2024-12-13 15:13:13.AIbase.
Harvard University Releases Million-Volume Book Dataset to Provide High-Quality Training Material for AI Models
2024-12-12 14:14:40.AIbase.
Harvard University to Release Massive Free AI Training Dataset Funded by OpenAI and Microsoft
2024-11-21 14:18:11.AIbase.
AI Data Scandal: OpenAI Accidentally Deletes Evidence, Media Giants Sue for Copyright Infringement
2024-11-06 09:53:34.AIbase.
Chinese Team Releases the World's Largest Open-source Multimodal Dataset, Achieving Record Performance with 2B Parameter Model
2024-11-06 09:29:51.AIbase.
Chinese Team Launches World's Largest Multimodal Dataset 'Infinity-MM' and Cutting-Edge Micro AI Model 'Aquila-VL-2B'
2024-10-21 10:21:55.AIbase.
Large Models Are 'Playing Dumb'! Research Finds They Know the Right Answers but Deliberately Say the Wrong Ones
2024-10-08 11:12:18.AIbase.
MOSEL Project: Building an Open-Source Voice Database for European AI Language Models
2024-09-25 13:54:53.AIbase.
Beijing Academy of Artificial Intelligence Releases Chinese Internet Corpus CCI3.0 Containing 1000GB Dataset
2024-09-24 10:26:16.AIbase.
OpenAI Releases Groundbreaking Multilingual AI Dataset to Promote Global Language Equality
2024-09-24 09:11:51.AIbase.
Zhiyuan Launches the Infinity-Instruct Dataset with Millions of Instruction Tuning Data
2024-09-20 11:01:43.AIbase.
Google Open Building 2.5D Time Dataset: AI Aids a New Chapter in Global Urbanization Development
2024-09-02 17:09:10.AIbase.
LAION Releases New AI Dataset Re-LAION-5B, Completely Removes Links to Child Sexual Abuse Material
2024-08-31 10:41:54.AIbase.
The organization behind the dataset used for training Stable Diffusion claims to have removed CSAM
2024-08-13 15:08:09.AIbase.
AI Data Crisis! MIT Research Shows Rapid Decline in Public Sharing of Web Data!
2024-08-12 14:59:02.AIbase.
MedTrinity-25M: A Medical Multimodal Dataset Containing 25 Million Medical Images
2024-07-31 09:33:34.AIbase.
The First 100 Million Parameter Seismic Wave Large Model 'Diting' Released in Chengdu
2024-07-26 09:26:21.AIbase.
Wuhan University Collaborates with China Mobile and Jiutian AI Team to Release Open-source Audio-Video Speaker Recognition Dataset VoxBlink2
2024-07-18 15:31:13.AIbase.
Microsoft Unveils Auto Evol-Instruct AI Framework: Evolving Guidance Datasets with Large Language Models Without Human Intervention
2024-07-18 11:50:55.AIbase.
Apple Clarifies: YouTube Caption Data Not Used for Apple Intelligence; OpenELM Exclusively for Research Purposes Step-by-step explanation: 1. Begin with Apple Clarifies to indicate that the company is providing a clarification or statement. 2. Mention the specific subject being clarified, which is YouTube Caption Data. 3. Use Not Used for to clearly state that this data is not being utilized for a particular purpose, in this case, Apple Intelligence. 4. Introduce the alternative purpose, OpenELM, and specify that it is Exclusively for Research Purposes. This helps to differentiate between the two purposes and emphasize the research-only nature of OpenELM.
2024-07-17 11:00:58.AIbase.