2024-12-20 10:47:59.AIbase.
Skip o2! OpenAI may plan to launch the next generation 'o3' reasoning model
2024-12-18 11:23:53.AIbase.
ByteDance Launches Doubao Visual Reasoning Model: Prices as Low as 0.003 Yuan per Thousand Tokens
2024-12-15 10:23:35.AIbase.
Ali Launches New AI Benchmark 'PROCESSBENCH' to Assess Error Recognition Capability in Mathematical Reasoning
2024-11-29 09:47:51.AIbase.
Devastating Loss! Epoch AI Launches New Mathematics Benchmark FrontierMath, Top AI Models Solve Less Than 2%
2024-11-26 08:28:14.AIbase.
Alibaba International AI Team Releases Open-Source Problem Reasoning Model Marco-o1
2024-11-19 16:58:24.AIbase.
Why LLMs Are Always Baffled by Math Problems? AI Arithmetic Reasoning Relies on 'Guessing'!
2024-11-19 11:02:21.AIbase.
Fireworks AI Launches Composite AI Model f1: A New Generation Reasoning System Beyond GPT-4
2024-11-18 07:58:19.AIbase.
Kimi Launches Mathematical Reasoning Model k0-math: Math Capabilities Benchmarking Against OpenAI's o1 Series
2024-11-12 17:07:16.AIbase.
The Mystery of CoT Reasoning in Large Models: A Memory Master and a Probability Expert?
2024-10-30 14:42:09.AIbase.
Google AI's new project 'Project Astra' delayed until 2025 for release
2024-10-28 16:11:31.AIbase.
Major AI Discovery: State-of-the-Art Visual Models Still Fall Short in Basic Visual Reasoning Abilities
2024-10-18 16:24:35.AIbase.
Simplismart Launches Personalized AI Reasoning Engine to Enhance Enterprise AI Performance
2024-10-17 14:15:54.AIbase.
Think Like the Human Brain! Meta's New Model Dualformer Integrates Fast and Slow Thinking, Significantly Enhancing Reasoning Capabilities
2024-10-14 14:51:30.AIbase.
Apple Research Team Releases New Benchmark GSM-Symbolic: Revealing the Mathematical Reasoning Limitations of Large Language Models!
2024-10-14 09:05:30.AIbase.
Apple Research Reveals Serious Deficiencies in Large Language Model Reasoning Capabilities
2024-10-12 14:59:01.AIbase.
Apple AI Research Team Discovers Limitations of Large Model Inference, Rendering OpenAI's o1 Ineffective with Just One Sentence
2024-10-11 11:06:18.AIbase.
Super Strong Reasoning Capability! Kimi Exploration Version Starts Internal Testing: Solving Complex Search Problems
2024-10-11 09:35:13.AIbase.
DeepMind Launches New Benchmark Michelangelo: Revealing Long Context LLM Reasoning Flaws
2024-10-08 09:16:16.AIbase.
New Study Reveals Significant Deficiencies in the Reasoning Abilities of Small AI Language Models
2024-09-24 14:50:52.AIbase.