Cosine Trains GPT-4o with High-Quality Data for Its Genie Assistant 'Human Reasoning Compiler'

AIbase基地

Published inAI News · 4 min read · Aug 19, 2024

241

Cosine, an AI startup based in San Francisco, has introduced a new AI model called Genie, designed specifically to assist software developers. According to the company, Genie outperforms its competitors in benchmark tests, demonstrating exceptional capabilities.

Cosine collaborated with OpenAI to train a variant of GPT-4o using high-quality data, achieving remarkable benchmark test results. The company states that the key to Genie's success lies in its ability to "mimic human reasoning," which may extend beyond the realm of software development.

QQ截图20240819092111.png

Genie Leads in the SWE Domain

Alistair Pullen, co-founder and CEO of Cosine, revealed that Genie scored 30% on the SWE-Bench test, the highest score ever achieved by an AI model in this field. This score surpasses other language models focused on coding, such as Amazon's model (19%) and Cognition's Devin (13.8% in certain SWE-Bench tests).

Genie's architecture is designed to mimic the cognitive processes of human developers, enabling it to autonomously or collaboratively fix errors, develop new features, refactor code, and perform various programming tasks.

Self-Improvement Through Synthetic Data

The development of Genie involved a proprietary process, training and fine-tuning a non-public variant of GPT-40 using billions of high-quality data. With the help of experienced developers, Cosine spent nearly a year curating this dataset, which includes 21% JavaScript and Python, 14% TypeScript and TSX, and 3% other languages (including Java, C++, and Ruby).

Genie's outstanding performance is partly due to its self-improvement training. Initially, the model primarily learned from perfect, effective code but struggled with handling its own errors. Cosine addressed this issue by using synthetic data: if Genie's initial solution was incorrect, the model was shown how to improve through the correct result. With each iteration, Genie's solutions gradually improved, requiring fewer corrections.

QQ截图20240819092121.png

Overcoming Technical Limitations

Pullen recognized the potential of large language models to support human software development as early as 2022. However, the technology at the time was not advanced enough to realize the vision for Genie. The token capacity of the context window was typically limited to 4000 tokens, a major bottleneck. Today, models like Gemini1.5Pro can process up to 2 million tokens in a single prompt. Although Cosine has not disclosed Genie's specific token capacity, this technical advancement undoubtedly provides a solid foundation for Genie's success.

Tsinghua Changgeng Hospital Collaborates with Beijing Electronic Information and Intelligence to Develop China's First Pharmaceutical Large Model: Focused on Medication Safety Evaluation for Special Populations

Beijing Tsinghua Changgeng Hospital has collaborated with Beijing Electronic Information and Intelligence to develop China's first pharmaceutical-specific large model, using AI to optimize pharmaceutical processes, improve the efficiency and accuracy of medication safety evaluation for special populations such as the elderly, children, and pregnant women, and address the challenges of rapid updates in drug information and complex individual differences.

AI Music Creation Becomes a New Side Job for Programmers: Single Track Plays Over 2 Million Times, Copyright Revenue Reaches Several Ten Thousand Yuan

In 2025, the popularity of AI music creation tools is changing the industry landscape. In January, a player from Genshin Impact used Suno to create a song with 6.4 million plays, sparking discussions about the capabilities of AI creation. Programmers have become an active group, and in March, Yapie completed a theme song using multiple tools within a few hours.

A Single Sentence Can Change AI's Creative Potential: Study Finds Simple Prompts Can Significantly Improve Output Diversity

A team from Stanford and other universities proposed the 'language sampling' method, which improves the creative diversity of generative AI by asking the model to generate five responses and their probabilities in the prompt. This method applies to both language and image models, and can stimulate richer creative outputs.

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Services

AI Search Visibility Checker

AI Model Compatibility Checker

AI Dataset Collection

Intelligent Document Recognition

Cosine Trains GPT-4o with High-Quality Data for Its Genie Assistant 'Human Reasoning Compiler'

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Xiaomi AI Team Collaborates with Peking University to Publish New Paper, 'Talented Girl' Hired by Lei Jun Participates in Research

Tsinghua Changgeng Hospital Collaborates with Beijing Electronic Information and Intelligence to Develop China's First Pharmaceutical Large Model: Focused on Medication Safety Evaluation for Special Populations

AI Music Creation Becomes a New Side Job for Programmers: Single Track Plays Over 2 Million Times, Copyright Revenue Reaches Several Ten Thousand Yuan

A Single Sentence Can Change AI's Creative Potential: Study Finds Simple Prompts Can Significantly Improve Output Diversity

Chongqing Strengthens Regulation, Removes Over 10 Non-Compliant AI Products to Ensure Technological Safety

AI Daily: Google Gemini 3.0 Pro is being rolled out on a limited scale; Aishike Technology completes B+ round financing of 100 million yuan; Baidu releases document parsing model PaddleOCR-VL

AI Daily: ByteDance Launches DouBao Large Model 1.6; AiShi Technology Completes 100 Million RMB B+ Funding Round; Baidu Releases Document Parsing Model PaddleOCR-VL

AI Video Company Ai Shi Technology Completes 100 Million RMB B+ Round Financing: ARR Exceeds 40 Million USD, Users Exceed 100 Million

Yingmu Technology Launches New Generation AI Glasses and Expands to 2000+ Experience Stores Nationwide

Wikipedia Worries About Sustainability Due to Decline in Traffic from AI Chatbots

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Services​

AI Search Visibility Checker

AI Model Compatibility Checker

AI Dataset Collection

Intelligent Document Recognition

Cosine Trains GPT-4o with High-Quality Data for Its Genie Assistant 'Human Reasoning Compiler'

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Xiaomi AI Team Collaborates with Peking University to Publish New Paper, 'Talented Girl' Hired by Lei Jun Participates in Research

Tsinghua Changgeng Hospital Collaborates with Beijing Electronic Information and Intelligence to Develop China's First Pharmaceutical Large Model: Focused on Medication Safety Evaluation for Special Populations

AI Music Creation Becomes a New Side Job for Programmers: Single Track Plays Over 2 Million Times, Copyright Revenue Reaches Several Ten Thousand Yuan

A Single Sentence Can Change AI's Creative Potential: Study Finds Simple Prompts Can Significantly Improve Output Diversity

Chongqing Strengthens Regulation, Removes Over 10 Non-Compliant AI Products to Ensure Technological Safety

AI Daily: Google Gemini 3.0 Pro is being rolled out on a limited scale; Aishike Technology completes B+ round financing of 100 million yuan; Baidu releases document parsing model PaddleOCR-VL

AI Daily: ByteDance Launches DouBao Large Model 1.6; AiShi Technology Completes 100 Million RMB B+ Funding Round; Baidu Releases Document Parsing Model PaddleOCR-VL

AI Video Company Ai Shi Technology Completes 100 Million RMB B+ Round Financing: ARR Exceeds 40 Million USD, Users Exceed 100 Million

Yingmu Technology Launches New Generation AI Glasses and Expands to 2000+ Experience Stores Nationwide

Wikipedia Worries About Sustainability Due to Decline in Traffic from AI Chatbots

GEO Services