en
AI Ranking
每月不到10元,就可以无限制地访问最好的AIbase。立即成为会员
Home
News
Daily Brief
Income Guide
Tutorial
Tools Directory
Product Library
en
AI Ranking
Search AI Products and News
Explore worldwide AI information, discover new AI opportunities
AI News
AI Tools
AI Cases
AI Tutorial
Type :
AI News
AI Tools
AI Cases
AI Tutorial
2025-02-20 10:37:18
.
AIbase
.
15.5k
OpenAI's Latest Benchmark Test: AI Programming Ability Matches One-Quarter of Humans, Revealing Limitations
Recently, OpenAI released a significant report on AI programming capabilities, highlighting the current state of AI in software development through a $1 million real-world development project. The benchmark test, named SWE-Lancer, covered 1,400 real projects from Upwork, comprehensively assessing AI performance in both direct development and project management areas. The results indicated that the best-performing AI model, Claude 3.5 Sonnet, achieved a success rate of 26.2% in coding tasks and reported performance in project management.
2025-02-18 16:55:26
.
AIbase
.
15.5k
OpenAI Launches SWE-Lancer Benchmark: Evaluating Model Performance on Real-World Freelance Software Engineering Tasks