On March 24, 2026, the Meituan LongCat team open-sourced the deep learning model LongCat-Flash-Prover, specifically designed for mathematical formalization and theorem proving. The model breaks down formal reasoning into three atomic capabilities: automatic formalization, draft generation, and proof generation. It aims to overcome the shortcomings of large language models in rigorous logical reasoning, achieving a paradigm shift from 'probabilistic prediction of answers' to 'rigorous logical proof'.
Google used a large language model to analyze 5 million news articles globally, mining unstructured data and building a global flood prediction system. This innovative approach solves the problem of traditional deep learning models struggling to predict floods in remote areas due to a lack of historical weather data.
JD.com open-sourced the large model JoyAI-LLM-Flash, with 4.8 billion parameters and 3 billion activated parameters, pre-trained on 20 trillion text, featuring advanced knowledge comprehension, reasoning, and programming capabilities. It adopts the FiberPO optimization framework, combining fiber bundle theory and reinforcement learning, using the Muon optimizer and dense multi-Token prediction technology to solve the instability problem in model scale expansion.
DeepMind CEO Demis Hassabis sleeps under 6 hours, believes less harms the brain, and splits his day into two work blocks. Co-founder known for protein structure prediction research.....
Build intelligent prediction agents to connect humans with AI for precise reasoning in multiple fields.
Tenki is an AI sports prediction tool for Kalshi traders, analyzing data to find winning trading opportunities.
An AI-driven football prediction APP that analyzes over 10,000 matches, provides accurate predictions, and can be downloaded for free.
Meta's single-image 3D reconstruction model that combines SAM 3 segmentation with geometric and texture layout prediction to generate 3D assets.
Anthropic
$105
Input tokens/M
$525
Output tokens/M
200
Context Length
Google
$2.1
$17.5
1k
$21
Alibaba
$2
-
256
$15.8
$12.7
64
$0.8
128
Bytedance
Baidu
Tencent
Openai
$0.63
$3.15
131
32
Xai
$8.75
$70
$2.4
$9.6
ai-sage
GigaChat3-10B-A1.8B is an efficient dialogue model in the GigaChat series. Based on the Mixture-of-Experts (MoE) architecture, it has a total of 10 billion parameters and 1.8 billion active parameters. It adopts the innovative Multi-head Latent Attention (MLA) and Multi-token Prediction (MTP) technologies, aiming to optimize inference throughput and generation speed. The model is trained on 20T tokens of diverse data and supports 10 languages including Chinese, suitable for dialogue scenarios requiring quick responses.
GigaChat3-10B-A1.8B is a dialogue model in the GigaChat series, based on the Mixture of Experts (MoE) architecture, with a total of 10 billion parameters, of which 1.8 billion are active parameters. This model uses multi-head latent attention and multi-token prediction technologies, supports a long context of 256,000 tokens, and performs excellently in multilingual dialogue and reasoning tasks.
GigaChat3-10B-A1.8B-base is the basic pre-trained model of the GigaChat series, adopting the Mixture of Experts (MoE) architecture with a total of 10 billion parameters and 1.8 billion active parameters. The model integrates Multi-Head Latent Attention (MLA) and Multi-Token Prediction (MTP) technologies, and has the advantage of high throughput during inference.
labhamlet
WavJEPA is an audio foundation model based on the waveform joint embedding prediction architecture. It uses advanced semantic representation learning to address the deficiencies in speech unit or token-level representation learning. It significantly outperforms the state-of-the-art time-domain audio foundation models in many downstream benchmark tasks while requiring significantly less computational resources.
Prior-Labs
TabPFN-2.5 is a tabular foundation model based on the Transformer architecture. It utilizes context learning technology and can solve tabular prediction problems in a single forward propagation, providing efficient regression and classification solutions for structured tabular data.
amazon
Chronos-2 is a time series foundation model with 120 million parameters, supporting zero-shot prediction. It supports univariate, multivariate, and covariate-aware tasks within a single architecture, achieving state-of-the-art accuracy in zero-shot prediction across multiple benchmarks, and has extremely high inference efficiency.
ByteDance-Seed
AHN is an innovative neural network architecture for efficient long context modeling. By converting lossless memory into a fixed-size compressed representation, it combines the advantages of Transformer and RNN, achieving efficient computation and accurate prediction in long sequence processing.
inclusionAI
Ming-UniVision is a large multimodal language model that first integrates continuous visual representations into the next-token prediction framework, unifying vision and language under a single autoregressive paradigm without discrete quantization or modality-specific heads. This model supports joint image understanding and generation, converges faster in vision-language training, and also supports multi-round context visual tasks.
mldi-lab
Kairos-10M is a time series foundation model designed for cross-domain zero-shot prediction, with approximately 10 million parameters. It can handle heterogeneous time series data with different information densities and achieve strong generalization ability across different domains without fine-tuning.
Kairos-50M is a time series foundation model with 50 million parameters, specifically designed for zero-shot prediction across different domains. It uses adaptive word segmentation and position encoding techniques to handle heterogeneous time series data with different information densities and can achieve strong generalization ability across different domains without fine-tuning.
alibaba-pai
Wan-Fun is a powerful text-to-video tool that supports multi-resolution video prediction and multiple languages, meeting diverse video generation needs. This model is based on the Wan2.2 architecture and has a parameter scale of 14B, specifically designed for text-to-video and image-to-video generation tasks.
yslan
STream3R is a scalable sequential 3D reconstruction model based on causal Transformer, which redefines point cloud map prediction as a decoder-only Transformer problem. It introduces a streaming processing framework, efficiently processes image sequences using causal attention, and can generalize well to various challenging scenarios, including dynamic scenes where traditional methods often fail.
AesSedai
This is the ik_llama.cpp imatrix quantized version of the zai-org/GLM-4.5 model. It uses advanced quantization techniques to reduce the model's storage space while maintaining high prediction accuracy. It is designed for performance and efficiency optimization in specific scenarios.
Salesforce
Moirai 2.0 is a decoder-only general time series prediction Transformer model that has been pre-trained on multiple high-quality datasets, including GIFT-Eval, a subset of the Chronos dataset, synthetic time series, and Salesforce's internal operational data. Compared with the first version, significant improvements have been made in aspects such as the loss function, prediction method, and data processing.
nvidia
AMPLIFY is an efficient and advanced protein language model focused on protein sequence analysis and prediction tasks.
NexaAI
Parakeet TDT 0.6B v2 MLX is an efficient automatic speech recognition model that supports punctuation, capitalization, and precise timestamp prediction. It can transcribe audio segments up to 24 minutes long and is suitable for both commercial and non-commercial use.
SAP
SAP RPT 1 OSS is a deep learning model that combines semantic understanding and context learning, specifically designed for tabular data prediction tasks. The model uses specialized embeddings for different data modalities and is trained on large-scale real-world tabular data, performing excellently in a wide range of benchmark tests.
multimolecule
ERNIE-RNA is an RNA language model based on unsupervised learning. It is pre-trained on non-coding RNA sequences and adopts the masked language model objective. This model can understand the grammar of the RNA language and provide support for downstream tasks such as RNA structure and function prediction.
andrewdalpino
A protein molecular function prediction model based on the Gene Ontology (GO) and the ESM2 architecture
Chumafly
This model is a fine-tuned BERT-uncased model for functional/non-functional requirement prediction, trained on AWS g5 instances using the PROMISE dataset and custom data, achieving an overall accuracy of 97%.
Vestige is an AI memory engine based on cognitive science. By implementing 29 neuroscience modules such as prediction error gating, FSRS - 6 spaced repetition, and memory dreaming, it provides long - term memory capabilities for AI. It includes a 3D visualization dashboard and 21 MCP tools, runs completely locally, and does not require the cloud.
The Allora MCP Server is an implementation based on the Model Context Protocol (MCP) that provides the function of obtaining machine learning inference data from the Allora network, enabling AI systems to seamlessly access Allora prediction market data.
This is a Label Studio integration server based on the Model Context Protocol, which realizes programmatic interaction with Label Studio instances through the label - studio - sdk, supporting functions such as project management, task management, and prediction integration.
The Chronulus MCP service is a prediction agent server designed for Claude AI, supporting installation and configuration in multiple ways, including pip, docker, and uvx, and can be integrated with services such as the file system and network requests.
PolyMarket Prediction Market Data Service
An MCP server that provides real-time data from multi-platform prediction markets, supporting platforms such as Polymarket, PredictIt, and Kalshi. Query odds, prices, and market information through a unified interface.
The Agentipy MCP Server is a model context protocol server designed for Claude Desktop, which enables AI agents to interact with the Solana blockchain through standardized interfaces, providing a rich set of blockchain tool functions, including balance query, transaction execution, price prediction, and cross - chain bridging.
An MCP server that provides daily horoscope predictions for the twelve constellations
Titan Memory Server is a neural memory system based on Google research, providing sequence learning and prediction functions, supporting memory state management and model persistence.
Provide a k-point generation tool under the SSSP1.3 PBEsol pseudopotential for Quantum ESPRESSO, including k-point spacing prediction and grid generation functions, supporting different confidence levels and model choices.
The AlphaFold MCP Server is a comprehensive platform that provides protein structure prediction analysis tools, supporting functions such as structure retrieval, quality assessment, batch processing, and visualization integration.
The Titan Memory MCP Server is a neural memory system designed for LLMs, supporting memory retention across interactions and sequence prediction. It is integrated into tools such as Cursor to achieve automated memory management.
SolanaViz MCP is a Model Context Protocol server that enables access, analysis, and visualization of Solana blockchain data through natural language interaction, including price prediction, security assessment, and multi - wallet analysis.
The MCP server of Manifold Markets provides comprehensive prediction market interaction functions, including market creation, trading operations, liquidity management, and social functions.
CryptoWeather AI Bitcoin Signal is an AI service that provides real - time bitcoin price predictions through the Model Context Protocol (MCP). It includes functions such as trading advice, performance metrics, and signal analysis, and updates data every hour.
An MCP server that exposes the PyTorch Lightning framework to tools, agents, and orchestration systems through structured APIs, supporting functions such as training, inspection, validation, testing, prediction, and model checkpoint management.
MCP server integrating Chronulus AI prediction agents with Claude
A satellite tracking MCP server based on the N2YO API, providing functions such as real - time satellite position query, orbital data acquisition, and transit prediction.
The Label Studio MCP server project provides functions for managing annotation projects through natural language or structured calls, including project creation, task management, and prediction integration.
An MCP-based NBA player data prediction tool that generates player performance predictions through real-time data analysis and statistical modeling.