Magma

Magma is a foundational model capable of understanding and executing multimodal inputs for complex tasks and environments.

CommonProductProductivityMultimodalRobotics
Magma, developed by Microsoft Research, is a multimodal foundational model designed to enable complex task planning and execution through the combination of vision, language, and action. Pre-trained on large-scale visual-language data, it possesses capabilities in language understanding, spatial intelligence, and action planning, allowing it to excel in tasks such as UI navigation and robot operation. This model provides a powerful foundation framework for multimodal AI agent tasks, with broad application prospects.
Visit

Magma Visit Over Time

Monthly Visits

986849

Bounce Rate

51.18%

Page per Visit

2.7

Visit Duration

00:01:57

Magma Visit Trend

Magma Visit Geography

Magma Traffic Sources

Magma Alternatives