Magma
Magma is a foundational model capable of understanding and executing multimodal inputs for complex tasks and environments.
CommonProductProductivityMultimodalRobotics
Magma, developed by Microsoft Research, is a multimodal foundational model designed to enable complex task planning and execution through the combination of vision, language, and action. Pre-trained on large-scale visual-language data, it possesses capabilities in language understanding, spatial intelligence, and action planning, allowing it to excel in tasks such as UI navigation and robot operation. This model provides a powerful foundation framework for multimodal AI agent tasks, with broad application prospects.
Magma Visit Over Time
Monthly Visits
986849
Bounce Rate
51.18%
Page per Visit
2.7
Visit Duration
00:01:57