MIT Unveils New Robot Training Model to Tackle Problems in a Simpler, More Direct Way

AIbase基地

Published inAI News · 4 min read · Nov 4, 2024

128

This week, the Massachusetts Institute of Technology (MIT) showcased a novel robot training model that abandons the traditional approach of focusing on specific datasets, instead utilizing vast amounts of information similar to those used in training large language models (LLMs).

Researchers have pointed out that imitation learning—where an agent learns by mimicking individuals performing tasks—may fail when faced with minor challenges. These challenges could include varying lighting conditions, different environmental settings, or new obstacles. In such situations, robots lack sufficient data to adapt to these changes.

Robot Taking an Exam - Robot Gaokao

Image source: Picture generated by AI, provided by Midjourney, an image licensing service.

The team drew inspiration from models like GPT-4, adopting a brute-force, data-driven approach to problem-solving.

"In the field of language, data is represented by sentences," said Lirui Wang, the lead author of the paper. "In robotics, given the diversity of data, if you want to pre-train in a similar manner, we need a different architecture."

The team introduced a new architecture called the Heterogeneous Pre-trained Transformer (HPT), which integrates information from different sensors and environments. The data is then incorporated into the training model using transformers. The larger the transformer, the better the output results.

Users subsequently input the design, configuration of the robot, and the tasks they wish to accomplish.

"Our dream is to have a universal robot brain that you can download and use for your robot without any training," said David Held, an associate professor at Carnegie Mellon University, speaking about the research. "Although we are just beginning, we will continue to strive, hoping that the scaling up will bring breakthroughs to robot strategies, just as it has done with large language models."

This research was partially funded by the Toyota Research Institute. Last year at TechCrunch Disrupt, TRI demonstrated a method to train robots overnight. Recently, it achieved a watershed partnership, combining its robot learning research with Boston Dynamics' hardware.

Zhipu Announces Price Cuts for Multiple Large Language Models, with GLM-4-Plus Dropping 90%

Zhipu BigModel's open platform has adjusted prices for several of its model offerings. GLM-4-FlashX, for example, is now priced at just 10 RMB per 100 million tokens. Built on a powerful pre-trained base, this model boasts exceptionally fast inference speeds and functional capabilities comparable to GPT-4, excelling in data extraction, generation, and translation.

OpenAI Predicts $125 Billion Revenue by 2029, 3 Billion Monthly Active Users by 2030

OpenAI recently released a prediction forecasting $125 billion in total revenue by 2029. AI agent and channel revenue will be key drivers. AI agent revenue is projected to reach nearly $29 billion, representing almost a quarter of total revenue, while channel revenue is expected to reach $25 billion. Image note: Image generated by AI, image licensing service Midjourney. Following the success of ChatGPT, OpenAI's...

JEDEC Releases HBM4 Standard, Powering the Next Era of AI and High-Performance Computing

The JEDEC Solid State Technology Association has announced the highly anticipated release of the High Bandwidth Memory (HBM) standard – HBM4. Evolving from the HBM3 standard, HBM4 aims to further accelerate data processing while maintaining higher bandwidth, energy efficiency, and greater capacity per chip or stack, to meet the demands of efficient processing of large datasets and complex computations. The HBM4 standard introduces several key technological advancements, suitable for applications in generative AI, high-performance computing, high-end graphics cards, and servers. Firstly, HBM4 significantly increases bandwidth...

Shenzhen University's Artificial Intelligence Institute Officially Unveiled, Boosting AI Talent Cultivation

On April 21, 2025, Shenzhen University officially unveiled its Artificial Intelligence Institute, marking a significant step forward in the university's AI education and research. According to Shenzhen TV's Deep Vision News report, the institute will establish a basic research center and a computing platform, and will collaborate with Tencent Cloud to build an industry academy, promoting deep integration of industry, academia, and research. Image Note: Image generated by AI, image authorization service provider Midjourney. Currently, the Artificial Intelligence Institute boasts a strong team of approximately 80 teachers and researchers.

Intel Open-Sources AI Playground: Arc GPU-Powered Local AI Model Execution

Intel recently announced the open-sourcing of its AI Playground software, designed for local generative AI. AI Playground provides a powerful platform for running AI models on Intel Arc GPUs. It supports various image and video generation models, as well as Large Language Models (LLMs), significantly lowering the hardware barrier for AI applications by optimizing local computing resources. The project is available on GitHub and has attracted developers and AI enthusiasts worldwide.

UAE Pioneers AI for 70% Faster Lawmaking

The UAE recently announced it will be the first to utilize AI in drafting laws, aiming for significantly faster legislation. This initiative is expected to reduce the time it takes to draft legal documents by up to 70%. The UAE government hopes to leverage AI's analytical capabilities to quickly generate laws relevant to modern society. Image Note: Image generated by AI, licensed from Midjourney. The UAE government states that this technology will not only speed up the legislative process but also help improve the quality of laws.