According to reports from Yicai, Luo Fuli, a key developer of the open-source large model DeepSeek-V2, will join Xiaomi as the head of the Xiaomi AI Lab and will be responsible for building the large model team. This news has attracted widespread attention, especially as Xiaomi plans to strengthen its position in the large model field.
Image Source Note: The image is generated by AI, authorized by service provider Midjourney.
According to insiders, Xiaomi founder Lei Jun is concerned about the company's late entry into the AI large model field, prompting him to offer a high salary to attract Luo Fuli. Luo Fuli has an impressive background, holding a master's degree from the Institute of Computational Linguistics at Peking University and has published several papers at top conferences in natural language processing, such as ACL2019, demonstrating her profound expertise in this field.
Before joining Xiaomi, Luo Fuli worked at Alibaba's DAMO Academy as a researcher in the Machine Intelligence Lab, where she was responsible for developing the multilingual pre-training model VECO and promoting the open-source work of AliceMind. In 2022, Luo Fuli chose to leave Alibaba to join DeepSeek, participating in the development of DeepSeek-V2, further solidifying her position in the large model R&D field.
Since its establishment in 2016, the Xiaomi AI Lab has grown to a team of about 250 people, with research directions covering various fields such as vision, acoustics, speech, natural language processing, knowledge graphs, and machine learning. According to public information, Xiaomi has already established a dedicated large model team in 2023, appointing Luan Jian as the head, who reports to Wang Bin, the vice chairman of the technical committee.
Key Points:
🌟 Luo Fuli will join Xiaomi to lead the large model team at the AI Lab.
💰 Lei Jun expresses concerns about Xiaomi's development in the AI large model field and offers high salaries to attract talent.
📈 The Xiaomi AI Lab has established a dedicated team focused on advancing large model technology.