MeshAnything: A Grid Generation Model That Transforms Any 3D Into Artist-Created Meshes

AIbase

Published inAI News · 5 min read · Jun 17, 2024

422

Recently, 3D assets created through reconstruction and generation have reached the quality level of manually crafted assets, highlighting their potential in alternative fields. However, this potential has not been fully realized, as these assets always require conversion into meshes for use in the 3D industry, and the current mesh extraction methods produce meshes that are significantly inferior to those created by human artists (AMs). Specifically, current mesh extraction methods rely on dense faces and overlook geometric features, leading to inefficient, complex post-processing, and lower representation quality.

To address these issues, researchers have proposed MeshAnything, an autoregressive model for generating artist-created 3D meshes. MeshAnything seamlessly integrates with various existing models to generate high-quality text/image/shape-conditioned mesh generation.

Product Entry:https://top.aibase.com/tool/meshanything

The meshes generated by MeshAnything significantly improve storage, rendering, and simulation efficiency while achieving comparable accuracy to previous methods.

The architecture of MeshAnything includes a VQ-VAE and a shape-conditioned decoder-only transformer. First, a VQ-VAE learns the mesh vocabulary, and then a shape-conditioned decoder-only transformer is trained on this vocabulary for shape-conditioned autoregressive mesh generation. Extensive experiments demonstrate that this method generates AMs with hundreds of times fewer faces than previous methods, significantly improving storage, rendering, and simulation efficiency, while achieving comparable accuracy to previous methods.

By integrating with various 3D asset production methods, MeshAnything enables highly controllable artist-created mesh generation. Additionally, compared to ground truth, this method has advantages in mesh topology and face count, and can generate meshes with completely different topologies but similar shapes, proving that the method does not merely overfit but understands how to construct meshes with efficient topologies.

Core features of this product include:

Powerful Mesh Generation: MeshAnything leverages autoregressive transformer technology to convert various inputs, such as images and point clouds, into fine-grained mesh models, with outstanding generation capabilities and model representation.

Automated Art Creation: MeshAnything provides users with convenient tools, making art creation more automated and intelligent, allowing users to focus on creative expression without being overly concerned with technical details.

Versatile Applications: MeshAnything has a wide range of applications in various fields, including industrial design, art creation, digital entertainment, and more, meeting the creative and needs of different users.

It should be noted that MeshAnything requires approximately 7GB and 30 seconds to generate meshes on an A6000 GPU. Limited by computational resources, MeshAnything is only trained on meshes with fewer than 800 faces and cannot generate meshes with more than 800 faces. The shape of the input mesh must be sufficiently clear, otherwise, it will be very difficult to represent it with only 800 faces. Therefore, feedforward image-to-3D methods often produce poor results due to insufficient shape quality.

Try it out: https://huggingface.co/spaces/Yiwen-ntu/MeshAnything

Moonshot AI Releases and Opensources Kimi K2 Model, Strong in Code and Agentic Tasks

Moonshot AI officially released its latest creation - the Kimi K2 model, and simultaneously announced its open source. This foundation model based on the MoE architecture has gained widespread attention in the AI field since its release, thanks to its strong coding capabilities and excellent general Agent task processing abilities. The Kimi K2 model has a total of 1T parameters, with 32B activated parameters. It has achieved top performance among open-source models in a series of benchmark performance tests such as SWE Bench Verified, Tau2, and AceBench.

AI Daily: Zhipu Launches PPT Generation Function AI Slides; Ke Ling AI Releases Ketur 2.1 Model

1. Zhipu launches free AI Slides for PPT generation. 2. Keling AI introduces KeTu 2.1 with 180 styles. 3. NVIDIA's DiffusionRenderer enables 3D scene editing. 4. Modao AI offers 30-second prototype generation. 5. Higgsfield creates avatars from 10 photos. 6. Google open-sources GenAI Processors. 7. Google Veo3 adds image-to-video. 8. Mistral AI releases Devstral2507 for code generation.....

Google DeepMind Open Sources GenAI Processors: One-Click Building of Real-Time AI Workflows

Google DeepMind open sources the GenAI Processors Python library, helping developers build efficient generative AI workflows. The library supports asynchronous processing of multimodal data and optimizes Gemini API application development, significantly reducing latency in real-time applications. Core features include a modular Processor interface, streaming API design, and concurrency optimization, enabling rapid development of real-time applications such as intelligent assistants. Currently only supports Python, but with an open community contribution model, future plans include expanding functionality to cover more scenarios.

Manus AI Official Website and Social Media Undergo Changes, Chinese Users May Be Affected

General AI company Manus adjusts its China operations, lays off employees, and relocates its core technology team to Singapore. The China region had approximately 120 employees, and the company states this move is aimed at improving operational efficiency and focusing on core business. The official website now shows that the region is unavailable, replacing previous messages about the development of the Chinese version. The official Weibo and Xiaohongshu accounts have also been cleared, indicating a significant shift in the company's market strategy in China.

Modo AI Launches: Input Your Idea and Generate a High-Fidelity, Editable Prototype in 30 Seconds

Modo AI introduces a 30-second rapid prototype generation feature, supporting multi-device adaptation and conversation optimization. Users can generate high-fidelity, editable prototypes through text, sketches, and other input methods, and support iterative conversation adjustments. The AI can intelligently parse uploaded sketches, wireframes, and more, automatically generating interfaces. It offers dual-mode editing, automatic documentation generation, and code integration features, covering multiple scenarios such as e-commerce and social networking, significantly lowering the barrier to prototype creation and improving product design efficiency.

Mistral AI Releases Devstral2507: Designed for Code-Centric Language Modeling

Mistral AI launched the Devstral2507 series with two AI models: the open-source Devstral Small1.1 (24 billion parameters, SWE-Bench score of 53.6%) and the enterprise version Devstral Medium2507 (score of 61.6%). Small1.1 supports a 128k context window and local deployment, while Medium2507 outperforms some commercial models. Both are optimized for code reasoning and program synthesis, and support integration with agent frameworks.

Musk's New AI Chatbot Grok 4: Pursuing Truth or Advocating Personal Opinions?

Musk's xAI launched Grok4 AI chatbot, promoting 'truth-seeking' but sparking controversy. Tests show it often cites Musk's views on sensitive topics like Israel-Palestine conflict and immigration. Grok previously faced anti-Semitic content issues, highlighting risks of linking AI to founder's opinions. While Grok4 outperforms rivals in some tests, frequent errors and lack of transparency may hinder commercialization. xAI is promoting $300/month s....

Product Finder

Product Submit

AI Models Finder

MCP Servers

MCP Client

MCP Inspector

Case Tutorials

Latest AI News

AI Daily Brief

MeshAnything: A Grid Generation Model That Transforms Any 3D Into Artist-Created Meshes

AIbase

This article is from AIbase Daily

AI News Recommendations