Meta's Creation! Pippo: Generate High-Resolution Multi-View Images from a Single Character Photo

AIbase基地

Published inAI News · 3 min read · Feb 17, 2025

478

Recently, the research team at Meta Reality Labs, in collaboration with Efficient, released an innovative generative model called "Pippo," which can generate a high-resolution video of up to 1K from a casually taken photo. This groundbreaking technology marks another significant advancement in the fields of computer vision and image generation.

The core of the Pippo model lies in its design of a multi-view diffusion transformer. Unlike traditional generative models, Pippo does not require any additional inputs, such as fitted parameter models or camera parameters used to capture the image. Users simply need to provide a regular photo, and the system can automatically generate a multi-angle video effect, presenting a more vivid and three-dimensional representation of characters.

To facilitate developers, Pippo is released as a code-only version without pre-trained weights. The research team provides the necessary models, configuration files, inference code, and sample training code from the Ava-256 dataset. Developers can quickly get started with training and application by cloning and setting up the codebase using simple commands.

The future plans for the Pippo project include organizing and cleaning up the code, as well as releasing inference scripts for pre-trained models. These improvements will further enhance user experience and promote the widespread use of this technology in practical applications.

Project: https://github.com/facebookresearch/pippo

Key Points:

🌟 The Pippo model can generate high-resolution multi-view videos from a single ordinary photo without any additional input.

💻 Code is released only, with no pre-trained weights; developers can train the model and apply it themselves.

🔍 The team plans to introduce more features and improvements in the future to enhance user experience.

Tsinghua Changgeng Hospital Collaborates with Beijing Electronic Information and Intelligence to Develop China's First Pharmaceutical Large Model: Focused on Medication Safety Evaluation for Special Populations

Beijing Tsinghua Changgeng Hospital has collaborated with Beijing Electronic Information and Intelligence to develop China's first pharmaceutical-specific large model, using AI to optimize pharmaceutical processes, improve the efficiency and accuracy of medication safety evaluation for special populations such as the elderly, children, and pregnant women, and address the challenges of rapid updates in drug information and complex individual differences.

A Single Sentence Can Change AI's Creative Potential: Study Finds Simple Prompts Can Significantly Improve Output Diversity

A team from Stanford and other universities proposed the 'language sampling' method, which improves the creative diversity of generative AI by asking the model to generate five responses and their probabilities in the prompt. This method applies to both language and image models, and can stimulate richer creative outputs.

OpenAI Video Generation Model Sora 2 Launches on Microsoft Azure Platform: Pricing at $0.10 per Second, Enters Public Preview Phase

Microsoft launches OpenAI's Sora2 video generation model on Azure AI for public preview, offering cloud API access to businesses and developers. This multimodal tool processes text, image, and video inputs to create new content, advancing generative AI video into commercial applications like advertising.....

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Services

AI Search Visibility Checker

AI Model Compatibility Checker

AI Dataset Collection

Intelligent Document Recognition

Meta's Creation! Pippo: Generate High-Resolution Multi-View Images from a Single Character Photo

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Tsinghua Changgeng Hospital Collaborates with Beijing Electronic Information and Intelligence to Develop China's First Pharmaceutical Large Model: Focused on Medication Safety Evaluation for Special Populations

A Single Sentence Can Change AI's Creative Potential: Study Finds Simple Prompts Can Significantly Improve Output Diversity

AI Daily: Google Gemini 3.0 Pro is being rolled out on a limited scale; Aishike Technology completes B+ round financing of 100 million yuan; Baidu releases document parsing model PaddleOCR-VL

AI Daily: ByteDance Launches DouBao Large Model 1.6; AiShi Technology Completes 100 Million RMB B+ Funding Round; Baidu Releases Document Parsing Model PaddleOCR-VL

Baidu Releases Global Leading Document Parsing Model PaddleOCR-VL, Reshaping the OCR Technology Landscape!

OpenAI Video Generation Model Sora 2 Launches on Microsoft Azure Platform: Pricing at $0.10 per Second, Enters Public Preview Phase

LLaVA-OneVision-1.5, a Fully Open-Source Multimodal Model That Exceeds Qwen2.5-VL

Google DeepMind and Yale University Collaborate to Develop AI Model C2S-Scale 27B for Cancer Treatment Pathways

Pinterest Launches AI Content Restriction Tool: Users Can Customize to Reduce Generative AI Images

ByteDance Releases Dou Bao Large Model 1.6: The First Domestic Model Supporting Adjustable Thinking Depth

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Services​

AI Search Visibility Checker

AI Model Compatibility Checker

AI Dataset Collection

Intelligent Document Recognition

Meta's Creation! Pippo: Generate High-Resolution Multi-View Images from a Single Character Photo

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Tsinghua Changgeng Hospital Collaborates with Beijing Electronic Information and Intelligence to Develop China's First Pharmaceutical Large Model: Focused on Medication Safety Evaluation for Special Populations

A Single Sentence Can Change AI's Creative Potential: Study Finds Simple Prompts Can Significantly Improve Output Diversity

AI Daily: Google Gemini 3.0 Pro is being rolled out on a limited scale; Aishike Technology completes B+ round financing of 100 million yuan; Baidu releases document parsing model PaddleOCR-VL

AI Daily: ByteDance Launches DouBao Large Model 1.6; AiShi Technology Completes 100 Million RMB B+ Funding Round; Baidu Releases Document Parsing Model PaddleOCR-VL

Baidu Releases Global Leading Document Parsing Model PaddleOCR-VL, Reshaping the OCR Technology Landscape!

OpenAI Video Generation Model Sora 2 Launches on Microsoft Azure Platform: Pricing at $0.10 per Second, Enters Public Preview Phase

LLaVA-OneVision-1.5, a Fully Open-Source Multimodal Model That Exceeds Qwen2.5-VL

Google DeepMind and Yale University Collaborate to Develop AI Model C2S-Scale 27B for Cancer Treatment Pathways

Pinterest Launches AI Content Restriction Tool: Users Can Customize to Reduce Generative AI Images

ByteDance Releases Dou Bao Large Model 1.6: The First Domestic Model Supporting Adjustable Thinking Depth

GEO Services