ViTPose

A collection of ViTPose models implemented based on the Transformer architecture.

CommonProductImageArtificial IntelligenceComputer Vision
ViTPose is a series of human pose estimation models based on the Transformer architecture. It leverages the powerful feature extraction capabilities of Transformers to provide a simple yet effective baseline for human pose estimation tasks. The ViTPose models perform exceptionally well across various datasets, demonstrating high accuracy and efficiency. Maintained and updated by the community at the University of Sydney, the model offers various versions of different scales to meet diverse application needs. The ViTPose models are open-sourced on the Hugging Face platform, allowing users to easily download and deploy these models for human pose estimation research and application development.
Visit

ViTPose Visit Over Time

Monthly Visits

21315886

Bounce Rate

45.50%

Page per Visit

5.2

Visit Duration

00:05:02

ViTPose Visit Trend

ViTPose Visit Geography

ViTPose Traffic Sources

ViTPose Alternatives