VMamba

Visual state-space model with linear complexity and global perception.

CommonProductImageVisual ModelImage Processing
VMamba is a visual state-space model that combines the advantages of convolutional neural networks (CNNs) and visual Transformers (ViTs), achieving linear complexity without sacrificing global perception. It introduces the Cross-Scan Module (CSM) to address the issue of direction sensitivity and can demonstrate excellent performance in various visual perception tasks. As the image resolution increases, it shows more significant advantages compared to existing benchmark models.
Visit

VMamba Visit Over Time

Monthly Visits

20899836

Bounce Rate

46.04%

Page per Visit

5.2

Visit Duration

00:04:57

VMamba Visit Trend

VMamba Visit Geography

VMamba Traffic Sources

VMamba Alternatives