Aphrodite Engine
PygmalionAI's large-scale inference engine
CommonProductProgrammingLarge-scale inferenceLanguage models
Aphrodite is the official backend engine of PygmalionAI, aimed at providing inference endpoints for the PygmalionAI website, enabling fast model serving for a large number of users. It utilizes vLLM's paginated attention technology, achieving features such as continuous batching, efficient key-value management, and optimized CUDA kernels, while supporting various quantization schemes to boost inference performance.
Aphrodite Engine Visit Over Time
Monthly Visits
515580771
Bounce Rate
37.20%
Page per Visit
5.8
Visit Duration
00:06:42