HPT
HPT is an innovative multi-modal LLM framework launched by HyperGAI, designed to understand and process various input modalities including text, images, and videos.
CommonProductProductivityMulti-modal LLMArtificial Intelligence
HPT (Hyper-Pretrained Transformers) is a novel multi-modal large language model framework introduced by the HyperGAI research team. It enables the efficient and scalable training of large multi-modal foundation models, capable of understanding various input modalities including text, images, and videos. The HPT framework can be trained from scratch or efficiently fine-tuned using existing pre-trained vision encoders and/or large language models.