AIbase
Product LibraryTool Navigation

KOSMOS-2

Public

KOSMOS-2 is designed to handle text and images simultaneously, and redefine the way we perceive and interact with multimodal data, KOSMOS-2 is built on a Transformer-based causal language model architecture, similar to other renowned models like LLaMa-2 and Mistral AI's 7b model.

Creat2023-11-04T18:00:02
Update2024-08-07T21:07:49
https://www.analyticsvidhya.com/blog/2023/11/kosmos-2-a-multimodal-large-language-model-by-microsoft/
3
Stars
0
Stars Increase