Jina Embeddings V2 Base
English text embedding model
CommonProductProductivityText EmbeddingBert
Jina Embeddings V2 Base is an English text embedding model that supports a sequence length of 8192. It is based on the Bert architecture (JinaBert) and supports the ALiBi symmetric bidirectional variant to allow for longer sequence lengths. The model was pre-trained on the C4 dataset and further trained on a collection of over 400 million sentence pairs and negative samples from Jina AI. This model is suitable for various use cases involving long documents, including long document retrieval, semantic text similarity, text re-ranking, recommendation, RAG, and LLM-based generative search. The model has 137 million parameters and is recommended for inference on a single GPU.
Jina Embeddings V2 Base Visit Over Time
Monthly Visits
19075321
Bounce Rate
45.07%
Page per Visit
5.5
Visit Duration
00:05:32