OneGen

An efficient single-pass unified generation and retrieval framework suitable for large language models.

CommonProductProgrammingNatural Language ProcessingLarge Language Models
OneGen is an efficient single-pass generation and retrieval framework designed for large language models (LLMs), intended for fine-tuning generation, retrieval, or mixed tasks. The core idea is to integrate generation and retrieval tasks within the same context by assigning the retrieval task to retrieval tokens generated autoregressively. This enables the LLM to perform both tasks in a single forward pass. This approach not only reduces deployment costs but also significantly decreases inference costs, as it avoids the need for two forward pass computations for queries.
Visit

OneGen Visit Over Time

Monthly Visits

503747431

Bounce Rate

37.31%

Page per Visit

5.7

Visit Duration

00:06:44

OneGen Visit Trend

OneGen Visit Geography

OneGen Traffic Sources

OneGen Alternatives