DocGraphLM

A document graph language model for information extraction and question answering

CommonProductProductivityInformation ExtractionQuestion Answering

DocGraphLM is a document graph language model for information extraction and question answering. It employs advanced vision-rich document understanding techniques, combining pre-trained language models and graph semantics. Its uniqueness lies in proposing a joint encoder architecture to represent documents and a novel link prediction method to reconstruct the document graph. DocGraphLM predicts the direction and distance between nodes through a convergent joint loss function, prioritizing neighborhood restoration and minimizing the weight of remote node detection. Experiments on three SotA datasets demonstrate that incorporating graph features consistently improves performance in information extraction and question-answering tasks. Furthermore, we report that employing graph features accelerates convergence during training, despite these features being constructed solely through link prediction.

AI News

AI Daily

AI Timeline

Al Hardware

Latest Cases

Image Collection

Video Collection

Audio Collection

Content Collection

Latest Tutorials

AI Product Ranking

AI Traffic Growth Ranking

AI Traffic Decline Ranking

AI Weekly Ranking

United States

China

India

Brazil

Image Generation

Personal Assistant

Character Generation

Video Generation

AI Project Ranking

AI Project Growth Ranking

AI Developer Ranking

AI Organization Ranking

Deepseek

TTS

LLM

ChatGPT

Overview

DocGraphLM

DocGraphLM Visit Over Time

DocGraphLM Visit Trend

DocGraphLM Visit Geography

DocGraphLM Traffic Sources

DocGraphLM Alternatives

DocGraphLM — A document graph language model for information extraction and question answering

YAYI-UIE Information Extraction Large Model — High-quality information extraction model based on massive data

Snack AI — Multilingual Model Question-Answering Assistant

Search4All — A question answering system based on a large language model, capable of answering a wide range of questions.

VideoLLaMA2-7B — A large video-language model that provides video question answering and video captioning.

PPLX Online LLMs — The first online language model API designed for question-answering

Ask Seneca — Intelligent Question Answering Assistant

Benchmark Medical RAG — Benchmark Test for Retrieval-Based Question Answering in the Medical Field

LongRAG — Enhanced Retrieval-Augmented Generation Model for Long-Text Question Answering

VideoLLaMA2-7B-16F-Base — A large video language model used for visual question answering and video subtitling generation.

ChatQA — Construct GPT-4 level conversational question answering models

VideoLLaMA2-7B-Base — A large video language model that provides visual question answering and video captioning capabilities.

Doctrine — Integrate AI question-answering functionality in minutes and embed knowledge into your applications.

MedRAG — A retrieval-augmented question-answering model for the medical field

Askwise — AI-powered intelligent question-answering assistant

LongCite — Generates fine-grained citations for large language models in long text question answering.

ScholarTurbo — ChatGPT-powered PDF Question Answering

Ask AI - Chat Bot Assistant — AI Chatbot Assistant specializing in intelligent question answering.

KnowBuddy.AI — AI assistant, providing intelligent question answering and task execution

LLaVA — Large Language and Vision Assistant, enabling multimodal chat and scientific question answering

TianGong AI Assistant — A dual-billion-parameter large language model, capable of intelligent question answering and creative text generation. TianGong is China's first AI assistant that directly compares to ChatGPT, powered by a dual-billion-parameter large language model.

Intellecs.AI — Information Simplification

Trustworthy Language Model (TLM) Playground — Try Cleanlab's Trustworthy Language Model (TLM) in your browser

Yuanxiang Large Model XChat — Leading domestic general-purpose large model

TableGPT-agent — A pre-built agent based on TableGPT2 for table-based question answering tasks.

CogVLM — A powerful open-source visual language model

AgentRE — A framework based on agents for relationship extraction in complex information environments.

Xiaomen Dao AI — One-stop AI service for image generation, question answering, and image processing

Higgsfield — Advanced Language Processing Model

idefics-80b — A general-purpose multimodal model that can be used for question answering, image description and other tasks.