Deepmark AI is a benchmark tool for evaluating large language models (LLMs) that allows you to assess a variety of task-specific metrics on your own data. It comes pre-integrated with leading generative AI APIs like GPT-4, Anthropic, GPT-3.5 Turbo, Cohere, and AI21.