Arthur has launched the open source tool ArthurBench for evaluating and comparing the performance of large language models. ArthurBench helps companies test the performance of different language models on specific use cases and provides metrics such as accuracy, readability, and risk mitigation for comparison. Financial services firms, automotive manufacturers, and media platforms have already begun using ArthurBench, accelerating analysis and providing more accurate answers.