Finance Commons and the Bad Data Toolbox are a suite of models and tools for document AI research and applications. They focus on handling bad data, including OCR errors and chaotic text, to enhance the robustness of AI in document processing. These tools and models help to automate workflows, reduce the workload for businesses in preparing content, and support development of next-generation multi-modal document models.