PwC-ocr-label-studio-outamation
PublicThis project is an end-to-end workflow for processing a sample invoice using OCR & manual annotation. The project demonstrates how to extract text from an invoice using Tesseract OCR, refine the results in Label Studio, & prepare a high-quality dataset for AI training. Includes configuration files, scripts, & documentation for document processing