Docling : https://github.com/docling-project Docling Docs : https://docling-project.github.io/docling/installation/ OCR Engines : EasyOCR, Tesseract, OcrMac, RapidOCR, OnnxTR -- General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model GOT https://github.com/Ucas-HaoranWei/GOT-OCR2.0 https://huggingface.co/stepfun-ai/GOT-OCR-2.0-hf Gives CUDA version and Transformers library errors -- OLMOCR Main Page : https://olmocr.allenai.org/ https://github.com/allenai/olmocr --- ★ PDF-Extract-Kit (huge resource requirement) https://github.com/opendatalab/PDF-Extract-Kit -- Reddit https://www.reddit.com/r/LocalLLaMA/comments/172k9q2/best_model_for_document_layout_analysis_and_ocr/ -- ★ Facebook AI Research Nougat: Neural Optical Understanding for Academic Documents: https://facebookresearch.github.io/nougat/ -- Donut 🍩 : Document Understanding Transformer: https://github.com/clovaai/donut/ -- HURIDOCS New open-source AI tool unl...