Python

 Glossary :  https://docs.python.org/3/glossary.html#glossary Tutorial :  https://docs.python.org/3/tutorial/index.html Python 13 What's new :  https://docs.python.org/3.13/whatsnew/3.13.html Python Standard Library :  https://docs.python.org/3.13/library/index.html The Python Language Reference :  https://docs.python.org/3.13/reference/index.html Books :  Python Distilled Automate the Boring Stuff Quick Python

OCR


OCR Engines : EasyOCR, Tesseract, OcrMac, RapidOCR, OnnxTR

--
General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
GOT
https://huggingface.co/stepfun-ai/GOT-OCR-2.0-hf

Gives CUDA version and Transformers library errors
--
OLMOCR
---
PDF-Extract-Kit (huge resource requirement)
--
Reddit https://www.reddit.com/r/LocalLLaMA/comments/172k9q2/best_model_for_document_layout_analysis_and_ocr/
--
 Facebook AI Research Nougat: Neural Optical Understanding for Academic Documents:  https://facebookresearch.github.io/nougat/
--
Donut 🍩 : Document Understanding Transformer: https://github.com/clovaai/donut/
--
HURIDOCS New open-source AI tool unlocks content and structure of PDFs effortlessly
https://huridocs.org/2024/08/new-open-source-ai-tool-unlocks-content-and-structure-of-pdfs-effortlessly/
https://github.com/huridocs/pdf-document-layout-analysis
--

Comments

Popular posts from this blog

Segmentation Models

AI Avatar

AI Video