Collection of PDF parsing libraries like AI based docling, claude, openai, llama-vision, unstructured-io, and pdfminer, pymupdf, pdfplumber etc for efficient snapshot, text, table, and metadata extraction.
ocr openai claude camelot pymupdf pypdf ocr-python markitdown llama-parse omniai unstructured-io docling llama-vision smoldocling
-
Updated
Mar 27, 2025 - Python