A Unified Toolkit for Deep Learning Based Document Image Analysis
-
Updated
Aug 15, 2024 - Python
A Unified Toolkit for Deep Learning Based Document Image Analysis
PdfDet aims to simplify PDF layout detect tasks for users.
Extracting structured text from GI Bill index cards for JDoc 2023 paper
Layout Parser notebook Implementation & Re-trained model for Image detection and extraction
Yolo & Layout Parser & Detectron2
A lightweight Python library for metadata-rich document chunking in Retrieval-Augmented Generation (RAG) workflows. It leverages Azure AI Document Intelligence to enhance chunking by retaining hierarchical structure, page numbers, and bounding boxes for seamless integration with PDF viewers.
Add a description, image, and links to the layout-parser topic page so that developers can more easily learn about it.
To associate your repository with the layout-parser topic, visit your repo's landing page and select "manage topics."