- User interface of initial page.
- User interface while in use.
- networkx
- nltk
- numpy
- opencv-python
- pytesseract
- python-dotenv
- quart
- Quart-CORS
- scipy
- Tesseract OCR engine
-
Python packages using pip
python -m pip install networkx nltk numpy opencv-python pytesseract python-dotenv quart Quart-CORS scipy
-
Installing Tesseract OCR engine
-
Install to the default location (Recommended)
C:\\Program Files\\Tesseract-OCR\\tesseract.exe
-
Or change the value of
TESSERACT
in.env
file accordingly.
-
Enter the follwing command in
server
directory:python server.py
-
Configured to run on
PORT 5000
by default. -
PORT
can be changed via.env
. Important: client.env
must be edited accordingly. -
Visit http://127.0.0.1:5000/api/ to test the server.
-
Visit http://127.0.0.1:5000/api/test/ocr and http://127.0.0.1:5000/api/test/summary to test both engines.
-
Use a simple web server to serve
dist
folder inclient
directory. -
For example execute
python -m http.server
insidedist
to serve the files of the directory. And visit the given link (i.e.http://localhost:8000/
) to access the web client.
- Re-implement API in
JavaScript
andExpress.js
while keeping OCR/Summarization functionality in Python.