This is the official repository for MathReader, an advanced TTS document reader for academic mathematical documents.
Project page: https://hyeonsieun.github.io/MathReader_demo/
The experimental code and test dataset developed for our research can be found here.
-
Install Nougat and NVIDIA NeMo and transformers library in your development environment.
-
You can also set up the environment using the following code through the mathreader_environment.yml file
:
conda env create -f ./mathreader_environment.yml
-
Create a folder named 'test_audio' in the same location as MathReader.py.
-
Modify line 102 in MathReader.py (Write the path of the PDF file you want to perform OCR on.).
-
Run
python MathReader.py
in the terminal.