What is the difference between OCR_PROVIDER and VISION_LLM_PROVIDER? #254
-
What is the difference between OCR_PROVIDER and VISION_LLM_PROVIDER?
Are these two technics to OCR the document? What do I have to set to use the OCR generated by Paperless-ngx with Tesseract? |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 7 replies
-
You have the option to use either:
To use the specialized AI OCR service you set the To use a LLM provider you have to set both The OCR data from paperless-gpt is saved in the 'Content' section of the document in paperless-ngx. However, unlike Tesseract OCR in paperless-ngx, it is not yet embedded in the PDF. This functionality is still a work in progress: #212. |
Beta Was this translation helpful? Give feedback.
You have the option to use either:
To use the specialized AI OCR service you set the
OCR_PROVIDER
togoogle_docai
To use a LLM provider you have to set both
OCR_PROVIDER
tollm
and also setVISION_LLM_PROVIDER
to eitherollama
oropenai
, then you also might want to look into customizing the prompt to get the ocr results you want.The OCR data from paperless-gpt is saved in the 'Content' section of the document in paperless-ngx. However, unlike Tesseract OCR in paperless-ngx, it is not …