Skip to content

What is the difference between OCR_PROVIDER and VISION_LLM_PROVIDER? #254

Answered by gardar
rosaLux161 asked this question in Q&A
Discussion options

You must be logged in to vote

You have the option to use either:

  • A specialized AI OCR service (currently only google document ai)
  • A LLM provider (ollama or openai) which are services that can do OCR pretty well, although they are not specially designed as OCR services.

To use the specialized AI OCR service you set the OCR_PROVIDER to google_docai

To use a LLM provider you have to set both OCR_PROVIDER to llm and also set VISION_LLM_PROVIDER to either ollama or openai, then you also might want to look into customizing the prompt to get the ocr results you want.

The OCR data from paperless-gpt is saved in the 'Content' section of the document in paperless-ngx. However, unlike Tesseract OCR in paperless-ngx, it is not …

Replies: 1 comment 7 replies

Comment options

You must be logged in to vote
7 replies
@rosaLux161
Comment options

@gardar
Comment options

@rosaLux161
Comment options

@gardar
Comment options

@rosaLux161
Comment options

Answer selected by rosaLux161
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants