Skip to content

Latest commit

 

History

History
20 lines (13 loc) · 510 Bytes

README.md

File metadata and controls

20 lines (13 loc) · 510 Bytes

Multimodal-RAG

Multimodal RAG using LangChain and Vertex AI for advanced document search and Q&A over text and images. Leverage Google's Gemini models for enhanced knowledge retrieval.

Screenshot_2024-08-15_at_4 50 33_AM-removebg-preview

Setup

Install the required packages:

pip install -r requirements.txt

Usage

Run the main script to execute the multimodal RAG:

python src/main.py