A powerful and intuitive image analysis interface powered by Google's Gemini Flash 2, built with Python and Streamlit.
- 🎯 Real-time object detection and localization
- 📦 Clear bounding box visualization with enhanced labels
- 🔄 Support for common image formats (JPG, JPEG, PNG)
- 🎨 Clean and minimalist user interface
- 💾 Download capability for analyzed images
- 🔑 Secure API key management
- Python 3.10 or higher
- A web Browser
- Google API key from Google AI Studio
- Clone the repository:
git clone https://github.com/smaranjitghose/ObjectSightAI.git
cd ObjectSightAI
- Create and activate virtual environment:
# Windows
python -m venv env
.\env\Scripts\activate
# Linux/Mac
python3 -m venv env
source env/bin/activate
- Install required packages:
pip install -r requirements.txt
- Start ObjectSight AI:
streamlit run app.py
- Open your browser and navigate to:
http://localhost:8501
- Enter your Google API key in the sidebar
- Upload an image using the file uploader
- Write a descriptive prompt about what to analyze
- Click "Run!" to start the analysis
- View results and download the analyzed image if desired
"Identify and locate all objects in this image"
"Find and label all people and furniture"
"Detect all electronic devices"
"Locate and identify different types of vehicles"
-
API Key Error
- Verify API key is entered correctly
- Check if API key has necessary permissions
- Ensure API key is active
-
Image Upload Issues
- Check if image format is supported
- Ensure image size is under limit
- Verify image is not corrupted
-
Analysis Failures
- Check internet connection
- Verify API quota hasn't been exceeded
- Ensure prompt is clear and specific
Contributions are welcome! Please feel free to submit a Pull Request.
- Fork the project
- Create your feature branch (
git checkout -b feature/AmazingFeature
) - Commit your changes (
git commit -m 'Add some AmazingFeature'
) - Push to the branch (
git push origin feature/AmazingFeature
) - Open a Pull Request
This project is licensed under the MIT License - see the LICENSE file for details.
Made with ❤️ by Smaranjit Ghose