ObjectSight AI 👁️

A powerful and intuitive image analysis interface powered by Google's Gemini Flash 2, built with Python and Streamlit.

🌟 Features

🎯 Real-time object detection and localization
📦 Clear bounding box visualization with enhanced labels
🔄 Support for common image formats (JPG, JPEG, PNG)
🎨 Clean and minimalist user interface
💾 Download capability for analyzed images
🔑 Secure API key management

🖼️ Snapshot

🔧 Prerequisites

Python 3.10 or higher
A web Browser
Google API key from Google AI Studio

📥 Installation

Clone the repository:

git clone https://github.com/smaranjitghose/ObjectSightAI.git
cd ObjectSightAI

Create and activate virtual environment:

# Windows
python -m venv env
.\env\Scripts\activate

# Linux/Mac
python3 -m venv env
source env/bin/activate

Install required packages:

pip install -r requirements.txt

🚀 Usage

Start ObjectSight AI:

streamlit run app.py

Open your browser and navigate to:

http://localhost:8501

💡 Quick Start Guide

Enter your Google API key in the sidebar
Upload an image using the file uploader
Write a descriptive prompt about what to analyze
Click "Run!" to start the analysis
View results and download the analyzed image if desired

🎯 Example Prompts

"Identify and locate all objects in this image"
"Find and label all people and furniture"
"Detect all electronic devices"
"Locate and identify different types of vehicles"

🛠️ Troubleshooting

Common Issues

API Key Error
- Verify API key is entered correctly
- Check if API key has necessary permissions
- Ensure API key is active
Image Upload Issues
- Check if image format is supported
- Ensure image size is under limit
- Verify image is not corrupted
Analysis Failures
- Check internet connection
- Verify API quota hasn't been exceeded
- Ensure prompt is clear and specific

🤝 Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Fork the project
Create your feature branch (git checkout -b feature/AmazingFeature)
Commit your changes (git commit -m 'Add some AmazingFeature')
Push to the branch (git push origin feature/AmazingFeature)
Open a Pull Request

📝 License

This project is licensed under the MIT License - see the LICENSE file for details.

Made with ❤️ by Smaranjit Ghose

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.devcontainer		.devcontainer
assets		assets
.dockerignore		.dockerignore
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
Dockerfile		Dockerfile
LICENSE		LICENSE
README.MD		README.MD
app.py		app.py
docker-compose.yml		docker-compose.yml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ObjectSight AI 👁️

🌟 Features

🖼️ Snapshot

🔧 Prerequisites

📥 Installation

🚀 Usage

💡 Quick Start Guide

🎯 Example Prompts

🛠️ Troubleshooting

Common Issues

🤝 Contributing

📝 License

About

Contributors 2

Languages

License

smaranjitghose/ObjectSightAI

Folders and files

Latest commit

History

Repository files navigation

ObjectSight AI 👁️

🌟 Features

🖼️ Snapshot

🔧 Prerequisites

📥 Installation

🚀 Usage

💡 Quick Start Guide

🎯 Example Prompts

🛠️ Troubleshooting

Common Issues

🤝 Contributing

📝 License

About

Topics

Resources

License

Code of conduct

Stars

Watchers

Forks

Contributors 2

Languages