Skip to content

OmdenaAI/SaoPauloBrazilChapter_BrazilianSignLanguage

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Brazilian Sign Language Recognition Project

This is a collaborative data science project under the Omdena São Paulo Chapter focused on Brazilian Sign Language (Libras) recognition. The project aims to develop machine learning models that can classify sign language videos into corresponding Portuguese words.

Project Overview

See STRUCTURE.md for detailed project organization.

Setup Instructions

Prerequisites

  1. Python 3.11 or higher
  2. uv package manager (recommended installation via pip: pip install uv)

Environment Setup

  1. Clone the repository:

    git clone https://github.com/OmdenaAI/SaoPauloBrazilChapter_BrazilianSignLanguage.git
    cd SaoPauloBrazilChapter_BrazilianSignLanguage
  2. Install core dependencies:

    uv sync

    For additional dependencies:

    uv sync --extra <group>  # Example: uv sync --extra data

    See pyproject.toml for available dependency groups (data, model, app).

  3. Using the environment:

    # Activate the environment
    uv venv activate
    
    # Run your code
    python your_script.py
    jupyter notebook

    Or run commands directly without activation:

    uv run python your_script.py
    uv run jupyter notebook
  4. Adding new dependencies:

    uv add <package>           # Add to core dependencies
    uv add --extra data <pkg>  # Add to data processing tools

Project Structure

SaoPauloBrazilChapter_BrazilianSignLanguage/
├── data/                  # Data files
│   ├── raw/              # Original data
│   │   ├── INES/        # INES dataset
│   │   │   └── videos/  # Video files (stored on Google Drive)
│   │   ├── SignBank/    # SignBank dataset
│   │   │   └── videos/  # Video files (stored on Google Drive)
│   │   ├── UFV/         # UFV dataset
│   │   │   └── videos/  # Video files (stored on Google Drive)
│   │   └── V-Librasil/  # V-Librasil dataset
│   │       └── videos/  # Video files (stored on Google Drive)
│   ├── interim/          # Intermediate processing
│   ├── processed/        # Final datasets
│   ├── external/         # Third party data
│   └── papers/           # Related research
├── code/                 # Source code
│   ├── data/            # Data processing
│   ├── models/          # Model implementations
├── notebooks/            # Jupyter notebooks
└── tests/               # Unit tests

See STRUCTURE.md for complete structure details.

Data Management

Video Files

  • Large video files are stored on Google Drive
  • Video directories in the repository structure are placeholders
  • Download videos to your local videos/ directories as needed

Data Files

  • Small files like CSV files, labels, and metadata are tracked in Git
  • Store processed data (features, embeddings) in processed/
  • Document data formats in respective directories

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published