Skip to content
View marcomistretta's full-sized avatar
🤚
Looking for Internship!
🤚
Looking for Internship!

Highlights

  • Pro

Organizations

@miccunifi

Block or report marcomistretta

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Easy wrapper for inserting LoRA layers in CLIP.

Python 27 2 Updated Jun 16, 2024

[ICLR 2025] - Cross the Gap: Exposing the Intra-modal Misalignment in CLIP via Modality Inversion

32 Updated Feb 7, 2025

Mind the Gap: Understanding the Modality Gap in Multi-modal Contrastive Representation Learning

Jupyter Notebook 145 8 Updated Sep 26, 2022

[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ult…

Jupyter Notebook 6,715 440 Updated Jan 12, 2025

A smarter cd command. Supports all major shells.

Rust 25,078 591 Updated Feb 26, 2025

Processed / Cleaned Data for Paper Copilot

316 11 Updated Feb 25, 2025

TensorZero creates a feedback loop for optimizing LLM applications — turning production data into smarter, faster, and cheaper models.

Rust 2,718 162 Updated Feb 26, 2025

Welcome to my GitHub page!

1 Updated Feb 18, 2025

[CVPR 2024] Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers

Python 577 24 Updated Oct 25, 2024

A framework to easily use 32 (and growing) different image matching methods

Python 372 33 Updated Feb 19, 2025

🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper

Python 31,699 2,628 Updated Feb 26, 2025

Pytorch implementation of various Knowledge Distillation (KD) methods.

Python 1,665 267 Updated Nov 25, 2021

Open source implementation of "Vision Transformers Need Registers"

Python 165 15 Updated Jan 27, 2025

A beautiful portfolio Jekyll theme that works with GitHub Pages.

HTML 1,043 611 Updated Aug 14, 2024

[ECCV-W] Official repo for the paper "ComiCap: A VLMs pipeline for dense captioning of Comic Panels"

JavaScript 10 Updated Nov 20, 2024

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 17,496 1,756 Updated Feb 26, 2025

Official Pytorch code for MANTRA - Memory Augmented Neural Trajectory Predictor (CVPR2020)

Python 75 15 Updated Aug 24, 2022

[ECCV 2024] - ScanTalk: 3D Talking Heads from Unregistered Scans

Python 33 2 Updated Oct 24, 2024

A light webserver for monitoring RAM and GPU usage on multiple servers.

Python 21 2 Updated Mar 31, 2021

The official repo of the Comics Survey: "A missing piece in Vision and Language: A Survey on Comics Understanding"

102 5 Updated Jan 2, 2025

Pen and paper exercises in machine learning

TeX 1,962 143 Updated May 21, 2024

The repository provides code for training the SegmentAnything Model (SAM) for predicting frame polygons in comic books

Jupyter Notebook 49 2 Updated Mar 14, 2024

Code for paper "Boosting Continual Learning of Vision-Language Models via Mixture-of-Experts Adapters" CVPR2024

Python 187 13 Updated Nov 17, 2024

CLIP-like model evaluation

Jupyter Notebook 664 85 Updated Feb 18, 2025

JAX reimplementation of the DeepMind paper "Genie: Generative Interactive Environments"

Python 54 3 Updated Jan 23, 2025
Python 1 Updated May 11, 2022

Code for the paper: "SuS-X: Training-Free Name-Only Transfer of Vision-Language Models" [ICCV'23]

Python 97 4 Updated Aug 22, 2023

[ECCVW/TWYN 2024 - Best Workshop Paper] Are CLIP features all you need for Universal Synthetic Image Origin Attribution?

Python 8 1 Updated Feb 1, 2025
Next
Showing results