Skip to content
View teowu's full-sized avatar
🎯
Focusing
🎯
Focusing

Organizations

@VQAssessment @Q-Future

Block or report teowu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)

Python 2,394 181 Updated Jan 30, 2025

Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and…

Python 46,834 8,033 Updated Feb 26, 2025

PyTorch code for our paper "Grounding-IQA: Multimodal Language Grounding Model for Image Quality Assessment"

37 1 Updated Dec 8, 2024
Python 19 Updated Nov 8, 2024

Official released code for VQA² series models

Python 30 1 Updated Jan 31, 2025
Python 87 2 Updated Dec 30, 2024

[CVPR 2025] Official Dataloader and Evaluation Scripts for VideoAutoArena.

Python 5 Updated Nov 29, 2024

[CVPR 2025] Official Dataloader and Evaluation Scripts for VideoAutoBench.

Python 10 Updated Nov 28, 2024

MathVista: data, code, and evaluation for Mathematical Reasoning in Visual Contexts

Jupyter Notebook 280 45 Updated Nov 29, 2024

Image Quality Assessment: From Human to Machine Preference

8 Updated Nov 18, 2024

[ACL 2024 Findings] "TempCompass: Do Video LLMs Really Understand Videos?", Yuanxin Liu, Shicheng Li, Yi Liu, Yuxiang Wang, Shuhuai Ren, Lei Li, Sishuo Chen, Xu Sun, Lu Hou

Python 105 2 Updated Feb 23, 2025

EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation

92 Updated Nov 14, 2024

A Versatile Video-LLM for Long and Short Video Understanding with Superior Temporal Localization Ability

85 Updated Nov 28, 2024

VideoNIAH: A Flexible Synthetic Method for Benchmarking Video MLLMs

Python 38 Updated Feb 25, 2025

🔥🔥MLVU: Multi-task Long Video Understanding Benchmark

Python 178 Updated Feb 27, 2025

PyTorch code for our paper "Dog-IQA: Standard-guided Zero-shot MLLM for Mix-grain Image Quality Assessment"

21 1 Updated Oct 7, 2024

Codebase for Aria - an Open Multimodal Native MoE

Jupyter Notebook 1,006 85 Updated Jan 22, 2025
29 1 Updated Jul 8, 2024

[ICML'24 Spotlight] LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning

Python 640 61 Updated Jun 1, 2024

[Neurips 24 Spotlight] Training in Pairs + Inference on Single Image with Anchors

Python 31 3 Updated Feb 20, 2025
Python 28 Updated Jun 14, 2024

Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.

Python 2,145 213 Updated Feb 28, 2025

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

Python 1,933 280 Updated Feb 28, 2025

🏆 [CVPRW 2024] COVER: A Comprehensive Video Quality Evaluator. 🥇 Winner solution for Video Quality Assessment Challenge at the 1st AIS 2024 workshop @ CVPR 2024

Python 51 4 Updated Jul 18, 2024

[Neurips 24' D&B] Official Dataloader and Evaluation Scripts for LongVideoBench.

Python 85 2 Updated Jul 27, 2024

[ICLR 2025] What do we expect from LMMs as AIGI evaluators and how do they perform?

142 3 Updated Feb 3, 2025

Cornell NLVR and NLVR2 are natural language grounding datasets. Each example shows a visual input and a sentence describing it, and is annotated with the truth-value of the sentence.

HTML 261 59 Updated Aug 18, 2022

An open-source implementation for training LLaVA-NeXT.

Python 380 21 Updated Oct 23, 2024

[NeurIPS 2024] This repo contains evaluation code for the paper "Are We on the Right Way for Evaluating Large Vision-Language Models"

Python 167 5 Updated Sep 26, 2024
Next
Showing results