teowu

🎯

Focusing

Teo (Timothy) Wu Haoning teowu

🎯

Focusing

LMM Researcher @rhymes-ai, Founder @Q-Future

178 followers · 122 following

Rhymes AI Singapore
A seat that sees all tourists taking photos with Merlion
07:01 - 8h ahead
teowu.github.io
@HaoningTimothy

Achievements

x2 x2 x2

Achievements

x2 x2 x2

Organizations

Lists (1)

Sort

Awesome Quality Assessment

7 repositories

Stars

THUDM / AgentBench

A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)

Python 2,394 181 Updated Jan 30, 2025

PaddlePaddle / PaddleOCR

Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and…

Python 46,838 8,034 Updated Feb 26, 2025

zhengchen1999 / Grounding-IQA

PyTorch code for our paper "Grounding-IQA: Multimodal Language Grounding Model for Image Quality Assessment"

37 1 Updated Dec 8, 2024

yale-nlp / TOMATO

Python 19 Updated Nov 8, 2024

Q-Future / Visual-Question-Answering-for-Video-Quality-Assessment

Official released code for VQA² series models

Python 30 1 Updated Jan 31, 2025

egoschema / EgoSchema

Python 87 2 Updated Dec 30, 2024

VideoAutoArena / VideoAutoArena

[CVPR 2025] Official Dataloader and Evaluation Scripts for VideoAutoArena.

Python 5 Updated Nov 29, 2024

VideoAutoArena / VideoAutoBench

[CVPR 2025] Official Dataloader and Evaluation Scripts for VideoAutoBench.

Python 10 Updated Nov 28, 2024

lupantech / MathVista

MathVista: data, code, and evaluation for Mathematical Reasoning in Visual Contexts

Jupyter Notebook 281 45 Updated Nov 29, 2024

lcysyzxdxc / MPD

Image Quality Assessment: From Human to Machine Preference

8 Updated Nov 18, 2024

llyx97 / TempCompass

[ACL 2024 Findings] "TempCompass: Do Video LLMs Really Understand Videos?", Yuanxin Liu, Shicheng Li, Yi Liu, Yuxiang Wang, Shuhuai Ren, Lei Li, Sishuo Chen, Xu Sun, Lu Hou

Python 105 2 Updated Feb 23, 2025

JeffWang987 / EgoVid

EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation

92 Updated Nov 14, 2024

TimeMarker-LLM / TimeMarker

A Versatile Video-LLM for Long and Short Video Understanding with Superior Temporal Localization Ability

85 Updated Nov 28, 2024

joez17 / VideoNIAH

VideoNIAH: A Flexible Synthetic Method for Benchmarking Video MLLMs

Python 38 Updated Feb 25, 2025

WildVision-AI / WildVision-Bench

Python 15 3 Updated Oct 21, 2024

JUNJIE99 / MLVU

🔥🔥MLVU: Multi-task Long Video Understanding Benchmark

Python 178 Updated Feb 27, 2025

Kai-Liu001 / Dog-IQA

PyTorch code for our paper "Dog-IQA: Standard-guided Zero-shot MLLM for Mix-grain Image Quality Assessment"

21 1 Updated Oct 7, 2024

rhymes-ai / Aria

Codebase for Aria - an Open Multimodal Native MoE

Jupyter Notebook 1,006 85 Updated Jan 22, 2025

yaolinli / DeCo

29 1 Updated Jul 8, 2024

datamllab / LongLM

[ICML'24 Spotlight] LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning

Python 640 61 Updated Jun 1, 2024

Q-Future / Compare2Score

[Neurips 24 Spotlight] Training in Pairs + Inference on Single Image with Anchors

Python 31 3 Updated Feb 20, 2025

Q-Future / CMC-Bench

Python 28 Updated Jun 14, 2024

EvolvingLMMs-Lab / lmms-eval

Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.

Python 2,145 213 Updated Feb 28, 2025

open-compass / VLMEvalKit

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

Python 1,934 280 Updated Feb 28, 2025

taco-group / COVER

🏆 [CVPRW 2024] COVER: A Comprehensive Video Quality Evaluator. 🥇 Winner solution for Video Quality Assessment Challenge at the 1st AIS 2024 workshop @ CVPR 2024

Python 51 4 Updated Jul 18, 2024

longvideobench / LongVideoBench

[Neurips 24' D&B] Official Dataloader and Evaluation Scripts for LongVideoBench.

Python 85 2 Updated Jul 27, 2024

Q-Future / A-Bench

[ICLR 2025] What do we expect from LMMs as AIGI evaluators and how do they perform?

142 3 Updated Feb 3, 2025

lil-lab / nlvr

Cornell NLVR and NLVR2 are natural language grounding datasets. Each example shows a visual input and a sentence describing it, and is annotated with the truth-value of the sentence.

HTML 261 59 Updated Aug 18, 2022

xiaoachen98 / Open-LLaVA-NeXT

An open-source implementation for training LLaVA-NeXT.

Python 380 21 Updated Oct 23, 2024

MMStar-Benchmark / MMStar

[NeurIPS 2024] This repo contains evaluation code for the paper "Are We on the Right Way for Evaluating Large Vision-Language Models"

Python 167 5 Updated Sep 26, 2024