Stars
An educational resource to help anyone learn deep reinforcement learning.
Official repository of the paper "MuQ: Self-Supervised Music Representation Learning with Mel Residual Vector Quantization".
The repository for the paper titled "Leopard: A Vision Language Model For Text-Rich Multi-Image Tasks"
Code for EMNLP 2024 paper "Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning"
Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"
V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.
Multilingual Corpus of Web Fiction
Benchmarking LLMs via Uncertainty Quantification
DrugAssist: A Large Language Model for Molecule Optimization
Cross-Cultural Challenging Benchmark for Text-to-Image Generation
Multi-domain Zero Pronoun Recovery and Translation Dataset
Cross Sentence Neural Machine Translation
improve Llama-2's proficiency in comprehension, generation, and translation of Chinese.
Offical Code for GPT4Video: A Unified Multimodal Large Language Model for lnstruction-Followed Understanding and Safety-Aware Generation
Inferflow is an efficient and highly configurable inference engine for large language models (LLMs).
A Pytorch-based implementation of the compression and decompression module in "Ultra Dual-Path Compression For Joint Echo Cancellation And Noise Suppression".
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
TriNet: stabilizing self-supervised learning from complete or slow collapse on ASR.
MAD: The first work to explore Multi-Agent Debate with Large Language Models :D