Ph.D Student from PKU ICL. Focusing on long context modeling
-
-
-
-
LongEmbed Public
LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)
-
DWTemplate Public
My personal workflow and tools for NLP research.
-
PoSE Public
Positional Skip-wise Training for Efficient Context Window Extension of LLMs to Extremely Length (ICLR 2024)
-
LongMTEB Public
Forked from embeddings-benchmark/mtebMTEB: Massive Text Embedding Benchmark
Python Apache License 2.0 UpdatedMay 4, 2024 -
CoUDA Public
Code for the paper "CoUDA: Coherence Evaluation via Unified Data Augmentation" (NAACL 2024)
-
BiGuid Public
Code for the paper "Probing Bilingual Guidance for Cross-Lingual Summarization" (NLPCC 2023)
-
-
-
CompilerLab_2022_Spring Public
Lab for the compiler course in Spring 2022.
Python UpdatedMay 10, 2022 -
-
-
Synonym_Classification Public
Reimplementation for synonym classification task
-
-
-
-