Dawei Zhu's picture

Dawei Zhu

dwzhu

·

dwzhu-pku

AI & ML interests

natural language processing

Organizations

authored 2 papers 3 months ago

MiMo-VL Technical Report

Paper • 2506.03569 • Published Jun 4 • 79

MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to Posttraining

Paper • 2505.07608 • Published May 12 • 81

authored 5 papers 5 months ago

ConFiguRe: Exploring Discourse-level Chinese Figures of Speech

Paper • 2209.07678 • Published Sep 16, 2022

Long Context Alignment with Short Instructions and Synthesized Positions

Paper • 2405.03939 • Published May 7, 2024

More Tokens, Lower Precision: Towards the Optimal Token-Precision Trade-off in KV Cache Compression

Paper • 2412.12706 • Published Dec 17, 2024

MMTEB: Massive Multilingual Text Embedding Benchmark

Paper • 2502.13595 • Published Feb 19 • 38

A Comprehensive Survey on Long Context Language Modeling

Paper • 2503.17407 • Published Mar 20 • 50

authored 3 papers over 1 year ago

RestGPT: Connecting Large Language Models with Real-World RESTful APIs

Paper • 2306.06624 • Published Jun 11, 2023 • 1

Large Language Models are not Fair Evaluators

Paper • 2305.17926 • Published May 29, 2023 • 1

LongEmbed: Extending Embedding Models for Long Context Retrieval

Paper • 2404.12096 • Published Apr 18, 2024 • 2

authored a paper almost 2 years ago

PoSE: Efficient Context Window Extension of LLMs via Positional Skip-wise Training

Paper • 2309.10400 • Published Sep 19, 2023 • 26