38 63 37

Qinghong (Kevin) Lin

KevinQHLin

http://qhlin.me/

AI & ML interests

Vision-Language Model, Video Understanding, Human-AI Interaction

Recent Activity

reacted to Jaward's post with 🤯 about 10 hours ago

Incredible work!! They claim this is the year of recursive language models (I hope so). As models get bigger and better managing their context windows to fit longer prompts has been a standing engineering problem. They propose an inference technique that allows the model to externally crunch down long prompts into snippets that it can recursively call itself on, instead of directly feeding the entire prompt into the transformer. This could make models cheaper and more efficient but I doubt if big tech will adopt it since they profit more with the current approach (bigger models = longer context windows = more expensive the model). Once again such work came from academia/oss community cuz I doubt big tech would have shared these findings lol. They probably have much better inference methods that we may never know of haha. Paper: https://arxiv.org/pdf/2512.24601

upvoted a paper 12 days ago

DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI

authored a paper 18 days ago

Video Reality Test: Can AI-Generated ASMR Videos fool VLMs and Humans?

View all activity

Organizations

upvoted a paper 12 days ago

DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI

Paper • 2512.16676 • Published 17 days ago • 200

upvoted 2 papers 18 days ago

Video Reality Test: Can AI-Generated ASMR Videos fool VLMs and Humans?

Paper • 2512.13281 • Published 20 days ago • 63

EVOLVE-VLA: Test-Time Training from Environment Feedback for Vision-Language-Action Models

Paper • 2512.14666 • Published 19 days ago • 8

upvoted a paper 24 days ago

OmniPSD: Layered PSD Generation with Diffusion Transformer

Paper • 2512.09247 • Published 26 days ago • 46

upvoted a paper 28 days ago

PaperDebugger: A Plugin-Based Multi-Agent System for In-Editor Academic Writing, Review, and Editing

Paper • 2512.02589 • Published Dec 2, 2025 • 67

upvoted 4 papers about 1 month ago

The Image as Its Own Reward: Reinforcement Learning with Adversarial Reward for Image Generation

Paper • 2511.20256 • Published Nov 25, 2025 • 27

upvoted 5 papers about 2 months ago

WEAVE: Unleashing and Benchmarking the In-context Interleaved Comprehension and Generation

Paper • 2511.11434 • Published Nov 14, 2025 • 44

Robot Learning from a Physical World Model

Paper • 2511.07416 • Published Nov 10, 2025 • 30

Grounding Computer Use Agents on Human Demonstrations

Paper • 2511.07332 • Published Nov 10, 2025 • 105

Scaling Agent Learning via Experience Synthesis

Paper • 2511.03773 • Published Nov 5, 2025 • 81

Diffusion Language Models are Super Data Learners

Paper • 2511.03276 • Published Nov 5, 2025 • 128

upvoted a collection 2 months ago

🔱 Sailor2 Language Models

Collection

Sailing in South-East Asia with Inclusive Multilingual LLMs • 34 items • Updated Nov 19, 2025 • 30

upvoted 3 papers 2 months ago

VCode: a Multimodal Coding Benchmark with SVG as Symbolic Visual Representation

Paper • 2511.02778 • Published Nov 4, 2025 • 101

UniLumos: Fast and Unified Image and Video Relighting with Physics-Plausible Feedback

Paper • 2511.01678 • Published Nov 3, 2025 • 35

See the Text: From Tokenization to Visual Reading

Paper • 2510.18840 • Published Oct 21, 2025 • 3

upvoted 2 papers 3 months ago

Diffusion Transformers with Representation Autoencoders

Paper • 2510.11690 • Published Oct 13, 2025 • 165

QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs

Paper • 2510.11696 • Published Oct 13, 2025 • 176

Qinghong (Kevin) Lin

AI & ML interests

Recent Activity

Organizations

KevinQHLin's activity

🎉 Free Image Generator Now Available!