1442

Joakim Lee

Reinforcement4All

AI & ML interests

None yet

Recent Activity

upvoted a paper about 3 hours ago

Scaling DoRA: High-Rank Adaptation via Factored Norms and Fused Kernels

upvoted a paper about 3 hours ago

ChartNet: A Million-Scale, High-Quality Multimodal Dataset for Robust Chart Understanding

upvoted a paper about 3 hours ago

SPASM: Stable Persona-driven Agent Simulation for Multi-turn Dialogue Generation

View all activity

Organizations

None yet

upvoted 20 papers about 3 hours ago

Scaling DoRA: High-Rank Adaptation via Factored Norms and Fused Kernels

Paper • 2603.22276 • Published 22 days ago • 14

ChartNet: A Million-Scale, High-Quality Multimodal Dataset for Robust Chart Understanding

Paper • 2603.27064 • Published 18 days ago • 26

SPASM: Stable Persona-driven Agent Simulation for Multi-turn Dialogue Generation

Paper • 2604.09212 • Published 5 days ago • 1

Counting to Four is still a Chore for VLMs

Paper • 2604.10039 • Published 4 days ago • 1

Polyglot Teachers: Evaluating Language Models for Multilingual Synthetic Data Generation

Paper • 2604.11290 • Published 2 days ago • 1

SciPredict: Can LLMs Predict the Outcomes of Scientific Experiments in Natural Sciences?

Paper • 2604.10718 • Published 3 days ago • 2

TorchUMM: A Unified Multimodal Model Codebase for Evaluation, Analysis, and Post-training

Paper • 2604.10784 • Published 3 days ago • 5

Low-rank Optimization Trajectories Modeling for LLM RLVR Acceleration

Paper • 2604.11446 • Published 2 days ago • 3

SWE-AGILE: A Software Agent Framework for Efficiently Managing Dynamic Reasoning Context

Paper • 2604.11716 • Published 2 days ago • 3

Playing Along: Learning a Double-Agent Defender for Belief Steering via Theory of Mind

Paper • 2604.11666 • Published 2 days ago • 3

Eliciting Medical Reasoning with Knowledge-enhanced Data Synthesis: A Semi-Supervised Reinforcement Learning Approach

Paper • 2604.11547 • Published 2 days ago • 4

Continuous Adversarial Flow Models

Paper • 2604.11521 • Published 2 days ago • 4

Efficient RL Training for LLMs with Experience Replay

Paper • 2604.08706 • Published 6 days ago • 9

Zero-shot World Models Are Developmentally Efficient Learners

Paper • 2604.10333 • Published 4 days ago • 6

Not All Denoising Steps Are Equal: Model Scheduling for Faster Masked Diffusion Language Models

Paper • 2604.02340 • Published 4 days ago • 6

General365: Benchmarking General Reasoning in Large Language Models Across Diverse and Challenging Tasks

Paper • 2604.11778 • Published 2 days ago • 6

SPEED-Bench: A Unified and Diverse Benchmark for Speculative Decoding

Paper • 2604.09557 • Published Feb 10 • 8

Joakim Lee

AI & ML interests

Recent Activity

Organizations

Reinforcement4All's activity