20 29 26

Yozh

justheuristic

justheuristic

AI & ML interests

None yet

Recent Activity

upvoted a paper 22 days ago

Rethinking Global Text Conditioning in Diffusion Transformers

upvoted a paper about 1 month ago

Kimi K2.5: Visual Agentic Intelligence

upvoted a paper about 1 month ago

Quartet II: Accurate LLM Pre-Training in NVFP4 by Improved Unbiased Gradient Estimation

View all activity

Organizations

upvoted a paper 22 days ago

Rethinking Global Text Conditioning in Diffusion Transformers

Paper • 2602.09268 • Published 24 days ago • 8

upvoted 3 papers about 1 month ago

Kimi K2.5: Visual Agentic Intelligence

Paper • 2602.02276 • Published about 1 month ago • 254

Quartet II: Accurate LLM Pre-Training in NVFP4 by Improved Unbiased Gradient Estimation

Paper • 2601.22813 • Published Jan 30 • 57

MemoryRewardBench: Benchmarking Reward Models for Long-Term Memory Management in Large Language Models

Paper • 2601.11969 • Published Jan 17 • 27

upvoted a paper about 2 months ago

ZebraLogic: On the Scaling Limits of LLMs for Logical Reasoning

Paper • 2502.01100 • Published Feb 3, 2025 • 21

liked a model about 2 months ago

Manmay/tortoise-tts

Updated Oct 25, 2023 • 20

upvoted a paper 2 months ago

Qwen2.5-1M Technical Report

Paper • 2501.15383 • Published Jan 26, 2025 • 72

liked a model 7 months ago

openai/gpt-oss-120b

Text Generation • 120B • Updated Aug 26, 2025 • 4.28M • • 4.55k

liked 2 models 8 months ago

ByteDance-Seed/BAGEL-7B-MoT

Any-to-Any • 15B • Updated Jan 9 • 1.74k • 1.18k

moonshotai/Kimi-K2-Instruct

Text Generation • 1T • Updated Jan 30 • 142k • • 2.32k

liked a dataset 8 months ago

yandex/mad-cars

Viewer • Updated Jun 29, 2025 • 5.88M • 64 • 32

upvoted an article 9 months ago

Article

Learn the Hugging Face Kernel Hub in 5 Minutes

Jun 12, 2025

•

160

upvoted 3 papers 9 months ago

upvoted an article 10 months ago

Article

4D masks support in Transformers

Jan 8, 2024

•

upvoted 2 papers 11 months ago

Learning Adaptive Parallel Reasoning with Language Models

Paper • 2504.15466 • Published Apr 21, 2025 • 44

PRIMA.CPP: Speeding Up 70B-Scale LLM Inference on Low-Resource Everyday Home Clusters

Paper • 2504.08791 • Published Apr 7, 2025 • 139

commented a paper 11 months ago

Hogwild! Inference: Parallel LLM Generation via Concurrent Attention

Paper • 2504.06261 • Published Apr 8, 2025 • 110 •

upvoted a paper 11 months ago

Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though

Paper • 2501.04682 • Published Jan 8, 2025 • 99