-
LNS-Madam: Low-Precision Training in Logarithmic Number System using Multiplicative Weight Update
Paper • 2106.13914 • Published • 1 -
HeurAgenix: Leveraging LLMs for Solving Complex Combinatorial Optimization Challenges
Paper • 2506.15196 • Published • 3 -
Ascend HiFloat8 Format for Deep Learning
Paper • 2409.16626 • Published • 1 -
Recipes for Pre-training LLMs with MXFP8
Paper • 2506.08027 • Published • 1
zhangwenbin
ExceedZhang
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 4 hours ago
QwenLong-L1.5: Post-Training Recipe for Long-Context Reasoning and Memory Management
liked
a model
about 4 hours ago
Intel/Qwen3-Next-80B-A3B-Instruct-int4-mixed-AutoRound
liked
a model
about 7 hours ago
nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-FP8
Organizations
None yet