3 8 57

Bill H

lccurious

https://lccurious.github.io/

AI & ML interests

None yet

Recent Activity

liked a model 7 days ago

MedAIBase/AntAngelMed

authored a paper 16 days ago

LLaDA2.0: Scaling Up Diffusion Language Models to 100B

authored a paper 16 days ago

dInfer: An Efficient Inference Framework for Diffusion Language Models

View all activity

Organizations

liked a model 7 days ago

MedAIBase/AntAngelMed

103B • Updated 12 days ago • 258 • 9

authored 3 papers 16 days ago

upvoted a paper 16 days ago

LLaDA2.0: Scaling Up Diffusion Language Models to 100B

Paper • 2512.15745 • Published 25 days ago • 78

published a dataset about 1 month ago

lccurious/SpudPix

Updated Dec 3, 2025 • 4

liked 3 models about 1 month ago

inclusionAI/LLaDA2.0-flash-preview

Text Generation • 103B • Updated 16 days ago • 81 • 68

inclusionAI/LLaDA2.0-flash

Text Generation • 103B • Updated 16 days ago • 397 • 59

inclusionAI/LLaDA2.0-mini

Text Generation • 16B • Updated 16 days ago • 8.06k • 51

upvoted a collection about 1 month ago

LLaDA 2.0

Collection

7 items • Updated 11 days ago • 39

updated 2 models about 1 month ago

inclusionAI/LLaDA2.0-flash

Text Generation • 103B • Updated 16 days ago • 397 • 59

inclusionAI/LLaDA2.0-mini

Text Generation • 16B • Updated 16 days ago • 8.06k • 51

updated 2 models 2 months ago

inclusionAI/LLaDA2.0-flash-preview

Text Generation • 103B • Updated 16 days ago • 81 • 68

inclusionAI/LLaDA2.0-mini-preview

Text Generation • 16B • Updated 16 days ago • 3.95k • 86

published a model 2 months ago

inclusionAI/LLaDA2.0-flash-preview

Text Generation • 103B • Updated 16 days ago • 81 • 68

New activity in inclusionAI/LLaDA-MoE-7B-A1B-Base 2 months ago

Why not compare it with Qwen3-4B or Qwen3-7B?

#2 opened 3 months ago by

wisamidris7

liked a model 3 months ago

inclusionAI/LLaDA2.0-mini-preview

Text Generation • 16B • Updated 16 days ago • 3.95k • 86

liked a model 4 months ago

inclusionAI/LLaDA-MoE-7B-A1B-Instruct

7B • Updated Oct 28, 2025 • 1.7k • 61

authored 2 papers 5 months ago

Energy-based Automated Model Evaluation

Paper • 2401.12689 • Published Jan 23, 2024 • 1

Grove MoE: Towards Efficient and Superior MoE LLMs with Adjugate Experts

Paper • 2508.07785 • Published Aug 11, 2025 • 28

Bill H

AI & ML interests

Recent Activity

Organizations

lccurious's activity

Why not compare it with Qwen3-4B or Qwen3-7B?

🎉 Free Image Generator Now Available!