view article Article Accelerate ND-Parallel: A Guide to Efficient Multi-GPU Training By siro1 and 4 others • 17 days ago • 52
gpt-oss Collection Open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases. • 2 items • Updated 18 days ago • 316
view article Article Welcome GPT OSS, the new open-source model family from OpenAI! By reach-vb and 11 others • 20 days ago • 473
deepcogito/cogito-v2-preview-deepseek-671B-MoE-FP8 Text Generation • 671B • Updated 25 days ago • 239 • 4
deepcogito/cogito-v2-preview-llama-109B-MoE Image-Text-to-Text • 109B • Updated 25 days ago • 2.04k • 29
deepcogito/cogito-v2-preview-deepseek-671B-MoE Text Generation • 671B • Updated 25 days ago • 393 • 32