25 9 23

Yoshi Suhara

suhara

https://yoshi-suhara.com/

AI & ML interests

None yet

Recent Activity

new activity 1 day ago

nvidia/NVIDIA-Nemotron-Nano-9B-v2:GGUF support

new activity 1 day ago

nvidia/NVIDIA-Nemotron-Nano-9B-v2:broken config.json

updated a model 3 days ago

nvidia/Nemotron-H-8B-Reasoning-128K-FP8

View all activity

Organizations

upvoted an article 4 days ago

Article

NVIDIA Releases 6 Million Multi-Lingual Reasoning Dataset

and 4 others •

4 days ago

• 11

upvoted an article 6 days ago

Article

Supercharge Edge AI With High‑Accuracy Reasoning Using NVIDIA Nemotron Nano 2 9B

and 9 others •

6 days ago

• 17

upvoted an article 2 months ago

Article

Supercharge Edge AI with High Accuracy Reasoning Using Llama Nemotron Nano 4B

and 3 others •

Jun 10

• 7

upvoted a paper 3 months ago

Llama-Nemotron: Efficient Reasoning Models

Paper • 2505.00949 • Published May 2 • 42

upvoted a paper 4 months ago

Efficient Hybrid Language Model Compression through Group-Aware SSM Pruning

Paper • 2504.11409 • Published Apr 15 • 10

upvoted a paper 9 months ago

Hymba: A Hybrid-head Architecture for Small Language Models

Paper • 2411.13676 • Published Nov 20, 2024 • 46

upvoted a paper 11 months ago

MaskLLM: Learnable Semi-Structured Sparsity for Large Language Models

Paper • 2409.17481 • Published Sep 26, 2024 • 48

upvoted a collection 12 months ago

Minitron

Collection

A family of compressed models obtained via pruning and knowledge distillation • 12 items • Updated 10 days ago • 60

upvoted a paper about 1 year ago

LLM Pruning and Distillation in Practice: The Minitron Approach

Paper • 2408.11796 • Published Aug 21, 2024 • 58

Yoshi Suhara

AI & ML interests

Recent Activity

Organizations

suhara's activity

NVIDIA Releases 6 Million Multi-Lingual Reasoning Dataset

Supercharge Edge AI With High‑Accuracy Reasoning Using NVIDIA Nemotron Nano 2 9B

Supercharge Edge AI with High Accuracy Reasoning Using Llama Nemotron Nano 4B