Blog, Articles, and discussions

Gemma 3n fully available in the open-source ecosystem!

By June 26, 2025 • 115

Community Articles

view all

Supercharge Edge AI With High‑Accuracy Reasoning Using NVIDIA Nemotron Nano 2 9B

and 9 others •

6 days ago

• 17

NVIDIA Releases 3 Million Sample Dataset for OCR, Visual Question Answering, and Captioning Tasks

and 4 others •

13 days ago

• 66

What’s MXFP4? The 4-Bit Secret Powering OpenAI’s GPT‑OSS Models on Modest Hardware

•

16 days ago

• 18

Code a simple RAG from scratch

•

Oct 29, 2024

• 164

Uncensor any LLM with abliteration

•

Jun 13, 2024

• 658

How To Build a News Agent with GPT-OSS, Hugging Face Inference & Gradio

•

10 days ago

• 20

AutoBench Third Run: Revolutionizing LLM Evaluation with Record-Breaking Scale, Accuracy, and a New Home at autobench.org

•

5 days ago

• 6

From GRPO to DAPO and GSPO: What, Why, and How

•

16 days ago

• 14

RynnVLA-001: Using Human Demonstrations to Improve Robot Manipulation

and 9 others •

14 days ago

• 25

Introducing Pivotal Token Search (PTS): Targeting Critical Decision Points in LLM Training

•

May 17

• 9

Post-Training Isaac GR00T N1.5 for LeRobot SO-101 Arm

and 5 others •

Jun 11

• 83

Introduction to State Space Models (SSM)

•

Jul 19, 2024

• 164

makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch

•

May 7, 2024

• 96

KV Caching Explained: Optimizing Transformer Inference Efficiency

•

Jan 30

• 117

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 210

Prefill and Decode for Concurrent Requests - Optimizing LLM Performance

•

Apr 16

• 35

OpenEvolve: An Open Source Implementation of Google DeepMind's AlphaEvolve

•

May 20

• 38

Blazingly fast whisper transcriptions with Inference Endpoints

By May 13, 2025 • 74

Hugging Face and Cloudflare Partner to Make Real-Time Speech and Video Seamless with FastRTC

By April 9, 2025 • 28

HuggingFace, IISc partner to supercharge model building on India's diverse languages

By February 27, 2025 • 22

FastRTC: The Real-Time Communication Library for Python

By February 25, 2025 • 172

Deploying Speech-to-Speech on Hugging Face

By October 22, 2024 • 41

Powerful ASR + diarization + speculative decoding with Hugging Face Inference Endpoints

By May 1, 2024 • 80

Fine-Tune W2V2-Bert for low-resource ASR with 🤗 Transformers

By January 19, 2024 • 39

Speculative Decoding for 2x Faster Whisper Inference

By December 20, 2023 • 30

AudioLDM 2, but faster ⚡️

By August 30, 2023 • 14

Deploy MusicGen in no time with Inference Endpoints

By August 4, 2023 • 4

Fine-tuning MMS Adapter Models for Multi-Lingual ASR

By June 19, 2023 • 20

Speech Synthesis, Recognition, and More With SpeechT5

By February 8, 2023 • 12

A Complete Guide to Audio Datasets

By December 15, 2022 • 39

Fine-Tune Whisper with 🤗 Transformers

By November 3, 2022 • 279

Community Articles

Supercharge Edge AI With High‑Accuracy Reasoning Using NVIDIA Nemotron Nano 2 9B

and 9 others •

6 days ago

• 17

NVIDIA Releases 3 Million Sample Dataset for OCR, Visual Question Answering, and Captioning Tasks

and 4 others •

13 days ago

• 66

ChatML vs Harmony: Understanding the new Format from OpenAI 🔍

•

15 days ago

• 27

Old Maps, New Terrain: Updating Labour Taxonomies for the AI Era

and 1 other •

5 days ago

• 12

NVIDIA Releases 6 Million Multi-Lingual Reasoning Dataset

and 4 others •

4 days ago

• 11

What’s MXFP4? The 4-Bit Secret Powering OpenAI’s GPT‑OSS Models on Modest Hardware

•

16 days ago

• 18

Code a simple RAG from scratch

•

Oct 29, 2024

• 164

Uncensor any LLM with abliteration

•

Jun 13, 2024

• 658

How To Build a News Agent with GPT-OSS, Hugging Face Inference & Gradio

•

10 days ago

• 20

AutoBench Third Run: Revolutionizing LLM Evaluation with Record-Breaking Scale, Accuracy, and a New Home at autobench.org

•

5 days ago

• 6

From GRPO to DAPO and GSPO: What, Why, and How

•

16 days ago

• 14

RynnVLA-001: Using Human Demonstrations to Improve Robot Manipulation

and 9 others •

14 days ago

• 25

Introducing Pivotal Token Search (PTS): Targeting Critical Decision Points in LLM Training

•

May 17

• 9

Post-Training Isaac GR00T N1.5 for LeRobot SO-101 Arm

and 5 others •

Jun 11

• 83

Introduction to State Space Models (SSM)

•

Jul 19, 2024

• 164

makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch

•

May 7, 2024

• 96

KV Caching Explained: Optimizing Transformer Inference Efficiency

•

Jan 30

• 117

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 210

Prefill and Decode for Concurrent Requests - Optimizing LLM Performance

•

Apr 16

• 35

OpenEvolve: An Open Source Implementation of Google DeepMind's AlphaEvolve

•

May 20

• 38

View all