AI & ML interests

Our focus: high-accuracy voice and language AI for real-world, high-noise applications. We’re exploring cutting-edge ASR, diarization, and NLI models—optimized for real-time, low-latency performance on CPUs.

Recent Activity

🅾 ShunyaLabs — State-of-the-Art Open Models

Pingala V1: Top-ranked ASR model setting new performance benchmarks in speech recognition.

Everything Starts From Zero: In the competitive landscape of Automatic Speech Recognition (ASR), a breakthrough has emerged that fundamentally challenges existing paradigms. We present Pingala V1, an ASR model that has achieved the #1 ranking on the prestigious Open ASR Leaderboard with an unprecedented average Word Error Rate (WER) of 2.94% across eight comprehensive benchmarks, representing about 50% improvement over the closest competitor. Following the mathematical legacy of its namesake, Pingala V1 demonstrates that ancient principles of binary computation and sequential analysis remain profoundly relevant in modern AI.

This research details the mathematical foundations underlying Pingala V1's superior performance, including our novel high-entropy architecture, Mixture of Experts (MoE) specialization, and advanced speaker diarization capabilities across 200+ languages. Our dual-mode approach—Enhanced mode for call center optimization and Verbatim mode for media applications—demonstrates that mathematical elegance and practical performance can converge to create transformative breakthroughs in speech recognition technology.


Stay Updated — Follow us here on Hugging Face


datasets 0

None public yet