Nicholas Broad's picture

Nicholas Broad

nbroad

·

AI & ML interests

None yet

Recent Activity

updated a dataset about 12 hours ago

nbroad/hf-inference-providers-data

liked a model 5 days ago

zai-org/GLM-4.7-Flash

upvoted an article 5 days ago

LightOnOCR-2-1B: a lightweight high-performance end-to-end OCR model family

View all activity

Organizations

upvoted an article 5 days ago

Article

LightOnOCR-2-1B: a lightweight high-performance end-to-end OCR model family

7 days ago

•

60

upvoted an article 18 days ago

Article

Building Autonomous Vehicles That Reason with the NVIDIA Alpamayo Open Ecosystem

20 days ago

•

17

upvoted a paper 5 months ago

InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency

Paper • 2508.18265 • Published Aug 25, 2025 • 212

upvoted an article 6 months ago

Article

Welcome GPT OSS, the new open-source model family from OpenAI!

+10

Aug 5, 2025

•

509

upvoted a collection 6 months ago

GLiCLass-V3

Models for zero-shot text classification that are up to 50 times faster than Cross-Encoders and show the same or higher accuracy. • 8 items • Updated Aug 13, 2025 • 18

upvoted an article 9 months ago

Article

Tiny Agents: an MCP-powered agent in 50 lines of code

Apr 25, 2025

•

306

upvoted 3 papers 10 months ago

Kimi-VL Technical Report

Paper • 2504.07491 • Published Apr 10, 2025 • 133

Fully Autonomous AI Agents Should Not be Developed

Paper • 2502.02649 • Published Feb 4, 2025 • 35

Charting and Navigating Hugging Face's Model Atlas

Paper • 2503.10633 • Published Mar 13, 2025 • 92

upvoted an article 11 months ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Feb 7, 2025

•

273

upvoted a paper 11 months ago

NeoBERT: A Next-Generation BERT

Paper • 2502.19587 • Published Feb 26, 2025 • 38

upvoted an article 11 months ago

Article

1 Billion Classifications

Feb 13, 2025

•

45

upvoted 2 papers 11 months ago

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

Paper • 2502.11089 • Published Feb 16, 2025 • 166

SWE-Lancer: Can Frontier LLMs Earn $1 Million from Real-World Freelance Software Engineering?

Paper • 2502.12115 • Published Feb 17, 2025 • 46

upvoted a paper 12 months ago

Tensor Product Attention Is All You Need

Paper • 2501.06425 • Published Jan 11, 2025 • 90

upvoted an article about 1 year ago

Article

Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference

Jan 16, 2025

•

76

upvoted 2 papers about 1 year ago

Executable Code Actions Elicit Better LLM Agents

Paper • 2402.01030 • Published Feb 1, 2024 • 186

Training Large Language Models to Reason in a Continuous Latent Space

Paper • 2412.06769 • Published Dec 9, 2024 • 92

upvoted a collection about 1 year ago

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 46 items • Updated 26 days ago • 679

upvoted a paper about 1 year ago

Apollo: An Exploration of Video Understanding in Large Multimodal Models

Paper • 2412.10360 • Published Dec 13, 2024 • 147