pplx-embed Collection Diffusion-Pretrained Dense and Contextual Embeddings • 7 items • Updated 28 days ago • 94
view changelog Hugging Face Changelog Introducing HF Jobs: Run scalable compute jobs on Hugging Face Jul 30, 2025 • 201
Efficient Agents: Building Effective Agents While Reducing Cost Paper • 2508.02694 • Published Jul 24, 2025 • 86
Long-Context LLMs Meet RAG: Overcoming Challenges for Long Inputs in RAG Paper • 2410.05983 • Published Oct 8, 2024 • 2
Search-R1-v0.2 Collection Exploration with a more stable RL pipeline with outcome-only reward and scaled-up LLMs. https://arxiv.org/abs/2503.09516 • 26 items • Updated Aug 12, 2025 • 5
DeepSeek R1 (All Versions) Collection DeepSeek-R1-0528 is here! The most powerful reasoning open LLM, available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models. • 37 items • Updated 14 days ago • 266
Cohere Labs Aya 23 Collection Aya 23 is an open weights research release of an instruction fine-tuned model with highly advanced multilingual capabilities. • 3 items • Updated Jul 31, 2025 • 57
Cohere Labs Aya Expanse Collection Aya Expanse is an open-weight research release of a model with highly advanced multilingual capabilities. • 4 items • Updated Jul 31, 2025 • 43
Tarsier2: Advancing Large Vision-Language Models from Detailed Video Description to Comprehensive Video Understanding Paper • 2501.07888 • Published Jan 14, 2025 • 15