view post Post 4184 Check out your 2025 Hugging Face Wrapped, a small experimental recap hf-wrapped/2025 See translation 3 replies · 🤗 7 7 🔥 3 3 + Reply
view post Post 388 PatchDNA, a DNA foundation model based on Meta's BLT tokenization strategy https://www.biorxiv.org/content/10.1101/2025.11.28.691095v1 See translation 🚀 1 1 + Reply
view post Post 2459 MLEB is the largest, most diverse, and most comprehensive benchmark for legal text embedding models. https://huggingface.co/blog/isaacus/introducing-mleb See translation 🚀 5 5 🔥 4 4 ❤️ 4 4 ➕ 3 3 🤗 3 3 😎 3 3 🧠 3 3 🤯 3 3 + Reply
METAGENE-1: Metagenomic Foundation Model for Pandemic Monitoring Paper • 2501.02045 • Published Jan 3 • 22
view post Post 458 Bio LLMs train on many genomes, but can we encode differences within a species? TomatoTomato adds pangenome tokens to represent a domestic tomato and a wild tomato in one sequence 🍅 🧬 monsoon-nlp/tomatotomato-gLM2-150M-v0.1 See translation 🚀 1 1 + Reply