7 99 47

Denis Akhiyarov

dtanow

AI & ML interests

AI Code Generation with LLMs

Recent Activity

upvoted a paper 6 days ago

MiniMax Sparse Attention

upvoted a paper 9 days ago

SWE-Explore: Benchmarking How Coding Agents Explore Repositories

upvoted an article 9 days ago

Can Voice Agents Handle Bilingual Customers? Benchmarking Frontier ASR on Code-Switched Speech

View all activity

Organizations

upvoted a paper 6 days ago

MiniMax Sparse Attention

Paper • 2606.13392 • Published 8 days ago • 138

upvoted a paper 9 days ago

SWE-Explore: Benchmarking How Coding Agents Explore Repositories

Paper • 2606.07297 • Published 14 days ago • 115

upvoted an article 9 days ago

Article

Can Voice Agents Handle Bilingual Customers? Benchmarking Frontier ASR on Code-Switched Speech

ServiceNow-AI

•

9 days ago

• 43

liked a Space 17 days ago

MTEB Leaderboard

📊

7.48k

Embedding Leaderboard

upvoted 2 papers 21 days ago

Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models

Paper • 2510.04618 • Published Oct 6, 2025 • 134

Hyperagents

Paper • 2603.19461 • Published Mar 19 • 51

upvoted a paper 29 days ago

Code as Agent Harness

Paper • 2605.18747 • Published May 18 • 220

upvoted 2 papers about 1 month ago

EVA-Bench: A New End-to-end Framework for Evaluating Voice Agents

Paper • 2605.13841 • Published May 13 • 75

Do Enterprise Systems Need Learned World Models? The Importance of Context to Infer Dynamics

Paper • 2605.12178 • Published May 12 • 61

liked a model about 1 month ago

nvidia/llama-nemotron-embed-vl-1b-v2

upvoted an article about 2 months ago

Article

Vision Language Models (Better, faster, stronger)

merve, sergiopaniego, ariG23498, pcuenq, andito

•

May 12, 2025

• 614

upvoted 2 papers 2 months ago

Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability

Paper • 2604.06628 • Published Apr 8 • 327

Apriel-Reasoner: RL Post-Training for General-Purpose and Efficient Reasoning

Paper • 2604.02007 • Published Apr 2 • 14

upvoted a paper 3 months ago

Therefore I am. I Think

Paper • 2604.01202 • Published Apr 2 • 33

submitted a paper to Daily Papers 3 months ago

Therefore I am. I Think

Paper • 2604.01202 • Published Apr 2 • 33

upvoted 2 papers 3 months ago

Terminal Agents Suffice for Enterprise Automation

Paper • 2604.00073 • Published Mar 31 • 96

CUA-Suite: Massive Human-annotated Video Demonstrations for Computer-Use Agents

Paper • 2603.24440 • Published Mar 25 • 98

liked a dataset 3 months ago

ServiceNow-AI/eva

Viewer • Updated Mar 24 • 50 • 121 • 71

upvoted an article 3 months ago

Article

A New Framework for Evaluating Voice Agents (EVA)

ServiceNow-AI

•

Mar 24

• 95

upvoted a paper 3 months ago

Reasoning as Compression: Unifying Budget Forcing via the Conditional Information Bottleneck

Paper • 2603.08462 • Published Mar 9 • 23

Denis Akhiyarov

AI & ML interests

Recent Activity

Organizations

dtanow's activity

Can Voice Agents Handle Bilingual Customers? Benchmarking Frontier ASR on Code-Switched Speech

MTEB Leaderboard

Vision Language Models (Better, faster, stronger)

A New Framework for Evaluating Voice Agents (EVA)