HolyFox's picture

5 107

HolyFox

Holy-fox

·

AI & ML interests

LLMが好きな高校生。得意分野は合成データ

Recent Activity

updated a model about 1 hour ago

DataPilot/ArrowIdeative-13b-Instruct-test-llm-jp-v0.1

updated a model about 1 hour ago

DataPilot/ArrowIdeative-13b-NeoBase-ZERO-llm-jp-v0.1

published a model about 1 hour ago

DataPilot/ArrowIdeative-13b-Instruct-test-llm-jp-v0.1

View all activity

Organizations

upvoted a collection 18 days ago

Nemotron-Cascade

Scaling Cascaded Reinforcement Learning for General-Purpose Reasoning Models • 18 items • Updated 3 days ago • 41

upvoted a paper 3 months ago

DeepMMSearch-R1: Empowering Multimodal LLMs in Multimodal Web Search

Paper • 2510.12801 • Published Oct 14, 2025 • 13

upvoted 2 papers 5 months ago

SafeChain: Safety of Language Models with Long Chain-of-Thought Reasoning Capabilities

Paper • 2502.12025 • Published Feb 17, 2025 • 3

On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

Paper • 2508.05629 • Published Aug 7, 2025 • 180

upvoted a collection 5 months ago

gpt-oss

Open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases. • 2 items • Updated Aug 7, 2025 • 396