一万篇论文笔记's picture

9 139

一万篇论文笔记

10Kpapers

·

AI & ML interests

None yet

Organizations

None yet

upvoted an article 6 months ago

Article

A Gentle Introduction to 8-bit Matrix Multiplication for transformers at scale using transformers, accelerate and bitsandbytes

By

and 1 other •

Aug 17, 2022

• 104

upvoted a collection 7 months ago

Tulu 3 Datasets

All datasets released with Tulu 3 -- state of the art open post-training recipes. • 33 items • Updated Apr 30 • 88

upvoted an article 7 months ago

Article

Open-source DeepResearch – Freeing our search agents

By

and 4 others •

Feb 4

• 1.28k

upvoted 2 collections 7 months ago

Tulu 3 Models

All models released with Tulu 3 -- state of the art open post-training recipes. • 11 items • Updated Apr 30 • 100

DeepSeek-Math

DeepSeek Math series • 4 items • Updated Aug 16, 2024 • 25

upvoted 3 collections 8 months ago

DeepSeek-V2

8 items • Updated Jan 3 • 32

DeepSeek-MoE

DeepSeek MoE series • 3 items • Updated Aug 16, 2024 • 20

Qwen2

Qwen2 language models, including pretrained and instruction-tuned models of 5 sizes, including 0.5B, 1.5B, 7B, 57B-A14B, and 72B. • 39 items • Updated Jul 21 • 368