Marc Sun's picture

Marc Sun

marcsun13

·

AI & ML interests

LLM, Quantization, Training, Inference

Recent Activity

updated a model 3 days ago

hf-internal-testing/gpt-oss-20b-bf16

published a model 3 days ago

hf-internal-testing/gpt-oss-20b-bf16

new activity 4 days ago

kernels-community/triton_kernels:Kernel assertion errors on 5090 using generation with MXfp4 (gpt-oss) - (stable on 4090)

View all activity

Organizations

published an article 17 days ago

Article

Accelerate ND-Parallel: A Guide to Efficient Multi-GPU Training

By

and 4 others •

17 days ago

• 51

published an article 20 days ago

Article

Welcome GPT OSS, the new open-source model family from OpenAI!

By

and 11 others •

20 days ago

• 472

published an article 2 months ago

Article

Transformers backend integration in SGLang

By

and 4 others •

Jun 23

• 53

published an article 2 months ago

Article

(LoRA) Fine-Tuning FLUX.1-dev on Consumer Hardware

By

and 4 others •

Jun 19

• 83

published an article 3 months ago

Article

Exploring Quantization Backends in Diffusers

By

and 2 others •

May 21

• 40

published an article 4 months ago

Article

Introducing AutoRound: Intel’s Advanced Quantization for LLMs and VLMs

By

and 8 others •

Apr 29

• 39

published an article 6 months ago

Article

LLM Inference on Edge: A Fun and Easy Guide to run LLMs via React Native on your Phone!

By

and 1 other •

Mar 7

• 76

published an article 10 months ago

Article

Introducing SynthID Text

By

and 5 others •

Oct 23, 2024

• 46

published an article 11 months ago

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

By

and 5 others •

Sep 18, 2024

• 265

published an article 12 months ago

Article

Accelerate 1.0.0

By

and 2 others •

Sep 13, 2024

• 53

published an article about 1 year ago

Article

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

By

and 7 others •

Jul 23, 2024

• 237

published an article over 1 year ago

Article

quanto: a pytorch quantization toolkit

By

and 2 others •

Mar 18, 2024

• 42

published an article almost 2 years ago

Article

Overview of natively supported quantization schemes in 🤗 Transformers

By

and 4 others •

Sep 12, 2023

• 12

published an article about 2 years ago

Article

Making LLMs lighter with AutoGPTQ and transformers

By

and 5 others •

Aug 23, 2023

• 58