LLaDA-8B-Base / README.md
nielsr's picture
nielsr HF Staff
Add model card
2f73110 verified
|
raw
history blame
466 Bytes
metadata
license: mit
library_name: transformers
pipeline_tag: text-generation

LLaDA-8B-Base

This repository contains the LLaDA-8B-Base model, as described in the paper Large Language Diffusion Models. LLaDA is a diffusion model with an 8B scale, trained entirely from scratch.

For code, see the project's GitHub repository: ML-GSAI/SMDM

Project Page