RedHatAI/Meta-Llama-3.1-8B-Instruct-FP8-dynamic Text Generation • 8B • Updated 22 days ago • 38.9k • 9
RedHatAI/Llama-3.1-8B-Instruct-speculator.eagle3 Text Generation • 1.0B • Updated 15 days ago • 6.68k • 1
Running 1 Quantization Formats And Cuda Compute Capability Support 🧠 1 Quantization Formats & CUDA Compute Capability Support