https://huggingface.co/JackFram/llama-68m with ONNX weights to be compatible with Transformers.js.

Usage (Transformers.js)

If you haven't already, you can install the Transformers.js JavaScript library from NPM using:

npm i @huggingface/transformers

Example: Text generation.

import { pipeline } from '@huggingface/transformers';

const generator = await pipeline('text-generation', 'Xenova/llama-68m');
const output = await generator('Once upon a time, there was', { max_new_tokens: 10 });

Note: Having a separate repo for ONNX weights is intended to be a temporary solution until WebML gains more traction. If you would like to make your models web-ready, we recommend converting to ONNX using 🤗 Optimum and structuring your repo like this one (with ONNX weights located in a subfolder named onnx).

Downloads last month
11
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Xenova/llama-68m

Base model

JackFram/llama-68m
Quantized
(6)
this model