Xenova HF Staff whitphx HF Staff commited on
Commit
80f0928
·
verified ·
1 Parent(s): 6920cab

Add/update the quantized ONNX model files and README.md for Transformers.js v3 (#3)

Browse files

- Add/update the quantized ONNX model files and README.md for Transformers.js v3 (e3b08cae7b7be18bb56316060806138b90bb20fb)


Co-authored-by: Yuichiro Tachibana <[email protected]>

README.md CHANGED
@@ -5,4 +5,22 @@ library_name: transformers.js
5
 
6
  https://huggingface.co/t5-small with ONNX weights to be compatible with Transformers.js.
7
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
8
  Note: Having a separate repo for ONNX weights is intended to be a temporary solution until WebML gains more traction. If you would like to make your models web-ready, we recommend converting to ONNX using [🤗 Optimum](https://huggingface.co/docs/optimum/index) and structuring your repo like this one (with ONNX weights located in a subfolder named `onnx`).
 
5
 
6
  https://huggingface.co/t5-small with ONNX weights to be compatible with Transformers.js.
7
 
8
+ ## Usage (Transformers.js)
9
+
10
+ If you haven't already, you can install the [Transformers.js](https://huggingface.co/docs/transformers.js) JavaScript library from [NPM](https://www.npmjs.com/package/@huggingface/transformers) using:
11
+ ```bash
12
+ npm i @huggingface/transformers
13
+ ```
14
+
15
+ **Example:** Text-to-text generation.
16
+
17
+ ```js
18
+ import { pipeline } from '@huggingface/transformers';
19
+
20
+ const generator = await pipeline('text2text-generation', 'Xenova/t5-small');
21
+ const output = await generator('how can I become more healthy?', {
22
+ max_new_tokens: 100,
23
+ });
24
+ ```
25
+
26
  Note: Having a separate repo for ONNX weights is intended to be a temporary solution until WebML gains more traction. If you would like to make your models web-ready, we recommend converting to ONNX using [🤗 Optimum](https://huggingface.co/docs/optimum/index) and structuring your repo like this one (with ONNX weights located in a subfolder named `onnx`).
onnx/decoder_model_bnb4.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ca7c0160f53931838935e9e4a096a84c37fb17e1eca67a389df27955e2f83305
3
+ size 80162109
onnx/decoder_model_fp16.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e90fe2cea972c0d68c6f46105378a37846b9d292e1d750e611ec6e82eda688aa
3
+ size 83411719
onnx/decoder_model_int8.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:77e84fb92a8eeb8c2f9f6522de77ef1a37a58a2e755ef0fce7f7e37dc3542b96
3
+ size 107685752
onnx/decoder_model_q4.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9374f8598ed15351abe4a5b854560f1bf5816f0d3e9950c5caf86d7e6dc62518
3
+ size 81734493
onnx/decoder_model_q4f16.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7c478b93500538dffad654dd2fa2d19d65357c7c07977e2f0135c37180d7bb7b
3
+ size 47244794
onnx/decoder_model_uint8.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:98782be9fd4baa185d3fd4f2d5db0adb42dcbdc0d97f4489c4689b2dd25b6541
3
+ size 107685791
onnx/decoder_with_past_model_bnb4.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b4f49b096981ada008409ac603c557364b6ab008b803591e1c288694398e3cad
3
+ size 78360748
onnx/decoder_with_past_model_fp16.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c393c9d200572722e940d3c30df4b04908af89e1a70ba4c37c8359c1458ed050
3
+ size 77095177
onnx/decoder_with_past_model_int8.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b3c4c4d13f534e19d9aeb4694ac98da3327cf5cf23f2af6bffb2a0d413f37ed3
3
+ size 104498256
onnx/decoder_with_past_model_q4.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2c67799659db399094aec762064b23b5a36f581a16e80c164903819624bd03fd
3
+ size 79736620
onnx/decoder_with_past_model_q4f16.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e7ddd5f5abd0024f2ee28ec30ae96d76ae6b5431baa298d104856f99b2233108
3
+ size 45448448
onnx/decoder_with_past_model_uint8.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5d29a22102cb181a42113cb831472bfa5d7d6799232e840a8be7d091325fe533
3
+ size 104498286
onnx/encoder_model_bnb4.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7aa6693b844d6e1a8308b56d5cd50439d9dcaa2f3f4425fe3bdec73f3cdb988b
3
+ size 76528655
onnx/encoder_model_int8.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1ada693644efa0fabb5037caa7c8d98307b74310509f480faace264055fa09ea
3
+ size 35472753
onnx/encoder_model_q4.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:89e93b66dc8fd108a97f5119f293510af82295f09397f808198e1b3dbf3ae699
3
+ size 77708015
onnx/encoder_model_q4f16.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:86904ebc48ab8c323f939a2256a3fcefceaa728ebd4d3faef41eab269f63d294
3
+ size 43616234
onnx/encoder_model_uint8.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b0bd138f07d8bf6794dab9a11926d3c74ea788cbe326b898f563cea6b8fbc80e
3
+ size 35472768