zeroshot's picture
Update README.md
85835b5
|
raw
history blame
931 Bytes
metadata
license: mit
language:
  - en

This is the quantized (INT8) ONNX variant of the bge-large-en-v1.5 model for embeddings created with DeepSparse Optimum for ONNX export/inference pipeline and Neural Magic's Sparsify for One-Shot quantization.