README.md · RedHatAI/bge-large-en-v1.5-quant at 85835b55ca92c41a2a0d6a6eee93880602bcc585

bge-large-en-v1.5-quant / README.md

zeroshot

Update README.md

85835b5 almost 2 years ago

preview code

raw

history blame

931 Bytes

metadata

license: mit
language:
  - en

This is the quantized (INT8) ONNX variant of the bge-large-en-v1.5 model for embeddings created with DeepSparse Optimum for ONNX export/inference pipeline and Neural Magic's Sparsify for One-Shot quantization.