aurick commited on
Commit
22a50b8
·
verified ·
1 Parent(s): 47580e9

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +5 -0
README.md CHANGED
@@ -12,6 +12,11 @@ For more details about SwiftKV and how to use it:
12
  * 📝 [SwiftKV: Fast Prefill-Optimized Inference with Knowledge-Preserving Model Transformation (arXiv)](https://arxiv.org/abs/2410.03960)
13
  * 🚀 [Getting started guide](https://github.com/Snowflake-Labs/vllm/tree/swiftkv/examples/swiftkv)
14
 
 
 
 
 
 
15
  ## Performance Metrics
16
 
17
  To evaluate SwiftKV’s performance, we focus on the following key metrics (see more details in our [blog](https://www.snowflake.com/engineering-blog/swiftkv-llm-compute-reduction/)):
 
12
  * 📝 [SwiftKV: Fast Prefill-Optimized Inference with Knowledge-Preserving Model Transformation (arXiv)](https://arxiv.org/abs/2410.03960)
13
  * 🚀 [Getting started guide](https://github.com/Snowflake-Labs/vllm/tree/swiftkv/examples/swiftkv)
14
 
15
+ ## Revisions
16
+
17
+ * **release-2508 (Aug 2025):** Updated model weights for long-context up to 128K
18
+ * **release-2412 (Dec 2024):** Initial model release
19
+
20
  ## Performance Metrics
21
 
22
  To evaluate SwiftKV’s performance, we focus on the following key metrics (see more details in our [blog](https://www.snowflake.com/engineering-blog/swiftkv-llm-compute-reduction/)):