Commit History
Update README.md bce37e2 verified
Fixing nested JSON args parsing for tool-calls in streaming (#32) 7d4e437 verified
Updating streaming tool-call parser to return ChoiceDeltaToolCall (#31) b90f131 verified
Upload streaming tool call parser python file for vLLM (#30) 579351d verified
Update modeling_nemotron_h.py dbe2b5b verified
Update README.md d97784d verified
Update README.md c9beb84 verified
Update README.md e5610bb verified
Update README.md dc376c2 verified
Use remaining_tokens for max_tokens in vLLM token budget demo 41409e7 verified
Update README.md b5c2277 verified
Update README.md 3298c11 verified
Update README.md a180ba8 verified
Update config.json 7659b75 verified
Update README.md 20547a9 verified
Updating evaluation details for RULER (reasoning off) (#6) 4a28fbc verified
Update README.md bd0d6d5 verified
Update README.md a550406 verified
Update README.md 60cfbe0 verified
Sharath Turuvekere Sreenivas commited on
Update README.md d566bdf verified
Update README.md 39d09ce verified
Update README.md 786a9ff verified
Update README.md 4f47b96 verified
Update README.md 18c4545 verified
Minor fixes in example code snippets and chat template description (#2) 92f6429 verified
Update README.md 3fa6107 verified
Sharath Turuvekere Sreenivas commited on