This is a dummy copy of amd/Mixtral-8x7B-Instruct-v0.1-FP8-KV
, but with two layer only.
Useful for profiling / testing purposes.
This is a dummy copy of amd/Mixtral-8x7B-Instruct-v0.1-FP8-KV
, but with two layer only.
Useful for profiling / testing purposes.