v1.3.3
Important Update: Add kv cache to speed up generation.
The old versions will not work because the ONNX model on Hugging Face has been updated.
Important Update: Add kv cache to speed up generation.
The old versions will not work because the ONNX model on Hugging Face has been updated.