v1.2.1: SynapseAI v1.14.0
SynapseAI v1.14
The codebase is validated with SynapseAI 1.14.0 and optimum-habana 1.10.4.
Tested configuration
- LLama2 70B BF16 on 8xGaudi2
Highlights
- Add support for continuous batching on Intel Gaudi
- Add batch size bucketing
- Add sequence bucketing for prefill operation
- Optimize concatenate operation
- Add speculative scheduling
Full Changelog: v1.2.0...v1.2.1