Skip to content

[BugFix]add int8 cache dtype when using attention quantization #147

[BugFix]add int8 cache dtype when using attention quantization

[BugFix]add int8 cache dtype when using attention quantization #147

mypy (3.12)

succeeded Feb 21, 2025 in 2m 45s