Skip to content

[BugFix]add int8 cache dtype when using attention quantization #147

[BugFix]add int8 cache dtype when using attention quantization

[BugFix]add int8 cache dtype when using attention quantization #147

mypy (3.11)

succeeded Feb 21, 2025 in 2m 28s