Skip to content

[BugFix]add int8 cache dtype when using attention quantization #144

[BugFix]add int8 cache dtype when using attention quantization

[BugFix]add int8 cache dtype when using attention quantization #144

mypy (3.12)

succeeded Feb 21, 2025 in 2m 59s