Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Core] Add Ascend Quant Config to main branch #33

Closed
wants to merge 28 commits into from

add int8 cache dtype when using attention quantization

dbc7ca2
Select commit
Loading
Failed to load commit list.
Closed

[Core] Add Ascend Quant Config to main branch #33

add int8 cache dtype when using attention quantization
dbc7ca2
Select commit
Loading
Failed to load commit list.