hugging face 关于context-length如何理解,与本地不一致 #1213
Unanswered
HeJianqiaoMVP
asked this question in
Q&A
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
请问Qwen/Qwen2.5-32B-Instruct-GGUF Q8量化版本支持的上下文token是多少呀。是32,768 还是131,072?huggingface上看Context Length: Full 32,768 tokens and generation 8192 tokens。我本地查看时候显示的是131,072?搞懵了。


Beta Was this translation helpful? Give feedback.
All reactions