-
Notifications
You must be signed in to change notification settings - Fork 19
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BugFix][Refactor] Fix some bugs and refine codes for large scale simulator test #93
base: main
Are you sure you want to change the base?
Conversation
s5u13b
commented
Jan 16, 2025
•
edited
Loading
edited
- Simplify request timestamps implementation and add metrics
- Set max-instances for auto_scale_up loop
- Support retry binding address for zmq server
- Support power-of-k-choice for dispatch
- Change num_cpus of ProxyActor from 1 to 0
- Fix some bugs: abort in AsyncStream, host in glocal launch mode, simulator in global launch mode
- Reorg simulator files
- Reorg global_scheduler directory
- Resort manager and launcher functions
- Others Minors
|
|
5501476
to
8ec7ba7
Compare
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1a49a08
to
d2894ca
Compare
|
|
commit 48c674b Author: s5u13b <sunbiao.sun@alibaba-inc.com> Date: Fri Feb 7 09:41:05 2025 +0000 Fix lint commit 322862b Author: s5u13b <sunbiao.sun@alibaba-inc.com> Date: Fri Feb 7 09:39:31 2025 +0000 Fix entrypoints unit test commit 75af824 Author: s5u13b <sunbiao.sun@alibaba-inc.com> Date: Fri Feb 7 08:07:26 2025 +0000 Fix lint commit 2818c8d Author: s5u13b <sunbiao.sun@alibaba-inc.com> Date: Fri Feb 7 08:06:08 2025 +0000 Fix cr commit a172468 Author: s5u13b <sunbiao.sun@alibaba-inc.com> Date: Fri Feb 7 07:01:07 2025 +0000 Fix lint commit 3f863b2 Author: s5u13b <sunbiao.sun@alibaba-inc.com> Date: Fri Feb 7 06:54:18 2025 +0000 Add back timestamp commit 2e53b24 Author: s5u13b <sunbiao.sun@alibaba-inc.com> Date: Fri Feb 7 06:45:16 2025 +0000 Fix lint commit eea1a3a Author: s5u13b <sunbiao.sun@alibaba-inc.com> Date: Fri Feb 7 06:37:30 2025 +0000 Add back timestamps commit b4a45ef Author: s5u13b <sunbiao.sun@alibaba-inc.com> Date: Fri Feb 7 06:21:48 2025 +0000 Remove old filter commit f2df197 Author: s5u13b <sunbiao.sun@alibaba-inc.com> Date: Fri Feb 7 06:12:53 2025 +0000 Add _process_model_outputs back commit a51cf25 Author: s5u13b <sunbiao.sun@alibaba-inc.com> Date: Fri Feb 7 03:46:45 2025 +0000 Fix abort commit 1058ec0 Author: s5u13b <sunbiao.sun@alibaba-inc.com> Date: Fri Feb 7 02:43:14 2025 +0000 Remove blank todo commit 670018e Author: s5u13b <sunbiao.sun@alibaba-inc.com> Date: Fri Feb 7 02:36:27 2025 +0000 Filter out migrating request commit fa2fc9c Author: s5u13b <sunbiao.sun@alibaba-inc.com> Date: Fri Jan 24 06:25:35 2025 +0000 Remove process_model_outputs request timestamps commit 2a980ca Author: s5u13b <sunbiao.sun@alibaba-inc.com> Date: Fri Jan 24 06:10:49 2025 +0000 Fix linting commit 78a1ab4 Author: s5u13b <sunbiao.sun@alibaba-inc.com> Date: Fri Jan 24 05:30:15 2025 +0000 Fix request leaking bug of migration commit 774205b Author: s5u13b <sunbiao.sun@alibaba-inc.com> Date: Fri Jan 24 03:11:08 2025 +0000 Fix commit 814521e Author: s5u13b <sunbiao.sun@alibaba-inc.com> Date: Fri Jan 24 02:57:20 2025 +0000 Minors commit b3f0688 Author: s5u13b <sunbiao.sun@alibaba-inc.com> Date: Fri Jan 24 01:56:09 2025 +0000 Change ci timeout-minutes
00d3273
to
2c4cc50
Compare
|
|
@@ -139,6 +140,10 @@ usage: -m llumnix.entrypoints.vllm.api_server [-h] | |||
- Possible choices: balanced, load, queue, rr | |||
- Default: "load" | |||
|
|||
`--power-of-k-choice` |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I don't recommend using the term power-of-k-choices here. It's a well-established concept in the specific context of decentralized load balancing. But here it seems to me that you are only doing some sort of randomizing inside a centralized scheduler. Calling it power-of-k-choices is pretty misleading.