Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BugFix] Fix cuda error when run simulator on gpu node #104

Merged
merged 2 commits into from
Feb 18, 2025

Conversation

Tong0217
Copy link
Contributor

No description provided.

@CLAassistant
Copy link

CLA assistant check
Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you sign our Contributor License Agreement before we can accept your contribution.


tongchenghao seems not to be a GitHub user. You need a GitHub account to be able to sign the CLA. If you have already a GitHub account, please add the email address used for this commit to your account.
You have signed the CLA already but the status is still pending? Let us recheck it.

Copy link

migration_size 8.00 MB 16.00 MB 24.00 MB 32.00 MB 40.00 MB 48.00 MB 56.00 MB 64.00 MB 72.00 MB 80.00 MB 88.00 MB 96.00 MB 104.00 MB 112.00 MB 120.00 MB 128.00 MB 136.00 MB 144.00 MB 152.00 MB 160.00 MB 168.00 MB 176.00 MB 184.00 MB 192.00 MB 200.00 MB 216.00 MB 272.00 MB 280.00 MB 296.00 MB 320.00 MB 472.00 MB
rayrpc_speed(GB/s) 0.89 1.32 1.62 1.86 1.90 2.06 2.11 2.12 2.26 2.28 2.31 2.28 2.43 2.31 2.33 2.38 2.53 2.49 2.51 2.50 2.54 2.44 2.48 2.58 2.69 2.44 2.67 2.80 2.82 2.99 3.28
migration_size 8.00 MB 16.00 MB 24.00 MB 32.00 MB 40.00 MB 48.00 MB 56.00 MB 64.00 MB 72.00 MB 80.00 MB 88.00 MB 96.00 MB 104.00 MB 112.00 MB 120.00 MB 128.00 MB 136.00 MB 144.00 MB 152.00 MB 160.00 MB 168.00 MB 176.00 MB 184.00 MB 192.00 MB 200.00 MB 208.00 MB 224.00 MB 232.00 MB 240.00 MB 248.00 MB 392.00 MB 480.00 MB
gloo_speed(GB/s) 0.89 1.45 1.82 2.06 2.15 2.47 2.56 2.62 2.40 2.57 2.63 2.68 2.56 2.86 2.65 2.50 2.73 2.34 2.63 2.71 2.52 2.34 0.25 2.16 2.65 2.47 1.93 1.77 2.54 0.32 0.72 2.53
migration_size 8.00 MB 16.00 MB 24.00 MB 32.00 MB 40.00 MB 48.00 MB 56.00 MB 64.00 MB 72.00 MB 80.00 MB 88.00 MB 96.00 MB 104.00 MB 112.00 MB 120.00 MB 128.00 MB 136.00 MB 144.00 MB 152.00 MB 160.00 MB 168.00 MB 176.00 MB 184.00 MB 192.00 MB 216.00 MB 224.00 MB 232.00 MB 240.00 MB 392.00 MB 480.00 MB 536.00 MB
nccl_speed(GB/s) 0.21 0.46 0.67 0.85 1.06 1.16 1.31 1.53 1.53 1.88 1.91 1.91 1.91 2.29 2.27 2.20 2.89 2.47 2.90 3.25 2.37 3.37 3.39 1.09 3.83 0.80 2.68 2.08 1.17 1.81 1.86

Copy link

prefill p25 p50 p75 p95 p99 mean
latency(ms) 910.12 1813.26 2735.47 77695.01 128055.40 13956.77
decode p25 p50 p75 p95 p99 mean
latency(ms) 65.32 119.55 183.14 1879.04 3659.35 304.02

@s5u13b s5u13b changed the title [BugFix]:fix cuda error when run simulator on gpu node. [BugFix] Fix cuda error when run simulator on gpu node Feb 17, 2025
@Tong0217 Tong0217 requested a review from ZeldaHuang February 17, 2025 09:09
@Tong0217 Tong0217 merged commit 5066d32 into main Feb 18, 2025
13 of 14 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants