-
Notifications
You must be signed in to change notification settings - Fork 197
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
About thrust API's performance #3851
Comments
Here's my test result.
|
This is probably best addressed by the CCCL team working on Thrust, let me transfer it to their repo. |
Hi! While Thrust APIs can be called from within a kernel (i.e. a Try replacing your kernel launch:
by a direct call to the Thrust API:
Mind that this also includes the memory transfer of the result back to the host. That should give you better results. I think we should also dedicate some time to update some of the samples. |
@bernhardmgruber Thanks a lot. I use following code
And the result is
Significant improvement! |
I am glad you could such an improvement! Cheers! |
ENV:
cpu: Intel® Core™ i7-14700K
ubuntu 22.04
GPU: NVIDIA GeForce RTX 4070 SUPER
CUDA 12.3
I'm currently learning to use thrust API. However, in my test, the thrust API's performance is quite low. Here's my timing code of 0_Introduction/c++11_cuda.cu
Are there any mistakes in my timing code?
The text was updated successfully, but these errors were encountered: