Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Opt] the binary size of libcuvs is increasing when add .cu files #634

Open
rhdong opened this issue Jan 31, 2025 · 6 comments
Open

[Opt] the binary size of libcuvs is increasing when add .cu files #634

rhdong opened this issue Jan 31, 2025 · 6 comments
Labels
feature request New feature or request

Comments

@rhdong
Copy link
Member

rhdong commented Jan 31, 2025

I observed that multiple depulicated kernels were compiled while developing PR #618.

  • The binary size increases even when the call to cagra::build was removed from cagra::merge, which indicating that the issue of duplicated kernels exists across a wider range and possibly happens when including the file cagra.cuh or cagra.hpp in a new src/*.cu .

The incremental symbols after the PR #618:

< Base __cudaRegisterLinkedBinary_0bad977c_72_compute_distance_vpq_L2Expanded_int8_uint32_dim512_t32_8pq_4subd_half_cu_f77f5a7d_5643
< Base __cudaRegisterLinkedBinary_0bdc89a9_63_compute_distance_standard_L2Expanded_float_uint32_dim256_t16_cu_f77f5a7d_3626
< Base __cudaRegisterLinkedBinary_14b6edf8_62_compute_distance_standard_L2Expanded_int8_uint32_dim512_t32_cu_f77f5a7d_3683
< Base __cudaRegisterLinkedBinary_16c70bf1_64_compute_distance_standard_InnerProduct_int8_uint32_dim256_t16_cu_f77f5a7d_3675
< Base __cudaRegisterLinkedBinary_19e7b356_63_compute_distance_standard_InnerProduct_int8_uint32_dim128_t8_cu_f77f5a7d_3673
< Base __cudaRegisterLinkedBinary_22690673_33_search_single_cta_uint8_uint32_cu_f77f5a7d_7128
< Base __cudaRegisterLinkedBinary_2fe4f0ab_72_compute_distance_vpq_L2Expanded_int8_uint32_dim256_t16_8pq_2subd_half_cu_f77f5a7d_5627
< Base __cudaRegisterLinkedBinary_353026eb_62_compute_distance_standard_L2Expanded_half_uint32_dim256_t16_cu_f77f5a7d_3635
< Base __cudaRegisterLinkedBinary_36bf2ef4_31_search_multi_cta_half_uint32_cu_f77f5a7d_5692
< Base __cudaRegisterLinkedBinary_3741c0e2_64_compute_distance_standard_InnerProduct_half_uint32_dim512_t32_cu_f77f5a7d_3576
< Base __cudaRegisterLinkedBinary_37ad16a1_61_compute_distance_standard_L2Expanded_half_uint32_dim128_t8_cu_f77f5a7d_3630
< Base __cudaRegisterLinkedBinary_393bdb37_63_compute_distance_standard_L2Expanded_uint8_uint32_dim256_t16_cu_f77f5a7d_3681
< Base __cudaRegisterLinkedBinary_41b4a96c_72_compute_distance_vpq_L2Expanded_uint8_uint32_dim128_t8_8pq_4subd_half_cu_f77f5a7d_5662
< Base __cudaRegisterLinkedBinary_461ac610_21_cagra_search_uint8_cu_f77f5a7d_3687
< Base __cudaRegisterLinkedBinary_46e294db_64_compute_distance_standard_InnerProduct_uint8_uint32_dim128_t8_cu_f77f5a7d_3555
< Base __cudaRegisterLinkedBinary_4a33b5e2_72_compute_distance_vpq_L2Expanded_float_uint32_dim128_t8_8pq_4subd_half_cu_f77f5a7d_3665
< Base __cudaRegisterLinkedBinary_4b4971f3_65_compute_distance_standard_InnerProduct_uint8_uint32_dim512_t32_cu_f77f5a7d_3671
< Base __cudaRegisterLinkedBinary_505b2ce9_72_compute_distance_vpq_L2Expanded_half_uint32_dim256_t16_8pq_4subd_half_cu_f77f5a7d_5621
< Base __cudaRegisterLinkedBinary_5488f3ad_21_cagra_search_float_cu_f77f5a7d_3689
< Base __cudaRegisterLinkedBinary_56a70ec8_73_compute_distance_vpq_L2Expanded_float_uint32_dim512_t32_8pq_2subd_half_cu_f77f5a7d_5573
< Base __cudaRegisterLinkedBinary_5cac27f3_73_compute_distance_vpq_L2Expanded_uint8_uint32_dim512_t32_8pq_2subd_half_cu_f77f5a7d_5680
< Base __cudaRegisterLinkedBinary_5d852589_71_compute_distance_vpq_L2Expanded_half_uint32_dim128_t8_8pq_2subd_half_cu_f77f5a7d_5594
< Base __cudaRegisterLinkedBinary_66aa5569_33_search_single_cta_float_uint32_cu_f77f5a7d_5699
< Base __cudaRegisterLinkedBinary_67697925_71_compute_distance_vpq_L2Expanded_int8_uint32_dim128_t8_8pq_2subd_half_cu_f77f5a7d_5623
< Base __cudaRegisterLinkedBinary_6bdad738_32_search_single_cta_half_uint32_cu_f77f5a7d_5701
< Base __cudaRegisterLinkedBinary_72ee691f_73_compute_distance_vpq_L2Expanded_float_uint32_dim256_t16_8pq_4subd_half_cu_f77f5a7d_5563
< Base __cudaRegisterLinkedBinary_74124b3e_72_compute_distance_vpq_L2Expanded_half_uint32_dim512_t32_8pq_2subd_half_cu_f77f5a7d_5619
< Base __cudaRegisterLinkedBinary_78e54024_73_compute_distance_vpq_L2Expanded_uint8_uint32_dim256_t16_8pq_4subd_half_cu_f77f5a7d_5689
< Base __cudaRegisterLinkedBinary_79ae236d_65_compute_distance_standard_InnerProduct_float_uint32_dim512_t32_cu_f77f5a7d_3616
< Base __cudaRegisterLinkedBinary_79dce9f7_64_compute_distance_standard_InnerProduct_float_uint32_dim128_t8_cu_f77f5a7d_3542
< Base __cudaRegisterLinkedBinary_8591f14b_65_compute_distance_standard_InnerProduct_uint8_uint32_dim256_t16_cu_f77f5a7d_3580
< Base __cudaRegisterLinkedBinary_8a4bf26a_62_compute_distance_standard_L2Expanded_uint8_uint32_dim128_t8_cu_f77f5a7d_3677
< Base __cudaRegisterLinkedBinary_8f012f76_72_compute_distance_vpq_L2Expanded_int8_uint32_dim256_t16_8pq_4subd_half_cu_f77f5a7d_5653
< Base __cudaRegisterLinkedBinary_92bc3215_63_compute_distance_standard_InnerProduct_half_uint32_dim128_t8_cu_f77f5a7d_3564
< Base __cudaRegisterLinkedBinary_9353ec83_20_cagra_search_int8_cu_f77f5a7d_3685
< Base __cudaRegisterLinkedBinary_ab4848a1_72_compute_distance_vpq_L2Expanded_int8_uint32_dim512_t32_8pq_2subd_half_cu_f77f5a7d_5645
< Base __cudaRegisterLinkedBinary_ac1c7797_31_search_multi_cta_int8_uint32_cu_f77f5a7d_5695
< Base __cudaRegisterLinkedBinary_aefa97cb_19_compute_distance_cu_f77f5a7d_3505
< Base __cudaRegisterLinkedBinary_b0e6ed11_32_search_multi_cta_uint8_uint32_cu_f77f5a7d_5697
< Base __cudaRegisterLinkedBinary_b5758f46_62_compute_distance_standard_L2Expanded_float_uint32_dim128_t8_cu_f77f5a7d_3601
< Base __cudaRegisterLinkedBinary_b776a3d5_65_compute_distance_standard_InnerProduct_float_uint32_dim256_t16_cu_f77f5a7d_3606
< Base __cudaRegisterLinkedBinary_bcf697e2_61_compute_distance_standard_L2Expanded_int8_uint32_dim128_t8_cu_f77f5a7d_3639
< Base __cudaRegisterLinkedBinary_bf30f130_20_cagra_search_half_cu_f77f5a7d_3691
< Base __cudaRegisterLinkedBinary_c5040911_63_compute_distance_standard_L2Expanded_float_uint32_dim512_t32_cu_f77f5a7d_3680
< Base __cudaRegisterLinkedBinary_c78ca6f8_71_compute_distance_vpq_L2Expanded_int8_uint32_dim128_t8_8pq_4subd_half_cu_f77f5a7d_5641
< Base __cudaRegisterLinkedBinary_d20bb6c2_73_compute_distance_vpq_L2Expanded_float_uint32_dim256_t16_8pq_2subd_half_cu_f77f5a7d_3597
< Base __cudaRegisterLinkedBinary_d4f794e3_72_compute_distance_vpq_L2Expanded_half_uint32_dim512_t32_8pq_4subd_half_cu_f77f5a7d_5628
< Base __cudaRegisterLinkedBinary_d8009ff9_73_compute_distance_vpq_L2Expanded_uint8_uint32_dim256_t16_8pq_2subd_half_cu_f77f5a7d_5677
< Base __cudaRegisterLinkedBinary_d81f8b49_64_compute_distance_standard_InnerProduct_int8_uint32_dim512_t32_cu_f77f5a7d_3669
< Base __cudaRegisterLinkedBinary_da6e6d40_62_compute_distance_standard_L2Expanded_int8_uint32_dim256_t16_cu_f77f5a7d_3624
< Base __cudaRegisterLinkedBinary_e15176b1_72_compute_distance_vpq_L2Expanded_uint8_uint32_dim128_t8_8pq_2subd_half_cu_f77f5a7d_5664
< Base __cudaRegisterLinkedBinary_ead66a3f_72_compute_distance_vpq_L2Expanded_float_uint32_dim128_t8_8pq_2subd_half_cu_f77f5a7d_3667
< Base __cudaRegisterLinkedBinary_f0bef334_72_compute_distance_vpq_L2Expanded_half_uint32_dim256_t16_8pq_2subd_half_cu_f77f5a7d_5611
< Base __cudaRegisterLinkedBinary_f1798e5b_32_search_single_cta_int8_uint32_cu_f77f5a7d_5703
< Base __cudaRegisterLinkedBinary_f425be0b_32_search_multi_cta_float_uint32_cu_f77f5a7d_5693
< Base __cudaRegisterLinkedBinary_f642d115_73_compute_distance_vpq_L2Expanded_float_uint32_dim512_t32_8pq_4subd_half_cu_f77f5a7d_5583
< Base __cudaRegisterLinkedBinary_f7e35b8f_63_compute_distance_standard_L2Expanded_uint8_uint32_dim512_t32_cu_f77f5a7d_3588
< Base __cudaRegisterLinkedBinary_f999405a_64_compute_distance_standard_InnerProduct_half_uint32_dim256_t16_cu_f77f5a7d_3637
< Base __cudaRegisterLinkedBinary_fbe8a653_62_compute_distance_standard_L2Expanded_half_uint32_dim512_t32_cu_f77f5a7d_3613
< Base __cudaRegisterLinkedBinary_fc49f82e_73_compute_distance_vpq_L2Expanded_uint8_uint32_dim512_t32_8pq_4subd_half_cu_f77f5a7d_5687
< Base __cudaRegisterLinkedBinary_fd60fa54_71_compute_distance_vpq_L2Expanded_half_uint32_dim128_t8_8pq_4subd_half_cu_f77f5a7d_5603
---
> Base __cudaRegisterLinkedBinary_0bad977c_72_compute_distance_vpq_L2Expanded_int8_uint32_dim512_t32_8pq_4subd_half_cu_c6ca332b_5704
> Base __cudaRegisterLinkedBinary_0bdc89a9_63_compute_distance_standard_L2Expanded_float_uint32_dim256_t16_cu_c6ca332b_3707
> Base __cudaRegisterLinkedBinary_14b6edf8_62_compute_distance_standard_L2Expanded_int8_uint32_dim512_t32_cu_c6ca332b_3709
> Base __cudaRegisterLinkedBinary_16c70bf1_64_compute_distance_standard_InnerProduct_int8_uint32_dim256_t16_cu_c6ca332b_3658
> Base __cudaRegisterLinkedBinary_19e7b356_63_compute_distance_standard_InnerProduct_int8_uint32_dim128_t8_cu_c6ca332b_3662
> Base __cudaRegisterLinkedBinary_22690673_33_search_single_cta_uint8_uint32_cu_c6ca332b_7168
> Base __cudaRegisterLinkedBinary_2fe4f0ab_72_compute_distance_vpq_L2Expanded_int8_uint32_dim256_t16_8pq_2subd_half_cu_c6ca332b_5667
> Base __cudaRegisterLinkedBinary_353026eb_62_compute_distance_standard_L2Expanded_half_uint32_dim256_t16_cu_c6ca332b_3705
> Base __cudaRegisterLinkedBinary_36bf2ef4_31_search_multi_cta_half_uint32_cu_c6ca332b_5731
> Base __cudaRegisterLinkedBinary_3741c0e2_64_compute_distance_standard_InnerProduct_half_uint32_dim512_t32_cu_c6ca332b_3631
> Base __cudaRegisterLinkedBinary_37ad16a1_61_compute_distance_standard_L2Expanded_half_uint32_dim128_t8_cu_c6ca332b_3711
> Base __cudaRegisterLinkedBinary_393bdb37_63_compute_distance_standard_L2Expanded_uint8_uint32_dim256_t16_cu_c6ca332b_3719
> Base __cudaRegisterLinkedBinary_41b4a96c_72_compute_distance_vpq_L2Expanded_uint8_uint32_dim128_t8_8pq_4subd_half_cu_c6ca332b_5692
> Base __cudaRegisterLinkedBinary_461ac610_21_cagra_search_uint8_cu_c6ca332b_3723
> Base __cudaRegisterLinkedBinary_46e294db_64_compute_distance_standard_InnerProduct_uint8_uint32_dim128_t8_cu_c6ca332b_3602
> Base __cudaRegisterLinkedBinary_4a33b5e2_72_compute_distance_vpq_L2Expanded_float_uint32_dim128_t8_8pq_4subd_half_cu_c6ca332b_3715
> Base __cudaRegisterLinkedBinary_4b4971f3_65_compute_distance_standard_InnerProduct_uint8_uint32_dim512_t32_cu_c6ca332b_3611
> Base __cudaRegisterLinkedBinary_505b2ce9_72_compute_distance_vpq_L2Expanded_half_uint32_dim256_t16_8pq_4subd_half_cu_c6ca332b_5643
> Base __cudaRegisterLinkedBinary_5488f3ad_21_cagra_search_float_cu_c6ca332b_3727
> Base __cudaRegisterLinkedBinary_56a70ec8_73_compute_distance_vpq_L2Expanded_float_uint32_dim512_t32_8pq_2subd_half_cu_c6ca332b_5610
> Base __cudaRegisterLinkedBinary_5cac27f3_73_compute_distance_vpq_L2Expanded_uint8_uint32_dim512_t32_8pq_2subd_half_cu_c6ca332b_5717
> Base __cudaRegisterLinkedBinary_5d852589_71_compute_distance_vpq_L2Expanded_half_uint32_dim128_t8_8pq_2subd_half_cu_c6ca332b_5635
> Base __cudaRegisterLinkedBinary_66aa5569_33_search_single_cta_float_uint32_cu_c6ca332b_5735
> Base __cudaRegisterLinkedBinary_67697925_71_compute_distance_vpq_L2Expanded_int8_uint32_dim128_t8_8pq_2subd_half_cu_c6ca332b_5685
> Base __cudaRegisterLinkedBinary_6bdad738_32_search_single_cta_half_uint32_cu_c6ca332b_5737
> Base __cudaRegisterLinkedBinary_72ee691f_73_compute_distance_vpq_L2Expanded_float_uint32_dim256_t16_8pq_4subd_half_cu_c6ca332b_5618
> Base __cudaRegisterLinkedBinary_74124b3e_72_compute_distance_vpq_L2Expanded_half_uint32_dim512_t32_8pq_2subd_half_cu_c6ca332b_5669
> Base __cudaRegisterLinkedBinary_78e54024_73_compute_distance_vpq_L2Expanded_uint8_uint32_dim256_t16_8pq_4subd_half_cu_c6ca332b_5709
> Base __cudaRegisterLinkedBinary_79ae236d_65_compute_distance_standard_InnerProduct_float_uint32_dim512_t32_cu_c6ca332b_3698
> Base __cudaRegisterLinkedBinary_79dce9f7_64_compute_distance_standard_InnerProduct_float_uint32_dim128_t8_cu_c6ca332b_3650
> Base __cudaRegisterLinkedBinary_8591f14b_65_compute_distance_standard_InnerProduct_uint8_uint32_dim256_t16_cu_c6ca332b_3628
> Base __cudaRegisterLinkedBinary_8a4bf26a_62_compute_distance_standard_L2Expanded_uint8_uint32_dim128_t8_cu_c6ca332b_3667
> Base __cudaRegisterLinkedBinary_8f012f76_72_compute_distance_vpq_L2Expanded_int8_uint32_dim256_t16_8pq_4subd_half_cu_c6ca332b_5687
> Base __cudaRegisterLinkedBinary_92bc3215_63_compute_distance_standard_InnerProduct_half_uint32_dim128_t8_cu_c6ca332b_3681
> Base __cudaRegisterLinkedBinary_9353ec83_20_cagra_search_int8_cu_c6ca332b_3725
> Base __cudaRegisterLinkedBinary_ab4848a1_72_compute_distance_vpq_L2Expanded_int8_uint32_dim512_t32_8pq_2subd_half_cu_c6ca332b_5683
> Base __cudaRegisterLinkedBinary_ac1c7797_31_search_multi_cta_int8_uint32_cu_c6ca332b_5729
> Base __cudaRegisterLinkedBinary_aefa97cb_19_compute_distance_cu_c6ca332b_3574
> Base __cudaRegisterLinkedBinary_b0e6ed11_32_search_multi_cta_uint8_uint32_cu_c6ca332b_5733
> Base __cudaRegisterLinkedBinary_b5758f46_62_compute_distance_standard_L2Expanded_float_uint32_dim128_t8_cu_c6ca332b_3626
> Base __cudaRegisterLinkedBinary_b776a3d5_65_compute_distance_standard_InnerProduct_float_uint32_dim256_t16_cu_c6ca332b_3600
> Base __cudaRegisterLinkedBinary_bcf697e2_61_compute_distance_standard_L2Expanded_int8_uint32_dim128_t8_cu_c6ca332b_3671
> Base __cudaRegisterLinkedBinary_bf30f130_20_cagra_search_half_cu_c6ca332b_3729
> Base __cudaRegisterLinkedBinary_c5040911_63_compute_distance_standard_L2Expanded_float_uint32_dim512_t32_cu_c6ca332b_3571
> Base __cudaRegisterLinkedBinary_c78ca6f8_71_compute_distance_vpq_L2Expanded_int8_uint32_dim128_t8_8pq_4subd_half_cu_c6ca332b_5659
> Base __cudaRegisterLinkedBinary_d20bb6c2_73_compute_distance_vpq_L2Expanded_float_uint32_dim256_t16_8pq_2subd_half_cu_c6ca332b_3717
> Base __cudaRegisterLinkedBinary_d4f794e3_72_compute_distance_vpq_L2Expanded_half_uint32_dim512_t32_8pq_4subd_half_cu_c6ca332b_5653
> Base __cudaRegisterLinkedBinary_d8009ff9_73_compute_distance_vpq_L2Expanded_uint8_uint32_dim256_t16_8pq_2subd_half_cu_c6ca332b_5707
> Base __cudaRegisterLinkedBinary_d81f8b49_64_compute_distance_standard_InnerProduct_int8_uint32_dim512_t32_cu_c6ca332b_3606
> Base __cudaRegisterLinkedBinary_da6e6d40_62_compute_distance_standard_L2Expanded_int8_uint32_dim256_t16_cu_c6ca332b_3669
> Base __cudaRegisterLinkedBinary_e15176b1_72_compute_distance_vpq_L2Expanded_uint8_uint32_dim128_t8_8pq_2subd_half_cu_c6ca332b_5689
> Base __cudaRegisterLinkedBinary_ead66a3f_72_compute_distance_vpq_L2Expanded_float_uint32_dim128_t8_8pq_2subd_half_cu_c6ca332b_3721
> Base __cudaRegisterLinkedBinary_f0bef334_72_compute_distance_vpq_L2Expanded_half_uint32_dim256_t16_8pq_2subd_half_cu_c6ca332b_5629
> Base __cudaRegisterLinkedBinary_f1798e5b_32_search_single_cta_int8_uint32_cu_c6ca332b_5739
> Base __cudaRegisterLinkedBinary_f425be0b_32_search_multi_cta_float_uint32_cu_c6ca332b_5727
> Base __cudaRegisterLinkedBinary_f642d115_73_compute_distance_vpq_L2Expanded_float_uint32_dim512_t32_8pq_4subd_half_cu_c6ca332b_5634
> Base __cudaRegisterLinkedBinary_f7e35b8f_63_compute_distance_standard_L2Expanded_uint8_uint32_dim512_t32_cu_c6ca332b_3713
> Base __cudaRegisterLinkedBinary_f999405a_64_compute_distance_standard_InnerProduct_half_uint32_dim256_t16_cu_c6ca332b_3683
> Base __cudaRegisterLinkedBinary_fbe8a653_62_compute_distance_standard_L2Expanded_half_uint32_dim512_t32_cu_c6ca332b_3674
> Base __cudaRegisterLinkedBinary_fc49f82e_73_compute_distance_vpq_L2Expanded_uint8_uint32_dim512_t32_8pq_4subd_half_cu_c6ca332b_5719
> Base __cudaRegisterLinkedBinary_fd60fa54_71_compute_distance_vpq_L2Expanded_half_uint32_dim128_t8_8pq_4subd_half_cu_c6ca332b_5626
194,254c194,254
< Base __fatbinwrap_0bad977c_72_compute_distance_vpq_L2Expanded_int8_uint32_dim512_t32_8pq_4subd_half_cu_f77f5a7d_5643
< Base __fatbinwrap_0bdc89a9_63_compute_distance_standard_L2Expanded_float_uint32_dim256_t16_cu_f77f5a7d_3626
< Base __fatbinwrap_14b6edf8_62_compute_distance_standard_L2Expanded_int8_uint32_dim512_t32_cu_f77f5a7d_3683
< Base __fatbinwrap_16c70bf1_64_compute_distance_standard_InnerProduct_int8_uint32_dim256_t16_cu_f77f5a7d_3675
< Base __fatbinwrap_19e7b356_63_compute_distance_standard_InnerProduct_int8_uint32_dim128_t8_cu_f77f5a7d_3673
< Base __fatbinwrap_22690673_33_search_single_cta_uint8_uint32_cu_f77f5a7d_7128
< Base __fatbinwrap_2fe4f0ab_72_compute_distance_vpq_L2Expanded_int8_uint32_dim256_t16_8pq_2subd_half_cu_f77f5a7d_5627
< Base __fatbinwrap_353026eb_62_compute_distance_standard_L2Expanded_half_uint32_dim256_t16_cu_f77f5a7d_3635
< Base __fatbinwrap_36bf2ef4_31_search_multi_cta_half_uint32_cu_f77f5a7d_5692
< Base __fatbinwrap_3741c0e2_64_compute_distance_standard_InnerProduct_half_uint32_dim512_t32_cu_f77f5a7d_3576
< Base __fatbinwrap_37ad16a1_61_compute_distance_standard_L2Expanded_half_uint32_dim128_t8_cu_f77f5a7d_3630
< Base __fatbinwrap_393bdb37_63_compute_distance_standard_L2Expanded_uint8_uint32_dim256_t16_cu_f77f5a7d_3681
< Base __fatbinwrap_41b4a96c_72_compute_distance_vpq_L2Expanded_uint8_uint32_dim128_t8_8pq_4subd_half_cu_f77f5a7d_5662
< Base __fatbinwrap_461ac610_21_cagra_search_uint8_cu_f77f5a7d_3687
< Base __fatbinwrap_46e294db_64_compute_distance_standard_InnerProduct_uint8_uint32_dim128_t8_cu_f77f5a7d_3555
< Base __fatbinwrap_4a33b5e2_72_compute_distance_vpq_L2Expanded_float_uint32_dim128_t8_8pq_4subd_half_cu_f77f5a7d_3665
< Base __fatbinwrap_4b4971f3_65_compute_distance_standard_InnerProduct_uint8_uint32_dim512_t32_cu_f77f5a7d_3671
< Base __fatbinwrap_505b2ce9_72_compute_distance_vpq_L2Expanded_half_uint32_dim256_t16_8pq_4subd_half_cu_f77f5a7d_5621
< Base __fatbinwrap_5488f3ad_21_cagra_search_float_cu_f77f5a7d_3689
< Base __fatbinwrap_56a70ec8_73_compute_distance_vpq_L2Expanded_float_uint32_dim512_t32_8pq_2subd_half_cu_f77f5a7d_5573
< Base __fatbinwrap_5cac27f3_73_compute_distance_vpq_L2Expanded_uint8_uint32_dim512_t32_8pq_2subd_half_cu_f77f5a7d_5680
< Base __fatbinwrap_5d852589_71_compute_distance_vpq_L2Expanded_half_uint32_dim128_t8_8pq_2subd_half_cu_f77f5a7d_5594
< Base __fatbinwrap_66aa5569_33_search_single_cta_float_uint32_cu_f77f5a7d_5699
< Base __fatbinwrap_67697925_71_compute_distance_vpq_L2Expanded_int8_uint32_dim128_t8_8pq_2subd_half_cu_f77f5a7d_5623
< Base __fatbinwrap_6bdad738_32_search_single_cta_half_uint32_cu_f77f5a7d_5701
< Base __fatbinwrap_72ee691f_73_compute_distance_vpq_L2Expanded_float_uint32_dim256_t16_8pq_4subd_half_cu_f77f5a7d_5563
< Base __fatbinwrap_74124b3e_72_compute_distance_vpq_L2Expanded_half_uint32_dim512_t32_8pq_2subd_half_cu_f77f5a7d_5619
< Base __fatbinwrap_78e54024_73_compute_distance_vpq_L2Expanded_uint8_uint32_dim256_t16_8pq_4subd_half_cu_f77f5a7d_5689
< Base __fatbinwrap_79ae236d_65_compute_distance_standard_InnerProduct_float_uint32_dim512_t32_cu_f77f5a7d_3616
< Base __fatbinwrap_79dce9f7_64_compute_distance_standard_InnerProduct_float_uint32_dim128_t8_cu_f77f5a7d_3542
< Base __fatbinwrap_8591f14b_65_compute_distance_standard_InnerProduct_uint8_uint32_dim256_t16_cu_f77f5a7d_3580
< Base __fatbinwrap_8a4bf26a_62_compute_distance_standard_L2Expanded_uint8_uint32_dim128_t8_cu_f77f5a7d_3677
< Base __fatbinwrap_8f012f76_72_compute_distance_vpq_L2Expanded_int8_uint32_dim256_t16_8pq_4subd_half_cu_f77f5a7d_5653
< Base __fatbinwrap_92bc3215_63_compute_distance_standard_InnerProduct_half_uint32_dim128_t8_cu_f77f5a7d_3564
< Base __fatbinwrap_9353ec83_20_cagra_search_int8_cu_f77f5a7d_3685
< Base __fatbinwrap_ab4848a1_72_compute_distance_vpq_L2Expanded_int8_uint32_dim512_t32_8pq_2subd_half_cu_f77f5a7d_5645
< Base __fatbinwrap_ac1c7797_31_search_multi_cta_int8_uint32_cu_f77f5a7d_5695
< Base __fatbinwrap_aefa97cb_19_compute_distance_cu_f77f5a7d_3505
< Base __fatbinwrap_b0e6ed11_32_search_multi_cta_uint8_uint32_cu_f77f5a7d_5697
< Base __fatbinwrap_b5758f46_62_compute_distance_standard_L2Expanded_float_uint32_dim128_t8_cu_f77f5a7d_3601
< Base __fatbinwrap_b776a3d5_65_compute_distance_standard_InnerProduct_float_uint32_dim256_t16_cu_f77f5a7d_3606
< Base __fatbinwrap_bcf697e2_61_compute_distance_standard_L2Expanded_int8_uint32_dim128_t8_cu_f77f5a7d_3639
< Base __fatbinwrap_bf30f130_20_cagra_search_half_cu_f77f5a7d_3691
< Base __fatbinwrap_c5040911_63_compute_distance_standard_L2Expanded_float_uint32_dim512_t32_cu_f77f5a7d_3680
< Base __fatbinwrap_c78ca6f8_71_compute_distance_vpq_L2Expanded_int8_uint32_dim128_t8_8pq_4subd_half_cu_f77f5a7d_5641
< Base __fatbinwrap_d20bb6c2_73_compute_distance_vpq_L2Expanded_float_uint32_dim256_t16_8pq_2subd_half_cu_f77f5a7d_3597
< Base __fatbinwrap_d4f794e3_72_compute_distance_vpq_L2Expanded_half_uint32_dim512_t32_8pq_4subd_half_cu_f77f5a7d_5628
< Base __fatbinwrap_d8009ff9_73_compute_distance_vpq_L2Expanded_uint8_uint32_dim256_t16_8pq_2subd_half_cu_f77f5a7d_5677
< Base __fatbinwrap_d81f8b49_64_compute_distance_standard_InnerProduct_int8_uint32_dim512_t32_cu_f77f5a7d_3669
< Base __fatbinwrap_da6e6d40_62_compute_distance_standard_L2Expanded_int8_uint32_dim256_t16_cu_f77f5a7d_3624
< Base __fatbinwrap_e15176b1_72_compute_distance_vpq_L2Expanded_uint8_uint32_dim128_t8_8pq_2subd_half_cu_f77f5a7d_5664
< Base __fatbinwrap_ead66a3f_72_compute_distance_vpq_L2Expanded_float_uint32_dim128_t8_8pq_2subd_half_cu_f77f5a7d_3667
< Base __fatbinwrap_f0bef334_72_compute_distance_vpq_L2Expanded_half_uint32_dim256_t16_8pq_2subd_half_cu_f77f5a7d_5611
< Base __fatbinwrap_f1798e5b_32_search_single_cta_int8_uint32_cu_f77f5a7d_5703
< Base __fatbinwrap_f425be0b_32_search_multi_cta_float_uint32_cu_f77f5a7d_5693
< Base __fatbinwrap_f642d115_73_compute_distance_vpq_L2Expanded_float_uint32_dim512_t32_8pq_4subd_half_cu_f77f5a7d_5583
< Base __fatbinwrap_f7e35b8f_63_compute_distance_standard_L2Expanded_uint8_uint32_dim512_t32_cu_f77f5a7d_3588
< Base __fatbinwrap_f999405a_64_compute_distance_standard_InnerProduct_half_uint32_dim256_t16_cu_f77f5a7d_3637
< Base __fatbinwrap_fbe8a653_62_compute_distance_standard_L2Expanded_half_uint32_dim512_t32_cu_f77f5a7d_3613
< Base __fatbinwrap_fc49f82e_73_compute_distance_vpq_L2Expanded_uint8_uint32_dim512_t32_8pq_4subd_half_cu_f77f5a7d_5687
< Base __fatbinwrap_fd60fa54_71_compute_distance_vpq_L2Expanded_half_uint32_dim128_t8_8pq_4subd_half_cu_f77f5a7d_5603
---
> Base __fatbinwrap_0bad977c_72_compute_distance_vpq_L2Expanded_int8_uint32_dim512_t32_8pq_4subd_half_cu_c6ca332b_5704
> Base __fatbinwrap_0bdc89a9_63_compute_distance_standard_L2Expanded_float_uint32_dim256_t16_cu_c6ca332b_3707
> Base __fatbinwrap_14b6edf8_62_compute_distance_standard_L2Expanded_int8_uint32_dim512_t32_cu_c6ca332b_3709
> Base __fatbinwrap_16c70bf1_64_compute_distance_standard_InnerProduct_int8_uint32_dim256_t16_cu_c6ca332b_3658
> Base __fatbinwrap_19e7b356_63_compute_distance_standard_InnerProduct_int8_uint32_dim128_t8_cu_c6ca332b_3662
> Base __fatbinwrap_22690673_33_search_single_cta_uint8_uint32_cu_c6ca332b_7168
> Base __fatbinwrap_2fe4f0ab_72_compute_distance_vpq_L2Expanded_int8_uint32_dim256_t16_8pq_2subd_half_cu_c6ca332b_5667
> Base __fatbinwrap_353026eb_62_compute_distance_standard_L2Expanded_half_uint32_dim256_t16_cu_c6ca332b_3705
> Base __fatbinwrap_36bf2ef4_31_search_multi_cta_half_uint32_cu_c6ca332b_5731
> Base __fatbinwrap_3741c0e2_64_compute_distance_standard_InnerProduct_half_uint32_dim512_t32_cu_c6ca332b_3631
> Base __fatbinwrap_37ad16a1_61_compute_distance_standard_L2Expanded_half_uint32_dim128_t8_cu_c6ca332b_3711
> Base __fatbinwrap_393bdb37_63_compute_distance_standard_L2Expanded_uint8_uint32_dim256_t16_cu_c6ca332b_3719
> Base __fatbinwrap_41b4a96c_72_compute_distance_vpq_L2Expanded_uint8_uint32_dim128_t8_8pq_4subd_half_cu_c6ca332b_5692
> Base __fatbinwrap_461ac610_21_cagra_search_uint8_cu_c6ca332b_3723
> Base __fatbinwrap_46e294db_64_compute_distance_standard_InnerProduct_uint8_uint32_dim128_t8_cu_c6ca332b_3602
> Base __fatbinwrap_4a33b5e2_72_compute_distance_vpq_L2Expanded_float_uint32_dim128_t8_8pq_4subd_half_cu_c6ca332b_3715
> Base __fatbinwrap_4b4971f3_65_compute_distance_standard_InnerProduct_uint8_uint32_dim512_t32_cu_c6ca332b_3611
> Base __fatbinwrap_505b2ce9_72_compute_distance_vpq_L2Expanded_half_uint32_dim256_t16_8pq_4subd_half_cu_c6ca332b_5643
> Base __fatbinwrap_5488f3ad_21_cagra_search_float_cu_c6ca332b_3727
> Base __fatbinwrap_56a70ec8_73_compute_distance_vpq_L2Expanded_float_uint32_dim512_t32_8pq_2subd_half_cu_c6ca332b_5610
> Base __fatbinwrap_5cac27f3_73_compute_distance_vpq_L2Expanded_uint8_uint32_dim512_t32_8pq_2subd_half_cu_c6ca332b_5717
> Base __fatbinwrap_5d852589_71_compute_distance_vpq_L2Expanded_half_uint32_dim128_t8_8pq_2subd_half_cu_c6ca332b_5635
> Base __fatbinwrap_66aa5569_33_search_single_cta_float_uint32_cu_c6ca332b_5735
> Base __fatbinwrap_67697925_71_compute_distance_vpq_L2Expanded_int8_uint32_dim128_t8_8pq_2subd_half_cu_c6ca332b_5685
> Base __fatbinwrap_6bdad738_32_search_single_cta_half_uint32_cu_c6ca332b_5737
> Base __fatbinwrap_72ee691f_73_compute_distance_vpq_L2Expanded_float_uint32_dim256_t16_8pq_4subd_half_cu_c6ca332b_5618
> Base __fatbinwrap_74124b3e_72_compute_distance_vpq_L2Expanded_half_uint32_dim512_t32_8pq_2subd_half_cu_c6ca332b_5669
> Base __fatbinwrap_78e54024_73_compute_distance_vpq_L2Expanded_uint8_uint32_dim256_t16_8pq_4subd_half_cu_c6ca332b_5709
> Base __fatbinwrap_79ae236d_65_compute_distance_standard_InnerProduct_float_uint32_dim512_t32_cu_c6ca332b_3698
> Base __fatbinwrap_79dce9f7_64_compute_distance_standard_InnerProduct_float_uint32_dim128_t8_cu_c6ca332b_3650
> Base __fatbinwrap_8591f14b_65_compute_distance_standard_InnerProduct_uint8_uint32_dim256_t16_cu_c6ca332b_3628
> Base __fatbinwrap_8a4bf26a_62_compute_distance_standard_L2Expanded_uint8_uint32_dim128_t8_cu_c6ca332b_3667
> Base __fatbinwrap_8f012f76_72_compute_distance_vpq_L2Expanded_int8_uint32_dim256_t16_8pq_4subd_half_cu_c6ca332b_5687
> Base __fatbinwrap_92bc3215_63_compute_distance_standard_InnerProduct_half_uint32_dim128_t8_cu_c6ca332b_3681
> Base __fatbinwrap_9353ec83_20_cagra_search_int8_cu_c6ca332b_3725
> Base __fatbinwrap_ab4848a1_72_compute_distance_vpq_L2Expanded_int8_uint32_dim512_t32_8pq_2subd_half_cu_c6ca332b_5683
> Base __fatbinwrap_ac1c7797_31_search_multi_cta_int8_uint32_cu_c6ca332b_5729
> Base __fatbinwrap_aefa97cb_19_compute_distance_cu_c6ca332b_3574
> Base __fatbinwrap_b0e6ed11_32_search_multi_cta_uint8_uint32_cu_c6ca332b_5733
> Base __fatbinwrap_b5758f46_62_compute_distance_standard_L2Expanded_float_uint32_dim128_t8_cu_c6ca332b_3626
> Base __fatbinwrap_b776a3d5_65_compute_distance_standard_InnerProduct_float_uint32_dim256_t16_cu_c6ca332b_3600
> Base __fatbinwrap_bcf697e2_61_compute_distance_standard_L2Expanded_int8_uint32_dim128_t8_cu_c6ca332b_3671
> Base __fatbinwrap_bf30f130_20_cagra_search_half_cu_c6ca332b_3729
> Base __fatbinwrap_c5040911_63_compute_distance_standard_L2Expanded_float_uint32_dim512_t32_cu_c6ca332b_3571
> Base __fatbinwrap_c78ca6f8_71_compute_distance_vpq_L2Expanded_int8_uint32_dim128_t8_8pq_4subd_half_cu_c6ca332b_5659
> Base __fatbinwrap_d20bb6c2_73_compute_distance_vpq_L2Expanded_float_uint32_dim256_t16_8pq_2subd_half_cu_c6ca332b_3717
> Base __fatbinwrap_d4f794e3_72_compute_distance_vpq_L2Expanded_half_uint32_dim512_t32_8pq_4subd_half_cu_c6ca332b_5653
> Base __fatbinwrap_d8009ff9_73_compute_distance_vpq_L2Expanded_uint8_uint32_dim256_t16_8pq_2subd_half_cu_c6ca332b_5707
> Base __fatbinwrap_d81f8b49_64_compute_distance_standard_InnerProduct_int8_uint32_dim512_t32_cu_c6ca332b_3606
> Base __fatbinwrap_da6e6d40_62_compute_distance_standard_L2Expanded_int8_uint32_dim256_t16_cu_c6ca332b_3669
> Base __fatbinwrap_e15176b1_72_compute_distance_vpq_L2Expanded_uint8_uint32_dim128_t8_8pq_2subd_half_cu_c6ca332b_5689
> Base __fatbinwrap_ead66a3f_72_compute_distance_vpq_L2Expanded_float_uint32_dim128_t8_8pq_2subd_half_cu_c6ca332b_3721
> Base __fatbinwrap_f0bef334_72_compute_distance_vpq_L2Expanded_half_uint32_dim256_t16_8pq_2subd_half_cu_c6ca332b_5629
> Base __fatbinwrap_f1798e5b_32_search_single_cta_int8_uint32_cu_c6ca332b_5739
> Base __fatbinwrap_f425be0b_32_search_multi_cta_float_uint32_cu_c6ca332b_5727
> Base __fatbinwrap_f642d115_73_compute_distance_vpq_L2Expanded_float_uint32_dim512_t32_8pq_4subd_half_cu_c6ca332b_5634
> Base __fatbinwrap_f7e35b8f_63_compute_distance_standard_L2Expanded_uint8_uint32_dim512_t32_cu_c6ca332b_3713
> Base __fatbinwrap_f999405a_64_compute_distance_standard_InnerProduct_half_uint32_dim256_t16_cu_c6ca332b_3683
> Base __fatbinwrap_fbe8a653_62_compute_distance_standard_L2Expanded_half_uint32_dim512_t32_cu_c6ca332b_3674
> Base __fatbinwrap_fc49f82e_73_compute_distance_vpq_L2Expanded_uint8_uint32_dim512_t32_8pq_4subd_half_cu_c6ca332b_5719
> Base __fatbinwrap_fd60fa54_71_compute_distance_vpq_L2Expanded_half_uint32_dim128_t8_8pq_4subd_half_cu_c6ca332b_5626
943,988c943,988
< Base _ZN4cuda3std3__4100_GLOBAL__N__14b6edf8_62_compute_distance_standard_L2Expanded_int8_uint32_dim512_t32_cu_f77f5a7d_36836ignoreE
< Base _ZN4cuda3std3__4100_GLOBAL__N__353026eb_62_compute_distance_standard_L2Expanded_half_uint32_dim256_t16_cu_f77f5a7d_36356ignoreE
< Base _ZN4cuda3std3__4100_GLOBAL__N__8a4bf26a_62_compute_distance_standard_L2Expanded_uint8_uint32_dim128_t8_cu_f77f5a7d_36776ignoreE
< Base _ZN4cuda3std3__4100_GLOBAL__N__b5758f46_62_compute_distance_standard_L2Expanded_float_uint32_dim128_t8_cu_f77f5a7d_36016ignoreE
< Base _ZN4cuda3std3__4100_GLOBAL__N__da6e6d40_62_compute_distance_standard_L2Expanded_int8_uint32_dim256_t16_cu_f77f5a7d_36246ignoreE
< Base _ZN4cuda3std3__4100_GLOBAL__N__fbe8a653_62_compute_distance_standard_L2Expanded_half_uint32_dim512_t32_cu_f77f5a7d_36136ignoreE
< Base _ZN4cuda3std3__4101_GLOBAL__N__0bdc89a9_63_compute_distance_standard_L2Expanded_float_uint32_dim256_t16_cu_f77f5a7d_36266ignoreE
< Base _ZN4cuda3std3__4101_GLOBAL__N__19e7b356_63_compute_distance_standard_InnerProduct_int8_uint32_dim128_t8_cu_f77f5a7d_36736ignoreE
< Base _ZN4cuda3std3__4101_GLOBAL__N__393bdb37_63_compute_distance_standard_L2Expanded_uint8_uint32_dim256_t16_cu_f77f5a7d_36816ignoreE
< Base _ZN4cuda3std3__4101_GLOBAL__N__92bc3215_63_compute_distance_standard_InnerProduct_half_uint32_dim128_t8_cu_f77f5a7d_35646ignoreE
< Base _ZN4cuda3std3__4101_GLOBAL__N__c5040911_63_compute_distance_standard_L2Expanded_float_uint32_dim512_t32_cu_f77f5a7d_36806ignoreE
< Base _ZN4cuda3std3__4101_GLOBAL__N__f7e35b8f_63_compute_distance_standard_L2Expanded_uint8_uint32_dim512_t32_cu_f77f5a7d_35886ignoreE
< Base _ZN4cuda3std3__4102_GLOBAL__N__16c70bf1_64_compute_distance_standard_InnerProduct_int8_uint32_dim256_t16_cu_f77f5a7d_36756ignoreE
< Base _ZN4cuda3std3__4102_GLOBAL__N__3741c0e2_64_compute_distance_standard_InnerProduct_half_uint32_dim512_t32_cu_f77f5a7d_35766ignoreE
< Base _ZN4cuda3std3__4102_GLOBAL__N__46e294db_64_compute_distance_standard_InnerProduct_uint8_uint32_dim128_t8_cu_f77f5a7d_35556ignoreE
< Base _ZN4cuda3std3__4102_GLOBAL__N__79dce9f7_64_compute_distance_standard_InnerProduct_float_uint32_dim128_t8_cu_f77f5a7d_35426ignoreE
< Base _ZN4cuda3std3__4102_GLOBAL__N__d81f8b49_64_compute_distance_standard_InnerProduct_int8_uint32_dim512_t32_cu_f77f5a7d_36696ignoreE
< Base _ZN4cuda3std3__4102_GLOBAL__N__f999405a_64_compute_distance_standard_InnerProduct_half_uint32_dim256_t16_cu_f77f5a7d_36376ignoreE
< Base _ZN4cuda3std3__4103_GLOBAL__N__4b4971f3_65_compute_distance_standard_InnerProduct_uint8_uint32_dim512_t32_cu_f77f5a7d_36716ignoreE
< Base _ZN4cuda3std3__4103_GLOBAL__N__79ae236d_65_compute_distance_standard_InnerProduct_float_uint32_dim512_t32_cu_f77f5a7d_36166ignoreE
< Base _ZN4cuda3std3__4103_GLOBAL__N__8591f14b_65_compute_distance_standard_InnerProduct_uint8_uint32_dim256_t16_cu_f77f5a7d_35806ignoreE
< Base _ZN4cuda3std3__4103_GLOBAL__N__b776a3d5_65_compute_distance_standard_InnerProduct_float_uint32_dim256_t16_cu_f77f5a7d_36066ignoreE
< Base _ZN4cuda3std3__4109_GLOBAL__N__5d852589_71_compute_distance_vpq_L2Expanded_half_uint32_dim128_t8_8pq_2subd_half_cu_f77f5a7d_55946ignoreE
< Base _ZN4cuda3std3__4109_GLOBAL__N__67697925_71_compute_distance_vpq_L2Expanded_int8_uint32_dim128_t8_8pq_2subd_half_cu_f77f5a7d_56236ignoreE
< Base _ZN4cuda3std3__4109_GLOBAL__N__c78ca6f8_71_compute_distance_vpq_L2Expanded_int8_uint32_dim128_t8_8pq_4subd_half_cu_f77f5a7d_56416ignoreE
< Base _ZN4cuda3std3__4109_GLOBAL__N__fd60fa54_71_compute_distance_vpq_L2Expanded_half_uint32_dim128_t8_8pq_4subd_half_cu_f77f5a7d_56036ignoreE
< Base _ZN4cuda3std3__4110_GLOBAL__N__0bad977c_72_compute_distance_vpq_L2Expanded_int8_uint32_dim512_t32_8pq_4subd_half_cu_f77f5a7d_56436ignoreE
< Base _ZN4cuda3std3__4110_GLOBAL__N__2fe4f0ab_72_compute_distance_vpq_L2Expanded_int8_uint32_dim256_t16_8pq_2subd_half_cu_f77f5a7d_56276ignoreE
< Base _ZN4cuda3std3__4110_GLOBAL__N__41b4a96c_72_compute_distance_vpq_L2Expanded_uint8_uint32_dim128_t8_8pq_4subd_half_cu_f77f5a7d_56626ignoreE
< Base _ZN4cuda3std3__4110_GLOBAL__N__4a33b5e2_72_compute_distance_vpq_L2Expanded_float_uint32_dim128_t8_8pq_4subd_half_cu_f77f5a7d_36656ignoreE
< Base _ZN4cuda3std3__4110_GLOBAL__N__505b2ce9_72_compute_distance_vpq_L2Expanded_half_uint32_dim256_t16_8pq_4subd_half_cu_f77f5a7d_56216ignoreE
< Base _ZN4cuda3std3__4110_GLOBAL__N__74124b3e_72_compute_distance_vpq_L2Expanded_half_uint32_dim512_t32_8pq_2subd_half_cu_f77f5a7d_56196ignoreE
< Base _ZN4cuda3std3__4110_GLOBAL__N__8f012f76_72_compute_distance_vpq_L2Expanded_int8_uint32_dim256_t16_8pq_4subd_half_cu_f77f5a7d_56536ignoreE
< Base _ZN4cuda3std3__4110_GLOBAL__N__ab4848a1_72_compute_distance_vpq_L2Expanded_int8_uint32_dim512_t32_8pq_2subd_half_cu_f77f5a7d_56456ignoreE
< Base _ZN4cuda3std3__4110_GLOBAL__N__d4f794e3_72_compute_distance_vpq_L2Expanded_half_uint32_dim512_t32_8pq_4subd_half_cu_f77f5a7d_56286ignoreE
< Base _ZN4cuda3std3__4110_GLOBAL__N__e15176b1_72_compute_distance_vpq_L2Expanded_uint8_uint32_dim128_t8_8pq_2subd_half_cu_f77f5a7d_56646ignoreE
< Base _ZN4cuda3std3__4110_GLOBAL__N__ead66a3f_72_compute_distance_vpq_L2Expanded_float_uint32_dim128_t8_8pq_2subd_half_cu_f77f5a7d_36676ignoreE
< Base _ZN4cuda3std3__4110_GLOBAL__N__f0bef334_72_compute_distance_vpq_L2Expanded_half_uint32_dim256_t16_8pq_2subd_half_cu_f77f5a7d_56116ignoreE
< Base _ZN4cuda3std3__4111_GLOBAL__N__56a70ec8_73_compute_distance_vpq_L2Expanded_float_uint32_dim512_t32_8pq_2subd_half_cu_f77f5a7d_55736ignoreE
< Base _ZN4cuda3std3__4111_GLOBAL__N__5cac27f3_73_compute_distance_vpq_L2Expanded_uint8_uint32_dim512_t32_8pq_2subd_half_cu_f77f5a7d_56806ignoreE
< Base _ZN4cuda3std3__4111_GLOBAL__N__72ee691f_73_compute_distance_vpq_L2Expanded_float_uint32_dim256_t16_8pq_4subd_half_cu_f77f5a7d_55636ignoreE
< Base _ZN4cuda3std3__4111_GLOBAL__N__78e54024_73_compute_distance_vpq_L2Expanded_uint8_uint32_dim256_t16_8pq_4subd_half_cu_f77f5a7d_56896ignoreE
< Base _ZN4cuda3std3__4111_GLOBAL__N__d20bb6c2_73_compute_distance_vpq_L2Expanded_float_uint32_dim256_t16_8pq_2subd_half_cu_f77f5a7d_35976ignoreE
< Base _ZN4cuda3std3__4111_GLOBAL__N__d8009ff9_73_compute_distance_vpq_L2Expanded_uint8_uint32_dim256_t16_8pq_2subd_half_cu_f77f5a7d_56776ignoreE
< Base _ZN4cuda3std3__4111_GLOBAL__N__f642d115_73_compute_distance_vpq_L2Expanded_float_uint32_dim512_t32_8pq_4subd_half_cu_f77f5a7d_55836ignoreE
< Base _ZN4cuda3std3__4111_GLOBAL__N__fc49f82e_73_compute_distance_vpq_L2Expanded_uint8_uint32_dim512_t32_8pq_4subd_half_cu_f77f5a7d_56876ignoreE
---
> Base _ZN4cuda3std3__4100_GLOBAL__N__14b6edf8_62_compute_distance_standard_L2Expanded_int8_uint32_dim512_t32_cu_c6ca332b_37096ignoreE
> Base _ZN4cuda3std3__4100_GLOBAL__N__353026eb_62_compute_distance_standard_L2Expanded_half_uint32_dim256_t16_cu_c6ca332b_37056ignoreE
> Base _ZN4cuda3std3__4100_GLOBAL__N__8a4bf26a_62_compute_distance_standard_L2Expanded_uint8_uint32_dim128_t8_cu_c6ca332b_36676ignoreE
> Base _ZN4cuda3std3__4100_GLOBAL__N__b5758f46_62_compute_distance_standard_L2Expanded_float_uint32_dim128_t8_cu_c6ca332b_36266ignoreE
> Base _ZN4cuda3std3__4100_GLOBAL__N__da6e6d40_62_compute_distance_standard_L2Expanded_int8_uint32_dim256_t16_cu_c6ca332b_36696ignoreE
> Base _ZN4cuda3std3__4100_GLOBAL__N__fbe8a653_62_compute_distance_standard_L2Expanded_half_uint32_dim512_t32_cu_c6ca332b_36746ignoreE
> Base _ZN4cuda3std3__4101_GLOBAL__N__0bdc89a9_63_compute_distance_standard_L2Expanded_float_uint32_dim256_t16_cu_c6ca332b_37076ignoreE
> Base _ZN4cuda3std3__4101_GLOBAL__N__19e7b356_63_compute_distance_standard_InnerProduct_int8_uint32_dim128_t8_cu_c6ca332b_36626ignoreE
> Base _ZN4cuda3std3__4101_GLOBAL__N__393bdb37_63_compute_distance_standard_L2Expanded_uint8_uint32_dim256_t16_cu_c6ca332b_37196ignoreE
> Base _ZN4cuda3std3__4101_GLOBAL__N__92bc3215_63_compute_distance_standard_InnerProduct_half_uint32_dim128_t8_cu_c6ca332b_36816ignoreE
> Base _ZN4cuda3std3__4101_GLOBAL__N__c5040911_63_compute_distance_standard_L2Expanded_float_uint32_dim512_t32_cu_c6ca332b_35716ignoreE
> Base _ZN4cuda3std3__4101_GLOBAL__N__f7e35b8f_63_compute_distance_standard_L2Expanded_uint8_uint32_dim512_t32_cu_c6ca332b_37136ignoreE
> Base _ZN4cuda3std3__4102_GLOBAL__N__16c70bf1_64_compute_distance_standard_InnerProduct_int8_uint32_dim256_t16_cu_c6ca332b_36586ignoreE
> Base _ZN4cuda3std3__4102_GLOBAL__N__3741c0e2_64_compute_distance_standard_InnerProduct_half_uint32_dim512_t32_cu_c6ca332b_36316ignoreE
> Base _ZN4cuda3std3__4102_GLOBAL__N__46e294db_64_compute_distance_standard_InnerProduct_uint8_uint32_dim128_t8_cu_c6ca332b_36026ignoreE
> Base _ZN4cuda3std3__4102_GLOBAL__N__79dce9f7_64_compute_distance_standard_InnerProduct_float_uint32_dim128_t8_cu_c6ca332b_36506ignoreE
> Base _ZN4cuda3std3__4102_GLOBAL__N__d81f8b49_64_compute_distance_standard_InnerProduct_int8_uint32_dim512_t32_cu_c6ca332b_36066ignoreE
> Base _ZN4cuda3std3__4102_GLOBAL__N__f999405a_64_compute_distance_standard_InnerProduct_half_uint32_dim256_t16_cu_c6ca332b_36836ignoreE
> Base _ZN4cuda3std3__4103_GLOBAL__N__4b4971f3_65_compute_distance_standard_InnerProduct_uint8_uint32_dim512_t32_cu_c6ca332b_36116ignoreE
> Base _ZN4cuda3std3__4103_GLOBAL__N__79ae236d_65_compute_distance_standard_InnerProduct_float_uint32_dim512_t32_cu_c6ca332b_36986ignoreE
> Base _ZN4cuda3std3__4103_GLOBAL__N__8591f14b_65_compute_distance_standard_InnerProduct_uint8_uint32_dim256_t16_cu_c6ca332b_36286ignoreE
> Base _ZN4cuda3std3__4103_GLOBAL__N__b776a3d5_65_compute_distance_standard_InnerProduct_float_uint32_dim256_t16_cu_c6ca332b_36006ignoreE
> Base _ZN4cuda3std3__4109_GLOBAL__N__5d852589_71_compute_distance_vpq_L2Expanded_half_uint32_dim128_t8_8pq_2subd_half_cu_c6ca332b_56356ignoreE
> Base _ZN4cuda3std3__4109_GLOBAL__N__67697925_71_compute_distance_vpq_L2Expanded_int8_uint32_dim128_t8_8pq_2subd_half_cu_c6ca332b_56856ignoreE
> Base _ZN4cuda3std3__4109_GLOBAL__N__c78ca6f8_71_compute_distance_vpq_L2Expanded_int8_uint32_dim128_t8_8pq_4subd_half_cu_c6ca332b_56596ignoreE
> Base _ZN4cuda3std3__4109_GLOBAL__N__fd60fa54_71_compute_distance_vpq_L2Expanded_half_uint32_dim128_t8_8pq_4subd_half_cu_c6ca332b_56266ignoreE
> Base _ZN4cuda3std3__4110_GLOBAL__N__0bad977c_72_compute_distance_vpq_L2Expanded_int8_uint32_dim512_t32_8pq_4subd_half_cu_c6ca332b_57046ignoreE
> Base _ZN4cuda3std3__4110_GLOBAL__N__2fe4f0ab_72_compute_distance_vpq_L2Expanded_int8_uint32_dim256_t16_8pq_2subd_half_cu_c6ca332b_56676ignoreE
> Base _ZN4cuda3std3__4110_GLOBAL__N__41b4a96c_72_compute_distance_vpq_L2Expanded_uint8_uint32_dim128_t8_8pq_4subd_half_cu_c6ca332b_56926ignoreE
> Base _ZN4cuda3std3__4110_GLOBAL__N__4a33b5e2_72_compute_distance_vpq_L2Expanded_float_uint32_dim128_t8_8pq_4subd_half_cu_c6ca332b_37156ignoreE
> Base _ZN4cuda3std3__4110_GLOBAL__N__505b2ce9_72_compute_distance_vpq_L2Expanded_half_uint32_dim256_t16_8pq_4subd_half_cu_c6ca332b_56436ignoreE
> Base _ZN4cuda3std3__4110_GLOBAL__N__74124b3e_72_compute_distance_vpq_L2Expanded_half_uint32_dim512_t32_8pq_2subd_half_cu_c6ca332b_56696ignoreE
> Base _ZN4cuda3std3__4110_GLOBAL__N__8f012f76_72_compute_distance_vpq_L2Expanded_int8_uint32_dim256_t16_8pq_4subd_half_cu_c6ca332b_56876ignoreE
> Base _ZN4cuda3std3__4110_GLOBAL__N__ab4848a1_72_compute_distance_vpq_L2Expanded_int8_uint32_dim512_t32_8pq_2subd_half_cu_c6ca332b_56836ignoreE
> Base _ZN4cuda3std3__4110_GLOBAL__N__d4f794e3_72_compute_distance_vpq_L2Expanded_half_uint32_dim512_t32_8pq_4subd_half_cu_c6ca332b_56536ignoreE
> Base _ZN4cuda3std3__4110_GLOBAL__N__e15176b1_72_compute_distance_vpq_L2Expanded_uint8_uint32_dim128_t8_8pq_2subd_half_cu_c6ca332b_56896ignoreE
> Base _ZN4cuda3std3__4110_GLOBAL__N__ead66a3f_72_compute_distance_vpq_L2Expanded_float_uint32_dim128_t8_8pq_2subd_half_cu_c6ca332b_37216ignoreE
> Base _ZN4cuda3std3__4110_GLOBAL__N__f0bef334_72_compute_distance_vpq_L2Expanded_half_uint32_dim256_t16_8pq_2subd_half_cu_c6ca332b_56296ignoreE
> Base _ZN4cuda3std3__4111_GLOBAL__N__56a70ec8_73_compute_distance_vpq_L2Expanded_float_uint32_dim512_t32_8pq_2subd_half_cu_c6ca332b_56106ignoreE
> Base _ZN4cuda3std3__4111_GLOBAL__N__5cac27f3_73_compute_distance_vpq_L2Expanded_uint8_uint32_dim512_t32_8pq_2subd_half_cu_c6ca332b_57176ignoreE
> Base _ZN4cuda3std3__4111_GLOBAL__N__72ee691f_73_compute_distance_vpq_L2Expanded_float_uint32_dim256_t16_8pq_4subd_half_cu_c6ca332b_56186ignoreE
> Base _ZN4cuda3std3__4111_GLOBAL__N__78e54024_73_compute_distance_vpq_L2Expanded_uint8_uint32_dim256_t16_8pq_4subd_half_cu_c6ca332b_57096ignoreE
> Base _ZN4cuda3std3__4111_GLOBAL__N__d20bb6c2_73_compute_distance_vpq_L2Expanded_float_uint32_dim256_t16_8pq_2subd_half_cu_c6ca332b_37176ignoreE
> Base _ZN4cuda3std3__4111_GLOBAL__N__d8009ff9_73_compute_distance_vpq_L2Expanded_uint8_uint32_dim256_t16_8pq_2subd_half_cu_c6ca332b_57076ignoreE
> Base _ZN4cuda3std3__4111_GLOBAL__N__f642d115_73_compute_distance_vpq_L2Expanded_float_uint32_dim512_t32_8pq_4subd_half_cu_c6ca332b_56346ignoreE
> Base _ZN4cuda3std3__4111_GLOBAL__N__fc49f82e_73_compute_distance_vpq_L2Expanded_uint8_uint32_dim512_t32_8pq_4subd_half_cu_c6ca332b_57196ignoreE
994c994
< Base _ZN4cuda3std3__446_GLOBAL__N__64f16e7a_9_bitset_cu_f77f5a7d_74086ignoreE
---
> Base _ZN4cuda3std3__446_GLOBAL__N__64f16e7a_9_bitset_cu_c6ca332b_74866ignoreE
998c998
< Base _ZN4cuda3std3__450_GLOBAL__N__4f4a4758_11_distance_cu_f77f5a7d_111546ignoreE
---
> Base _ZN4cuda3std3__450_GLOBAL__N__4f4a4758_11_distance_cu_c6ca332b_112336ignoreE
1009,1010d1008
< Base _ZN4cuda3std3__452_GLOBAL__N__95959c91_19_cagra_merge_int8_cu_1519f3d66ignoreE
< Base _ZN4cuda3std3__452_GLOBAL__N__b9f68122_19_cagra_merge_half_cu_bdc4ae5e6ignoreE
1012c1010
< Base _ZN4cuda3std3__452_GLOBAL__N__db96ccea_14_gram_matrix_cu_f77f5a7d_73366ignoreE
---
> Base _ZN4cuda3std3__452_GLOBAL__N__db96ccea_14_gram_matrix_cu_c6ca332b_74286ignoreE
1017d1014
< Base _ZN4cuda3std3__453_GLOBAL__N__a7374495_20_cagra_merge_float_cu_61811a596ignoreE
1019d1015
< Base _ZN4cuda3std3__453_GLOBAL__N__b5a57128_20_cagra_merge_uint8_cu_88b0c9476ignoreE
1026c1022
< Base _ZN4cuda3std3__454_GLOBAL__N__328d012e_15_dispatch_rbf_cu_f77f5a7d_104016ignoreE
---
> Base _ZN4cuda3std3__454_GLOBAL__N__328d012e_15_dispatch_rbf_cu_c6ca332b_104046ignoreE
1032c1028
< Base _ZN4cuda3std3__455_GLOBAL__N__3178ac04_17_kernel_factory_cu_f77f5a7d_74766ignoreE
---
> Base _ZN4cuda3std3__455_GLOBAL__N__3178ac04_17_kernel_factory_cu_c6ca332b_74986ignoreE
1034c1030
< Base _ZN4cuda3std3__455_GLOBAL__N__ad2089a5_16_sample_filter_cu_f77f5a7d_170376ignoreE
---
> Base _ZN4cuda3std3__455_GLOBAL__N__ad2089a5_16_sample_filter_cu_c6ca332b_169336ignoreE
1050c1046
< Base _ZN4cuda3std3__457_GLOBAL__N__aefa97cb_19_compute_distance_cu_f77f5a7d_35056ignoreE
---
> Base _ZN4cuda3std3__457_GLOBAL__N__aefa97cb_19_compute_distance_cu_c6ca332b_35746ignoreE
1057,1058c1053,1054
< Base _ZN4cuda3std3__458_GLOBAL__N__9353ec83_20_cagra_search_int8_cu_f77f5a7d_36856ignoreE
< Base _ZN4cuda3std3__458_GLOBAL__N__bf30f130_20_cagra_search_half_cu_f77f5a7d_36916ignoreE
---
> Base _ZN4cuda3std3__458_GLOBAL__N__9353ec83_20_cagra_search_int8_cu_c6ca332b_37256ignoreE
> Base _ZN4cuda3std3__458_GLOBAL__N__bf30f130_20_cagra_search_half_cu_c6ca332b_37296ignoreE
1064c1060
< Base _ZN4cuda3std3__459_GLOBAL__N__461ac610_21_cagra_search_uint8_cu_f77f5a7d_36876ignoreE
---
> Base _ZN4cuda3std3__459_GLOBAL__N__461ac610_21_cagra_search_uint8_cu_c6ca332b_37236ignoreE
1066c1062
< Base _ZN4cuda3std3__459_GLOBAL__N__5488f3ad_21_cagra_search_float_cu_f77f5a7d_36896ignoreE
---
> Base _ZN4cuda3std3__459_GLOBAL__N__5488f3ad_21_cagra_search_float_cu_c6ca332b_37276ignoreE
1068c1064
< Base _ZN4cuda3std3__459_GLOBAL__N__e1405c66_20_fused_distance_nn_cu_f77f5a7d_111456ignoreE
---
> Base _ZN4cuda3std3__459_GLOBAL__N__e1405c66_20_fused_distance_nn_cu_c6ca332b_112206ignoreE
1090c1086
< Base _ZN4cuda3std3__463_GLOBAL__N__8f9b59ea_24_iface_pq_half_int64_t_cu_f77f5a7d_127416ignoreE
---
> Base _ZN4cuda3std3__463_GLOBAL__N__8f9b59ea_24_iface_pq_half_int64_t_cu_c6ca332b_126116ignoreE
1093c1089
< Base _ZN4cuda3std3__464_GLOBAL__N__f58201ad_25_iface_pq_float_int64_t_cu_f77f5a7d_127346ignoreE
---
> Base _ZN4cuda3std3__464_GLOBAL__N__f58201ad_25_iface_pq_float_int64_t_cu_c6ca332b_126486ignoreE
1096c1092
< Base _ZN4cuda3std3__465_GLOBAL__N__c680846e_26_iface_pq_int8_t_int64_t_cu_f77f5a7d_129816ignoreE
---
> Base _ZN4cuda3std3__465_GLOBAL__N__c680846e_26_iface_pq_int8_t_int64_t_cu_c6ca332b_126566ignoreE
1099,1100c1095,1096
< Base _ZN4cuda3std3__466_GLOBAL__N__8cb523a1_27_iface_flat_float_int64_t_cu_f77f5a7d_127186ignoreE
< Base _ZN4cuda3std3__466_GLOBAL__N__f07524a0_27_iface_pq_uint8_t_int64_t_cu_f77f5a7d_129866ignoreE
---
> Base _ZN4cuda3std3__466_GLOBAL__N__8cb523a1_27_iface_flat_float_int64_t_cu_c6ca332b_122696ignoreE
> Base _ZN4cuda3std3__466_GLOBAL__N__f07524a0_27_iface_pq_uint8_t_int64_t_cu_c6ca332b_128856ignoreE
1102,1103c1098,1099
< Base _ZN4cuda3std3__467_GLOBAL__N__cf4fff67_28_iface_flat_int8_t_int64_t_cu_f77f5a7d_127226ignoreE
< Base _ZN4cuda3std3__467_GLOBAL__N__d2287562_28_iface_cagra_half_uint32_t_cu_f77f5a7d_123436ignoreE
---
> Base _ZN4cuda3std3__467_GLOBAL__N__cf4fff67_28_iface_flat_int8_t_int64_t_cu_c6ca332b_125586ignoreE
> Base _ZN4cuda3std3__467_GLOBAL__N__d2287562_28_iface_cagra_half_uint32_t_cu_c6ca332b_121076ignoreE
1106,1107c1102,1103
< Base _ZN4cuda3std3__468_GLOBAL__N__89a0537f_29_iface_flat_uint8_t_int64_t_cu_f77f5a7d_127296ignoreE
< Base _ZN4cuda3std3__468_GLOBAL__N__f38cfc4f_29_iface_cagra_float_uint32_t_cu_f77f5a7d_123356ignoreE
---
> Base _ZN4cuda3std3__468_GLOBAL__N__89a0537f_29_iface_flat_uint8_t_int64_t_cu_c6ca332b_126026ignoreE
> Base _ZN4cuda3std3__468_GLOBAL__N__f38cfc4f_29_iface_cagra_float_uint32_t_cu_c6ca332b_119376ignoreE
1109,1110c1105,1106
< Base _ZN4cuda3std3__469_GLOBAL__N__36bf2ef4_31_search_multi_cta_half_uint32_cu_f77f5a7d_56926ignoreE
< Base _ZN4cuda3std3__469_GLOBAL__N__6ea4ef21_30_iface_cagra_int8_t_uint32_t_cu_f77f5a7d_123536ignoreE
---
> Base _ZN4cuda3std3__469_GLOBAL__N__36bf2ef4_31_search_multi_cta_half_uint32_cu_c6ca332b_57316ignoreE
> Base _ZN4cuda3std3__469_GLOBAL__N__6ea4ef21_30_iface_cagra_int8_t_uint32_t_cu_c6ca332b_122306ignoreE
1112c1108
< Base _ZN4cuda3std3__469_GLOBAL__N__ac1c7797_31_search_multi_cta_int8_uint32_cu_f77f5a7d_56956ignoreE
---
> Base _ZN4cuda3std3__469_GLOBAL__N__ac1c7797_31_search_multi_cta_int8_uint32_cu_c6ca332b_57296ignoreE
1117,1123c1113,1119
< Base _ZN4cuda3std3__470_GLOBAL__N__6bdad738_32_search_single_cta_half_uint32_cu_f77f5a7d_57016ignoreE
< Base _ZN4cuda3std3__470_GLOBAL__N__b0e6ed11_32_search_multi_cta_uint8_uint32_cu_f77f5a7d_56976ignoreE
< Base _ZN4cuda3std3__470_GLOBAL__N__c5dc515a_31_iface_cagra_uint8_t_uint32_t_cu_f77f5a7d_126816ignoreE
< Base _ZN4cuda3std3__470_GLOBAL__N__f1798e5b_32_search_single_cta_int8_uint32_cu_f77f5a7d_57036ignoreE
< Base _ZN4cuda3std3__470_GLOBAL__N__f425be0b_32_search_multi_cta_float_uint32_cu_f77f5a7d_56936ignoreE
< Base _ZN4cuda3std3__471_GLOBAL__N__22690673_33_search_single_cta_uint8_uint32_cu_f77f5a7d_71286ignoreE
< Base _ZN4cuda3std3__471_GLOBAL__N__66aa5569_33_search_single_cta_float_uint32_cu_f77f5a7d_56996ignoreE
---
> Base _ZN4cuda3std3__470_GLOBAL__N__6bdad738_32_search_single_cta_half_uint32_cu_c6ca332b_57376ignoreE
> Base _ZN4cuda3std3__470_GLOBAL__N__b0e6ed11_32_search_multi_cta_uint8_uint32_cu_c6ca332b_57336ignoreE
> Base _ZN4cuda3std3__470_GLOBAL__N__c5dc515a_31_iface_cagra_uint8_t_uint32_t_cu_c6ca332b_122676ignoreE
> Base _ZN4cuda3std3__470_GLOBAL__N__f1798e5b_32_search_single_cta_int8_uint32_cu_c6ca332b_57396ignoreE
> Base _ZN4cuda3std3__470_GLOBAL__N__f425be0b_32_search_multi_cta_float_uint32_cu_c6ca332b_57276ignoreE
> Base _ZN4cuda3std3__471_GLOBAL__N__22690673_33_search_single_cta_uint8_uint32_cu_c6ca332b_71686ignoreE
> Base _ZN4cuda3std3__471_GLOBAL__N__66aa5569_33_search_single_cta_float_uint32_cu_c6ca332b_57356ignoreE
1127c1123
< Base _ZN4cuda3std3__473_GLOBAL__N__0ab3eb41_35_dispatch_l1_half_float_float_int_cu_f77f5a7d_92676ignoreE
---
> Base _ZN4cuda3std3__473_GLOBAL__N__0ab3eb41_35_dispatch_l1_half_float_float_int_cu_c6ca332b_93986ignoreE
1129,1138c1125,1134
< Base _ZN4cuda3std3__474_GLOBAL__N__9e9dbffe_36_dispatch_l1_float_float_float_int_cu_f77f5a7d_92126ignoreE
< Base _ZN4cuda3std3__477_GLOBAL__N__55339711_38_dispatch_l_inf_half_float_float_int_cu_f77f5a7d_100496ignoreE
< Base _ZN4cuda3std3__477_GLOBAL__N__75446a56_39_dispatch_cosine_half_float_float_int_cu_f77f5a7d_83846ignoreE
< Base _ZN4cuda3std3__477_GLOBAL__N__7fe93dd3_39_dispatch_l1_double_double_double_int_cu_f77f5a7d_93666ignoreE
< Base _ZN4cuda3std3__477_GLOBAL__N__cd17b4f0_38_ivf_pq_compute_similarity_half_half_cu_f77f5a7d_148486ignoreE
< Base _ZN4cuda3std3__478_GLOBAL__N__1d31cdb8_40_dispatch_cosine_float_float_float_int_cu_f77f5a7d_83386ignoreE
< Base _ZN4cuda3std3__478_GLOBAL__N__860a2c90_39_ivf_pq_compute_similarity_float_half_cu_f77f5a7d_148586ignoreE
< Base _ZN4cuda3std3__478_GLOBAL__N__f5a96e76_39_dispatch_l_inf_float_float_float_int_cu_f77f5a7d_100096ignoreE
< Base _ZN4cuda3std3__479_GLOBAL__N__82676a98_40_ivf_pq_compute_similarity_float_float_cu_f77f5a7d_148906ignoreE
< Base _ZN4cuda3std3__479_GLOBAL__N__9de1c87e_41_dispatch_canberra_half_float_float_int_cu_f77f5a7d_74926ignoreE
---
> Base _ZN4cuda3std3__474_GLOBAL__N__9e9dbffe_36_dispatch_l1_float_float_float_int_cu_c6ca332b_93886ignoreE
> Base _ZN4cuda3std3__477_GLOBAL__N__55339711_38_dispatch_l_inf_half_float_float_int_cu_c6ca332b_100836ignoreE
> Base _ZN4cuda3std3__477_GLOBAL__N__75446a56_39_dispatch_cosine_half_float_float_int_cu_c6ca332b_84076ignoreE
> Base _ZN4cuda3std3__477_GLOBAL__N__7fe93dd3_39_dispatch_l1_double_double_double_int_cu_c6ca332b_94446ignoreE
> Base _ZN4cuda3std3__477_GLOBAL__N__cd17b4f0_38_ivf_pq_compute_similarity_half_half_cu_c6ca332b_145436ignoreE
> Base _ZN4cuda3std3__478_GLOBAL__N__1d31cdb8_40_dispatch_cosine_float_float_float_int_cu_c6ca332b_81256ignoreE
> Base _ZN4cuda3std3__478_GLOBAL__N__860a2c90_39_ivf_pq_compute_similarity_float_half_cu_c6ca332b_146306ignoreE
> Base _ZN4cuda3std3__478_GLOBAL__N__f5a96e76_39_dispatch_l_inf_float_float_float_int_cu_c6ca332b_100656ignoreE
> Base _ZN4cuda3std3__479_GLOBAL__N__82676a98_40_ivf_pq_compute_similarity_float_float_cu_c6ca332b_146446ignoreE
> Base _ZN4cuda3std3__479_GLOBAL__N__9de1c87e_41_dispatch_canberra_half_float_float_int_cu_c6ca332b_75246ignoreE
1140,1174c1136,1170
< Base _ZN4cuda3std3__480_GLOBAL__N__286cc0e0_42_dispatch_canberra_float_float_float_int_cu_f77f5a7d_74866ignoreE
< Base _ZN4cuda3std3__481_GLOBAL__N__2958776b_42_dispatch_l_inf_double_double_double_int_cu_f77f5a7d_100516ignoreE
< Base _ZN4cuda3std3__481_GLOBAL__N__b5be8adb_42_ivf_pq_compute_similarity_half_fp8_true_cu_f77f5a7d_148466ignoreE
< Base _ZN4cuda3std3__481_GLOBAL__N__c9973895_43_dispatch_cosine_double_double_double_int_cu_f77f5a7d_83916ignoreE
< Base _ZN4cuda3std3__482_GLOBAL__N__3e7d4bb6_44_dispatch_l2_expanded_half_float_float_int_cu_f77f5a7d_95656ignoreE
< Base _ZN4cuda3std3__482_GLOBAL__N__4ef5a42d_43_dispatch_russel_rao_half_float_float_int_cu_f77f5a7d_103386ignoreE
< Base _ZN4cuda3std3__482_GLOBAL__N__63fa6094_43_ivf_pq_compute_similarity_float_fp8_true_cu_f77f5a7d_148946ignoreE
< Base _ZN4cuda3std3__482_GLOBAL__N__68c72ae4_43_ivf_pq_compute_similarity_half_fp8_false_cu_f77f5a7d_145336ignoreE
< Base _ZN4cuda3std3__482_GLOBAL__N__70fbd51c_44_dispatch_correlation_half_float_float_int_cu_f77f5a7d_79076ignoreE
< Base _ZN4cuda3std3__483_GLOBAL__N__0b4c14ac_45_dispatch_canberra_double_double_double_int_cu_f77f5a7d_77616ignoreE
< Base _ZN4cuda3std3__483_GLOBAL__N__8b3dda89_45_dispatch_correlation_float_float_float_int_cu_f77f5a7d_77656ignoreE
< Base _ZN4cuda3std3__483_GLOBAL__N__8e72320f_44_ivf_pq_compute_similarity_float_fp8_false_cu_f77f5a7d_148926ignoreE
< Base _ZN4cuda3std3__483_GLOBAL__N__bd7016e1_45_dispatch_l2_expanded_float_float_float_int_cu_f77f5a7d_95616ignoreE
< Base _ZN4cuda3std3__483_GLOBAL__N__daddd4c2_44_dispatch_russel_rao_float_float_float_int_cu_f77f5a7d_102746ignoreE
< Base _ZN4cuda3std3__484_GLOBAL__N__457e59db_46_dispatch_l2_unexpanded_half_float_float_int_cu_f77f5a7d_95636ignoreE
< Base _ZN4cuda3std3__484_GLOBAL__N__8002999e_46_dispatch_kl_divergence_half_float_float_int_cu_f77f5a7d_91186ignoreE
< Base _ZN4cuda3std3__485_GLOBAL__N__87097342_46_dispatch_lp_unexpanded_half_float_float_int_cu_f77f5a7d_100616ignoreE
< Base _ZN4cuda3std3__485_GLOBAL__N__887bc1c9_47_dispatch_kl_divergence_float_float_float_int_cu_f77f5a7d_89976ignoreE
< Base _ZN4cuda3std3__485_GLOBAL__N__8e080816_47_dispatch_l2_unexpanded_float_float_float_int_cu_f77f5a7d_95596ignoreE
< Base _ZN4cuda3std3__485_GLOBAL__N__d34b4a5d_47_dispatch_jensen_shannon_half_float_float_int_cu_f77f5a7d_87376ignoreE
< Base _ZN4cuda3std3__486_GLOBAL__N__071954dc_47_dispatch_lp_unexpanded_float_float_float_int_cu_f77f5a7d_100546ignoreE
< Base _ZN4cuda3std3__486_GLOBAL__N__8a451b10_48_dispatch_jensen_shannon_float_float_float_int_cu_f77f5a7d_87356ignoreE
< Base _ZN4cuda3std3__486_GLOBAL__N__a24dd29f_47_dispatch_russel_rao_double_double_double_int_cu_f77f5a7d_103896ignoreE
< Base _ZN4cuda3std3__486_GLOBAL__N__a5560fac_47_ivf_pq_compute_similarity_half_half_bitset64_cu_f77f5a7d_151716ignoreE
< Base _ZN4cuda3std3__486_GLOBAL__N__c9ed40d8_48_dispatch_l2_expanded_double_double_double_int_cu_f77f5a7d_95696ignoreE
< Base _ZN4cuda3std3__486_GLOBAL__N__e00b7f19_48_dispatch_correlation_double_double_double_int_cu_f77f5a7d_79166ignoreE
< Base _ZN4cuda3std3__487_GLOBAL__N__66c626ed_48_ivf_pq_compute_similarity_float_half_bitset64_cu_f77f5a7d_151736ignoreE
< Base _ZN4cuda3std3__488_GLOBAL__N__431c20ac_49_ivf_pq_compute_similarity_float_float_bitset64_cu_f77f5a7d_152936ignoreE
< Base _ZN4cuda3std3__488_GLOBAL__N__a618dde5_50_dispatch_kl_divergence_double_double_double_int_cu_f77f5a7d_92046ignoreE
< Base _ZN4cuda3std3__488_GLOBAL__N__bb39acf1_49_dispatch_l2_expanded_float_float_float_int64_t_cu_f77f5a7d_110986ignoreE
< Base _ZN4cuda3std3__488_GLOBAL__N__f3f9e36f_50_dispatch_l2_unexpanded_double_double_double_int_cu_f77f5a7d_96826ignoreE
< Base _ZN4cuda3std3__489_GLOBAL__N__137fc2da_50_dispatch_lp_unexpanded_double_double_double_int_cu_f77f5a7d_100636ignoreE
< Base _ZN4cuda3std3__489_GLOBAL__N__5044fb7a_51_dispatch_hellinger_expanded_half_float_float_int_cu_f77f5a7d_87206ignoreE
< Base _ZN4cuda3std3__489_GLOBAL__N__81624d7d_51_dispatch_hamming_unexpanded_half_float_float_int_cu_f77f5a7d_83766ignoreE
< Base _ZN4cuda3std3__489_GLOBAL__N__9ef6e366_51_dispatch_jensen_shannon_double_double_double_int_cu_f77f5a7d_89956ignoreE
---
> Base _ZN4cuda3std3__480_GLOBAL__N__286cc0e0_42_dispatch_canberra_float_float_float_int_cu_c6ca332b_75206ignoreE
> Base _ZN4cuda3std3__481_GLOBAL__N__2958776b_42_dispatch_l_inf_double_double_double_int_cu_c6ca332b_101446ignoreE
> Base _ZN4cuda3std3__481_GLOBAL__N__b5be8adb_42_ivf_pq_compute_similarity_half_fp8_true_cu_c6ca332b_143396ignoreE
> Base _ZN4cuda3std3__481_GLOBAL__N__c9973895_43_dispatch_cosine_double_double_double_int_cu_c6ca332b_84156ignoreE
> Base _ZN4cuda3std3__482_GLOBAL__N__3e7d4bb6_44_dispatch_l2_expanded_half_float_float_int_cu_c6ca332b_95986ignoreE
> Base _ZN4cuda3std3__482_GLOBAL__N__4ef5a42d_43_dispatch_russel_rao_half_float_float_int_cu_c6ca332b_103626ignoreE
> Base _ZN4cuda3std3__482_GLOBAL__N__63fa6094_43_ivf_pq_compute_similarity_float_fp8_true_cu_c6ca332b_146566ignoreE
> Base _ZN4cuda3std3__482_GLOBAL__N__68c72ae4_43_ivf_pq_compute_similarity_half_fp8_false_cu_c6ca332b_143346ignoreE
> Base _ZN4cuda3std3__482_GLOBAL__N__70fbd51c_44_dispatch_correlation_half_float_float_int_cu_c6ca332b_78876ignoreE
> Base _ZN4cuda3std3__483_GLOBAL__N__0b4c14ac_45_dispatch_canberra_double_double_double_int_cu_c6ca332b_77966ignoreE
> Base _ZN4cuda3std3__483_GLOBAL__N__8b3dda89_45_dispatch_correlation_float_float_float_int_cu_c6ca332b_77986ignoreE
> Base _ZN4cuda3std3__483_GLOBAL__N__8e72320f_44_ivf_pq_compute_similarity_float_fp8_false_cu_c6ca332b_146546ignoreE
> Base _ZN4cuda3std3__483_GLOBAL__N__bd7016e1_45_dispatch_l2_expanded_float_float_float_int_cu_c6ca332b_95946ignoreE
> Base _ZN4cuda3std3__483_GLOBAL__N__daddd4c2_44_dispatch_russel_rao_float_float_float_int_cu_c6ca332b_101716ignoreE
> Base _ZN4cuda3std3__484_GLOBAL__N__457e59db_46_dispatch_l2_unexpanded_half_float_float_int_cu_c6ca332b_95966ignoreE
> Base _ZN4cuda3std3__484_GLOBAL__N__8002999e_46_dispatch_kl_divergence_half_float_float_int_cu_c6ca332b_90996ignoreE
> Base _ZN4cuda3std3__485_GLOBAL__N__87097342_46_dispatch_lp_unexpanded_half_float_float_int_cu_c6ca332b_101646ignoreE
> Base _ZN4cuda3std3__485_GLOBAL__N__887bc1c9_47_dispatch_kl_divergence_float_float_float_int_cu_c6ca332b_90926ignoreE
> Base _ZN4cuda3std3__485_GLOBAL__N__8e080816_47_dispatch_l2_unexpanded_float_float_float_int_cu_c6ca332b_95866ignoreE
> Base _ZN4cuda3std3__485_GLOBAL__N__d34b4a5d_47_dispatch_jensen_shannon_half_float_float_int_cu_c6ca332b_87856ignoreE
> Base _ZN4cuda3std3__486_GLOBAL__N__071954dc_47_dispatch_lp_unexpanded_float_float_float_int_cu_c6ca332b_101496ignoreE
> Base _ZN4cuda3std3__486_GLOBAL__N__8a451b10_48_dispatch_jensen_shannon_float_float_float_int_cu_c6ca332b_87836ignoreE
> Base _ZN4cuda3std3__486_GLOBAL__N__a24dd29f_47_dispatch_russel_rao_double_double_double_int_cu_c6ca332b_104006ignoreE
> Base _ZN4cuda3std3__486_GLOBAL__N__a5560fac_47_ivf_pq_compute_similarity_half_half_bitset64_cu_c6ca332b_149386ignoreE
> Base _ZN4cuda3std3__486_GLOBAL__N__c9ed40d8_48_dispatch_l2_expanded_double_double_double_int_cu_c6ca332b_96056ignoreE
> Base _ZN4cuda3std3__486_GLOBAL__N__e00b7f19_48_dispatch_correlation_double_double_double_int_cu_c6ca332b_79716ignoreE
> Base _ZN4cuda3std3__487_GLOBAL__N__66c626ed_48_ivf_pq_compute_similarity_float_half_bitset64_cu_c6ca332b_149406ignoreE
> Base _ZN4cuda3std3__488_GLOBAL__N__431c20ac_49_ivf_pq_compute_similarity_float_float_bitset64_cu_c6ca332b_151056ignoreE
> Base _ZN4cuda3std3__488_GLOBAL__N__a618dde5_50_dispatch_kl_divergence_double_double_double_int_cu_c6ca332b_92096ignoreE
> Base _ZN4cuda3std3__488_GLOBAL__N__bb39acf1_49_dispatch_l2_expanded_float_float_float_int64_t_cu_c6ca332b_112186ignoreE
> Base _ZN4cuda3std3__488_GLOBAL__N__f3f9e36f_50_dispatch_l2_unexpanded_double_double_double_int_cu_c6ca332b_98786ignoreE
> Base _ZN4cuda3std3__489_GLOBAL__N__137fc2da_50_dispatch_lp_unexpanded_double_double_double_int_cu_c6ca332b_101626ignoreE
> Base _ZN4cuda3std3__489_GLOBAL__N__5044fb7a_51_dispatch_hellinger_expanded_half_float_float_int_cu_c6ca332b_87306ignoreE
> Base _ZN4cuda3std3__489_GLOBAL__N__81624d7d_51_dispatch_hamming_unexpanded_half_float_float_int_cu_c6ca332b_84126ignoreE
> Base _ZN4cuda3std3__489_GLOBAL__N__9ef6e366_51_dispatch_jensen_shannon_double_double_double_int_cu_c6ca332b_90906ignoreE
1178,1188c1174,1184
< Base _ZN4cuda3std3__490_GLOBAL__N__00f74b8c_51_ivf_pq_compute_similarity_half_fp8_true_bitset64_cu_f77f5a7d_151696ignoreE
< Base _ZN4cuda3std3__490_GLOBAL__N__2fcca1ca_52_dispatch_hellinger_expanded_float_float_float_int_cu_f77f5a7d_83826ignoreE
< Base _ZN4cuda3std3__490_GLOBAL__N__b17912df_52_dispatch_hamming_unexpanded_float_float_float_int_cu_f77f5a7d_83686ignoreE
< Base _ZN4cuda3std3__491_GLOBAL__N__67ef10ab_52_ivf_pq_compute_similarity_float_fp8_true_bitset64_cu_f77f5a7d_154976ignoreE
< Base _ZN4cuda3std3__491_GLOBAL__N__8c62390a_52_ivf_pq_compute_similarity_half_fp8_false_bitset64_cu_f77f5a7d_151576ignoreE
< Base _ZN4cuda3std3__491_GLOBAL__N__cdac08aa_52_dispatch_l2_expanded_double_double_double_int64_t_cu_f77f5a7d_110876ignoreE
< Base _ZN4cuda3std3__492_GLOBAL__N__290f943a_53_ivf_pq_compute_similarity_float_fp8_false_bitset64_cu_f77f5a7d_154676ignoreE
< Base _ZN4cuda3std3__493_GLOBAL__N__94a62e93_55_dispatch_hellinger_expanded_double_double_double_int_cu_f77f5a7d_87336ignoreE
< Base _ZN4cuda3std3__493_GLOBAL__N__faa8f5ee_55_dispatch_hamming_unexpanded_double_double_double_int_cu_f77f5a7d_83806ignoreE
< Base _ZN4cuda3std3__499_GLOBAL__N__37ad16a1_61_compute_distance_standard_L2Expanded_half_uint32_dim128_t8_cu_f77f5a7d_36306ignoreE
< Base _ZN4cuda3std3__499_GLOBAL__N__bcf697e2_61_compute_distance_standard_L2Expanded_int8_uint32_dim128_t8_cu_f77f5a7d_36396ignoreE
---
> Base _ZN4cuda3std3__490_GLOBAL__N__00f74b8c_51_ivf_pq_compute_similarity_half_fp8_true_bitset64_cu_c6ca332b_149366ignoreE
> Base _ZN4cuda3std3__490_GLOBAL__N__2fcca1ca_52_dispatch_hellinger_expanded_float_float_float_int_cu_c6ca332b_84306ignoreE
> Base _ZN4cuda3std3__490_GLOBAL__N__b17912df_52_dispatch_hamming_unexpanded_float_float_float_int_cu_c6ca332b_83906ignoreE
> Base _ZN4cuda3std3__491_GLOBAL__N__67ef10ab_52_ivf_pq_compute_similarity_float_fp8_true_bitset64_cu_c6ca332b_152306ignoreE
> Base _ZN4cuda3std3__491_GLOBAL__N__8c62390a_52_ivf_pq_compute_similarity_half_fp8_false_bitset64_cu_c6ca332b_149086ignoreE
> Base _ZN4cuda3std3__491_GLOBAL__N__cdac08aa_52_dispatch_l2_expanded_double_double_double_int64_t_cu_c6ca332b_110106ignoreE
> Base _ZN4cuda3std3__492_GLOBAL__N__290f943a_53_ivf_pq_compute_similarity_float_fp8_false_bitset64_cu_c6ca332b_151086ignoreE
> Base _ZN4cuda3std3__493_GLOBAL__N__94a62e93_55_dispatch_hellinger_expanded_double_double_double_int_cu_c6ca332b_87356ignoreE
> Base _ZN4cuda3std3__493_GLOBAL__N__faa8f5ee_55_dispatch_hamming_unexpanded_double_double_double_int_cu_c6ca332b_84236ignoreE
> Base _ZN4cuda3std3__499_GLOBAL__N__37ad16a1_61_compute_distance_standard_L2Expanded_half_uint32_dim128_t8_cu_c6ca332b_37116ignoreE
> Base _ZN4cuda3std3__499_GLOBAL__N__bcf697e2_61_compute_distance_standard_L2Expanded_int8_uint32_dim128_t8_cu_c6ca332b_36716ignoreE
@rhdong rhdong added the feature request New feature or request label Jan 31, 2025
@rhdong rhdong assigned rhdong and unassigned rhdong Jan 31, 2025
@robertmaynard
Copy link
Contributor

Entirely expected as cuvs uses whole compilation for the src/neighbors/cagra_merge_* files. In whole compilation mode each TU is a separate CUDA module and you can't call functions across CUDA module boundaries. This means that these helper functions need to be embedded into each object file.

@rhdong
Copy link
Member Author

rhdong commented Feb 19, 2025

Hi @robertmaynard, thank you for the clarification, that really makes sense. I have a question: if I want to continue to reuse cagra::build in the merge API, what should I do? May I use the extern to invoke them in the *merge*.cu? Thanks a lot!

@robertmaynard
Copy link
Contributor

May I use the extern to invoke them in the merge.cu

in whole compilation mode you can't invoke kernels defined in other translation units. It isn't clear to me if the cagra::build is a host function or device kernel you want to launch. If it is a host side function you don't need to mark it as extern to call it.

@rhdong
Copy link
Member Author

rhdong commented Feb 19, 2025

Sorry for forgetting to paste a code link, that might be helpful to understand, and as you can see, the merge need call build API, and the build includes some cuda kernel. So, as my understanding, we still have two choices:

  1. mark build via extern
    or
  2. move the instantiations of the merge into *build*.cu

am I right?

@cjnolet
Copy link
Member

cjnolet commented Feb 20, 2025 via email

@rhdong
Copy link
Member Author

rhdong commented Feb 20, 2025

James, I think the correct way to handle this situation is to 1) try and call cagra build from the public API header wherever possible, and 2) if 1 isn’t at all possible, use existing instantiations where possible (and use extern templates). It’s most ideal when we can do 1. That would be the way to fix your issue. Sent from my iPhoneOn Feb 19, 2025, at 4:36 PM, rhdong @.***> wrote:
Sorry for forgetting to paste a code link, that might be helpful to understand, and as you can see, the merge need call build API, and the build includes some cuda kernel. So, as my understanding, we still have two chooses:

mark build via extern
or
move the instantiations of the merge into build.cu
—Reply to this email directly, view it on GitHub, or unsubscribe.You are receiving this because you are subscribed to this thread.Message ID: @.***>

rhdong left a comment (rapidsai/cuvs#634)
Sorry for forgetting to paste a code link, that might be helpful to understand, and as you can see, the merge need call build API, and the build includes some cuda kernel. So, as my understanding, we still have two chooses:

mark build via extern
or
move the instantiations of the merge into build.cu

—Reply to this email directly, view it on GitHub, or unsubscribe.You are receiving this because you are subscribed to this thread.Message ID: @.***>
I got it! Thank you @robertmaynard @cjnolet , for your clarification. That's really helpful! Since I have called the build from a public header(like here). I will try to use extern in the next step, will back you when I have result. Thanks a lot!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feature request New feature or request
Projects
None yet
Development

No branches or pull requests

3 participants