-
Notifications
You must be signed in to change notification settings - Fork 117
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
#18332: Fix BN hang for FPU Kernel #18634
Conversation
1b87bcf
to
f08a501
Compare
9ff289e
to
57c18ab
Compare
f08a501
to
4bc2169
Compare
57c18ab
to
75ec194
Compare
4bc2169
to
adfebf2
Compare
tile_regs_commit(); | ||
|
||
tile_regs_wait(); | ||
pack_tile_with_dt(dst0, cb_affine_or_out); | ||
pack_tile_with_dt(0, cb_affine_or_out); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Minor: Why was this change needed?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
dst0 is also 0. the value is same. Its just that for binary_dest_reuse_tiles
I passed 0 and I followed the same here.
75ec194
to
736407b
Compare
adfebf2
to
dbb139b
Compare
dbb139b
to
c4d5a9c
Compare
c4d5a9c
to
dc3aed0
Compare
Ticket
#18332
Problem description
Batch norm kernel hangs for larger shapes. This was because the buffer size is set to 2. The kernel reserves
freq
number of tiles. So when this gets more than 2, there is an hang as it tries to reserve more than the size. Hence unpack and math threads got hung at thecopy_tile
call while the pack thread was stuck atcb_reserve_back
.What's changed
freq
loops in the kernel to ensure that as each tile gets reserved, they also get popped before the next tile gets reservedcb_num
buffer as I have rearranged the computationChecklist