-
Notifications
You must be signed in to change notification settings - Fork 29
Open
Description
Thanks for the great blogpost and kernels.
In the column-major ordering:
grid_m = tl.cdiv(m, block_m)
grid_n = tl.cdiv(n, block_n)
pid_m = (pid % grid_n)
pid_n = pid // grid_m
Why is pid_m = (pid % grid_n)
?
grid_m
is the leading dimension (number of block
rows), so should it be pid_m = pid % grid_m
? Apologies if I'm misunderstanding the issue.
AdnanHoque and seanxcwangpingzhuu, chu-tianxiang and ofhwei
Metadata
Metadata
Assignees
Labels
No labels