Some resources I found helpful

Terminology

CUDAPTXNVIDIA hardwareOpenGLVulkan
thread block grid GPU dispatch global workgroup
thread block CTA SM (sometimes MP) work group (local) workgroup
warp warp scheduler? subgroup
thread CUDA core invocation/thread invocation