5. Numba for HSA APUsΒΆ 5.1. Overview 5.1.1. Terminology 5.1.2. Requirements 5.1.3. Installation 5.2. Writing HSA Kernels 5.2.1. Introduction 5.2.2. Introduction for CUDA Programmers 5.2.3. Kernel declaration 5.2.4. Kernel invocation 5.2.4.1. Choosing the workgroup size 5.2.4.2. Multi-dimensional workgroup and grid 5.2.5. WorkItem positioning 5.3. Memory management 5.3.1. Shared memory and thread synchronization 5.4. Writing Device Functions 5.5. Supported Atomic Operations 5.5.1. Example 5.6. Device management 5.7. Examples 5.7.1. Matrix multiplication