Kurs : Performance Engineering on CPUs and GPUs | TÜBİTAK-ULAKBİM Açık Ders Platformu

Konu özeti

Konu seçin Overview

Overview

Hepsini daralt Hepsini genişlet
Skill Level: Intermediate
Language: English
Workload: 2 hours total
Topic: Performance Engineering on CPUs and GPUs
Overview: This lecture delves into performance engineering on CPUs and GPUs, focusing on optimizing computation and memory access patterns for high-performance computing applications.
Course Description: The lecture covers simple but key architectural concepts, including pipelining, memory hierarchies, and caches. On GPUs, the emphasis is placed coalesced memory access, shared memory usage, and addressing bank conflicts. By examining real-world examples such as matrix multiplication and dot product computations, the lecture provides practical insights into maximizing computational throughput and minimizing bottlenecks. This session equips participants with the foundational knowledge to design and implement efficient algorithms for modern multicore and manycore systems.
Course Contents:
Part 1: Introduction to CPUs
Part 2: Optimizing Memory Access Patterns for Performance
Part 3: Utilizing the Cache and Spatial-Temporal Locality
Part 4: Prefetching and Latency Mitigation
Part 5: Fundamentals of GPU Architecture and Programming
Part 6: Efficient Memory Accesses on GPUs
Part 7: Bank Conflicts and Shared Memory Performance on GPUs
Part 8: GPU Occupancy and Parallel Matrix Multiplication
Who Should Enroll: Anyone who thinks performance matters and want more for their codes.
Prerequisite: Experience with C++ and CUDA
Tools, libraries, frameworks used: g++, nvcc, perf
Learning Objectives: By participating in this course, you will learn:
simple things that can be useful to improve the performance on a CPU
simple things that can be useful to improve the performance on a GPU
About the instructor(s): Kamer Kaya is an Associate Professor at the Faculty of Engineering and Natural Sciences at Sabancı University. His research areas include high-performance computing, machine learning on sparse data, and graph algorithms.
- Duyurular etkinliğini seçin
  
  Duyurular Forum
Konu seçin Course Introduction

Course Introduction
- Course Introduction Slides etkinliğini seçin
  
  Course Introduction Slides Dosya
Konu seçin Part I

Part I
- Part I Slides etkinliğini seçin
  
  Part I Slides Dosya
Konu seçin Part II

Part II
- Part II Slides etkinliğini seçin
  
  Part II Slides Dosya
Konu seçin Part III

Part III
- Part III Slides etkinliğini seçin
  
  Part III Slides Dosya
Konu seçin Part IV

Part IV
- Part IV Slides etkinliğini seçin
  
  Part IV Slides Dosya
Konu seçin Part V

Part V
- Part V Slides etkinliğini seçin
  
  Part V Slides Dosya
Konu seçin Part VI

Part VI
- Part VI Slides etkinliğini seçin
  
  Part VI Slides Dosya
Konu seçin Part VII

Part VII
- Part VII Slides etkinliğini seçin
  
  Part VII Slides Dosya
Konu seçin Part VIII

Part VIII
- Part VIII Slides etkinliğini seçin
  
  Part VIII Slides Dosya
Konu seçin Codes

Codes
- CPU and GPU etkinliğini seçin
  
  CPU and GPU Dosya

Konu özeti

Overview

Course Introduction

Part I

Part II

Part III

Part IV

Part V

Part VI

Part VII

Part VIII

Codes