High performance computing is the basic technology needed to support today's large scale scientific simulations. It covers a wide variety of issues on hardware and software for high-end computing such as high speed computation, high speed networking, large scale memory and disk storage, high speed numerical algorithms, programming schemes and the system softwares to support them. Current advanced supercomputer systems are based on large scale parallel processing systems. Nowadays, even application users are required to understand these technologies to a certain level for their effective utilization. In this class, we focus on the basic technology of high-end computing systems, programming, algorithm and performance tuning for application users who aim to use these systems for their practical simulation and computing.
Lecture Day: | February 21 (Wed), 22 (Thu), 2024 |
---|---|
Location: | Online (Zoom link will be sent by email.) |
Notice: | This intensive course will also be held as Korea-Japan HPC Winter School 2023. |
Feb. 21 (Wed) | Feb. 22 (Thu) | |
09:00 - 10:30 | Fundamentals of HPC and Parallel Processing | Parallel Numerical Algorithm 1 |
10:45 - 12:15 | Parallel Processing Systems | Parallel Numerical Algorithm 2 |
13:30 - 15:00 | Parallel Programming 1: OpenMP | Computation Optimization |
15:15 - 16:45 | Parallel Programming 2: MPI | GPU Computing |
Lecture name | Contents | Instructor | |
---|---|---|---|
1 | Fundamentals of HPC and Parallel Processing | Amdahl's law, Parallelization methods (EP, Data parallelism, Pipeline parallelism), Communication, Synchronization, Parallelization efficiency, Load balance. | Taisuke Boku |
2 | Parallel Processing Systems | Parallel processing systems (SMP, NUMA, Cluster, Grid, etc.), Memory hierarchy, Memory bandwidth, Network, Communication bandwidth, Delay. | Ryohei Kobayashi |
3 | Parallel Programming 1: OpenMP | Parallel programming model, parallel programming language OpenMP. | Akira Nukada |
4 | Parallel Programming 2: MPI | Parallel programming language MPI. | Norihisa Fujita |
5 | Parallel Numerical Algorithm 1 | Krylov subspace iterative methods and their parallelization methods. | Hiroto Tadano |
6 | Parallel Numerical Algorithm 2 | Fast Fourier Transformation (FFT) and its parallelization methods. | Daisuke Takahashi |
7 | Computation Optimization | Program optimization techniques (Register blocking, Cache blocking, Memory allocation, etc.) and performance evaluation on a compute node of parallel processing systems. | Daisuke Takahashi |
8 | GPU Computing | Introduction of GPU architecture and GPU programming. | Akira Nukada |