Questa è una vecchia versione del documento!
2h: Introduction to the course, structure of the lessons, hardware evolution and the urgency of parallel/distributed computing dictated by parallel hw (multicore CPUs, GPUs, clusters).
Assignment: look at top500.org and green500.org and try to figure out evolution of parallel machines through the “statistics” graphs in the lists.
2h: Examples of parallel execution of different kind of real life tasks:
How the tasks may be executed by a single person and by number of cooperating persons. Different way of executing tasks in parallel. Effect of “slow” persons recruited to execute a task. Figuring out general principles out of this: patterns, mechanisms, overheads, …
2h: Finding concurrency aspects:
2h: Mechanisms to implement parallel computations on multicores
(apologize I miss the last period of the lesson of the afternoon, due to my mistake)
Assigment: write a program using Cilk and C++ threads “translating” a book in parallel, where:
2h: Time and parallelism related measures
Assignment: write a program using C++ threads counting word occurrences in a text.
2h: Access to lab machines. More on mechanisms in C++ for concurrency.
2h: overheads and counter measures
Assignment: look for overheads in translate book and word count assignments.
2h: more on overheads (NUMA memory, false sharing). Introduction to vectorization (with sample code). Tools for checking sequential/vector code performance.
Assignment (1): single Jacobi iteration (as in the blackboard PDF). Create randomly generated matrix A and vectors B and X and run a single iteration of the Jacobi method verifying that the code is properly vectorized and measuring the increase in speed due to vectorization (optimization of the sequential code in general)
Assignment (2): consider a large matrix (4-8K rows/columns at least) and measure the improvement w.r.t. non optimized sequential computation of a single Jacobi iteration
Introduction to parallel patterns/skeletons
Assignment
Sample code
Parallel design patterns and algorithmic skeletons
Assignment: