ATPESC 2025 hands-on I/O exercises and reference material

You can store data in the /flare/ATPESC2025/ directory on Aurora or the /eagle/ATPESC2025/ directory on Polaris. Please create subdirectories based on your username to avoid conflicts with other ATPESC attendees. Users on Aurora also have access to a DAOS storage pool as well, which will be covered during the DAOS session.

Note that Aurora and Polaris use separate file systems, so if you wish to move data between the two you must use Globus or scp to copy data.

Initial setup (ALCF Aurora)

Note that Aurora is the preferred platform for ATPESC 2025 exercises, but you are also welcome to use Polaris. Note that DAOS is only available on Aurora.

Confirm account access if you haven't already (see presenters for details)
Log on to Aurora
Download the tutorial materials to your home directory.
- mkdir atpesc-io
- cd atpesc-io
- git clone https://github.com/radix-io/hands-on.git
- cd hands-on
Set up your environment to have access to the utilities needed for the hands-on exercises
- source ./aurora-setup-env.sh

Initial setup (ALCF Polaris)

Note that Aurora is the preferred platform for ATPESC 2025 exercises, but you are also welcome to use Polaris. Note that DAOS is only available on Aurora.

Confirm account access if you haven't already (see presenters for details)
Log on to Polaris
Download the tutorial materials to your home directory.
- mkdir atpesc-io
- cd atpesc-io
- git clone https://github.com/radix-io/hands-on.git
- cd hands-on
Set up your environment to have access to the utilities needed for the hands-on exercises
- source ./polaris-setup-env.sh

Darshan

Running hands-on example programs

All Darshan hands-on examples are set up for use on the Aurora (ALCF) system. See the Aurora setup instructions above to configure your baseline environment.

Compile example programs and submit into the job queue (see below for details on specific example programs)
- mpicc <exampleprogram>.c -o <exampleprogram>
- qsub ./<exampleprogram>-aurora.qsub
Check the queue to see when your jobs complete
- qstat |grep <username>
Look for log files in /lus/flare/logs/darshan/aurora/2025/8/8/<username>* (or whatever the current day is in UTC)
- Copy log files to your home directory
Use the PyDarshan job summary tool or darshan-parser to investigate Darshan characterization data
- python -m darshan summary <log_path> command will produce *.html files with an analysis summary
- You can use scp to copy these to your laptop to view them in a browser

Hands-on exercise: helloworld

The hands-on material includes an example application called helloworld. Compile it, run it, and generate the Darshan job summary following the instructions above. How many files did the application open? How much data did it read, and how much data did it write? What approximate I/O performance did it achieve?

Hands-on exercise: warpdrive

NOTE: this exercise is best done some time after the MPI-IO and/or performance tuning presentations. It requires diagnosis of I/O performance problems to complete.

The hands-on material includes an example application called warpdrive. There are two versions of this application: warpdriveA and warpdriveB. Both of them do the same amount of I/O from each process, but one of them performs better than the other. Which one has the fastest I/O? Why?

Hands-on exercise: fidgetspinner

NOTE: this exercise is best done some time after the MPI-IO and/or performance tuning presentations. It requires diagnosis of I/O performance problems to complete.

The hands-on material includes an example application called fidgetspinner. There are two versions of this application: fidgetspinnerA and fidgetspinnerB. Both of them do the same amount of I/O from each process, but one of them performs better than the other. Which one has the fastest I/O? Why?

hello-io

Here's a quick introduction to both the MPI-IO interface and how to run programs on these machines. Demonstrates basic MPI datatype usage to describe noncontiguous accesses in both memory and file.

Array

The presentation will walk you through several interfaces for writing an array to a file. We have provided you with some skeleton code which you can build upon during the lecture. If you get stuck you can find complete examples in the solutions directory.

IOR

IOR has a lot of command line arguments. I have included the job submission scripts I used for the talk in the 'ior' directory

Variance

The variance subdirectory contains a hands-on example to illustrate the kind of variance you can expect from each job run in terms of I/O performance. To execute it:

cc variance.c -o variance
qsub variance.qsub

If you look in variance.qsub you will see that the job is a script job that executes the same program 5 times. Each will display the elapsed time of the I/O routine.

What was the slowest time?
What was the fastest time?
What was the average time?
This is a small example program. Do you think the variance will improve or get worse with a larger example? What strategies might help improve performance in the example code?

Parallel-NetCDF

You can install Parallel-NetCDF on your laptop easily enough if you already have MPI installed. On Polaris or Aurora, the setup script will load the necessary modules. The Parallel-NetCDF projet has a Quick Tutorial outlining several different ways one can do I/O with Parallel-NetCDF. We'll also explore attributes.

Hands-on exercise: comparing I/O approaches

The QuickTutorial has links to code and some brief discussions about what the examples are trying to demonstrate.
Following the "Real parallel I/O on shared files" example, build and run 'pnetcdf-write-standard' to create a (tiny) Parallel-NetCDF dataset.
Look at a Darshan job summary.
Next, follow the "Non-blocking interface" example to create another (tiny) Parallel-NetCDF dataset.
Compare the Darshan job summary of this approach. What's different between the two?

Hands-on exercise: using attributes

Write a simple parallel-netcdf program that puts your name as a global attribute on the data set. You won't need to define any dimensions or variables. (If you need to cheat, look at the example C files above).
What happens if you define different attributes on different processors?

HDF5

Lots of sample code to draw from at https://github.com/HDFGroup/hdf5-examples/

Game of Life

We have provided the a Game of Life program if you want to experiment with I/O and do not already have a program handy.

Building Notes

For Aurora, configure might pick up the right MPI libraries automatically, but if it does not you can explicitly set the MPICC and MPIF77 environment variables.

$ module add cray-parallel-netcdf
$ configure --with-pnetcdf=$PARALLEL_NETCDF_DIR \
     MPICC=cc MPIF77=ftn --host=x86_64-unknown-linux-gnu

For another example, I've built and installed MPICH and Parallel-NetCDF into my home directory on my laptop. The command might look something like this:

configure --with-mpi=${HOME}/work/soft/mpich/bin/ \
    --with-pnetcdf=${HOME}/work/soft/pnetcdf

The "game of life" lives in the examples/life directory:

$ cd examples/life
$ make mlife-mpiio mlife-pnetcdf

Execution

the mlife example takes a few arguments:

-x X, where x is number of columns
-y Y, where y is number of rows
-i I, where I is iterations to run
-r R, where R is iteration number to restart from
-p PATH, where PATH is the prefix mlife will put on the checkpoint files

To give you an idea of how big a problem size to use, here are some run times for problem sizes on a few machines I have at hand:

laptop: mpiexec -np 4 ./mlife-mpiio -x 5000 -y 5000 -i 10 takes about 10 seconds.
Blue Gene /Q: qsub -A radix-io -t 10 -n 128 --mode c16 ./mlife-pnetcdf -x 5000 -y 5000 -i 10 -p /projects/radix-io/robl/ takes about 13 seconds.
Theta: ./mlife-pnetcdf -x 5000 -y 5000 -i 10 -p /projects/radix-io/robl/mlife takes about 17 seconds -- don't forget to increase the default stripe size of your destination directory!

Project Ideas

Run the MPI-IO and Parallel-NetCDF versions, then use Darshan to observe any differences in behavior
Rewrite the Parallel-NetCDF version of the Game of Life to dump out all checkpoints to a single dataset
Write an MLIFEIO implementation that uses HDF5
Experiment with Lustre stripe sizes on Theta. When is a stripe width of 1 a good idea?

Sparse Matrix I/O

A more sophisticated I/O example demonstrating non-contiguous I/O and abstraction layers. Writing a sparse matrix to a file as an N-dimensional array can be wasteful for a very sparse matrix. This example uses Compressed Sparse Row (CSR) representation to write out the file using MPI-IO routines and optimizations.

Project Ideas

The interface writes to MPI-IO. Update it to use pnetcdf or hdf5

Name		Name	Last commit message	Last commit date
Latest commit History 172 Commits
ai-checkpoint		ai-checkpoint
array		array
benchmarking		benchmarking
daos		daos
darshan		darshan
h5bench		h5bench
hdf5		hdf5
hello-io		hello-io
ior		ior
mpi2tutorial		mpi2tutorial
variance		variance
.gitignore		.gitignore
README.md		README.md
aurora-setup-env.sh		aurora-setup-env.sh
polaris-setup-env.sh		polaris-setup-env.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

ATPESC 2025 hands-on I/O exercises and reference material

Table of Contents:

Reservations and file storage locations

Initial setup (ALCF Aurora)

Initial setup (ALCF Polaris)

Darshan

Running hands-on example programs

Hands-on exercise: helloworld

Hands-on exercise: warpdrive

Hands-on exercise: fidgetspinner

hello-io

Array

IOR

Variance

Parallel-NetCDF

Hands-on exercise: comparing I/O approaches

Hands-on exercise: using attributes

HDF5

Game of Life

Building Notes

Execution

Project Ideas

Sparse Matrix I/O

Project Ideas

About

Uh oh!

Releases 5

Packages

Contributors 5

Uh oh!

Languages

radix-io/hands-on

Folders and files

Latest commit

History

Repository files navigation

ATPESC 2025 hands-on I/O exercises and reference material

Table of Contents:

Reservations and file storage locations

Initial setup (ALCF Aurora)

Initial setup (ALCF Polaris)

Darshan

Running hands-on example programs

Hands-on exercise: helloworld

Hands-on exercise: warpdrive

Hands-on exercise: fidgetspinner

hello-io

Array

IOR

Variance

Parallel-NetCDF

Hands-on exercise: comparing I/O approaches

Hands-on exercise: using attributes

HDF5

Game of Life

Building Notes

Execution

Project Ideas

Sparse Matrix I/O

Project Ideas

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases 5

Packages 0

Contributors 5

Uh oh!

Languages

Packages