Skip to content

Commit fccdcb8

Browse files
authored
Merge pull request #159 from chpc-tech-eval/tut4/release
Tut4/release
2 parents a37d029 + 1ceba8b commit fccdcb8

18 files changed

+1369
-36
lines changed

README.md

+58-31
Original file line numberDiff line numberDiff line change
@@ -198,8 +198,7 @@ You will be evaluated on your overall progress in the tutorials. Below you will
198198
1. [WirGuard VPN Cluster Access](tutorial2/README.md#wirguard-vpn-cluster-access)
199199
1. [ZeroTier](tutorial2/README.md#zerotier)
200200

201-
**Tutorial 3** will demonstrate how to configure, build, compile and install a number of applications. You will also be building these applications with different tools. Finally, you will learn how to run applications across your cluster.
202-
201+
**Tutorial 3** will demonstrate how to configure, build, compile and install a number of various system software and applications. You will also be building these applications with different tools. Finally, you will learn how to run applications across your cluster.
203202

204203
1. [Checklist](tutorial3/README.md#checklist)
205204
1. [Managing Your Environment](tutorial3/README.md#managing-your-environment)
@@ -221,39 +220,67 @@ You will be evaluated on your overall progress in the tutorials. Below you will
221220
1. [Application Benchmarks and System Evaluation](tutorial3/README.md#application-benchmarks-and-system-evaluation)
222221
1. [GROMACS (ADH Cubic)](tutorial3/README.md#gromacs-adh-cubic)
223222
1. [LAMMPS (Lennard-Jones)](tutorial3/README.md#lammps-lennard-jones)
224-
1. [Qiskit (Quantum Volume)](tutorial3/README.md#qiskit-quantum-volume)
225-
226-
223+
1. [Qiskit (Quantum Volume)](tutorial3/README.md#qiskit-quantum-volume**
224+
225+
**Tutorial 4** demonstrates how to configure docker containers to deploy a monitoring stack, comprising of a metrics database service, an exporting / scraping service and a metric visualization services. You will then learn the very basics of how to visualize and interpret data. You will then learn how to automate the deployment of your Sebowa OpenStack infrastructure. Lastly, you'll deploy a scheduler and submit a job to it.
226+
227+
1. [Checklist](tutorial4/README.md#checklist)
228+
1. [Cluster Monitoring](tutorial4/README.md#cluster-monitoring)
229+
1. [Install Docker Engine, Containerd and Docker Compose](tutorial4/README.md#install-docker-engine-containerd-and-docker-compose)
230+
1. [Installing your Monitoring Stack](tutorial4/README.md#installing-your-monitoring-stack)
231+
1. [Startup and Test the Monitoring Services](tutorial4/README.md#startup-and-test-the-monitoring-services)
232+
1. [SSH Port Local Forwarding Tunnel](tutorial4/README.md#ssh-port-local-forwarding-tunnel)
233+
1. [Create a Dashboard in Grafana](tutorial4/README.md#create-a-dashboard-in-grafana)
234+
1. [Success State, Next Steps and Troubleshooting](tutorial4/README.md#success-state-next-steps-and-troubleshooting)
235+
1. [Configuring and Connecting to your Remote JupyterLab Server](tutorial4/README.md#configuring-and-connecting-to-your-remote-jupyterlab-server)
236+
1. [Visualize Your HPL Benchmark Results](tutorial4/README.md#visualize-your-hpl-benchmark-results)
237+
1. [Visualize Your Qiskit Results](tutorial4/README.md#visualize-your-qiskit-results)
238+
1. [Automating the Deployment of your OpenStack Instances Using Terraform](tutorial4/README.md#automating-the-deployment-of-your-openstack-instances-using-terraform)
239+
1. [Install and Initialize Terraform](tutorial4/README.md#install-and-initialize-terraform)
240+
1. [Generate `clouds.yml` and `main.tf` Files](tutorial4/README.md#generate-cloudsyml-and-maintf-files)
241+
1. [Generate, Deploy and Apply Terraform Plan](tutorial4/README.md#generate-deploy-and-apply-terraform-plan)
242+
1. [Continuous Integration Using CircleCI](tutorial4/README.md#continuous-integration-using-circleci)
243+
1. [Prepare GitHub Repository](tutorial4/README.md#prepare-github-repository)
244+
1. [Reuse `providers.tf` and `main.tf` Terraform Configurations](tutorial4/README.md#reuse-providerstf-and-maintf-terraform-configurations)
245+
1. [Create `.circleci/config.yml` File and `push` Project to GitHub](tutorial4/README.md#create-circleciconfigyml-file-and-push-project-to-github)
246+
1. [Create CircleCI Account and Add Project](tutorial4/README.md#create-circleci-account-and-add-project)
247+
1. [Slurm Scheduler and Workload Manager](tutorial4/README.md#slurm-scheduler-and-workload-manager)
248+
1. [Prerequisites](tutorial4/README.md#prerequisites)
249+
1. [Head Node Configuration (Server)](tutorial4/README.md#head-node-configuration-server)
250+
1. [Compute Node Configuration (Clients)](tutorial4/README.md#compute-node-configuration-clients)
251+
1. [GROMACS Application Benchmark](tutorial4/README.md#gromacs-application-benchmark)
252+
1. [Protein Visualization](tutorial4/README.md#protein-visualization)
253+
1. [Benchmark 2 (1.5M Water)](tutorial4/README.md#benchmark-2-15m-water)
227254
# Lecture Recordings
228255

229256
In this section you will finds links to all of the livestreams of the lectures (Teams Meetings) and subsequent recordings for you to refer back to.
230257

231-
1. Day 1 - Welcome, Introduction and Getting Started
232-
* **08:30 - 09:30**: [Welcome and Introduction](https://youtu.be/T68pHDdZW3s)
233-
* **09:30 - 10:30**: [Getting Started](https://youtu.be/cIB9Gtpifpk)
234-
* **11:00 - 12:00**: [Intro to Linux & Services](https://youtu.be/pbpjAw9eVcM)
235-
* **12:00 - 13:30**: [Intro to Tools for the Modern IT Engineer](https://youtu.be/t4hD6NMw9Xg)
236-
237-
2. Day 2 - HPC Hardware, HPC Networking and Systems Administration
238-
* **08:30 - 09:30**: [HPC Hardware]()
239-
* **09:30 - 10:30**: [HPC Networking]()
240-
* **11:00 - 12:00**: [System Administration & Configuration Management]()
241-
* **12:00 - 13:30**: [Standing up a Compute Node]()
242-
243-
3. Day 3 - Benchmarking, Compilation and Parallel Computing
244-
* **08:30 - 09:30**: [HPC Benchmarking]()
245-
* **09:30 - 10:30**: [Code Compilation]()
246-
* **11:00 - 12:00**: [Parallel Computing and Intro to QC]()
247-
* **12:00 - 13:30**: [Applications: LAMMPS & Qiskit]()
248-
249-
4. Day 4 - HPC Administration and Application Visualization
250-
* **08:30 - 09:30**: [Cluster Admin, Ansible & Containers]()
251-
* **09:30 - 10:30**: [Monitoring]()
252-
* **11:00 - 12:00**: [Schedulers]()
253-
* **12:00 - 13:30**: [Data Visualization & Jupyter Lab]()
254-
255-
5. Day 5 - Career Guidance
256-
* **08:30 - 10:30** [HPC Career Panel]()
258+
1. Welcome, Introduction and Getting Started
259+
* [Welcome and Introduction](https://youtu.be/T68pHDdZW3s)
260+
* [Getting Started](https://youtu.be/cIB9Gtpifpk)
261+
* [Intro to Linux & Services](https://youtu.be/pbpjAw9eVcM)
262+
* [Intro to Tools for the Modern IT Engineer](https://youtu.be/t4hD6NMw9Xg)
263+
264+
2. HPC Hardware, HPC Networking and Systems Administration
265+
* [HPC Hardware](https://youtu.be/gkZCrdTuvrc)
266+
* [HPC Networking](https://youtu.be/pGzQutsk2-A)
267+
* [System Administration & Configuration Management](https://youtu.be/YY6D-0e5iHw)
268+
* [Standing up a Compute Node](https://youtu.be/V6bx86qJwHE)
269+
270+
3. Benchmarking, Compilation and Parallel Computing
271+
* [HPC Benchmarking]()
272+
* [Code Compilation]()
273+
* [Parallel Computing and Intro to QC]()
274+
* [Applications: LAMMPS & Qiskit]()
275+
276+
4. Administration and Application Visualization
277+
* [Cluster Admin, Ansible & Containers]
278+
* [Monitoring]()
279+
* [Schedulers]
280+
* [Data Visualization & Jupyter Lab]
281+
282+
5. Career Guidance
283+
* [HPC Career Panel]()
257284

258285
# Contributing to the Project
259286

tutorial3/README.md

+4-5
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,4 @@
1-
# Student Cluster Competition - Tutorial 3
1+
Student Cluster Competition - Tutorial 3
22

33
## Table of Contents
44

@@ -43,7 +43,7 @@ In this tutorial you will:
4343
- [ ] Understand the pros and cons of compiling libraries from source.
4444
- [ ] Install and make use of Intel's oneAPI framework to run HPL.
4545
- [ ] Understand theoretical system peak performance.
46-
- [ ] Appreciate the significance of the Top500 list and bencmarking.
46+
- [ ] Appreciate the significance of the Top500 list and benchmarking.
4747
- [ ] Standup and Configure a Second Compute Node, and running applications across a cluster.
4848
- [ ] Download and compile the High Performance Computing Challenge (HPCC) benchmark.
4949
- [ ] Understand that scientific computer applications are primarily used to conduct scientific research, and can also evaluate system performance.
@@ -131,7 +131,7 @@ In this section, you are going to be building and compiling Lmod from source. Lm
131131
sudo dnf install -y epel-release
132132
sudo dnf install -y tcl-devel tcl tcllib bc
133133
sudo dnf install -y lua lua-posix lua-term
134-
sduo dnf --enable-repo=devel install lua-devel
134+
sudo dnf --enable-repo=devel install lua-devel
135135
```
136136
* APT
137137
```bash
@@ -526,7 +526,7 @@ After you've successfully completed the previous section, you will be ready to r
526526
cp setup/Make.Linux_Intel64 ./
527527
528528
# Edit the configuration file to make use of your Intel oneAPI Toolkit
529-
nano Make.Linux64
529+
nano Make.Linux_Intel64
530530
```
531531

532532
1. Configure your `Make.Linux_Intel64`
@@ -592,7 +592,6 @@ The [TOP500 list](https://top500.org/lists/top500/2024/06/) is a project that ra
592592
| | Compute node using custom compiled MPI and BLAS | | | |
593593
| | Compute node using Intel oneAPI Toolkits | | | |
594594
| | Across two compute nodes | | | |
595-
| | | | | |
596595
597596
> [!IMPORTANT]
598597
> You do **NOT** need to try and Rank you VM's HPL performance. Cores and threads are used interchangeably in this context. Following the recommended configuration and guides, your head node has one CPU package with two compute cores (or threads). Continuing this same analogy, your compute node has one CPU with six cores (or threads).

0 commit comments

Comments
 (0)