Introduction to High Performance Computing for Supercomputing Wales: SFTP

Key Points

Supercomputing Wales Introduction	A cluster is a group of computers connected together to act as one. Clusters are formed of nodes, each usually has several processors and 10s or hundreds of gigabytes of RAM. Supercomputing Wales has clusters for researchers at Welsh universities to use
Logging in to Supercomputing Wales	`ssh sunbird.swansea.ac.uk` or `ssh hawklogin.cf.ac.uk` to log in to the system `sinfo` shows partitions and how busy they are. `slurmtop` shows another view of how busy the system is.
Filesystems and Storage	The home directory is the default place to store data. The scratch directory is a larger space for temporary files. On Hawk in Cardiff home is backed up but is also a slower disk. On Sunbird in Swansea neither is backed up. Quotas on home are much smaller than scratch.
Running Jobs with Slurm	Interactive jobs let you test out the behaviour of a command, but aren’t pratical for running lots of jobs Batch jobs are suited for submitting a job to run without user interaction. Job arrays are useful to submit lots of jobs. Slurm lets you set parameters about how many processors or nodes are allocated, how much memory or how long the job can run. Slurm can email you when a job starts or finishes.
Working with Modules and Installing Software	A lot of software is only available by loading extra modules. This helps prevent problems where two packages are incompatible. Python 3 is one such package. If you want to install pip packages use the `--user` option to store the packages in your home directory.
What next?	Remember that HPCs are shared systems and try avoid allocating resources which you don’t use Don’t make millions of files You will need to apply for or using an existing project on Supercomputing Wales Make use of the Research Software Engineers to help you use the system effectively
Optimising for Parallel Processing	GNU Parallel lets a single Slurm job start multiple subprocesses This helps to use all the CPUs on a node effectively.
Running on GPUs	Use `--partition=gpu` or `--partition=accel_ai` to submit to a partition with GPUs Use `--gres=gpu:1` (or similar) to specify the number of GPUs you need.
Profiling Single Core Performance	Each programming language typically provides tools called profilers with which you can analyse the runtime of your code. The estimate of pi spends most of it’s time while generating random numbers. The estimation of pi with the Monte Carlo method is a compute bound problem because pseudo-random numbers are just algorithms.
Parallel Estimation of Pi	Amdahl’s law is a description of what you can expect of your parallelisation efforts. Use the profiling data to calculate the time consumption of hot spots in the code. The generation and processing of random numbers can be parallelized as it is a data parallel task. Time consumption of a single application can be measured using the `time` utility. The ratio of the run time of a parallel program divided by the time of the equivalent serial implementation, is called speed-up.
Distributing computations among computers with MPI	The MPI driver `mpirun` sends compute jobs to a set of allocated computers. The MPI software then executes these jobs on the remote hosts and synchronizes their state/memory. MPI assigns a rank to each process, usually the one with a rank of zero does the coordination MPI can be used to split a task into components and have several nodes run them.

SFTP

If you want to use SFTP from the command line instead of with Filezilla then these commands might be helpful.

SFTP takes the argument of the username followed by an @ symbol and then the hostname. Optionally you can specify what directory to start in by putting a : symbol after this and adding the directory name. The command below will start in /home/s.jane.doe/data, if no directory is specified then sftp defaults to your home directory.

Running SFTP

sftp s.jane.doe@sunbird.swansea.ac.uk:/home/s.jane.doe/data

s.jane.doe@sunbird.swansea.ac.uk's password:
Connected to sunbird.swansea.ac.uk.
Changing to: /home/s.jane.doe/data
sftp> ls

SFTP commands

command	Purpose
ls	list files
pwd	curret directory on server
!pwd	current directory on local system
get	copy a file from the server
put	copy a file to the server
lcd	change local directory
cd	change remote directory

Useful Links

Slurm Environment Variables - https://www.glue.umd.edu/hpcc/help/slurmenv.html