Filesystems and Storage
Overview
Teaching: 15 min
Exercises: 10 minQuestions
Where can I store my data?
What is the difference between scratch and home filestore?
Objectives
Understand the difference between home and scratch directories
Understand how to copy files between your computer and your Supercomputing Wales home/scratch directories
Filesystems and Storage
What is a filesystem?
Storage on most compute systems is not what and where you think they are! Physical disks are bundled together into a virtual volume; this virtual volume may represent one filesystem, or may be divided up, or partitioned, into multiple filesystems. And your directories then reside within one of these fileystems. Filesystems are accessed over the network through mount points.
There are multiple storage/filesystems options available for you to do your work. The most common are:
- home: where you land when you first login. 50 GB per user. Slower access, backed up (Cardiff only). Used to store your work long term.
- project: shared between all users of a project. Same filesystem as home.
- scratch: temporary working space. Faster access, not backed up. Larger quota, but old files might get deleted. DON’T STORE RESULTS HERE!
Here’s a synopsis of filesystems on Hawk in Cardiff:
Name | Path | Default Quota | Disk Size | Backed Up |
---|---|---|---|---|
Home | /home/user.name | 50GB | 420TB | Yes |
Project | /home/scwXXXX | Negotiable | 12TB (same disk as home) | Yes |
Scratch | /scratch/user.name | 20TB + 10million files | 1200TB | No |
and on Sunbird in Swansea:
Name | Path | Default Quota | Disk Size | Backed Up |
---|---|---|---|---|
Home | /home/user.name | 100GB | 231TB | No |
Project | /home/scwXXXX | Negotiable | 231TB (same disk as home) | No |
Scratch | /scratch/user.name | 20TB + 10million files | 808TB | No |
Important!! Ensure that you don’t store anything longer than necessary on scratch, this can negatively affect other people’s jobs on the system.
Accessing your filestore
How much quota do I have left on my home directory?
Login to a head node (e.g. sunbird.swansea.ac.uk
or hawklogin.cf.ac.uk
) and run the myquota
command. This will tell you how much space is left in your home directory.
$ myquota
HOME DIRECTORY a.cos
Filesystem used quota limit grace files quota limit grace
/lustrehome 48.4G 100G 110G - 206182 220000 230000 -
SCRATCH DIRECTORY a.cos
Filesystem used quota limit grace files quota limit grace
/scratch 63.29G 19.56T 20.06T - 313075 9851800 10101800 -
Group Filestore
If you have multiple collaborators working on a particular project and
would like to share common software or data across the project, then
it will be convenient for you to use a shared filestore. These can be
created on /home
(for long-term use, e.g. software) or on /scratch
(for short-term data). If you would like one setup for you, you can
raise a support ticket or speak to one of your local RSEs.
How much scratch have I used?
The df
command tells you how much disk space is left. The -h
argument makes the output easier to read, it gives human readable units like M, G and T for Megabyte, Gigabyte and Terrabyte instead of just giving output in bytes. By default df will give us the free space on all the drives on a system, but we can just ask for the scratch drive by adding /scratch
as an argument after the -h
.
$ df -h /scratch
Filesystem Size Used Avail Use% Mounted on
172.2.1.51@o2ib:172.2.1.52@o2ib:/scratch 692T 57T 635T 9% /scratch
Copying data from your PC to Supercomputing Wales
You can copy files to/from your Supercomputing Wales home and scratch drives using the secure copy protocol (SCP) or secure file transfer protocol (SFTP) and connecting to Sunbird or Hawk.
Copying data using Filezilla
Filezilla is a graphical SCP/SFTP client available for Windows, Mac and Linux. You can download it from Filezilla download
Open filezilla and type sftp://sunbird.swansea.ac.uk
or sftp://hawklogin.cf.ac.uk
into the host box. Enter your username and password in the username/password boxes.
Click Quickconnect and a connection will be started. The first time you connect you will be asked to verify the host key, tick the “Always trust this host, add key to the cache” box to stop this message appearing again in future.
You should now have some files in the right hand side of the window. These are on the remote system, the list on the left hand side is your local system.
Files can be transferred either by dragging and dropping them from one side to the other. Or you can right click on a remote file and choose “Download” or a local file and choose “Upload”.
You can change directory on the remote host by typing a path into the “Remote site:” box. For example type in /scratch/user.name
(where user.name is your username) to access your scratch directory.
Copying on the command line using SFTP or SCP
If you prefer to use a command line interface to copy files then see the reference material on using the SFTP and SCP commands.
Exercises
Using the
df
command.
- Login to a login node
- Run the command
df -h
.- How much space does /scratch have left?
- If you had to run a large job requiring 10TB of scratch space, would there be enough space for it?
Using the
myquota
command.
- Login to a login node.
- Run the
myquota
command.- How much space have you used and how much do you have left?
- If you had a job that resulted in 60GB of files would you have enough space to store them?
Copying files.
- Login to a login node.
- Create a file called hello.txt by using the nano text editor (or the editor of your choice) and typing
nano hello.txt
. Enter some text into the file and press Ctrl+X to save it.- Use either Filezilla or SCP/SFTP to copy the file to your computer.
- Create a file on your computer using a text editor. Copy that file to your Supercomputing Wales home directory using Filezilla or SCP/SFTP and examine its conents with
nano
on the login node.
Key Points
The home directory is the default place to store data.
The scratch directory is a larger space for temporary files.
On Hawk in Cardiff home is backed up but is also a slower disk.
On Sunbird in Swansea neither is backed up.
Quotas on home are much smaller than scratch.