Supercomputing Wales Introduction
Overview
Teaching: 10 min
Exercises: 0 minQuestions
What is Supercomputing Wales and how do I get access to it?
Objectives
Understand what that a cluster is multiple computers connected together
Understand that clusters typically have shared storage
Understand what is meant by a node
Understand what is meant by a core
Prior Knowledge/Pre-requesites
- Basic use of the Linux command line, as covered in the Software Carpentry Introduction to the Unix Shell Lesson.
- An account on Supercomputing Wales.
Cluster basics
Clusters, otherwise know as high-performance computing (HPC) or high-throughput computing systems, are large collections of relatively normal computers linked together through a “interconnect”.
These tools are becoming the de facto standard tools in most research disciplines today.
What are some of reasons to use a cluster?
- Your computer does not have enough resources to run the desired analysis—for example, memory, processors, disk space, or network bandwidth.
- You want to produce results faster than your computer can.
- You cannot install software in your computer. That is, the application does not have support for your operating system, conflicts with other existing applications, or softare licensing does not allow for installation on personal laptops.
- You want to leave something running while your computer would be turned off or doing something else.
What does a cluster look like?
“High Performance Computing most generally refers to the practice of aggregating computing power in a way that delivers much higher performance than one could get out of a typical desktop computer or workstation in order to solve large problems in science, engineering, or business.” —InsideHPC
Clusters are simply a grouping of computers with the same components (RAM, disk, processors/cores, and networking cards) as those in your desktop or laptop, but with more umph! and are networked with high-speed interconnect that can be accessed (indirectly) through software, the scheduler, that manages simultaneous execution of jobs, or analyses, by multiple persons.
The user accesses the compute cluster through one or more login nodes, and submits jobs to the scheduler, which will dispatch to and collect the completed work from the compute nodes. Frequently, clusters have shared disks, or filesystems, of various flavors where you can store your data, programs, and use for in-job execution (working or scratch areas)
Nodes and Cores
Each individual computer in a cluster is commonly referred to as a “node”. Inside each node will be several processor chips that do the actual computation. Until around the mid 2000s most desktop/laptop computers had only a single processor, but since then most are multi-core meaning they effectively have multiple processors but all on one physical chip. A typical node in a cluster will have anything from 8 to 40 cores in total often across several physical processor chips.
Key Points
A cluster is a group of computers connected together to act as one.
Clusters are formed of nodes, each usually has several processors and 10s or hundreds of gigabytes of RAM.
Supercomputing Wales has clusters for researchers at Welsh universities to use