Filesystems and Storage

Overview

Teaching: 15 min
Exercises: 10 min
Questions
  • Where can I store my data?

  • What is the difference between scratch and home filestore?

Objectives
  • Understand the difference between home and scratch directories

  • Understand how to copy files between your computer and your Bert home/scratch directories

Filesystems and Storage

What is a filesystem?

Storage on most compute systems is not what and where you think they are! Physical disks are bundled together into a virtual volume; this virtual volume may represent one filesystem, or may be divided up, or partitioned, into multiple filesystems. And your directories then reside within one of these fileystems. Filesystems are accessed over the network through mount points.

There are multiple storage/filesystems options available for you to do your work. The most common are:

Here’s a synopsis of filesystems on Bert:

Name Path Default Quota Disk Size Backed Up
Home /ibers/ernie/home/user.name 250GB 42TB Yes
Groups /ibers/repository03/groups/group.name None currently 200TB Yes
Scratch /scratch/user.name None currently 109TB No
Fast Scratch /fast-scratch/user.name None currently 5.2TB No

Important!! Ensure that you don’t store anything longer than necessary on scratch, this can negatively affect other people’s jobs on the system.

Accessing your filestore

How much quota do I have left on my home directory?

Login to the cluster (e.g. bert.ibers.aber.ac.uk) and run the quota command. This will tell you how much space is left in your home directory.

$ quota
Quotas aren't currently working, don't abuse this!

Group Filestore

If you have multiple collaborators working on a particular project and would like to share common software or data across the project, then it will be convenient for you to use a shared filestore. If you would like one setup for you, you can raise a support ticket by emailing ibers-cs@aber.ac.uk.

How much scratch have I used?

The df command tells you how much disk space is left. The -h argument makes the output easier to read, it gives human readable units like M, G and T for Megabyte, Gigabyte and Terrabyte instead of just giving output in bytes. By default df will give us the free space on all the drives on a system, but we can just ask for the scratch drive by adding /scratch as an argument after the -h.

$ df -h /scratch
Filesystem         Size  Used Avail Use% Mounted on
nfs01-ib:/scratch  109T   11T   98T  10% /scratch

Copying data from your PC to the cluster

You can copy files to/from your home and scratch drives using the secure copy protocol (SCP) or secure file transfer protocol (SFTP) and connecting to Bert.

Copying data using Filezilla

Filezilla is a graphical SCP/SFTP client available for Windows, Mac and Linux. You can download it from Filezilla download

Open filezilla and type sftp://bert.ibers.aber.ac.uk into the host box. Enter your username and password in the username/password boxes.

Transferring files using FileZilla

Click Quickconnect and a connection will be started. The first time you connect you will be asked to verify the host key, tick the “Always trust this host, add key to the cache” box to stop this message appearing again in future.

Transferring files using FileZilla

You should now have some files in the right hand side of the window. These are on the remote system, the list on the left hand side is your local system.

Transferring files using FileZilla

Files can be transferred either by dragging and dropping them from one side to the other. Or you can right click on a remote file and choose “Download” or a local file and choose “Upload”.

Transferring files using FileZilla Transferring files using FileZilla

You can change directory on the remote host by typing a path into the “Remote site:” box. For example type in /scratch/user.name (where user.name is your username) to access your scratch directory.

Transferring files using FileZilla

Copying on the command line using SFTP or SCP

If you prefer to use a command line interface to copy files then see the reference material on using the SFTP and SCP commands.

Exercises

Using the df command.

  1. Login to Bert
  2. Run the command df -h.
  3. How much space does /scratch have left?
  4. If you had to run a large job requiring 10TB of scratch space, would there be enough space for it?

Copying files.

  1. Login to Bert
  2. Create a file called hello.txt by using the nano text editor (or the editor of your choice) and typing nano hello.txt. Enter some text into the file and press Ctrl+X to save it.
  3. Use either Filezilla or SCP/SFTP to copy the file to your computer.
  4. Create a file on your computer using a text editor. Copy that file to your Bert home directory using Filezilla or SCP/SFTP and examine its conents with nano on the login node.

Key Points

  • The home directory is the default place to store data.

  • The scratch directory is a larger space for temporary files.

  • Quotas on home are much smaller than scratch.

  • The SSD scratch is faster than regular scratch, but smaller.