oph:cluster:storage
Questa è una vecchia versione del documento!
Storage types
The DIFA-OPH cluster offers different storage areas with very different characteristics and limitations.
$HOME
- This is a storage area available from every node. Should be used for source files, compiling code, jobs that do not need a lot of space.
- NFS-mount via Ethernet (1Gbps): not very fast but quite responsive.
- Quota limit: 50GB (soft) / 100GB (hard); check how much you're using with
quota -u
/home/temp
- Temporary replacement for /fast (see below), useable to store work that doesn't fit $HOME. NOT AN ALTERNATIVE TO /scratch
/home/web
- This share is web-accessible (read-only) at
https://apps.difa.unibo.it/files/people/Str957-cluster
. Web access is read-only and it is not possible to create dynamic pages.
/home/work
- Used as work area for jobs that do not need very big datasets or need to have lots of files in a single directory (not recommended, might degrade performances for all users!).
/scratch
- Main archive area. To be used for large files, big datasets and archives.
- IMPORTANT: DO NOT USE FOR LOTS OF SMALL FILES!!!:
/scratch
is a clustered filesystem that on one hand allows to store big files and to scale out capacity by adding more servers, but on the other hand introduces quite a lot of overhead (it must contact 3 servers to determine where to get a file from, for example) that won't get amortized with small files. Way worse are directory lookups, that nearly crawl to a halt when a directory contains more that about a thousand files. If you need some big files (say a dataset) and your codes do not allow for specifying a different path, just use symlinks from$HOME
to the files in/scratch
.
- Quota: per-sector (see
df -kh /scratch/sector
), with extra space allocated for specific projects or bought by individuals (df -kh /scratch/extra/project_or_name
).
/fast (planned)
PLANNED
⇒ not useable (yet)
- Parallel filesystem, for quick access to the data you're working on.
- No quota, but files older than TBD1)
days will be automatically deleted without further notice.
- No cross-server redundancy, just local RAID: if (when) a server (or two disks in the same RAID) fails, all data becomes unavailable – always keep a copy of your important data archived elsewhere (maybe /scratch, but for very important data offsite is better)
$TMPDIR
Every node does have some available space on local storage. It's useful to store temporary files that do not need to be shared between nodes. Being local, latency is very low. But local disks aren't big. $TMPDIR can usually store around 200GB of data. It gets automatically cleaned when the job terminates: if you need to keep some files, copy 'em to other storages.
1)
Decision to be finalized by the board
oph/cluster/storage.1697437741.txt.gz · Ultima modifica: 2023/10/16 06:29 da diego.zuccato@unibo.it