Longwood is the newest High-Performance Compute Cluster at HMS. It is located at the Massachusetts Green High Performance Computing Center.
...
This provides a heterogeneous environment with both Intel (DGX) and ARM (Grace Hopper) architectures. Module management is supported through LMOD, allowing easy loading of software suites like the NVIDIA NeMo deep learning toolkit and more.
How to connect
Note |
---|
The cluster is currently only accessible via secure shell (ssh) command line from the HMS network:
Two-factor authentication (DUO) is not required for logins because all connections must originate from an HMS network. Currently, the login server hostname is: login.dgx.rc.hms.harvard.edu |
...
/home
Max: 100GiB
/n/scratch
Individual scratch folders are created by HMS RC when making a new user account.
Max: 25TiB or 2.5 million files
Path:
/n/scratch/users/<first_hms_id_char>/<hms_id>
/n/lw_groups
Shared folders for labs, created by request. Contact rchelp@hms.harvard.edu if interested.
To transfer data to/from the Longwood cluster, please use the server transfer.dgx.rc.hms.harvard.edu . From O2, you must initiate the transfer from a compute node.
Snapshots
.snapshot
is a feature available on Longwood. This enables recovery of data accidentally deleted by users, daily:14 days
and weekly: 60 days
.
...
gpu_dgx - the standard GPU partition
gpu_grace - this targets the special Grace Hopper nodes. You’ll need to be using software compiled for ARM
gpu_dia - the DIA dedicated GPU partition which takes priority over gpu_dgx
cpu - the partition available to run jobs that do not require a GPU card. This partition does not include Grace Hopper nodes. If you require a Grace Hopper node, you can still submit a job to
gpu_grace
without specifying a GPU resource.TimeLimit is up to 5 days for both partitions
...