Storage: Lustre (scratch)

Working with small files

As Lustre is meant for large files, the performance with small (smaller than 10MB) files will not be optimal. If possible, try to avoid working with large numbers of small files. Large numbers is greater than thousands or tens of thousands.

Working with large files

By default Lustre on Triton is configured so that as files grow larger, they get striped (split) over more storage servers. This way, small files only require one server to serve the file (reducing latency), while large files can be streamed over multiple disks.

This page previously had instructions for how to adjust the striping of files yourself, but it is now automatic.

Lustre: common recommendations

Triton’s Lustre is much better than it was 10 years ago, but it’s still worth thinking about the following things:

Minimize use of ls -l and ls --color when possible.

Several excellent recommendations are at https://www.nas.nasa.gov/hecc/support/kb/Lustre-Best-Practices_226.html , they are fully applicable to our case.

Be aware, that being a high performance filesystem Lustre still has its own bottlenecks, and even non-proper a usage by a single user can get whole system in stuck. See the recommendations at the link above how to avoid those potential situations. Common Lustre troublemakers are ls -lR, creating many small files, rm -rf, small random i/o, heavy bulk i/o.

For advanced user, these slides (from 2012) can be interesting: https://www.eofs.eu/wp-content/uploads/2024/02/06_daniel_kobras_s_c_lustre_fs_bottleneck.pdf