mathstodon.xyz is one of the many independent Mastodon servers you can use to participate in the fediverse.
A Mastodon instance for maths people. We have LaTeX rendering in the web interface!

Server stats:

2.8K
active users

#snakemake

0 posts0 participants0 posts today

Last year at the Open Science Retreat (#OSR24NL) I have been introduced to nanopubs by @egonw and created my first nanopub declaring citations for a paper using CiTOs (citation ontologies).

Now, travelling to #OSR25CH, due to issues with the train network foreseen with plenty of time, I used the opportunity and created a new one ( w3id.org/np/RAicq7k9QHX8EG8ho7 ) for my latest publication on teaching #Snakemake on #HPC systems.

The teaching material is — again — in desperate need of additions and overhaul, but that is for another day.

w3id.orgRAicq7k9QH (explore) | nanodash

> Removing output files of failed job product2animation since they might be corrupted

😠😠😠

Thanks for nothing #SnakeMake for deleting a very precious output file, resulting from an hours/days-long process because it *might* 🪩 be corrupted 🤦

I though we were past this nonsense of build-systems deleting files at will, especially with a science-focused project such as snakemake!?

#SnakeMake has some nasty caching going on, where it sometimes reuses old versions of code you have long overwritten. Removing the .snakemake/ folder solves it. Also this behavior that if there's an error in your notebook, Snakemake will just *not* save the processed notebook so you can't actually have a look at the error.
Makes it kinda annoying to debug.

There are many HPC admins who prohibit using considerable CPU time on login nodes. This is understandable.

I want to take this opportunity to provide a data point. My student has measured the accumulated CPU time (user + system) for a 9 h (precise: 33343 s) run of a Snakemake workflow. It was 225 s or about 0.67 % - including jobs which were carried out on this login node, e.g. `mv`, `ln` or download of data.

There is certainly room for improvements. There will ever be room for improvements.

But my dear fellow admins: Running Snakemake on login nodes as a shepherd of jobs, will impair nobodies work.

Over time, I will certainly gather more and different statistics. And will invest time in necessary improvements. Regarding CPU time for checking job status, however, I believe to have demonstrated, that this is a pretty high hanging fruit.

@johanneskoester Random #snakemake question for you - should with work with SLURM where the default shell is Dash not Bash?

I have some code and workflows which works on SLURM cluster A where /bin/sh is an alias for bash, but fail on SLURM cluster B where /bin/sh is an alias for dash:

WorkflowError:
SLURM sbatch failed. The error message was /bin/sh: 1: Syntax error: "(" unexpected

I don't have any brackets in my workflows, but there are some in the data files…

Yesterday, I learned that we have only 16 - in letters: sixteen(!) - people attending my course in Dresden, so far.

So, if you are interested in creating sustainable data analysis workflows using #Snakemake on #HPC systems, this course might be something for you:

On the 26th and 27th, I will be teaching a 2-day course in #Saxony, #Dresden. See tu-dresden.de/zih/hochleistung for details and registration.

TU DresdenBuilding HPC-Compliant Snakemake Data Analysis WorkflowsNHR Tutorial (On-Site) Wednesday & Thursday, 26 & 27/02/2025, 9:00 am - 6:00 pm Speaker: Dr. Christian Meesters -- NHR SouthWest, JGU Mainz This tutorial provides participants with the …

When reading this call (mast.hpc.social/@sneuwirth/113) from @sneuwirth I think, I could tell a number of tales from my experience with developing the #Slurm plugin for #Snakemake (and other software over the years).

Particularly, issue reports telling what peculiarities fellow admins thought of (e.g. "we are not allowed to do this" and "we not that". "We have this setting" and "we that".) To develop #HPC software is a challenge in its own right. To support workflow software seems to develop into a nice challenge.

Could certainly deliver a funny few minutes. 😉 Perhaps I should collect anecdotes and write an article sometime? 🤔

HPC.social MastodonSarah Neuwirth (@sneuwirth@mast.hpc.social)📢 Call for Contributions: 2nd Workshop on Research Software Engineering in HPC (RSEHPC@ISC25), to be held in conjunction with #ISC25! RSE? HPC? HPC? RSE? Those fields are sometimes quite close to each other. One instance is the upcoming RSEHPC workshop at ISC25, June 13, 2025, in Hamburg. We are looking for contributions from the field of continuous integration and benchmarking for HPC! Lightning talk proposals are due Feb 28. Further details: https://www.helmholtz-hirse.de/events/2025_06_13-rsehpcatisc.html #HPC #Supercomputing