Thanks to @petrilaarne, BigQuery tells me that there are 49329 commits on GitHub with hashes beginning 'abc', and none starting 'abcd'. Assuming hashes are distributed uniformly, this implies there are around 49239 × 16³ ≈ 202 million commits in the dataset. In fact there are 203 million. It's nice when maths works!


Whoops, there are 3072 starting 'abcd' - I forgot to change the length of my substr :blush:
There are 15 starting 'abcdef', and none starting 'abcdefg'!
If you don't think too hard, that means we're due one!

Sign in to participate in the conversation

A Mastodon instance for maths people. The kind of people who make \(\pi z^2 \times a\) jokes.

Use \( and \) for inline LaTeX, and \[ and \] for display mode.