This week on Saturday Morning Math Writer's Club, I've got an interview to prepare for with Rob Schapire, inventor of boosting in machine learning. I think this is going to be a great story about a preeminently practical result that came from pure theory. I have a slew of questions to ask Rob, but in the mean time I want to find examples of people using it in production.
Now I know there's a long list of Kaggle competition winners who use XGBoost: https://github.com/dmlc/xgboost/tree/master/demo#machine-learning-challenge-winning-solutions
But I don't consider this a production setting.
The same doc has a list of "use cases" and "integrations," but the only two that seem like they count are brief notes from the "Tencent data platform team" and the "autohome.com ad platform team." I will have to dig through the integrated tools to see if they list any compelling users.
A friend also brought up the possibility of writing about cuckoo hashing, but I don't know of anyone who uses this in prod. Plus, it seems like a relatively minor upgrade over something like linear probing, so I'm not sure in what context this would be particularly useful.
@j2kun Saw this today as a supposed practical application and deployment of cuckoo hashing in the TikTok recommendation system: https://gantry.io/blog/papers-to-know-20230110/, via https://en.wikipedia.org/wiki/Special:Diff/1157764695
@11011110 Oh boy, if I can get an interview with someone from TikTok... that would be wild. Maybe also get me in trouble with various state entities???