@hexaheximal Would @Codeberg installing #Anubis help with this problem?

@hexaheximal Would @Codeberg installing #Anubis help with this problem?
Techaro BotStopper (the commercial cut of #Anubis) is near ready for its first ~~victim~~, uhh customer, I mean customer. We'll have more docs in the future, but have a preview of what's to come in this thread!
https://michal.sapka.pl/2025/anubis-deployed-on-my-freebsd-server/
Anubis is now deployed on my FreeBSD server
--- well, mastodon got blocked ;-)
Oh, #anubis is already in #FreeBSD :latest ports https://www.freshports.org/www/go-anubis/
ergo: my server is going :latest
Anubis is interesting - but it's breaking a lot of the RSS feeds I follow. Several sites I regularly read using text browsers are suddenly inaccessible.
I get the urge to block AI scraper bots, but breaking basic web access doesn't feel like the right trade-off.
It appears that some #Anubis installations replace the cute mascot with a check mark emoji. This makes me sad. It was fun to be judged by an anime girl.
"Setting up Anubis to protect cgit from AI crawlers"
https://sysrq.in/en/article/cgit-with-anubis.md
This is my new attempt at writing a useful guide! In the article I try to explain my current configuration for running cgit (or any other CGI application) with Anubis.
The guide suggests using uWSGI to serve CGI, with Nginx being a reverse proxy.
Get bent, OpenAI!
{
"time": "2025-04-22T00:27:46.651583942Z",
"level": "INFO",
"source": {
"function": "github.com/TecharoHQ/anubis/lib.(*Server).MaybeReverseProxy",
"file": "github.com/TecharoHQ/anubis/lib/anubis.go",
"line": 235
},
"msg": "explicit deny",
"user_agent": "Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko); compatible; ChatGPT-User/1.0; +https://openai.com/bot",
"accept_language": "en-US,en;q=0.9",
"priority": "",
"x-forwarded-for": "",
"x-real-ip": "52.255.111.58",
"check_result": {
"name": "bot/ai-robots-txt",
"rule": "DENY"
}
}
#Anubis works as intended.
#Anubis now protects UNESCO, FreeBSD, Linux's git/mailing list archives, Arch Linux's wiki, GNOME's GitLab, FFmpeg's bug tracker and more. It's probably good enough for your server too! anubis.techaro.lol
I remember coming across this about 4 years ago and losing it at the "Yo Whaddap, It's ya boi Anubis" on the first panel.
That first panel is perfection. IYKYK!
Artist: featheredsnek (https://featheredsnek.bsky.social)
Very nice, #sourcehut is more using #anubis. For code forges, it really should not be skipped. So many crawlers just do not care of what you say can be used.
https://sourcehut.org/blog/2025-04-15-you-cannot-have-our-users-data/
Oh my gods, look at this release name for #anubis - https://github.com/TecharoHQ/anubis/releases/tag/v1.16.0
Woke up this morning to yet more Linode alerts and another failed server as a result of AI bots relentlessly scraping my #Gitea instance.
I heard about #Anubis (https://anubis.techaro.lol) when Xe Iaso (https://xeiaso.net) was on a recent episode of the #SelfHostedShow podcast and so it seemed like a great opportunity to give it a try. I don't really need "SEO" or any discoverability on Gitea, so hopefully the only downside is that new visitors need to wait a few secs before things load
I was not particularly bothered by #Anubis at first, but having to wait upwards of 15 seconds to access some of the websites using it is uhhh...
bad. yeah I think I'd call that bad.