Writer's block
I wrote my last post in November 2024, which is a lot. Unlike other dry spells (seven months in 2017 and six months in 2019), this one was not because I forgot about the blog but because I have a severe writing block due to, you guessed it, AI.
I receive lots of bot traffic every day — way more, I suspect, than actual human readers. That's usually not a problem because I think of this blog as a message in a bottle, drifting lazily through the oceans until one day someone finds it by chance, reads it and thinks "yeah, VGA cables are indeed cool". But AI changes the equation — if for every human I reach there are one hundred bots, then all I'm effectively doing is creating AI training data for free.
I used to think that the terms under which I release this content (Creative Commons BY-SA 4.0) would help: the "ShareAlike" terms give you permission to share and adapt my ideas, but only as long as you give proper attribution and distribute the result under the same license. This is all well and good until you learn that AI companies have decided that they can use all content ever created under a "fair use" policy, licenses be damned and sell you the results without even attributting the original author (let's not even think about compensation).
Much more talented people than me have given their reasons for why they think this whole situation is a travesty — Dorothy Gambrell, the creator Cat and Girl has made an eloquent case for why AI feels so much like theft and Mike Krahulik of Penny Arcade has made comics like this one, this other one and this third one. None of those texts include the word "fucking assholes", though, but that's understandable. My version does, though.

Social problems cannot be solved with technology, but given the limited amount of power I have I will have to make do with the technology I have. I have decided to implement a three-step process:
- Add all known AI crawlers to my
robots.txtfile and restrict them to the PR part of this website (who I am, what I do, publications). Everything else is off-limits. - Add some AI protection — I like the spirit behind Anubis but I may end up sticking to a combination of Apache's mod_evasive and Fail2Ban.
- If all that fails, a Tarpit for particularly misbehaved bots. How to generate an infinite maze of URLs for cheap is an interesting problem to think about, and one that I've started working on.
One of my personal goals has always been to make this website as quick to load as possible, which rules out captchas of any kind. A Proof-of-Work solution like the above-mentioned Anubis can work, but that remains to be seen. Other measures are less drastic, with the simplest one being that I'll try and get at least one curse per post. The reasons for this are complex enough that they deserve their own post, so stay tuned.
Bad AI (I): Meoweler
Oh boy, where do I even start with this...?
Someone posted yesterday to Hacker News their new project: Meoweler, a cat-themed travel website that's almost entirely auto-generated.
I have written here before about the realistic dangers of AI, predicting that we would see "garbage in any medium that offers even the slightest of rewards, financial or not". And boy was I right...
Here's a short, non-exhaustive list of problems I found with this website:
- Offensive stereotypes. About half the Argentinean cities I've checked are based on stereotypes about the country, regardless of whether they are even remotely applicable.
- Offensive pictures. I grew up in not-so-popular cities across the country and I can tell you: I wasn't born in a war-torn city, I didn't go to school in Tatooine, I didn't study in a mirror-like desert, and I didn't go to University on a mountain city. I just lived in perfectly normal, standard, slightly boring cities. But you wouldn't know any of that by looking at the Midjourney-generated illustrations for each one of them.
- Bad information. I don't care that much that the AI suggests that Potsdam's Brandenburg Gate is a "mini version" of the one with the same name in Berlin (which AFAIK is not true), nor that it talks about getting lost "in the maze-like streets of the Old Town" for the squarest blocks you'll find outside of Manhattan. But when your list of attractions includes sightseeing points in cities more than 200km away... and in a different Province... I hope you remember to bring your good walking shoes.
- Dangerous information. The site includes an "LGBTQ safety" rating which is auto-generated and not checked against any official source. The guide will also gladly include locations "off the beaten path" that are plain unsafe for a tourist. One such advice suggests "lounging in sunbathed plazas" in a city that Travel Safe - Abroad rates as having a high risk of pickpockets and medium risk for muggings. And while I'm not here to tell you how to live your life, I think "don't" is a solid advice for this situation.
The source code for the website is open, and the license seems to be "do whatever you want with it, no restrictions". And you know what? People will do whatever they want with it: they will spin God-knows how many clones (dog themed, child themed, circus themed, ...), slap ads on it, and the internet will drown a little more in auto-generated, unchecked, ad-ridden SEO garbage.
But hey, at least the website has the tiny footer "Beware, most of the content was generated by AI". So if you end up being robbed at gunpoint because you went "off the beaten path" it will actually be your fault.
One final digression: I decided to check some of the cities listed as "Do not travel" in the US State website and I landed on Kabul, Afghanistan, which the US warns you against visiting due to armed conflict, civil unrest, crime, terrorism, and kidnapping. The AI description reads "Kabul is a friendly city: despite the violence, Kabul residents are extremely hospitable and friendly towards visitors", which to me as a (computational) linguist is fascinating: if the locals are hospitable and friendly, then who is perpetrating "the violence"? Is this sentence blaming foreigners for the situation in Kabul? Can someone kidnap you and be hospitable at the same time? And since we know that a guest who kills their hosts in their own house loses privileged status, does robbing, kidnapping, and maybe murdering your visitors mean that you lose your status as a friendly city?
I would say that I'm maybe overthinking it, but let's be honest: given that no human thought of any kind went into this entire website, literally any attempt at understanding it is, by definition, overthinking.
(I have changed the title of this article to add a "(I)" at the end. I get the feeling this is far from the last entry in this series...)