Cloudflare's next-generation "AI Labyrinth" promises to "waste resources" as-a-service, using today's machine learning models to sabotage tomorrow's - feddit.site

Arthur Besse to [email protected]

English

Cloudflare’s next-generation “AI Labyrinth” promises to “waste resources” as-a-service, using today’s machine learning models to sabotage tomorrow’s

75

Cloudflare’s next-generation “AI Labyrinth” promises to “waste resources” as-a-service, using today’s machine learning models to sabotage tomorrow’s

Arthur Besse to [email protected]

English

Trapping misbehaving bots in an AI Labyrinth

blog.cloudflare.com

How Cloudflare uses generative AI to slow down, confuse, and waste the resources of AI Crawlers and other bots that don’t respect “no crawl” directives.

Today, we’re excited to announce AI Labyrinth, a new mitigation approach that uses AI-generated content to slow down, confuse, and waste the resources of AI Crawlers and other bots that don’t respect “no crawl” directives. When you opt in, Cloudflare will automatically deploy an AI-generated set of linked pages when we detect inappropriate bot activity, without the need for customers to create any custom rules.

And it’s “free”! (visibility in to all of that traffic is more than sufficient payment for them 🤑)

Here are some perhaps-contradictory highlights from their blog post (emphasis mine), which I’m pretty sure was itself written with LLM assistance:

No real human would go four links deep into a maze of AI-generated nonsense.

When these links are followed, we know with high confidence that it’s automated crawler activity, as human visitors and legitimate browsers would never see or click them. This provides us with a powerful identification mechanism, generating valuable data that feeds into our machine learning models. By analyzing which crawlers are following these hidden pathways, we can identify new bot patterns and signatures that might otherwise go undetected.

But as bots have evolved, they now proactively look for honeypot techniques like hidden links, making this approach less effective.

AI Labyrinth won’t simply add invisible links, but will eventually create whole networks of linked URLs that are much more realistic, and not trivial for automated programs to spot. The content on the pages is obviously content no human would spend time-consuming, but AI bots are programmed to crawl rather deeply to harvest as much data as possible. When bots hit these URLs, we can be confident they aren’t actual humans, and this information is recorded and automatically fed to our machine learning models to help improve our bot identification. This creates a beneficial feedback loop where each scraping attempt helps protect all Cloudflare customers.

This is only the first iteration of using generative AI to thwart bots for us. Currently, while the content we generate is convincingly human, it won’t conform to the existing structure of every website. In the future, we’ll continue to work to make these links harder to spot and make them fit seamlessly into the existing structure of the website they’re embedded in. You can help us by opting in now.

You must log in or register to comment.

HotTopNewOld

Chat

Monkey With A Shell English

24•1Y

https://zadzmo.org/code/nepenthes/

For the self hosted version.

@[email protected]English

4•1Y

Nepenthes

9•1Y

And https://iocaine.madhouse-project.org/ (this is the one I’m running myself)

WasPentalive English

5•1Y

Like Beer, AI is the cause, and the solution of all our problems. – Homer Simpson

Technology

[email protected]

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: [email protected]

This is the official technology community of Lemmy.ml for all news related to creation and use of technology, and to facilitate civil, meaningful discussion around it.

Ask in DM before posting product reviews or ads. All such posts otherwise are subject to removal.

Rules:

1: All Lemmy rules apply

2: Do not post low effort posts

3: NEVER post naziped*gore stuff

4: Always post article URLs or their archived version URLs as sources, NOT screenshots. Help the blind users.

5: personal rants of Big Tech CEOs like Elon Musk are unwelcome (does not include posts about their companies affecting wide range of people)

6: no advertisement posts unless verified as legitimate and non-exploitative/non-consumerist

7: crypto related posts, unless essential, are disallowed

1 user online
6 users / day
65 users / week
366 users / month
1.46K users / 6 months
1 subscriber
4.79K Posts
52.1K Comments
Modlog

mods:
@[email protected]