Cloudflare will block AI bots from crawling websites by default for new customers, and broker pay-per-crawl deals between its customers and bot operators.
Ulrich
link
fedilink
-82d

they can already block VPN traffic unless it goes through their VPN

Yeah that’s how most VPNs work.

their whole business model is based on them being a man in the middle that decrypts ssl and analyses the requests plainly

Okay? Analyze all you want. They can’t stop bots on any of the other sites they regulate either.

about a third of the worldwide websites are using cloudflare so they have a pretry good birds eye view on behaviour of any machine that will be visiting a lot of websites

Great. Bots intentionally change up their behavior and identifying information as to be undetected.

@PowerCrazy@lemmy.ml
link
fedilink
8
edit-2
2d

They can’t stop bots on any of the other sites they regulate either.

Why not? They are doing edge caching, they can literally just block the connection from visiting the site just like they do with their DDoS mitigation.

Ulrich
link
fedilink
-22d

they can literally just block the connection

block which connection? Again, these AI companies know people don’t want them crawling their sites and they do everything they can to be invisible. This has been an issue for years at this point.

just like they do with their DDoS mitigation

blocking DDoS is trivial by comparison.

They can’t stop bots on any of the other sites they regulate either.

They can and do. What is blocked depends on what the website owner sets as settings in cloudflare.

Bots intentionally change up their behavior and identifying information as to be undetected.

If they have to crawl the web while behaving like a normal human, it will be magnitudes slower and more costly.

Ulrich
link
fedilink
-22d

What is blocked depends on what the website owner sets as settings in cloudflare.

And how does the owner know which connections are bots?

If they have to crawl the web while behaving like a normal human, it will be magnitudes slower and more costly.

They don’t care, they have trillions of dollars of VC money to power through.

The owner sets the level. If they set strict level, all bots are blocked.

They do care. VC funding happens because the result is profitable. If it is less profitable, there will be less funding because of higher investment risk.

Ulrich
link
fedilink
02d

If they set strict level, all bots are blocked.

I don’t know what you don’t understand. These bots are not labeling themselves as bots. They are camouflaging themselves to look like any other type of traffic.

VC funding happens because the result is profitable.

No, VC funding happens because investors are duped into thinking the result is profitable.

Ulrich
link
fedilink
1
edit-2
2d

I’m not and your tone is completely unnecessary so maybe dial it back a bit.

Yes, VC funding can be profitable. It’s also often not. Like any other investment. Corporations will absolutely lie and blow smoke up their ass if they think it can get them more money.

@HelloRoot@lemy.lol
link
fedilink
-1
edit-2
2d

often

rarely. Because the overall trends from the 2 links I shared show that it is more often profitable, resulting in a net return on investment.

My tone perfectly reflects my level of respect for you being ongoingly confidently wrong. Sorry if that hurt you. Cheers.

Create a post

A place to discuss privacy and freedom in the digital world.

Privacy has become a very important issue in modern society, with companies and governments constantly abusing their power, more and more people are waking up to the importance of digital privacy.

In this community everyone is welcome to post links and discuss topics related to privacy.

Some Rules

  • Posting a link to a website containing tracking isn’t great, if contents of the website are behind a paywall maybe copy them into the post
  • Don’t promote proprietary software
  • Try to keep things on topic
  • If you have a question, please try searching for previous discussions, maybe it has already been answered
  • Reposts are fine, but should have at least a couple of weeks in between so that the post can reach a new audience
  • Be nice :)

Related communities

much thanks to @gary_host_laptop for the logo design :)

  • 0 users online
  • 124 users / day
  • 1.05K users / week
  • 1.3K users / month
  • 4.58K users / 6 months
  • 1 subscriber
  • 3.89K Posts
  • 98.1K Comments
  • Modlog