Anti-Web Scraping Measures

Discussions center on techniques to detect, block, and deter web scrapers and bots, especially aggressive AI crawlers, including rate limiting, Cloudflare protections, behavioral detection, and challenges in implementation.

πŸš€ Rising 2.4x Security
4,308
Comments
20
Years Active
5
Top Authors
#5383
Topic ID

Activity Over Time

2007
3
2008
22
2009
20
2010
50
2011
77
2012
126
2013
59
2014
124
2015
65
2016
108
2017
166
2018
173
2019
150
2020
193
2021
264
2022
257
2023
399
2024
455
2025
1,499
2026
98

Keywords

e.g AI CloudFlare robots.txt API IP scrapers scraping bot ai cloudflare api detect robots txt robots crawler

Sample Comments

walterbell β€’ Jan 16, 2024 β€’ View on HN

Did they imagine it would slow down scrapers/bots?

b3ing β€’ Jan 4, 2026 β€’ View on HN

Maybe it’s to block ai scrapers

dclowd9901 β€’ Aug 1, 2012 β€’ View on HN

You mean like overloading their servers with needless scraping?

cwbriscoe β€’ Nov 16, 2025 β€’ View on HN

I am not well versed in this problem but can't the web servers rate limit by known IP addresses of these crawler/scrapers?

plumeria β€’ Sep 27, 2014 β€’ View on HN

Does Cloudflare works for preventing scrapers?

tornato7 β€’ Sep 24, 2023 β€’ View on HN

Sounds like they have identified you as a crawler / scraper and don't want another Cambridge analytics incident.

lrvick β€’ Jun 20, 2018 β€’ View on HN

Malicious how? No one is going to stop bots scraping the site so why penalize humans?

amelius β€’ Apr 23, 2018 β€’ View on HN

Wouldn't it be easy for e.g. Facebook to train a classifier on the behavior of your scrapers, and from there block you?

almost_usual β€’ Jan 18, 2018 β€’ View on HN

There is a multi-million dollar industry around blocking scrapers, it's not "just add rate limiting".

unstatusthequo β€’ Oct 21, 2025 β€’ View on HN

Did they just make us all human web scrapers?