The Web Scraping Club

The Web Scraping Club

Home
Consulting
Proxy pricing benchmark
The Lab
Club Deals
Archive
About
Fine-Tuning LLMs for Industry-Specific Scraping
Beyond General AI: A Deep Dive into Fine-Tuning Language Models for Web Data
Sep 21 • Federico Trotta
2

THE LAB #93: scraping Booking.com using internal APIs
How to get travel data without a browser
Sep 19 • Pierluigi Vinciguerra
2
How Airproxy built its 2000 mobile proxies infrastructure from scratch
How to start and scale a mobile proxy factory
Sep 16 • Diego Sinigaglia
1
Understanding the Role of the X-Forwarded-For Header in Proxies
Learn everything you need to know about the X-Forwarded-For HTTP header
Sep 14 • Antonello Zanini
3
Implementing Anomaly Detection on Scraped Datasets
From Theory To Practice: A Guide on Anomaly Detection on Scraped Data
Sep 7 • Federico Trotta
2
Platinum Partner
View all
Discover Decodo Web Scraping API
Meet the Platinum Partner of The Web Scraping Club
Feb 6 • Pierluigi Vinciguerra
1
The Great Web Unblocker Benchmark - Cloudflare Edition
Testing different web unblockers against Indeed.com
Sep 22, 2024 • Pierluigi Vinciguerra
6
The Web Unblocker Cost Benchmark
Price comparison between the most well-known web unblockers on the market
Dec 24, 2023 • Pierluigi Vinciguerra
2
Web Scraping
View all
Fine-Tuning LLMs for Industry-Specific Scraping
Beyond General AI: A Deep Dive into Fine-Tuning Language Models for Web Data
Sep 21 • Federico Trotta
2
THE LAB #93: scraping Booking.com using internal APIs
How to get travel data without a browser
Sep 19 • Pierluigi Vinciguerra
2
Understanding the Role of the X-Forwarded-For Header in Proxies
Learn everything you need to know about the X-Forwarded-For HTTP header
Sep 14 • Antonello Zanini
3
Implementing Anomaly Detection on Scraped Datasets
From Theory To Practice: A Guide on Anomaly Detection on Scraped Data
Sep 7 • Federico Trotta
2
How to Scrape Booking.com in Python
Automatically retrieve data from Booking.com using a custom Playwright-based scraper
Aug 31 • Antonello Zanini
3
2
THE LAB #92: scraping Depop in a cost-effective way
Delegate or in house solution? A cost-wise perspective.
Aug 29 • Pierluigi Vinciguerra
4
AI
View all
THE LAB #84: AI-Driven Web Scraping: OpenAI Codex vs Cursor vs AI Scraping Tools
Is OpenAI Codex the new silver bullet for scraping?
May 22 • Pierluigi Vinciguerra
4
THE LAB #79: Use Cursor as a web scraping assistant with MCP servers
Add MCP Servers to Cursor for increasing our web scraping capabilities
Mar 21 • Pierluigi Vinciguerra
12
The Browser Automation Landscape in 2025
How new players and tools are shaping the browser automation and scraping industries
Feb 9 • Pierluigi Vinciguerra
11
4
THE LAB #75: Building self healing scrapers with AI
How can we use LLMs to analyze HTML and fix our web scrapers?
Feb 6 • Pierluigi Vinciguerra
8
© 2025 Pierluigi
Privacy ∙ Terms ∙ Collection notice
Start writingGet the app
Substack is the home for great culture