The Web Scraping Club
Subscribe
Sign in
Home
Consulting
Proxy pricing benchmark
The Lab
Club Deals
Archive
About
AI
Latest
Top
Discussions
Using NLP for Entity Extraction From Scraped Data
From theory to practice: how to extract entities from textual scraped data using NLP
Dec 7
•
Federico Trotta
1
Using AI to Detect Patterns in Scraped Data
A practical guide on finding patterns in scraped data with advanced techniques
Nov 23
•
Federico Trotta
1
2
When Browsers Start to Think: ChatGPT Atlas, Stagehand, Cursor, and the Future of Web Scraping
How recent browser integrations with LLMs are changing the way we explore and scrape the web.
Nov 2
•
Pierluigi Vinciguerra
1
THE LAB #84: AI-Driven Web Scraping: OpenAI Codex vs Cursor vs AI Scraping Tools
Is OpenAI Codex the new silver bullet for scraping?
May 22
•
Pierluigi Vinciguerra
4
THE LAB #79: Use Cursor as a web scraping assistant with MCP servers
Add MCP Servers to Cursor for increasing our web scraping capabilities
Mar 21
•
Pierluigi Vinciguerra
12
The Browser Automation Landscape in 2025
How new players and tools are shaping the browser automation and scraping industries
Feb 9
•
Pierluigi Vinciguerra
12
4
THE LAB #75: Building self healing scrapers with AI
How can we use LLMs to analyze HTML and fix our web scrapers?
Feb 6
•
Pierluigi Vinciguerra
8
How AI is changing the web scraping industry
How AI brought to life a new set of tools and services for web scraping
Jan 26
•
Pierluigi Vinciguerra
19
2
2
The Scriptwall: Why Google is hiding its SERP content behind Javascript
What are the implications of this move for the web scraping industry?
Jan 19
•
Pierluigi Vinciguerra
12
AI and data: different faces of the same coin
How the training of LLMs changed the web data industry
Nov 10, 2024
•
Pierluigi Vinciguerra
3
Building a custom GPT using Firecrawl
Create The Web Scraping Club GPT by scraping my own newsletter
Oct 6, 2024
•
Pierluigi Vinciguerra
3
The Oxycon 2024 wrap up
Three key insights from the Oxycon 2024 conference
Sep 29, 2024
•
Pierluigi Vinciguerra
1
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts