The Web Scraping Club
Subscribe
Sign in
Home
Consulting
Proxy pricing benchmark
The Lab
Club Deals
Archive
About
The Lab
Latest
Top
Discussions
THE LAB #80: Scraping food delivery data
Use both the website and the mobile apps to get data from food and grocery delivery data
Apr 3
•
Pierluigi Vinciguerra
1
Share this post
The Web Scraping Club
THE LAB #80: Scraping food delivery data
Copy link
Facebook
Email
Notes
More
THE LAB #79: Use Cursor as web scraping assistant with MCP servers
Add MCP Servers to Cursor for increasing our web scraping capabilities
Mar 21
•
Pierluigi Vinciguerra
11
Share this post
The Web Scraping Club
THE LAB #79: Use Cursor as web scraping assistant with MCP servers
Copy link
Facebook
Email
Notes
More
THE LAB #78: Building a Web Scraping Knowledge Assistant with RAG - Part2
Optimizing the content storage and creating a CLI for our assistant
Mar 7
•
Pierluigi Vinciguerra
6
Share this post
The Web Scraping Club
THE LAB #78: Building a Web Scraping Knowledge Assistant with RAG - Part2
Copy link
Facebook
Email
Notes
More
THE LAB #77: Building a Web Scraping Knowledge Assistant with RAG
How to include scraped data in your AI assistant
Feb 27
•
Pierluigi Vinciguerra
12
Share this post
The Web Scraping Club
THE LAB #77: Building a Web Scraping Knowledge Assistant with RAG
Copy link
Facebook
Email
Notes
More
THE LAB #76: Bypassing Kasada With Open Source Tools In 2025
How to bypass Kasada protected websites without paying a cent
Feb 13
•
Pierluigi Vinciguerra
5
Share this post
The Web Scraping Club
THE LAB #76: Bypassing Kasada With Open Source Tools In 2025
Copy link
Facebook
Email
Notes
More
THE LAB #75: Building self healing scrapers with AI
How can we use LLMs to analyze HTML and fix our web scrapers?
Feb 6
•
Pierluigi Vinciguerra
7
Share this post
The Web Scraping Club
THE LAB #75: Building self healing scrapers with AI
Copy link
Facebook
Email
Notes
More
THE LAB #74: Running scrapers on GitHub Actions
Save money and time by using GitHub infrastructure for running your scrapers
Jan 30
•
Pierluigi Vinciguerra
7
Share this post
The Web Scraping Club
THE LAB #74: Running scrapers on GitHub Actions
Copy link
Facebook
Email
Notes
More
THE LAB #73: How to Bypass Cloudflare in 2025
Scraping websites protected by Cloudflare bot protection with open source tools
Jan 23
•
Pierluigi Vinciguerra
13
Share this post
The Web Scraping Club
THE LAB #73: How to Bypass Cloudflare in 2025
Copy link
Facebook
Email
Notes
More
4
THE LAB #72: Advanced logging in Playwright
RabbitMQ, screenshots and system monitoring for your Playwright scrapers
Jan 10
•
Pierluigi Vinciguerra
6
Share this post
The Web Scraping Club
THE LAB #72: Advanced logging in Playwright
Copy link
Facebook
Email
Notes
More
THE LAB #71: Sending Scrapy logs to RabbitMQ
Saving the logs of your distributed scraping architecture to your database
Dec 19, 2024
•
Pierluigi Vinciguerra
1
Share this post
The Web Scraping Club
THE LAB #71: Sending Scrapy logs to RabbitMQ
Copy link
Facebook
Email
Notes
More
THE LAB #70: Advanced logging in Scrapy
How to extract the most meaningful metrics from your scrapers
Dec 12, 2024
•
Pierluigi Vinciguerra
5
Share this post
The Web Scraping Club
THE LAB #70: Advanced logging in Scrapy
Copy link
Facebook
Email
Notes
More
THE LAB #69: Building a dashboard for your scrapers with Grafana
Visualizing the operations of your Scrapy spider with interactive dashboards
Dec 5, 2024
•
Pierluigi Vinciguerra
8
Share this post
The Web Scraping Club
THE LAB #69: Building a dashboard for your scrapers with Grafana
Copy link
Facebook
Email
Notes
More
THE LAB #68: Scheduling Scrapers with Airflow
How to manage a fleet of scrapers with Apache Airflow
Nov 28, 2024
•
Pierluigi Vinciguerra
4
Share this post
The Web Scraping Club
THE LAB #68: Scheduling Scrapers with Airflow
Copy link
Facebook
Email
Notes
More
1
THE LAB #67: Scraping Telegram using its APIs
How to create a bot for scraping Telegram channels
Nov 21, 2024
•
Pierluigi Vinciguerra
10
Share this post
The Web Scraping Club
THE LAB #67: Scraping Telegram using its APIs
Copy link
Facebook
Email
Notes
More
THE LAB #66: How to properly scrape a booking website
Business logic and best practices for scraping booking websites like Airbnb and Booking in the most efficient way
Nov 7, 2024
•
Pierluigi Vinciguerra
3
Share this post
The Web Scraping Club
THE LAB #66: How to properly scrape a booking website
Copy link
Facebook
Email
Notes
More
THE LAB #65: Scraping Datadome protected websites with Camoufox
Discovering the features of Camoufox, a custom and stealthy version of Firefox
Oct 24, 2024
•
Pierluigi Vinciguerra
8
Share this post
The Web Scraping Club
THE LAB #65: Scraping Datadome protected websites with Camoufox
Copy link
Facebook
Email
Notes
More
2
THE LAB #64: JWT Tokens and API scraping
How to create scrapers that use token authentication for API data retrieval
Oct 17, 2024
•
Pierluigi Vinciguerra
3
Share this post
The Web Scraping Club
THE LAB #64: JWT Tokens and API scraping
Copy link
Facebook
Email
Notes
More
1
THE LAB #63: Oxymouse and Playwright for human-like mouse movements
Testing the new Oxylabs open source package for human-like mouse movements
Oct 4, 2024
•
Pierluigi Vinciguerra
2
Share this post
The Web Scraping Club
THE LAB #63: Oxymouse and Playwright for human-like mouse movements
Copy link
Facebook
Email
Notes
More
THE LAB #62: Bypassing Cloudflare with Nodriver
Testing the undetected-chromedriver successor for scraping Cloudflare protected websites
Sep 26, 2024
•
Pierluigi Vinciguerra
2
Share this post
The Web Scraping Club
THE LAB #62: Bypassing Cloudflare with Nodriver
Copy link
Facebook
Email
Notes
More
THE LAB #61: Evaluating your proxy provider
Measuring programmatically the quality of the IPs offered by proxy providers
Sep 12, 2024
•
Pierluigi Vinciguerra
9
Share this post
The Web Scraping Club
THE LAB #61: Evaluating your proxy provider
Copy link
Facebook
Email
Notes
More
THE LAB #60: Writing scrapers with LLMs
Comparing LLama3.1, GPT4 and Mistral in creating scrapers
Sep 6, 2024
•
Pierluigi Vinciguerra
1
Share this post
The Web Scraping Club
THE LAB #60: Writing scrapers with LLMs
Copy link
Facebook
Email
Notes
More
The Lab #59: Bypassing certificate pinning with Frida and Fiddler - part 2
Rooting a virtual mobile Android Device to install Frida and discover API endpoints under the hood of apps
Aug 15, 2024
•
Pierluigi Vinciguerra
6
Share this post
The Web Scraping Club
The Lab #59: Bypassing certificate pinning with Frida and Fiddler - part 2
Copy link
Facebook
Email
Notes
More
The Lab #58: Intercepting traffic from an App - part 1
Discover API endpoints called by an App to scrape its data
Aug 9, 2024
•
Pierluigi Vinciguerra
6
Share this post
The Web Scraping Club
The Lab #58: Intercepting traffic from an App - part 1
Copy link
Facebook
Email
Notes
More
4
The Lab #57: Improving your Playwright scraper and avoid CDP detection
How to use the latest advancements to avoid CDP detection in your Playwright scrapers
Jul 26, 2024
•
Pierluigi Vinciguerra
5
Share this post
The Web Scraping Club
The Lab #57: Improving your Playwright scraper and avoid CDP detection
Copy link
Facebook
Email
Notes
More
3
Share
Copy link
Facebook
Email
Notes
More
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts