The Web Scraping Club
Subscribe
Sign in
Home
Web Scraping Course
The Lab
Special offers for our readers
Archive
Leaderboard
About
The Lab
Latest
Top
Discussions
The Lab #57: Improving your Playwright scraper and avoid CDP detection
How to use the latest advancements to avoid CDP detection in your Playwright scrapers
16 hrs ago
•
Pierluigi Vinciguerra
Share this post
The Lab #57: Improving your Playwright scraper and avoid CDP detection
substack.thewebscraping.club
Copy link
Facebook
Email
Note
Other
The Lab #56: Bypassing PerimeterX 3
Testing the latest PerimeterX version by scraping Crunchbase public data
Jul 11
•
Pierluigi Vinciguerra
4
Share this post
The Lab #56: Bypassing PerimeterX 3
substack.thewebscraping.club
Copy link
Facebook
Email
Note
Other
The Lab #55: Checking your browser fingerprint
Understanding your scrapers' browser fingerprint reliability with online tests.
Jul 4
•
Pierluigi Vinciguerra
6
Share this post
The Lab #55: Checking your browser fingerprint
substack.thewebscraping.club
Copy link
Facebook
Email
Note
Other
3
The Lab #54: Scraping from Algolia APIs
Why internal APIs are always the best choice for scraping a website
Jun 21
•
Pierluigi Vinciguerra
1
Share this post
The Lab #54: Scraping from Algolia APIs
substack.thewebscraping.club
Copy link
Facebook
Email
Note
Other
2
The Lab #53: Bypassing AWS WAF
Scraping websites protected by AWS WAF using an hybrid approach
Jun 6
•
Pierluigi Vinciguerra
4
Share this post
The Lab #53: Bypassing AWS WAF
substack.thewebscraping.club
Copy link
Facebook
Email
Note
Other
The Lab #52: Scraping with LLMs and ScrapeGraphAi - part 1
Are LLMs the Holy Graal for web scraping?
May 30
•
Pierluigi Vinciguerra
1
Share this post
The Lab #52: Scraping with LLMs and ScrapeGraphAi - part 1
substack.thewebscraping.club
Copy link
Facebook
Email
Note
Other
The Lab #51: APIs with Bearer Token
Scraping data from API endpoints requiring Bearer Token
May 17
•
Pierluigi Vinciguerra
1
Share this post
The Lab #51: APIs with Bearer Token
substack.thewebscraping.club
Copy link
Facebook
Email
Note
Other
Celebrating the 50th article of The Lab series
A brief review of the first 50 episodes of The Lab series
May 9
•
Pierluigi Vinciguerra
3
Share this post
Celebrating the 50th article of The Lab series
substack.thewebscraping.club
Copy link
Facebook
Email
Note
Other
The Lab #49: Bypassing Cloudflare with open source repositories
And my two cents about these solutions
May 3
•
Pierluigi Vinciguerra
1
Share this post
The Lab #49: Bypassing Cloudflare with open source repositories
substack.thewebscraping.club
Copy link
Facebook
Email
Note
Other
The Lab #48: Scraping with AWS Lambda
Using Serverless and Selenium on Lambda for gathering data
Apr 12
•
Pierluigi Vinciguerra
2
Share this post
The Lab #48: Scraping with AWS Lambda
substack.thewebscraping.club
Copy link
Facebook
Email
Note
Other
The Lab #47: Scraping real time data with Python
Using WebSocket to scrape data from Bitstamp and Sofascore
Apr 4
•
Pierluigi Vinciguerra
3
Share this post
The Lab #47: Scraping real time data with Python
substack.thewebscraping.club
Copy link
Facebook
Email
Note
Other
The Lab #46: Fingerprint injection in Playwright
A home-made solution to bypass anti-bots by changing your browser fingerprint.
Mar 28
•
Pierluigi Vinciguerra
3
Share this post
The Lab #46: Fingerprint injection in Playwright
substack.thewebscraping.club
Copy link
Facebook
Email
Note
Other
Share
Copy link
Facebook
Email
Note
Other
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts