Discussion about this post

User's avatar
Neural Foundry's avatar

This is a really solid breakdown of Scrapling's capabilites! The Cloudflare bypass section is particularly intresting because most libraries just throw their hands up when they hit Turnstile. The fact that Scrapling can solve it in headless mode using Camoufox is impressive since that's usually where most stealth approaches break down. I'm curious about the long term maintainability though, given that Cloudflare is constantly evolving their detection methods. Does the library have a regular update cycle to keep pace with those changes, or does the fingerprint spoofing aproach provide enough flexibility that it stays ahead naturally?

shivham's avatar

I really liked your suggestion.

I have tried Option 2 and used the web unblocker API. They are working and allowing me to scrape data that doesn't require a login, protecting me from hitting a login wall. However, to get complete and accurate data, we need to log in first to see the real and full information.

I am not in favor of using the LinkedIn API from the provider. It is costly and does not yield good results for my requirements. I am more inclined to develop my own LinkedIn solution.

Your valuable input needed. Thank you

Also, where can I find the Discord community link?

10 more comments...

No posts

Ready for more?