Scrapling: A Complete Hands-On Guide

Antonello Zanini

Oct 12, 2025

Let's discover Scrapling, the Python library that makes web scraping easy and effortless—with practical examples!

Read →

12 Comments

shivham

Jan 20

I really liked your suggestion.

I have tried Option 2 and used the web unblocker API. They are working and allowing me to scrape data that doesn't require a login, protecting me from hitting a login wall. However, to get complete and accurate data, we need to log in first to see the real and full information.

I am not in favor of using the LinkedIn API from the provider. It is costly and does not yield good results for my requirements. I am more inclined to develop my own LinkedIn solution.

Your valuable input needed. Thank you

Also, where can I find the Discord community link?

Reply (1)

Antonello Zanini

Jan 21

You are welcome. Here is the link:

https://discord.com/invite/zpV3UvAhYu

(Regarding scraping data while being logged in, I do not recommend that for legal and ethical reasons.)

Reply (1)

shivham

Jan 22

Hello, I have requested to join the community. Waiting for the acceptance

You're in!

Jan 18Edited

I am starting to use it and trying to parse Link*d*n public data, which does not require a login. But after opening it with StealthySession in the browser, it redirects me to the login page; with other proxies, it bypasses that page and gives me the real HTML.

Reply (1)

Antonello Zanini

Jan 19

Scraping LinkedIn is always tricky, mostly because of the known login wall. You can bypass it on some pages with a simple trick, which I documented in a previous post:

https://substack.thewebscraping.club/p/scraping-linkedin-public-data

Keep in mind that Scrapling's StealthySession only tweaks the automated browser to look less “automated” and more human. The underlying IP is still yours (or one of your servers), so you can still get blocked.

The library itself can’t change your IP address, which is why, when scraping complex sites like LinkedIn, it’s always smart to pair good browser automation with high-quality proxies!

The IP’s location is also extremely important, sometimes even more than the IP’s quality itself. A European website is far more likely to accept a connection from a European IP (even if it comes from a datacenter) than from a very high-quality IP located in Asia or America.

Reply (1)

shivham

Jan 19

Thank you for the reply. I have seen your article, and this is the same way I was scraping LinkedIn jobs with their api and proxies.

But when it comes to scraping profiles, the game is totally different. I have list of profiles and i am trying to scrape their public data which are available without login with the ISP and scrapling. Unfortunately, the login wall prevents me from viewing the data without logging in and redirects me to the login page. Some proxies work with the API provided by the proxy providers. But for the scaling purpose, it does not seem to be a very good solution. Any help or suggestions?

Reply (2)

Antonello Zanini

Jan 20

Also, feel free to join our Discord and ask for help from other web scraping experts!

Antonello Zanini

Jan 20

Yes! Scraping LinkedIn profiles is indeed a different game. In this case, you can either try high-quality residential proxies (e.g., from Decodo, NetNut, etc.) with a retry mechanism for login challenges or go straight with Web Unlocker APIs. If I were you, I’d try the first approach to see if it works. But, realistically, it probably makes more sense to go directly with the second one.

Web unlockers handle all obstacles for you, making scraping much easier. If you don’t need custom data parsing, I’d opt for a LinkedIn Scraping API from a top provider (Bright Data, Apify, etc.) or a Web Unlocker API from providers like Zyte, Decodo, Oxylabs, Bright Data, etc.

Tamas Deak

Oct 18

Hi Karim. amazing work on Scrapling.

Quick question: would you be open to supporting Kameleo as a browser option as well? I can see how nicely you integrated Camoufox, but since it's no longer actively maintained by Daijiro, it may eventually hit limitations.

Kameleo is a paid solution, but that's precisely what allows us to continuously update our browser kernels and evolve our fingerprint masking, so we can reliably stay ahead in the anti-bot space - especially for serious web-scraping use cases.

Happy to chat anytime if you'd be interested in exploring this together.

Comment removed

Oct 31

Comment removed

Reply (2)

Antonello Zanini

Oct 31Edited

Thanks! Scrapling is definitely a fantastic library.

Regarding the maintainability aspects, I leave it to Karim, the author of the library.

Karim Shoair

Oct 31

Hello, Scrapling author here.

Thanks for all your kind words. Yes, I keep maintaining the solver since I released it, and it's actually now working better than when this article was published (updates and all).

The solver has been working since last December, even though I added it to Scrapling in v0.3 (I was using it in my daily job), but this shows that I have been maintaining it for nearly a year now, and I intend to keep doing that with the rest of the library :D

The Web Scraping Club

Scrapling: A Complete Hands-On Guide