Discussion about this post

User's avatar
Neural Foundry's avatar

Solid breakdown of the I/O bottleneck. The 6x speedup on just 10 pages really hammers home how much time gets wasted waiting on network calls in sync mode. I've seen similar gains when switching scrapers to async, and the gap only widens as scale increases. One thing that trips people up tho is managing connection pooling properly with aiohttp, since keeping too many concurrent sessions open can actualy become its own bottleneck if you're not careful.

Expand full comment

No posts

Ready for more?