Scrapling

Scrapling

License: BSD-3-Clause


Description:

Scrapling is an adaptive Python web scraping framework with a Scrapy-like Spider API, dynamic fetchers, proxy rotation, and AI integration. It supports full crawls, session-based requests, and automatic element relocation as sites change. It bypasses anti-bot measures and provides a fast, memory-efficient pipeline for scalable data extraction.

simdjson

simdjson

License: Apache-2.0, MIT


Description:

simdjson is a high-performance C++ JSON parser that uses SIMD and automatic CPU optimization to parse gigabytes of JSON per second, with fast, UTF-8 validated parsing and On-Demand APIs; supports multiple bindings and ports.