Crawlee
A Python library for web scraping and browser automation
CommonProductProgrammingpythoncrawler
Crawlee is a Python library for building reliable web crawlers to extract data for use in AI, LLMs, RAG, or GPTs. It provides a unified interface for handling both HTTP and headless browser crawling tasks, supports automatic parallelization based on system resources, and comes with a clean and elegant API built on standard asynchronous IO. Unlike Scrapy, Crawlee offers native support for headless browser crawling. It is written in Python and includes type hints, enhancing the development experience and minimizing errors. Crawlee boasts features like automatic retries, integrated proxy rotation and session management, configurable request routing, a persistent URL queue, and pluggable storage options.
Crawlee Visit Over Time
Monthly Visits
515580771
Bounce Rate
37.20%
Page per Visit
5.8
Visit Duration
00:06:42