π GitHub Detail
apify/crawlee-python
By apify
CrawleeβA web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Parsel, BeautifulSoup, Playwright, and raw HTTP. Both headful and headless mode. With proxy rotation.
Live Snapshot
β
Stars
9,123
π΄
Forks
747
π
License
Apache License 2.0
π§©
Type
Python
About this open-source project
Live information fetched from GitHub.
CrawleeβA web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Parsel, BeautifulSoup, Playwright, and raw HTTP. Both headful and headless mode. With proxy rotation.
Default Branch
master
Open Issues
75
Watchers
9,123