Menu ☰
← Back to Open Source

πŸ™ GitHub Detail

C

apify/crawlee

By apify

Crawleeβ€”A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.

GitHub TypeScript Apache License 2.0 Updated 29 May 2026

Live Snapshot

⭐

Stars

23,550

🍴

Forks

1,406

πŸ“„

License

Apache License 2.0

🧩

Type

TypeScript

πŸ“˜

About this open-source project

Live information fetched from GitHub.

Crawleeβ€”A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.

🌿

Default Branch

master

🐞

Open Issues

176

πŸ‘€

Watchers

23,550