Menu
Back to Open Source

🐙 GitHub Detail

P

PaddlePaddle/PaddleOCR

By PaddlePaddle

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

GitHub Python Apache License 2.0 Updated 30 May 2026

Live Snapshot

Stars

79,010

🍴

Forks

10,521

📄

License

Apache License 2.0

🧩

Type

Python

📘

About this open-source project

Live information fetched from GitHub.

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

🌿

Default Branch

main

🐞

Open Issues

215

👀

Watchers

79,010