Menu

🌍 Live Open Source Explorer

Explore live open-source projects and AI models.

Search public open-source repositories from GitHub and AI models from Hugging Face. Every page shows 10 results with clean pagination.

🔎 Live Search

Search live open-source data

Search GitHub repositories and Hugging Face models directly, then explore stars, downloads, source links and project details.

Reset Search
🔎
🌐

Try keywords like automation, CRM, analytics, chatbot, llama or workflow.

Choose where to search live data.

Live Results

GitHub Open Source Repositories

Search: MinerU

Page 1

Showing 10 results from 54

M

opendatalab/MinerU

GitHub Python Other

Transforms complex documents like PDFs and Office docs into LLM-ready markdown/JSON for your Agentic workflows.

★ 65,603 Forks 5,533 opendatalab Updated 29 May 2026
Y

xerrors/Yuxi

GitHub Python MIT License

结合知识库、知识图谱管理的 多租户 Agent Harness 平台。 An agent harness that integrates a LightRAG knowledge base and knowledge graphs. Build with LangChain + Vue + FastAPI, support DeepAgents、MinerU PDF、Neo4j 、MCP.

★ 5,318 Forks 747 xerrors Updated 29 May 2026
M

magicyuan876/mineru-tianshu

GitHub Python Apache License 2.0

天枢 - 企业级 AI 一站式数据预处理平台 | PDF/Office转Markdown | 支持MCP协议AI助手集成 | Vue3+FastAPI全栈方案 | 文档解析 | 多模态信息提取

★ 655 Forks 94 magicyuan876 Updated 29 May 2026
M

opendatalab/MinerU-Diffusion

GitHub Python MIT License

A diffusion-based framework for document OCR that replaces autoregressive decoding with block-level parallel diffusion decoding.

★ 599 Forks 35 opendatalab Updated 28 May 2026
M

opendatalab/MinerU-Document-Explorer

GitHub TypeScript MIT License

Agent-native knowledge engine with MCP tools for document indexing, wiki organization, fast retrieval and deep reading across PDF/DOCX/PPTX/Markdown

★ 568 Forks 61 opendatalab Updated 29 May 2026
M

liuhuapiaoyuan/MinerU-webui

GitHub Python

MinerU是一款开源的高质量PDF解析工具,基于深度学习技术,可自动提取PDF文档中的文字、表格、图片、公式等内容,并提供丰富的分析、统计、搜索等功能。 本项目为其提供一个简化版本的WebUI,方便用户上传PDF文件,并实时展示提取结果。

★ 319 Forks 42 liuhuapiaoyuan Updated 22 May 2026
M

opendatalab/MinerU-HTML

GitHub Python Apache License 2.0

MinerU-HTML: An SLM-powered HTML main content extractor that outputs clean HTML bodies. Perfect for Deep Research Agents, RAG applications, and training data generation.

★ 248 Forks 24 opendatalab Updated 28 May 2026
L

1sdv/lunwentocode

GitHub Python

将论文(Markdown格式)转换为可运行的Python代码(Mineru一键转化pdf为Markdown文件).

★ 179 Forks 5 1sdv Updated 25 May 2026
M

JuniverseCoder/MinerU2PPT

GitHub Python

High-fidelity tool to convert PDF files and images into fully editable PowerPoint presentations (.pptx). Unlike basic converters, it uses AI-powered structure extraction (MinerU) to accurately reconstruct text, layouts, and images, ensuring the output is not just a screenshot but a professional, ... Read more

★ 178 Forks 16 JuniverseCoder Updated 28 May 2026
M

lpdswing/mineru-web

GitHub Python GNU Affero General Public License v3.0

A user-friendly PDF-to-Markdown conversion tool based on Mineru.

★ 143 Forks 28 lpdswing Updated 22 May 2026
Pagination Page 1 of 6

10 results on this page · 54 total found

Showing first 54 accessible GitHub results.