🌍 Live Open Source Explorer
Explore live open-source projects and AI models.
Search public open-source repositories from GitHub and AI models from Hugging Face. Every page shows 10 results with clean pagination.
🔎 Live Search
Search live open-source data
Search GitHub repositories and Hugging Face models directly, then explore stars, downloads, source links and project details.
Live Results
GitHub Open Source Repositories
Search: MinerU
Page 1
Showing 10 results from 54
opendatalab/MinerU
GitHub Python OtherTransforms complex documents like PDFs and Office docs into LLM-ready markdown/JSON for your Agentic workflows.
External source
GitHub
xerrors/Yuxi
GitHub Python MIT License结合知识库、知识图谱管理的 多租户 Agent Harness 平台。 An agent harness that integrates a LightRAG knowledge base and knowledge graphs. Build with LangChain + Vue + FastAPI, support DeepAgents、MinerU PDF、Neo4j 、MCP.
External source
GitHub
magicyuan876/mineru-tianshu
GitHub Python Apache License 2.0天枢 - 企业级 AI 一站式数据预处理平台 | PDF/Office转Markdown | 支持MCP协议AI助手集成 | Vue3+FastAPI全栈方案 | 文档解析 | 多模态信息提取
External source
GitHub
opendatalab/MinerU-Diffusion
GitHub Python MIT LicenseA diffusion-based framework for document OCR that replaces autoregressive decoding with block-level parallel diffusion decoding.
External source
GitHub
opendatalab/MinerU-Document-Explorer
GitHub TypeScript MIT LicenseAgent-native knowledge engine with MCP tools for document indexing, wiki organization, fast retrieval and deep reading across PDF/DOCX/PPTX/Markdown
External source
GitHub
liuhuapiaoyuan/MinerU-webui
GitHub PythonMinerU是一款开源的高质量PDF解析工具,基于深度学习技术,可自动提取PDF文档中的文字、表格、图片、公式等内容,并提供丰富的分析、统计、搜索等功能。 本项目为其提供一个简化版本的WebUI,方便用户上传PDF文件,并实时展示提取结果。
External source
GitHub
opendatalab/MinerU-HTML
GitHub Python Apache License 2.0MinerU-HTML: An SLM-powered HTML main content extractor that outputs clean HTML bodies. Perfect for Deep Research Agents, RAG applications, and training data generation.
External source
GitHub
1sdv/lunwentocode
GitHub Python将论文(Markdown格式)转换为可运行的Python代码(Mineru一键转化pdf为Markdown文件).
External source
GitHub
JuniverseCoder/MinerU2PPT
GitHub PythonHigh-fidelity tool to convert PDF files and images into fully editable PowerPoint presentations (.pptx). Unlike basic converters, it uses AI-powered structure extraction (MinerU) to accurately reconstruct text, layouts, and images, ensuring the output is not just a screenshot but a professional, ... Read more
External source
GitHub
lpdswing/mineru-web
GitHub Python GNU Affero General Public License v3.0A user-friendly PDF-to-Markdown conversion tool based on Mineru.
External source
GitHub
10 results on this page · 54 total found
Showing first 54 accessible GitHub results.