天枢 - 企业级 AI 一站式数据预处理平台 | PDF/Office转Markdown | 支持MCP协议AI助手集成 | Vue3+FastAPI全栈方案 | 文档解析 | 多模态信息提取
- Updated
Feb 27, 2026 - Python
天枢 - 企业级 AI 一站式数据预处理平台 | PDF/Office转Markdown | 支持MCP协议AI助手集成 | Vue3+FastAPI全栈方案 | 文档解析 | 多模态信息提取
A Python package for converting PDFs to markdown while extracting images and tables, generate descriptive text descriptions for extracted tables/images using several LLM clients. And many more functionalities. Markdrop is available on PyPI.
Collection of PDF parsing libraries like AI based docling, claude, openai, gemini, meta's llama-vision, unstructured-io, and pdfminer, pymupdf, pdfplumber etc for efficient snapshot, text, table, and metadata extraction.
📱 A React app to preview and edit Markdown✍. You can also export it as HTML.
EdgeParse converts any digital PDF into Markdown, JSON (with bounding boxes), HTML, or plain text — deterministically, without a JVM, without a GPU, and with best-in-class accuracy on the 200-document benchmark suite included in this repository.
[Required for large models] Office to Markdown service implementation, based on Microsoft Markitdown.
DocuGenius 是一个专业的 VSCode 插件,专门为使用 AI 编程工具的产品经理设计。它能够将你的 Word、Excel、PowerPoint 和 PDF 文件转换为 AI 友好的 Markdown 格式,让 Trae AI、CodeBuddy、Qoder等 AI 编程工具能够直接理解和处理你的业务文档。
Docsifer is a powerful tool for converting various data formats into Markdown for applications such as indexing, text analysis, and more. It supports PDF, PowerPoint, Word, Excel, Images, Audio, HTML, and other text-based formats, and leverages LLMs to enhance performance.
High-performance Python Excel processing library with advanced conversion capabilities
A URL Fetch Gemini Processor to be used with Gemini's genai-processors
📄 Professional MCP server for converting 29+ file formats to Markdown - Perfect for Claude Desktop and AI workflows!
CV Matcher is a Python-based application that helps analyze resumes and match them against job descriptions. It provides both CLI and server-based interfaces for resume analysis.
simplified and containerized version of MarkItDown running as a FastAPI service, with a RESTful API for file-to-Markdown conversion.
🔒 100% Local RAG pipeline for .md documents — Ollama + SQLite-vec + MarkItDown MCP. Built on Microsoft.Extensions.DataIngestion (.NET 10)
Simple FastAPI wrapper for Document-to-Markdown conversion using Microsoft's MarkItDown library.
A Clean Architecture ASP.NET Core Web API that wraps Microsoft's MarkItDown Python library. Convert PDF, DOCX, PPTX, images, audio, and URLs to Markdown. Features CQRS pattern, FluentValidation, and IIS deployment support.
convert a file to a markdown file
📄 Convert 29+ file formats to clean Markdown using the Model Context Protocol for seamless integration with AI workflows.
Enterprise Knowledge Base Management System with AI-powered document conversion, multi-user collaboration, and admin approval workflow. Supports 12+ file formats including PDF, DOCX, XLSX, images with OCR.
Add a description, image, and links to the markitdown topic page so that developers can more easily learn about it.
To associate your repository with the markitdown topic, visit your repo's landing page and select "manage topics."