opendatalab/MinerU
opendatalab/MinerUOtherPython
69.1k
Transforms complex documents like PDFs and Office docs into LLM-ready markdown/JSON for your Agentic workflows.
ai4sciencedocument-analysisdocxextract-datalayout-analysisocrparserpdfpdf-converterpdf-extractor-llmpdf-extractor-pretrainpdf-extractor-ragpdf-parserpptxpythonxlsx