Navigate:
PyMuPDF
~$PYMUP0.8%

PyMuPDF: Python library for PDF document processing

Python bindings for MuPDF document processing library.

LIVE RANKINGS • 10:20 AM • STEADY
OVERALL
#127
18
DEVELOPER TOOLS
#22
6
30 DAY RANKING TREND
ovr#127
·Devel#22
STARS
9.1K
FORKS
692
7D STARS
+74
7D FORKS
+1
See Repo:
Share:

Learn more about PyMuPDF

import fitz\ndoc = fitz.open("document.pdf")\nprint(doc.page_count)

PyMuPDF

1

MuPDF-based architecture

Built as Python bindings to MuPDF, a lightweight C toolkit maintained by Artifex Software. This approach provides direct access to a mature rendering engine without reimplementing PDF parsing logic in Python.

2

Multi-format support

Handles PDF, XPS, EPUB, and other document formats through a single API. The underlying MuPDF engine provides native support for these formats rather than relying on format-specific libraries.

3

Optional feature extensibility

Core functionality requires no external dependencies, while optional features like font subsetting (fontTools) and OCR (Tesseract) can be added independently. This allows users to install only the capabilities they need.


import pymupdf

doc = pymupdf.open("report.pdf")
page = doc[0]
text = page.get_text()
print(text)
doc.close()

v1.26.7

PyMuPDF 1.26.7 updates to MuPDF-1.26.12 and improves security of the embed-extract command.

  • Use MuPDF-1.26.12.
  • Retrospectively mark #4756 as fixed in 1.26.6.
  • Improved safety of pymupdf embed-extract. This now refuses to write to unsafe locations.
v1.26.6

PyMuPDF 1.26.6 updates to MuPDF-1.26.11, adds Python 3.14 support, drops Python 3.9, and fixes several reported issues.

  • Use MuPDF-1.26.11.
  • Supported Python versions are now 3.10-3.14.
  • Fixed #4699, #4712
v1.26.5

PyMuPDF 1.26.5 updates to MuPDF-1.26.10, expands Python support to 3.9-3.14, and includes multiple bug fixes and code improvements.

  • Use MuPDF-1.26.10.
  • Fixed #2883, #4507, #4613


[ EXPLORE MORE ]

Related Repositories

Discover similar tools and frameworks used by developers