Navigate:
All ReposPyMuPDF
~$PYMUPD0.2%

PyMuPDF: Python library for PDF document processing

Python bindings for MuPDF document processing library.

LIVE RANKINGS • 06:51 AM • STEADY
TOP 100TOP 100TOP 100TOP 100TOP 100TOP 100TOP 100TOP 100TOP 100TOP 100TOP 100TOP 100
OVERALL
#98
5
DEVELOPER TOOLS
#17
4
30 DAY RANKING TREND
ovr#98
·Devel#17
STARS
8.8K
FORKS
680
DOWNLOADS
7D STARS
+16
7D FORKS
+1
See Repo:
Share:

Learn more about PyMuPDF

import fitz\ndoc = fitz.open("document.pdf")\nprint(doc.page_count)

PyMuPDF

1

MuPDF-based architecture

Built as Python bindings to MuPDF, a lightweight C toolkit maintained by Artifex Software. This approach provides direct access to a mature rendering engine without reimplementing PDF parsing logic in Python.

2

Multi-format support

Handles PDF, XPS, EPUB, and other document formats through a single API. The underlying MuPDF engine provides native support for these formats rather than relying on format-specific libraries.

3

Optional feature extensibility

Core functionality requires no external dependencies, while optional features like font subsetting (fontTools) and OCR (Tesseract) can be added independently. This allows users to install only the capabilities they need.


import pymupdf

doc = pymupdf.open("report.pdf")
page = doc[0]
text = page.get_text()
print(text)
doc.close()

v1.26.6

Drops Python 3.9 support; minimum version is now 3.10. Upgrades to MuPDF 1.26.11 and fixes five reported issues.

  • Pin Python ≥3.10 before upgrading; 3.9 is no longer supported.
  • Review issues #4699, #4712, #4720, #4742, #4746 if you encountered related bugs.
v1.26.5

Drops Python 3.8, adds 3.14 support, removes duplicate Shape class; upgrades to MuPDF 1.26.10 with five bug fixes.

  • Pin Python ≥3.9 and ≤3.14; Python 3.8 is no longer supported.
  • Remove references to pymupdf.utils.Shape (duplicate removed); use pymupdf.Shape directly.
v1.26.4

Maintenance release upgrading to MuPDF 1.26.7 and fixing 11 reported issues; no breaking changes noted.

  • Upgrade to MuPDF 1.26.7 and apply fixes for issues #3806, #4388, #4457, #4462, #4533, #4565, #4571, #4590, #4614, #4639.
  • Use new Page.clip_to_rect() method for clipping operations; experimental Graal support and improved Tesseract data search included.

See how people are using PyMuPDF

Loading tweets...


[ EXPLORE MORE ]

Related Repositories

Discover similar tools and frameworks used by developers