pymupdf

PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.

Latest version: 1.25.2 registry icon
Maintenance score
100
Safety score
100
Popularity score
100
Check your open source dependency risks. Get immediate insight about security, stability and licensing risks.
Security
  Vulnerabilities
Version Suggest Low Medium High Critical
1.25.2 0 0 0 0 0
1.25.1 0 0 0 0 0
1.25.0 0 0 0 0 0
1.24.14 0 0 0 0 0
1.24.13 0 0 0 0 0
1.24.12 0 0 0 0 0
1.24.11 0 0 0 0 0
1.24.10 0 0 0 0 0
1.24.9 0 0 0 0 0
1.24.8 0 0 0 0 0
1.24.7 0 0 0 0 0
1.24.6 0 0 0 0 0
1.24.5 0 0 0 0 0
1.24.4 0 0 0 0 0
1.24.3 0 0 0 0 0
1.24.2 0 0 0 0 0
1.24.1 0 0 0 0 0
1.24.0 0 0 0 0 0
1.23.26 0 0 0 0 0
1.23.25 0 0 0 0 0
1.23.24 0 0 0 0 0
1.23.23 0 0 0 0 0
1.23.22 0 0 0 0 0
1.23.21 0 0 0 0 0
1.23.20 0 0 0 0 0
1.23.19 0 0 0 0 0
1.23.18 0 0 0 0 0
1.23.17 0 0 0 0 0
1.23.16 0 0 0 0 0
1.23.15 0 0 0 0 0
1.23.14 0 0 0 0 0
1.23.13 0 0 0 0 0
1.23.12 0 0 0 0 0
1.23.11 0 0 0 0 0
1.23.10 0 0 0 0 0
1.23.9 0 0 0 0 0
1.23.9rc2 0 0 0 0 0
1.23.9rc1 0 0 0 0 0
1.23.8 0 0 0 0 0
1.23.7 0 0 0 0 0
1.23.6 0 0 0 0 0
1.23.5 0 0 0 0 0
1.23.4 0 0 0 0 0
1.23.3 0 0 0 0 0
1.23.2 0 0 0 0 0
1.23.2rc1 0 0 0 0 0
1.23.1 0 0 0 0 0
1.23.0rc2 0 0 0 0 0
1.23.0rc1 0 0 0 0 0
1.23.0 0 0 0 0 0
1.22.5 0 0 0 0 0
1.22.3 0 0 0 0 0
1.22.2 0 0 0 0 0
1.22.1 0 0 0 0 0
1.22.0 0 0 0 0 0
1.21.1 0 0 0 0 0
1.21.0 0 0 0 0 0
1.20.2 0 0 0 0 0
1.20.1 0 0 0 0 0
1.20.0 0 0 0 0 0
1.19.6 0 0 0 0 0
1.19.5 0 0 0 0 0
1.19.4 0 0 0 0 0
1.19.3 0 0 0 0 0
1.19.2 0 0 0 0 0
1.19.1 0 0 0 0 0
1.19.0 0 0 0 0 0
1.18.19 0 0 0 0 0
1.18.18 0 0 0 0 0
1.18.17 0 0 0 0 0
1.18.16 0 0 0 0 0
1.18.15 0 0 0 0 0
1.18.14 0 0 0 0 0
1.18.13 0 0 0 0 0
1.18.12 0 0 0 0 0
1.18.11 0 0 0 0 0
1.18.10 0 0 0 0 0
1.18.9 0 0 0 0 0
1.18.8 0 0 0 0 0
1.18.7 0 0 0 0 0
1.18.6 0 0 0 0 0
1.18.5 0 0 0 0 0
1.18.4 0 0 0 0 0
1.18.3 0 0 0 0 0
1.18.2 0 0 0 0 0
1.18.1 0 0 0 0 0
1.18.0 0 0 0 0 0
1.17.7 0 0 0 0 0
1.17.6 0 0 0 0 0
1.17.5 0 0 0 0 0
1.17.4 0 0 0 0 0
1.17.3 0 0 0 0 0
1.17.2 0 0 0 0 0
1.17.1 0 0 0 0 0
1.17.0 0 0 0 0 0
1.16.18 0 0 0 0 0
1.16.17 0 0 0 0 0
1.16.16 0 0 0 0 0
1.16.15 0 0 0 0 0
1.16.14 0 0 0 0 0
1.16.13 0 0 0 0 0
1.16.12 0 0 0 0 0
1.16.11 0 0 0 0 0
1.16.10 0 0 0 0 0
1.16.9 0 0 0 0 0
1.16.8 0 0 0 0 0
1.16.7 0 0 0 0 0
1.16.6 0 0 0 0 0
1.16.5 0 0 0 0 0
1.16.4 0 0 0 0 0
1.16.3 0 0 0 0 0
1.16.2 0 0 0 0 0
1.16.1 0 0 0 0 0
1.16.0 0 0 0 0 0
1.14.21 0 0 0 0 0
1.14.20 0 0 0 0 0
1.14.19 0 0 0 0 0
1.13.20 0 0 0 0 0
1.12.5 0 0 0 0 0
1.11.2 0 0 0 0 0
1.10.0 0 0 0 0 0
1.9.2 0 0 0 0 0

Stability
Latest release:

1.25.2 - This version is safe to use because it has no known security vulnerabilities at this time. Find out if your coding project uses this component and get notified of any reported security vulnerabilities with Meterian-X Open Source Security Platform

Licensing

Maintain your licence declarations and avoid unwanted licences to protect your IP the way you intended.

AGPL-3.0   -   GNU Affero General Public License v3.0

Not a wildcard

Not proprietary

OSI Compliant



PyMuPDF

PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.

Community

Join us on Discord here: #pymupdf

Installation

PyMuPDF requires Python 3.9 or later, install using pip with:

pip install PyMuPDF

There are no mandatory external dependencies. However, some optional features become available only if additional packages are installed.

You can also try without installing by visiting PyMuPDF.io.

Usage

Basic usage is as follows:

import pymupdf # imports the pymupdf library
doc = pymupdf.open("example.pdf") # open a document
for page in doc: # iterate the document pages
  text = page.get_text() # get plain text encoded as UTF-8

Documentation

Full documentation can be found on pymupdf.readthedocs.io.

Optional Features

  • fontTools for creating font subsets.
  • pymupdf-fonts contains some nice fonts for your text output.
  • Tesseract-OCR for optical character recognition in images and document pages.

About

PyMuPDF adds Python bindings and abstractions to MuPDF, a lightweight PDF, XPS, and eBook viewer, renderer, and toolkit. Both PyMuPDF and MuPDF are maintained and developed by Artifex Software, Inc.

PyMuPDF was originally written by Jorj X. McKie.

License and Copyright

PyMuPDF is available under open-source AGPL and commercial license agreements. If you determine you cannot meet the requirements of the AGPL, please contact Artifex for more information regarding a commercial license.