METADATA 3.3 KB

1234567891011121314151617181920212223242526272829303132333435363738394041424344454647484950515253545556575859606162636465666768697071727374757677787980818283
  1. Metadata-Version: 2.1
  2. Name: PyMuPDF
  3. Version: 1.26.4
  4. Summary: A high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
  5. Description-Content-Type: text/markdown
  6. Author: Artifex
  7. Author-email: support@artifex.com
  8. License: Dual Licensed - GNU AFFERO GPL 3.0 or Artifex Commercial License
  9. Classifier: Development Status :: 5 - Production/Stable
  10. Classifier: Intended Audience :: Developers
  11. Classifier: Intended Audience :: Information Technology
  12. Classifier: Operating System :: MacOS
  13. Classifier: Operating System :: Microsoft :: Windows
  14. Classifier: Operating System :: POSIX :: Linux
  15. Classifier: Programming Language :: C
  16. Classifier: Programming Language :: C++
  17. Classifier: Programming Language :: Python :: 3 :: Only
  18. Classifier: Programming Language :: Python :: Implementation :: CPython
  19. Classifier: Topic :: Utilities
  20. Classifier: Topic :: Multimedia :: Graphics
  21. Classifier: Topic :: Software Development :: Libraries
  22. Requires-Python: >=3.9
  23. Project-URL: Documentation, https://pymupdf.readthedocs.io/
  24. Project-URL: Source, https://github.com/pymupdf/pymupdf
  25. Project-URL: Tracker, https://github.com/pymupdf/PyMuPDF/issues
  26. Project-URL: Changelog, https://pymupdf.readthedocs.io/en/latest/changes.html
  27. # PyMuPDF
  28. **PyMuPDF** is a high performance **Python** library for data extraction, analysis, conversion & manipulation of [PDF (and other) documents](https://pymupdf.readthedocs.io/en/latest/the-basics.html#supported-file-types).
  29. # Community
  30. Join us on **Discord** here: [#pymupdf](https://discord.gg/TSpYGBW4eq)
  31. # Installation
  32. **PyMuPDF** requires **Python 3.9 or later**, install using **pip** with:
  33. `pip install PyMuPDF`
  34. There are **no mandatory** external dependencies. However, some [optional features](#pymupdf-optional-features) become available only if additional packages are installed.
  35. You can also try without installing by visiting [PyMuPDF.io](https://pymupdf.io/#examples).
  36. # Usage
  37. Basic usage is as follows:
  38. ```python
  39. import pymupdf # imports the pymupdf library
  40. doc = pymupdf.open("example.pdf") # open a document
  41. for page in doc: # iterate the document pages
  42. text = page.get_text() # get plain text encoded as UTF-8
  43. ```
  44. # Documentation
  45. Full documentation can be found on [pymupdf.readthedocs.io](https://pymupdf.readthedocs.io).
  46. # <a id="pymupdf-optional-features"></a>Optional Features
  47. * [fontTools](https://pypi.org/project/fonttools/) for creating font subsets.
  48. * [pymupdf-fonts](https://pypi.org/project/pymupdf-fonts/) contains some nice fonts for your text output.
  49. * [Tesseract-OCR](https://github.com/tesseract-ocr/tesseract) for optical character recognition in images and document pages.
  50. # About
  51. **PyMuPDF** adds **Python** bindings and abstractions to [MuPDF](https://mupdf.com/), a lightweight **PDF**, **XPS**, and **eBook** viewer, renderer, and toolkit. Both **PyMuPDF** and **MuPDF** are maintained and developed by [Artifex Software, Inc](https://artifex.com).
  52. **PyMuPDF** was originally written by [Jorj X. McKie](mailto:jorj.x.mckie@outlook.de).
  53. # License and Copyright
  54. **PyMuPDF** is available under [open-source AGPL](https://www.gnu.org/licenses/agpl-3.0.html) and commercial license agreements. If you determine you cannot meet the requirements of the **AGPL**, please contact [Artifex](https://artifex.com/contact/pymupdf-inquiry.php) for more information regarding a commercial license.