|Initial release||March 4, 2005[nb 1]|
0.62.0 / December 3, 2017
|Operating system||Linux, Unix, BSD, Windows|
|License||GPLv2 or GPLv3|
Poppler is a free software utility library for rendering Portable Document Format (PDF) documents. Its development is supported by freedesktop.org. It is commonly used on Linux systems, and is used by the PDF viewers of the open source GNOME and KDE desktop environments.
The project was started by Kristian Høgsberg with two goals: to provide PDF rendering functionality as a shared library for centralizing maintenance effort, and to go beyond the goals of Xpdf, and integrate with functionality provided by modern operating systems.
As of the version 0.18 release in 2011, the poppler library represents a complete implementation of ISO 32000-1, the PDF format standard, and is the first major free PDF library to support its forms (only Acroforms but not full XFA forms) and annotations features.
Numerous free software applications use Poppler to render PDF documents.
|pdftotext, pdftohtml, etc.||none|
Poppler can use two back-ends for drawing PDF documents, Cairo and Splash. Its features may depend on which back-end it employs. A third back-end based on Qt4's painting framework "Arthur", is available, but is incomplete and no longer under active development. Bindings exist for Glib, Qt3, and Qt4, that provide interfaces to the Poppler backends, although the Qt3 and Qt4 bindings support only the Splash backend. There is a patchset available to add support for the Cairo backend to the Qt4 bindings, but the Poppler project does not currently wish to integrate the feature into the library proper.
Some characteristics of the back-ends include:
Poppler comes with a text-rendering back-end as well, which can be invoked from the command line utility
pdftotext. It is useful for searching for strings in PDFs from the command line, using the utility
grep, for instance.
pdftotext file.pdf - | grep string
poppler-utils is a collection of tools builds on poppler library API, to manage PDF and extract contents.
pdfdetach extract embedded documents from a PDF
pdffonts lists the fonts used in a PDF
pdfimages extract all embedded images at native resolution from a PDF
pdfinfo list all infos of a PDF
pdfseparate extract single pages from a PDF
pdftocairo convert single pages from a PDF to vector or bitmap formats using cairo
pdftohtml convert PDF to HTML format retaining formatting
pdftoppm convert a PDF page to a bitmap
pdftops convert PDF to printable PS format
pdftotext extract all text from PDF
pdfunite merges several PDF
Manage research, learning and skills at defaultLogic. Create an account using LinkedIn or facebook to manage and organize your Digital Marketing and Technology knowledge. defaultLogic works like a shopping cart for information -- helping you to save, discuss and share.