Working with PDFs in Python: Reading and Splitting Pages

This article is the first in a series on working with PDFs in Python: The PDF Document Format Today, the Portable Document Format (PDF) belongs to the most commonly used data formats. In 1990, the structure of a PDF document was defined by Adobe. The idea behind the PDF format is that transmitted data/documents look exactly the same for both parties that are involved in the communication process – the creator, author or sender, and the receiver. PDF is the […]

Read more

Working with PDFs in Python: Adding Images and Watermarks

This article is the second in a series on working with PDFs in Python: Introduction Today, a world without the Portable Document Format (PDF) seems to be unthinkable. It has become one of the most commonly used data formats ever. Up to PDF version 1.4, displaying a PDF document in an according PDF viewer works fine. Unfortunately, the features from the newer PDF revisions, such as forms, are tricky to implement, and still require further work to be fully functional […]

Read more

Working with PDFs in Python: Inserting, Deleting, and Reordering Pages

This article is the third in a series on working with PDFs in Python: Introduction This article is part three of a little series on working with PDFs in Python. In the previous articles we gave an introduction into reading PDF documents using Python. So far you have learned how to manipulate existing PDFs, and to read and extract the content – both text and images. Furthermore, we have discussed splitting documents into its single pages, as well as adding […]

Read more