Extract text to page using Python pdfMiner?

I experimented with pyPdf and pdfMiner to extract text from pdf files. I have some unfriendly pdf files that only pdfMiner can extract. I use the code here to extract text for the entire file. However, I would really like to extract page-based text as a function getPage(i).extractText()in pyPdf. Does anyone know how to extract text to a page using pdfMiner?

+5
source share
1 answer
for pageNumber, page in enumerate(PDFDocument.get_pages()):
    if pageNumber == 42:
        #do something with the page

There is a good article here.

+6
source

All Articles