PDF Miner Pdfminer.six is a python package for extracting information from PDF documents. Check out the source on github. Stack Overflow examples