Skip to main content

Yong Zhang's Library tagged toolkit   View Popular

14 Aug 08

PDFMiner

PDFMiner is a suite of programs that aims to help analyzing text data from PDF documents. It includes a PDF parser, a PDF renderer (though only rendering text is supported for now), and a couple of nice tools to extract texts. Unlike other PDF-related tools, it allows to obtain the exact location of texts in a page, as well as other layout information such as font size or font name, which could be useful for analyzing the document.

www.unixuser.org/...pdfminer - Preview

pdf python toolkit

1 - 16 of 16
Showing 20 items per page

Diigo is about better ways to research, share and collaborate on information. Learn more »

Join Diigo