PdfBox « PDF file « Java I/O Q&A





1. Check if a pdf file is valid using PdfBox by Apache    stackoverflow.com

I am using PdfBox in Java to extract text from PDF files. Some of the input files provided are not valid and PDFTextStripper halts on these files. Is there a clean ...

2. Parsing PDF files (especially with tables) with PDFBox    stackoverflow.com

I need to parse a PDF file which contais tabular data. I'm using PDFBox to extract the file text to parse the result (String) later. The problem is that ...

3. Extracting author from PDF file using PDFBox    forums.oracle.com

Hi, I am using PDFBox to write a utility that will extract the author information from a PDF file. The code that I am using is given below: PDFParser pdfParser = new PDFParser(inputStream); pdfParser.parse(); PDDocument pdDocument = pdfParser.getPDDocument(); PDDocumentInformation pdDocumentInfo = pdDocument.getDocumentInformation(); System.out.println("Author is: "+pdDocumentInfo.getAuthor()); While this does print a value of the author, it is not the same as it ...