PDFBox PDF processing class library v3.0.0 alpha2
3.0.0
PDFBox is a PDF document collaboration class library implemented in Java. It provides the creation, processing and document content extraction functions of PDF documents. It also includes some command line utilities.
feature
Extract text
Extract Unicode text from PDF files.
Split and merge
Split a single PDF into multiple files or merge multiple PDF files.
fill in form
Extract data from PDF forms or fill in PDF forms.
Preflight
Validate PDF files according to PDF/A-1b standards.
Print PDF files using the standard Java printing API.
Save as picture
Save PDF as image file such as PNG or JPEG.
Create PDF
Create PDFs from scratch with embedded fonts and images.
Sign for receipt
Digitally sign PDF files.