PDFBox
DeveloperApache Software Foundation
Stable release
1.8.x:1.8.17 / 15 September 2022; 3 years ago (2022-09-15)[1]
2.0.x:2.0.32 / 24 July 2024; 22 months ago (2024-07-24)[1]
3.0.x:3.0.3 / 8 August 2024; 22 months ago (2024-08-08)[1]
Written inJava
Operating systemCross-platform
TypePortable Document Format (PDF)
LicenseApache License 2.0
Websitepdfbox.apache.org
RepositoryPDFBox Repository (Mirror)

Apache PDFBox is an open source pure-Java library that can be used to create, render, print, split, merge, alter, verify and extract text and meta-data of PDF files.

Open Hub reports over 11,000 commits (since the start as an Apache project) by 18 contributors representing more than 140,000 lines of code. PDFBox has a well established, mature codebase maintained by an average size development team with increasing year-over-year commits. Using the COCOMO model, it took an estimated 46 person-years of effort.[2]

Structure

edit

Apache PDFBox has these components:

  • PDFBox: the main part
  • FontBox: handles font information
  • XmpBox: handles XMP metadata
  • Preflight (optional): checks PDF files for PDF/A-1b conformity.

History

edit

PDFBox was started in 2002 in SourceForge by Ben Litchfield who wanted to be able to extract text of PDF files for Lucene.[3] It became an Apache Incubator project in 2008, and an Apache top level project in 2009.[4]

Preflight was originally named PaDaF and developed by Atos worldline, and donated to the project in 2011.[5]

In February 2015, Apache PDFBox was named an Open Source Partner Organization of the PDF Association.[6]

See also

edit

References

edit
  1. ^ a b c "Apache PDFBox - Blog". pdfbox.apache.org. Apache Software Foundation. Retrieved 2024-10-30.
  2. ^ "The Apache PDFBox Open Source Project on Open Hub". openhub.net. 2017-03-18. Retrieved 2017-03-18.
  3. ^ Apache PDFBox and FontBox 1.0.0 released, The H Open, 16 February 2010
  4. ^ PDFBox Project Incubation Status
  5. ^ PaDaF Preflight Codebase Intellectual Property (IP) Clearance Status
  6. ^ Apache™ PDFBox™ named an Open Source Partner Organization of the PDF Association, February 3, 2015
edit

📚 Artikel Terkait di Wikipedia

PDF

the GNU General Public License (GPL), version 2 or 3. "The Apache PDFBox project- Apache PDFBox 3.0.0 released". August 17, 2023. Archived from the original

PDF Split and Merge

to edit PDF files and SAMBox, a PDFBox fork. Free and open-source software portal List of PDF software Apache PDFBox "Release 6.0.1". 1 May 2026. Retrieved

JBIG2

are jbig2dec (AGPL), the java-based Apache PDFBox jbig2-imageio (Apache-2), the JavaScript-based jbig2.js (Apache-2), and the decoder by Glyph & Cog LLC

List of Apache Software Foundation projects

format PDFBox: Java based PDF library (reading, text extraction, manipulation, viewer) Mod_perl: module that integrates the Perl interpreter into Apache server

Apache Commons

The Apache Commons is a project of the Apache Software Foundation, formerly under the Jakarta Project. The purpose of the Commons is to provide reusable

List of PDF software

PDF including PDF/A. Apache OpenOffice Draw GNU LGPL Yes Yes Yes Yes PDF import via software, or extensions. Apache PDFBox Apache License 2.0 Yes Yes Unix

Java (programming language)

features, offering an implementation compatible with the standard library (Apache Harmony). The use of Java-related technology in Android led to a legal dispute

Outline of the Java programming language

Java Edition NetBeans Apache Software Foundation – Apache Commons, Apache Maven, Apache Tomcat, Apache Kafka Eclipse Foundation – Adoptium, Eclipse IDE