Pdf2Dom pdf2dom

Group net.sf.cssbox
描述 Pdf2Dom is a PDF parser that converts the documents to a HTML DOM representation. The obtained DOM tree may be then serialized to a HTML file or further processed. The inline CSS definitions contained in the resulting document are used for making the HTML page as similar as possible to the PDF input. A command-line utility for converting the PDF documents to HTML is included in the distribution package. Pdf2Dom may be also used as an independent Java library with a standard DOM interface for your DOM-based applications or as an alternative parser for the CSSBox rendering engine in order to add the PDF processing capability to CSSBox.
Packaging jar
Size 60.73 KB
文件 pom jar
网址 http://cssbox.sourceforge.net/pdf2dom
发布时间 2020-01-03 21:18

dependencies

Group Artifact Version
net.sf.cssbox cssbox 4.17
org.apache.pdfbox pdfbox 2.0.18
net.mabboud.fontverter FontVerter 1.2.22
commons-io commons-io 2.6
junit junit 4.13
org.jsoup jsoup 1.12.1
org.hamcrest hamcrest-all 1.3
commons-codec commons-codec 1.13
org.slf4j slf4j-simple 1.7.25
net.mabboud.gfxassert GfxAssert 1.0.4

developers

Radek Burget

licenses

GNU Lesser General Public License 3.0 http://www.gnu.org/licenses/lgpl-3.0.txt
索引仓库
仓库 个数
Central 592045
3856752