Apache Tika tika

Parent apache
Group org.apache
描述 Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
Packaging jar
Size 188.50 KB
文件 pom jar
网址 http://lucene.apache.org/tika/
发布时间 2008-12-10 06:15

dependencies

Group Artifact Version
commons-lang commons-lang 2.1
commons-logging commons-logging 1.0.4
commons-codec commons-codec 1.3
commons-io commons-io 1.4
pdfbox pdfbox 0.7.3
org.apache.poi poi 3.1-FINAL
org.apache.poi poi-scratchpad 3.1-FINAL
net.sourceforge.nekohtml nekohtml 1.9.9
com.ibm.icu icu4j 3.8
asm asm 3.1
log4j log4j 1.2.14
junit junit 3.8.1

developers

ridabenjelloun Rida Benjelloun
kbennett Keith Bennett
mharwood Mark Harwood
mattmann Chris A. Mattmann NASA Jet Propulsion Laboratory
dmeikle Dave Meikle
siren Sami Siren
jukka Jukka Zitting

licenses

索引仓库
仓库 个数
Central 592045
5089555