If you are looking for software to use, go to Huajun Software Park! software release

Hello, if there is a need for software inclusion, please package the software and attach the software name, software introduction, software-related screenshots, software icon, soft copy, and business license (if you do not have a business license, please provide the front and back of the corresponding developer ID card) and a photo of yourself holding your ID card) and send it to your email http://softwaredownload4.com/sbdm/user/login

Hide>>

Send to email:news@onlinedown.net

Hide>>

Location: front pagePC softwareMAC softwareapp Apache Tika
Apache Tika

Apache Tika 0.7

QR code
  • Software licensing: free software
  • Software size: 1.64MB
  • Software rating:
  • Software type: foreign software
  • Update time: 2024-09-14
  • Application platform: Mac OS X
  • Software language: English
  • Version: 0.7

Download the service agreement at the bottom of the page

Software introduction Related topics FAQ Download address

Basic introduction
Apache Tika segment first LOGO
Tika is a toolkit for text extraction. It integrates POI, Pdfbox and provides a unified interface for text extraction work. Secondly, Tika also provides a convenient extension API to enrich its support for third-party file formats.

Tika provides support for the following file formats:

* PDF - via Pdfbox
* MS-* - via POI
* HTML - Use nekohtml to organize non-standard html into xhtml
* OpenOffice format - provided by Tika
* Archive - zip, tar, gzip, bzip, etc.
* RTF - provided by Tika
* Java class - Class analysis is completed by ASM
* Image - only supports image metadata extraction
* XML

FAQ