23
I Use This!
High Activity
Analyzed about 8 hours ago. based on code collected about 10 hours ago.

Project Summary

The Apache Tika™ toolkit detects and extracts metadata and structured text content from various documents using existing parser libraries.

Tika is a project of the Apache Software Foundation, and was formerly a subproject of Apache Lucene.

Tags

apache content java lucene metadata mime parser tika

Apache License 2.0
Permitted

Commercial Use

Modify

Distribute

Place Warranty

Sub-License

Private Use

Use Patent Claims

Forbidden

Hold Liable

Use Trademarks

Required

Include Copyright

State Changes

Include License

Include Notice

These details are provided for information only. No information here is legal advice and should not be used as such.

Project Security

Vulnerabilities per Version ( last 10 releases )

Project Vulnerability Report

Security Confidence Index

Poor security track-record
Favorable security track-record

Vulnerability Exposure Index

Many reported vulnerabilities
Few reported vulnerabilities

Did You Know...

  • ...
    in 2016, 47% of companies did not have formal process in place to track OS code
  • ...
    you can subscribe to e-mail newsletters to receive update from the Open Hub blog
  • ...
    nearly 1 in 3 companies have no process for identifying, tracking, or remediating known open source vulnerabilities
  • ...
    check out hot projects on the Open Hub
About Project Security

Languages

Java
82%
XML
16%
13 Other
2%

30 Day Summary

Aug 22 2025 — Sep 21 2025

12 Month Summary

Sep 21 2024 — Sep 21 2025
  • 719 Commits
    Down -513 (41%) from previous 12 months
  • 21 Contributors
    Down -1 (4%) from previous 12 months

Ratings

6 users rate this project:
5.0
 
5.0/5.0
Click to add your rating
  
Review this Project!