Tags : Browse Projects

Select a tag to browse associated projects and drill deeper into the tag cloud.

Apache Tika

Compare

Claimed by Apache Software Foundation Analyzed about 24 hours ago

The Apache Tika™ toolkit detects and extracts metadata and structured text content from various documents using existing parser libraries. Tika is a project of the Apache Software Foundation, and was formerly a subproject of Apache Lucene.

392K lines of code

19 current contributors

1 day since last commit

23 users on Open Hub

Very High Activity
5.0
 
I Use This

Apache Solr for TYPO3

Compare

  Analyzed about 23 hours ago

Open Source Enterprise Search meets Open Source Enterprise Content Management System. A TYPO3 extension that integrates the Apache Solr enterprise search server with TYPO3. Features include * User Access Groups Support * Multi Language Handling * File Indexing * Facetting & Filters * ... [More] Sorting * Field Boosting * Spellchecking * Search Word Highlighting * Auto Suggest * Multisite Support * Advanced Templating Engine * Index Reports [Less]

98.9K lines of code

22 current contributors

15 days since last commit

3 users on Open Hub

Moderate Activity
5.0
 
I Use This

Apache Tika for TYPO3

Compare

  Analyzed about 5 hours ago

Apache Tika for TYPO3 offers several services to extract meta data and content from files. The extension also comes with a service to detect the language of a text (requires Tika 0.8+). EXT:tika can use either a locally available Tika CLI app or a remote Apache Solr server. The provided ... [More] services can then be used by other extensions like EXT:dam or EXT:solr for example. [Less]

4.94K lines of code

2 current contributors

6 months since last commit

1 users on Open Hub

Very Low Activity
5.0
 
I Use This