The Apache Tika™ toolkit detects and extracts metadata and structured text content from various documents using existing parser libraries.
Tika is a project of the Apache Software Foundation, and was formerly a subproject of Apache Lucene.
Open Source Enterprise Search meets Open Source Enterprise Content Management System.
A TYPO3 extension that integrates the Apache Solr enterprise search server with TYPO3.
Features include
* User Access Groups Support
* Multi Language Handling
* File Indexing
* Facetting & Filters
*
... [More] Sorting
* Field Boosting
* Spellchecking
* Search Word Highlighting
* Auto Suggest
* Multisite Support
* Advanced Templating Engine
* Index Reports [Less]
Apache Tika for TYPO3 offers several services to extract meta data and content from files. The extension also comes with a service to detect the language of a text (requires Tika 0.8+).
EXT:tika can use either a locally available Tika CLI app or a remote Apache Solr server.
The provided
... [More] services can then be used by other extensions like EXT:dam or EXT:solr for example. [Less]
This site uses cookies to give you the best possible experience.
By using the site, you consent to our use of cookies.
For more information, please see our
Privacy Policy