I Use This!
Activity Not Available
Analyzed about 1 year ago. based on code collected about 1 year ago.

Project Summary

IntroductionLarge libraries often contain multiple catalogs, digital repositories, and other data sources. Generally, each of these must be searched independently or through federated search systems. The goal of Meercat is to provide a metadata harvesting and management system that can maintain up to date copies of the metadata in one location so that it can either be harvested by another service or used directly for discovery and retrieval of detailed resource metadata. The open source Lucene extension, Solr, is used to facilitate discovery and there is a REST interface to access more detailed information on resources.

TechnologiesLanguagesThe core system is written in Python. Harvesters and storage are independent of any runtime system, but the jobs and scheduling system requires Twisted. XSLT is used to transform chunks of metadata from one format to another.

HarvestersCurrent harvester sources implemented are Voyager ILS catalogs, SFX electronic resources, and metalib databases. We plan on adding a harvester for OAI-PMH servers. All harvesters implement an API and more can be added and integrated easily as additional Python modules.

Queriable HarvestersQueriable harvesters from data sources such as the Voyager ILS allow Meercat to stay current with circulation information about physical resources. Queriable harvesters are an extension of the base harvester API and add the ability to incrementally harvest resources and to harvest only resources that have been modified in a certain time frame.

Solr (Search Indexing)Apache Solr is used to facilitate discovery. Simple metadata such as title, creator and description are indexed directly in Solr while some complex data such as location and status are reduced to simple fields that can be indexed by Solr for faceting and filtering of search results. The metadata is transformed using a MapReduce framework in Twisted, an asynchronous, multi-threaded Python library.

Top Level ComponentsMeercat is comprised of reusable Python packages that can be replaced or upgraded independently of the rest of the system. The core package types are:

meercat meercat.harvester meercat.job meercat.server meercat.solr meercat.storage meercat.ui

Related ProjectsOther projects that we are of aware of that are looking at library resource discovery are: Extensible Catalog Blacklight VuFind


catalog code4lib discovery electronicresources library metadata metalib oai pmh python sfx solr sqlalchemy twisted voyager

In a Nutshell, meercat...

This Project has No vulnerabilities Reported Against it

Did You Know...

  • ...
    Black Duck offers a free trial so you can discover if there are open source vulnerabilities in your code
  • ...
    data presented on the Open Hub is available through our API
  • ...
    65% of companies leverage OSS to speed application development in 2016
  • ...
    learn about Open Hub updates and features on the Open Hub blog

30 Day Summary

Mar 16 2016 — Apr 15 2016

12 Month Summary

Apr 15 2015 — Apr 15 2016