Tags : Browse Projects

Select a tag to browse associated projects and drill deeper into the tag cloud.

notmuch

Compare

  Analyzed 3 months ago

Notmuch is a system for indexing, searching, reading, and tagging large collections of email messages. It uses the Xapian library to provide fast, full-text search of large collection of email with a convenient search syntxs.

40.6K lines of code

22 current contributors

3 months since last commit

23 users on Open Hub

Activity Not Available
4.625
   

djapian

Compare

  Analyzed 9 months ago

If you are using Djapian please tell us about your project in reply to this post Use this package to allow full-text search in your Django project. Versions compatibility matrix: Djapian Django Xapian and python bindings = 2.31.11.0.7 Notice: there is an old issue with Xapian (< ... [More] 1.0.13) in mod_python environment. So be careful. Notice: with 2.2.2 release has been introduced database schema backward-incompatible bug fix - Change model has switched its object_id field type from integer to string. FeaturesMost of this features provided by Xapian itself and Djapian in this case plays role only as Django-compatible adaptation. High-level DSL for indexer declaration Result filtering with Django ORM like API Result set compatible with standard Django Paginator Indexing of field, method results and related model attributes Entry filtering before indexing (by trigger function) Results filtering with boolean lookups support Term tagging Spelling corrections Stemming Result ordering by fields Indexers auto discovery Index shell Model changes auto tracking Support for different index spaces Usage exampleAssume that we have this models in our imaginary application: class Person(models.Model): name = models.CharField(max_length=150) def __unicode__(self): return self.name class Entry(models.Model): author = models.ForeignKey(Person, related_name="entries") title = models.CharField(max_length=250) created_on = models.DateTimeField(default=datetime.now) is_active = models.BooleanField(default=True) text = models.TextField() editors = models.ManyToManyField(Person, related_name="edited_entries") def headline(self): return "%s - %s" % (self.author, self.title) def __unicode__(self): return self.titleAnd we want to apply indexing functionality for model Entry. The next step is to create Indexer instance with proper settings. Indexer may look like this: import djapian class EntryIndexer(djapian.Indexer): fields=["text"] tags=[ ("author", "author.name" ), ("title", "title", 3), ("date", "created_on" ), ("active", "is_active" ), ("editors", "editors" ) ] trigger=lambda indexer, obj: obj.is_active djapian.space.add_index(Entry, EntryIndexer, attach_as="indexer")In the django shell create some instances of models: >>> p = Person.objects.create(name="Alex") >>> Entry.objects.create(author=p, title="Test entry", text="Not large text field") >>> Entry.objects.create(author=p, title="Another test entry", is_active=False) >>> Entry.objects.create(author=p, title="Third small entry", text="Some another text") >>> Entry.indexer.update()Thats all! Each Entry instance has been indexed and now ready for search. Let's try: >>> result = Entry.indexer.search('title:entry') >>> len(result), result.count() 2, 2 >>> for row in result: ... row.percent, row.instance.headline() ... 99 Alex - Test entry 98 Alex - Third small entryYou can follow complete Tutorial for study Djapian basics. [Less]

1.96K lines of code

0 current contributors

over 4 years since last commit

2 users on Open Hub

Activity Not Available
5.0
 

Xappy

Compare

  Analyzed 7 months ago

The "xappy" python module is an easy-to-use interface to the Xapian search engine. Xapian provides a low level interface, dealing with terms and documents, but not really worrying about where terms come from, or how to build searches to match the way in which data has been indexed. In contrast ... [More] , "xappy" allows you to design a field structure, specifying what kind of information is held in particular fields, and then uses this field structure to index data appropriately, and to build and perform searches. [Less]

17.6K lines of code

0 current contributors

almost 6 years since last commit

2 users on Open Hub

Activity Not Available
0.0
 

mu maildir utils

Compare

  Analyzed about 2 months ago

E-mail is the 'flow' in the work flow of many people. Consequently, one spends a lot of time searching for old e-mails, to dig up some important piece of information. With people having tens of thousands of e-mails (or more), this is becoming harder and harder. How to find that one e-mail in an ... [More] evergrowing haystack? Enter mu. 'mu' is a set of command-line tools for Linux/Unix that enable you to quickly find the e-mails you are looking for, assuming that you store your e-mails in Maildirs (e-mail directories). [Less]

32.2K lines of code

35 current contributors

2 months since last commit

2 users on Open Hub

Activity Not Available
5.0
 

OpenBib

Compare

  Analyzed 3 months ago

OpenBib is a next generation discovery infrastructure for libraries. On the frontend side it features RSS feeds, mashups, tagging, tag clouds, use analyses, drilldowns as well as catalogue enrichment and much more. The entire infrastructure relies on the central priciples of REST and the Semantic ... [More] Web. A lot of different data sources - ranging from library catalogues to collections to OAI repositories - can be imported in its local data store (PostgreSQL database + Xapian search engine). Any combination of these data sources can be exposed as a separate web portal with minimum work using OO inheritance of templates on different abstraction levels. As a use case see the union catalogue of the University of Cologne, Germany: http://kug.ub.uni-koeln.de/ [Less]

143K lines of code

2 current contributors

3 months since last commit

1 users on Open Hub

Activity Not Available
0.0
 

SFeed

Compare

  Analyzed 8 months ago

SFeed is an RSS and Atom feed reader, with extensive support for searching the archive of feed entries. It is written in python, using the Xapian search engine, the Storm ORM, and Mark Pilgrim's feedparser module.

1.94K lines of code

0 current contributors

over 9 years since last commit

1 users on Open Hub

Activity Not Available
0.0
 

Flax

Compare

  Analyzed almost 6 years ago

Flax is a project to develop an open source enterprise search engine application based on the Xapian search engine library. It also contains a clean-and-simple Python interface suitable for many users of Xapian, built on the standard Xapian Python interface, together with various other add-ons such as performance testing utilities.

12K lines of code

0 current contributors

about 8 years since last commit

1 users on Open Hub

Activity Not Available
3.5
   

perlax

Compare

  No analysis available

Still in its infancy.

0 lines of code

0 current contributors

0 since last commit

0 users on Open Hub

Activity Not Available
0.0
 
Mostly written in language not available
Licenses: GPL-3.0+

naipax

Compare

  Analyzed 8 months ago

DESCRIPTIONNaipax is a crawler project. Its aim is to be a generic library with which you may be able to create your own crawler applications. AVAILABILITYThere is not any formal release yet. Please check out the source code from the repository svn checkout ... [More] http://naipax.googlecode.com/svn/trunk/ naipaxINSTALLATION./configure make make installTHE NAIPAX APPWhen you type 'make install', a binary executable naipax is installed. It is a simple crawler application, and it serves no more than demonstration purposes for now. You may modify app/naipax/naipax.cpp to meet your own requirements. Start up the naipax appnaipax config_fileAnd please see here for an example of configuration file. Command it to crawl web pagesRun this: ./util/feed_url.pl 127.0.0.1 1997 http://slashdot.orgAnd naipax will start to fetch web pages. You may find the helper script here. PREREQUISITESBerkleyDB, pcrecpp, libconfuse, c-ares PLATFORMSNaipax is only tested on Debian GNU/Linux for now. [Less]

10.7K lines of code

0 current contributors

over 9 years since last commit

0 users on Open Hub

Activity Not Available
0.0
 

jaset

Compare

  No analysis available

It's just another search engine tool. The code will come soon.

0 lines of code

0 current contributors

0 since last commit

0 users on Open Hub

Activity Not Available
0.0
 
Mostly written in language not available
Licenses: GPL-2.0+