Whoosh Python Search Library

Settings | Report Duplicate

12

I Use This!

Activity Not Available

News

Less is more Posted almost 15 years ago by matt One thing that makes me think I'm back on the right track with the CDB format and cursor stuff is that I'm ripping out quite a bit of hard-to-follow code in my experimental branch. Feels good :)
OK, It's not so bad Posted almost 15 years ago by matt OK, so it's not as bad as I thought. First of all, I discovered the CDB database library and tried translating its format into Python. My home-grown on-disk associative array format is basically an ordered linear list, but chunked up and written ... [More]
Python performance is a fool's errand Posted almost 15 years ago by matt I'm beginning to feel like striving for performance in a Python library is counter-productive. Every thing you do to eek out performance warps the code, making it less clear. And it seems like every time you try to make something faster, it actually ... [More]
Whoosh 0.1.11 will require reindexing Posted about 15 years ago by matt The latest PyPI release of Whoosh (0.1.11) contains changes to the index format (I changed to a sane storage format for the per-document field lengths) that are not backward compatible. You will need to recrease any existing indexes with the new version. Sorry for the inconvenience.
Tradeoffs of additional speed Posted about 15 years ago by matt As a followup to my last post, I've been thinking about how speeding up Whoosh using array.tofile() and fromfile() raises a conflict between two of my goals for Whoosh. Whoosh is supposed to be a fast (for Python) search library. But I also ... [More]
Speed improvements coming Posted about 15 years ago by matt The great perversion of trying to write high-performance code in Python is that it has almost nothing to do with being clever. It really comes down to writing your program in such as way that it touches as much C code as possible. Some things you ... [More]
The Future Posted about 15 years ago by matt Well, that was interesting. Whoosh got noticed in a few places, and a few people are playing with it. Now I need to work on three things. Documentation Finishing features Exploring performance Documentation is what I'm going to be focusing on ... [More]
Anonymous access is fixed Posted about 15 years ago by matt Yay! Anonymous access to the whoosh repository is fixed.
Murphy's Library Posted about 15 years ago by matt To summarize the initial release of Whoosh, Day Two: I screwed up setup.py and uploaded incomplete distributions. Twice. Anonymous access isn't working for the Subversion repository. I've got a question in to my service provider about it, but until ... [More]
The Whoosh search blog Posted about 15 years ago by matt The trials, tribulations, and triumphs of one man's journey to create a kick-ass Python search engine library.

Edit News Feeds

←
1
2
→