Blogger

Delete comment from: A Dash of Technology

@Jason Adding a word to the set/zset keyed by the metaphone of the word is usually sufficient to get you 80-90% of the way towards great suggestions (set for when you don't care about occurrences, zset when you do). Throwing in 1 or 2 words of context can take you to 90-95%, at the cost of complexity, space, lookup speed, etc., and depending on your corpus, may require throwing in parts of other datasets to be good (like the Google 1T 5-gram corpus). Recently, I've erred on the side of simplicity, so I keep myself from over-optimizing the suggestions (as there are usually much bigger fish to fry).

Jul 6, 2010, 1:14:50 PM


Posted to Building a search engine using Redis and redis-py

Google apps
Main menu