-
- Downloads
Per this discussion, here's a patch to implement both levenshtein() and
metaphone() in a contrib. There seem to be a fair number of different approaches to both of these algorithms. I used the simplest case for levenshtein which has a cost of 1 for any character insertion, deletion, or substitution. For metaphone, I adapted the same code from CPAN that the PHP folks did. A couple of questions: 1. Does it make sense to fold the soundex contrib together with this one? 2. I was debating trying to add multibyte support to levenshtein (it would make no sense at all for metaphone), but a quick search through the contrib directory found no hits on the word MULTIBYTE. Should worry about adding multibyte support to levenshtein()? Joe Conway
Showing
- contrib/README 4 additions, 0 deletionscontrib/README
- contrib/fuzzystrmatch/Makefile 41 additions, 0 deletionscontrib/fuzzystrmatch/Makefile
- contrib/fuzzystrmatch/README.fuzzystrmatch 121 additions, 0 deletionscontrib/fuzzystrmatch/README.fuzzystrmatch
- contrib/fuzzystrmatch/fuzzystrmatch.c 631 additions, 0 deletionscontrib/fuzzystrmatch/fuzzystrmatch.c
- contrib/fuzzystrmatch/fuzzystrmatch.h 161 additions, 0 deletionscontrib/fuzzystrmatch/fuzzystrmatch.h
- contrib/fuzzystrmatch/fuzzystrmatch.sql.in 5 additions, 0 deletionscontrib/fuzzystrmatch/fuzzystrmatch.sql.in
Loading
Please register or sign in to comment