|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Objectcom.swabunga.spell.engine.EditDistance
public class EditDistance
This class is based on Levenshtein Distance algorithms, and it calculates how similar two words are.
If the words are identical, then the distance is 0. The more that the words have in common, the lower the distance value.
The distance value is based on how many operations it takes to get from one word to the other. Possible operations are
swapping characters, adding a character, deleting a character, and substituting a character.
The resulting distance is the sum of these operations weighted by their cost, which can be set in the Configuration object.
When there are multiple ways to convert one word into the other, the lowest cost distance is returned.
Another way to think about this: what are the cheapest operations that would have to be done on the "original" word to end up
with the "similar" word? Each operation has a cost, and these are added up to get the distance.
Configuration.COST_REMOVE_CHAR
,
Configuration.COST_INSERT_CHAR
,
Configuration.COST_SUBST_CHARS
,
Configuration.COST_SWAP_CHARS
Field Summary | |
---|---|
static Configuration |
config
Fetches the spell engine configuration properties. |
Constructor Summary | |
---|---|
EditDistance()
|
Method Summary | |
---|---|
static int |
getDistance(java.lang.String word,
java.lang.String similar)
Evaluates the distance between two words. |
static int |
getDistance(java.lang.String word,
java.lang.String similar,
int[][] matrix)
Evaluates the distance between two words. |
static void |
main(java.lang.String[] args)
For testing edit distances |
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Field Detail |
---|
public static Configuration config
Constructor Detail |
---|
public EditDistance()
Method Detail |
---|
public static final int getDistance(java.lang.String word, java.lang.String similar)
word
- One word to evaluatessimilar
- The other word to evaluates
public static final int getDistance(java.lang.String word, java.lang.String similar, int[][] matrix)
word
- One word to evaluatessimilar
- The other word to evaluates
public static void main(java.lang.String[] args) throws java.lang.Exception
args
- an array of two strings we want to evaluate their distances.
java.lang.Exception
- when problems occurs during reading args.
|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |