Calculating distance between strings has been a commonly used option for modern
spell checkers and seems to work well at least in English. In English spell correction
word boundary problems such as in the word forgot (forgot ??“ for got), run-ons such as
in form ??“ inform, or short words such as prepositions (to, or, at) are difficult. N-gram
is used to correct context-independent problems. However, Asian languages such as
Japanese and Chinese consist of run-on words, which are context dependent.
Figure 1. Show Damerau??™s simple ???arithmetic at??? system based on deletion, insertion,
transposition, and addition using ORD as example (Damerau, 1964)
0 Moon and Burste n
Copyright ?© 2007, Idea Group Inc. Copying or distributing in print or electronic forms without written permission
of Idea Group Inc. is prohibited.
The latest word processor programs are capable of suggesting a replacement for a
mistyped word. Spell checkers ???know??? how to evaluate distance between a misspelled
word and the words in its files. Words whose evaluated distance is the smallest are
suggested as candidates for replacement.
Pages:
765
766
767
768
769
770
771
772
773
774
775
776
777
778
779
780
781
782
783
784
785
786
787
788
789