Skip to content

double metaphone for non-English languages

1 message · Dimitri Liakhovitski

#
Hello!

I am processing some strings using package "PGRdup".
I using function 'DoubleMetaphone' to generate the double metaphone
codes of strings - to match words that sound alike (in case they were
written differently).

However, I got some French and German strings and discovered that
DoubleMetaphone can't process non-ASCII characters (like Umlauts in
German, accents in French).

What would you recommend I use instead?
Use 'phonetic' from stringdist?


Thank you!