User talk:Trey314159/homoglyphHunter.js

Latest comment: 5 years ago by Erutuon in topic Addition

Addition

edit

Hi, you might to add the following to your Latin-to-Cyrillic map: 'ḯ':'ї́', 'Ḯ':'Ї́',. I was using the map to correct Ukrainian words with the wrong script and just encountered the lowercase letter in a Ukrainian word in Reconstruction:Proto-Slavic/dojiti. — Eru·tuon 23:16, 17 April 2019 (UTC)Reply

Actually, a way to avoid having to add more characters is to convert to canonically decompose and then do the replacing (and recompose because that is the normalization used in wikitext). Then all the diacriticked letters could be removed, if they decompose to a letter and combining diacritics. At the moment I can't think of any weird effects that would have in this case. — Eru·tuon 04:52, 18 April 2019 (UTC)Reply

Yep, there is a catch. The grapheme с̧ (Cyrillic small letter es, combining cedilla), which would result from the decomposition method, is not canonically equivalent to ҫ (Cyrillic small letter es with descender). Sigh. — Eru·tuon 08:58, 18 April 2019 (UTC)Reply