User:Любослов Езыкин/List of the Arabic letters

Table

edit

Preliminary note

edit
  • Every Arabic letter has four presentations forms according to its position in a text string: isolated, intital, medial, final.
  • Some letters lack initial and medial forms, this is shown with an m-dash.
  • Presentations forms of additional letters are written like the forms of the basic letters.
  • Some letters have particular forms, it will be shown below.
  • If the first value shown without the language name then it's used in the most languages (to list them all is unpractical). Then goes deviations from the letter basic value.
  • Only values of Modern Standard Arabic shown for simplicity's sake. See Varieties of Arabic for phonology of particular dialects.
  • The Arabic scripts of the Turkic languages of the former USSR were written similarly to Tatar before the 1930th, so they are not listed.

Basic alphabet

edit
Final Medial Initial Isolated Unicode Character Value
U+0627 ا Arabic /ʔ, aː/, Persian/Dari /ʔ, ɔ, æ, Ø/, Kurdish /ɐ/, Malay /Ø/, Mozarabic /h/, Pashto /ʔ, Ø, ə, ɑ(ː), i, u/, Sindhi /ɑ/, Turkish /ɑ, Ø; o/, e, Urdu /ɑ, ə/, Uyghur /ɑ/, Chinese /ɑ~a/ ⟨a⟩
U+0628 ب /b/, Turkish /b, p/, Chinese /b~p/ ⟨b⟩
U+0629 ة Arabic /-at/
U+062A ت /t/, Chinese /tʰ/ ⟨t⟩
U+062B ث Arabic /θ/, Mozarabic /t/, Chinese /tɕʰ, ɕ/ ⟨q, x⟩, others /s/
U+062C ج /dʒ/, Mozarabic /tʃ/, Tatar /ʓ/, Turkish /dʒ, tʃ/, Chinese /ʈʂ/ ⟨zh⟩
U+062D ح Arabic, Kurdish /ħ/, Bosnian /h, x/; Pashto, Persian/Dari, Kashmiri /h, Ø/; Malay, Mozarabic, Sindhi, Turkish, Urdu /h/, Tatar /x/, Chinese /x/ ⟨h⟩
U+062E خ Arabic /x~χ/, Kashmiri /kʰ/, Malay /x, k/, Turkish /h/, Avar, Dargwa, Lak, Lezghi /χ/, Persian, Urdu & others /x/
U+062F د /d/, Turkish /d, t/, Chinese /t~d, tɕ/ ⟨d, j⟩
U+0630 ذ Arabic /ð/, Malay /dz/, Chinese /ts/ ⟨z⟩, others /z/
U+0631 ر /r/, Chinese /-ɻ/ ⟨-r⟩
U+0632 ز /z/, Malay /z, dʒ/
U+0633 س /s/, Chinese /s, ɕ/ ⟨s, x⟩
U+0634 ش /ʃ/, Chinese /ʂ, ɕ/ ⟨sh, x⟩
U+0635 ص Arabic /sˤ/, Malay /t/, others /s/
ﺿ U+0636 ض Arabic /dˤ/, Turkish /z, d/, others /z/
U+0637 ط Arabic /tˤ/, Turkish /t, d/, Chinese /ts/ ⟨z⟩, Avar, Dargwa, Lak, Lezghi /tʼ/, others /t/
U+0638 ظ Arabic /zˤ/, Chinese /ts/ ⟨z⟩, others /z/
U+0639 ع Arabic /ʕ/, Persian/Dari, Kashmiri /ʔ, Ø/, Pashto /ʔ, Ø, ɑ/, Malay, Sindhi /Ø/, Urdu /C_ɑ; Ø, ʔ, ə/, Turkish /Ø, ɑ/, Tatar /ɣ/, Chinese /ə~ɤ/ ⟨e⟩
U+063A غ Arabic /ɣ~ʁ/, Persian/Dari /ɣ, q, ɢ, x/, Pashto, Sindhi, Urdu, Tatar, Uyghur /ɣ/, Kashmiri, Bosnian, Hausa /ɡ/, Turkish /ɣ, ɡ, k, h/, Malay /ɣ, r/
U+0641 ف /f/, Kashmiri /f, pʰ/, Malay /f, p/, Hausa /β/
U+0642 ق /q/, Dari, Persian /q, ɢ/, Kashmiri, Malay /k/, Hausa /kʼ/, Chinese /k~ɡ/ ⟨g⟩, Dargwa, Lak, Lezghi /qʼ/
U+0643 ك /k/, Turkish /k, j/
U+06A9 ک Arabic variant of kaf /k/, Sindhi /kʰ/, Persian & others /k/, Chinese /kʰ/ ⟨k⟩
U+0644 ل /l/
U+0645 م /m/, Chinese /m~n/ ⟨m, n⟩
U+0646 ن /n/
U+0647 ه /h/, Persian/Dari /h, Ø, ɛ, æ/, Pashto /h, ɑ, ə, Ø/, Urdu /h, Ø/, Kashmiri /h, ə/, Turkish /ɑ, Ø, i, e/, Chinese /x/ ⟨h⟩
U+0648 و /w, uː/, Persian, Urdu & others /o(ː), u(ː), v, w, ʋ/, Chinese /u, ʊ/ ⟨wu-, u⟩, Turkic also /ø, y/
U+0649 ى Arabic /-aː/
U+064A ي /j, iː/, others /j, i(ː)~ɪ(ː), e(ː)~ɛ(ː)/, Turkic also /ɨ~ʉ~ɯ~ɤ/

Additional letters

edit
Derived from Unicode Letter Value Unicode name Notes
ا U+0773 ݳ Burushaski /a/ ARABIC LETTER ALEF WITH EXTENDED ARABIC-INDIC DIGIT TWO ABOVE [1]
ا U+0774 ݴ Burushaski /aːː/ ARABIC LETTER ALEF WITH EXTENDED ARABIC-INDIC DIGIT THREE ABOVE [1]
ا U+08AD Tatar /ɑ/ ARABIC LETTER LOW ALEF [2]
ب U+066E ٮ compatibility ARABIC LETTER DOTLESS BEH [3]
ب U+067B ٻ Sindhi, Marwari, Hausa /ɓ/ ARABIC LETTER BEEH [4]
ب U+067E پ Persian & others /p/, Chinese /pʰ/ ⟨p⟩ ARABIC LETTER PEH
ب U+0680 ڀ Sindhi /bʱ/ ARABIC LETTER BEHEH [4]
ب U+0750 ݐ Pulaar (Mali) ⟨ƴ⟩ ARABIC LETTER BEH WITH THREE DOTS HORIZONTALLY BELOW [5]
ب U+0751 ݑ Pulaar /p/ ARABIC LETTER BEH WITH DOT BELOW AND THREE DOTS ABOVE [5]
ب U+0752 ݒ Comoro, Hausa, Pulaar (Senegal) /ɓ/, Sindhi (old) /ʈʰ/ ARABIC LETTER BEH WITH THREE DOTS POINTING UPWARDS BELOW [5] [4]
ب U+0753 ݓ Hausa /c/ ARABIC LETTER BEH WITH THREE DOTS POINTING UPWARDS BELOW AND TWO DOTS ABOVE [5]
ب U+0754 ݔ Pulaar (Mali, Senegal) /ɲ/ ARABIC LETTER BEH WITH TWO DOTS BELOW AND DOT ABOVE [5]
ب U+0755 ݕ Serer /ɓ̥/ ⟨ƥ⟩ ARABIC LETTER BEH WITH INVERTED SMALL V BELOW [5]
ب U+0756 ݖ Songhoy-Zarma, Pulaar-Fulfulde /c/ ARABIC LETTER BEH WITH SMALL V [5]
ب U+08A0 Pulaar, Serer, Balant (Senegal) /ɓ/ ARABIC LETTER BEH WITH SMALL V BELOW [6]
ب U+08A1 Adamawa Fulfulde (Cameroon) /ɓ/ ARABIC LETTER BEH WITH HAMZA ABOVE [6]
ت U+0679 ٹ Balochi, Brahui, Marwari, Kashmiri, Punjabi, Urdu /ʈ/ ARABIC LETTER TTEH
ت U+067A ٺ Sindhi /ʈʰ/, Sindhi (old) /ʈ/ ARABIC LETTER TTEHEH [4]
ت U+067C ټ Pashto, Wakhi /ʈ/ ARABIC LETTER TEH WITH RING [7]
ت U+067D ٽ Sindhi /ʈ/ ARABIC LETTER TEH WITH THREE DOTS ABOVE DOWNWARDS [4]
ت U+067F ٿ Sindhi /tʰ/, Urdu (old) /ʈ/, Chinese /tɕʰ/ ⟨q⟩ ARABIC LETTER TEHEH [8] [4]
ج U+08A2 Serer /ʄ̊/ ARABIC LETTER JEEM WITH TWO DOTS ABOVE [6]
ح U+0681 ځ Pashto, Wakhi /dz/ ARABIC LETTER HAH WITH HAMZA ABOVE
ح U+0682 ڂ Pashto (variant) /dz/ ARABIC LETTER HAH WITH TWO DOTS VERTICAL ABOVE [7]
ح U+0683 ڃ Sindhi /ɲ/, Sindhi (old) /ʄ/, Bosniak /tɕ/, Hausa, Pulaar ⟨ƴ⟩, Wakhi /ɖʐ/, Avar, Dargwa, Lak, Lezghi /tʃʼ/ ARABIC LETTER NYEH [4] [9]
ح U+0684 ڄ Sindhi /ʄ/, Sindhi (old) /tʃʰ/, Bosniak /ts/ ARABIC LETTER DYEH [4]
ح U+0685 څ Pashto /ts/, Pashto (old) /ts~dz/, Avar, Dargwa, Lak, Lezghi /qʰ?/ ARABIC LETTER HAH WITH THREE DOTS ABOVE [7] [9]
ح U+0686 چ /tʃ/, Tatar /ɕ/, Chinese /ʈʂʰ/ ⟨ch⟩, Israel /ɡ/, Maghrebi /ɡ/ ARABIC LETTER TCHEH [10]
ح U+0687 ڇ Sindhi /tʃʰ/, Wakhi /ʈʂ/ ARABIC LETTER TCHEHEH [4]
ح U+06BF ڿ Moplah /ɲ/ ARABIC LETTER TCHEH WITH DOT ABOVE [11]
ح U+0757 ݗ Songhoy (Niger), Pulaar (Mali) /ŋ/ ARABIC LETTER HAH WITH TWO DOTS ABOVE [5]
ح U+0758 ݘ Songhoy-Zarma (Niger) /c/ ARABIC LETTER HAH WITH THREE DOTS POINTING UPWARDS BELOW [5]
ح U+076E ݮ Khowar /ɖʐ/ ARABIC LETTER HAH WITH SMALL ARABIC LETTER TAH BELOW [1]
ح U+076F ݯ Khowar /ʈʂ/ ARABIC LETTER HAH WITH SMALL ARABIC LETTER TAH AND TWO DOTS [1]
ح U+0772 ݲ Torwali /ɖʐ/ ARABIC LETTER HAH WITH SMALL ARABIC LETTER TAH ABOVE [1]
ح U+077C ݼ Burushaski /tsʰ/ ARABIC LETTER HAH WITH EXTENDED ARABIC-INDIC DIGIT FOUR BELOW [1]
د U+0688 ڈ Balochi, Kashmiri, Punjabi, Urdu /ɖ/ ARABIC LETTER DDAL
د U+0689 ډ Pashto /ɖ/ ARABIC LETTER DAL WITH RING [7]
د U+068A ڊ Sindhi, Arwi /ɖ/ ARABIC LETTER DAL WITH DOT BELOW [4]
د U+068B ڋ Saraiki /ɗ/ ARABIC LETTER DAL WITH DOT BELOW AND SMALL TAH
د U+068C ڌ Sindhi /dʰ/, Pulaar (Mali) /nd/ ARABIC LETTER DAHAL [4]
د U+068D ڍ Sindhi /ɖʰ/, Arwi /ʈ/ ARABIC LETTER DDAHAL [4]
د U+068E ڎ Sindhi (old) /ɗ/, Burushaski /??/ ARABIC LETTER DUL [4]
د U+068F ڏ Sindhi, Marwari /ɗ/ ARABIC LETTER DAL WITH THREE DOTS ABOVE DOWNWARDS [4]
د U+0690 ڐ Urdu (old) /ɖ/ ARABIC LETTER DAL WITH FOUR DOTS ABOVE [8]
د U+06EE ۮ Parkari Koli /ɗ/ ARABIC LETTER DAL WITH INVERTED V [12]
د U+0759 ݙ Saraiki /ᶑ/ ARABIC LETTER DAL WITH TWO DOTS VERTICALLY BELOW AND SMALL TAH [5]
د U+075A ݚ Comoro /ɗ/ ARABIC LETTER DAL WITH INVERTED SMALL V BELOW [5]
د U+08AE Tataro-Belarusian /dz/ ARABIC LETTER DAL WITH THREE DOTS BELOW [2]
ر U+0691 ڑ Balochi, Brahui, Marwari, Kashmiri, Punjabi, Urdu /ɽ/ ARABIC LETTER RREH
ر U+0692 ڒ Kurdish (old) /r/ ARABIC LETTER REH WITH SMALL V [13] [14]
ر U+0693 ړ Pashto /ɽ/ ARABIC LETTER REH WITH RING [7]
ر U+0694 ڔ Arwi, Moplah /ɽ/, Kurdish (old) /r/ ARABIC LETTER REH WITH DOT BELOW [13] [11] [14]
ر U+0695 ڕ Kurdish /r/ ARABIC LETTER REH WITH SMALL V BELOW [13] [14]
ر U+0696 ږ Pashto /ʐ/, Wakhi /ɣ/ ARABIC LETTER REH WITH DOT BELOW AND DOT ABOVE [7]
ر U+0697 ڗ Avar, Dargwa, Lak, Lezghi /tsʼ/, Moplah /ɖ/ ARABIC LETTER REH WITH TWO DOTS ABOVE [11] [9]
ر U+0698 ژ /ʒ/, Kashmiri /ts/, Moplah /ɭː/, Chinese /ʐ~ɻ/ ⟨r-⟩ ARABIC LETTER JEH [11]
ر U+0699 ڙ Sindhi /ɽ/, Wakhi /ʐ/, Urdu (old) /ɽ/ ARABIC LETTER REH WITH FOUR DOTS ABOVE [8] [4]
ر U+06EF ۯ Parkari Koli /ɭ/ ARABIC LETTER REH WITH INVERTED V [12]
ر U+075B ݛ Kanuri /ɽ/ ARABIC LETTER REH WITH STROKE [5]
ر U+076B ݫ Kalami, Ormuri, Torwali /r, ʑ/ ARABIC LETTER REH WITH TWO DOTS VERTICALLY ABOVE [15]
ر U+076C ݬ Kalami, Ormuri, Torwali /ʑ/ ARABIC LETTER REH WITH HAMZA ABOVE [15]
ر U+0771 ݱ Khowar /ʐ/ ARABIC LETTER REH WITH SMALL ARABIC LETTER TAH AND TWO DOTS [1]
ر U+08AA Rohingya /-r/ ARABIC LETTER REH WITH LOOP [6]
ز U+08B2 Berber /zˤ/ ARABIC LETTER ZAIN WITH INVERTED V ABOVE [16]
س U+069A ښ Pashto /ʂ~x/, Wakhi /x/ ARABIC LETTER SEEN WITH DOT BELOW AND DOT ABOVE [7]
س U+069B ڛ Persian (early) /s/, Avar, Lezghi /ts/ ARABIC LETTER SEEN WITH THREE DOTS BELOW [17] [9]
س U+069C ڜ Moroccan /tʃ/, Wakhi /ʂ/ ARABIC LETTER SEEN WITH THREE DOTS BELOW AND THREE DOTS ABOVE
س U+075C ݜ Shina /ʂ/ ARABIC LETTER SEEN WITH FOUR DOTS ABOVE [5]
س U+076D ݭ Kalami, Ormuri, Torwali /ɕ/ ARABIC LETTER SEEN WITH TWO DOTS VERTICALLY ABOVE [15]
س U+0770 ݰ Khowar /ʂ/ ARABIC LETTER SEEN WITH SMALL ARABIC LETTER TAH AND TWO DOTS [1]
س U+077D ݽ Burushaski /ʂ/ ARABIC LETTER SEEN WITH EXTENDED ARABIC-INDIC DIGIT FOUR ABOVE [1]
س U+077E ݾ Marwari /χ/, Persian (early) /s/ ARABIC LETTER SEEN WITH INVERTED V [17]
ش U+06FA ۺ Moplah /ʂ/ ARABIC LETTER SHEEN WITH DOT BELOW [11]
ص U+069D ڝ Dargwa, Lak /ts/ ARABIC LETTER SAD WITH TWO DOTS BELOW [9]
ص U+069E ڞ Berber /zˤ/, Burushaski /??/, Chinese /tsʰ/ ⟨c⟩ ARABIC LETTER SAD WITH THREE DOTS ABOVE
ص U+08AF Tataro-Belarusian /ts/ ARABIC LETTER SAD WITH THREE DOTS BELOW [2]
ض U+06FB ۻ Arwi /ɻ/, Moplah /ɭ/ ARABIC LETTER DAD WITH DOT BELOW [11]
ط U+069F ڟ Hausa /tsʼ/ ARABIC LETTER TAH WITH THREE DOTS ABOVE
ط U+08A3 Serer /ɗ̥/ ⟨ƭ⟩ ARABIC LETTER TAH WITH TWO DOTS ABOVE [6]
ع U+06A0 ڠ Malay /ŋ/, Dargwa, Lak, Lezghi(?) /q(ː)/ ARABIC LETTER AIN WITH THREE DOTS ABOVE [9]
ع U+06FC ۼ Moplah /ŋ/ ARABIC LETTER GHAIN WITH DOT BELOW [11]
ع U+075D ݝ Wolof, Songhoy, Soussou, Soninke, Pulaar /ŋ/, Songhoy /ɡ/ ARABIC LETTER AIN WITH TWO DOTS ABOVE [5]
ع U+075E ݞ Songhoy (Niger) /ŋ/, Hausa /ɡʷ/ ARABIC LETTER AIN WITH THREE DOTS POINTING DOWNWARDS ABOVE [5]
ع U+075F ݟ Songhoy (Mali) /ɡ/ ARABIC LETTER AIN WITH TWO DOTS VERTICALLY ABOVE [5]
ع U+08B3 Arwi /ŋ/ ARABIC LETTER AIN WITH THREE DOTS BELOW [18]
ف U+06A1 ڡ (variant) /f/, Adyghe /??/ ARABIC LETTER DOTLESS FEH [3]
ف U+06A2 ڢ Maghrebi /f/, Dargwa, Lak, Lezghi /pʼ/ ARABIC LETTER FEH WITH DOT MOVED BELOW [9]
ف U+06A3 ڣ Chechen, Ingush /pʼ/, Arwi /p/ ARABIC LETTER FEH WITH DOT BELOW [19]
ف U+06A4 ڤ Arabic, Kurdish, Comoro, Wakhi /v/, Malay /p/, Pulaar /ŋ/ (?) ARABIC LETTER VEH [5]
ف U+06A5 ڥ Maghrebi /v/ ARABIC LETTER FEH WITH THREE DOTS BELOW [10]
ف U+06A6 ڦ Sindhi /pʰ/ ARABIC LETTER PEHEH [4]
ف U+0760 ݠ Pualaar (Mali) /p/ ARABIC LETTER FEH WITH TWO DOTS BELOW [5]
ف U+0761 ݡ Comoro /β/ ARABIC LETTER FEH WITH THREE DOTS POINTING UPWARDS BELOW [5]
ف U+08A4 Chad /ⱱ/ ⟨vb⟩ ARABIC LETTER FEH WITH DOT BELOW AND THREE DOTS ABOVE [6]
ق U+066F ٯ (variant) /q/ ARABIC LETTER DOTLESS QAF [3]
ق U+06A7 ڧ Maghrebi /ɡ/ ARABIC LETTER QAF WITH DOT ABOVE
ق U+06A8 ڨ Maghrebi /ɡ/, Avar /tɬʼ/, Pulaar /ŋ/ (?) ARABIC LETTER QAF WITH THREE DOTS ABOVE [9] [10] [5]
ق U+08A5 Chad /gb/ ARABIC LETTER QAF WITH DOT BELOW [6]
ك U+06A9 ک Kurdish, Persian, Urdu /k/, Sindhi /kʰ/ ARABIC LETTER KEHEH [4]
ك U+06AC ڬ Malay /ɡ/ ARABIC LETTER KAF WITH DOT ABOVE [20]
ك U+06AD ڭ Turkish, Uyghur, Uzbek, Kazakh, Kyrgyz, Tatar /ŋ/, Berber /ɡ/ ARABIC LETTER NG [20]
ك U+06AE ڮ Berber /ɡ?/, Persian (early) ARABIC LETTER KAF WITH THREE DOTS BELOW [17]
ك U+077F ݿ Persian (early) /ŋ/ ARABIC LETTER KAF WITH TWO DOTS ABOVE [17]
ك U+08B4 Arwi /ɡ/ ARABIC LETTER KAF WITH DOT BELOW [18]
ک U+063B ػ Persian (early) /ɡ/ ARABIC LETTER KEHEH WITH TWO DOTS ABOVE [17]
ک U+063C ؼ Persian (early) /ɡ/, Avar, Lak, Lezghi /x/, Dargwa /ç/ ARABIC LETTER KEHEH WITH THREE DOTS BELOW [17] [9]
ک U+06AA ڪ Sindhi /k/ ARABIC LETTER SWASH KAF [20] [4]
ک U+06AB ګ Pashto /ɡ/ ARABIC LETTER KAF WITH RING
ک U+06AF گ Persian & others /ɡ/ ARABIC LETTER GAF
ک U+0762 ݢ Malay /ɡ/, Dargwa /ɢ/ ARABIC LETTER KEHEH WITH DOT ABOVE [5] [20] [9]
ک U+0763 ݣ Moroccan, Berber /ɡ/, Turkish /ŋ/, Chinese /ŋ-, ɲ-/ ⟨ng-, gn-⟩ ARABIC LETTER KEHEH WITH THREE DOTS ABOVE [5] [20] [21] [10]
ک U+0764 ݤ Hausa /k'/ ARABIC LETTER KEHEH WITH THREE DOTS POINTING UPWARDS BELOW [5]
گ U+06B0 ڰ Saraiki /ɠ/ ARABIC LETTER GAF WITH RING
گ U+06B1 ڱ Sindhi /ŋ/ ARABIC LETTER NGOEH [4]
گ U+06B2 ڲ Berber /ɡ?/, (not Sindhi) ARABIC LETTER GAF WITH TWO DOTS BELOW
گ U+06B3 ڳ Sindhi /ɠ/ ARABIC LETTER GUEH [4]
گ U+06B4 ڴ Azeri, Karakalpak /ŋ/ ARABIC LETTER GAF WITH THREE DOTS ABOVE
گ U+08B0 Turkish /j/ ⟨ğ⟩, Crimean Tatar /j/, Avar, Dargwa, Lak, Lezghi, Chechen, Ingush /kʼ/ ARABIC LETTER GAF WITH INVERTED STROKE [21] [2] [9]
ل U+06B5 ڵ Bosniak /ʎ/, Kurdish /ɫ/ ARABIC LETTER LAM WITH SMALL V [13] [14]
ل U+06B6 ڶ Kurdish (old) /ɫ/ ARABIC LETTER LAM WITH DOT ABOVE [13] [14]
ل U+06B7 ڷ Brahui /ɬ/, Kurdish (old) /ɫ/ ARABIC LETTER LAM WITH THREE DOTS ABOVE [13] [14]
ل U+06B8 ڸ Avar /ɬ/ ARABIC LETTER LAM WITH THREE DOTS BELOW [9]
ل U+076A ݪ Marwari /ɭ/ ARABIC LETTER LAM WITH BAR [22]
ل U+08A6 Chad /ɮ/ ⟨sl⟩ ARABIC LETTER LAM WITH DOUBLE BAR [6]
م U+0765 ݥ Mali Pulaar /ɓ/ ARABIC LETTER MEEM WITH DOT ABOVE [5]
م U+0766 ݦ Maba /mb/ ARABIC LETTER MEEM WITH DOT BELOW [5]
م U+08A7 Chad /mv/ ARABIC LETTER MEEM WITH THREE DOTS ABOVE [6]
ن U+06B9 ڹ Arwi, Moplah /ɳ/ ARABIC LETTER NOON WITH DOT BELOW [11]
ن U+06BA ں Urdu nasalization /◌̃/ ARABIC LETTER NOON GHUNNA [3]
ن U+06BB ڻ Sindhi, Balochi /ɳ/ ARABIC LETTER RNOON [4]
ن U+06BC ڼ Pashto /ɳ/ ARABIC LETTER NOON WITH RING [7]
ن U+06BD ڽ Malay /ɲ/ ARABIC LETTER NOON WITH THREE DOTS ABOVE [23]
ن U+0767 ݧ Comoro, Wolof, Songhoy, Kanuri, Mandinka, Soninke, Pulaar /ɲ/ ARABIC LETTER NOON WITH TWO DOTS BELOW [5]
ن U+0768 ݨ Marwari, Pathwari, Saraiki /ɳ/ ARABIC LETTER NOON WITH SMALL TAH [5]
ن U+0769 ݩ Gojri /ɳ/ ARABIC LETTER NOON WITH SMALL V [5]
ه U+06BE ھ Urdu aspiration /ʰ/, other Pakistani /ʰ/ ARABIC LETTER HEH DOACHASHMEE
ه U+06C0 ۀ Urdu (izafat) /ɛ/ ARABIC LETTER HEH WITH YEH ABOVE [21]
ه U+06C1 ہ Urdu /h/, other Pakistani /h/ ARABIC LETTER HEH GOAL
ه U+06C2 ۂ Urdu (izafat) /ɛ/ ARABIC LETTER HEH GOAL WITH HAMZA ABOVE
ه U+06C3 ۃ Urdu /t~h/ ARABIC LETTER TEH MARBUTA GOAL
ه U+06D5 ە Uyghur /æ~ɛ/, Uzbek /ɛ/, Kyrgyz /e/, Karakalpak /æ~e/, Tatar /æ/, Kurdish /ɛ/ ARABIC LETTER AE [13]
ه U+06FF ۿ Parkari Koli /ɦ/ ARABIC LETTER HEH WITH INVERTED V [12]
و U+06C4 ۄ Kashmiri /ɔ/ ARABIC LETTER WAW WITH RING
و U+06C5 ۅ Kyrgyz /ø/ ARABIC LETTER KIRGHIZ OE
و U+06C6 ۆ Kazakh /v/, Uyghur, Azeri /ø/, Kurdish /o/, Tatar (Tahiri) /y/ ARABIC LETTER OE [14]
و U+06C7 ۇ Uyghur, Uzbek, Kazakh, Kyrgyz /u/ ARABIC LETTER U
و U+06C8 ۈ Uyghur /y/ ARABIC LETTER YU
و U+06C9 ۉ Kyrgyz /y/, Tatar (Tahiri) /ø/ ARABIC LETTER KIRGHIZ YU
و U+06CA ۊ Kurdish /ʉː/ ARABIC LETTER WAW WITH TWO DOTS ABOVE [13] [14]
و U+06CB ۋ Uyghur, Uzbek, Karakalpak, Kyrgyz, Tatar /w, v/, Kazakh /uw, yw, w, v/ ARABIC LETTER VE
و U+06CF ۏ Malay /v/ ARABIC LETTER WAW WITH DOT ABOVE
و U+0778 ݸ Burushaski /o/ ARABIC LETTER WAW WITH EXTENDED ARABIC-INDIC DIGIT TWO ABOVE [1]
و U+0779 ݹ Burushaski /oː/ ARABIC LETTER WAW WITH EXTENDED ARABIC-INDIC DIGIT THREE ABOVE [1]
و U+08AB Rohingya /-w/ ARABIC LETTER WAW WITH DOT WITHIN [6]
و U+08B1 Tatar (Alparov) /ɤ/ ARABIC LETTER STRAIGHT WAW [2]
ي U+0620 ؠ Kashmiri /ʲ/ ARABIC LETTER KASHMIRI YEH [24]
ي U+063D ؽ Azeri /j, ɯ/ ARABIC LETTER FARSI YEH WITH INVERTED V
ي U+063E ؾ Persian (early) /j/ ARABIC LETTER FARSI YEH WITH TWO DOTS ABOVE [17]
ي U+063F ؿ Persian (early) /j/, Azeri /j, ɯ/ ARABIC LETTER FARSI YEH WITH THREE DOTS ABOVE [17]
ي U+06CC ی Persian /j, i/, Chinese /i, ɪ/ ⟨yi-, i⟩, Uyghur, Uzbek, Kazakh, Kyrgyz, Karakalpak /i~ɨ~ɯ/ ARABIC LETTER FARSI YEH [23]
ي U+06CD ۍ Pashto /əj/ ARABIC LETTER YEH WITH TAIL
ي U+06CE ێ Azeri /e/, Kurdish /e/ ARABIC LETTER YEH WITH SMALL V [13] [14]
ي U+06D0 ې Pashto, Uyghur, Uzbek /e/ ARABIC LETTER E
ي U+06D1 ۑ Malay (old) /??/ ARABIC LETTER YEH WITH THREE DOTS BELOW
ي U+06D2 ے Urdu /eː, ɛː/, Kashmiri, Punjubi, Sindhi /e(ː)/, Arabic variant of yā’ /j, iː/ ARABIC LETTER YEH BARREE [20]
ي U+06D3 ۓ Urdu (izafat) /ɛ/ ARABIC LETTER YEH BARREE WITH HAMZA ABOVE
ي U+0775 ݵ Burushaski /i/ ARABIC LETTER FARSI YEH WITH EXTENDED ARABIC-INDIC DIGIT TWO ABOVE [1]
ي U+0776 ݶ Burushaski /iː/ ARABIC LETTER FARSI YEH WITH EXTENDED ARABIC-INDIC DIGIT THREE ABOVE [1]
ي U+0777 ݷ Burushaski /ɰ/ ARABIC LETTER FARSI YEH WITH EXTENDED ARABIC-INDIC DIGIT FOUR BELOW [1]
ي U+077A ݺ Burushaski /e/ ARABIC LETTER YEH BARREE WITH EXTENDED ARABIC-INDIC DIGIT TWO ABOVE [1]
ي U+077B ݻ Burushaski /eː/ ARABIC LETTER YEH BARREE WITH EXTENDED ARABIC-INDIC DIGIT THREE ABOVE [1]
ي U+08A8 Adamawa Fulfulde (Cameroon) ⟨ƴ⟩ ARABIC LETTER YEH WITH TWO DOTS BELOW AND HAMZA ABOVE [6]
ي U+08A9 Adamawa Fulfulde (Cameroon) /ɲ/ ARABIC LETTER YEH WITH TWO DOTS BELOW AND DOT ABOVE [6]
ي U+08AC Rohingya /-j/ ARABIC LETTER ROHINGYA YEH [6]

Special cases

edit
Language Isolated Final Medial Initial Unicode Character name
Arabic /θ/ ث ـث ـثـ ثـ U+062B ARABIC LETTER THEH
Arabic /n/ ن ـن ـنـ نـ U+067E ARABIC LETTER PEH
Urdu /◌̃/ ں ـں ـںـ ںـ U+06BA ARABIC LETTER NOON GHUNNA
Malay /ɲ/ ڽ ـڽ ـڽـ ڽـ U+06BD ARABIC LETTER NOON WITH THREE DOTS ABOVE
Persian /p/ پ ـپ ـپـ پـ U+067E ARABIC LETTER PEH
  • The Urdu "nasalization nūn" is written without dots in its final and isolated forms, but does not have any differences in its other forms with basic nūn.
  • The three dots of Malay (Jawi) nūn are written below in the initial and medial forms, thus the letter is clearly different from Arabic thāʾ, but looks like Persian pe (this does not cause any ambiguity since Malay uses fāʾ with three dots ڤ for /p/).
Language Isolated Final Medial Initial Unicode Character name
Arabic /h/ ه ـه ـهـ هـ U+0647 ARABIC LETTER HEH
Urdu /h/ ہ ـہ ـہـ ہـ U+06C1 ARABIC LETTER HEH GOAL
Urdu /◌ʰ/ ھ ـھ ـھـ ھـ U+06BE ARABIC LETTER HEH DOACHASHMEE
  • There at least exist two forms of Arabic hāʾ: one is used in the Naskh style, the other one is in Nastaʿlīq. As Urdu nearly exclusively uses Nastaʿlīq, for compatibility the Nastaʿlīq hāʾ is encoded separately and given an Urdu name (though it is originally just a stylistic variation of the Arabic letter). Usually this code point (U+06C1) should not be used in contemporary situations, but the normal hāʾ (U+0647). It is up to the fonts to properly render hāʾ (U+0647) in the Nastaʿlīq way.
  • Urdu also uses another one hāʾ for aspiration. It does looks like the Naskh hāʾ even in Nastaʿlīq fonts, but differs in its final and isolated forms.
Language Isolated Final Medial Initial Unicode Character name
Arabic /j, iː/ ي ـي ـيـ يـ U+064A ARABIC LETTER YEH
Arabic /-aː/ ى ـى U+0649 ARABIC LETTER ALEF MAKSURA
Persian /j, i/ ی ـی ـیـ یـ U+06CC ARABIC LETTER FARSI YEH
Urdu /eː, ɛː/ (1) ے ـے ـےـ ےـ U+06D2 ARABIC LETTER YEH BARREE
Urdu /eː, ɛː/ (2) ے ـے ـیـ یـ U+06D2 ARABIC LETTER YEH BARREE
  • Arabic uses two types of yāʾ: one is normal and the other one is for /-aː/. As its value is equal to ʾalif, it is called ʾalif maqṣūra. It is written only at the end of words and have no initial and medial forms.
  • Persian ye is a stylistic variation of Arabic yāʾ. It lacks dots in the final and isolated forms, hence they look like ʾalif maqṣūra. This variation is also used in some Arabic countries like Egypt. For compatibility this variation is encoded separately and given a Persian name. Although historically it is not an independent Persian letter, but just a variation of Arabic yāʾ.
  • There are also the third type of yāʾ. It has been used in Nastaʿlīq style for both Arabic, Persian and other languages. It might be called "returned yāʾ" for its particular final form. It was not a separate letter, but just a stylistic (calligraphic) variation. But in Urdu it has became an independent letter, given a separate code point and its Urdu name (ye baṛī). Its initial and medial forms are not different from basic yāʾ. But this causes several problems:
    • Technically both the font and the system should render its initial and medial forms as visually no different from yāʾ. But very few if not any fonts do this properly. Hence the result (1) from above: the right code points (U+06D2), but the wrong representations.
    • An ad hoc workaround for this is to use yāʾ (U+064A) when the letter at the beginning or in the middle of words, but ye baṛī (U+06D2) at the end. Hence the result (2) from above: the wrong code points (U+064A) for the initial and medial forms, but the right representations. Though it does look good, but technically this is not right. This situation must be resolved in the future by font designers and software developers.
Language Isolated Final Medial Initial Unicode Character name
Arabic /k/ ك ـك ـكـ كـ U+0643 ARABIC LETTER KAF
Sindhi /kʰ/, Persian, Urdu /k/ ک ـک ـکـ کـ U+06A9 ARABIC LETTER KEHEH
Sindhi /k/ ڪ ـڪ ـڪـ ڪـ U+06AA ARABIC LETTER SWASH KAF
  • Arabic has had three shapes of kāf:
    • ك (U+0643) is used in all contemporary situations.
    • ک (U+06A9) and ڪ (U+06AA), "swash kāf", are historical and may usually appear in Quranic texts.
  • Persian uses exclusively ک (U+06A9), though its initial and medial shapes are no different from "Arabic" kāf. This shape is also used in the languages of Afghanistan, Pakistan and India.
  • Sindhi treats both "Persian kāf" and "swash kāf" as two independent letters. ك (U+0643) is not used in Sindhi.

F and Q

edit
Language Isolated Final Medial Initial Unicode Character name
Standard Arabic /f/ ف ـف ـفـ فـ U+0641 ARABIC LETTER FEH
Standard Arabic /q/ ق ـق ـقـ قـ U+0642 ARABIC LETTER QAF
Maghrebi /f/ ڢ ـڢ ـڢـ ڢـ U+06A2 ARABIC LETTER FEH WITH DOT MOVED BELOW
Maghrebi /ɡ~q/ ڧ ـڧ ـڧـ ڧـ U+06A7 ARABIC LETTER QAF WITH DOT ABOVE
  • In Standard Arabic fāʾ and qāf are differentiated largely by the number of the superscript dots. In old Arabic manuscripts fāʾ was written with a dot below while qāf with a dot above. This ancient practice survives in Maghrebi Quranic manuscripts.
Language Isolated Final Medial Initial Unicode Character name
Maghrebi /v/ ڤ ـڤ ـڤـ ڤـ U+06A4 ARABIC LETTER VEH
Maghrebi /v/ ڥ ـڥ ـڥـ ڥـ U+06A5 ARABIC LETTER FEH WITH THREE DOTS BELOW
Maghrebi /ɡ/ ڨ ـڨ ـڨـ ڨـ U+06A8 ARABIC LETTER QAF WITH THREE DOTS ABOVE
  • In Maghreb qāf with three dots above is used for /ɡ/. In Morocco ݣ (U+0763) is preferred.
  • In Maghreb fāʾ with three dots either above or below is used for /v/. The former (ڤ) can be hardly distinguished from the three-dotted qāf, so should be avoided.

Diacritics

edit
Unicode Diacritic Usage Unicode name
U+064B هً Arabic /-an/ ARABIC FATHATAN
U+064C هٌ Arabic /-un/ ARABIC DAMMATAN
U+064D هٍ Arabic /-in/ ARABIC KASRATAN
U+064E هَ Arabic /a/, Persian /æ/, Urdu /ə/ ARABIC FATHA
U+064F هُ Arabic /u/, Persian /o/, Urdu /ʊ/ ARABIC DAMMA
U+0650 هِ Arabic /i/, Persian /e/, Urdu /ɪ/ ARABIC KASRA
U+0651 هّ consonants doubling ARABIC SHADDA
U+0652 هْ no vowel ARABIC SUKUN
U+0653 هٓ long vowel ARABIC MADDAH ABOVE
U+0654 هٔ glottal stop /ʔ/ ARABIC HAMZA ABOVE
U+0655 هٕ glottal stop /ʔ/ ARABIC HAMZA BELOW
U+0656 هٖ Arabic /iː/, Chadian ⟨ɨ⟩ ARABIC SUBSCRIPT ALEF
U+0657 هٗ Arabic, Urdu, Kashmiri /uː/, Chadian ⟨o⟩ ARABIC INVERTED DAMMA
U+0658 ه٘ Urdu nasalisation /ه̃/ ARABIC MARK NOON GHUNNA
U+0659 هٙ Pashto /ə/ ARABIC ZWARAKAY
U+065A هٚ Chadian ⟨ɛ⟩ ARABIC VOWEL SIGN SMALL V ABOVE
U+065B هٛ Chadian ⟨ɔ⟩ ARABIC VOWEL SIGN INVERTED SMALL V ABOVE
U+065C هٜ Chadian, Guinean ⟨e⟩ ARABIC VOWEL SIGN DOT BELOW
U+065D هٝ Songhoy-Zarma, Pulaar-Fulfulde ⟨o⟩, Ormuri ARABIC REVERSED DAMMA
U+065E هٞ Gawri (Kalami) /æ/ ARABIC FATHA WITH TWO DOTS
U+065F هٟ Kashmiri /uː/ ARABIC WAVY HAMZA BELOW
U+0670 هٰ Arabic /aː/ ARABIC LETTER SUPERSCRIPT ALEF
U+0674 هٴ glottal stop /ʔ/, Kazakh "palatal" word ARABIC LETTER HIGH HAMZA
U+06EA ه۪ Kashmiri palatalization /ʲ/ ARABIC EMPTY CENTRE LOW STOP
Quranic
U+0615 هؕ Quranic al-waqf al-muṭlaq ARABIC SMALL HIGH TAH
U+0616 هؖ Quranic pause sign ARABIC SMALL HIGH LIGATURE ALEF WITH LAM WITH YEH
U+0617 هؗ Quranic al-waqf al-mujawwaz ARABIC SMALL HIGH ZAIN
U+0618 هؘ Quranic /a/ ARABIC SMALL FATHA
U+0619 هؙ Quranic /u/ ARABIC SMALL DAMMA
U+061A هؚ Quranic /i/ ARABIC SMALL KASRA
U+06D6 هۖ Quranic al-waṣlu awlā ARABIC SMALL HIGH LIGATURE SAD WITH LAM WITH ALEF MAKSURA
U+06D7 هۗ Quranic qif ARABIC SMALL HIGH LIGATURE QAF WITH LAM WITH ALEF MAKSURA
U+06D8 هۘ Quranic jazm ARABIC SMALL HIGH MEEM INITIAL FORM
U+06D9 هۙ Quranic lā-waqfi ʿalayhi ARABIC SMALL HIGH LAM ALEF
U+06DA هۚ Quranic al-waqf al-jāʾiz ARABIC SMALL HIGH JEEM
U+06DB هۛ Quranic muʿanaqah ARABIC SMALL HIGH THREE DOTS
U+06DC هۜ Quranic sakta ARABIC SMALL HIGH SEEN
U+06DF ه۟ Quranic ARABIC SMALL HIGH ROUNDED ZERO
U+06E0 ه۠ Quranic ARABIC SMALL HIGH UPRIGHT RECTANGULAR ZERO
U+06E1 هۡ Quranic ARABIC SMALL HIGH DOTLESS HEAD OF KHAH
U+06E2 هۢ Quranic al-waqf al-lāzim ARABIC SMALL HIGH MEEM ISOLATED FORM
U+06E3 هۣ Quranic ARABIC SMALL LOW SEEN
U+06E4 هۤ Quranic ARABIC SMALL HIGH MADDA
U+06E5 هۥ Quranic waqfah ARABIC SMALL WAW
U+06E6 هۦ Quranic ARABIC SMALL YEH
U+06E7 هۧ Quranic ARABIC SMALL HIGH YEH
U+06E8 هۨ Quranic ARABIC SMALL HIGH NOON
U+06EB ه۫ Quranic ARABIC EMPTY CENTRE HIGH STOP
U+06EC ه۬ Quranic ARABIC ROUNDED HIGH STOP WITH FILLED CENTRE
U+06ED هۭ Quranic ARABIC SMALL LOW MEEM
U+08F0 هࣰ Quranic /-aⁿ/ ARABIC OPEN FATHATAN
U+08F1 هࣱ Quranic /-uⁿ/ ARABIC OPEN DAMMATAN
U+08F2 هࣲ Quranic /-iⁿ/ ARABIC OPEN KASRATAN
U+08F3 هࣳ Quranic /w/ ARABIC SMALL HIGH WAW
Abbreviation
U+0610 هؐ Abbreviation ARABIC SIGN SALLALLAHOU ALAYHE WASSALLAM
U+0611 هؑ Abbreviation ARABIC SIGN ALAYHE ASSALLAM
U+0612 هؒ Abbreviation ARABIC SIGN RAHMATULLAH ALAYHE
U+0613 هؓ Abbreviation ARABIC SIGN RADI ALLAHOU ANHU
U+0614 هؔ Abbreviation ARABIC SIGN TAKHALLUS
Rohingya
U+08E4 هࣤ Rohingya ⟨o⟩ ARABIC CURLY FATHA
U+08E5 هࣥ Rohingya ⟨ou⟩ ARABIC CURLY DAMMA
U+08E6 هࣦ Rohingya ⟨e⟩ ARABIC CURLY KASRA
U+08E7 هࣧ Rohingya ⟨on⟩ ARABIC CURLY FATHATAN
U+08E8 هࣨ Rohingya ⟨oun⟩ ARABIC CURLY DAMMATAN
U+08E9 هࣩ Rohingya ⟨en⟩ ARABIC CURLY KASRATAN
U+08EA ه࣪ Rohingya ⟨tone 1a⟩ ARABIC TONE ONE DOT ABOVE
U+08EB ه࣫ Rohingya ⟨tone 1b⟩ ARABIC TONE TWO DOTS ABOVE
U+08EC ه࣬ Rohingya ⟨tone 2a⟩ ARABIC TONE LOOP ABOVE
U+08ED ه࣭ Rohingya ⟨tone 2c⟩ ARABIC TONE ONE DOT BELOW
U+08EE ه࣮ Rohingya ⟨tone 3a⟩ ARABIC TONE TWO DOTS BELOW
U+08EF ه࣯ Rohingya ⟨tone 3b⟩ ARABIC TONE LOOP BELOW
African languages
U+08F4 هࣴ Wolof, Dyola ⟨ë⟩ ARABIC FATHA WITH RING
U+08F5 هࣵ Wolof ⟨à⟩ ARABIC FATHA WITH DOT ABOVE
U+08F6 هࣶ Jola-Fonyi, Balanta-Ganja /ì/ ARABIC KASRA WITH DOT BELOW
U+08F7 هࣷ Senegalian ⟨o⟩ ARABIC LEFT ARROWHEAD ABOVE
U+08F8 هࣸ Guinean ⟨u⟩ ARABIC RIGHT ARROWHEAD ABOVE
U+08F9 هࣹ Senegalian ⟨e⟩ ARABIC LEFT ARROWHEAD BELOW
U+08FA هࣺ Wolof, Jola-Fonyi, Balanta-Ganja ⟨é⟩ ARABIC RIGHT ARROWHEAD BELOW
U+08FB هࣻ Guinean ⟨un⟩ ARABIC DOUBLE RIGHT ARROWHEAD ABOVE
U+08FC هࣼ Guinean ⟨on⟩ ARABIC DOUBLE RIGHT ARROWHEAD ABOVE WITH DOT
U+08FD هࣽ Guinean ⟨o⟩ ARABIC RIGHT ARROWHEAD ABOVE WITH DOT
Phillipine languages
U+08FE هࣾ Yakan ⟨o⟩ ARABIC DAMMA WITH DOT
U+08FF هࣿ Yakan ⟨ɛ⟩ ARABIC MARK SIDEWAYS NOON GHUNNA

See also

edit

References

edit
  1. ^ a b c d e f g h i j k l m n o p 06149
  2. ^ a b c d e 11209
  3. ^ a b c d 14211
  4. ^ a b c d e f g h i j k l m n o p q r s t u Trumpp, Ernst (1872). Grammar of the Sindhi Language. pp. 2–6.
  5. ^ a b c d e f g h i j k l m n o p q r s t u v w x y z aa ab 03223
  6. ^ a b c d e f g h i j k l m 10288r
  7. ^ a b c d e f g h Trumpp, Ernst (1873). Grammar of the Paštō Or Language of the Afghāns. pp. 1–4.
  8. ^ a b c Palmer, Edward Henry (1890). Simplified Grammar of Arabic, Persian, and Hindustani.
  9. ^ a b c d e f g h i j k l Шамхалов А. Л. Дагестанские языки // Литературная энциклопедия. — Т. 3. — [М.]: Изд-во Ком. Акад., 1930. — Стб. 132—135.
  10. ^ a b c d Boogert, N. van den (1989). "Some notes on Maghribi script" (PDF). Manuscripts of the Middle East. 4: 30–43.
  11. ^ a b c d e f g h Moplah
  12. ^ a b c 01427
  13. ^ a b c d e f g h i 11090
  14. ^ a b c d e f g h i Kurdish Romanization
  15. ^ a b c 04025r
  16. ^ 12181r
  17. ^ a b c d e f g h 06345r
  18. ^ a b 13130r
  19. ^ 13119
  20. ^ a b c d e f 03176
  21. ^ a b c 01329
  22. ^ 03228r2
  23. ^ a b 09146
  24. ^ 10169

General references

edit