Unicode in web passwords

When we enter passwords, the text fields only show stars or some other character that corresponds to the number of characters. If I use an input method that produces unicode (especially Indian languages), I notice that this number varies in a different pattern (sometimes the number reduces) than the characters entered. A careful inspection showed that one star corresponds to one unicode codepoint, and format characters are not included.

I tried the malayalam word സരീക്ഷ്യൂമന്‍ by using an input method tool called Keyman, which allows to enter the word by entering the roman character sequence “sareekshyooman”.

I used Windows XP Professional (SP 2) and IE 6.0. Here is what I found.

Transliteration Unicode # stars Unicode characters Explanation
s സ് 2 D38 D4D sa (D38), virama (D4D)
sa à´¸ 1 D38 sa (D38)
sar സര്‍ 3 D38 D30 D4D 200D sa (D38), ra(D30), virama (D4D).
There may be a ZWJ(200D) also, but this doesn’t produce a star.
sare സരെ 3 D38 D30 D46 sa (D38), ra(D30), e(D46)
saree സരീ 3 D38 D30 D40 sa (D38), ra(D30), ee(D40)
sareek സരീക് 5 D38 D30 D40 D15 D4D sa (D38), ra(D30), ee(D40), ka(D15), virama(D4D)
sareeks സരീക്സ് 7 D38 D30 D40 D15 D4D D38 D4D sa (D38), ra(D30), ee(D40), ka(D15), virama(D4D), sa (D38), virama (D4D)
sareeksh സരീക്ഷ് 7 D38 D30 D40 D15 D4D D37 D4D sa (D38), ra(D30), ee(D40), ka(D15), virama(D4D), sa (D37), virama (D4D)
sareekshy സരീക്ഷ്യ് 9 D38 D30 D40 D15 D4D D37 D4D D2F D4D sa (D38), ra(D30), ee(D40), ka(D15), virama(D4D), sa (D37), virama (D4D), ya (D2F), virama(D4D)
sareekshyo സരീക്ഷ്യൊ 9 D38 D30 D40 D15 D4D D37 D4D D2F D4A sa (D38), ra(D30), ee(D40), ka(D15), virama(D4D), sa (D37), virama (D4D), ya (D2F), o(D4A)
sareekshyoo സരീക്ഷ്യൂ 9 D38 D30 D40 D15 D4D D37 D4D D2F D42 sa (D38), ra(D30), ee(D40), ka(D15), virama(D4D), sa (D37), virama (D4D), ya (D2F), oo(D42)
sareekshyoom സരീക്ഷ്യൂം 10 D38 D30 D40 D15 D4D D37 D4D D2F D42 D02 sa (D38), ra(D30), ee(D40), ka(D15), virama(D4D), sa (D37), virama (D4D), ya (D2F), oo(D42), anuswaram(D02). Can be ma (D2E), virama(D4D) instead of anuswaram(D02).
sareekshyooma സരീക്ഷ്യൂമ 10 D38 D30 D40 D15 D4D D37 D4D D2F D42 D2E sa (D38), ra(D30), ee(D40), ka(D15), virama(D4D), sa (D37), virama (D4D), ya (D2F), oo(D42), ma (D2E)
sareekshyooman സരീക്ഷ്യൂമന്‍ 12 D38 D30 D40 D15 D4D D37 D4D D2F D42 D2E D28 D4D 200D sa (D38), ra(D30), ee(D40), ka(D15), virama(D4D), sa (D37), virama (D4D), ya (D2F), oo(D42), ma (D2E), na (D28) virama(D4D).
There may be ZWJ(200D) also here, but that doesn’t appear.