mirror of
https://gitlab.freedesktop.org/uchardet/uchardet.git
synced 2025-12-06 16:56:40 +08:00
I realize that the language information a text has been written in is very important since it would completely change the character distribution. Our test files should take this into account, and we should create several test files in different languages for encoding used in various languages.
6 lines
399 B
Plaintext
6 lines
399 B
Plaintext
TIS-620
|
|
|
|
ÁҾðҚźĹÔľŔŃłąěÍŘľĘŇËĄĂĂÁ 620-2533, ÁÍĄ.620-2533, ËĂ×͡ŐčĂŮé¨ŃĄĄŃšˇŃčÇäťÇčŇ TIS-620 ŕťçšŞŘ´ÍŃĄ˘ĂĐÁҾðҚÍŘľĘŇËĄĂĂÁ˘Í§äˇÂ ÁŐŞ×čÍŕľçÁÇčŇ ĂËŃĘĘÓËĂŃşÍŃĄ˘ĂĐäˇÂˇŐčăŞéĄŃş¤ÍÁžÔÇŕľÍĂě
|
|
|
|
ĂËŃĘ TIS-620 ÁŐĂŇÂĹĐŕÍŐ´¤ĹéŇÂĂËŃĘ ISO-8859-11 ÁŇĄ ᾥľčҧĄŃšá¤čŕžŐ§ˇŐč ISO-8859-11 ĄÓËš´ăËé A0 ŕťçš "ŕÇéšÇĂäẺäÁčľŃ´¤Ó" (no-break space) ĘčÇš TIS-620 šŃéšáÁé¨ĐʧǚľÓáËšč§ A0 ŕÍŇäÇé áľčĄçäÁčä´éĄÓËš´¤čŇă´ ć ăËé
|