uchardet/test/th/iso-8859-11.txt
Jehan fb3c47a073 LangModels: add ISO-8859-11 and regenerate TIS-620 Thai models.
ISO-8859-11 is basically exactly identical to TIS-620, with the added
non-breaking space character.
Basically our detection will always return TIS-620 except for
exceptional cases when a text has a non-breaking space.
2015-12-04 03:14:52 +01:00

6 lines
400 B
Plaintext

TIS-620
 ÁҾðҚźĹÔľŔŃłąěÍŘľĘŇËĄĂĂÁ 620-2533, ÁÍĄ.620-2533, ËĂ×͡ŐčĂŮé¨ŃĄĄŃšˇŃčÇäťÇčŇ TIS-620 ŕťçšŞŘ´ÍŃĄ˘ĂĐÁҾðҚÍŘľĘŇËĄĂĂÁ˘Í§äˇÂ ÁŐŞ×čÍŕľçÁÇčŇ ĂËŃĘĘÓËĂŃşÍŃĄ˘ĂĐäˇÂˇŐčăŞéĄŃş¤ÍÁžÔÇŕľÍĂě
ĂËŃĘ TIS-620 ÁŐĂŇÂĹĐŕÍŐ´¤ĹéŇÂĂËŃĘ ISO-8859-11 ÁŇĄ ᾥľčҧĄŃšá¤čŕžŐ§ˇŐč ISO-8859-11 ĄÓËš´ăËé A0 ŕťçš "ŕÇéšÇĂäẺäÁčľŃ´¤Ó" (no-break space) ĘčÇš TIS-620 šŃéšáÁé¨ĐʧǚľÓáËšč§ A0 ŕÍŇäÇé áľčĄçäÁčä´éĄÓËš´¤čŇă´ ć ăËé