Jehan fb3c47a073 LangModels: add ISO-8859-11 and regenerate TIS-620 Thai models.
ISO-8859-11 is basically exactly identical to TIS-620, with the added
non-breaking space character.
Basically our detection will always return TIS-620 except for
exceptional cases when a text has a non-breaking space.
2015-12-04 03:14:52 +01:00
..
bg Reorganize test files in language subdirectories. 2015-11-17 21:12:39 +01:00
de LangModels: adding German models for ISO-8859-1 and Windows-1252. 2015-12-03 23:58:41 +01:00
el Add Greek test files. 2015-11-18 02:57:09 +01:00
en Add an ASCII test file for English... 2015-11-28 17:49:13 +01:00
eo LangModels: add Esperanto ISO-8859-3 language model. 2015-12-04 01:35:56 +01:00
fr test: add a Windows-1252 French test. 2015-12-03 21:20:15 +01:00
he Add Hebrew test files. 2015-11-18 03:16:18 +01:00
hu test: add a Hungarian Windows-1250 test but skip it for now. 2015-12-03 21:18:55 +01:00
ja Add UTF-16 test files without BOM... 2015-11-28 19:50:18 +01:00
ko Adding UTF-8 file for Korean. 2015-11-18 02:36:33 +01:00
ru Add some Russian test files. 2015-11-27 18:17:20 +01:00
th LangModels: add ISO-8859-11 and regenerate TIS-620 Thai models. 2015-12-04 03:14:52 +01:00
tr LangModels: adding Turkish models for ISO-8859-3 and ISO-8859-9. 2015-12-04 02:35:09 +01:00
zh Adding some more test files for Russian and Chinese. 2015-11-18 19:27:38 +01:00
CMakeLists.txt test: add a Hungarian Windows-1250 test but skip it for now. 2015-12-03 21:18:55 +01:00
uchardet-tests.c Add automatic testing against every test file. 2015-11-18 18:18:27 +01:00