Jehan 2bade77bf9 tests: update Window-1250 test file for Hungarian.
ISO-8859-2 and Windows-1250 are absolutely similar for all letters in
the Hungarian alphabet. So for most texts, it is not an error to return
one charset or the other.
What could make the difference is for instance that Windows-1250 has
some symbols where ISO-8859-2 has control characters, like quotes,
dashes, the euro symbol…
Since control characters have a negative impact on confidence now,
texts with such symbols would tend towards Windows-1250 decision.
The new test file has such quote symbols.
2015-12-12 18:12:08 +01:00
..
bg Reorganize test files in language subdirectories. 2015-11-17 21:12:39 +01:00
de LangModels: adding German models for ISO-8859-1 and Windows-1252. 2015-12-03 23:58:41 +01:00
el Add Greek test files. 2015-11-18 02:57:09 +01:00
en Add an ASCII test file for English... 2015-11-28 17:49:13 +01:00
eo LangModels: add Esperanto ISO-8859-3 language model. 2015-12-04 01:35:56 +01:00
fr test: update UTF-16 and UTF-32 tests after label changing. 2015-12-04 19:46:51 +01:00
he Add Hebrew test files. 2015-11-18 03:16:18 +01:00
hu tests: update Window-1250 test file for Hungarian. 2015-12-12 18:12:08 +01:00
ja Add UTF-16 test files without BOM... 2015-11-28 19:50:18 +01:00
ko test: update UTF-16 and UTF-32 tests after label changing. 2015-12-04 19:46:51 +01:00
ru Add some Russian test files. 2015-11-27 18:17:20 +01:00
th LangModels: add ISO-8859-11 and regenerate TIS-620 Thai models. 2015-12-04 03:14:52 +01:00
tr LangModels: adding Turkish models for ISO-8859-3 and ISO-8859-9. 2015-12-04 02:35:09 +01:00
zh Adding some more test files for Russian and Chinese. 2015-11-18 19:27:38 +01:00
CMakeLists.txt tests: update Window-1250 test file for Hungarian. 2015-12-12 18:12:08 +01:00
uchardet-tests.c Add automatic testing against every test file. 2015-11-18 18:18:27 +01:00