mirror of
https://gitlab.freedesktop.org/uchardet/uchardet.git
synced 2025-12-13 15:10:06 +08:00
This fixes the broken Russian test in Windows-1251 which once again gets a much better score with Russian. Also this adds UTF-8 support. Same as Bulgarian, I wonder why I had not regenerated this earlier. The new UTF-8 test comes from the 'Сурки' page of Wikipedia in Russian. Note that now this broke the test zh:gb18030 (the score for KOI8-R / ru (0.766388) beats GB18030 / zh (0.700000)). I think I'll have to look a bit closer at our GB18030 dedicated prober. |
||
|---|---|---|
| .. | ||
| ibm855.txt | ||
| ibm866.txt | ||
| iso-8859-5.txt | ||
| koi8-r.txt | ||
| mac-cyrillic.txt | ||
| utf-8.txt | ||
| windows-1251.txt | ||