Jehan 04f9309932 tests: update ISO-8859-15 French test file.
Previous technical text about charsets themselves were not relevant
to identify a language. In particular the special characters different
between ISO-8859-1 and ISO-8859-15 were used by themselves, out of a
char sequence context. Therefore without language understanding, they
could have as well been representing the ISO-8859-15 letters or the
ISO-8859-1 symbols at the corresponding codepoints.
Replacing with text from this Wikipedia page:
https://fr.wikipedia.org/wiki/Œuf_(cuisine)
This uses some of these same characters (in particular 'œ') but in
contextual character sequences, making it relevant for our algorithm.
2015-11-30 00:19:15 +01:00
..
iso-8859-1.txt Reorganize test files in language subdirectories. 2015-11-17 21:12:39 +01:00
iso-8859-15.txt tests: update ISO-8859-15 French test file. 2015-11-30 00:19:15 +01:00
utf-16be.txt Add UTF-16 and UTF-32 test files in French, with BOM. 2015-11-26 02:45:00 +01:00
utf-16le.txt Add UTF-16 and UTF-32 test files in French, with BOM. 2015-11-26 02:45:00 +01:00
utf-32be.txt Add UTF-16 and UTF-32 test files in French, with BOM. 2015-11-26 02:45:00 +01:00
utf-32le.txt Add UTF-16 and UTF-32 test files in French, with BOM. 2015-11-26 02:45:00 +01:00