Jehan 6bbe7da1ac LangModels: add Finnish support.
I built models for ISO-8859-1, ISO-8859-4, ISO-8859-9, ISO-8859-13,
ISO-8859-15 and WINDOWS-1252, which all contain Finnish letters.
Nevertheless most texts in these encoding end up the same (same
codepoints for the Finnish glyphs) so I keep only tests for ISO-8859-1
and UTF-8. Models for other encoding may still be useful when processing
texts with some symbols, etc.
2016-09-21 18:27:39 +02:00
..
LangModels LangModels: add Finnish support. 2016-09-21 18:27:39 +02:00
tools Update the URL links: uchardet is now a freedesktop project. 2016-07-20 01:47:50 +02:00
Big5Freq.tab Initial release. 2011-07-10 15:04:42 +08:00
CharDistribution.cpp Update code from upstream. 2011-07-11 14:42:50 +08:00
CharDistribution.h uchardet_get_charset() must return iconv-compatible names. 2015-11-17 16:15:21 +01:00
CMakeLists.txt LangModels: add Finnish support. 2016-09-21 18:27:39 +02:00
EUCKRFreq.tab Initial release. 2011-07-10 15:04:42 +08:00
EUCTWFreq.tab Initial release. 2011-07-10 15:04:42 +08:00
GB2312Freq.tab Initial release. 2011-07-10 15:04:42 +08:00
JISFreq.tab Initial release. 2011-07-10 15:04:42 +08:00
JpCntx.cpp Fixes boolean operation precedence warnings... 2015-11-18 19:38:12 +01:00
JpCntx.h Update code from upstream. 2011-07-11 14:42:50 +08:00
nsBig5Prober.cpp Update code from upstream. 2011-07-11 14:42:50 +08:00
nsBig5Prober.h uchardet_get_charset() must return iconv-compatible names. 2015-11-17 16:15:21 +01:00
nsCharSetProber.cpp Initial release. 2011-07-10 15:04:42 +08:00
nsCharSetProber.h Update code from upstream. 2011-07-11 14:42:50 +08:00
nsCodingStateMachine.h Update code from upstream. 2011-07-11 14:42:50 +08:00
nscore.h Update code from upstream. 2011-07-11 14:42:50 +08:00
nsEscCharsetProber.cpp Update code from upstream. 2011-07-11 14:42:50 +08:00
nsEscCharsetProber.h Update code from upstream. 2011-07-11 14:42:50 +08:00
nsEscSM.cpp Update code from upstream. 2011-07-11 14:42:50 +08:00
nsEUCJPProber.cpp Update code from upstream. 2011-07-11 14:42:50 +08:00
nsEUCJPProber.h Update code from upstream. 2011-07-11 14:42:50 +08:00
nsEUCKRProber.cpp Update code from upstream. 2011-07-11 14:42:50 +08:00
nsEUCKRProber.h src: the EUC-KR prober now returns "UHC" as encoding name. 2016-09-19 01:22:45 +02:00
nsEUCTWProber.cpp Update code from upstream. 2011-07-11 14:42:50 +08:00
nsEUCTWProber.h uchardet_get_charset() must return iconv-compatible names. 2015-11-17 16:15:21 +01:00
nsGB2312Prober.cpp Update code from upstream. 2011-07-11 14:42:50 +08:00
nsGB2312Prober.h uchardet_get_charset() must return iconv-compatible names. 2015-11-17 16:15:21 +01:00
nsHebrewProber.cpp s/windows-1255/WINDOWS-1255/ to follow iconv uppercase naming. 2015-11-18 03:21:34 +01:00
nsHebrewProber.h Initial release. 2011-07-10 15:04:42 +08:00
nsLatin1Prober.cpp Update code from upstream. 2011-07-11 14:42:50 +08:00
nsLatin1Prober.h uchardet_get_charset() must return iconv-compatible names. 2015-11-17 16:15:21 +01:00
nsMBCSGroupProber.cpp uchardet_get_charset() must return iconv-compatible names. 2015-11-17 16:15:21 +01:00
nsMBCSGroupProber.h Update code from upstream. 2011-07-11 14:42:50 +08:00
nsMBCSSM.cpp uchardet_get_charset() must return iconv-compatible names. 2015-11-17 16:15:21 +01:00
nsPkgInt.h Update code from upstream. 2011-07-11 14:42:50 +08:00
nsSBCharSetProber.cpp src: give a little weight to "probable sequences". 2016-05-25 17:38:20 +02:00
nsSBCharSetProber.h LangModels: add Finnish support. 2016-09-21 18:27:39 +02:00
nsSBCSGroupProber.cpp LangModels: add Finnish support. 2016-09-21 18:27:39 +02:00
nsSBCSGroupProber.h LangModels: add Finnish support. 2016-09-21 18:27:39 +02:00
nsSJISProber.cpp Update code from upstream. 2011-07-11 14:42:50 +08:00
nsSJISProber.h uchardet_get_charset() must return iconv-compatible names. 2015-11-17 16:15:21 +01:00
nsUniversalDetector.cpp src: fix non-guarded 'if' warning. 2016-09-21 02:37:31 +02:00
nsUniversalDetector.h Nearly-ASCII text with NBSP is still not ASCII. 2015-12-05 21:11:29 +01:00
nsUTF8Prober.cpp Update code from upstream. 2011-07-11 14:42:50 +08:00
nsUTF8Prober.h Update code from upstream. 2011-07-11 14:42:50 +08:00
prmem.h Initial release. 2011-07-10 15:04:42 +08:00
symbols.cmake cmake: minor comment cleanups 2016-03-22 01:23:06 +03:00
uchardet.cpp uchardet_get_charset() should never return NULL... 2015-11-16 17:33:16 +01:00
uchardet.h src: update documentation comments on the public API. 2016-09-21 17:36:17 +02:00