Jehan b7aebfdfda LangModels: add support for Latvian | Lithuanian / ISO-8859-4 | ISO-8859-10.
Just realizing that these 2 language can also be encoded with these
charsets (even though ISO-8859-13 would appear to be more common…
maybe?). Anyway now the models are updated and can recognize texts
using these encoding for these languages.
Added some test files as well, which work great.
2016-09-21 00:27:16 +02:00
..
LangModels LangModels: add support for Latvian | Lithuanian / ISO-8859-4 | ISO-8859-10. 2016-09-21 00:27:16 +02:00
tools Update the URL links: uchardet is now a freedesktop project. 2016-07-20 01:47:50 +02:00
Big5Freq.tab Initial release. 2011-07-10 15:04:42 +08:00
CharDistribution.cpp Update code from upstream. 2011-07-11 14:42:50 +08:00
CharDistribution.h uchardet_get_charset() must return iconv-compatible names. 2015-11-17 16:15:21 +01:00
CMakeLists.txt LangModels: add support for Portuguese / ISO-8859-1. 2016-09-21 00:01:07 +02:00
EUCKRFreq.tab Initial release. 2011-07-10 15:04:42 +08:00
EUCTWFreq.tab Initial release. 2011-07-10 15:04:42 +08:00
GB2312Freq.tab Initial release. 2011-07-10 15:04:42 +08:00
JISFreq.tab Initial release. 2011-07-10 15:04:42 +08:00
JpCntx.cpp Fixes boolean operation precedence warnings... 2015-11-18 19:38:12 +01:00
JpCntx.h Update code from upstream. 2011-07-11 14:42:50 +08:00
nsBig5Prober.cpp Update code from upstream. 2011-07-11 14:42:50 +08:00
nsBig5Prober.h uchardet_get_charset() must return iconv-compatible names. 2015-11-17 16:15:21 +01:00
nsCharSetProber.cpp Initial release. 2011-07-10 15:04:42 +08:00
nsCharSetProber.h Update code from upstream. 2011-07-11 14:42:50 +08:00
nsCodingStateMachine.h Update code from upstream. 2011-07-11 14:42:50 +08:00
nscore.h Update code from upstream. 2011-07-11 14:42:50 +08:00
nsEscCharsetProber.cpp Update code from upstream. 2011-07-11 14:42:50 +08:00
nsEscCharsetProber.h Update code from upstream. 2011-07-11 14:42:50 +08:00
nsEscSM.cpp Update code from upstream. 2011-07-11 14:42:50 +08:00
nsEUCJPProber.cpp Update code from upstream. 2011-07-11 14:42:50 +08:00
nsEUCJPProber.h Update code from upstream. 2011-07-11 14:42:50 +08:00
nsEUCKRProber.cpp Update code from upstream. 2011-07-11 14:42:50 +08:00
nsEUCKRProber.h src: the EUC-KR prober now returns "UHC" as encoding name. 2016-09-19 01:22:45 +02:00
nsEUCTWProber.cpp Update code from upstream. 2011-07-11 14:42:50 +08:00
nsEUCTWProber.h uchardet_get_charset() must return iconv-compatible names. 2015-11-17 16:15:21 +01:00
nsGB2312Prober.cpp Update code from upstream. 2011-07-11 14:42:50 +08:00
nsGB2312Prober.h uchardet_get_charset() must return iconv-compatible names. 2015-11-17 16:15:21 +01:00
nsHebrewProber.cpp s/windows-1255/WINDOWS-1255/ to follow iconv uppercase naming. 2015-11-18 03:21:34 +01:00
nsHebrewProber.h Initial release. 2011-07-10 15:04:42 +08:00
nsLatin1Prober.cpp Update code from upstream. 2011-07-11 14:42:50 +08:00
nsLatin1Prober.h uchardet_get_charset() must return iconv-compatible names. 2015-11-17 16:15:21 +01:00
nsMBCSGroupProber.cpp uchardet_get_charset() must return iconv-compatible names. 2015-11-17 16:15:21 +01:00
nsMBCSGroupProber.h Update code from upstream. 2011-07-11 14:42:50 +08:00
nsMBCSSM.cpp uchardet_get_charset() must return iconv-compatible names. 2015-11-17 16:15:21 +01:00
nsPkgInt.h Update code from upstream. 2011-07-11 14:42:50 +08:00
nsSBCharSetProber.cpp src: give a little weight to "probable sequences". 2016-05-25 17:38:20 +02:00
nsSBCharSetProber.h LangModels: add support for Latvian | Lithuanian / ISO-8859-4 | ISO-8859-10. 2016-09-21 00:27:16 +02:00
nsSBCSGroupProber.cpp LangModels: add support for Latvian | Lithuanian / ISO-8859-4 | ISO-8859-10. 2016-09-21 00:27:16 +02:00
nsSBCSGroupProber.h LangModels: add support for Latvian | Lithuanian / ISO-8859-4 | ISO-8859-10. 2016-09-21 00:27:16 +02:00
nsSJISProber.cpp Update code from upstream. 2011-07-11 14:42:50 +08:00
nsSJISProber.h uchardet_get_charset() must return iconv-compatible names. 2015-11-17 16:15:21 +01:00
nsUniversalDetector.cpp Nearly-ASCII text with NBSP is still not ASCII. 2015-12-05 21:11:29 +01:00
nsUniversalDetector.h Nearly-ASCII text with NBSP is still not ASCII. 2015-12-05 21:11:29 +01:00
nsUTF8Prober.cpp Update code from upstream. 2011-07-11 14:42:50 +08:00
nsUTF8Prober.h Update code from upstream. 2011-07-11 14:42:50 +08:00
prmem.h Initial release. 2011-07-10 15:04:42 +08:00
symbols.cmake cmake: minor comment cleanups 2016-03-22 01:23:06 +03:00
uchardet.cpp uchardet_get_charset() should never return NULL... 2015-11-16 17:33:16 +01:00
uchardet.h Update doc comment. 2015-12-03 20:36:09 +01:00