mirror of
https://gitlab.freedesktop.org/uchardet/uchardet.git
synced 2025-12-06 16:56:40 +08:00
Officially supported: ISO-8859-1, ISO-8859-3, ISO-8859-9, ISO-8859-15 and WINDOWS-1252. Same as Finnish only ISO-8859-1 and UTF-8 test added since other encoding end up similar as ISO-8859-1 for most common texts (i.e. glyphs used in Italian are on the same codepoints on these other encodings). Test text from https://it.wikipedia.org/wiki/Architettura_longobarda
163 lines
4.3 KiB
Plaintext
163 lines
4.3 KiB
Plaintext
= Logs of language model for Italian (it) =
|
|
|
|
- Generated by BuildLangModel.py
|
|
- Started: 2016-09-21 18:43:12.831409
|
|
- Maximum depth: 5
|
|
- Max number of pages: 100
|
|
|
|
== Parsed pages ==
|
|
|
|
Pieve Ligure (revision 83186252)
|
|
010 (prefisso) (revision 76157203)
|
|
1000 (revision 83185341)
|
|
1143 (revision 70627567)
|
|
1162 (revision 70627612)
|
|
118 - Emergenza sanitaria (revision 83267411)
|
|
1201 (revision 77523243)
|
|
1202 (revision 76764411)
|
|
1374 (revision 78259457)
|
|
1404 (revision 70628069)
|
|
1520 (revision 76854924)
|
|
1537 (revision 70628296)
|
|
1582 (revision 80626188)
|
|
1584 (revision 76837051)
|
|
1600 (revision 76869356)
|
|
1619 (revision 70628455)
|
|
1742 (revision 70628675)
|
|
1748 (revision 70628682)
|
|
1749 (revision 70628684)
|
|
1750 (revision 70628690)
|
|
1754 (revision 70628697)
|
|
1775 (revision 70628734)
|
|
1797 (revision 78338823)
|
|
1798 (revision 82047236)
|
|
1803 (revision 77502534)
|
|
1805 (revision 79369853)
|
|
1809 (revision 70628789)
|
|
1810 (revision 82930218)
|
|
1814 (revision 78338825)
|
|
1815 (revision 82669615)
|
|
1816 (revision 83185384)
|
|
1818 (revision 72407239)
|
|
1823 (revision 74880156)
|
|
1859 (revision 83185401)
|
|
1860 (revision 83185403)
|
|
1861 (revision 83185412)
|
|
1868 (revision 83185430)
|
|
1874 (revision 83185441)
|
|
1897 (revision 83185267)
|
|
1908 (revision 83185631)
|
|
1909 (revision 83185630)
|
|
1913 (revision 83185626)
|
|
1915 (revision 83185625)
|
|
1917 (revision 83185270)
|
|
1920 (revision 83185621)
|
|
1921 (revision 83185619)
|
|
1923 (revision 83185616)
|
|
1925 (revision 83185614)
|
|
1926 (revision 83185612)
|
|
1928 (revision 83185610)
|
|
1929 (revision 83185609)
|
|
1939 (revision 83185598)
|
|
1946 (revision 83185590)
|
|
1947 (revision 83185589)
|
|
1948 (revision 83185587)
|
|
1951 (revision 83185584)
|
|
1956 (revision 83185478)
|
|
1960 (revision 83185487)
|
|
1964 (revision 83185493)
|
|
1965 (revision 83185494)
|
|
1969 (revision 83185500)
|
|
1970 (revision 83185503)
|
|
1971 (revision 83185505)
|
|
1975 (revision 83185510)
|
|
1976 (revision 83185513)
|
|
1977 (revision 83185514)
|
|
1980 (revision 83185518)
|
|
1981 (revision 83308867)
|
|
1983 (revision 83185524)
|
|
1985 (revision 83185526)
|
|
1988 (revision 83185280)
|
|
1990 (revision 83185531)
|
|
1995 (revision 83185538)
|
|
1999 (revision 83326325)
|
|
2000 (revision 83185544)
|
|
2001 (revision 83309058)
|
|
2002 (revision 83185545)
|
|
2003 (revision 83185546)
|
|
2004 (revision 83185283)
|
|
2005 (revision 83185285)
|
|
2006 (revision 83185547)
|
|
2007 (revision 83185549)
|
|
2008 (revision 83185551)
|
|
2009 (revision 83185552)
|
|
2010 (revision 83185287)
|
|
2012 (revision 83185289)
|
|
712 (revision 70630167)
|
|
749 (revision 78272323)
|
|
ATP (Provincia di Genova) (revision 82754117)
|
|
Abbazia di San Colombano (revision 83062997)
|
|
Abbazia di San Fruttuoso (revision 83288120)
|
|
Acacia dealbata (revision 83036867)
|
|
Acquedotto (revision 82973825)
|
|
Affresco (revision 82000422)
|
|
Agricoltura (revision 82578266)
|
|
Allevamento (revision 82971452)
|
|
Altitudine (revision 82971213)
|
|
Angelo (revision 82333116)
|
|
Anni 1960 (revision 83161222)
|
|
Anni 1970 (revision 81663175)
|
|
Antica Roma (revision 83125874)
|
|
|
|
== End of Parsed pages ==
|
|
|
|
- Wikipedia parsing ended at: 2016-09-21 18:46:08.840718
|
|
|
|
59 characters appeared 823241 times.
|
|
|
|
First 34 characters:
|
|
[ 0] Char i: 11.823147778111148 %
|
|
[ 1] Char a: 11.252112078965942 %
|
|
[ 2] Char e: 10.910170897707962 %
|
|
[ 3] Char o: 8.936386793174782 %
|
|
[ 4] Char n: 7.317055394471364 %
|
|
[ 5] Char l: 6.931263141655967 %
|
|
[ 6] Char r: 6.521784021932824 %
|
|
[ 7] Char t: 6.386708145002497 %
|
|
[ 8] Char s: 4.572415610981475 %
|
|
[ 9] Char c: 4.116291584116923 %
|
|
[10] Char d: 3.9770856893667834 %
|
|
[11] Char u: 2.8944136650142545 %
|
|
[12] Char m: 2.762860450342002 %
|
|
[13] Char p: 2.6809889206198427 %
|
|
[14] Char g: 2.1493098618751985 %
|
|
[15] Char v: 1.5369739845318686 %
|
|
[16] Char b: 1.2855287819727153 %
|
|
[17] Char f: 0.9932692856648295 %
|
|
[18] Char z: 0.9664241698360504 %
|
|
[19] Char h: 0.7159507361756764 %
|
|
[20] Char q: 0.2416060424590126 %
|
|
[21] Char k: 0.18876610858788617 %
|
|
[22] Char à: 0.15596890825408355 %
|
|
[23] Char y: 0.12462936126844994 %
|
|
[24] Char è: 0.11600491229178332 %
|
|
[25] Char w: 0.10628722330398996 %
|
|
[26] Char x: 0.10312897438295712 %
|
|
[27] Char j: 0.07555503188009344 %
|
|
[28] Char ù: 0.05575524056746445 %
|
|
[29] Char ò: 0.03304014255849745 %
|
|
[30] Char é: 0.021014502436103158 %
|
|
[31] Char ì: 0.0191924357508919 %
|
|
[32] Char á: 0.004737373381549267 %
|
|
[33] Char ó: 0.003644133370422513 %
|
|
|
|
The first 34 characters have an accumulated ratio of 0.9997947138201325.
|
|
|
|
872 sequences found.
|
|
|
|
First 512 (typical positive ratio): 0.9989484485502651
|
|
Next 512 (512-1024): 1.214711123474171e-06
|
|
Rest: -4.336808689942018e-17
|
|
|
|
- Processing end: 2016-09-21 18:46:08.920456
|