mirror of
https://gitlab.freedesktop.org/uchardet/uchardet.git
synced 2025-12-07 01:06:40 +08:00
Encodings: ISO-8859-1, ISO-8859-4, ISO-8859-9, ISO-8859-15 and WINDOWS-1252. Test text from https://sv.wikipedia.org/wiki/Mölle
152 lines
4.5 KiB
Plaintext
152 lines
4.5 KiB
Plaintext
= Logs of language model for Swedish (sv) =
|
|
|
|
- Generated by BuildLangModel.py
|
|
- Started: 2016-09-28 22:26:37.221506
|
|
- Maximum depth: 5
|
|
- Max number of pages: 100
|
|
|
|
== Parsed pages ==
|
|
|
|
Kakapo (revision 36509929)
|
|
Akut hotad (revision 32517788)
|
|
Aotearoa (revision 36575359)
|
|
Art (revision 36771341)
|
|
Artepitet (revision 36771341)
|
|
Auckland (revision 35752058)
|
|
Auktorsnamn (revision 35976965)
|
|
BBC (revision 36508743)
|
|
Basalomsättning (revision 30567523)
|
|
Beilschmiedia tawa (revision 29101923)
|
|
Berguv (revision 36295501)
|
|
Betesmark (revision 34292168)
|
|
Biotop (revision 35528052)
|
|
BirdLife International (revision 36124283)
|
|
Bonaparte (revision 37325183)
|
|
British Museum (revision 36420244)
|
|
Bröstben (revision 30602527)
|
|
Dacrydium cupressinum (revision 32986501)
|
|
Digital object identifier (revision 27637223)
|
|
Djur (revision 37300775)
|
|
Djurpark (revision 37147093)
|
|
Domän (biologi) (revision 33377709)
|
|
Don Merton (revision 36509929)
|
|
Douglas Adams (revision 36556245)
|
|
Däggdjur (revision 37328286)
|
|
Ekologisk nisch (revision 33898643)
|
|
Ekosystem (revision 36598266)
|
|
Endemisk (revision 30647109)
|
|
Eukaryoter (revision 37095313)
|
|
Evolution (revision 37093592)
|
|
Familj (biologi) (revision 30280200)
|
|
Femininum (revision 30597527)
|
|
Fjäder (biologi) (revision 36364943)
|
|
Fjäderdräkt (revision 36364943)
|
|
Fladdermöss (revision 37307257)
|
|
Flygg (revision 36479633)
|
|
Frukter (revision 34088588)
|
|
Frö (revision 37333131)
|
|
Fågelläte (revision 34034723)
|
|
Fåglar (revision 37387306)
|
|
Fåglarnas liv (revision 36509929)
|
|
Genitiv (revision 37388438)
|
|
George Edward Grey (revision 36509929)
|
|
George Robert Gray (revision 20426710)
|
|
Haasts örn (revision 29175076)
|
|
Hauturu/Little Barrier Island (revision 36509929)
|
|
Hermelin (revision 36578682)
|
|
Hertz (revision 37104488)
|
|
Hjortdjur (revision 36493550)
|
|
Hund (revision 37351832)
|
|
Husdjur (revision 37384850)
|
|
Huskatt (revision 32922967)
|
|
Hāngi (revision 29609696)
|
|
IUCN (revision 30570280)
|
|
Iller (revision 30663158)
|
|
Infraröd (revision 36770733)
|
|
Internationella naturvårdsunionen (revision 30570280)
|
|
Jordbruk (revision 37352625)
|
|
Kahurangi National Park (revision 35956142)
|
|
Kamouflage (revision 36579595)
|
|
Kaniner (revision 36877621)
|
|
Kapiti Island (revision 37395588)
|
|
Katt (revision 36734686)
|
|
Kelp (revision 30312471)
|
|
Kivier (revision 36373234)
|
|
Klass (biologi) (revision 30280201)
|
|
Kroppsfett (revision 35066611)
|
|
Könsdimorfism (revision 30816932)
|
|
Könsfördelning (revision 24769321)
|
|
Lamm- och fårkött (revision 36187205)
|
|
Lek (fortplantningsbeteende) (revision 30508235)
|
|
Mandel (revision 36577529)
|
|
Maori (revision 32560474)
|
|
Maorier (revision 35862066)
|
|
Maoripapegojor (revision 36545138)
|
|
Mark Carwardine (revision 20375916)
|
|
Markpapegoja (revision 36295722)
|
|
Maskulinum (revision 32704551)
|
|
Masterton (revision 29859631)
|
|
Metrosideros umbellata (revision 29071212)
|
|
Milford Sound (revision 20284758)
|
|
Morrhår (revision 36533839)
|
|
Muskelmage (revision 31196380)
|
|
Mustela (revision 20934105)
|
|
Mårddjur (revision 37306347)
|
|
Māori (revision 32560474)
|
|
NHNZ (revision 36509929)
|
|
Nattpapegoja (revision 33486517)
|
|
Nordön (revision 24810231)
|
|
Nya Zeeland (revision 36575359)
|
|
Näbb (revision 23648463)
|
|
Ollonår (revision 36509929)
|
|
Ordning (biologi) (revision 30280196)
|
|
|
|
== End of Parsed pages ==
|
|
|
|
- Wikipedia parsing ended at: 2016-09-28 22:29:21.480287
|
|
|
|
48 characters appeared 594415 times.
|
|
|
|
First 31 characters:
|
|
[ 0] Char a: 10.070741821791172 %
|
|
[ 1] Char e: 9.737136512369304 %
|
|
[ 2] Char r: 9.110638190489809 %
|
|
[ 3] Char n: 8.378826240925951 %
|
|
[ 4] Char t: 7.481305148759705 %
|
|
[ 5] Char s: 5.828587771169974 %
|
|
[ 6] Char i: 5.359891658184939 %
|
|
[ 7] Char l: 5.173489901836259 %
|
|
[ 8] Char o: 4.694195133029954 %
|
|
[ 9] Char d: 4.597293136949774 %
|
|
[10] Char k: 3.297359588839447 %
|
|
[11] Char m: 3.1898589369379975 %
|
|
[12] Char g: 3.004466576381821 %
|
|
[13] Char v: 2.2324470277499726 %
|
|
[14] Char f: 2.1988005013332437 %
|
|
[15] Char p: 2.06017681249632 %
|
|
[16] Char u: 2.0499146219392173 %
|
|
[17] Char ä: 2.0475593650900468 %
|
|
[18] Char h: 2.028380845032511 %
|
|
[19] Char å: 1.5443755625278637 %
|
|
[20] Char c: 1.442594820117258 %
|
|
[21] Char ö: 1.3515809661600062 %
|
|
[22] Char b: 1.268642278542769 %
|
|
[23] Char j: 0.7302978558751041 %
|
|
[24] Char y: 0.6699023409570755 %
|
|
[25] Char x: 0.2111319532649748 %
|
|
[26] Char w: 0.10262190557102362 %
|
|
[27] Char z: 0.09151855185350302 %
|
|
[28] Char é: 0.021197311642539303 %
|
|
[29] Char ā: 0.011103353717520588 %
|
|
[30] Char q: 0.007570468443764037 %
|
|
|
|
The first 31 characters have an accumulated ratio of 0.999936071599808.
|
|
|
|
748 sequences found.
|
|
|
|
First 512 (typical positive ratio): 0.997323508584682
|
|
Next 512 (512-1024): 1.6823263208364526e-06
|
|
Rest: 1.7780915628762273e-17
|
|
|
|
- Processing end: 2016-09-28 22:29:21.590354
|