mirror of
https://gitlab.freedesktop.org/uchardet/uchardet.git
synced 2025-12-06 16:56:40 +08:00
Encodings: Windows-1250, ISO-8859-2, IBM852 and Mac-CentralEurope. Other encodings are known to have been used for Czech: Kamenicky, KOI-8 CS2 and Cork. But these are uncommon enough that I decided not to support them (especially since I can't find them supported in iconv either, or at least not under an alias which I could recognize). This web page, which contents was made under the Public Domain, is a good reference for encodings which were used historically for Czech and Slovak: http://luki.sdf-eu.org/txt/cs-encodings-faq.html
162 lines
5.3 KiB
Plaintext
162 lines
5.3 KiB
Plaintext
= Logs of language model for Czech (cs) =
|
|
|
|
- Generated by BuildLangModel.py
|
|
- Started: 2016-09-21 03:20:56.824516
|
|
- Maximum depth: 5
|
|
- Max number of pages: 100
|
|
|
|
== Parsed pages ==
|
|
|
|
Sociální fobie (revision 13567590)
|
|
Adaptace (revision 13991192)
|
|
Agorafobie (revision 13013445)
|
|
Alkoholismus (revision 13822064)
|
|
Alprazolam (revision 14082425)
|
|
Antidepresivum (revision 14113423)
|
|
Asertivita (revision 14111958)
|
|
Atenolol (revision 12051880)
|
|
Automatické negativní myšlenky (revision 13567590)
|
|
Benzodiazepin (revision 13947546)
|
|
Beta-blokátory (revision 13428762)
|
|
Blud (revision 13888988)
|
|
Bohatství (revision 13556478)
|
|
Bupropion (revision 13686045)
|
|
Citaloparam (revision 13567590)
|
|
Clonazepan (revision 13567590)
|
|
Crohnova nemoc (revision 13745254)
|
|
Deprese (psychologie) (revision 13695735)
|
|
Diagnostický a statický manuál mentálních poruch (revision 13567590)
|
|
Diagnostický a statistický manuál mentálních poruch (revision 13714660)
|
|
Diagnóza (medicína) (revision 13052239)
|
|
Dichotomické myšlení (revision 13567590)
|
|
Digital object identifier (revision 14138049)
|
|
Dopamin (revision 13714274)
|
|
Dystymie (revision 13567267)
|
|
Důkaz kruhem (revision 13190761)
|
|
Elektivní mutismus (revision 9940891)
|
|
Emoce (revision 14110033)
|
|
Escitalopram (revision 12954987)
|
|
Evoluce (revision 13951488)
|
|
Expozice (psychologie) (revision 14119474)
|
|
Extraverze a introverze (revision 13872996)
|
|
Fluoxetin (revision 12955006)
|
|
Fluvoxamin (revision 12955006)
|
|
Gen (revision 13907182)
|
|
Generalizovaná úzkostná porucha (revision 14006709)
|
|
Halucinaci (revision 12188143)
|
|
Hněv (revision 14057864)
|
|
Inteligence (revision 14009781)
|
|
International Standard Serial Number (revision 12869806)
|
|
Interpersonální psychoterapie (revision 13567590)
|
|
Iracionalita (revision 4765977)
|
|
Ján Praško Pavlov (revision 14086840)
|
|
Klinické testování (revision 13530979)
|
|
Kognitivní omyl (revision 13107294)
|
|
Kognitivní psychologie (revision 11629465)
|
|
Kognitivní restrukturalizace (revision 13567360)
|
|
Kognitivně behaviorální terapie (revision 13980494)
|
|
Komorbidita (revision 11351714)
|
|
Lymská borelióza (revision 14068446)
|
|
Malé sebevědomí (revision 13567590)
|
|
Medical Subject Headings (revision 12239331)
|
|
Meditace (revision 13180783)
|
|
Mentální černý filtr (revision 13567590)
|
|
Mezinárodní klasifikace nemocí (revision 12531067)
|
|
Michael Liebowitz (revision 13567590)
|
|
Moclobemid (revision 13567590)
|
|
Moritova terapie (revision 11960292)
|
|
Musturbace (revision 13567590)
|
|
Nervozita (revision 13847097)
|
|
Noradrenalin (revision 14054165)
|
|
Obsedantně kompulzivní porucha (revision 13950365)
|
|
Panická ataka (revision 13253537)
|
|
Panická porucha (revision 13253537)
|
|
Paranoia (revision 14027052)
|
|
Paroxetin (revision 12955006)
|
|
Pohlavnost (revision 13564689)
|
|
Porucha (revision 11039108)
|
|
Pravděpodobnost (revision 13596041)
|
|
Predestinace (revision 12467403)
|
|
Profese (revision 13975485)
|
|
Propanolol (revision 12972658)
|
|
Psychiatr (revision 12767960)
|
|
Psychické trauma (revision 11227535)
|
|
Psychoaktivní droga (revision 13939232)
|
|
Psychodynamická léčba (revision 13567590)
|
|
Psychofarmaka (revision 9928215)
|
|
Psycholog (revision 12358728)
|
|
Psychoterapie (revision 13874178)
|
|
Puberta (revision 12540014)
|
|
RIMA (revision 10234728)
|
|
Remise (revision 9896748)
|
|
Richard Heimberg (revision 13567590)
|
|
Rámování myšlenek (revision 13567590)
|
|
Schizofrenie (revision 13977456)
|
|
Sebevražda (revision 14053884)
|
|
Selektivní abstrakce (revision 13567590)
|
|
Selektivní inhibitor zpětného vychytávání serotoninu (revision 12955027)
|
|
Serotonin (revision 13975104)
|
|
Sertralin (revision 12955006)
|
|
Skupinová terapie (revision 11964235)
|
|
Sociální chování (revision 13507313)
|
|
Sociální dovednost (revision 12226347)
|
|
|
|
== End of Parsed pages ==
|
|
|
|
- Wikipedia parsing ended at: 2016-09-21 03:28:11.731386
|
|
|
|
47 characters appeared 594800 times.
|
|
|
|
First 41 characters:
|
|
[ 0] Char o: 8.323806321452588 %
|
|
[ 1] Char e: 8.040013449899126 %
|
|
[ 2] Char n: 6.895595158036315 %
|
|
[ 3] Char a: 6.263113651647613 %
|
|
[ 4] Char i: 5.650470746469401 %
|
|
[ 5] Char t: 5.40383322125084 %
|
|
[ 6] Char s: 4.588937457969065 %
|
|
[ 7] Char v: 3.8685272360457295 %
|
|
[ 8] Char p: 3.6914929388029587 %
|
|
[ 9] Char r: 3.6302958977807664 %
|
|
[10] Char l: 3.6017148621385338 %
|
|
[11] Char í: 3.5733019502353733 %
|
|
[12] Char k: 3.301950235373235 %
|
|
[13] Char u: 3.1782111634162744 %
|
|
[14] Char c: 3.1383658372562206 %
|
|
[15] Char d: 3.120208473436449 %
|
|
[16] Char m: 2.758406186953598 %
|
|
[17] Char h: 2.2747141896435776 %
|
|
[18] Char á: 2.156186953597848 %
|
|
[19] Char z: 2.0260591795561536 %
|
|
[20] Char y: 1.9894082044384667 %
|
|
[21] Char j: 1.8979488903833224 %
|
|
[22] Char b: 1.8189307330195021 %
|
|
[23] Char ě: 1.277236045729657 %
|
|
[24] Char é: 1.2291526563550772 %
|
|
[25] Char č: 0.9502353732347008 %
|
|
[26] Char ž: 0.9214862138533961 %
|
|
[27] Char ř: 0.8955951580363146 %
|
|
[28] Char ý: 0.7646267652992602 %
|
|
[29] Char š: 0.6605581708137189 %
|
|
[30] Char f: 0.6260928043039677 %
|
|
[31] Char ů: 0.5016812373907196 %
|
|
[32] Char g: 0.47041022192333554 %
|
|
[33] Char ú: 0.19502353732347008 %
|
|
[34] Char x: 0.13685272360457296 %
|
|
[35] Char ň: 0.05447209145931405 %
|
|
[36] Char w: 0.04488903833221251 %
|
|
[37] Char ó: 0.03429724277067922 %
|
|
[38] Char ť: 0.02269670477471419 %
|
|
[39] Char ď: 0.012104909213180902 %
|
|
[40] Char q: 0.007229320780094149 %
|
|
|
|
The first 41 characters have an accumulated ratio of 0.9999613315400132.
|
|
|
|
1025 sequences found.
|
|
|
|
First 512 (typical positive ratio): 0.9786035192432675
|
|
Next 512 (512-1024): 1.6812373907195695e-06
|
|
Rest: 2.0246480655940202e-06
|
|
|
|
- Processing end: 2016-09-21 03:28:12.235582
|