mirror of
https://gitlab.freedesktop.org/uchardet/uchardet.git
synced 2025-12-06 08:46:40 +08:00
I actually added also couples with ISO-8859-9, ISO-8859-15 and Windows-1252. Nevertheless there are no differences on the main characters related to Portuguese so differences will hardly be made and detection will usually return ISO-8859-1 only.
167 lines
5.6 KiB
Plaintext
167 lines
5.6 KiB
Plaintext
= Logs of language model for Portuguese (pt) =
|
|
|
|
- Generated by BuildLangModel.py
|
|
- Started: 2016-09-20 23:44:39.722451
|
|
- Maximum depth: 5
|
|
- Max number of pages: 100
|
|
|
|
== Parsed pages ==
|
|
|
|
Papagaio-das-mascarenhas (revision 46763149)
|
|
Albinismo (revision 46498446)
|
|
Alfred Newton (revision 43617011)
|
|
Alphonse Milne-Edwards (revision 39740747)
|
|
Animalia (revision 46727732)
|
|
Asa (revision 46338820)
|
|
August von Pelzeln (revision 34726241)
|
|
Aves (revision 46728980)
|
|
Bico (revision 45311553)
|
|
Carl Wilhelm Hahn (revision 45025566)
|
|
Carlos Lineu (revision 46625396)
|
|
Carolus Linnaeus (revision 46625396)
|
|
Cauda (revision 43275401)
|
|
Charles Lucien Bonaparte (revision 45529712)
|
|
Chordata (revision 46640101)
|
|
Cladograma (revision 46700307)
|
|
Classe (biologia) (revision 46701409)
|
|
Classificação científica (revision 46306288)
|
|
Coleção Leverian (revision 45026647)
|
|
Comores (revision 46181501)
|
|
Coracopsinae (revision 36946101)
|
|
Coracopsis nigra (revision 44338845)
|
|
Coracopsis vasa (revision 42905822)
|
|
Cylindraspis indica (revision 42905410)
|
|
Cúlmen (revision 45311553)
|
|
Digital object identifier (revision 42172651)
|
|
Eclectus roratus (revision 44380798)
|
|
Edward Newton (revision 39261469)
|
|
Endemismo (revision 45260961)
|
|
Epíteto específico (revision 35101647)
|
|
Espécie (revision 45685675)
|
|
Esquilo-vermelho (revision 43489595)
|
|
Estado de conservação (revision 46662839)
|
|
Extinção (revision 46526607)
|
|
Família (biologia) (revision 46636004)
|
|
Filo (revision 46704246)
|
|
França (revision 46740839)
|
|
François-Nicolas Martinet (revision 43679514)
|
|
François Levaillant (revision 40142351)
|
|
Fredrik Hasselqvist (revision 44381122)
|
|
Fregilupus varius (revision 46555765)
|
|
Fumigação (revision 42458244)
|
|
George Robert Gray (revision 39047844)
|
|
Georges-Louis Leclerc, conde de Buffon (revision 45622418)
|
|
Género (biologia) (revision 45296588)
|
|
Hermann Schlegel (revision 43137605)
|
|
Herpetologista (revision 46207704)
|
|
Histoire Naturelle (revision 44293456)
|
|
Holótipo (revision 44029660)
|
|
Ilha da Reunião (revision 45458206)
|
|
Ilha vulcânica (revision 37924535)
|
|
Ilhas Mascarenhas (revision 45858660)
|
|
Ilhas Molucas (revision 45476933)
|
|
International Standard Book Number (revision 46326494)
|
|
Jacques Barraband (revision 45007769)
|
|
Jean Feuilley (revision 43140791)
|
|
Johann Georg Wagler (revision 34585234)
|
|
John Gerrard Keulemans (revision 39664498)
|
|
Julian Hume (revision 41876605)
|
|
Leiolopisma (revision 43997173)
|
|
Lionel Walter Rothschild (revision 46022922)
|
|
Lista Vermelha da IUCN (revision 46569884)
|
|
Lista Vermelha da União Internacional para a Conservação da Natureza e dos Recursos Naturais (revision 46569884)
|
|
Lista Vermelha de Espécies Ameaçadas da IUCN (revision 46569884)
|
|
Lista de aves extintas (revision 45507420)
|
|
Londres (revision 46310311)
|
|
Língua inglesa (revision 46609785)
|
|
Madagascar (revision 46617630)
|
|
Mascarenotus grucheti (revision 43145662)
|
|
Mathurin Jacques Brisson (revision 36018826)
|
|
Maurício (revision 46723599)
|
|
Maximiliano I José da Baviera (revision 46372080)
|
|
Melanina (revision 46762903)
|
|
Museu Nacional de História Natural (França) (revision 43731807)
|
|
Naturhistorisches Museum (revision 46694247)
|
|
Nesoenas duboisi (revision 43995805)
|
|
Nome científico (revision 46671641)
|
|
Nomenclatura binomial (revision 46671641)
|
|
Nycticorax duboisi (revision 43816214)
|
|
Nível do mar (revision 46414695)
|
|
Ordem (biologia) (revision 46360024)
|
|
Otto Finsch (revision 42362273)
|
|
Papagaio (revision 46738207)
|
|
Papagaio-cinzento (revision 46673943)
|
|
Papagaio-cinzento-de-maurício (revision 46664408)
|
|
Pedro Mascarenhas (c. 1484-1555) (revision 45541977)
|
|
Periquito-de-maurício (revision 43010883)
|
|
Periquito-de-reunião (revision 43048764)
|
|
Peter Mundy (revision 43563846)
|
|
Piton des Neiges (revision 45632497)
|
|
Pleistoceno (revision 45916874)
|
|
Plumagem (revision 34951058)
|
|
Ponto quente (revision 45375495)
|
|
Porphyrio coerulescens (revision 43672493)
|
|
Praslin (revision 40728143)
|
|
Psitacídeos (revision 46598835)
|
|
Psittaciformes (revision 46598835)
|
|
Psittacula (revision 42856453)
|
|
Psittaculinae (revision 46760737)
|
|
Psittaculini (revision 43015966)
|
|
Psittrichasiidae (revision 44385977)
|
|
|
|
== End of Parsed pages ==
|
|
|
|
- Wikipedia parsing ended at: 2016-09-20 23:47:27.346826
|
|
|
|
51 characters appeared 558324 times.
|
|
|
|
First 38 characters:
|
|
[ 0] Char a: 11.864795351802895 %
|
|
[ 1] Char e: 11.44604208309154 %
|
|
[ 2] Char o: 9.868284365350585 %
|
|
[ 3] Char s: 8.346587286235232 %
|
|
[ 4] Char i: 7.118089138206489 %
|
|
[ 5] Char r: 6.394136737808154 %
|
|
[ 6] Char n: 5.568272186042513 %
|
|
[ 7] Char d: 5.243192125002687 %
|
|
[ 8] Char t: 4.80061756256224 %
|
|
[ 9] Char m: 4.498105042949971 %
|
|
[10] Char c: 3.9747530107965985 %
|
|
[11] Char u: 3.7229279056605127 %
|
|
[12] Char l: 3.207814817202914 %
|
|
[13] Char p: 2.77562848811801 %
|
|
[14] Char g: 1.3850380782484721 %
|
|
[15] Char v: 1.3210967108703908 %
|
|
[16] Char f: 1.122466524813549 %
|
|
[17] Char b: 0.9702251739133549 %
|
|
[18] Char h: 0.9130898904578704 %
|
|
[19] Char é: 0.7026386112723079 %
|
|
[20] Char ã: 0.7022803963290133 %
|
|
[21] Char q: 0.5903382265494588 %
|
|
[22] Char ç: 0.5856814322866293 %
|
|
[23] Char í: 0.41391736697688086 %
|
|
[24] Char x: 0.3913498255493226 %
|
|
[25] Char á: 0.34567742027926435 %
|
|
[26] Char z: 0.3170202248156984 %
|
|
[27] Char ó: 0.22925756370852768 %
|
|
[28] Char j: 0.20454073262120204 %
|
|
[29] Char ê: 0.20239144296143458 %
|
|
[30] Char õ: 0.16155493942585308 %
|
|
[31] Char y: 0.15080849112701586 %
|
|
[32] Char w: 0.09241945537000021 %
|
|
[33] Char ú: 0.08794176857881804 %
|
|
[34] Char k: 0.08364318925928313 %
|
|
[35] Char â: 0.07898639499645367 %
|
|
[36] Char à: 0.06859816164091102 %
|
|
[37] Char ô: 0.031164700066627977 %
|
|
|
|
The first 38 characters have an accumulated ratio of 0.9998137282294869.
|
|
|
|
891 sequences found.
|
|
|
|
First 512 (typical positive ratio): 0.9953179582313172
|
|
Next 512 (512-1024): 1.7910747164728723e-06
|
|
Rest: 2.42861286636753e-17
|
|
|
|
- Processing end: 2016-09-20 23:47:27.489355
|