Home // Cultura // Linguagens esfriam enquanto expandem…

Linguagens esfriam enquanto expandem…


 Languages cool as they expand: Allometric scaling and the decreasing need for new words


Alexander M. PetersenJoel N. TenenbaumShlomo HavlinH. Eugene StanleyMatjaz Perc
(Submitted on 11 Dec 2012)

We analyze the occurrence frequencies of over 15 million words recorded in millions of books published during the past two centuries in seven different languages. For all languages and chronological subsets of the data we confirm that two scaling regimes characterize the word frequency distributions, with only the more common words obeying the classic Zipf law. Using corpora of unprecedented size, we test the allometric scaling relation between the corpus size and the vocabulary size of growing languages to demonstrate a decreasing marginal need for new words, a feature that is likely related to the underlying correlations between words. We calculate the annual growth fluctuations of word use which has a decreasing trend as the corpus size increases, indicating a slowdown in linguistic evolution following language expansion. This “cooling pattern” forms the basis of a third statistical regularity, which unlike the Zipf and the Heaps law, is dynamical in nature.

Comments: 9 two-column pages, 7 figures; accepted for publication in Scientific Reports
Subjects: Physics and Society (physics.soc-ph); Statistical Mechanics (cond-mat.stat-mech); Computation and Language (cs.CL); Applications (stat.AP)
Journal reference: Sci. Rep. 2 (2012) 943
DOI: 10.1038/srep00943
Cite as: arXiv:1212.2616 [physics.soc-ph]
(or arXiv:1212.2616v1 [physics.soc-ph] for this version)

Submission history

Posted in Cultura, Sem categoria and tagged as , , , , , , , , , ,

Deixe uma resposta

O seu endereço de e-mail não será publicado. Campos obrigatórios são marcados com *

Esse site utiliza o Akismet para reduzir spam. Aprenda como seus dados de comentários são processados.