Form of presentation | Articles in international journals and collections |
Year of publication | 2014 |
|
Bochkarev Vladimir Vladimirovich, author
Lerner Eduard Yulevich, author
Shevlyakova Anna Vladimirovna, author
|
Bibliographic description in the original language |
Deviations in the Zipf and Heaps laws in natural languages.
J. Phys.: Conf. Ser., 2014, V.490, 012009 |
Annotation |
This paper is devoted to verifying of the empirical Zipf and Hips laws in natural languages using Google Books Ngram corpus data. The connection between the Zipf and Heaps law which predicts the power dependence of the vocabulary size on the text size is discussed. In fact, the Heaps exponent in this dependence varies with the increasing of the text
corpus. To explain it, the obtained results are compared with the probability model of text generation. Quasiperiodic variations with characteristic time periods of 60-100 years were also found. |
Keywords |
Zipf law, Heaps law, Google Books Ngram |
The name of the journal |
J. Phys. Conf. Series
|
URL |
http://iopscience.iop.org/1742-6596/490/1/012009/pdf/1742-6596_490_1_012009.pdf |
Please use this ID to quote from or refer to the card |
https://repository.kpfu.ru/eng/?p_id=95309&p_lang=2 |
Full metadata record |
Field DC |
Value |
Language |
dc.contributor.author |
Bochkarev Vladimir Vladimirovich |
ru_RU |
dc.contributor.author |
Lerner Eduard Yulevich |
ru_RU |
dc.contributor.author |
Shevlyakova Anna Vladimirovna |
ru_RU |
dc.date.accessioned |
2014-01-01T00:00:00Z |
ru_RU |
dc.date.available |
2014-01-01T00:00:00Z |
ru_RU |
dc.date.issued |
2014 |
ru_RU |
dc.identifier.citation |
Deviations in the Zipf and Heaps laws in natural languages.
J. Phys.: Conf. Ser., 2014, V.490, 012009 |
ru_RU |
dc.identifier.uri |
https://repository.kpfu.ru/eng/?p_id=95309&p_lang=2 |
ru_RU |
dc.description.abstract |
J. Phys. Conf. Series |
ru_RU |
dc.description.abstract |
This paper is devoted to verifying of the empirical Zipf and Hips laws in natural languages using Google Books Ngram corpus data. The connection between the Zipf and Heaps law which predicts the power dependence of the vocabulary size on the text size is discussed. In fact, the Heaps exponent in this dependence varies with the increasing of the text
corpus. To explain it, the obtained results are compared with the probability model of text generation. Quasiperiodic variations with characteristic time periods of 60-100 years were also found. |
ru_RU |
dc.language.iso |
ru |
ru_RU |
dc.subject |
Zipf law |
ru_RU |
dc.subject |
Heaps law |
ru_RU |
dc.subject |
Google Books Ngram |
ru_RU |
dc.title |
Deviations in the Zipf and Heaps laws in natural languages |
ru_RU |
dc.type |
Articles in international journals and collections |
ru_RU |
|