Форма представления | Статьи в зарубежных журналах и сборниках |
Год публикации | 2020 |
Язык | английский |
|
Иванов Владимир Владимирович, автор
Солнышкина Марина Ивановна, автор
Соловьев Валерий Дмитриевич, автор
|
Библиографическое описание на языке оригинала |
Solovyev V., Ivanov V., Solnyshkina M. (2020) Thesaurus-Based Methods for Assessment of Text Complexity in Russian. In: Martínez-Villaseñor L., Herrera-Alcántara O., Ponce H., Castro-Espinoza F.A. (eds) Advances in Computational Intelligence. MICAI 2020. Lecture Notes in Computer Science, vol 12469. Springer, Cham. https://doi.org/10.1007/978-3-030-60887-3_14 |
Аннотация |
The study explores the problem of assessing complexity of Russian educational texts. In this paper, we focus on measuring conceptual complexity which is rarely selected as a research question and propose to use a thesaurus (or a linguistic ontology) to this end. We also compiled an original corpus of school textbooks on Social Studies, History used in high school, and textbooks for elementary school specifically for this set of text complexity experiments. On the first stage of the research, RuThes-Lite thesaurus, a linguistic knowledge base with the total size of 100,000 concepts, was used to elicit concepts in the texts of schoolbooks and represent them as graphs. To the best of our knowledge, we a new method for text complexity assessment using RuThes-Lite graphs and identify graphs-based semantic characteristics of texts that impact complexity. The most significant findings of the research include identification of statistically significant correlations of the selected features, such as node degree, with complexity of educational texts. |
Ключевые слова |
Text complexity, Thesaurus, Russian language |
Название журнала |
Lecture Notes in Computer Science Proceedings, Springer. - LNCS 3777.
|
URL |
https://link.springer.com/chapter/10.1007/978-3-030-60887-3_14 |
Пожалуйста, используйте этот идентификатор, чтобы цитировать или ссылаться на эту карточку |
https://repository.kpfu.ru/?p_id=239434 |
Полная запись метаданных |
Поле DC |
Значение |
Язык |
dc.contributor.author |
Иванов Владимир Владимирович |
ru_RU |
dc.contributor.author |
Солнышкина Марина Ивановна |
ru_RU |
dc.contributor.author |
Соловьев Валерий Дмитриевич |
ru_RU |
dc.date.accessioned |
2020-01-01T00:00:00Z |
ru_RU |
dc.date.available |
2020-01-01T00:00:00Z |
ru_RU |
dc.date.issued |
2020 |
ru_RU |
dc.identifier.citation |
Solovyev V., Ivanov V., Solnyshkina M. (2020) Thesaurus-Based Methods for Assessment of Text Complexity in Russian. In: Martínez-Villaseñor L., Herrera-Alcántara O., Ponce H., Castro-Espinoza F.A. (eds) Advances in Computational Intelligence. MICAI 2020. Lecture Notes in Computer Science, vol 12469. Springer, Cham. https://doi.org/10.1007/978-3-030-60887-3_14 |
ru_RU |
dc.identifier.uri |
https://repository.kpfu.ru/?p_id=239434 |
ru_RU |
dc.description.abstract |
Lecture Notes in Computer Science Proceedings, Springer. - LNCS 3777. |
ru_RU |
dc.description.abstract |
The study explores the problem of assessing complexity of Russian educational texts. In this paper, we focus on measuring conceptual complexity which is rarely selected as a research question and propose to use a thesaurus (or a linguistic ontology) to this end. We also compiled an original corpus of school textbooks on Social Studies, History used in high school, and textbooks for elementary school specifically for this set of text complexity experiments. On the first stage of the research, RuThes-Lite thesaurus, a linguistic knowledge base with the total size of 100,000 concepts, was used to elicit concepts in the texts of schoolbooks and represent them as graphs. To the best of our knowledge, we a new method for text complexity assessment using RuThes-Lite graphs and identify graphs-based semantic characteristics of texts that impact complexity. The most significant findings of the research include identification of statistically significant correlations of the selected features, such as node degree, with complexity of educational texts. |
ru_RU |
dc.language.iso |
ru |
ru_RU |
dc.subject |
Text complexity |
ru_RU |
dc.subject |
Thesaurus |
ru_RU |
dc.subject |
Russian language |
ru_RU |
dc.title |
Thesaurus-Based Methods for Assessment of Text Complexity in Russian |
ru_RU |
dc.type |
Статьи в зарубежных журналах и сборниках |
ru_RU |
|