All issues
- 2024 Vol. 16
- 2023 Vol. 15
- 2022 Vol. 14
- 2021 Vol. 13
- 2020 Vol. 12
- 2019 Vol. 11
- 2018 Vol. 10
- 2017 Vol. 9
- 2016 Vol. 8
- 2015 Vol. 7
- 2014 Vol. 6
- 2013 Vol. 5
- 2012 Vol. 4
- 2011 Vol. 3
- 2010 Vol. 2
- 2009 Vol. 1
-
Extracting knowledge from text messages: overview and state-of-the-art
Computer Research and Modeling, 2021, v. 13, no. 6, pp. 1291-1315In general, solving the information explosion problem can be delegated to systems for automatic processing of digital data. These systems are intended for recognizing, sorting, meaningfully processing and presenting data in formats readable and interpretable by humans. The creation of intelligent knowledge extraction systems that handle unstructured data would be a natural solution in this area. At the same time, the evident progress in these tasks for structured data contrasts with the limited success of unstructured data processing, and, in particular, document processing. Currently, this research area is undergoing active development and investigation. The present paper is a systematic survey on both Russian and international publications that are dedicated to the leading trend in automatic text data processing: Text Mining (TM). We cover the main tasks and notions of TM, as well as its place in the current AI landscape. Furthermore, we analyze the complications that arise during the processing of texts written in natural language (NLP) which are weakly structured and often provide ambiguous linguistic information. We describe the stages of text data preparation, cleaning, and selecting features which, alongside the data obtained via morphological, syntactic, and semantic analysis, constitute the input for the TM process. This process can be represented as mapping a set of text documents to «knowledge». Using the case of stock trading, we demonstrate the formalization of the problem of making a trade decision based on a set of analytical recommendations. Examples of such mappings are methods of Information Retrieval (IR), text summarization, sentiment analysis, document classification and clustering, etc. The common point of all tasks and techniques of TM is the selection of word forms and their derivatives used to recognize content in NL symbol sequences. Considering IR as an example, we examine classic types of search, such as searching for word forms, phrases, patterns and concepts. Additionally, we consider the augmentation of patterns with syntactic and semantic information. Next, we provide a general description of all NLP instruments: morphological, syntactic, semantic and pragmatic analysis. Finally, we end the paper with a comparative analysis of modern TM tools which can be helpful for selecting a suitable TM platform based on the user’s needs and skills.
-
Dynamics regimes of population with non-overlapping generations taking into account genetic and stage structures
Computer Research and Modeling, 2020, v. 12, no. 5, pp. 1165-1190This paper studies a model of a population with non-overlapping generations and density-dependent regulation of birth rate. The population breeds seasonally, and its reproductive potential is determined genetically. The model proposed combines an ecological dynamic model of a limited population with non-overlapping generations and microevolutionary model of its genetic structure dynamics for the case when adaptive trait of birth rate controlled by a single diallelic autosomal locus with allelomorphs A and a. The study showed the genetic composition of the population, namely, will it be polymorphic or monomorphic, is mainly determined by the values of the reproductive potentials of heterozygote and homozygotes. Moreover, the average reproductive potential of mature individuals and intensity of self-regulation processes determine population dynamics. In particularly, increasing the average value of the reproductive potential leads to destabilization of the dynamics of age group sizes. The intensity of self-regulation processes determines the nature of emerging oscillations, since scenario of stability loss of fixed points depends on the values of this parameter. It is shown that patterns of occurrence and evolution of cyclic dynamics regimes are mainly determined by the features of life cycle of individuals in population. The life cycle leading to existence of non-overlapping generation gives isolated subpopulations in different years, which results in the possibility of independent microevolution of these subpopulations and, as a result, the complex dynamics emergence of both stage structure and genetic one. Fixing various adaptive mutations will gradually lead to genetic (and possibly morphological) differentiation and to differences in the average reproductive potentials of subpopulations that give different values of equilibrium subpopulation sizes. Further evolutionary growth of reproductive potentials of limited subpopulations leads to their number fluctuations which can differ in both amplitude and phase.
Indexed in Scopus
Full-text version of the journal is also available on the web site of the scientific electronic library eLIBRARY.RU
The journal is included in the Russian Science Citation Index
The journal is included in the RSCI
International Interdisciplinary Conference "Mathematics. Computing. Education"