All issues
- 2024 Vol. 16
- 2023 Vol. 15
- 2022 Vol. 14
- 2021 Vol. 13
- 2020 Vol. 12
- 2019 Vol. 11
- 2018 Vol. 10
- 2017 Vol. 9
- 2016 Vol. 8
- 2015 Vol. 7
- 2014 Vol. 6
- 2013 Vol. 5
- 2012 Vol. 4
- 2011 Vol. 3
- 2010 Vol. 2
- 2009 Vol. 1
-
Extracting knowledge from text messages: overview and state-of-the-art
Computer Research and Modeling, 2021, v. 13, no. 6, pp. 1291-1315In general, solving the information explosion problem can be delegated to systems for automatic processing of digital data. These systems are intended for recognizing, sorting, meaningfully processing and presenting data in formats readable and interpretable by humans. The creation of intelligent knowledge extraction systems that handle unstructured data would be a natural solution in this area. At the same time, the evident progress in these tasks for structured data contrasts with the limited success of unstructured data processing, and, in particular, document processing. Currently, this research area is undergoing active development and investigation. The present paper is a systematic survey on both Russian and international publications that are dedicated to the leading trend in automatic text data processing: Text Mining (TM). We cover the main tasks and notions of TM, as well as its place in the current AI landscape. Furthermore, we analyze the complications that arise during the processing of texts written in natural language (NLP) which are weakly structured and often provide ambiguous linguistic information. We describe the stages of text data preparation, cleaning, and selecting features which, alongside the data obtained via morphological, syntactic, and semantic analysis, constitute the input for the TM process. This process can be represented as mapping a set of text documents to «knowledge». Using the case of stock trading, we demonstrate the formalization of the problem of making a trade decision based on a set of analytical recommendations. Examples of such mappings are methods of Information Retrieval (IR), text summarization, sentiment analysis, document classification and clustering, etc. The common point of all tasks and techniques of TM is the selection of word forms and their derivatives used to recognize content in NL symbol sequences. Considering IR as an example, we examine classic types of search, such as searching for word forms, phrases, patterns and concepts. Additionally, we consider the augmentation of patterns with syntactic and semantic information. Next, we provide a general description of all NLP instruments: morphological, syntactic, semantic and pragmatic analysis. Finally, we end the paper with a comparative analysis of modern TM tools which can be helpful for selecting a suitable TM platform based on the user’s needs and skills.
-
Hydrodynamical activation of blood coagulation in stenosed vessels. Theoretical analysis
Computer Research and Modeling, 2012, v. 4, no. 1, pp. 155-183Views (last year): 2. Citations: 5 (RSCI).The mechanisms of hydrodynamical activation of blood coagulation system are investigated in stenosed vessels for a wide range of Reynolds number values (from 10 up to 500). It is assumed that the vessel wall permeability for procoagulant factors rapidly increases when wall shear stress exceeds specific threshold value. A number of patterns of blood coagulation processes development are described. The influence of blood flow topology changes on activation of blood coagulation is explored. It is established that not only blood flow decrease, but also its increase may promote activation of blood coagulation. It was found that dependence of thrombogenic danger of stenosis on vessel lumen blockage ratio is non-monotonic. The relevance of obtained theoretical results for clinical practice is discussed.
-
Semantic structuring of text documents based on patterns of natural language entities
Computer Research and Modeling, 2022, v. 14, no. 5, pp. 1185-1197The technology of creating patterns from natural language words (concepts) based on text data in the bag of words model is considered. Patterns are used to reduce the dimension of the original space in the description of documents and search for semantically related words by topic. The process of dimensionality reduction is implemented through the formation of patterns of latent features. The variety of structures of document relations is investigated in order to divide them into themes in the latent space.
It is considered that a given set of documents (objects) is divided into two non-overlapping classes, for the analysis of which it is necessary to use a common dictionary. The belonging of words to a common vocabulary is initially unknown. Class objects are considered as opposition to each other. Quantitative parameters of oppositionality are determined through the values of the stability of each feature and generalized assessments of objects according to non-overlapping sets of features.
To calculate the stability, the feature values are divided into non-intersecting intervals, the optimal boundaries of which are determined by a special criterion. The maximum stability is achieved under the condition that the boundaries of each interval contain values of one of the two classes.
The composition of features in sets (patterns of words) is formed from a sequence ordered by stability values. The process of formation of patterns and latent features based on them is implemented according to the rules of hierarchical agglomerative grouping.
A set of latent features is used for cluster analysis of documents using metric grouping algorithms. The analysis applies the coefficient of content authenticity based on the data on the belonging of documents to classes. The coefficient is a numerical characteristic of the dominance of class representatives in groups.
To divide documents into topics, it is proposed to use the union of groups in relation to their centers. As patterns for each topic, a sequence of words ordered by frequency of occurrence from a common dictionary is considered.
The results of a computational experiment on collections of abstracts of scientific dissertations are presented. Sequences of words from the general dictionary on 4 topics are formed.
-
Pattern formation of a three-species predator – prey model with prey-taxis and omnivorous predator
Computer Research and Modeling, 2023, v. 15, no. 6, pp. 1617-1634The spatiotemporal dynamics of a three-component model for food web is considered. The model describes the interactions among resource, prey and predator that consumes both species. In a previous work, the author analyzed the model without taking into account spatial heterogeneity. This study continues the model study of the community considering the diffusion of individuals, as well as directed movements of the predator. It is assumed that the predator responds to the spatial change in the resource and prey density by occupying areas where species density is higher or avoiding them. Directed predator movement is described by the advection term, where velocity is proportional to the gradient of resource and prey density. The system is considered on a one-dimensional domain with zero-flux conditions as boundary ones. The spatiotemporal dynamics produced by model is determined by the system stability in the vicinity of stationary homogeneous state with respect to small inhomogeneous perturbations. The paper analyzes the possibility of wave instability leading to the emergence of autowaves and Turing instability, as a result of which stationary patterns are formed. Sufficient conditions for the existence of both types of instability are obtained. The influence of local kinetic parameters on the spatial structure formation was analyzed. It was shown that only Turing instability is possible when taxis on the resource is positive, but with a negative taxis, both types of instability are possible. The numerical solution of the system was found by using method of lines (MOL) with the numerical integration of ODE system by means of splitting techniques. The spatiotemporal dynamics of the system is presented in several variants, realizing one of the instability types. In the case of a positive taxis on the prey, both autowave and stationary structures are formed in smaller regions, with an increase in the region size, Turing structures are not formed. For negative taxis on the prey, stationary patterns is observed in both regions, while periodic structures appear only in larger areas.
-
Features of social interactions: the basic model
Computer Research and Modeling, 2023, v. 15, no. 6, pp. 1673-1693The paper considers the basic model of competitive interactions and its use for the analysis and description of social processes. The peculiarity of the model is that it describes the interaction of several competing actors, while actors can vary the strategy of their actions, in particular, form coalitions to jointly counter a common enemy. As a result of modeling, various modes of competitive interaction were identified, their classification was conducted, and their features were described. In the course of the study, the attention is paid to the so-called “rough” (according to A.A. Andronov) cases of the implementation of competitive interaction, which until now have rarely been considered in the scientific literature, but are quite common in real life. Using a basic mathematical model, the conditions for the implementation of various modes of competitive interactions are considered, the conditions for the transition from one mode to another are determined, examples of the implementation of these modes in the economy, social and political life are given. It is shown that with a relatively low level of competition, which is non-antagonistic in nature, competition can lead to an increase in the activity of interacting actors and to overall economic growth. Moreover, in the presence of expanding resource opportunities (as long as such opportunities remain), this growth may have a hyperbolic character. With a decrease in resource capabilities and increased competition, there is a transition to an oscillatory mode, when weaker actors unite to jointly counteract stronger ones. With a further decrease in resource opportunities and increased competition, there is a transition to the formation of stable hierarchical structures. At the same time, the model shows that at a certain moment there is a loss of stability, the system becomes “rough” according to A.A. Andronov and sensitive to fluctuations in parameter changes. As a result, the existing hierarchies may collapse and be replaced by new ones. With a further increase in the intensity of competition, the actor-leader completely suppresses his opponents and establishes monopolism. Examples from economic, social, and political life are given, illustrating the patterns identified on the basis of modeling using the basic model of competition. The obtained results can be used in the analysis, modeling and forecasting of socioeconomic and political processes.
Indexed in Scopus
Full-text version of the journal is also available on the web site of the scientific electronic library eLIBRARY.RU
The journal is included in the Russian Science Citation Index
The journal is included in the RSCI
International Interdisciplinary Conference "Mathematics. Computing. Education"