Результаты поиска по 'decision-making':
Найдено статей: 34
  1. Lubashevsky I.A., Lubashevskiy V.I.
    Dynamical trap model for stimulus – response dynamics of human control
    Computer Research and Modeling, 2024, v. 16, no. 1, pp. 79-87

    We present a novel model for the dynamical trap of the stimulus – response type that mimics human control over dynamic systems when the bounded capacity of human cognition is a crucial factor. Our focus lies on scenarios where the subject modulates a control variable in response to a certain stimulus. In this context, the bounded capacity of human cognition manifests in the uncertainty of stimulus perception and the subsequent actions of the subject. The model suggests that when the stimulus intensity falls below the (blurred) threshold of stimulus perception, the subject suspends the control and maintains the control variable near zero with accuracy determined by the control uncertainty. As the stimulus intensity grows above the perception uncertainty and becomes accessible to human cognition, the subject activates control. Consequently, the system dynamics can be conceptualized as an alternating sequence of passive and active modes of control with probabilistic transitions between them. Moreover, these transitions are expected to display hysteresis due to decision-making inertia.

    Generally, the passive and active modes of human control are governed by different mechanisms, posing challenges in developing efficient algorithms for their description and numerical simulation. The proposed model overcomes this problem by introducing the dynamical trap of the stimulus-response type, which has a complex structure. The dynamical trap region includes two subregions: the stagnation region and the hysteresis region. The model is based on the formalism of stochastic differential equations, capturing both probabilistic transitions between control suspension and activation as well as the internal dynamics of these modes within a unified framework. It reproduces the expected properties in control suspension and activation, probabilistic transitions between them, and hysteresis near the perception threshold. Additionally, in a limiting case, the model demonstrates the capability of mimicking a similar subject’s behavior when (1) the active mode represents an open-loop implementation of locally planned actions and (2) the control activation occurs only when the stimulus intensity grows substantially and the risk of the subject losing the control over the system dynamics becomes essential.

  2. The paper presents a physico-mathematical model of the perturbed region formed in the lower D-layer of the ionosphere under the action of directed radio emission flux from a terrestrial stand of the megahertz frequency range, obtained as a result of comprehensive theoretical studies. The model is based on the consideration of a wide range of kinetic processes taking into account their nonequilibrium and in the two-temperature approximation for describing the transformation of the radio beam energy absorbed by electrons. The initial data on radio emission achieved by the most powerful radio-heating stands are taken in the paper. Their basic characteristics and principles of functioning, and features of the altitude distribution of the absorbed electromagnetic energy of the radio beam are briefly described. The paper presents the decisive role of the D-layer of the ionosphere in the absorption of the energy of the radio beam. On the basis of theoretical analysis, analytical expressions are obtained for the contribution of various inelastic processes to the distribution of the absorbed energy, which makes it possible to correctly describe the contribution of each of the processes considered. The work considers more than 60 components. The change of the component concentration describe about 160 reactions. All the reactions are divided into five groups according to their physical content: ionization-chemical block, excitation block of metastable electronic states, cluster block, excitation block of vibrational states and block of impurities. Blocks are interrelated and can be calculated both jointly and separately. The paper presents the behavior of the parameters of the perturbed region in daytime and nighttime conditions is significantly different at the same radio flux density: under day conditions, the maximum electron concentration and temperature are at an altitude of ~45–55 km; in night ~80 km, with the temperature of heavy particles rapidly increasing, which leads to the occurrence of a gas-dynamic flow. Therefore, a special numerical algorithm are developed to solve two basic problems: kinetic and gas dynamic. Based on the altitude and temporal behavior of concentrations and temperatures, the algorithm makes it possible to determine the ionization and emission of the ionosphere in the visible and infrared spectral range, which makes it possible to evaluate the influence of the perturbed region on radio engineering and optoelectronic devices used in space technology.

    Views (last year): 17.
  3. Rudenko V.D., Yudin N.E., Vasin A.A.
    Survey of convex optimization of Markov decision processes
    Computer Research and Modeling, 2023, v. 15, no. 2, pp. 329-353

    This article reviews both historical achievements and modern results in the field of Markov Decision Process (MDP) and convex optimization. This review is the first attempt to cover the field of reinforcement learning in Russian in the context of convex optimization. The fundamental Bellman equation and the criteria of optimality of policy — strategies based on it, which make decisions based on the known state of the environment at the moment, are considered. The main iterative algorithms of policy optimization based on the solution of the Bellman equations are also considered. An important section of this article was the consideration of an alternative to the $Q$-learning approach — the method of direct maximization of the agent’s average reward for the chosen strategy from interaction with the environment. Thus, the solution of this convex optimization problem can be represented as a linear programming problem. The paper demonstrates how the convex optimization apparatus is used to solve the problem of Reinforcement Learning (RL). In particular, it is shown how the concept of strong duality allows us to naturally modify the formulation of the RL problem, showing the equivalence between maximizing the agent’s reward and finding his optimal strategy. The paper also discusses the complexity of MDP optimization with respect to the number of state–action–reward triples obtained as a result of interaction with the environment. The optimal limits of the MDP solution complexity are presented in the case of an ergodic process with an infinite horizon, as well as in the case of a non-stationary process with a finite horizon, which can be restarted several times in a row or immediately run in parallel in several threads. The review also reviews the latest results on reducing the gap between the lower and upper estimates of the complexity of MDP optimization with average remuneration (Averaged MDP, AMDP). In conclusion, the real-valued parametrization of agent policy and a class of gradient optimization methods through maximizing the $Q$-function of value are considered. In particular, a special class of MDPs with restrictions on the value of policy (Constrained Markov Decision Process, CMDP) is presented, for which a general direct-dual approach to optimization with strong duality is proposed.

  4. The main aim, formulated in the first part of article, is to carry out detailed numerical studies of the chemical, ionization, optical, and temperature characteristics of the lower ionosphere perturbed by powerful radio emission. The brief review of the main experimental and theoretical researches of physical phenomena occurring in the ionosphere when it is heated by high-power high-frequency radio waves from heating facilities is given. The decisive role of the $D$-region of the ionosphere in the absorption of radio beam energy is shown. A detailed analysis of kinetic processes in the disturbed $D$-region, which is the most complex in kinetic terms, has been performed. It is shown that for a complete description of the ionization-chemical and optical characteristics of the disturbed region, it is necessary to take into account more than 70 components, which, according to their main physical content, can be conveniently divided into five groups. A kinetic model is presented to describe changes in the concentrations of components interacting (the total number of reactions is 259). The system of kinetic equations was solved using a semi-implicit numerical method specially adapted to such problems. Based on the proposed structure, a software package was developed in which the algorithm scheme allowed changing both the content of individual program blocks and their number, which made it possible to conduct detailed numerical studies of individual processes in the behavior of the parameters of the perturbed region. The complete numerical algorithm is based on the two-temperature approximation, in which the main attention was paid to the calculation of the electron temperature, since its behavior is determined by inelastic kinetic processes involving electrons. The formulation of the problem is of a rather general nature and makes it possible to calculate the parameters of the disturbed ionosphere in a wide range of powers and frequencies of radio emission. Based on the developed numerical technique, it is possible to study a wide range of phenomena both in the natural and disturbed ionosphere.

  5. Suvorov N.V., Shleymovich M.P.
    Mathematical model of the biometric iris recognition system
    Computer Research and Modeling, 2020, v. 12, no. 3, pp. 629-639

    Automatic recognition of personal identity by biometric features is based on unique peculiarities or characteristics of people. Biometric identification process consist in making of reference templates and comparison with new input data. Iris pattern recognition algorithms presents high accuracy and low identification errors percent on practice. Iris pattern advantages over other biometric features are determined by its high degree of freedom (nearly 249), excessive density of unique features and constancy. High recognition reliability level is very important because it provides search in big databases. Unlike one-to-one check mode that is applicable only to small calculation count it allows to work in one-to-many identification mode. Every biometric identification system appears to be probabilistic and qualitative characteristics description utilizes such parameters as: recognition accuracy, false acceptance rate and false rejection rate. These characteristics allows to compare identity recognition methods and asses the system performance under any circumstances. This article explains the mathematical model of iris pattern biometric identification and its characteristics. Besides, there are analyzed results of comparison of model and real recognition process. To make such analysis there was carried out the review of existing iris pattern recognition methods based on different unique features vector. The Python-based software package is described below. It builds-up probabilistic distributions and generates large test data sets. Such data sets can be also used to educate the identification decision making neural network. Furthermore, synergy algorithm of several iris pattern identification methods was suggested to increase qualitative characteristics of system in comparison with the use of each method separately.

  6. Malkov S.Yu., Shpyrko O.A.
    Formalized decision-making model: taking into account value motivation
    Computer Research and Modeling, 2025, v. 17, no. 2, pp. 323-338

    The paper considers the problems of mathematical description of deontological aspects influencing the behavior of decision makers. A methodology is proposed for correlating utilitarian (material) and deontological (value) aspects in their decision-making, taking into account their psychological characteristics. A mathematical model is proposed for the joint consideration of utilitarian and deontological factors in decision-making in various situations. Some patterns related to this consideration are identified, and their formal description is given. The model shows that there is a tendency for a gradual decrease in the level of deontology in evaluating alternatives when making decisions (compared to what the outside world inclines to) towards greater utilitarianism. Over time, this trend begins to influence public opinion and society’s attitude to moral norms, gradually reducing the overall level of morality in society. This process can be stopped only by constantly and purposefully maintaining a high level of deontology by society and the state (ideological work, promotion of traditional values, educational work at school, etc.), otherwise society will inevitably become utilitarian over time, focusing exclusively on material factors when making decisions.

    In the future, it is planned to use the developed tools for analyzing specific situations, including for analyzing the patterns of civilizational cycles: the rise and fall of the Roman Empire, the USSR, and modern Western civilization).

  7. Shamiev M.O., Trofimov A.G.
    Learning spatio-temporal precursors of dam instability using a CNN–BiGRU framework
    Computer Research and Modeling, 2026, v. 18, no. 2, pp. 377-397

    Dam safety assessment increasingly relies on continuous monitoring of hydrometeorological variables; however, identifying early-stage instability remains challenging due to complex spatio-temporal interactions and highly imbalanced failure observations. This study proposes a deep learning framework based on a Convolutional Bidirectional Gated Recurrent Unit (CNN–BiGRU) architecture to learn spatio-temporal precursors of dam instability from multivariate hydrometeorological time series. The convolutional component extracts localized temporal patterns associated with short-term fluctuations, while the bidirectional recurrent structure captures long-range dependencies and evolving dynamics preceding critical states.

    The proposed model is evaluated on a real-world dam monitoring dataset comprising multiple water-level, meteorological, and derived dynamic indicators. To address class imbalance, a cost-sensitive training strategy using class weighting is adopted without synthetic oversampling. Experimental results demonstrate strong predictive performance, achieving an accuracy of 0.961, precision of 0.901, recall of 0.757, and an F1-score of 0.823. The model further attains a ROC-AUC of 0.907 and a PR-AUC of 0.819, indicating robust discrimination capability under imbalanced conditions.

    Feature importance analysis reveals that short- and medium-term water level variability, including rolling standard deviation, volatility, and multi-scale gradients, play a dominant role in characterizing pre-instability behavior, providing physically interpretable insights into dam response dynamics. The findings suggest that the CNN–BiGRU framework effectively captures meaningful spatio-temporal precursors and offers a reliable data-driven tool for supporting dam safety monitoring and decision-making under real operational conditions.

  8. Orlova E.V.
    Model for economic interests agreement in duopoly’s making price decisions
    Computer Research and Modeling, 2015, v. 7, no. 6, pp. 1309-1329

    The model of market pricing in duopoly describing the prices dynamics as a two-dimensional map is presented. It is shown that the fixed point of the map coincides with the local Nash-equilibrium price in duopoly game. There have been numerically identified a bifurcation of the fixed point, shown the scheme of transition from periodic to chaotic mode through a doubling period. To ensure the sustainability of local Nashequilibrium price the controlling chaos mechanism has been proposed. This mechanism allows to harmonize the economic interests of the firms and to form the balanced pricing policy.

    Views (last year): 10. Citations: 2 (RSCI).
  9. Varshavsky L.E.
    Studying indicators of development of oligopolistic markets on the basis of operational calculus
    Computer Research and Modeling, 2019, v. 11, no. 5, pp. 949-963

    The traditional approach to computing optimal game strategies of firms on oligopolistic markets and of indicators of such markets consists in studying linear dynamical games with quadratic criteria and solving generalized matrix Riccati equations.

    The other approach proposed by the author is based on methods of operational calculus (in particular, Z-transform). This approach makes it possible to achieve economic meaningful decisions under wider field of parameter values. It characterizes by simplicity of computations and by necessary for economic analysis visibility. One of its advantages is that in many cases important for economic practice, it, in contrast to the traditional approach, provides the ability to make calculations using widespread spreadsheets, which allows to study the prospects for the development of oligopolistic markets to a wide range of professionals and consumers.

    The article deals with the practical aspects of determining the optimal Nash–Cournot strategies of participants in oligopolistic markets on the basis of operational calculus, in particular the technique of computing the optimal Nash–Cournot strategies in Excel. As an illustration of the opportinities of the proposed methods of calculation, examples close to the practical problems of forecasting indicators of the markets of high-tech products are studied.

    The results of calculations obtained by the author for numerous examples and real economic systems, both using the obtained relations on the basis of spreadsheets and using extended Riccati equations, are very close. In most of the considered practical problems, the deviation of the indicators calculated in accordance with the two approaches, as a rule, does not exceed 1.5–2%. The highest value of relative deviations (up to 3–5%) is observed at the beginning of the forecasting period. In typical cases, the period of relatively noticeable deviations is 3–5 moments of time. After the transition period, there is almost complete agreement of the values of the required indicators using both approaches.

  10. Petrov A.P., Podlipskaia O.G., Pronchev G.B.
    Modeling the dynamics of public attention to extended processes on the example of the COVID-19 pandemic
    Computer Research and Modeling, 2022, v. 14, no. 5, pp. 1131-1141

    The dynamics of public attention to COVID-19 epidemic is studied. The level of public attention is described by the daily number of search requests in Google made by users from a given country. In the empirical part of the work, data on the number of requests and the number of infected cases for a number of countries are considered. It is shown that in all cases the maximum of public attention occurs earlier than the maximum daily number of newly infected individuals. Thus, for a certain period of time, the growth of the epidemics occurs in parallel with the decline in public attention to it. It is also shown that the decline in the number of requests is described by an exponential function of time. In order to describe the revealed empirical pattern, a mathematical model is proposed, which is a modification of the model of the decline in attention after a one-time political event. The model develops the approach that considers decision-making by an individual as a member of the society in which the information process takes place. This approach assumes that an individual’s decision about whether or not to make a request on a given day about COVID is based on two factors. One of them is an attitude that reflects the individual’s long-term interest in a given topic and accumulates the individual’s previous experience, cultural preferences, social and economic status. The second is the dynamic factor of public attention to the epidemic, which changes during the process under consideration under the influence of informational stimuli. With regard to the subject under consideration, information stimuli are related to epidemic dynamics. The behavioral hypothesis is that if on some day the sum of the attitude and the dynamic factor exceeds a certain threshold value, then on that day the individual in question makes a search request on the topic of COVID. The general logic is that the higher the rate of infection growth, the higher the information stimulus, the slower decreases public attention to the pandemic. Thus, the constructed model made it possible to correlate the rate of exponential decrease in the number of requests with the rate of growth in the number of cases. The regularity found with the help of the model was tested on empirical data. It was found that the Student’s statistic is 4.56, which allows us to reject the hypothesis of the absence of a correlation with a significance level of 0.01.

Pages: previous next last »

Indexed in Scopus

Full-text version of the journal is also available on the web site of the scientific electronic library eLIBRARY.RU

The journal is included in the Russian Science Citation Index

The journal is included in the RSCI

International Interdisciplinary Conference "Mathematics. Computing. Education"