All issues
- 2024 Vol. 16
- 2023 Vol. 15
- 2022 Vol. 14
- 2021 Vol. 13
- 2020 Vol. 12
- 2019 Vol. 11
- 2018 Vol. 10
- 2017 Vol. 9
- 2016 Vol. 8
- 2015 Vol. 7
- 2014 Vol. 6
- 2013 Vol. 5
- 2012 Vol. 4
- 2011 Vol. 3
- 2010 Vol. 2
- 2009 Vol. 1
-
Image classification based on deep learning with automatic relevance determination and structured Bayesian pruning
Computer Research and Modeling, 2024, v. 16, no. 4, pp. 927-938Deep learning’s power stems from complex architectures; however, these can lead to overfitting, where models memorize training data and fail to generalize to unseen examples. This paper proposes a novel probabilistic approach to mitigate this issue. We introduce two key elements: Truncated Log-Uniform Prior and Truncated Log-Normal Variational Approximation, and Automatic Relevance Determination (ARD) with Bayesian Deep Neural Networks (BDNNs). Within the probabilistic framework, we employ a specially designed truncated log-uniform prior for noise. This prior acts as a regularizer, guiding the learning process towards simpler solutions and reducing overfitting. Additionally, a truncated log-normal variational approximation is used for efficient handling of the complex probability distributions inherent in deep learning models. ARD automatically identifies and removes irrelevant features or weights within a model. By integrating ARD with BDNNs, where weights have a probability distribution, we achieve a variational bound similar to the popular variational dropout technique. Dropout randomly drops neurons during training, encouraging the model not to rely heavily on any single feature. Our approach with ARD achieves similar benefits without the randomness of dropout, potentially leading to more stable training.
To evaluate our approach, we have tested the model on two datasets: the Canadian Institute For Advanced Research (CIFAR-10) for image classification and a dataset of Macroscopic Images of Wood, which is compiled from multiple macroscopic images of wood datasets. Our method is applied to established architectures like Visual Geometry Group (VGG) and Residual Network (ResNet). The results demonstrate significant improvements. The model reduced overfitting while maintaining, or even improving, the accuracy of the network’s predictions on classification tasks. This validates the effectiveness of our approach in enhancing the performance and generalization capabilities of deep learning models.
-
Modelling diameter measurement errors of a wide-aperture laser beam with flat profile
Computer Research and Modeling, 2015, v. 7, no. 1, pp. 113-124Views (last year): 3. Citations: 3 (RSCI).Work is devoted to modeling instrumental errors of a laser beam diameter measurement using a method based on a lambertian transmissive screen. Super-Lorenz distribution was used as a model of the beam. To determine the effect of each parameter on the measurement error were performed computational experiments, results of which were approximated by analytic functions. There were obtained the errors depending on relative beam size, spatial non-uniformity of the transmission screen, lens distortion, physical vignetting, beam tilt, CCD spatial resolution, ADC resolution of a camera. There was shown that the error can be less then 1 %.
-
Languages in China provinces: quantitative estimation with incomplete data
Computer Research and Modeling, 2016, v. 8, no. 4, pp. 707-716Views (last year): 3.This paper formulates and solves a practical problem of data recovery regarding the distribution of languages on regional level in context of China. The necessity of this recovery is related to the problem of the determination of the linguistic diversity indices, which, in turn, are used to analyze empirically and to predict sources of social and economic development as well as to indicate potential conflicts at regional level. We use Ethnologue database and China census as the initial data sources. For every language spoken in China, the data contains (a) an estimate of China residents who claim this language to be their mother tongue, and (b) indicators of the presence of such residents in China provinces. For each pair language/province, we aim to estimate the number of the province inhabitants that claim the language to be their mother tongue. This base problem is reduced to solving an undetermined system of algebraic equations. Given additional restriction that Ethnologue database introduces data collected at different time moments because of gaps in Ethnologue language surveys and accompanying data collection expenses, we relate those data to a single time moment, that turns the initial task to an ’ill-posed’ system of algebraic equations with imprecisely determined right hand side. Therefore, we are looking for an approximate solution characterized by a minimal discrepancy of the system. Since some languages are much less distributed than the others, we minimize the weighted discrepancy, introducing weights that are inverse to the right hand side elements of the equations. This definition of discrepancy allows to recover the required variables. More than 92% of the recovered variables are robust to probabilistic modelling procedure for potential errors in initial data.
-
Mathematical modeling of thrombin propagation during blood coagulation
Computer Research and Modeling, 2017, v. 9, no. 3, pp. 469-486In case of vessel wall damage or contact of blood plasma with a foreign surface, the chain of chemical reactions called coagulation cascade is launched that leading to the formation of a fibrin clot. A key enzyme of the coagulation cascade is thrombin, which catalyzes formation of fibrin from fibrinogen. The distribution of thrombin concentration in blood plasma determines spatio-temporal dynamics of clot formation. Contact pathway of blood coagulation triggers the production of thrombin in response to the contact with a negatively charged surface. If the concentration of thrombin generated at this stage is large enough, further production of thrombin takes place due to positive feedback loops of the coagulation cascade. As a result, thrombin propagates in plasma cleaving fibrinogen that results in the clot formation. The concentration profile and the speed of propagation of thrombin are constant and do not depend on the type of the initial activator.
Such behavior of the coagulation system is well described by the traveling wave solutions in a system of “reaction – diffusion” equations on the concentration of blood factors involved in the coagulation cascade. In this study, we carried out detailed analysis of the mathematical model describing the main reaction of the intrinsic pathway of coagulation cascade.We formulate necessary and sufficient conditions of the existence of the traveling wave solutions. For the considered model the existence of such solutions is equivalent to the existence of the wave solutions in the simplified one-equation model describing the dynamics of thrombin concentration derived under the quasi-stationary approximation.
Simplified model also allows us to obtain analytical estimate of the thrombin propagation rate in the considered model. The speed of the traveling wave for one equation is estimated using the narrow reaction zone method and piecewise linear approximation. The resulting formulas give a good approximation of the velocity of propagation of thrombin in the simplified, as well as in the original model.
Keywords: traveling waves, blood coagulation.Views (last year): 10. Citations: 1 (RSCI). -
On some properties of short-wave statistics of FOREX time series
Computer Research and Modeling, 2017, v. 9, no. 4, pp. 657-669Views (last year): 10.Financial mathematics is one of the most natural applications for the statistical analysis of time series. Financial time series reflect simultaneous activity of a large number of different economic agents. Consequently, one expects that methods of statistical physics and the theory of random processes can be applied to them.
In this paper, we provide a statistical analysis of time series of the FOREX currency market. Of particular interest is the comparison of the time series behavior depending on the way time is measured: physical time versus trading time measured in the number of elementary price changes (ticks). The experimentally observed statistics of the time series under consideration (euro–dollar for the first half of 2007 and for 2009 and British pound – dollar for 2007) radically differs depending on the choice of the method of time measurement. When measuring time in ticks, the distribution of price increments can be well described by the normal distribution already on a scale of the order of ten ticks. At the same time, when price increments are measured in real physical time, the distribution of increments continues to differ radically from the normal up to scales of the order of minutes and even hours.
To explain this phenomenon, we investigate the statistical properties of elementary increments in price and time. In particular, we show that the distribution of time between ticks for all three time series has a long (1-2 orders of magnitude) power-law tails with exponential cutoff at large times. We obtained approximate expressions for the distributions of waiting times for all three cases. Other statistical characteristics of the time series (the distribution of elementary price changes, pair correlation functions for price increments and for waiting times) demonstrate fairly simple behavior. Thus, it is the anomalously wide distribution of the waiting times that plays the most important role in the deviation of the distribution of increments from the normal. As a result, we discuss the possibility of applying a continuous time random walk (CTRW) model to describe the FOREX time series.
-
Methods and problems in the kinetic approach for simulating biological structures
Computer Research and Modeling, 2018, v. 10, no. 6, pp. 851-866Views (last year): 31.The biological structure is considered as an open nonequilibrium system which properties can be described on the basis of kinetic equations. New problems with nonequilibrium boundary conditions are introduced. The nonequilibrium distribution tends gradually to an equilibrium state. The region of spatial inhomogeneity has a scale depending on the rate of mass transfer in the open system and the characteristic time of metabolism. In the proposed approximation, the internal energy of the motion of molecules is much less than the energy of translational motion. Or in other terms we can state that the kinetic energy of the average blood velocity is substantially higher than the energy of chaotic motion of the same particles. We state that the relaxation problem models a living system. The flow of entropy to the system decreases in downstream, this corresponds to Shrödinger’s general ideas that the living system “feeds on” negentropy. We introduce a quantity that determines the complexity of the biosystem, more precisely, this is the difference between the nonequilibrium kinetic entropy and the equilibrium entropy at each spatial point integrated over the entire spatial region. Solutions to the problems of spatial relaxation allow us to estimate the size of biosystems as regions of nonequilibrium. The results are compared with empirical data, in particular, for mammals we conclude that the larger the size of animals, the smaller the specific energy of metabolism. This feature is reproduced in our model since the span of the nonequilibrium region is larger in the system where the reaction rate is shorter, or in terms of the kinetic approach, the longer the relaxation time of the interaction between the molecules. The approach is also used for estimation of a part of a living system, namely a green leaf. The problems of aging as degradation of an open nonequilibrium system are considered. The analogy is related to the structure, namely, for a closed system, the equilibrium of the structure is attained for the same molecules while in the open system, a transition occurs to the equilibrium of different particles, which change due to metabolism. Two essentially different time scales are distinguished, the ratio of which is approximately constant for various animal species. Under the assumption of the existence of these two time scales the kinetic equation splits in two equations, describing the metabolic (stationary) and “degradative” (nonstationary) parts of the process.
-
Mathematical and numerical modeling of a drop-shaped microcavity laser
Computer Research and Modeling, 2019, v. 11, no. 6, pp. 1083-1090This paper studies electromagnetic fields, frequencies of lasing, and emission thresholds of a drop-shaped microcavity laser. From the mathematical point of view, the original problem is a nonstandard two-parametric eigenvalue problem for the Helmholtz equation on the whole plane. The desired positive parameters are the lasing frequency and the threshold gain, the corresponding eigenfunctions are the amplitudes of the lasing modes. This problem is usually referred to as the lasing eigenvalue problem. In this study, spectral characteristics are calculated numerically, by solving the lasing eigenvalue problem on the basis of the set of Muller boundary integral equations, which is approximated by the Nystr¨om method. The Muller equations have weakly singular kernels, hence the corresponding operator is Fredholm with zero index. The Nyström method is a special modification of the polynomial quadrature method for boundary integral equations with weakly singular kernels. This algorithm is accurate for functions that are well approximated by trigonometric polynomials, for example, for eigenmodes of resonators with smooth boundaries. This approach leads to a characteristic equation for mode frequencies and lasing thresholds. It is a nonlinear algebraic eigenvalue problem, which is solved numerically by the residual inverse iteration method. In this paper, this technique is extended to the numerical modeling of microcavity lasers having a more complicated form. In contrast to the microcavity lasers with smooth contours, which were previously investigated by the Nyström method, the drop has a corner. We propose a special modification of the Nyström method for contours with corners, which takes also the symmetry of the resonator into account. The results of numerical experiments presented in the paper demonstrate the practical effectiveness of the proposed algorithm.
-
Computer simulation of the process soil treatment by tillage tools of soil processing machines
Computer Research and Modeling, 2020, v. 12, no. 3, pp. 607-627The paper analyzes the methods of studying the process of interaction of soil environments with the tillage tools of soil processing machines. The mathematical methods of numerical modeling are considered in detail, which make it possible to overcome the disadvantages of analytical and empirical approaches. A classification and overview of the possibilities the continuous (FEM — finite element method, CFD — computational fluid dynamics) and discrete (DEM — discrete element method, SPH — hydrodynamics of smoothed particles) numerical methods is presented. Based on the discrete element method, a mathematical model has been developed that represents the soil in the form of a set of interacting small spherical elements. The working surfaces of the tillage tool are presented in the framework of the finite element approximation in the form of a combination of many elementary triangles. The model calculates the movement of soil elements under the action of contact forces of soil elements with each other and with the working surfaces of the tillage tool (elastic forces, dry and viscous friction forces). This makes it possible to assess the influence of the geometric parameters of the tillage tools, technological parameters of the process and soil parameters on the geometric indicators of soil displacement, indicators of the self-installation of tools, power loads, quality indicators of loosening and spatial distribution of indicators. A total of 22 indicators were investigated (or the distribution of the indicator in space). This makes it possible to reproduce changes in the state of the system of elements of the soil (soil cultivation process) and determine the total mechanical effect of the elements on the moving tillage tools of the implement. A demonstration of the capabilities of the mathematical model is given by the example of a study of soil cultivation with a disk cultivator battery. In the computer experiment, a virtual soil channel of 5×1.4 m in size and a 3D model of a disk cultivator battery were used. The radius of the soil particles was taken to be 18 mm, the speed of the tillage tool was 1 m/s, the total simulation time was 5 s. The processing depth was 10 cm at angles of attack of 10, 15, 20, 25 and 30°. The verification of the reliability of the simulation results was carried out on a laboratory stand for volumetric dynamometry by examining a full-scale sample, made in full accordance with the investigated 3D-model. The control was carried out according to three components of the traction resistance vector: $F_x$, $F_y$ and $F_z$. Comparison of the data obtained experimentally with the simulation data showed that the discrepancy is not more than 22.2%, while in all cases the maximum discrepancy was observed at angles of attack of the disk battery of 30°. Good consistency of data on three key power parameters confirms the reliability of the whole complex of studied indicators.
-
Fast method for analyzing the electromagnetic field perturbation by small spherical scatterer
Computer Research and Modeling, 2020, v. 12, no. 5, pp. 1039-1050In this work, we consider a special approximation of the general perturbation formula for the electromagnetic field by a set of electrically small inhomogeneities located in the domain of interest. The problem considered in this paper arises in many applications of technical electrodynamics, radar technologies and subsurface remote sensing. In the general case, it is formulated as follows: at some point in the perturbed domain, it is necessary to determine the amplitude of the electromagnetic field. The perturbation of electromagnetic waves is caused by a set of electrically small scatterers distributed in space. The source of electromagnetic waves is also located in perturbed domain. The problem is solved by introducing the far field approximation and through the formulation for the scatterer radar cross section value. This, in turn, allows one to significantly speed up the calculation process of the perturbed electromagnetic field by a set of a spherical inhomogeneities identical to each other with arbitrary electrophysical parameters. In this paper, we consider only the direct scattering problem; therefore, all parameters of the scatterers are known. In this context, it may be argued that the formulation corresponds to the well-posed problem and does not imply the solution of the integral equation in the generalized formula. One of the features of the proposed algorithm is the allocation of a characteristic plane at the domain boundary. All points of observation of the state of the system belong to this plane. Set of the scatterers is located inside the observation region, which is formed by this surface. The approximation is tested by comparing the results obtained with the solution of the general formula method for the perturbation of the electromagnetic field. This approach, among other things, allows one to remove a number of restrictions on the general perturbation formula for E-filed analysis.
-
Application of Random Forest to construct a local operator for flow fields refinement in external aerodynamics problems
Computer Research and Modeling, 2021, v. 13, no. 4, pp. 761-778Numerical modeling of turbulent flows requires finding the balance between accuracy and computational efficiency. For example, DNS and LES models allow to obtain more accurate results, comparing to RANS models, but are more computationally expensive. Because of this, modern applied simulations are mostly performed with RANS models. But even RANS models can be computationally expensive for complex geometries or series simulations due to the necessity of resolving the boundary layer. Some methods, such as wall functions and near-wall domain decomposition, allow to significantly improve the speed of RANS simulations. However, they inevitably lose precision due to using a simplified model in the near-wall domain. To obtain a model that is both accurate and computationally efficient, it is possible to construct a surrogate model based on previously made simulations using the precise model.
In this paper, an operator is constructed that allows reconstruction of the flow field obtained by an accurate model based on the flow field obtained by the simplified model. Spalart–Allmaras model with approximate nearwall domain decomposition and Spalart–Allmaras model resolving the near-wall region are taken as the simplified and the base models respectively. The operator is constructed using a local approach, i. e. to reconstruct a point in the flow field, only features (flow variables and their derivatives) at this point in the field are used. The operator is constructed using the Random Forest algorithm. The efficiency and accuracy of the obtained surrogate model are demonstrated on the supersonic flow over a compression corner with different values for angle $\alpha$ and Reynolds number. The investigation has been conducted into interpolation and extrapolation both by $Re$ and $\alpha$.
Indexed in Scopus
Full-text version of the journal is also available on the web site of the scientific electronic library eLIBRARY.RU
The journal is included in the Russian Science Citation Index
The journal is included in the RSCI
International Interdisciplinary Conference "Mathematics. Computing. Education"