Результаты поиска по 'accelerated methods':
Найдено статей: 32
  1. Gasnikov A.V., Gorbunov E.A., Kovalev D.A., Mohammed A.A., Chernousova E.O.
    The global rate of convergence for optimal tensor methods in smooth convex optimization
    Computer Research and Modeling, 2018, v. 10, no. 6, pp. 737-753

    In this work we consider Monteiro – Svaiter accelerated hybrid proximal extragradient (A-HPE) framework and accelerated Newton proximal extragradient (A-NPE) framework. The last framework contains an optimal method for rather smooth convex optimization problems with second-order oracle. We generalize A-NPE framework for higher order derivative oracle (schemes). We replace Newton’s type step in A-NPE that was used for auxiliary problem by Newton’s regularized (tensor) type step (Yu. Nesterov, 2018). Moreover we generalize large step A-HPE/A-NPE framework by replacing Monteiro – Svaiter’s large step condition so that this framework could work for high-order schemes. The main contribution of the paper is as follows: we propose optimal highorder methods for convex optimization problems. As far as we know for that moment there exist only zero, first and second order optimal methods that work according to the lower bounds. For higher order schemes there exists a gap between the lower bounds (Arjevani, Shamir, Shiff, 2017) and existing high-order (tensor) methods (Nesterov – Polyak, 2006; Yu.Nesterov, 2008; M. Baes, 2009; Yu.Nesterov, 2018). Asymptotically the ratio of the rates of convergences for the best existing methods and lower bounds is about 1.5. In this work we eliminate this gap and show that lower bounds are tight. We also consider rather smooth strongly convex optimization problems and show how to generalize the proposed methods to this case. The basic idea is to use restart technique until iteration sequence reach the region of quadratic convergence of Newton method and then use Newton method. One can show that the considered method converges with optimal rates up to a logarithmic factor. Note, that proposed in this work technique can be generalized in the case when we can’t solve auxiliary problem exactly, moreover we can’t even calculate the derivatives of the functional exactly. Moreover, the proposed technique can be generalized to the composite optimization problems and in particular to the constraint convex optimization problems. We also formulate a list of open questions that arise around the main result of this paper (optimal universal method of high order e.t.c.).

    Views (last year): 75.
  2. Ryabtsev A.B.
    The error accumulation in the conjugate gradient method for degenerate problem
    Computer Research and Modeling, 2021, v. 13, no. 3, pp. 459-472

    In this paper, we consider the conjugate gradient method for solving the problem of minimizing a quadratic function with additive noise in the gradient. Three concepts of noise were considered: antagonistic noise in the linear term, stochastic noise in the linear term and noise in the quadratic term, as well as combinations of the first and second with the last. It was experimentally obtained that error accumulation is absent for any of the considered concepts, which differs from the folklore opinion that, as in accelerated methods, error accumulation must take place. The paper gives motivation for why the error may not accumulate. The dependence of the solution error both on the magnitude (scale) of the noise and on the size of the solution using the conjugate gradient method was also experimentally investigated. Hypotheses about the dependence of the error in the solution on the noise scale and the size (2-norm) of the solution are proposed and tested for all the concepts considered. It turned out that the error in the solution (by function) linearly depends on the noise scale. The work contains graphs illustrating each individual study, as well as a detailed description of numerical experiments, which includes an account of the methods of noise of both the vector and the matrix.

  3. Shaklein A.A., Karpov A.I., Bolkisev A.A.
    Analysis of a numerical method for studying upward flame spread over solid material
    Computer Research and Modeling, 2018, v. 10, no. 6, pp. 755-774

    Reduction of the fire hazard of polymeric materials is one of the important scientific and technical problems. Since complexity of experimental procedures associated with flame spread, establishing reacting flows theoretical basics turned out to be crucial field of modern fundamental science. In order to determine parameters of flame spread over solid combustible materials numerical modelling methods have to be improved. Large amount of physical and chemical processes taking place needed to be resolved not just separately one by one but in connection with each other in gas and solid phases.

    Upward flame spread over vertical solid combustible material is followed by unsteady eddy structures of gas flow in the vicinity of flame zone caused by thermal instability and natural convection forces accelerating hot combustion products. At every moment different amount of heat energy is transferred from hot gas-phase flame to solid material because of eddy flow structures. Therefore, satisfactory heat flux and eddy flow modelling are important to estimate flame spread rate.

    In the current study we evaluated parameters of numerical method for flame spread over solid combustible material problem taking into account coupled nature of complex interaction between gas phase, solid material and eddy flow resulted from natural convection. We studied aspects of different approximation schemes used in differential equations integration process over space and time, of fields relaxation during iterations procedure carried out inside time step, of different time step values.

    Mathematical model formulated allows to simulate flame spread over solid combustible material. Fluid dynamics is modeled by Navier – Stokes system of equations, eddy flow is described by combined turbulent model RANS–LES (DDES), turbulent combustion is resolved by modified turbulent combustion model Eddy Break-Up taking into account kinetic effects, radiation transfer is modeled by spherical harmonics method of first order approximation (P1). The equations presented are solved in OpenFOAM software.

    Views (last year): 33.
  4. Alkousa M.S., Gasnikov A.V., Dvurechensky P.E., Sadiev A.A., Razouk L.Ya.
    An approach for the nonconvex uniformly concave structured saddle point problem
    Computer Research and Modeling, 2022, v. 14, no. 2, pp. 225-237

    Recently, saddle point problems have received much attention due to their powerful modeling capability for a lot of problems from diverse domains. Applications of these problems occur in many applied areas, such as robust optimization, distributed optimization, game theory, and many applications in machine learning such as empirical risk minimization and generative adversarial networks training. Therefore, many researchers have actively worked on developing numerical methods for solving saddle point problems in many different settings. This paper is devoted to developing a numerical method for solving saddle point problems in the nonconvex uniformly-concave setting. We study a general class of saddle point problems with composite structure and H\"older-continuous higher-order derivatives. To solve the problem under consideration, we propose an approach in which we reduce the problem to a combination of two auxiliary optimization problems separately for each group of variables, the outer minimization problem w.r.t. primal variables, and the inner maximization problem w.r.t the dual variables. For solving the outer minimization problem, we use the Adaptive Gradient Method, which is applicable for nonconvex problems and also works with an inexact oracle that is generated by approximately solving the inner problem. For solving the inner maximization problem, we use the Restarted Unified Acceleration Framework, which is a framework that unifies the high-order acceleration methods for minimizing a convex function that has H\"older-continuous higher-order derivatives. Separate complexity bounds are provided for the number of calls to the first-order oracles for the outer minimization problem and higher-order oracles for the inner maximization problem. Moreover, the complexity of the whole proposed approach is then estimated.

  5. Aristova E.N., Karavaeva N.I.
    Bicompact schemes for the HOLO algorithm for joint solution of the transport equation and the energy equation
    Computer Research and Modeling, 2023, v. 15, no. 6, pp. 1429-1448

    The numerical solving of the system of high-temperature radiative gas dynamics (HTRGD) equations is a computationally laborious task, since the interaction of radiation with matter is nonlinear and non-local. The radiation absorption coefficients depend on temperature, and the temperature field is determined by both gas-dynamic processes and radiation transport. The method of splitting into physical processes is usually used to solve the HTRGD system, one of the blocks consists of a joint solving of the radiative transport equation and the energy balance equation of matter under known pressure and temperature fields. Usually difference schemes with orders of convergence no higher than the second are used to solve this block. Due to computer memory limitations it is necessary to use not too detailed grids to solve complex technical problems. This increases the requirements for the order of approximation of difference schemes. In this work, bicompact schemes of a high order of approximation for the algorithm for the joint solution of the radiative transport equation and the energy balance equation are implemented for the first time. The proposed method can be applied to solve a wide range of practical problems, as it has high accuracy and it is suitable for solving problems with coefficient discontinuities. The non-linearity of the problem and the use of an implicit scheme lead to an iterative process that may slowly converge. In this paper, we use a multiplicative HOLO algorithm named the quasi-diffusion method by V.Ya.Goldin. The key idea of HOLO algorithms is the joint solving of high order (HO) and low order (LO) equations. The high-order equation (HO) is the radiative transport equation solved in the energy multigroup approximation, the system of quasi-diffusion equations in the multigroup approximation (LO1) is obtained by averaging HO equations over the angular variable. The next step is averaging over energy, resulting in an effective one-group system of quasi-diffusion equations (LO2), which is solved jointly with the energy equation. The solutions obtained at each stage of the HOLO algorithm are closely related that ultimately leads to an acceleration of the convergence of the iterative process. Difference schemes constructed by the method of lines within one cell are proposed for each of the stages of the HOLO algorithm. The schemes have the fourth order of approximation in space and the third order of approximation in time. Schemes for the transport equation were developed by B.V. Rogov and his colleagues, the schemes for the LO1 and LO2 equations were developed by the authors. An analytical test is constructed to demonstrate the declared orders of convergence. Various options for setting boundary conditions are considered and their influence on the order of convergence in time and space is studied.

  6. Gasnikov A.V., Kovalev D.A.
    A hypothesis about the rate of global convergence for optimal methods (Newton’s type) in smooth convex optimization
    Computer Research and Modeling, 2018, v. 10, no. 3, pp. 305-314

    In this paper we discuss lower bounds for convergence of convex optimization methods of high order and attainability of this bounds. We formulate a hypothesis that covers all the cases. It is noticeable that we provide this statement without a proof. Newton method is the most famous method that uses gradient and Hessian of optimized function. However, it converges locally even for strongly convex functions. Global convergence can be achieved with cubic regularization of Newton method [Nesterov, Polyak, 2006], whose iteration cost is comparable with iteration cost of Newton method and is equivalent to inversion of Hessian of optimized function. Yu.Nesterov proposed accelerated variant of Newton method with cubic regularization in 2008 [Nesterov, 2008]. R.Monteiro and B. Svaiter managed to improve global convergence of cubic regularized method in 2013 [Monteiro, Svaiter, 2013]. Y.Arjevani, O. Shamir and R. Shiff showed that convergence bound of Monteiro and Svaiter is optimal (cannot be improved by more than logarithmic factor with any second order method) in 2017 [Arjevani et al., 2017]. They also managed to find bounds for convex optimization methods of p-th order for $p ≥ 2$. However, they got bounds only for first and second order methods for strongly convex functions. In 2018 Yu.Nesterov proposed third order convex optimization methods with rate of convergence that is close to this lower bounds and with similar to Newton method cost of iteration [Nesterov, 2018]. Consequently, it was showed that high order methods can be practical. In this paper we formulate lower bounds for p-th order methods for $p ≥ 3$ for strongly convex unconstrained optimization problems. This paper can be viewed as a little survey of state of the art of high order optimization methods.

    Views (last year): 21. Citations: 1 (RSCI).
  7. Kozhevnikov V.S., Matyushkin I.V., Chernyaev N.V.
    Analysis of the basic equation of the physical and statistical approach within reliability theory of technical systems
    Computer Research and Modeling, 2020, v. 12, no. 4, pp. 721-735

    Verification of the physical-statistical approach within reliability theory for the simplest cases was carried out, which showed its validity. An analytical solution of the one-dimensional basic equation of the physicalstatistical approach is presented under the assumption of a stationary degradation rate. From a mathematical point of view this equation is the well-known continuity equation, where the role of density is played by the density distribution function of goods in its characteristics phase space, and the role of fluid velocity is played by intensity (rate) degradation processes. The latter connects the general formalism with the specifics of degradation mechanisms. The cases of coordinate constant, linear and quadratic degradation rates are analyzed using the characteristics method. In the first two cases, the results correspond to physical intuition. At a constant rate of degradation, the shape of the initial distribution is preserved, and the distribution itself moves equably from the zero. At a linear rate of degradation, the distribution either narrows down to a narrow peak (in the singular limit), or expands, with the maximum shifting to the periphery at an exponentially increasing rate. The distribution form is also saved up to the parameters. For the initial normal distribution, the coordinates of the largest value of the distribution maximum for its return motion are obtained analytically.

    In the quadratic case, the formal solution demonstrates counterintuitive behavior. It consists in the fact that the solution is uniquely defined only on a part of an infinite half-plane, vanishes along with all derivatives on the boundary, and is ambiguous when crossing the boundary. If you continue it to another area in accordance with the analytical solution, it has a two-humped appearance, retains the amount of substance and, which is devoid of physical meaning, periodically over time. If you continue it with zero, then the conservativeness property is violated. The anomaly of the quadratic case is explained, though not strictly, by the analogy of the motion of a material point with an acceleration proportional to the square of velocity. Here we are dealing with a mathematical curiosity. Numerical calculations are given for all cases. Additionally, the entropy of the probability distribution and the reliability function are calculated, and their correlation is traced.

  8. Ivanova A.S., Omelchenko S.S., Kotliarova E.V., Matyukhin V.V.
    Calibration of model parameters for calculating correspondence matrix for Moscow
    Computer Research and Modeling, 2020, v. 12, no. 5, pp. 961-978

    In this paper, we consider the problem of restoring the correspondence matrix based on the observations of real correspondences in Moscow. Following the conventional approach [Gasnikov et al., 2013], the transport network is considered as a directed graph whose edges correspond to road sections and the graph vertices correspond to areas that the traffic participants leave or enter. The number of city residents is considered constant. The problem of restoring the correspondence matrix is to calculate all the correspondence from the $i$ area to the $j$ area.

    To restore the matrix, we propose to use one of the most popular methods of calculating the correspondence matrix in urban studies — the entropy model. In our work, which is based on the work [Wilson, 1978], we describe the evolutionary justification of the entropy model and the main idea of the transition to solving the problem of entropy-linear programming (ELP) in calculating the correspondence matrix. To solve the ELP problem, it is proposed to pass to the dual problem. In this paper, we describe several numerical optimization methods for solving this problem: the Sinkhorn method and the Accelerated Sinkhorn method. We provide numerical experiments for the following variants of cost functions: a linear cost function and a superposition of the power and logarithmic cost functions. In these functions, the cost is a combination of average time and distance between areas, which depends on the parameters. The correspondence matrix is calculated for multiple sets of parameters and then we calculate the quality of the restored matrix relative to the known correspondence matrix.

    We assume that the noise in the restored correspondence matrix is Gaussian, as a result, we use the standard deviation as a quality metric. The article provides an overview of gradient-free optimization methods for solving non-convex problems. Since the number of parameters of the cost function is small, we use the grid search method to find the optimal parameters of the cost function. Thus, the correspondence matrix calculated for each set of parameters and then the quality of the restored matrix is evaluated relative to the known correspondence matrix. Further, according to the minimum residual value for each cost function, we determine for which cost function and at what parameter values the restored matrix best describes real correspondence.

  9. Russkikh S.V., Shklyarchuk F.N.
    Numerical solution of systems of nonlinear second-order differential equations with variable coefficients by the one-step Galerkin method
    Computer Research and Modeling, 2023, v. 15, no. 5, pp. 1153-1167

    A nonlinear oscillatory system described by ordinary differential equations with variable coefficients is considered, in which terms that are linearly dependent on coordinates, velocities and accelerations are explicitly distinguished; nonlinear terms are written as implicit functions of these variables. For the numerical solution of the initial problem described by such a system of differential equations, the one-step Galerkin method is used. At the integration step, unknown functions are represented as a sum of linear functions satisfying the initial conditions and several given correction functions in the form of polynomials of the second and higher degrees with unknown coefficients. The differential equations at the step are satisfied approximately by the Galerkin method on a system of corrective functions. Algebraic equations with nonlinear terms are obtained, which are solved by iteration at each step. From the solution at the end of each step, the initial conditions for the next step are determined.

    The corrective functions are taken the same for all steps. In general, 4 or 5 correction functions are used for calculations over long time intervals: in the first set — basic power functions from the 2nd to the 4th or 5th degrees; in the second set — orthogonal power polynomials formed from basic functions; in the third set — special linear-independent polynomials with finite conditions that simplify the “docking” of solutions in the following steps.

    Using two examples of calculating nonlinear oscillations of systems with one and two degrees of freedom, numerical studies of the accuracy of the numerical solution of initial problems at various time intervals using the Galerkin method using the specified sets of power-law correction functions are performed. The results obtained by the Galerkin method and the Adams and Runge –Kutta methods of the fourth order are compared. It is shown that the Galerkin method can obtain reliable results at significantly longer time intervals than the Adams and Runge – Kutta methods.

  10. Dzhinchvelashvili G.A., Dzerzhinsky R.I., Denisenkova N.N.
    Quantitative assessment of seismic risk and energy concepts of earthquake engineering
    Computer Research and Modeling, 2018, v. 10, no. 1, pp. 61-76

    Currently, earthquake-resistant design of buildings based on the power calculation and presentation of effect of the earthquake static equivalent forces, which are calculated using elastic response spectra (linear-spectral method) that connects the law of motion of the soil with the absolute acceleration of the model in a nonlinear oscillator.

    This approach does not directly take into account either the influence of the duration of strong motion or the plastic behavior of the structure. Frequency content and duration of ground vibrations directly affect the energy received by the building and causing damage to its elements. Unlike power or kinematic calculation of the seismic effect on the structure can be interpreted without considering separately the forces and displacements and to provide, as the product of both variables, i.e., the work or input energy (maximum energy that can be purchased building to the earthquake).

    With the energy approach of seismic design, it is necessary to evaluate the input seismic energy in the structure and its distribution among various structural components.

    The article provides substantiation of the energy approach in the design of earthquake-resistant buildings and structures instead of the currently used method based on the power calculation and presentation of effect of the earthquake static equivalent forces, which are calculated using spectra of the reaction.

    Noted that interest in the use of energy concepts in earthquake-resistant design began with the works of Housner, which provided the seismic force in the form of the input seismic energy, using the range of speeds, and suggested that the damage in elastic-plastic system and elastic system causes one and the same input seismic energy.

    The indices of the determination of the input energy of the earthquake, proposed by various authors, are given in this paper. It is shown that modern approaches to ensuring seismic stability of structures, based on the representation of the earthquake effect as a static equivalent force, do not adequately describe the behavior of the system during an earthquake.

    In this paper, based on quantitative estimates of seismic risk analyzes developed in the NRU MSUCE Standard Organization (STO) “Seismic resistance structures. The main design provisions”. In the developed document a step forward with respect to the optimal design of earthquake-resistant structures.

    The proposed concept of using the achievements of modern methods of calculation of buildings and structures on seismic effects, which are harmonized with the Eurocodes and are not contrary to the system of national regulations.

    Views (last year): 21.
Pages: next last »

Indexed in Scopus

Full-text version of the journal is also available on the web site of the scientific electronic library eLIBRARY.RU

The journal is included in the Russian Science Citation Index

The journal is included in the RSCI

International Interdisciplinary Conference "Mathematics. Computing. Education"