Результаты поиска по 'GPU':
Найдено статей: 8
  1. Kuz'min I.M., Tonkov L.E., Kopysov S.P.
    Algorithms and Software for Solving Coupled Fluid-Structure Interaction Problems on Hybrid HPC Platform
    Computer Research and Modeling, 2013, v. 5, no. 2, pp. 153-164

    In this paper, we propose a new software for simulation of fluid-structure interaction. The software is designed for solving coupled problems and provides an interface for synchronization synchronisation and data exchange between existing fluid and structural solvers. Algorithms of coupling solvers and mesh deformation are discussed. The software can be used on hybrid CPU/GPU platforms.

    Views (last year): 1. Citations: 11 (RSCI).
  2. Dzhoraev A.R.
    GPU-accelerated hybrid systems for high-performance computing in bio-informatics
    Computer Research and Modeling, 2010, v. 2, no. 2, pp. 163-167

    Modern GPUs are massively-parallel processors, offering substantial amount of computational power in energy-efficient package. We discuss the benefits of utilizing this computing power for modeling problems in bio-informatics, such as molecular dynamics, quantum chemistry and sequence analysis.

    Views (last year): 2. Citations: 6 (RSCI).
  3. Kazennov A.M.
    Basic concepts of CUDA technology
    Computer Research and Modeling, 2010, v. 2, no. 3, pp. 295-308

    The history of the development of CUDA technology and its fundamental limitations are discribed. The article is intended for those readers who are not familiar with graphics adapter programming features but want to evaluate the possibilities for GPU computing applications.

    Views (last year): 5. Citations: 4 (RSCI).
  4. Geller O.V., Vasilev M.O., Kholodov Y.A.
    Building a high-performance computing system for simulation of gas dynamics
    Computer Research and Modeling, 2010, v. 2, no. 3, pp. 309-317

    The aim of research is to develop software system for solving gas dynamic problem in multiply connected integration domains of regular shape by high-performance computing system. Comparison of the various technologies of parallel computing has been done. The program complex is implemented using multithreaded parallel systems to organize both multi-core and massively parallel calculation. The comparison of numerical results with known model problems solutions has been done. Research of performance of different computing platforms has been done.

    Views (last year): 5. Citations: 6 (RSCI).
  5. Zhmurov A.A., Barsegov V.A., Trifonov S.V., Kholodov Y.A., Kholodov A.S.
    Efficient Pseudorandom number generators for biomolecular simulations on graphics processors
    Computer Research and Modeling, 2011, v. 3, no. 3, pp. 287-308

    Langevin Dynamics, Monte Carlo, and all-atom Molecular Dynamics simulations in implicit solvent require a reliable source of pseudorandom numbers generated at each step of calculation. We present the two main approaches for implementation of pseudorandom number generators on a GPU. In the first approach, inherent in CPU-based calculations, one PRNG produces a stream of pseudorandom numbers in each thread of execution, whereas the second approach builds on the ability of different threads to communicate, thus, sharing random seeds across the entire device. We exemplify the use of these approaches through the development of Ran2, Hybrid Taus, and Lagged Fibonacci algorithms. As an application-based test of randomness, we carry out LD simulations of N independent harmonic oscillators coupled to a stochastic thermostat. This model allows us to assess statistical quality of pseudorandom numbers. We also profile performance of these generators in terms of the computational time, memory usage, and the speedup factor (CPU/GPU time).

    Views (last year): 11. Citations: 2 (RSCI).
  6. Bogdanov A.V., Gankevich I.G., Gayduchok V.Yu., Yuzhanin N.V.
    Running applications on a hybrid cluster
    Computer Research and Modeling, 2015, v. 7, no. 3, pp. 475-483

    A hybrid cluster implies the use of computational devices with radically different architectures. Usually, these are conventional CPU architecture (e.g. x86_64) and GPU architecture (e. g. NVIDIA CUDA). Creating and exploiting such a cluster requires some experience: in order to harness all computational power of the described system and get substantial speedup for computational tasks many factors should be taken into account. These factors consist of hardware characteristics (e.g. network infrastructure, a type of data storage, GPU architecture) as well as software stack (e.g. MPI implementation, GPGPU libraries). So, in order to run scientific applications GPU capabilities, software features, task size and other factors should be considered.

    This report discusses opportunities and problems of hybrid computations. Some statistics from tests programs and applications runs will be demonstrated. The main focus of interest is open source applications (e. g. OpenFOAM) that support GPGPU (with some parts rewritten to use GPGPU directly or by replacing libraries).

    There are several approaches to organize heterogeneous computations for different GPU architectures out of which CUDA library and OpenCL framework are compared. CUDA library is becoming quite typical for hybrid systems with NVIDIA cards, but OpenCL offers portability opportunities which can be a determinant factor when choosing framework for development. We also put emphasis on multi-GPU systems that are often used to build hybrid clusters. Calculations were performed on a hybrid cluster of SPbU computing center.

    Views (last year): 4.
  7. Zhmurov A.A., Alekseenko A.E., Barsegov V.A., Kononova O.G., Kholodov Y.A.
    Phase transition from α-helices to β-sheets in supercoils of fibrillar proteins
    Computer Research and Modeling, 2013, v. 5, no. 4, pp. 705-725

    The transition from α-helices to β-strands under external mechanical force in fibrin molecule containing coiled-coils is studied and free energy landscape is resolved. The detailed theoretical modeling of each stage of coiled-coils fragment pulling process was performed. The plots of force (F) as a function of molecule expansion (X) for two symmetrical fibrin coiled-coils (each ∼17 nm in length) show three distinct modes of mechanical behaviour: (1) linear (elastic) mode when coiled-coils behave like entropic springs (F<100−125 pN and X<7−8 nm), (2) viscous (plastic) mode when molecule resistance force does not increase with increase in elongation length (F≈150 pN and X≈10−35 nm) and (3) nonlinear mode (F>175−200 pN and X>40−50 nm). In linear mode the coiled-coils unwind at 2π radian angle, but no structural transition occurs. Viscous mode is characterized by the phase transition from the triple α-spirals to three-stranded parallel β-sheet. The critical tension of α-helices is 0.25 nm per turn, and the characteristic energy change is equal to 4.9 kcal/mol. Changes in internal energy Δu, entropy Δs and force capacity cf per one helical turn for phase transition were also computed. The observed dynamic behavior of α-helices and phase transition from α-helices to β-sheets under tension might represent a universal mechanism of regulation of fibrillar protein structures subject to mechanical stresses due to biological forces.

    Views (last year): 6. Citations: 1 (RSCI).
  8. Minkin A.S., Knizhnik A.A., Potapkin B.V.
    OpenCL realization of some many-body potentials
    Computer Research and Modeling, 2015, v. 7, no. 3, pp. 549-558

    Modeling of carbon nanostructures by means of classical molecular dynamics requires a lot of computations. One of the ways to improve the performance of basic algorithms is to transform them for running on SIMD-type computing systems such as systems with dedicated GPU. In this work we describe the development of algorithms for computation of many-body interaction based on Tersoff and embedded-atom potentials by means of OpenCL technology. OpenCL standard provides universality and portability of the algorithms and can be successfully used for development of the software for heterogeneous computing systems. The performance of algorithms is evaluated on CPU and GPU hardware platforms. It is shown that concurrent memory writes is effective for Tersoff bond order potential. The same approach for embedded-atom potential is shown to be slower than algorithm without concurrent memory access. Performance evaluation shows a significant GPU acceleration of energy-force evaluation algorithms for many-body potentials in comparison to the corresponding serial implementations.

    Views (last year): 4. Citations: 1 (RSCI).

Indexed in Scopus

Full-text version of the journal is also available on the web site of the scientific electronic library eLIBRARY.RU

The journal is included in the Russian Science Citation Index

The journal is included in the RSCI

International Interdisciplinary Conference "Mathematics. Computing. Education"