A gpgpu parallel computing method for sph fluid simulation

A parallel computing and fluid simulation technology, applied in multi-programming devices, processor architecture/configuration, inter-program communication, etc., can solve problems such as low cache hit rate, inefficient global memory access mode, and insufficient potential

Active Publication Date: 2019-09-10
EAST CHINA NORMAL UNIV
View PDF7 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, existing shared memory solutions do not bring the desired performance improvement, and are even 20% to 70% slower than traditional methods
This solution uses the explicit thread synchronization method in CTA to ensure the correctness of data, but thread synchronization brings very high performance overhead; at the same time, this method occupies too much shared memory, making it run on most devices It has a low device occupancy rate; secondly, its inefficient global memory access mode does not make good use of the access characteristics of global memory; and its random task allocation strategy will lead to a lower cache hit rate
[0004] All in all, although the performance of SPH has been significantly improved by taking advantage of the high parallel processing capability of GPGPU devices, its potential is still far from being fully exploited.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A gpgpu parallel computing method for sph fluid simulation
  • A gpgpu parallel computing method for sph fluid simulation
  • A gpgpu parallel computing method for sph fluid simulation

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0031] The present invention will be described in detail below in conjunction with the accompanying drawings.

[0032] (1) Fluid model based on SPH method

[0033] The SPH method is a meshless particle method. As a Lagrangian-based approach, the persistent simulation space is considered to consist of a number of discrete particles. The properties of each particle (such as density, force) are calculated by smooth kernel function W(r, h) with weighted interpolation:

[0034]

[0035] where m and ρ represent the mass and density represented by the particle, respectively. h in W(r,h) represents the smooth kernel radius.

[0036] Before calculating the force of each particle, it is necessary to first calculate the density ρ represented by each particle and generate the pressure p:

[0037] ρ i =∑ j m j W(r i -r j , h), (2)

[0038] p i =κ(ρ i -ρ 0 ), (3)

[0039] where κ is the gas constant, ρ 0 is the rest density. Next, calculate the force f on each particle by...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a GPGPU (General Purpose Graphics Processing Unit) parallel computing method oriented to SPH (Smoothed Particle Hydrodynamics) fluid simulation. The method comprises the step of: grouping particle computing tasks in an SPH method to accord with a tread level structure in a GPGPU, wherein a thread in a cooperative thread array (CTA) can process logically similar work. The grouping method is developed for a CTA scheduling rule in the GPGPU and is realized on the GPGPU, thus the cache hit rate of a system can be improved while allocating tasks rapidly. An on-chip shared memory can be used for caching read data from a global memory on the basis of the allocating method; the memory bandwidth can be fully utilized according to the storage access characteristics of the global memory; and meanwhile through utilization of SIMD (Single Instruction Multiple Data) characteristics of the GPGPU, the thread synchronization overhead brought by use of the shared memory can be effectively avoided.

Description

technical field [0001] The invention belongs to the field of computer graphics and high-performance computing, specifically a new parallel method based on a general-purpose computing graphics processor (GPGPU), which is used for fluid simulation calculation based on the SPH algorithm. It involves computer graphics and simulation, computational fluid dynamics, single instruction stream multiple data stream (SIMD) architecture, GPGPU, etc. Background technique [0002] The SPH method is a grid-free Lagrangian numerical method. This method has been widely used in fluid simulation, astrodynamics, molecular dynamics and biological simulation in recent years. As a particle method, SPH requires a large number of particles to simulate large-scale scenes with high precision. And a large number of high-density particles will lead to massive computing requirements. So for users, they are always forced to make a trade-off between simulation effects and system performance. Therefore,...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F9/50G06F9/54G06T1/20
Inventor 阮骥鸣王长波秦洪
Owner EAST CHINA NORMAL UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products