Title: Performance analysis of parallel smoothed particle hydrodynamics on multi-core CPUs
Abstract: This paper presents a parallel SPH implementation on multi-core CPUs. The implementation uses a hash table to store particles data and divides the program code into 2 parts for parallelization. The first part has no data race, but the second part has data race. Then, the paper compares the running time and parallel speedup of each part to find the bottleneck of the parallel SPH program. The results show that the program can achieve linear speedup just with the first part to be parallelized when the search radius is large. And the second part has become a performance bottleneck only when the search radius is small enough (for each cell only contains one or two particles on average). We present a method to parallelize the second part without affecting the performance of the first part. The results show that our method can ease the performance bottleneck when the search radius is small.
Publication Year: 2014
Publication Date: 2014-12-01
Language: en
Type: article
Indexed In: ['crossref']
Access and Citation
AI Researcher Chatbot
Get quick answers to your questions about the article from our AI researcher chatbot