Skip to content
Prev 1854 / 2152 Next

Why pure computation time in parallel is longer than the serial version?

hi

2014-02-22 19:30 GMT+09:00 Xuening Zhu <puddingnnn529 at gmail.com>:
Logical performance of your CPU..

use SIMD 4 FLOPSparClock x 2.5GHz x 2phisicalcore = 20GFLOPS
use AVX   8 FLOPSparClock x 2.5GHz x 2phisicalcore = 40GFLOPS
# Because a physical core is two, the computing unit is two.

amount of the operation of DGEMM is O(N^3).
amount of the operation of the procession is O(N^3).
require calculation ... 2*(3e3 * 4e3 * (3e3+4e3)/2) = 84GFLOPS

use SIMD 84/20 = 4.2 sec
use AVX   84/40 = 2.1 sec

so becomes 2.1 seconds in the logical peak performance in your CPU.
Maybe, because the effective efficiency is about range 80% to 90%, it
is likely to become about 2.6 seconds normally.
maybe this performance of NEHALEM Core.
and hyperthread decreases the efficiency of cache in the procession.

Best Regards,

Thread (19 messages)

Wesley Goi Why pure computation time in parallel is longer thanthe serial version? Feb 17 Xuening Zhu Why pure computation time in parallel is longer thanthe serial version? Feb 17 wesley goi Why pure computation time in parallel is longer thanthe serial version? Feb 17 Roger Bivand Why pure computation time in parallel is longer than the serial version? Feb 18 wesley goi Why pure computation time in parallel is longer than the serial version? Feb 18 Xuening Zhu Why pure computation time in parallel is longer than the serial version? Feb 22 Claudia Beleites Why pure computation time in parallel is longer than the serial version? Feb 22 Xuening Zhu Why pure computation time in parallel is longer than the serial version? Feb 22 Roger Bivand Why pure computation time in parallel is longer than the serial version? Feb 22 Xuening Zhu Why pure computation time in parallel is longer than the serial version? Feb 22 Roger Bivand Why pure computation time in parallel is longer than the serial version? Feb 23 Ei-ji Nakama Why pure computation time in parallel is longer than the serial version? Feb 23 Ei-ji Nakama Why pure computation time in parallel is longer than the serial version? Feb 23 Simon Urbanek Why pure computation time in parallel is longer than the serial version? Feb 25 Jim Gattiker Why pure computation time in parallel is longer than the serial version? Feb 25 George Ostrouchov Why pure computation time in parallel is longer than the serial version? Feb 25 Norm Matloff Why pure computation time in parallel is longer than the serial version? Feb 25 Jim Gattiker Why pure computation time in parallel is longer than the serial version? Feb 26 George Ostrouchov Why pure computation time in parallel is longer than the serial version? Feb 26