Forum > GPU crunching

CUDA MB V12b for multi-GPU multicore hosts.

<< < (9/9)

glennaxl:
Another round of tests, same rigs as before. A 3rd rig test is coming but somehow its crashing on test b. 3rd rig is a q6600, dual gtx 260 on p55 chipset board.


Test Cases:
TEST A: 1 GPU, No CPU
TEST B: 1 GPU, 100% CPU
TEST C: v12 ALL GPU, 100% CPU
TEST D: v12b ALL GPU, 100% CPU
TEST E: v12 x4 ALL GPU, 100% CPU

TEST A:
1) a. GPU0 @ GTX295-CORE0 (v12 vs v12b)
   b. GPU0 @ GTX295-CORE0 (v12 vs v12b x4)
2) a. GPU1 @ GTX260 (v12 vs v12b)
   b. GPU1 @ GTX260 (v12 vs v12b x4)
3) a. GPU2 @ GTX295-CORE1 (v12 vs v12b)
   b. GPU2 @ GTX295-CORE1 (v12 vs v12b x4)

TEST B:
CPU0-7 @i7 920 (AKv8 vs AKv8b)
1) a. GPU0   @ GTX295-CORE0 (v12 vs v12b)
   b. GPU0   @ GTX295-CORE0 (v12 vs v12b x4)
2) a. GPU1   @ GTX260 (v12 vs v12b)
   b. GPU1   @ GTX260 (v12 vs v12b x4)
3) a. GPU2   @ GTX295-CORE1 (v12 vs v12b)
   b. GPU2   @ GTX295-CORE1 (v12 vs v12b x4)

TEST C:
1) GPU0 @ GTX295-CORE0 (stock609 vs v12)
   GPU1 @ GTX260 (stock609 vs v12)
   GPU2 @ GTX295-CORE1 (stock609 vs v12)
   CPU0-7 @i7 920 (AKv8 vs AKv8b)

TEST D:
1) GPU0 @ GTX295-CORE0 (v12 vs v12b)
   GPU1 @ GTX260 (v12 vs v12b)
   GPU2 @ GTX295-CORE1 (v12 vs v12b)
   CPU0-7 @i7 920 (AKv8 vs AKv8b)

TEST E:
1) GPU0 @ GTX295-CORE0 (v12 vs v12b x4)
   GPU1 @ GTX260 (v12 vs v12b x4)
   GPU2 @ GTX295-CORE1 (v12 vs v12b x4)
   CPU0-7 @i7 920 (AKv8 vs AKv8b)

[attachment deleted by admin]

glennaxl:
The 3rd rig I mentioned - upgraded to 196.21 from 195.62 and it fix the issue.

It seems the speed up is less on Q6600 than i7 920.

[attachment deleted by admin]

Raistmer:
Thanks a lot!
Will look at results.

glennaxl:

Results are from Test D and E.

Raistmer:
Looks like V12b has some sense for hosts with 3 GPU but not for host with only 2 GPUs. [Rig 3 2-GPUs only too...]
V12b x4 takes 1 CPU only and for i7 CPU it will mean that 2 instanses sitting on same physical core because of HyperThreading. It's sub-optimal of course so x4 results almost always worser.
V12b takes 2 CPUs per instance that is, always full i7 core, but using only first 4 CPUs so again, 3 instanses will use 2 i7 cores instead of 3.
Will try to do some i7-related tuning and maybe results will be more clearer...

Navigation

[0] Message Index

[*] Previous page

Go to full version