Recent Posts

Pages: 1 2 [3] 4 5 ... 10
21
Discussion Forum / Re: Better sleep on Windows - new round
« Last post by Raistmer on 26 Aug 2016, 02:02:31 pm »
And data from GT720 on busy i5-3470 (high_prec timer enabled):

MB8_win_x86_SSE3_OpenCL_NV_SoG_Sleep0.exe -verb -nog :
  Elapsed 3018.575 secs, speedup: 46.57%  ratio: 1.87x
      CPU 358.599 secs, speedup: 90.99%  ratio: 11.10x
MB8_win_x86_SSE3_OpenCL_NV_SoG_Sleep1.exe -verb -nog :
  Elapsed 3024.707 secs, speedup: 46.46%  ratio: 1.87x
      CPU 326.494 secs, speedup: 91.80%  ratio: 12.19x
MB8_win_x86_SSE3_OpenCL_NV_SoG_STT.exe -verb -nog :
  Elapsed 3034.625 secs, speedup: 46.29%  ratio: 1.86x
      CPU 334.591 secs, speedup: 91.59%  ratio: 11.89x

Sleep0:class SleepQuantum:      total=5073.9668,   N=3152,   <>=1.609761,   min=0.011221858   max=8.9496584
Sleep1:class SleepQuantum:      total=3132.7358,   N=3153,   <>=0.99357305,   min=0.85221332   max=3.1896715
STT:    class SleepQuantum:      total=15702.391,   N=2136,   <>=7.3513065,   min=0.01114194   max=16.63485

Nothing new here, just support of prev conclusions.

GT720 on busy i5-3470 (timer at default after host power cycle):
MB8_win_x86_SSE3_OpenCL_NV_SoG_Sleep0.exe  :
  Elapsed 3095.420 secs, speedup: 45.21%  ratio: 1.83x
      CPU 268.571 secs, speedup: 93.25%  ratio: 14.82x
MB8_win_x86_SSE3_OpenCL_NV_SoG_Sleep1.exe  :
  Elapsed 3051.709 secs, speedup: 45.98%  ratio: 1.85x
      CPU 273.017 secs, speedup: 93.14%  ratio: 14.58x
MB8_win_x86_SSE3_OpenCL_NV_SoG_STT.exe  :
  Elapsed 3014.658 secs, speedup: 46.64%  ratio: 1.87x
      CPU 319.958 secs, speedup: 91.96%  ratio: 12.44x

Sleep0:class SleepQuantum:      total=38777.035,   N=1595,   <>=24.311621,   min=3.4529493   max=51.648083
Sleep1:class SleepQuantum:      total=24096.59,   N=1575,   <>=15.299422,   min=14.763614   max=15.852066
STT:class SleepQuantum:      total=13195.877,   N=2761,   <>=4.7793832,   min=0.012540359   max=23.805403

And here advantage of STT finally appeared. With sleep quantum only ~15ms STT remained on ~4-5ms range.

CPU idle:
MB8_win_x86_SSE3_OpenCL_NV_SoG_Sleep0.exe  :
  Elapsed 29005.994 secs, speedup: -413.41%  ratio: 0.19x(suspended through night)
      CPU 1725.527 secs, speedup: 56.64%  ratio: 2.31x
MB8_win_x86_SSE3_OpenCL_NV_SoG_Sleep1.exe  :
  Elapsed 3032.034 secs, speedup: 46.33%  ratio: 1.86x
      CPU 299.272 secs, speedup: 92.48%  ratio: 13.30x
MB8_win_x86_SSE3_OpenCL_NV_SoG_STT.exe  :
  Elapsed 3007.939 secs, speedup: 46.76%  ratio: 1.88x
      CPU 1729.599 secs, speedup: 56.54%  ratio: 2.30x

Sleep0:class SleepQuantum:      total=49.532238,   N=3197,   <>=0.015493349,   min=0.012275338   max=8.6543369
Sleep1:class SleepQuantum:      total=24069.621,   N=1575,   <>=15.282299,   min=4.3649426   max=15.597382
STT:    class SleepQuantum:      total=40.04808,   N=3215,   <>=0.012456635,   min=0.012152191   max=0.0152032

approx context switch overhead for i5-3470: 0.015493349ms-0.012456635ms=0.003036714ms~3us
22
Discussion Forum / Re: Better sleep on Windows - new round
« Last post by Mike on 24 Aug 2016, 04:51:08 pm »
Quote
Could it be power issues? Maybe more strong power supply needed?

A Corsair AX 750I should be enough.
I also tested a 850 Watt.

I only have rock solid components.
Mobo Asus Sabertooth
Corsair AX 750i PSU
Kingston RAM
Noctua Heat think
Sapphire tricool GPU

Believe me its the FX.
The FX only has 4 FPU`s but 8 physical CPU cores and since seti app uses mostly FPU you don`t need a calculator.
There is no room  left for OS specific operations.
That`s why i usually run seti on 4 cores only and it is fully loaded.
I can encode HD video on 8 cores for 24 hours without any issues cause its not FPU bound.
CPU dont exceed 55°C.
23
Discussion Forum / Re: Better sleep on Windows - new round
« Last post by Raistmer on 24 Aug 2016, 12:26:06 pm »
Host reseted after 5 minutes.
FX can`t use permanent all 8 cores.
Could it be power issues? Maybe more strong power supply needed?

stderr attached.
thanks. app processed OK some time, even found some spikes.
24
Discussion Forum / Re: Better sleep on Windows - new round
« Last post by Mike on 24 Aug 2016, 09:15:44 am »
I have to remove r3500 from this bench  because it doesn`t even start with all cores in use.

Ok for now, there is separate issue we just discovered...

Also sleep versions doesn`t even start.
Zero CPU usage on GPU task so i aborted after 5 minutes.
Not even wisgen started.

Please remove all wisgen tasks, run bench, await ~5mins, locate stderr.txt in ScienceApps folder and attach it as is.

Host reseted after 5 minutes.
FX can`t use permanent all 8 cores.
I told you that before.

stderr attached.
25
Discussion Forum / Re: Better sleep on Windows - new round
« Last post by Raistmer on 24 Aug 2016, 08:58:42 am »
I have to remove r3500 from this bench  because it doesn`t even start with all cores in use.

Ok for now, there is separate issue we just discovered...

Also sleep versions doesn`t even start.
Zero CPU usage on GPU task so i aborted after 5 minutes.
Not even wisgen started.

Please remove all wisgen tasks, run bench, await ~5mins, locate stderr.txt in ScienceApps folder and attach it as is.
26
Discussion Forum / Re: Better sleep on Windows - new round
« Last post by Mike on 24 Aug 2016, 08:56:46 am »
I have to remove r3500 from this bench  because it doesn`t even start with all cores in use.

Ok for now, there is separate issue we just discovered...

Also sleep versions doesn`t even start.
Zero CPU usage on GPU task so i aborted after 5 minutes.
Not even wisgen started.
27
Discussion Forum / Re: Better sleep on Windows - new round
« Last post by Raistmer on 24 Aug 2016, 08:52:43 am »
I have to remove r3500 from this bench  because it doesn`t even start with all cores in use.

Ok for now, there is separate issue we just discovered...
28
Discussion Forum / Re: Better sleep on Windows - new round
« Last post by Mike on 24 Aug 2016, 08:49:51 am »
I have to remove r3500 from this bench  because it doesn`t even start with all cores in use.
29
Discussion Forum / Re: Better sleep on Windows - new round
« Last post by Raistmer on 24 Aug 2016, 01:48:45 am »
That shows the need of fixed amount sleep in case of underloaded CPU.
GPU app has bigger priority so, if some free CPU resource awailable, it will be scheduled for exection there.
What strange is no differencies in STT and Sleep(0) behavior. From what I read on main forums Sleep(0) should return to the same process immediately so just spin with full CPU busy while STT should give up CPU slice always(wrong, only if there are ready threads on the same CPU). So, in SleepQuantum counter it should have bigger mean value (hard to imagine that with absolute most of 2704 occurencies process was exactly at the end of its current time slice). Nevertheless once can see VERY close mean times (<>) for Sleep(0) and STT. Strange. If so I don't see any advantage of STT at all  :-\
[NB: Windows time slice ~10-15 ms and STT mean is 0.0014 ms]
30
Discussion Forum / Re: Better sleep on Windows - new round
« Last post by Mike on 23 Aug 2016, 04:39:29 am »
r3500:class SleepQuantum:      total=2.8579862,   N=3,   <>=0.95266207,   min=0.93661302   max=0.97626472
Sleep0: class SleepQuantum:      total=4.8358912,   N=2704,   <>=0.0017884213,   min=0.00054984231   max=0.4228799
Sleep1: class SleepQuantum:      total=2148.8459,   N=1791,   <>=1.1998023,   min=0.86739361   max=3.0483601
STT: class SleepQuantum:      total=3.9076965,   N=2704,   <>=0.001445154,   min=0.0004952898   max=0.0027276319

The same question. CPU idle or busy? Or, maybe, single CPU core free only?
Sleep behavior strongly depends from host load that's I always ask for full description of test conditions.
And for prev run w/o sleep enabled - no explanation why these builds consume much more CPU  :o

Yep 7 cores were in use.
Pages: 1 2 [3] 4 5 ... 10
Powered by EzPortal