+- +-
Say hello if visiting :) by Gecko
11 Jan 2023, 07:43:05 pm

Seti is down again by Mike
09 Aug 2017, 10:02:44 am

Some considerations regarding OpenCL MultiBeam app tuning from algorithm view by Raistmer
11 Dec 2016, 06:30:56 am

Loading APU to the limit: performance considerations by Mike
05 Nov 2016, 06:49:26 am

Better sleep on Windows - new round by Raistmer
26 Aug 2016, 02:02:31 pm

Author Topic: New cruncher on the way  (Read 28580 times)

Offline Cosmic_Ocean

  • Knight o' The Realm
  • **
  • Posts: 71
New cruncher on the way
« on: 21 Jan 2012, 06:50:51 am »
So back in October 2006, I put together a machine that I planned to make last five years and still be competitive to new stuff that comes along.  I went with a 2P Opteron setup.  Until the capacitors got tired of full-bore 24/7 for 4.5 years, it was doing a fantastic job of staying competitive.  I replaced the bulged caps and that fixed the problem for a few months, but then it came back.  Replaced more caps and it didn't change anything at all.  I kind of limped along the past three months with anticipation of building a new rig, but the funds didn't cooperate.

Finally got the funds, and went with a nice new setup:

ASRock 970 EXTREME3
AMD FX-6100 Zambezi 3.3GHz
G.SKILL Ripjaws Series 8GB (2 x 4GB) 240-Pin DDR3 1600 (PC3 12800) x 2

Seems like a pretty nice setup.  I kept a lot of my old setup, with the exception of my OS drive (was a PATA 80gb WD Caviar).  Boards these days don't have PATA at all, and I've been wanting to move over to SATA for my OS drive anyway.  Dug around in my pile of computers and found an 80gb SATA drive.  Ran a lot of tests on it and there are zero bad sectors, so that's all I needed.  My plan is to use Acronis or something similar and copy it over to this SSD that I plan on getting sometime in the near future, when funds permit.

Then I'll upgrade my 8800GT to something like an HD 6970.  I'll also need to go for a new power supply as this one is 5 years old and has been at 50-75% load 24/7.  It seems fine, but the potential for failure is getting pretty high.  Not to mention that the mobo reports the 12v rail as 11.3, 5v as 4.35, 3.3 as 3.1.  Since they're all low, it may just be the board is reporting low, or it could just be that the PSU is getting weak.

So I got it all put together pretty easily.  I was ecstatic to see that when you enter BIOS to change settings.. you can use the mouse, and it is 1024x768.  That was pretty cool.  Disabled Cool'n'Quiet because I'm going to be crunching and need full-speed.  After I got Windows installed and pulled up CPU-Z, I noticed I was only running at 1.4GHz.  Pulled up something to load the CPU and it jumped to 3.3.  So I guess even when you disable cool'n'quiet.. it's still enabled.  Might just have to go to the overclocking section and punch in what it should be.

[rant]

So I was hoping to get BOINC installed and start attempting to build an AP-only cache, but there's just too many problems.  For starters, I'm going to RMA the board first-thing Monday.  During the Windows installation, I had sound coming out of the speakers.  Once I was at the desktop, I started installing drivers for everything.  After installing the drivers for the onboard sound, I was getting constant balloon notifications about "you have just plugged an audio device in" and "you have just unplugged an audio device."  On like half-second intervals.  Constantly.  Pulled up the Realtek HD Audio Manager and it shows you the ports either bold or faded for occupied/vacant.  Well instead of the green port changing states, all six of them were doing all kinds of random state-changes.  Unplugged the speakers, and it stopped.  Plugged it back in, crazy crap.

My previous board had a screwy issue with the Realtek sound.  The drivers from the board manufacturer didn't work right, so I ended up using the reference set from the Realtek website and that fixed it.  Figured I'd give that a try this time.  Didn't make any difference at all.  Uninstalled the sound card in device manager and let it go back to the default windows drivers and that still didn't fix it.  I think the board is just defective or something.

In addition to that, I think one of the USB2.0 stacks on the rear panel are defective.  I plugged a cord in for my camera (nothing plugged in at the other end), and there was an electrical arc/pop and two of my HDDs spun down and came back up.  So that's a bad socket if I can't handle having something plugged in without arcing and shorting out.  Also, I have singing capacitors.  They are almost silent if I disable the onboard NIC, but the volume and note they sing changes depending on CPU and general system load.  I don't know if it is related to the possibly low voltage output from the PSU, but it just seems there's too many problems and I'm not going to just "make do" with it.

I would just contact ASRock about it and try to RMA through them, buuuut.. they just started a Chinese holiday for the next 10 days, so they're entirely useless until the 30th.  I'm hoping that if I can RMA it with Newegg on Monday and ship it to them, I can hopefully get the replacement Friday, but that's probably very wishful thinking.

Then during all of my installations and configurations, the OS drive would go into this read/write cycle where the entire system would become unresponsive (even the mouse) for exactly 300 seconds and then be fine again.  I disabled about 10 services that are basically entirely useless and it was still doing it.  Finally figured out that it was Homegroup.  Left the homegroup and disabled the services for that.  Problem went away.  Of course it didn't help that while I was trying to set up my email accounts in Outlook, Homegroup was doing whatever-the-hell it was doing, and at some point during updates, Windows decided to change my update setting from "check for updates and notify for download and install" to "automatic download and install, with automatic restart".  So I'm trying to import my Outlook archive file which was about 1.3gb, .NET 4 is doing an install in the background, and Homegroup is hosing everything up.  Damn that was an annoying hour and a half.

[/rant]

So I'm finishing up with installing everything but BOINC and getting everything configured the way I want it.  Since it's going to be the same model mobo and all the same hardware, I won't have to worry about starting all over again when the replacement shows up.  Also, since I haven't been able to play games for the past 3 months, I'm going to install them but not play them until I get a working replacement board, otherwise it'll just make that week (or so) just pure agony.

Just felt like sharing my woes.

Offline Mike

  • Alpha Tester
  • Knight who says 'Ni!'
  • ***
  • Posts: 2427
Re: New cruncher on the way
« Reply #1 on: 21 Jan 2012, 07:14:52 am »

Hi cosmic.

I have the 990 FX Extreme 3 with a FX 8150.
Did you update your Bios ?
Also check your settings in Bios.
I couldn´t run it with default settings.
Have set up everything to manual since nearly no value did match requirements.

Mike

Offline Cosmic_Ocean

  • Knight o' The Realm
  • **
  • Posts: 71
Re: New cruncher on the way
« Reply #2 on: 21 Jan 2012, 07:31:01 am »
Board came with the newest BIOS version already.  Regarding the CPU being throttled even though that feature is disabled is one that I will have to manually overclock it to the stock value.  However, BIOS settings won't fix the defective USB port and sound card.

Offline Mike

  • Alpha Tester
  • Knight who says 'Ni!'
  • ***
  • Posts: 2427
Re: New cruncher on the way
« Reply #3 on: 21 Jan 2012, 08:52:44 am »

Regarding CPU throttling you need to disable CPU Load Line Calibration.

Mike

Offline Cosmic_Ocean

  • Knight o' The Realm
  • **
  • Posts: 71
Re: New cruncher on the way
« Reply #4 on: 21 Jan 2012, 10:10:22 pm »
Finally just went through and got the WEI scores.  Not bad, but the HDD brings the whole score down.

Processor: 7.3
RAM: 7.6
Graphics: 6.9
Gaming graphics: 6.9
primary disk: 5.4

Haven't done the suggestion for fixing the throttling yet.

Offline skildude

  • Knight o' The Round Table
  • ***
  • Posts: 168
Re: New cruncher on the way
« Reply #5 on: 22 Jan 2012, 12:50:16 am »
amd throttles the CPU and has turbo when its working hard.  To stop the throttling just go into the Catalyst control center. Under performance click the CPU POWER.  Then simply slide the lowest setting to the far right.  This forces the CPU to stay at its max speed

Offline Cosmic_Ocean

  • Knight o' The Realm
  • **
  • Posts: 71
Re: New cruncher on the way
« Reply #6 on: 22 Jan 2012, 01:16:21 am »
amd throttles the CPU and has turbo when its working hard.  To stop the throttling just go into the Catalyst control center. Under performance click the CPU POWER.  Then simply slide the lowest setting to the far right.  This forces the CPU to stay at its max speed
Yeah, I don't have Catalyst.  According to AMD, there is no 900-series chipset.  The drivers that are on the ASRock website (and on the CD) are labeled as "AMD all in 1 driver ver: 8.70a_WHQL", and there are many references to 8xx in the filenames and folder structures within.  I'm thinking the 900-series is 800-based with a few new features or something, but isn't different enough to need different drivers?  I don't know.

By the way, disabling spread spectrum in UEFI setup did not force it into full speed.  I did also disable turbo, because as I read for how it operates, it will turn two or three cores completely off to turbo the remaining ones.

Offline skildude

  • Knight o' The Round Table
  • ***
  • Posts: 168
Re: New cruncher on the way
« Reply #7 on: 22 Jan 2012, 09:29:54 am »
My Biostar TA 990 has a v8.71.  For giggles I checked the 890 mobo from biostar and its chipset was v8.70

Offline Mike

  • Alpha Tester
  • Knight who says 'Ni!'
  • ***
  • Posts: 2427
Re: New cruncher on the way
« Reply #8 on: 22 Jan 2012, 09:44:14 am »
Regarding CPU throttling disable  “CPU Thermal Throttle” in UEFI.

Please disable Cool’n’Quiet.

Please try different settings for “CPU Load-Line Calibration” and “AMD Turbo Core Technology” in UEFI.

Settings in CCC doesn´t prevent CPU throttling.


Mike


Offline skildude

  • Knight o' The Round Table
  • ***
  • Posts: 168
Re: New cruncher on the way
« Reply #9 on: 22 Jan 2012, 02:12:09 pm »
It sure doesn't I hadn't noticed it because I was running Seti full time so I never dropped down

Offline Mike

  • Alpha Tester
  • Knight who says 'Ni!'
  • ***
  • Posts: 2427
Re: New cruncher on the way
« Reply #10 on: 22 Jan 2012, 05:10:22 pm »

Settings in CCC are similar to power saving settings which only take effect if some cores are idle.
That doesn´t happen while seti is running.

CPU Thermal Throttle and CPU Load Line callibration prevent overheating the CPU by reducing core voltage.

I tested it while overclocking.
4.3 GHZ was just to much for my heatsink, so i ordered a better one.

Mike

Offline Cosmic_Ocean

  • Knight o' The Realm
  • **
  • Posts: 71
Re: New cruncher on the way
« Reply #11 on: 22 Jan 2012, 07:09:09 pm »
So.  Since I'm going to be calling in the morning and getting an RMA set up, I was thinking "what am I going to do in the meantime?  using the laptop is just weird.  OO! I know.. set the laptop up ontop of my case, hook the USB keyboard and mouse up, hook the monitor up, and bam.

Then I got an even better idea.  I hadn't nuked my old PATA C drive, and there really wasn't anything stopping me from just putting the old setup back together.  Well, except for two bulged caps.  Replaced those (again) and well.. here I am, back on the old setup.  I can do a lot of stuff, except for write to the array at all, or play games.  I can watch TV shows, listen to music, check email and frequent some forums until the replacement board comes back.

Offline Cosmic_Ocean

  • Knight o' The Realm
  • **
  • Posts: 71
Re: New cruncher on the way
« Reply #12 on: 24 Jan 2012, 03:51:20 am »
Sent the board off today.  I'm thinking Tuesday (31st) to have the replacement board.  After reading the reviews for it (like I should have done in the first place), it seems it's a real crapshoot as to whether there is going to be a problem with it or not.  And the problems range all over the map; there's no one specific common issue.  I've heard (from fellow nerds) of ASRock boards being pretty durable and reliable along with being affordable, so I figured I'd give it a shot.  We'll see what happens with the replacement.

I do have a question about the opt apps though.  If I install BOINC but don't attach to any projects, and then create the project folder in the data directory and put my app_info and EXE in there, THEN attach to the project, will that work right away?  My goal is to keep my application details from having any references to stock apps, so I don't want to attach and get assigned work right away after the stock downloads.  I want the new machine's app details to look like my single core machine's does.  It has been running an app_info since before the app details came along and therefore has no references to stock apps.


[unrelated background information]

In the meantime, I'm doing a lot of data sorting/archiving to clear out enough space on the externals to hold the contents of the array for a few hours.  Some time ago (I think it was ~2 years), one drive punted itself from the array randomly.  Normally, that's not really that big of a problem (it's a 4x500gb raid-5).  I just go in and delete the ghost raidset that thinks three disks are missing, then designate that disk as a hot spare, and the controller pulls it in and auto-rebuilds.

Well that one time two years ago, disk 2 punted itself, and for some stupid reason, I pulled the side panel off and touched one of the cables just right and two more removed themselves from the array as well.  So then I had three ghost raidsets and the one correct one.  Since each disk reported that it was a member of a raidset and lost contact with all the others, it was technically destroyed and not recoverable.  I did a lot of searching around on the Internet and found that in the BIOS setup for the card, there's supposed to be an option for rebuilding where the initialize method is "None (Rescue)."  Well mine didn't have that.  After some more research, I found that particular option in the BIOS setup didn't come along until the next firmware revision after the original base release that my card has.  However, the web interface (localhost:81) has the option in there.

So the fix for this scenario was to delete ALL of the volumesets and raidsets, create a new raidset with exactly the same parameters as before, but make sure initialize was set to Rescue.  After an adrenaline explosion and many butterflies in my stomach, I hit the "confirm operation" button.  It worked.  The controller didn't automatically do it, but I went and told it to do a consistency check just to make sure everything was good.  Ended up not having any problems once it was done.  Fortunately there wasn't any write operations going on when I had my moment of stupidity.

But in the aftermath of that issue, the name for the raidset that appears is the same as it was before, except instead of #00, it is #R-00, R for "rescued."  I had originally set it up for a 16KB stripe size, but I think I can get much better performance if I go to 64KB.  I believe this because when I do read benchmarks, or just read large files from it, it tops out at 210MB/sec.  Now the first time I had a disk punt itself, I thought for sure it was just a failed disk, so I panicked and ordered a new one from Newegg, and overnight shipping was only $6 more than the regular shipping that was like $2.49 or something like that.  While I was waiting for the new disk to arrive, I put the "failed" disk into another machine and ran the WD DLG tools on it, specifically the long test that looks for errors.  It found none.  Did the short pass of zeroes (first and last million sectors), put it back into the main rig, booted up and told the controller it was a hot spare.  Automatic rebuild.

Worked fine for a long time.  But now I had a "spare" disk laying around.  Of course I turned it into a storage disk that could be used as a raid spare (after swapping the cable from onboard to the card).  Same model and everything, and that disk gets 115MB/sec on reads in the first 50GB, and doesn't drop below 100MB/sec until about 125-150GB.  Even near the end, it's still pulling 65-70MB/sec.  So since raid-5 on reads is basically a raid-0 stripe, I should be getting 300+ at the beginning, and well over 150 at the very end.  But since I've only got 16KB for a stripe size, I max out at 210 at the beginning, and it drops down to 105 at the end, with a 175 average.  That's still pretty good, but it should easily be twice what I'm getting.  Of course I filled the array up too quickly and ran out of storage space elsewhere to rebuild it with a larger stripe size.

It sucks that HDD prices are sky-high right now, too.  I was looking forward to picking up a 2TB drive to put in my external enclosure (it's a 5-bay trayless backplane on eSATA), so now my only option to fix my array is to burn stuff onto DVD+R's to clear out enough space to hold the array contents.  I think I might just make enough room.. it'll be close.  I've also wanted to upgrade the firmware on the raid card, but I'm not totally certain--nor confident--that it won't destroy the array in the process.  Safest thing to do is have the data backed-up and then upgrade the firmware.  Upgrading the firmware will give me a lot of new options.. such as "no alarm on fan fail."  I've gone through five 40mm chipset fans in the 4 years I've had this card.  I finally just said "enough!" and put the passive heatsink that came with the card on it, but with no fan, the alarm goes off.  Yay old firmware.. ::)

[/unrelated background information]

Okay, so I don't really expect anyone to spend 5-10 whole minutes reading all of that, but I felt like explaining.  If you read all of it.. I totally gave you an opt-out at the beginning.  :P

Offline arkayn

  • Janitor o' the Board
  • Knight who says 'Ni!'
  • *****
  • Posts: 1230
  • Aaaarrrrgggghhhh
    • My Little Place On The Internet
Re: New cruncher on the way
« Reply #13 on: 24 Jan 2012, 07:42:01 am »
I think your best bet would be to copy the old folder over, clean out all the files but leave all the folders in place.

Offline Claggy

  • Alpha Tester
  • Knight who says 'Ni!'
  • ***
  • Posts: 3111
    • My computers at Seti Beta
Re: New cruncher on the way
« Reply #14 on: 24 Jan 2012, 08:40:07 am »
I do have a question about the opt apps though.  If I install BOINC but don't attach to any projects, and then create the project folder in the data directory and put my app_info and EXE in there, THEN attach to the project, will that work right away?  My goal is to keep my application details from having any references to stock apps, so I don't want to attach and get assigned work right away after the stock downloads.  I want the new machine's app details to look like my single core machine's does.  It has been running an app_info since before the app details came along and therefore has no references to stock apps.
You can try attaching with an app_info and apps already popolated in the setiathome directory,

If i don't want to get Stock apps when i attach a host, i Deselect Use CPU & Use Nvidia GPU in my Default Project Preferences,
then when Boinc does the Prroject Initialisation it finds it can't ask for CPU or Nvidia GPU work, so it gets no Stock work, then you can do the app_info and apps,

Claggy

 

Welcome, Guest.
Please login or register.
 
 
 
Forgot your password?
Members
Total Members: 97
Latest: ToeBee
New This Month: 0
New This Week: 0
New Today: 0
Stats
Total Posts: 59559
Total Topics: 1672
Most Online Today: 355
Most Online Ever: 983
(20 Jan 2020, 03:17:55 pm)
Users Online
Members: 0
Guests: 199
Total: 199
Powered by EzPortal