+- +-
Say hello if visiting :) by Gecko
11 Jan 2023, 07:43:05 pm

Seti is down again by Mike
09 Aug 2017, 10:02:44 am

Some considerations regarding OpenCL MultiBeam app tuning from algorithm view by Raistmer
11 Dec 2016, 06:30:56 am

Loading APU to the limit: performance considerations by Mike
05 Nov 2016, 06:49:26 am

Better sleep on Windows - new round by Raistmer
26 Aug 2016, 02:02:31 pm

Author Topic: optimized sources  (Read 615664 times)

Offline _heinz

  • Volunteer Developer
  • Knight who says 'Ni!'
  • *****
  • Posts: 2117
Re: optimized sources
« Reply #285 on: 05 Feb 2008, 05:33:42 pm »
Hi all who are reading here...
long time no post....we are working  :)

heinz

Offline _heinz

  • Volunteer Developer
  • Knight who says 'Ni!'
  • *****
  • Posts: 2117
Re: optimized sources
« Reply #286 on: 04 Mar 2008, 03:33:41 pm »
Hi all,
we are still working...
Today one year in the forum now  ;)

heinz

Offline _heinz

  • Volunteer Developer
  • Knight who says 'Ni!'
  • *****
  • Posts: 2117
Re: optimized sources
« Reply #287 on: 19 Mar 2008, 04:00:09 pm »
We are still there and the work is going on.

Happy Eastern
Joyeuse Paques 

heinz

Offline _heinz

  • Volunteer Developer
  • Knight who says 'Ni!'
  • *****
  • Posts: 2117
Re: optimized sources
« Reply #288 on: 22 Mar 2008, 12:24:15 pm »
Hi,
during all the developement time(one year now) I'm crunching parallel 24/7, all on one machine, a  Northwood P4 2.66Mhz.
Since august 2007 (25566 at 2007-08-11) I'm going backwards in the "World Position List" and the curve looks like a typical "bathtub curve" as you can see here
this means much more new users has better machine like mine.
« Last Edit: 22 Mar 2008, 12:29:25 pm by seti_britta »

Offline Jason G

  • Construction Fraggle
  • Knight who says 'Ni!'
  • *****
  • Posts: 8980
Re: optimized sources
« Reply #289 on: 22 Mar 2008, 12:30:51 pm »
Heinz, keep in mind i did pay Aus$197 for my Wolfdale  , which is really a piece of sand about 0.25 cm squared.  That is more expensive, yet more useful than gold...

Offline _heinz

  • Volunteer Developer
  • Knight who says 'Ni!'
  • *****
  • Posts: 2117
Re: optimized sources
« Reply #290 on: 22 Mar 2008, 12:46:37 pm »
At the moment the price of a Intel Xeon E5405 activ (BX80574E5405A) Harperton Quadcore/2000MHz, L2 Cache 12MB, Bustakt 1333, Socket LGA 771 Boxed(80Watt) is still 176,90 Euros, thats a very interesting alternative to build a 8 core.
But these dualboards are very expensive.....
heinz
modify:
178.00 Euros = 296.34 Australian dollars 
 
Exchange rate: 1.664847 
Rate valid as of: 22/3/2008 
-----------------------------------------
here in europa is the wolfdale E8400 2x3.0GHz BOX 6MB with 200 Euro more expensive than a Xeon E5405 Quadcore 2GHz 178,62 Euro
this give me some thinking   :o
« Last Edit: 22 Mar 2008, 01:24:33 pm by seti_britta »

Offline _heinz

  • Volunteer Developer
  • Knight who says 'Ni!'
  • *****
  • Posts: 2117
Re: optimized sources
« Reply #291 on: 22 Mar 2008, 01:15:51 pm »
Heinz, keep in mind i did pay Aus$197 for my Wolfdale  , which is really a piece of sand about 0.25 cm squared.  That is more expensive, yet more useful than gold...

197.00 Australian dollars = 118.33 Euros 
 
Exchange rate: 0.600656 
Rate valid as of: 22/3/2008 
-----------------------------------------
Intel Core2 Duo E8400 2x3.0GHz BOX 6MB
199,90 €
here in Germany ---> http://www.kmelektronik.de/
200.00 Euros = 332.97 Australian dollars 
 
Exchange rate: 1.664847 
Rate valid as of: 22/3/2008 
-------------------------------------------

you got a very hot price in australia   ::)


heinz
« Last Edit: 22 Mar 2008, 01:35:41 pm by seti_britta »

Offline Jason G

  • Construction Fraggle
  • Knight who says 'Ni!'
  • *****
  • Posts: 8980
Re: optimized sources
« Reply #292 on: 22 Mar 2008, 08:27:40 pm »
Yeah Not anymore Though, I checked at the same place I bought my E8400:

    " Intel Core 2 Duo E8400 (3.00Ghz/6MB/1333FSB/EMT64/XD/Dual Core)
**SHORT SUPPLY. NEXT SHIPMENT DUE MID TO LATE APRIL** "  $399

 :o

Q6600 Quads are now $299 at the same place

Offline _heinz

  • Volunteer Developer
  • Knight who says 'Ni!'
  • *****
  • Posts: 2117
Re: optimized sources
« Reply #293 on: 22 Mar 2008, 10:32:29 pm »
Sometime I'm not sure what todo, SkullTrail yes/no with E5405 (178 Euro)
or better wait for Nehalem and Next Generation Intel Microarchitecture
or meanwhile a cheaper resulution:  board XFX GeForce 7150/MCP630i (70 Euro) no graphiccard necessary, with a Intel Core2 Quad Q9450 4x2.67GHz BOX (300 Euro), case ram disk..all together ca 680 Euro for the hardware + software XP Professional (130 Euro) for testing our parallel stuff.....
« Last Edit: 23 Mar 2008, 08:51:24 am by seti_britta »

Offline Jason G

  • Construction Fraggle
  • Knight who says 'Ni!'
  • *****
  • Posts: 8980
Re: optimized sources
« Reply #294 on: 23 Mar 2008, 08:09:19 am »
Sometime I'm not sure what todo, SkullTrail yes/no with E5405 (178 Euro)
or better wait for Nehalem
or meanwhile a cheaper resulution:  board XFX GeForce 7150/MCP630i (70 Euro) no graphiccard necessary, with a Intel Core2 Quad Q9450 4x2.67GHz BOX (300 Euro), case ram disk..all together ca 680 Euro for the hardware + software XP Professional (130 Euro) for testing our parallel stuff.....

Honestly Heinz, I'd say it'd be difficult to go past the Q6600 at the moment.  I'd guess the Yorkfields are being held off 'till the stock of those clears a bit.  Then the Yorkfields will be awesome [If this wolfdale is anything to go by].  I am getting the feeling that the Nehalem architecture will be a fairly radical departure from what we're used to, and it may take some time for the software to follow.  Perhaps something like the OpenMP standard gives some insight there,  many cores with shared memory.

Jason
« Last Edit: 23 Mar 2008, 08:11:36 am by Jason G »

Offline _heinz

  • Volunteer Developer
  • Knight who says 'Ni!'
  • *****
  • Posts: 2117
Re: optimized sources
« Reply #295 on: 23 Mar 2008, 09:03:57 am »
Hi Jason,
please read Next Generation Intel Microarchitecture with Intel QuickPath Architecture it is a revolution and a step forward to the next generation.  :o
under this circumstances is it better to wait for it and meanwhile use a cheaper solution as described before.
please give feedback if you have read it.

heinz
« Last Edit: 23 Mar 2008, 09:24:04 am by seti_britta »

Offline Jason G

  • Construction Fraggle
  • Knight who says 'Ni!'
  • *****
  • Posts: 8980
Re: optimized sources
« Reply #296 on: 23 Mar 2008, 09:18:50 am »
That pretty much describes the reasons I chose to settle for a Wolfdale Right now:
  - I needed a fairlly inexpensive upgrade (well it was at the time cheap, now they have gone too high in price here)
  - My other jobs require me to support development of SSE4.1 functionality
  - My current system has limited power & cooling considerations (45 degree C desert heat, 1 10amp power circuit @240Vac, for everything)

 So the dual core suits my needs perfectly for now, but I recognise the software infrastructure required to make better use of parallelism is improving immensely, both through fine grained and thread level approaches,. So my next upgrade will probably be either Nehalem or the 32nm variant after Nehalem, depending on what my work demands.. I saw the name but don't remeber it of the next generation...
« Last Edit: 23 Mar 2008, 09:21:23 am by Jason G »

Gecko_R7

  • Guest
Re: optimized sources
« Reply #297 on: 23 Mar 2008, 12:20:27 pm »
Sometime I'm not sure what todo, SkullTrail yes/no with E5405 (178 Euro)
or better wait for Nehalem
or meanwhile a cheaper resulution: board XFX GeForce 7150/MCP630i (70 Euro) no graphiccard necessary, with a Intel Core2 Quad Q9450 4x2.67GHz BOX (300 Euro), case ram disk..all together ca 680 Euro for the hardware + software XP Professional (130 Euro) for testing our parallel stuff.....

Honestly Heinz, I'd say it'd be difficult to go past the Q6600 at the moment. I'd guess the Yorkfields are being held off 'till the stock of those clears a bit. Then the Yorkfields will be awesome [If this wolfdale is anything to go by]. I am getting the feeling that the Nehalem architecture will be a fairly radical departure from what we're used to, and it may take some time for the software to follow. Perhaps something like the OpenMP standard gives some insight there, many cores with shared memory.

Jason


I think Nehalem will be a quite expensive upgrade/transition for a bit when all costs are factored.
It's LGA1366 so factor a brand new Mobo.
Also, DDR3 is almost a given to take advantage of the new arch.

Likely to be massive price gouging and very little supply initially...ltd mobo options, and buggy release bios's.  Wouldn't be surprised for it to be at least Q2 09' before we see decent pricing and availability for us mere mortals that have budgets to consider.
We should also see some nice price drops on Penryn/Yorkfield and perhaps a new stepping as Nehalem is released.

Hard to argue against the current value and mobo selection of c2d & quadcore chips.
Pretty cheap $$$ / performance ratio.

Cheers!

Offline Jason G

  • Construction Fraggle
  • Knight who says 'Ni!'
  • *****
  • Posts: 8980
Re: optimized sources
« Reply #298 on: 23 Mar 2008, 01:17:34 pm »
Likely to be massive price gouging and very little supply initially...ltd mobo options, and buggy release bios's.  Wouldn't be surprised for it to be at least Q2 09' before we see decent pricing and availability for us mere mortals that have budgets to consider.
We should also see some nice price drops on Penryn/Yorkfield and perhaps a new stepping as Nehalem is released.

Well priorities have a way of shifting depending on need.  As you point out p4's & AMDs of SSE2 vintage are still extremely popular according to boincstats, and dominate throughput in many respects.

What the tests seem to be showing is that Alex pretty well nailed the Core2 code, and unless we decide to tackle the other end there may be little left to do there for now (Unless, that is,  some of the relaxed validation requirements that have been spoken about are put in place, then the parallelism race may be back on in force). 

Early p4's have special characteristics to do with cache that aren't necessarily all that happy with techniques used in builds targeted for the core2 architecture.  There are speed improvement showing in the p4(SSE3) I tested, but not as great as the core2 improvements.  There might be plenty of room to tweak that and the SSE3 instructions may as well be macro encapsualted while we're there, allowing SSE2 substitution.

There is though possibly still quite a bit more opportunity to squeeze more performance from the core2 build first.  We have spoken about profile guided optimisations, which haven't been touched yet, and in fact no profiles have even been run yet to identify possible bottlenecks or problems with the build,  That is why, in my book,  it is still considered pre-alpha.  Valid results is one thing, but releasing substandard builds I'd rather leave to the software companies who have the excuse of pressure from the marketing department.

Jason

Gecko_R7

  • Guest
Re: optimized sources
« Reply #299 on: 23 Mar 2008, 01:33:30 pm »

Well priorities have a way of shifting depending on need. As you point out p4's & AMDs of SSE2 vintage are still extremely popular according to boincstats, and dominate throughput in many respects.

What the tests seem to be showing is that Alex pretty well nailed the Core2 code, and unless we decide to tackle the other end there may be little left to do there for now (Unless, that is, some of the relaxed validation requirements that have been spoken about are put in place, then the parallelism race may be back on in force).

Early p4's have special characteristics to do with cache that aren't necessarily all that happy with techniques used in builds targeted for the core2 architecture. There are speed improvement showing in the p4(SSE3) I tested, but not as great as the core2 improvements. There might be plenty of room to tweak that and the SSE3 instructions may as well be macro encapsualted while we're there, allowing SSE2 substitution.

There is though possibly still quite a bit more opportunity to squeeze more performance from the core2 build first. We have spoken about profile guided optimisations, which haven't been touched yet, and in fact no profiles have even been run yet to identify possible bottlenecks or problems with the build, That is why, in my book, it is still considered pre-alpha. Valid results is one thing, but releasing substandard builds I'd rather leave to the software companies who have the excuse of pressure from the marketing department.

Jason

Did you intend this rseponse to be attached to other thread?
« Last Edit: 23 Mar 2008, 01:35:54 pm by Gecko_R7 »

 

Welcome, Guest.
Please login or register.
 
 
 
Forgot your password?
Members
Total Members: 97
Latest: ToeBee
New This Month: 0
New This Week: 0
New Today: 0
Stats
Total Posts: 59559
Total Topics: 1672
Most Online Today: 355
Most Online Ever: 983
(20 Jan 2020, 03:17:55 pm)
Users Online
Members: 0
Guests: 25
Total: 25
Powered by EzPortal