+- +-
Say hello if visiting :) by Gecko
11 Jan 2023, 07:43:05 pm

Seti is down again by Mike
09 Aug 2017, 10:02:44 am

Some considerations regarding OpenCL MultiBeam app tuning from algorithm view by Raistmer
11 Dec 2016, 06:30:56 am

Loading APU to the limit: performance considerations by Mike
05 Nov 2016, 06:49:26 am

Better sleep on Windows - new round by Raistmer
26 Aug 2016, 02:02:31 pm

Author Topic: optimized sources  (Read 548412 times)

Offline _heinz

  • Volunteer Developer
  • Knight who says 'Ni!'
  • *****
  • Posts: 2117
Re: optimized sources
« Reply #555 on: 11 Mar 2010, 06:07:50 pm »
For 10? I think you need two million. Oh, and I thought you were German.  :)
Congrats anyway.
10 mio ? -->Target will be reached in 41.63 days on April 4 2010 (if i have no hardware outage)
 :)
9 Mio total today
10 mio ? -->Target will be reached in 18.28 days on March 30 2010
 ;)

Offline The Grinch

  • Knight o' The Realm
  • **
  • Posts: 58
    • www.Au-Ja.de
Re: optimized sources
« Reply #556 on: 12 Mar 2010, 06:55:16 am »
Kommt hier eigentlich mal was Produktives, oder wird hier nur das "optimale" System präsentiert?

Offline _heinz

  • Volunteer Developer
  • Knight who says 'Ni!'
  • *****
  • Posts: 2117
Re: optimized sources
« Reply #557 on: 12 Mar 2010, 08:50:24 am »
Kommt hier eigentlich mal was Produktives, oder wird hier nur das "optimale" System präsentiert?
natürlich gibts was productives, wir arbeiten an speed optimierten astropulse Apps
mit den neuesten Technologien für CPU und GPU(NVIDIA uind ATI)
Ziel sind 50% speedup gegenüber der standard astropulse app.
downlods zum testen gibts hier nicht, sondern nur im PubliC Release Beta Testforum
http://lunatics.kwsn.net/14-public-release-beta-test-forum/index.0.html
final versionen gibts dann auf der startseite, wie immer.

@Grinch: as always, next time in english

Offline The Grinch

  • Knight o' The Realm
  • **
  • Posts: 58
    • www.Au-Ja.de
Re: optimized sources
« Reply #558 on: 13 Mar 2010, 01:40:50 am »
Das mag sein das es das Ziel ist, aber auf den meisten der 36 Seiten dieses Threads
les ich nur was von irgend welcher Hardware oder Software, die mal funktioniert oder auch nicht.

Und mein Englisch ist leider nicht so gut für dieses Forum, lesen klappt bescheiden.

AMpractice

  • Guest
Re: optimized sources
« Reply #559 on: 01 Apr 2010, 11:57:32 pm »
Quote
on my old xp-machine this older version 2.2 compiles complete without error. So I thought it is a good complete test.
...

some other projects(arprec) I tried did not have this problem... will try some others now too

with akv8 I believe I must use IPP5.3,  have still 5.2beta.
there is still something to install (ITBB and latest IPP)

some closer collaboration would I like  ;D

heinz
Quote

Hello:
I am a newbie in installing external packages (ARPREC) in VS2008 c++ on Windows XP.  I am having trouble in configuring Visual Studio 2008 to recognize arprec lib.  After running the arprec.vcproj and compiling the projects successfully, the runtime error was:
RegSvr32 Message:

No DLL name specified.

Usage:  regsvr32[/u][/s][/n][/i[:cmdline]]dllname

/u-Unregistered server

/s-Silent; display no message boxes

/i-Call DllInstall passing it an optional [cmdline]; when used with /u calls dll uninstall

/n-do not call DllRegisterServer; this option must be used with /i

If someone can provide the directions for installing ARPREC properly in MSVS2008 C++ on WinXP, it would be greatly appreciated.   The README for the pkg may not have been specific for Visual Studio.  Sorry for the newbie question.     :o

Offline _heinz

  • Volunteer Developer
  • Knight who says 'Ni!'
  • *****
  • Posts: 2117
Re: optimized sources
« Reply #560 on: 02 Apr 2010, 09:44:13 am »
Hi AMpractice,

configure the project as /Release
if you have compiled the project sucessful, you must write your own batchfiles to execute the different programs. If you study the source, you can find which parameters the exe need to execute. This is the main object of the batchfiles.
You must copy all files from /Release including the arprec.lib, arprec.dll, arprec.exp into your testdirectory where the batchfiles are.
Write your own batchfiles is a good exercise to understunding the project.
On my website you can find the arprec samples behind some other interesting math.
Ask, if you have questions

Regards
heinz

Offline _heinz

  • Volunteer Developer
  • Knight who says 'Ni!'
  • *****
  • Posts: 2117
Re: optimized sources
« Reply #561 on: 05 Apr 2010, 05:28:34 pm »
For 10? I think you need two million. Oh, and I thought you were German.  :)
Congrats anyway.
10 mio ? -->Target will be reached in 41.63 days on April 4 2010 (if i have no hardware outage)
 :)
9 Mio total today
10 mio ? -->Target will be reached in 18.28 days on March 30 2010
 ;)
got the 10 Mio on 4th of April  ;)

Offline sunu

  • Alpha Tester
  • Knight who says 'Ni!'
  • ***
  • Posts: 771
Re: optimized sources
« Reply #562 on: 06 Apr 2010, 02:17:50 pm »
got the 10 Mio on 4th of April  ;)

Very nice milestone!  :)

Offline _heinz

  • Volunteer Developer
  • Knight who says 'Ni!'
  • *****
  • Posts: 2117
Re: optimized sources
« Reply #563 on: 11 Apr 2010, 04:37:04 pm »
Hi,
Since nearly a half of year I had have laying around a "W7 Ultimate 64".
Now its time to install:
- Updated my host machine to VMWare Server 2.02.:
- Installed now unsupported  :o "W7 Ultimate 64" as V8-VM7
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
We have now a 64Bit (V8-VM7)  "Windows 7" machine for Beta testing.

 ;)


Offline _heinz

  • Volunteer Developer
  • Knight who says 'Ni!'
  • *****
  • Posts: 2117
Re: optimized sources
« Reply #564 on: 14 Apr 2010, 10:36:18 am »
Hi Jason,
I tested  beta Unified installer on V8-VM7 , no problems on W7 Ultimate 64.
Installed AK_v8b_win_x64_SSE41 and ap_5.05r339_SSE, get work for AK_v8b and it runs.
 ;)

Offline Jason G

  • Construction Fraggle
  • Knight who says 'Ni!'
  • *****
  • Posts: 8980
Re: optimized sources
« Reply #565 on: 14 Apr 2010, 10:40:31 am »
Cheers Heinz.  This should really be released already (for a while), but I have some hectic schedule preventing my full attention to do things right. I hope to sort things out soon so we can move onto the next stage.

Jason

Offline _heinz

  • Volunteer Developer
  • Knight who says 'Ni!'
  • *****
  • Posts: 2117
Re: optimized sources
« Reply #566 on: 15 Apr 2010, 08:49:50 pm »
how p8 compiles:
p8 - New Optimizations for 32-bit applications on 45nm Intel® Core™2 Duo (Penryn,Nehalem,Westmere) family processors
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
1>------ Neues Erstellen gestartet: Projekt: ap_client, Konfiguration: AP_SSE2_CSP_QaxAVX Win32 ------
1>Deleting intermediate files and output files for project 'ap_client', configuration 'AP_SSE2_CSP_QaxAVX|Win32'.
1>AP SSE2_IPP_ICC_CSP_QaxSSE2_Qparallel_MKLP_BLANKIT_O2_Oii Win32 (Microsoft VC++ Environment)
1>Generating new BuildInfo
1>SubWCRev: 'C:\I\SC\apwk\ap_ICCIPP'
1>Last committed at revision 374
1>Updated to revision 374
1>Local modifications found
1>
1>APREV IS 374
1>Deleting old output files
1>Compiling with Intel(R) C++ 11.1.060 [IA-32]... (Intel C++ Environment)
1>ap_schema.cpp
1>Compiling with Intel(R) C++ 11.1.060 [IA-32]... (Intel C++ Environment)
1>ap_fold.cpp
1>-----USE_AVX activ-----
1>-----AVX undefined in x32, p8 used -----
1>Compiling with Intel(R) C++ 11.1.060 [IA-32]... (Intel C++ Environment)
1>intrinsics.cpp
1>-----USE_AVX activ-----
1>-----AVX undefined in x32, p8 used -----
1>Compiling with Intel(R) C++ 11.1.060 [IA-32]... (Intel C++ Environment)
1>ap_fileio.cpp
1>-----USE_AVX activ-----
1>-----AVX undefined in x32, p8 used -----
1>Compiling with Intel(R) C++ 11.1.060 [IA-32]... (Intel C++ Environment)
1>ap_science.cpp
1>-----USE_AVX activ-----
1>-----AVX undefined in x32, p8 used -----
1>-----USE_AVX activ-----
1>-----AVX undefined in x32, p8 used -----
1>Compiling with Intel(R) C++ 11.1.060 [IA-32]... (Intel C++ Environment)
1>sbtf.cpp
1>Compiling with Intel(R) C++ 11.1.060 [IA-32]... (Intel C++ Environment)
1>ap_remove_radar.cpp
1>-----USE_AVX activ-----
1>-----AVX undefined in x32, p8 used -----
1>Compiling with Intel(R) C++ 11.1.060 [IA-32]... (Intel C++ Environment)
1>ap_debug.cpp
1>Compiling with Intel(R) C++ 11.1.060 [IA-32]... (Intel C++ Environment)
1>ap_client_main.cpp
1>-----USE_AVX activ-----
1>-----AVX undefined in x32, p8 used -----
1>Compiling with Intel(R) C++ 11.1.060 [IA-32]... (Intel C++ Environment)
1>version.cpp
1>Compiling with Intel(R) C++ 11.1.060 [IA-32]... (Intel C++ Environment)
1>mtrand.cpp
1>dm_chunk_parallel.cpp
1>-----USE_AVX activ-----
1>-----AVX undefined in x32, p8 used -----
1>Compiling with Intel(R) C++ 11.1.060 [IA-32]... (Intel C++ Environment)
1>ap_gfx_main.cpp
1>-----USE_AVX activ-----
1>-----AVX undefined in x32, p8 used -----
1>Compiling with Intel(R) C++ 11.1.060 [IA-32]... (Intel C++ Environment)
1>ap_timer.cpp
1>-----USE_AVX activ-----
1>-----AVX undefined in x32, p8 used -----
1>Linking... (Intel C++ Environment)
1>xilink: executing 'link'
1>Embedding manifest... (Microsoft VC++ Environment)
1>AP SSE2_IPP_ICC_CSP_QaxSSE2_Qparallel_MKLP_BLANKIT_O2_Oi Win32 (Microsoft VC++ Environment)
1>Post Build revision number extraction
1>
1>APREV IS 374
1>Copy/Renaming Output Files
1>
1>Build log was saved at "file://C:\I\SC\apwk\ap_ICCIPP\client\WinBuild\ICC11_2k8\Win32\Output_ext\ap_client\AP_SSE2_CSP_QaxAVX\Intermediate\BuildLog.htm"
1>ap_client - 0 error(s), 0 warning(s)
========== Alles neu erstellen: 1 erfolgreich, Fehler bei 0, 0 übersprungen ==========
This is a build with dispatcher and include optimized code for the selected processor
from min SSE2 to SSE4.2
Its a Qax and run on any processor with SSE2 and above and Pentium3, and all Athlon with min SSE2
resultfile attached
A full 64Bit AVX build is in preparation  ;)

[attachment deleted by admin]

Offline _heinz

  • Volunteer Developer
  • Knight who says 'Ni!'
  • *****
  • Posts: 2117
Re: optimized sources
« Reply #567 on: 01 May 2010, 04:59:14 pm »
Hi all,

update6 for parallel composer and ICSP is out. A lot todo with updates.

Composer update6
1>Compiling with Intel(R) C++ Compiler 11.1.082 [IA-32]... (Intel C++ Environment)

ICSP update6
1>Compiling with Intel(R) C++ 11.1.065 [IA-32]... (Intel C++ Environment)


thanks to all readers of this epic thread...

by the way, got "Brown Belt" from Intel today  ;)

and 11 Mio today: 1st of Mai
Current Credit (based on incremental update) 11,016,471.91

regards
 
« Last Edit: 01 May 2010, 06:53:21 pm by _heinz »

Offline _heinz

  • Volunteer Developer
  • Knight who says 'Ni!'
  • *****
  • Posts: 2117
Re: optimized sources
« Reply #568 on: 25 May 2010, 09:42:08 pm »
Hi,
the new driver 197.45 WHQL added Open CL 1.0  support to all devices upto Geforce 8 and later GPU's
Fügt Unterstützung für OpenCL(Open Computing Language) 1.0 für alle GeForce 8-Serie und später GPUs.
Version: 197.45 WHQL
Freigabedatum: 2010.04.13
Betriebssystem: Windows 7 64-bit
Sprache: Deutsch
Dateigröße: 143 MB 

 ;)
http://www.nvidia.de/object/win7_winvista_64bit_197.45_whql_de.html

really great to get OCL this way  ;D

Offline _heinz

  • Volunteer Developer
  • Knight who says 'Ni!'
  • *****
  • Posts: 2117
Re: optimized sources
« Reply #569 on: 26 May 2010, 09:28:39 am »
And does it really worked you can see here:
~~~~~~~~~~~~~~~~~~~~~~~~~~
oclDeviceQuery.exe Starting...

OpenCL SW Info:

 CL_PLATFORM_NAME:      NVIDIA CUDA
 CL_PLATFORM_VERSION:   OpenCL 1.0 CUDA 3.0.1
 OpenCL SDK Revision:   5537818


OpenCL Device Info:

 1 devices found supporting OpenCL:

 ---------------------------------
 Device ION
 ---------------------------------
  CL_DEVICE_NAME:                       ION
  CL_DEVICE_VENDOR:                     NVIDIA Corporation
  CL_DRIVER_VERSION:                    197.45
  CL_DEVICE_TYPE:                       CL_DEVICE_TYPE_GPU
  CL_DEVICE_MAX_COMPUTE_UNITS:          2
  CL_DEVICE_MAX_WORK_ITEM_DIMENSIONS:   3
  CL_DEVICE_MAX_WORK_ITEM_SIZES:        512 / 512 / 64
  CL_DEVICE_MAX_WORK_GROUP_SIZE:        512
  CL_DEVICE_MAX_CLOCK_FREQUENCY:        1100 MHz
  CL_DEVICE_ADDRESS_BITS:               32
  CL_DEVICE_MAX_MEM_ALLOC_SIZE:         128 MByte
  CL_DEVICE_GLOBAL_MEM_SIZE:            241 MByte
  CL_DEVICE_ERROR_CORRECTION_SUPPORT:   no
  CL_DEVICE_LOCAL_MEM_TYPE:             local
  CL_DEVICE_LOCAL_MEM_SIZE:             16 KByte
  CL_DEVICE_MAX_CONSTANT_BUFFER_SIZE:   64 KByte
  CL_DEVICE_QUEUE_PROPERTIES:           CL_QUEUE_OUT_OF_ORDER_EXEC_MODE_ENABLE
  CL_DEVICE_QUEUE_PROPERTIES:           CL_QUEUE_PROFILING_ENABLE
  CL_DEVICE_IMAGE_SUPPORT:              1
  CL_DEVICE_MAX_READ_IMAGE_ARGS:        128
  CL_DEVICE_MAX_WRITE_IMAGE_ARGS:       8
  CL_DEVICE_SINGLE_FP_CONFIG:           INF-quietNaNs round-to-nearest round-to-
zero round-to-inf fma

  CL_DEVICE_IMAGE <dim>                 2D_MAX_WIDTH     8192
                                        2D_MAX_HEIGHT    8192
                                        3D_MAX_WIDTH     2048
                                        3D_MAX_HEIGHT    2048
                                        3D_MAX_DEPTH     2048

  CL_DEVICE_EXTENSIONS:                 cl_khr_byte_addressable_store
                                        cl_khr_icd
                                        cl_khr_gl_sharing
                                        cl_nv_d3d9_sharing
                                        cl_nv_d3d10_sharing
                                        cl_nv_d3d11_sharing
                                        cl_nv_compiler_options
                                        cl_nv_device_attribute_query
                                        cl_nv_pragma_unroll
                                        cl_khr_global_int32_base_atomics
                                        cl_khr_global_int32_extended_atomics


  CL_DEVICE_COMPUTE_CAPABILITY_NV:      1.1
  NUMBER OF MULTIPROCESSORS:            2
  NUMBER OF CUDA CORES:                 16
  CL_DEVICE_REGISTERS_PER_BLOCK_NV:     8192
  CL_DEVICE_WARP_SIZE_NV:               32
  CL_DEVICE_GPU_OVERLAP_NV:             CL_FALSE
  CL_DEVICE_KERNEL_EXEC_TIMEOUT_NV:     CL_FALSE
  CL_DEVICE_INTEGRATED_MEMORY_NV:       CL_TRUE
  CL_DEVICE_PREFERRED_VECTOR_WIDTH_<t>  CHAR 1, SHORT 1, INT 1, LONG 1, FLOAT 1,
 DOUBLE 0


oclDeviceQuery, Platform Name = NVIDIA CUDA, Platform Version = OpenCL 1.0 CUDA
3.0.1, SDK Revision = 5537818, NumDevs = 1, Device = ION

System Info:

 Local Time/Date = 15:24:44, 5/26/2010
 CPU Arch: 0
 CPU Level: 6
 # of CPU processors: 2
 Windows Build: 6002
 Windows Ver: 6.0


PASSED


Press <Enter> to Quit...
-----------------------------------------------------------

 

Welcome, Guest.
Please login or register.
 
 
 
Forgot your password?
Members
Total Members: 97
Latest: ToeBee
New This Month: 0
New This Week: 0
New Today: 0
Stats
Total Posts: 59559
Total Topics: 1672
Most Online Today: 40
Most Online Ever: 983
(20 Jan 2020, 03:17:55 pm)
Users Online
Members: 0
Guests: 31
Total: 31
Powered by EzPortal