Update Readme: CUDA, CMake #25

ax3l · 2019-12-06T01:05:10Z

Update the readme with CMake notes, CUDA-focus of the fork, newer instructions and authors.
Remove old Makefile, we do not use nor support that.

Related to #22 #24

ax3l · 2019-12-06T01:06:08Z

Makefile

-
-#if you are using AMD GPUs, uncomment the following line and set the install path correctly 
-#TARGET=ocl_memtest
-AMD_INSTALL_PATH ?=/usr/local/ati-stream-sdk-v2.1-lnx64/


wow, I am old enough to remember ATI stream.

I did use it in the good old times xD

Update the readme with CMake notes, CUDA-focus of the fork, newer instructions and authors.

ax3l · 2019-12-06T01:29:25Z

cc @grische @sbastrakov

sbastrakov · 2019-12-06T08:05:45Z

README.md

+
+### Compile
+
+Inside the source directory, run:


Don't we normally recommend an out-of-source build? Guess for this in-source should be fine when done like described here.

Running in a quick-&-dirty but empty dir is not perfect but good enough. Simplifies instructions below.

sbastrakov

Great! Some language nitpicking, in case you agree to it I can implement myself.

sbastrakov · 2019-12-06T08:08:24Z

README.md

+	the address wires. 
+
+Test 1 `[Own address test]`  
+	Each Memory location is filled with its own address. The next kernel checks if the 


Why is Memory capitalized?

sbastrakov · 2019-12-06T08:09:14Z

README.md

+
+### Known Issues
+
+* If your machine is cuda 2.2, killing the program while it is running test 10 (the memory stress test) could result 


machine is cuda 2.2 should perhaps be machine runs cuda 2.2 ?

sbastrakov · 2019-12-06T08:11:03Z

README.md

+### Known Issues
+
+* If your machine is cuda 2.2, killing the program while it is running test 10 (the memory stress test) could result 
+  in your GPUs in bad state. This is a bug from the nvidia driver. A detailed description can be found in 


I think there is a word missing between GPUs and in, like being or remaining.

nvidia is also written as Nvidia, both versions are in several places.

Or retro NVidia or nVidia? Nobody knows

sbastrakov · 2019-12-06T08:14:03Z

README.md

+Then we exit the kernel so that the memory can be flushed. Then we start a new kernel to read
+and check if the value matches the pattern. An error is recorded if it does not match for each 
+memory location. In the same kernel, the compliment of the pattern is written after the checking. 
+The third kernel is launched to read the value again and checks against the compliment of the pattern. 


checks -> check ?

sbastrakov · 2019-12-06T08:14:38Z

README.md

+### Detailed Description
+
+Test 0 `[Walking 1 bit]`  
+	This test changes one bit a time in memory address to see it


to see it -> to see if it?

sbastrakov · 2019-12-06T08:16:46Z

README.md

+	are completed the data patterns are checked.  Because the data is checked
+	only after the memory moves are completed it is not possible to know
+	where the error occurred.  The addresses reported are only for where the
+	bad pattern was found.


This paragraph has double spaces between sentences.

sbastrakov · 2019-12-06T08:17:55Z

README.md

+
+Test 8 `[Modulo 20, random pattern]`  
+	A random pattern is generated. This pattern is used to set every 20th memory location
+	in memory. The rest of the memory location is set to the complimemnt of the pattern.


in memory is excessive, due to memory location.

The rest of the memory location is - shouldn't it be plural?

sbastrakov · 2019-12-06T08:19:32Z

README.md

+	The bit fade test initializes all of memory with a pattern and then
+	sleeps for 90 minutes. Then memory is examined to see if any memory bits
+	have changed. All ones and all zero patterns are used. This test takes
+	3 hours to complete. The Bit Fade test is disabled by default


Here is says 3 hours, but in the name 90 minutes.

sbastrakov · 2019-12-06T08:20:15Z

README.md

+Test 10 `[memory stress test]`  
+	Stress memory as much as we can. A random pattern is generated and a kernel of large grid size
+	and block size is launched to set all memory to the pattern. A new read and write kernel is launched
+	immediately after the previous write kernel to check if there is any errors in memory and set the


there is any errors is singular and plural mismatch.

sbastrakov · 2019-12-06T08:20:50Z

README.md

+	immediately after the previous write kernel to check if there is any errors in memory and set the
+	memory to the compliment. This process is repeated for 1000 times for one pattern. The kernel is 
+	written as to achieve the maximum bandwidth between the global memory and GPU.
+	This will increase the chance of catching software error. In practice, we found this test quite useful 


software error should either be plural or singular with a in front.

ax3l · 2019-12-06T19:19:15Z

I did not change the original lingo, but you are welcome to just add a follow-up PR ;)

ax3l added the enhancement label Dec 6, 2019

ax3l requested a review from psychocoderHPC December 6, 2019 01:05

ax3l commented Dec 6, 2019

View reviewed changes

Update Readme: CUDA, CMake

0f12452

Update the readme with CMake notes, CUDA-focus of the fork, newer instructions and authors.

ax3l force-pushed the doc-updateReadme branch 8 times, most recently from 3859027 to 64f3c0d Compare December 6, 2019 01:27

ax3l requested a review from sbastrakov December 6, 2019 01:29

README: Markdownify

88a975f

ax3l force-pushed the doc-updateReadme branch from 64f3c0d to 88a975f Compare December 6, 2019 01:30

ax3l mentioned this pull request Dec 6, 2019

Creating a release/tag #24

Open

sbastrakov reviewed Dec 6, 2019

View reviewed changes

sbastrakov suggested changes Dec 6, 2019

View reviewed changes

ax3l merged commit edb66a4 into ComputationalRadiationPhysics:dev Dec 6, 2019

ax3l deleted the doc-updateReadme branch December 6, 2019 19:21


		### Known Issues

		* If your machine is cuda 2.2, killing the program while it is running test 10 (the memory stress test) could result

Update Readme: CUDA, CMake #25

Update Readme: CUDA, CMake #25

Uh oh!

Conversation

ax3l commented Dec 6, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ax3l commented Dec 6, 2019

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ax3l Dec 6, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sbastrakov left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ax3l Dec 6, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ax3l commented Dec 6, 2019

Uh oh!

Uh oh!

ax3l commented Dec 6, 2019 •

edited

Loading

ax3l Dec 6, 2019 •

edited

Loading

ax3l Dec 6, 2019 •

edited

Loading