Writing /fs1/eecg/moshovos/a/a3/moshovos/public_www/CUDA11/data/cache/7/7c550b6d6735ca17d562cd954514f0e4.i failed

Unable to save cache file. Hint: disk full; file permissions; safe_mode setting.

Writing /fs1/eecg/moshovos/a/a3/moshovos/public_www/CUDA11/data/cache/7/7c550b6d6735ca17d562cd954514f0e4.i failed

Unable to save cache file. Hint: disk full; file permissions; safe_mode setting.

Writing /fs1/eecg/moshovos/a/a3/moshovos/public_www/CUDA11/data/cache/7/7c550b6d6735ca17d562cd954514f0e4.xhtml failed

Tools
Books
CUDA Reference Material
Other

Tools

Cuda Occupancy Calculator External Link. Use this to see how many threads/warps will run on your GPU at any given point of time. You need to specify:

The Compute Capability: 1.3 for GTX280
Threads/block: You decide this
Registers per thread: Use the nvcc compiler option -ptxas-options=”-v“. It will report the number of registers per thread.
Shared memory per block: see the item above. Again you'll have to use nvcc's output.

Books

Programming Massively Parallel Processors: A Hands-on Approach, David Kirk and Wen-mei Hwu, External Link.
Cuda by Example, Jason Sanders and Edward Kandrot, External Link.
The OpenCL Programming Book, Ryoji Tsuchiyama, Takashi Nakamura, Takuro Iizuka, Akihiro Asahara, Satoshi Miki, External Link

CUDA Reference Material

We will be using CUDA v3.2. A newer *candidate* version exists today (v4.0 RC). The lab machines will not be upgraded for the duration of this course as we want to stick with a stable version.

You can download all reference material and tools from here
CUDA Programming Guide, External Link
CUDA Best Practices Guide, External Link
CUDA Toolkit Reference Guide, External Link
More reference guides from NVidia (CUDA and OpenCL). Look at the Linux section and at the CUDA Tookit row External Link

Other

What Every Computer Scientist Should Know About Floating-Point Arithmetic local copy, External Link