syntax.us Let the syntax do the talking
Blog Contact Posts Questions Tags Hire Me

Question:
How to install Nvidia Cuda 7.5 on Ubuntu 14.04?

I bought a laptop recently with an Nvidia GPU chip in it.

I knew that a Nvidia GPU could speed up machine learning calculations via the Cuda API:

https://developer.nvidia.com/cuda-gpus

I detected the chip with this shell command:
lspci | grep -i nvidia
I saw this:
01:00.0 3D controller: NVIDIA Corporation GM204M [GeForce GTX 980M] (rev a1)
I was curious, 'How to install Cuda on this laptop?'

I wanted to run Ubuntu but I assumed that Ubuntu 14.04 Desktop might interfere with Cuda.

So, I started by installation of Ubuntu 14.04 Server instead of Ubuntu-Desktop:

http://releases.ubuntu.com/14.04/ubuntu-14.04.4-server-amd64.iso

After installation I enhanced it:
sudo echo 'blacklist nouveau'         >  /etc/modprobe.d/blacklist-nouveau.conf
sudo echo 'options nouveau modeset=0' >> /etc/modprobe.d/blacklist-nouveau.conf
sudo apt-get install xterm gitk build-essential openssh-server emacs libssl-dev
Then I downloaded a package:

cuda-repo-ubuntu1404-7-5-local_7.5-18_amd64.deb

Which I found at the URL listed below:

https://developer.nvidia.com/cuda-downloads

I installed it with this syntax:
dpkg -i cuda-repo-ubuntu1404-7-5-local_7.5-18_amd64.deb
Next, I installed cuda:
sudo apt-get update
sudo apt-get install cuda
That command installed software in this location:
/usr/local/cuda-7.5/
Also it installed a convenient soft-link:
/usr/local/cuda
It looked like this:

  /usr/local/cuda-7.5:
  drwxr-xr-x 13 root root 4096 Jun  8 00:10 .
  drwxr-xr-x 11 root root 4096 Jun  8 00:17 ..
  drwxr-xr-x  3 root root 4096 Jun  8 00:10 bin
  drwxr-xr-x  5 root root 4096 Jun  8 00:10 doc
  drwxr-xr-x  4 root root 4096 Jun  8 00:09 extras
  lrwxrwxrwx  1 root root   28 Aug 15  2015 include -> targets/x86_64-linux/include
  lrwxrwxrwx  1 root root   24 Aug 15  2015 lib64 -> targets/x86_64-linux/lib
  drwxr-xr-x  8 root root 4096 Jun  8 00:10 libnsight
  drwxr-xr-x  7 root root 4096 Jun  8 00:10 libnvvp
  -rw-r--r--  1 root root  365 Aug 15  2015 LICENSE
  drwxr-xr-x  7 root root 4096 Jun  8 00:09 nvvm
  -rw-r--r--  1 root root  365 Aug 15  2015 README
  drwxr-xr-x 11 root root 4096 Jun  8 00:16 samples
  drwxr-xr-x  3 root root 4096 Jun  8 00:09 share
  drwxr-xr-x  2 root root 4096 Jun  8 00:09 src
  drwxr-xr-x  3 root root 4096 Jun  8 00:09 targets
  drwxr-xr-x  2 root root 4096 Jun  8 00:09 tools
  -rw-r--r--  1 root root   20 Aug 15  2015 version.txt
I found useful html docs here:

  /usr/local/cuda-7.5/doc/html:
  drwxr-xr-x 47 root root   4096 Jun  8 00:10 .
  drwxr-xr-x  5 root root   4096 Jun  8 00:10 ..
  drwxr-xr-x  5 root root   4096 Jun  8 00:10 common
  drwxr-xr-x  3 root root   4096 Jun  8 00:10 cublas
  drwxr-xr-x  3 root root   4096 Jun  8 00:10 cuda-binary-utilities
  drwxr-xr-x  3 root root   4096 Jun  8 00:10 cuda-c-best-practices-guide
  drwxr-xr-x  3 root root   4096 Jun  8 00:10 cuda-compiler-driver-nvcc
  drwxr-xr-x  3 root root   4096 Jun  8 00:10 cuda-c-programming-guide
  drwxr-xr-x  2 root root   4096 Jun  8 00:10 cuda-driver-api
  drwxr-xr-x  2 root root   4096 Jun  8 00:10 cuda-gdb
  drwxr-xr-x  3 root root   4096 Jun  8 00:10 cuda-installation-guide-linux
  drwxr-xr-x  3 root root   4096 Jun  8 00:10 cuda-installation-guide-mac-os-x
  drwxr-xr-x  3 root root   4096 Jun  8 00:10 cuda-installation-guide-microsoft-windows
  drwxr-xr-x  2 root root   4096 Jun  8 00:10 cuda-math-api
  drwxr-xr-x  2 root root   4096 Jun  8 00:10 cuda-memcheck
  drwxr-xr-x  3 root root   4096 Jun  8 00:10 cuda-quick-start-guide
  drwxr-xr-x  2 root root   4096 Jun  8 00:10 cuda-runtime-api
  drwxr-xr-x  3 root root   4096 Jun  8 00:10 cuda-samples
  drwxr-xr-x  2 root root   4096 Jun  8 00:10 cuda-toolkit-release-notes
  drwxr-xr-x  2 root root   4096 Jun  8 00:10 cufft
  drwxr-xr-x  2 root root   4096 Jun  8 00:10 cupti
  drwxr-xr-x  3 root root   4096 Jun  8 00:10 curand
  drwxr-xr-x  2 root root   4096 Jun  8 00:10 cusolver
  drwxr-xr-x  2 root root   4096 Jun  8 00:10 cusparse
  drwxr-xr-x  2 root root   4096 Jun  8 00:10 debugger-api
  drwxr-xr-x  2 root root   4096 Jun  8 00:10 eula
  drwxr-xr-x  3 root root   4096 Jun  8 00:10 floating-point
  drwxr-xr-x  3 root root   4096 Jun  8 00:10 gpudirect-rdma
  drwxr-xr-x  3 root root   4096 Jun  8 00:10 incomplete-lu-cholesky
  -rw-r--r--  1 root root  32526 Aug 15  2015 index.html
  drwxr-xr-x  2 root root   4096 Jun  8 00:10 inline-ptx-assembly
  drwxr-xr-x  2 root root   4096 Jun  8 00:10 kepler-tuning-guide
  drwxr-xr-x  2 root root  20480 Jun  8 00:10 libdevice-users-guide
  drwxr-xr-x  2 root root   4096 Jun  8 00:10 libnvvm-api
  drwxr-xr-x  2 root root   4096 Jun  8 00:10 maxwell-compatibility-guide
  drwxr-xr-x  2 root root   4096 Jun  8 00:10 maxwell-tuning-guide
  drwxr-xr-x  2 root root   4096 Jun  8 00:10 npp
  drwxr-xr-x  3 root root   4096 Jun  8 00:10 nsight-eclipse-edition-getting-started-guide
  drwxr-xr-x  2 root root   4096 Jun  8 00:10 nvblas
  drwxr-xr-x  2 root root   4096 Jun  8 00:10 nvrtc
  drwxr-xr-x  2 root root   4096 Jun  8 00:10 nvvm-ir-spec
  drwxr-xr-x  3 root root   4096 Jun  8 00:10 optimus-developer-guide
  drwxr-xr-x  3 root root   4096 Jun  8 00:10 parallel-thread-execution
  drwxr-xr-x  3 root root   4096 Jun  8 00:10 profiler-users-guide
  drwxr-xr-x  3 root root   4096 Jun  8 00:10 ptx-writers-guide-to-interoperability
  drwxr-xr-x  3 root root   4096 Jun  8 00:10 search
  -rw-r--r--  1 root root 138760 Aug 15  2015 sitemap.xml
  drwxr-xr-x  2 root root   4096 Jun  8 00:10 thrust
  drwxr-xr-x  2 root root   4096 Jun  8 00:10 video-decoder
Then, I added /usr/local/cuda/bin to my PATH in ~/.bashrc
export PATH=/usr/local/cuda/bin:$PATH
I did:
. ~/.bashrc
Next, I copied the samples to my home folder:
cd /usr/local/cuda/
rsync -a samples ~/
I found a Makefile there so I ran make:
cd ~/samples/
make
I saw this:

dan@srvr1404:~ $ cd samples/
dan@srvr1404:~/samples $ 
dan@srvr1404:~/samples $ 
dan@srvr1404:~/samples $ ll
total 148
drwxr-xr-x 12 dan dan  4096 Jul 11 16:07 ./
drwxr-xr-x 10 dan dan  4096 Jul 11 16:47 ../
drwxr-xr-x 47 dan dan  4096 Jun  8 00:10 0_Simple/
drwxr-xr-x  6 dan dan  4096 Jun  8 00:10 1_Utilities/
drwxr-xr-x 12 dan dan  4096 Jun  8 00:10 2_Graphics/
drwxr-xr-x 20 dan dan  4096 Jun  8 00:10 3_Imaging/
drwxr-xr-x 10 dan dan  4096 Jun  8 00:09 4_Finance/
drwxr-xr-x  9 dan dan  4096 Jun  8 00:10 5_Simulations/
drwxr-xr-x 30 dan dan  4096 Jun  8 00:10 6_Advanced/
drwxr-xr-x 28 dan dan  4096 Jun  8 00:10 7_CUDALibraries/
drwxrwxr-x  3 dan dan  4096 Jul 11 16:07 bin/
drwxr-xr-x  6 dan dan  4096 Jun  8 00:10 common/
-rw-r--r--  1 dan dan 96407 Aug 14  2015 EULA.txt
-rw-r--r--  1 dan dan  2652 Jun  8 00:16 Makefile
dan@srvr1404:~/samples $ 
dan@srvr1404:~/samples $ 

dan@srvr1404:~/samples $ make
make[1]: Entering directory `/home/dan/samples/0_Simple/simpleVoteIntrinsics_nvrtc'
g++ -I../../common/inc -I/usr/local/cuda-7.5/include   -o simpleVoteIntrinsics.o -c simpleVoteIntrinsics.cpp
g++  -L/usr/local/cuda-7.5/lib64 -L/usr/local/cuda-7.5/lib64/stubs -o simpleVoteIntrinsics_nvrtc simpleVoteIntrinsics.o  -lcuda -lnvrtc
mkdir -p ../../bin/x86_64/linux/release

snip...

mkdir -p ../../bin/x86_64/linux/release
cp cuSolverSp_LinearSolver ../../bin/x86_64/linux/release
make[1]: Leaving directory `/home/dan/samples/7_CUDALibraries/cuSolverSp_LinearSolver'
Finished building CUDA samples
dan@srvr1404:~/samples $ 
dan@srvr1404:~/samples $ 
dan@srvr1404:~/samples $ 
The executables landed here:
~/samples/bin/x86_64/linux/release/
They looked like this:

  /home/dan/samples/bin/x86_64/linux/release:
  drwxrwxr-x 2 dan dan      4096 Jul 11 16:32 .
  drwxrwxr-x 3 dan dan      4096 Jul 11 16:07 ..
  -rwxrwxr-x 1 dan dan    670048 Jul 11 16:22 alignedTypes
  -rwxrwxr-x 1 dan dan    558005 Jul 11 16:12 asyncAPI
  -rwxrwxr-x 1 dan dan    563406 Jul 11 16:13 bandwidthTest
  -rwxrwxr-x 1 dan dan    563198 Jul 11 16:30 batchCUBLAS
  -rwxrwxr-x 1 dan dan    861114 Jul 11 16:16 bicubicTexture
  -rwxrwxr-x 1 dan dan    814694 Jul 11 16:17 bilateralFilter
  -rwxrwxr-x 1 dan dan    822647 Jul 11 16:14 bindlessTexture
  -rwxrwxr-x 1 dan dan    747576 Jul 11 16:18 binomialOptions
  -rwxrwxr-x 1 dan dan     54381 Jul 11 16:18 binomialOptions_nvrtc
  -rwxrwxr-x 1 dan dan    591628 Jul 11 16:18 BlackScholes
  -rwxrwxr-x 1 dan dan     45834 Jul 11 16:18 BlackScholes_nvrtc
  -rwxrwxr-x 1 dan dan    987729 Jul 11 16:16 boxFilter
  -rwxrwxr-x 1 dan dan   6323670 Jul 11 16:31 boxFilterNPP
  -rwxrwxr-x 1 dan dan   3778517 Jul 11 16:27 cdpAdvancedQuicksort
  -rwxrwxr-x 1 dan dan   3667738 Jul 11 16:20 cdpBezierTessellation
  -rwxrwxr-x 1 dan dan  41500775 Jul 11 16:28 cdpLUDecomposition
  -rwxrwxr-x 1 dan dan   4142333 Jul 11 16:21 cdpQuadtree
  -rwxrwxr-x 1 dan dan   3626502 Jul 11 16:08 cdpSimplePrint
  -rwxrwxr-x 1 dan dan   3639147 Jul 11 16:10 cdpSimpleQuicksort
  -rwxrwxr-x 1 dan dan    556180 Jul 11 16:12 clock
  -rwxrwxr-x 1 dan dan     39209 Jul 11 16:09 clock_nvrtc
  -rwxrwxr-x 1 dan dan    564535 Jul 11 16:22 concurrentKernels
  -rwxrwxr-x 1 dan dan   3557384 Jul 11 16:31 conjugateGradient
  -rwxrwxr-x 1 dan dan    548143 Jul 11 16:31 conjugateGradientPrecond
  -rwxrwxr-x 1 dan dan    543696 Jul 11 16:31 conjugateGradientUM
  -rwxrwxr-x 1 dan dan    669104 Jul 11 16:17 convolutionFFT2D
  -rwxrwxr-x 1 dan dan    620539 Jul 11 16:17 convolutionSeparable
  -rwxrwxr-x 1 dan dan    596883 Jul 11 16:15 convolutionTexture
  -rwxrwxr-x 1 dan dan    561199 Jul 11 16:12 cppIntegration
  -rwxrwxr-x 1 dan dan    577661 Jul 11 16:11 cppOverload
  -rwxrwxr-x 1 dan dan    305415 Jul 11 16:15 cudaDecodeGL
  -rwxrwxr-x 1 dan dan    551716 Jul 11 16:12 cudaOpenMP
  -rwxrwxr-x 1 dan dan    529718 Jul 11 16:31 cuHook
  -rwxrwxr-x 1 dan dan    592822 Jul 11 16:30 cuSolverDn_LinearSolver
  -rwxrwxr-x 1 dan dan    601794 Jul 11 16:30 cuSolverRf
  -rwxrwxr-x 1 dan dan    592752 Jul 11 16:32 cuSolverSp_LinearSolver
  -rwxrwxr-x 1 dan dan    761229 Jul 11 16:16 dct8x8
  -rwxrwxr-x 1 dan dan    542694 Jul 11 16:13 deviceQuery
  -rwxrwxr-x 1 dan dan    534759 Jul 11 16:12 deviceQueryDrv
  -rwxrwxr-x 1 dan dan    601192 Jul 11 16:16 dwtHaar1D
  -rwxrwxr-x 1 dan dan    688350 Jul 11 16:16 dxtc
  -rwxrwxr-x 1 dan dan    865743 Jul 11 16:27 eigenvalues
  -rwxrwxr-x 1 dan dan    596040 Jul 11 16:22 fastWalshTransform
  -rwxrwxr-x 1 dan dan    632212 Jul 11 16:30 FDTD3d
  -rwxrwxr-x 1 dan dan    876786 Jul 11 16:19 fluidsGL
  -rwxrwxr-x 1 dan dan   3079765 Jul 11 16:30 freeImageInteropNPP
  -rwxrwxr-x 1 dan dan    860151 Jul 11 16:20 FunctionPointers
  -rwxrwxr-x 1 dan dan   3099037 Jul 11 16:31 histEqualizationNPP
  -rwxrwxr-x 1 dan dan    654653 Jul 11 16:15 histogram
  -rwxrwxr-x 1 dan dan    713063 Jul 11 16:15 HSOpticalFlow
  -rwxrwxr-x 1 dan dan    933298 Jul 11 16:17 imageDenoising
  -rwxrwxr-x 1 dan dan    552302 Jul 11 16:08 inlinePTX
  -rwxrwxr-x 1 dan dan     35089 Jul 11 16:10 inlinePTX_nvrtc
  -rwxrwxr-x 1 dan dan    795847 Jul 11 16:28 interval
  -rwxrwxr-x 1 dan dan    578257 Jul 11 16:31 jpegNPP
  -rwxrwxr-x 1 dan dan    542180 Jul 11 16:31 libcuhook.so.1
  -rwxrwxr-x 1 dan dan   2415664 Jul 11 16:29 lineOfSight
  -rwxrwxr-x 1 dan dan   1071716 Jul 11 16:14 Mandelbrot
  -rwxrwxr-x 1 dan dan   2150888 Jul 11 16:14 marchingCubes
  -rwxrwxr-x 1 dan dan    576877 Jul 11 16:08 matrixMul
  -rwxrwxr-x 1 dan dan    543225 Jul 11 16:11 matrixMulCUBLAS
  -rwxrwxr-x 1 dan dan    556602 Jul 11 16:11 matrixMulDrv
  -rwxrwxr-x 1 dan dan    605543 Jul 11 16:29 matrixMulDynlinkJIT
  -rw-rw-r-- 1 dan dan     40837 Jul 11 16:11 matrixMul_kernel64.ptx
  -rwxrwxr-x 1 dan dan     39315 Jul 11 16:11 matrixMul_nvrtc
  -rwxrwxr-x 1 dan dan   1348754 Jul 11 16:31 MC_EstimatePiInlineP
  -rwxrwxr-x 1 dan dan   1329245 Jul 11 16:30 MC_EstimatePiInlineQ
  -rwxrwxr-x 1 dan dan    610571 Jul 11 16:32 MC_EstimatePiP
  -rwxrwxr-x 1 dan dan    610720 Jul 11 16:30 MC_EstimatePiQ
  -rwxrwxr-x 1 dan dan   1464962 Jul 11 16:31 MC_SingleAsianOptionP
  -rwxrwxr-x 1 dan dan    838786 Jul 11 16:21 mergeSort
  -rwxrwxr-x 1 dan dan  54089864 Jul 11 16:31 MersenneTwisterGP11213
  -rwxrwxr-x 1 dan dan   1708130 Jul 11 16:18 MonteCarloMultiGPU
  -rwxrwxr-x 1 dan dan   1431920 Jul 11 16:19 nbody
  -rwxrwxr-x 1 dan dan    803885 Jul 11 16:28 newdelete
  -rw-rw-r-- 1 dan dan     11785 Jul 11 16:15 NV12ToARGB_drvapi64.ptx
  -rwxrwxr-x 1 dan dan    854396 Jul 11 16:20 oceanFFT
  -rwxrwxr-x 1 dan dan    604657 Jul 11 16:13 p2pBandwidthLatencyTest
  -rwxrwxr-x 1 dan dan   2186769 Jul 11 16:19 particles
  -rwxrwxr-x 1 dan dan    834180 Jul 11 16:15 postProcessGL
  -rwxrwxr-x 1 dan dan    544739 Jul 11 16:27 ptxjit
  -rwxrwxr-x 1 dan dan    612392 Jul 11 16:18 quasirandomGenerator
  -rwxrwxr-x 1 dan dan     50114 Jul 11 16:18 quasirandomGenerator_nvrtc
  -rwxrwxr-x 1 dan dan   4081846 Jul 11 16:26 radixSortThrust
  -rwxrwxr-x 1 dan dan    787155 Jul 11 16:31 randomFog
  -rwxrwxr-x 1 dan dan    857886 Jul 11 16:15 recursiveGaussian
  -rwxrwxr-x 1 dan dan   2067705 Jul 11 16:22 reduction
  -rwxrwxr-x 1 dan dan    570803 Jul 11 16:29 scalarProd
  -rwxrwxr-x 1 dan dan    592225 Jul 11 16:21 scan
  -rwxrwxr-x 1 dan dan  12067583 Jul 11 16:25 segmentationTreeThrust
  -rwxrwxr-x 1 dan dan    622797 Jul 11 16:21 shfl_scan
  -rwxrwxr-x 1 dan dan    556327 Jul 11 16:11 simpleAssert
  -rwxrwxr-x 1 dan dan     35050 Jul 11 16:09 simpleAssert_nvrtc
  -rwxrwxr-x 1 dan dan    566413 Jul 11 16:12 simpleAtomicIntrinsics
  -rwxrwxr-x 1 dan dan     41245 Jul 11 16:12 simpleAtomicIntrinsics_nvrtc
  -rwxrwxr-x 1 dan dan    552153 Jul 11 16:10 simpleCallback
  -rwxrwxr-x 1 dan dan    588854 Jul 11 16:10 simpleCubemapTexture
  -rwxrwxr-x 1 dan dan    538603 Jul 11 16:31 simpleCUBLAS
  -rwxrwxr-x 1 dan dan    799902 Jul 11 16:16 simpleCUDA2GL
  -rwxrwxr-x 1 dan dan    561260 Jul 11 16:31 simpleCUFFT
  -rwxrwxr-x 1 dan dan    572727 Jul 11 16:31 simpleCUFFT_2d_MGPU
  -rwxrwxr-x 1 dan dan 187064153 Jul 11 16:30 simpleCUFFT_callback
  -rwxrwxr-x 1 dan dan    556872 Jul 11 16:31 simpleCUFFT_MGPU
  -rwxrwxr-x 1 dan dan  41352849 Jul 11 16:32 simpleDevLibCUBLAS
  -rwxrwxr-x 1 dan dan    803851 Jul 11 16:13 simpleGL
  -rwxrwxr-x 1 dan dan    572948 Jul 11 16:29 simpleHyperQ
  -rwxrwxr-x 1 dan dan    555960 Jul 11 16:11 simpleIPC
  -rwxrwxr-x 1 dan dan    584771 Jul 11 16:09 simpleLayeredTexture
  -rwxrwxr-x 1 dan dan    561251 Jul 11 16:08 simpleMultiCopy
  -rwxrwxr-x 1 dan dan    551730 Jul 11 16:10 simpleMultiGPU
  -rwxrwxr-x 1 dan dan    552759 Jul 11 16:10 simpleOccupancy
  -rwxrwxr-x 1 dan dan    555784 Jul 11 16:09 simpleP2P
  -rwxrwxr-x 1 dan dan    590270 Jul 11 16:08 simplePitchLinearTexture
  -rwxrwxr-x 1 dan dan    556164 Jul 11 16:08 simplePrintf
  -rwxrwxr-x 1 dan dan    586822 Jul 11 16:11 simpleSeparateCompilation
  -rwxrwxr-x 1 dan dan    565111 Jul 11 16:12 simpleStreams
  -rwxrwxr-x 1 dan dan    641367 Jul 11 16:09 simpleSurfaceWrite
  -rwxrwxr-x 1 dan dan    575293 Jul 11 16:10 simpleTemplates
  -rwxrwxr-x 1 dan dan     45521 Jul 11 16:10 simpleTemplates_nvrtc
  -rwxrwxr-x 1 dan dan    628302 Jul 11 16:09 simpleTexture
  -rwxrwxr-x 1 dan dan    786456 Jul 11 16:13 simpleTexture3D
  -rwxrwxr-x 1 dan dan    563138 Jul 11 16:09 simpleTextureDrv
  -rw-rw-r-- 1 dan dan     18876 Jul 11 16:09 simpleTexture_kernel64.ptx
  -rwxrwxr-x 1 dan dan    573426 Jul 11 16:09 simpleVoteIntrinsics
  -rwxrwxr-x 1 dan dan     39640 Jul 11 16:07 simpleVoteIntrinsics_nvrtc
  -rwxrwxr-x 1 dan dan    556317 Jul 11 16:12 simpleZeroCopy
  -rwxrwxr-x 1 dan dan   2905534 Jul 11 16:20 smokeParticles
  -rwxrwxr-x 1 dan dan    822069 Jul 11 16:17 SobelFilter
  -rwxrwxr-x 1 dan dan   1400534 Jul 11 16:18 SobolQRNG
  -rwxrwxr-x 1 dan dan    679133 Jul 11 16:29 sortingNetworks
  -rwxrwxr-x 1 dan dan    602848 Jul 11 16:17 stereoDisparity
  -rwxrwxr-x 1 dan dan    556089 Jul 11 16:28 StreamPriorities
  -rwxrwxr-x 1 dan dan    567669 Jul 11 16:10 template
  -rwxrwxr-x 1 dan dan    547523 Jul 11 16:09 template_runtime
  -rwxrwxr-x 1 dan dan   1219146 Jul 11 16:21 threadFenceReduction
  -rwxrwxr-x 1 dan dan    550354 Jul 11 16:29 threadMigration
  -rw-rw-r-- 1 dan dan       581 Jul 11 16:29 threadMigration_kernel64.ptx
  -rwxrwxr-x 1 dan dan    632213 Jul 11 16:25 transpose
  -rwxrwxr-x 1 dan dan    557223 Jul 11 16:11 UnifiedMemoryStreams
  -rwxrwxr-x 1 dan dan    546985 Jul 11 16:11 vectorAdd
  -rwxrwxr-x 1 dan dan    546060 Jul 11 16:09 vectorAddDrv
  -rw-rw-r-- 1 dan dan      1162 Jul 11 16:09 vectorAdd_kernel64.ptx
  -rwxrwxr-x 1 dan dan     39255 Jul 11 16:12 vectorAdd_nvrtc
  -rwxrwxr-x 1 dan dan   1009035 Jul 11 16:13 volumeFiltering
  -rwxrwxr-x 1 dan dan    822837 Jul 11 16:14 volumeRender
The oldest was this:
-rwxrwxr-x 1 dan dan     39640 Jul 11 16:07 simpleVoteIntrinsics_nvrtc
The youngest was this:
-rwxrwxr-x 1 dan dan    592752 Jul 11 16:32 cuSolverSp_LinearSolver
So it took about 25 minutes to run make.

I read this page:

http://docs.nvidia.com/cuda/cuda-installation-guide-linux/#running-binaries

I tried this command line:

dan@srvr1404:~/samples/bin/x86_64/linux/release $ 
dan@srvr1404:~/samples/bin/x86_64/linux/release $ ll deviceQuery
-rwxrwxr-x 1 dan dan 542694 Jul 11 16:13 deviceQuery*
dan@srvr1404:~/samples/bin/x86_64/linux/release $ 
dan@srvr1404:~/samples/bin/x86_64/linux/release $ ./deviceQuery
./deviceQuery Starting...

 CUDA Device Query (Runtime API) version (CUDART static linking)

Detected 1 CUDA Capable device(s)

Device 0: "GeForce GTX 980M"
  CUDA Driver Version / Runtime Version          7.5 / 7.5
  CUDA Capability Major/Minor version number:    5.2
  Total amount of global memory:                 4096 MBytes (4294770688 bytes)
  (12) Multiprocessors, (128) CUDA Cores/MP:     1536 CUDA Cores
  GPU Max Clock rate:                            1126 MHz (1.13 GHz)
  Memory Clock rate:                             2505 Mhz
  Memory Bus Width:                              256-bit
  L2 Cache Size:                                 2097152 bytes
  Maximum Texture Dimension Size (x,y,z)         1D=(65536), 2D=(65536, 65536), 3D=(4096, 4096, 4096)
  Maximum Layered 1D Texture Size, (num) layers  1D=(16384), 2048 layers
  Maximum Layered 2D Texture Size, (num) layers  2D=(16384, 16384), 2048 layers
  Total amount of constant memory:               65536 bytes
  Total amount of shared memory per block:       49152 bytes
  Total number of registers available per block: 65536
  Warp size:                                     32
  Maximum number of threads per multiprocessor:  2048
  Maximum number of threads per block:           1024
  Max dimension size of a thread block (x,y,z): (1024, 1024, 64)
  Max dimension size of a grid size    (x,y,z): (2147483647, 65535, 65535)
  Maximum memory pitch:                          2147483647 bytes
  Texture alignment:                             512 bytes
  Concurrent copy and kernel execution:          Yes with 2 copy engine(s)
  Run time limit on kernels:                     No
  Integrated GPU sharing Host Memory:            No
  Support host page-locked memory mapping:       Yes
  Alignment requirement for Surfaces:            Yes
  Device has ECC support:                        Disabled
  Device supports Unified Addressing (UVA):      Yes
  Device PCI Domain ID / Bus ID / location ID:   0 / 1 / 0
  Compute Mode:
     < Default (multiple host threads can use ::cudaSetDevice() with device simultaneously) >

deviceQuery, CUDA Driver = CUDART, CUDA Driver Version = 7.5, CUDA Runtime Version = 7.5, NumDevs = 1, Device0 = GeForce GTX 980M
Result = PASS
dan@srvr1404:~/samples/bin/x86_64/linux/release $ 
dan@srvr1404:~/samples/bin/x86_64/linux/release $ 
I liked the look of that.

Next, I worked on the task of connecting my Cuda software to Machine Learning software.

If you have questions, e-me: bikle101@gmail.com


syntax.us Let the syntax do the talking
Blog Contact Posts Questions Tags Hire Me