
Sat Sep 12 11:56:55 EDT 2015
numactl --interleave=all ../testing/testing_ssyevd -JN -N 123 -N 1234 --range 10:90:10 --range 100:900:100 --range 1000:10000:1000 --lapack
% MAGMA 1.7.0  compiled for CUDA capability >= 3.5, 32-bit magma_int_t, 64-bit pointer.
% CUDA runtime 7000, driver 7000. OpenMP threads 16. MKL 11.2.2, MKL threads 16. 
% device 0: Tesla K40c, 745.0 MHz clock, 11519.6 MB memory, capability 3.5
% device 1: Tesla K40c, 745.0 MHz clock, 11519.6 MB memory, capability 3.5
% device 2: Tesla K40c, 745.0 MHz clock, 11519.6 MB memory, capability 3.5
% Sat Sep 12 11:57:01 2015
% Usage: ../testing/testing_ssyevd [options] [-h|--help]

% jobz = No vectors, uplo = Lower
%   N   CPU Time (sec)   GPU Time (sec)
%======================================
  123      0.00             0.00
    | S_magma - S_lapack | / |S| = 0.00e+00   ok

 1234      0.08             0.09
    | S_magma - S_lapack | / |S| = 0.00e+00   ok

   10      0.00             0.00
    | S_magma - S_lapack | / |S| = 0.00e+00   ok

   20      0.00             0.00
    | S_magma - S_lapack | / |S| = 0.00e+00   ok

   30      0.00             0.00
    | S_magma - S_lapack | / |S| = 0.00e+00   ok

   40      0.00             0.00
    | S_magma - S_lapack | / |S| = 0.00e+00   ok

   50      0.00             0.00
    | S_magma - S_lapack | / |S| = 0.00e+00   ok

   60      0.00             0.00
    | S_magma - S_lapack | / |S| = 0.00e+00   ok

   70      0.00             0.00
    | S_magma - S_lapack | / |S| = 0.00e+00   ok

   80      0.00             0.00
    | S_magma - S_lapack | / |S| = 0.00e+00   ok

   90      0.00             0.00
    | S_magma - S_lapack | / |S| = 0.00e+00   ok

  100      0.00             0.00
    | S_magma - S_lapack | / |S| = 0.00e+00   ok

  200      0.00             0.00
    | S_magma - S_lapack | / |S| = 0.00e+00   ok

  300      0.01             0.01
    | S_magma - S_lapack | / |S| = 0.00e+00   ok

  400      0.01             0.01
    | S_magma - S_lapack | / |S| = 0.00e+00   ok

  500      0.02             0.02
    | S_magma - S_lapack | / |S| = 0.00e+00   ok

  600      0.02             0.02
    | S_magma - S_lapack | / |S| = 0.00e+00   ok

  700      0.03             0.03
    | S_magma - S_lapack | / |S| = 0.00e+00   ok

  800      0.03             0.03
    | S_magma - S_lapack | / |S| = 0.00e+00   ok

  900      0.04             0.04
    | S_magma - S_lapack | / |S| = 0.00e+00   ok

 1000      0.05             0.05
    | S_magma - S_lapack | / |S| = 0.00e+00   ok

 2000      0.22             0.22
    | S_magma - S_lapack | / |S| = 0.00e+00   ok

 3000      0.54             0.88
    | S_magma - S_lapack | / |S| = 7.59e-10   ok

 4000      1.07             1.53
    | S_magma - S_lapack | / |S| = 7.87e-11   ok

 5000      2.00             2.50
    | S_magma - S_lapack | / |S| = 6.41e-11   ok

 6000      3.70             3.72
    | S_magma - S_lapack | / |S| = 5.00e-11   ok

 7000      5.93             5.30
    | S_magma - S_lapack | / |S| = 4.62e-11   ok

 8000      9.21             7.18
    | S_magma - S_lapack | / |S| = 4.14e-11   ok

 9000     13.46             9.56
    | S_magma - S_lapack | / |S| = 4.70e-10   ok

10000     19.20            12.14
    | S_magma - S_lapack | / |S| = 3.07e-11   ok

Sat Sep 12 11:58:49 EDT 2015

Sat Sep 12 11:58:49 EDT 2015
numactl --interleave=all ../testing/testing_ssyevd -JV -N 123 -N 1234 --range 10:90:10 --range 100:900:100 --range 1000:10000:1000 --lapack
% MAGMA 1.7.0  compiled for CUDA capability >= 3.5, 32-bit magma_int_t, 64-bit pointer.
% CUDA runtime 7000, driver 7000. OpenMP threads 16. MKL 11.2.2, MKL threads 16. 
% device 0: Tesla K40c, 745.0 MHz clock, 11519.6 MB memory, capability 3.5
% device 1: Tesla K40c, 745.0 MHz clock, 11519.6 MB memory, capability 3.5
% device 2: Tesla K40c, 745.0 MHz clock, 11519.6 MB memory, capability 3.5
% Sat Sep 12 11:58:55 2015
% Usage: ../testing/testing_ssyevd [options] [-h|--help]

% jobz = Vectors needed, uplo = Lower
%   N   CPU Time (sec)   GPU Time (sec)
%======================================
  123      0.00             0.00
    | S_magma - S_lapack | / |S| = 0.00e+00   ok

 1234      0.13             0.10
    | S_magma - S_lapack | / |S| = 1.60e-10   ok

   10      0.00             0.00
    | S_magma - S_lapack | / |S| = 0.00e+00   ok

   20      0.00             0.00
    | S_magma - S_lapack | / |S| = 0.00e+00   ok

   30      0.00             0.00
    | S_magma - S_lapack | / |S| = 0.00e+00   ok

   40      0.00             0.00
    | S_magma - S_lapack | / |S| = 0.00e+00   ok

   50      0.00             0.00
    | S_magma - S_lapack | / |S| = 0.00e+00   ok

   60      0.00             0.00
    | S_magma - S_lapack | / |S| = 0.00e+00   ok

   70      0.00             0.00
    | S_magma - S_lapack | / |S| = 0.00e+00   ok

   80      0.00             0.00
    | S_magma - S_lapack | / |S| = 0.00e+00   ok

   90      0.00             0.00
    | S_magma - S_lapack | / |S| = 0.00e+00   ok

  100      0.00             0.00
    | S_magma - S_lapack | / |S| = 0.00e+00   ok

  200      0.00             0.01
    | S_magma - S_lapack | / |S| = 1.14e-09   ok

  300      0.01             0.01
    | S_magma - S_lapack | / |S| = 1.36e-09   ok

  400      0.01             0.02
    | S_magma - S_lapack | / |S| = 1.14e-09   ok

  500      0.02             0.02
    | S_magma - S_lapack | / |S| = 1.59e-09   ok

  600      0.03             0.03
    | S_magma - S_lapack | / |S| = 2.03e-09   ok

  700      0.04             0.04
    | S_magma - S_lapack | / |S| = 7.48e-10   ok

  800      0.05             0.04
    | S_magma - S_lapack | / |S| = 1.90e-10   ok

  900      0.06             0.06
    | S_magma - S_lapack | / |S| = 3.77e-10   ok

 1000      0.07             0.06
    | S_magma - S_lapack | / |S| = 5.50e-10   ok

 2000      0.31             0.21
    | S_magma - S_lapack | / |S| = 6.10e-11   ok

 3000      0.76             0.89
    | S_magma - S_lapack | / |S| = 1.90e-10   ok

 4000      1.79             1.61
    | S_magma - S_lapack | / |S| = 6.10e-11   ok

 5000      3.20             2.53
    | S_magma - S_lapack | / |S| = 1.56e-10   ok

 6000      4.84             3.83
    | S_magma - S_lapack | / |S| = 2.95e-11   ok

 7000      7.49             5.47
    | S_magma - S_lapack | / |S| = 1.10e-10   ok

 8000     11.24             7.33
    | S_magma - S_lapack | / |S| = 3.81e-11   ok

 9000     15.72             9.63
    | S_magma - S_lapack | / |S| = 3.54e-11   ok

10000     21.62            12.44
    | S_magma - S_lapack | / |S| = 4.18e-11   ok

Sat Sep 12 12:00:58 EDT 2015

Sat Sep 12 12:00:58 EDT 2015
numactl --interleave=all ../testing/testing_ssyevd_gpu -JN -N 123 -N 1234 --range 10:90:10 --range 100:900:100 --range 1000:10000:1000
% MAGMA 1.7.0  compiled for CUDA capability >= 3.5, 32-bit magma_int_t, 64-bit pointer.
% CUDA runtime 7000, driver 7000. OpenMP threads 16. MKL 11.2.2, MKL threads 16. 
% device 0: Tesla K40c, 745.0 MHz clock, 11519.6 MB memory, capability 3.5
% device 1: Tesla K40c, 745.0 MHz clock, 11519.6 MB memory, capability 3.5
% device 2: Tesla K40c, 745.0 MHz clock, 11519.6 MB memory, capability 3.5
% Sat Sep 12 12:01:04 2015
% Usage: ../testing/testing_ssyevd_gpu [options] [-h|--help]

% jobz = No vectors, uplo = Lower
%   N   CPU Time (sec)   GPU Time (sec)
%======================================
  123       ---              0.00
 1234       ---              0.09
   10       ---              0.00
   20       ---              0.00
   30       ---              0.00
   40       ---              0.00
   50       ---              0.00
   60       ---              0.00
   70       ---              0.00
   80       ---              0.00
   90       ---              0.00
  100       ---              0.00
  200       ---              0.00
  300       ---              0.01
  400       ---              0.01
  500       ---              0.02
  600       ---              0.02
  700       ---              0.03
  800       ---              0.04
  900       ---              0.05
 1000       ---              0.06
 2000       ---              0.24
 3000       ---              0.88
 4000       ---              1.52
 5000       ---              2.49
 6000       ---              3.72
 7000       ---              5.28
 8000       ---              7.17
 9000       ---              9.55
10000       ---             12.16
Sat Sep 12 12:01:57 EDT 2015

Sat Sep 12 12:01:57 EDT 2015
numactl --interleave=all ../testing/testing_ssyevd_gpu -JV -N 123 -N 1234 --range 10:90:10 --range 100:900:100 --range 1000:10000:1000
% MAGMA 1.7.0  compiled for CUDA capability >= 3.5, 32-bit magma_int_t, 64-bit pointer.
% CUDA runtime 7000, driver 7000. OpenMP threads 16. MKL 11.2.2, MKL threads 16. 
% device 0: Tesla K40c, 745.0 MHz clock, 11519.6 MB memory, capability 3.5
% device 1: Tesla K40c, 745.0 MHz clock, 11519.6 MB memory, capability 3.5
% device 2: Tesla K40c, 745.0 MHz clock, 11519.6 MB memory, capability 3.5
% Sat Sep 12 12:02:04 2015
% Usage: ../testing/testing_ssyevd_gpu [options] [-h|--help]

% jobz = Vectors needed, uplo = Lower
%   N   CPU Time (sec)   GPU Time (sec)
%======================================
  123       ---              0.00
 1234       ---              0.11
   10       ---              0.00
   20       ---              0.00
   30       ---              0.00
   40       ---              0.00
   50       ---              0.00
   60       ---              0.00
   70       ---              0.00
   80       ---              0.00
   90       ---              0.00
  100       ---              0.00
  200       ---              0.01
  300       ---              0.01
  400       ---              0.02
  500       ---              0.03
  600       ---              0.03
  700       ---              0.04
  800       ---              0.05
  900       ---              0.07
 1000       ---              0.08
 2000       ---              0.28
 3000       ---              0.94
 4000       ---              1.60
 5000       ---              2.57
 6000       ---              3.86
 7000       ---              5.55
 8000       ---              7.61
 9000       ---             10.13
10000       ---             13.08
Sat Sep 12 12:03:02 EDT 2015

Sat Sep 12 12:45:59 EDT 2015
numactl --interleave=all ../testing/testing_ssyevd -JN -N 123 -N 1234 --range 12000:20000:2000
% MAGMA 1.7.0  compiled for CUDA capability >= 3.5, 32-bit magma_int_t, 64-bit pointer.
% CUDA runtime 7000, driver 7000. OpenMP threads 16. MKL 11.2.2, MKL threads 16. 
% device 0: Tesla K40c, 745.0 MHz clock, 11519.6 MB memory, capability 3.5
% device 1: Tesla K40c, 745.0 MHz clock, 11519.6 MB memory, capability 3.5
% device 2: Tesla K40c, 745.0 MHz clock, 11519.6 MB memory, capability 3.5
% Sat Sep 12 12:46:06 2015
% Usage: ../testing/testing_ssyevd [options] [-h|--help]

% jobz = No vectors, uplo = Lower
%   N   CPU Time (sec)   GPU Time (sec)
%======================================
  123     ---               0.00
 1234     ---               0.10
12000     ---              19.38
14000     ---              28.24
16000     ---              39.84
18000     ---              54.66
20000     ---              71.74
Sat Sep 12 12:50:09 EDT 2015

Sat Sep 12 12:50:09 EDT 2015
numactl --interleave=all ../testing/testing_ssyevd -JV -N 123 -N 1234 --range 12000:20000:2000
% MAGMA 1.7.0  compiled for CUDA capability >= 3.5, 32-bit magma_int_t, 64-bit pointer.
% CUDA runtime 7000, driver 7000. OpenMP threads 16. MKL 11.2.2, MKL threads 16. 
% device 0: Tesla K40c, 745.0 MHz clock, 11519.6 MB memory, capability 3.5
% device 1: Tesla K40c, 745.0 MHz clock, 11519.6 MB memory, capability 3.5
% device 2: Tesla K40c, 745.0 MHz clock, 11519.6 MB memory, capability 3.5
% Sat Sep 12 12:50:15 2015
% Usage: ../testing/testing_ssyevd [options] [-h|--help]

% jobz = Vectors needed, uplo = Lower
%   N   CPU Time (sec)   GPU Time (sec)
%======================================
  123     ---               0.00
 1234     ---               0.10
12000     ---              20.27
14000     ---              29.65
16000     ---              41.64
18000     ---              57.79
20000     ---              75.25
Sat Sep 12 12:54:36 EDT 2015

Sat Sep 12 12:54:36 EDT 2015
numactl --interleave=all ../testing/testing_ssyevd_gpu -JN -N 123 -N 1234 --range 12000:20000:2000
% MAGMA 1.7.0  compiled for CUDA capability >= 3.5, 32-bit magma_int_t, 64-bit pointer.
% CUDA runtime 7000, driver 7000. OpenMP threads 16. MKL 11.2.2, MKL threads 16. 
% device 0: Tesla K40c, 745.0 MHz clock, 11519.6 MB memory, capability 3.5
% device 1: Tesla K40c, 745.0 MHz clock, 11519.6 MB memory, capability 3.5
% device 2: Tesla K40c, 745.0 MHz clock, 11519.6 MB memory, capability 3.5
% Sat Sep 12 12:54:42 2015
% Usage: ../testing/testing_ssyevd_gpu [options] [-h|--help]

% jobz = No vectors, uplo = Lower
%   N   CPU Time (sec)   GPU Time (sec)
%======================================
  123       ---              0.00
 1234       ---              0.10
12000       ---             19.11
14000       ---             28.13
16000       ---             39.57
18000       ---             54.60
20000       ---             70.85
Sat Sep 12 12:58:46 EDT 2015

Sat Sep 12 12:58:46 EDT 2015
numactl --interleave=all ../testing/testing_ssyevd_gpu -JV -N 123 -N 1234 --range 12000:20000:2000
% MAGMA 1.7.0  compiled for CUDA capability >= 3.5, 32-bit magma_int_t, 64-bit pointer.
% CUDA runtime 7000, driver 7000. OpenMP threads 16. MKL 11.2.2, MKL threads 16. 
% device 0: Tesla K40c, 745.0 MHz clock, 11519.6 MB memory, capability 3.5
% device 1: Tesla K40c, 745.0 MHz clock, 11519.6 MB memory, capability 3.5
% device 2: Tesla K40c, 745.0 MHz clock, 11519.6 MB memory, capability 3.5
% Sat Sep 12 12:58:52 2015
% Usage: ../testing/testing_ssyevd_gpu [options] [-h|--help]

% jobz = Vectors needed, uplo = Lower
%   N   CPU Time (sec)   GPU Time (sec)
%======================================
  123       ---              0.00
 1234       ---              0.11
12000       ---             21.31
14000       ---             31.31
16000       ---             44.91
18000       ---             62.66
20000       ---             82.70
Sat Sep 12 13:03:33 EDT 2015
