You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The following are the mean value found by running the specified implementation multiple times on the default matrix of the chosen size generated by application.
Times are expressed in milliseconds.
The speedup refers to the corrisponding serial implementation.
Matrix size: 1023 X 1024, for a total of 1047552 elements
(serial) Intel i5-7600 CPU @ 3.50GHz
Gaussian elimination kernel
Solution vector kernel
TOT
Speedup
No pivot
/
/
940.37
/
Partial pivot
/
/
1007.63
/
(intel) Intel i5-7600 CPU @ 3.50GHz
Gaussian elimination kernel
Solution vector kernel
TOT
Speedup
No pivot texture
1021.48
13.63
1035.11
x0.9
No pivot texture vec 4
268.06
15.97
284.03
x3.3
No pivot buffer
305.05
12.39
317.44
x3.0
No pivot buffer vec 4
260.94
12.37
273.31
x3.4
Partial pivot texture
5387.87
14.58
5402.45
x0.2
Partial pivot texture vec 4
1349.53
15.51
1365.04
x0.7
Partial pivot buffer
4264.74
12.56
4277.30
x0.2
Partial pivot buffer vec 4
1194.86
12.45
1207.31
x0.8
(pocl) Intel i5-7600 CPU @ 3.50GHz
Gaussian elimination kernel
Solution vector kernel
TOT
Speedup
No pivot texture vec 4
1363.69
23.95
1387.64
x0.7
No pivot buffer
296.43
19.55
315.98
x2.9
No pivot buffer vec 4
266.76
19.63
286.39
x3.3
Partial pivot texture vec 4
3002.90
23.81
3026.71
x0.3
Partial pivot buffer
3546.58
18.58
3565.16
x0.3
Partial pivot buffer vec 4
1057.72
20.67
1078.39
x0.9
Nvidia GeForce GTX 1060 3GB
Gaussian elimination kernel
Solution vector kernel
TOT
Speedup
No pivot texture
45.83
4.47
50.30
x18.7
No pivot texture vec 4
26.44
4.79
31.24
x30.1
No pivot buffer
57.56
5.26
62.82
x15.0
No pivot buffer vec 4
39.64
5.16
44.80
x21.0
Partial pivot texture
185.57
3.93
189.51
x5.3
Partial pivot texture vec 4
39.60
3.94
43.54
x23.1
Partial pivot buffer
399.86
4.44
404.30
x2.5
Partial pivot buffer vec 4
57.83
4.45
62.27
x16.2
Matrix size: 2047 X 2048, for a total of 4192256 elements