Skip to content

Commit d556ce7

Browse files
committed
Merge branch 'master' of https://github.com/bertaye/BasicSIMD
2 parents 293c21d + ee433b1 commit d556ce7

File tree

3 files changed

+24
-24
lines changed

3 files changed

+24
-24
lines changed
26.3 KB
Loading
6.83 KB
Loading

benchmark_results_linux_gcc/summary_report.txt

Lines changed: 24 additions & 24 deletions
Original file line numberDiff line numberDiff line change
@@ -4,143 +4,143 @@
44

55
| Variant | SIMD Time (ms) | Plain Time (ms) | Speedup (x) |
66
|---------|---------------|----------------|------------|
7-
| 100000 | 0.069 | 0.460 | 6.67x |
7+
| 100000 | 0.079 | 0.520 | 6.58x |
88

99
#### float256 Subtraction
1010

1111
| Variant | SIMD Time (ms) | Plain Time (ms) | Speedup (x) |
1212
|---------|---------------|----------------|------------|
13-
| 100000 | 0.069 | 0.471 | 6.83x |
13+
| 100000 | 0.079 | 0.511 | 6.47x |
1414

1515
#### float256 Multiplication
1616

1717
| Variant | SIMD Time (ms) | Plain Time (ms) | Speedup (x) |
1818
|---------|---------------|----------------|------------|
19-
| 100000 | 0.070 | 0.572 | 8.17x |
19+
| 100000 | 0.079 | 0.577 | 7.30x |
2020

2121
#### float256 Division
2222

2323
| Variant | SIMD Time (ms) | Plain Time (ms) | Speedup (x) |
2424
|---------|---------------|----------------|------------|
25-
| 100000 | 0.096 | 0.664 | 6.92x |
25+
| 100000 | 0.107 | 0.769 | 7.19x |
2626

2727
#### double256 Addition
2828

2929
| Variant | SIMD Time (ms) | Plain Time (ms) | Speedup (x) |
3030
|---------|---------------|----------------|------------|
31-
| 100000 | 0.071 | 0.240 | 3.38x |
31+
| 100000 | 0.079 | 0.261 | 3.30x |
3232

3333
#### double256 Subtraction
3434

3535
| Variant | SIMD Time (ms) | Plain Time (ms) | Speedup (x) |
3636
|---------|---------------|----------------|------------|
37-
| 100000 | 0.074 | 0.245 | 3.31x |
37+
| 100000 | 0.079 | 0.261 | 3.30x |
3838

3939
#### double256 Multiplication
4040

4141
| Variant | SIMD Time (ms) | Plain Time (ms) | Speedup (x) |
4242
|---------|---------------|----------------|------------|
43-
| 100000 | 0.075 | 0.330 | 4.40x |
43+
| 100000 | 0.081 | 0.388 | 4.79x |
4444

4545
#### double256 Division
4646

4747
| Variant | SIMD Time (ms) | Plain Time (ms) | Speedup (x) |
4848
|---------|---------------|----------------|------------|
49-
| 100000 | 0.118 | 0.467 | 3.96x |
49+
| 100000 | 0.141 | 0.565 | 4.01x |
5050

5151
#### int128_with_int32 t_Addition
5252

5353
| Variant | SIMD Time (ms) | Plain Time (ms) | Speedup (x) |
5454
|---------|---------------|----------------|------------|
55-
| 1000000 | 2.480 | 3.050 | 1.23x |
55+
| 1000000 | 0.829 | 2.540 | 3.06x |
5656

5757
#### int128_with_int32 t_Subtraction
5858

5959
| Variant | SIMD Time (ms) | Plain Time (ms) | Speedup (x) |
6060
|---------|---------------|----------------|------------|
61-
| 1000000 | 2.340 | 3.110 | 1.33x |
61+
| 1000000 | 0.829 | 2.560 | 3.09x |
6262

6363
#### int128_with_int32 t_Multiplication
6464

6565
| Variant | SIMD Time (ms) | Plain Time (ms) | Speedup (x) |
6666
|---------|---------------|----------------|------------|
67-
| 100000 | 0.070 | 0.274 | 3.91x |
67+
| 100000 | 0.075 | 0.276 | 3.68x |
6868

6969
#### int128_with_int16 t_Addition
7070

7171
| Variant | SIMD Time (ms) | Plain Time (ms) | Speedup (x) |
7272
|---------|---------------|----------------|------------|
73-
| 100000 | 0.072 | 0.473 | 6.57x |
73+
| 100000 | 0.074 | 0.513 | 6.93x |
7474

7575
#### int128_with_int16 t_Subtraction
7676

7777
| Variant | SIMD Time (ms) | Plain Time (ms) | Speedup (x) |
7878
|---------|---------------|----------------|------------|
79-
| 100000 | 0.069 | 0.545 | 7.90x |
79+
| 100000 | 0.076 | 0.511 | 6.72x |
8080

8181
#### int128_with_int16 t_Multiplication
8282

8383
| Variant | SIMD Time (ms) | Plain Time (ms) | Speedup (x) |
8484
|---------|---------------|----------------|------------|
85-
| 100000 | 0.073 | 0.559 | 7.66x |
85+
| 100000 | 0.074 | 0.615 | 8.31x |
8686

8787
#### int128_with_int8 t_Addition
8888

8989
| Variant | SIMD Time (ms) | Plain Time (ms) | Speedup (x) |
9090
|---------|---------------|----------------|------------|
91-
| 100000 | 0.061 | 0.975 | 15.98x |
91+
| 100000 | 0.061 | 1.040 | 17.05x |
9292

9393
#### int128_with_int8 t_Subtraction
9494

9595
| Variant | SIMD Time (ms) | Plain Time (ms) | Speedup (x) |
9696
|---------|---------------|----------------|------------|
97-
| 100000 | 0.059 | 1.090 | 18.47x |
97+
| 100000 | 0.067 | 1.030 | 15.37x |
9898

9999
#### int256_with_int32 t_Addition
100100

101101
| Variant | SIMD Time (ms) | Plain Time (ms) | Speedup (x) |
102102
|---------|---------------|----------------|------------|
103-
| 100000 | 0.091 | 0.503 | 5.53x |
103+
| 100000 | 0.079 | 0.505 | 6.39x |
104104

105105
#### int256_with_int32 t_Subtraction
106106

107107
| Variant | SIMD Time (ms) | Plain Time (ms) | Speedup (x) |
108108
|---------|---------------|----------------|------------|
109-
| 100000 | 0.084 | 0.500 | 5.95x |
109+
| 100000 | 0.079 | 0.508 | 6.43x |
110110

111111
#### int256_with_int32 t_Multiplication
112112

113113
| Variant | SIMD Time (ms) | Plain Time (ms) | Speedup (x) |
114114
|---------|---------------|----------------|------------|
115-
| 100000 | 0.080 | 0.532 | 6.65x |
115+
| 100000 | 0.079 | 0.554 | 7.01x |
116116

117117
#### int256_with_int16 t_Addition
118118

119119
| Variant | SIMD Time (ms) | Plain Time (ms) | Speedup (x) |
120120
|---------|---------------|----------------|------------|
121-
| 100000 | 0.074 | 1.060 | 14.32x |
121+
| 100000 | 0.079 | 1.030 | 13.04x |
122122

123123
#### int256_with_int16 t_Subtraction
124124

125125
| Variant | SIMD Time (ms) | Plain Time (ms) | Speedup (x) |
126126
|---------|---------------|----------------|------------|
127-
| 100000 | 0.075 | 0.905 | 12.07x |
127+
| 100000 | 0.079 | 1.020 | 12.91x |
128128

129129
#### int256_with_int16 t_Multiplication
130130

131131
| Variant | SIMD Time (ms) | Plain Time (ms) | Speedup (x) |
132132
|---------|---------------|----------------|------------|
133-
| 100000 | 0.079 | 1.130 | 14.30x |
133+
| 100000 | 0.079 | 1.160 | 14.68x |
134134

135135
#### int256_with_int8 t_Addition
136136

137137
| Variant | SIMD Time (ms) | Plain Time (ms) | Speedup (x) |
138138
|---------|---------------|----------------|------------|
139-
| 100000 | 0.083 | 1.780 | 21.45x |
139+
| 100000 | 0.079 | 2.060 | 26.08x |
140140

141141
#### int256_with_int8 t_Subtraction
142142

143143
| Variant | SIMD Time (ms) | Plain Time (ms) | Speedup (x) |
144144
|---------|---------------|----------------|------------|
145-
| 100000 | 0.070 | 2.150 | 30.71x |
145+
| 100000 | 0.079 | 2.200 | 27.85x |
146146

0 commit comments

Comments
 (0)