Table of Contents

Benchmark - YGroup2Transpose

(← Back)

See Group

  • YGroup2Transpose: Transpose a 22 matrix (对22矩阵进行转置). Mnemonic: result0[i] := (0==(i&1))?( x[i&~1] ):( y[i&~1] ), result1[i] := (0==(i&1))?( x[(i&~1) + 1] ):( y[(i&~1) + 1] ).

X86

AMD Ryzen 7 7840H

AMD Ryzen 7 7840H w/ Radeon 780M Graphics

.NET Framework

VectorTraits.Benchmarks.NetFw
benchmarkMode:	0

IsRelease:	True
Environment.ProcessorCount:	16
Environment.Is64BitProcess:	True
Environment.OSVersion:	Microsoft Windows NT 6.2.9200.0
Environment.Version:	4.0.30319.42000
Stopwatch.Frequency:	10000000
RuntimeEnvironment.GetRuntimeDirectory:	C:\Windows\Microsoft.NET\Framework64\v4.0.30319\
RuntimeInformation.FrameworkDescription:	.NET Framework 4.8.9277.0
RuntimeInformation.OSArchitecture:	X64
RuntimeInformation.OSDescription:	Microsoft Windows 10.0.22631 
IntPtr.Size:	8
BitConverter.IsLittleEndian:	True
Vector.IsHardwareAccelerated:	True
Vector<byte>.Count:	32	# 256bit
Vector<float>.Count:	8	# 256bit
Vector<T>.Assembly.CodeBase:	file:///E:/zylSelf/Code/cs/base/VectorTraits/tests/VectorTraits.Benchmarks.NetFw/bin/Release/System.Numerics.Vectors.DLL
GetTargetFrameworkDisplayName(VectorTextUtil):	.NET Standard 1.1
GetTargetFrameworkDisplayName(TraitsOutput):	.NET Framework 4.5
VectorTraitsGlobal.InitCheckSum:	-25396097	# 0xFE7C7C7F
VectorEnvironment.CpuModelName:	AMD Ryzen 7 7840H w/ Radeon 780M Graphics
VectorEnvironment.SupportedInstructionSets:	
Vectors.Instance:	VectorTraits256Base	// 
Vectors.BaseInstance:	VectorTraits256Base


[YGroup2TransposeBenchmark_Byte(262144)]
---
NAME	US	MOPS	SCALE
# SumBase_Basic	114
SumBase_Basic	181.880	1441.302
SumBase_ShiftByMul	30.933	8474.611	5.879829
SumBase	31.101	8428.903	5.848116
SumTraits	30.121	8703.029	6.038309

[YGroup2TransposeBenchmark_Int16(262144)]
---
NAME	US	MOPS	SCALE
# SumBase_Basic	-24614
SumBase_Basic	245.468	1067.937
SumBase_ShiftByMul	49.738	5270.541	4.935255
SumBase	48.411	5414.970	5.070496
SumTraits	49.579	5287.450	4.951088

[YGroup2TransposeBenchmark_Int32(262144)]
---
NAME	US	MOPS	SCALE
# SumBase_Basic	-468672550
SumBase_Basic	381.770	686.654
SumBase_ShiftByMul	271.608	965.156	1.405593
SumBase	378.347	692.867	1.009048
SumTraits	381.611	686.941	1.000418

[YGroup2TransposeBenchmark_Int64(262144)]
---
NAME	US	MOPS	SCALE
# SumBase_Basic	5910372263571615236
SumBase_Basic	663.608	395.028
SumBase	654.926	400.265	1.013257
SumTraits	654.973	400.236	1.013184

.NET Core 2.1

VectorTraits.Benchmarks.NetCore
benchmarkMode:	0

IsRelease:	True
Environment.ProcessorCount:	16
Environment.Is64BitProcess:	True
Environment.OSVersion:	Microsoft Windows NT 10.0.22631.0
Environment.Version:	4.0.30319.42000
Stopwatch.Frequency:	10000000
RuntimeEnvironment.GetRuntimeDirectory:	C:\Program Files\dotnet\shared\Microsoft.NETCore.App\2.1.30\
RuntimeInformation.FrameworkDescription:	.NET Core 4.6.30411.01
RuntimeInformation.OSArchitecture:	X64
RuntimeInformation.OSDescription:	Microsoft Windows 10.0.22631 
IntPtr.Size:	8
BitConverter.IsLittleEndian:	True
Vector.IsHardwareAccelerated:	True
Vector<byte>.Count:	32	# 256bit
Vector<float>.Count:	8	# 256bit
Vector<T>.Assembly.CodeBase:	file:///C:/Program Files/dotnet/shared/Microsoft.NETCore.App/2.1.30/System.Private.CoreLib.dll
GetTargetFrameworkDisplayName(VectorTextUtil):	.NET Standard 2.0
GetTargetFrameworkDisplayName(TraitsOutput):	.NET Core 2.1
VectorTraitsGlobal.InitCheckSum:	-25396097	# 0xFE7C7C7F
VectorEnvironment.CpuModelName:	AMD Ryzen 7 7840H w/ Radeon 780M Graphics
VectorEnvironment.SupportedInstructionSets:	
Vectors.Instance:	VectorTraits256Base	// 
Vectors.BaseInstance:	VectorTraits256Base


[YGroup2TransposeBenchmark_Byte(262144)]
---
NAME	US	MOPS	SCALE
# SumBase_Basic	114
SumBase_Basic	190.917	1373.075
SumBase_ShiftByMul	62.429	4199.055	3.058140
SumBase	33.204	7895.027	5.749888
SumTraits	31.791	8245.866	6.005402

[YGroup2TransposeBenchmark_Int16(262144)]
---
NAME	US	MOPS	SCALE
# SumBase_Basic	-24614
SumBase_Basic	238.259	1100.250
SumBase_ShiftByMul	120.098	2182.759	1.983875
SumBase	51.756	5064.974	4.603476
SumTraits	50.522	5188.689	4.715918

[YGroup2TransposeBenchmark_Int32(262144)]
---
NAME	US	MOPS	SCALE
# SumBase_Basic	-468672550
SumBase_Basic	387.801	675.976
SumBase_ShiftByMul	441.338	593.976	0.878693
SumBase	385.987	679.152	1.004699
SumTraits	388.597	674.592	0.997952

[YGroup2TransposeBenchmark_Int64(262144)]
---
NAME	US	MOPS	SCALE
# SumBase_Basic	5910372263571615236
SumBase_Basic	669.084	391.795
SumBase	684.803	382.802	0.977047
SumTraits	673.999	388.938	0.992708

.NET Core 3.1

VectorTraits.Benchmarks
benchmarkMode:	0

IsRelease:	True
Environment.ProcessorCount:	16
Environment.Is64BitProcess:	True
Environment.OSVersion:	Microsoft Windows NT 6.2.9200.0
Environment.Version:	3.1.32
Stopwatch.Frequency:	10000000
RuntimeEnvironment.GetRuntimeDirectory:	C:\Program Files\dotnet\shared\Microsoft.NETCore.App\3.1.32\
RuntimeInformation.FrameworkDescription:	.NET Core 3.1.32
RuntimeInformation.OSArchitecture:	X64
RuntimeInformation.OSDescription:	Microsoft Windows 10.0.22631
IntPtr.Size:	8
BitConverter.IsLittleEndian:	True
Vector.IsHardwareAccelerated:	True
Vector<byte>.Count:	32	# 256bit
Vector<float>.Count:	8	# 256bit
Vector<T>.Assembly.CodeBase:	file:///C:/Program Files/dotnet/shared/Microsoft.NETCore.App/3.1.32/System.Private.CoreLib.dll
GetTargetFrameworkDisplayName(VectorTextUtil):	.NET Core 3.0
GetTargetFrameworkDisplayName(TraitsOutput):	.NET Core 3.1
VectorTraitsGlobal.InitCheckSum:	7960959	# 0x0079797F
VectorEnvironment.CpuModelName:	AMD Ryzen 7 7840H w/ Radeon 780M Graphics
VectorEnvironment.SupportedInstructionSets:	Aes, Avx, Avx2, Bmi1, Bmi2, Fma, Lzcnt, Pclmulqdq, Popcnt, Sse, Sse2, Sse3, Ssse3, Sse41, Sse42
Vector128s.Instance:	WVectorTraits128Avx2	// Sse, Sse2, Sse3, Ssse3, Sse41, Sse42, Avx, Avx2
Vector256s.Instance:	WVectorTraits256Avx2	// Avx, Avx2, Sse, Sse2
Vectors.Instance:	VectorTraits256Avx2	// Avx, Avx2, Sse, Sse2
Vectors.BaseInstance:	VectorTraits256Base


[YGroup2TransposeBenchmark_Byte(262144)]
---
NAME	US	MOPS	SCALE
# SumBase_Basic	114
# Sum128Base_Basic	114
# Sum256Base_Basic	114
SumBase_Basic	184.921	1417.602
SumBase_ShiftByMul	27.612	9493.831	6.697107
SumBase	27.044	9693.378	6.837871
SumTraits	9.569	27394.422	19.324484
Sum128Base_Basic	560.804	467.443	0.329742
Sum128Base	555.538	471.874	0.332868
Sum128Sse_Shift	18.765	13970.084	9.854731
Sum128Sse_Shuffle	25.445	10302.426	7.267504
Sum128Sse_ShuffleXImm	19.691	13313.028	9.391233
Sum128Sse_Unpack	18.997	13799.266	9.734233
Sum128Traits	17.222	15221.510	10.737508
Sum256Base_Basic	441.308	594.016	0.419029
Sum256Base	445.657	588.219	0.414940
Sum256Avx2_Shift	8.264	31721.948	22.377194
Sum256Avx2_Shuffle	13.398	19566.632	13.802630
Sum256Avx2_ShuffleXImm	9.777	26811.564	18.913327
Sum256Avx2_Unpack	9.314	28146.623	19.855099
Sum256Traits	8.406	31186.379	21.999394

[YGroup2TransposeBenchmark_Int16(262144)]
---
NAME	US	MOPS	SCALE
# SumBase_Basic	-24614
# Sum128Base_Basic	-24614
# Sum256Base_Basic	-24614
SumBase_Basic	229.664	1141.423
SumBase_ShiftByMul	45.989	5700.205	4.993947
SumBase	44.843	5845.755	5.121463
SumTraits	11.641	22518.090	19.728087
Sum128Base_Basic	902.352	290.512	0.254517
Sum128Base	913.718	286.898	0.251351
Sum128Sse_Shift	35.998	7282.159	6.379896
Sum128Sse_Shuffle	53.848	4868.215	4.265041
Sum128Sse_ShuffleXImm	22.492	11654.850	10.210808
Sum128Sse_Unpack	30.018	8732.909	7.650897
Sum128Traits	23.493	11158.214	9.775706
Sum256Base_Basic	651.473	402.386	0.352530
Sum256Base	649.531	403.590	0.353585
Sum256Avx2_Shift	16.682	15714.290	13.767281
Sum256Avx2_Shuffle	26.272	9977.978	8.741702
Sum256Avx2_ShuffleXImm	11.406	22983.748	20.136050
Sum256Avx2_Unpack	15.198	17248.222	15.111158
Sum256Traits	11.336	23125.151	20.259933

[YGroup2TransposeBenchmark_Int32(262144)]
---
NAME	US	MOPS	SCALE
# SumBase_Basic	-468672550
# Sum128Base_Basic	-468148262
# Sum256Base_Basic	-468672550
SumBase_Basic	344.146	761.723
SumBase_ShiftByMul	269.651	972.161	1.276266
SumBase	340.023	770.960	1.012127
SumTraits	24.643	10637.484	13.965035
Sum128Base_Basic	1541.624	170.044	0.223236
Sum128Base	1517.720	172.722	0.226752
Sum128Sse_Shift	72.031	3639.344	4.777781
Sum128Sse_Shuffle	104.618	2505.720	3.289544
Sum128Sse_ShuffleXImm	45.135	5807.998	7.624820
Sum128Sse_Unpack	46.056	5691.822	7.472302
Sum128Traits	42.017	6238.989	8.190631
Sum256Base_Basic	988.729	265.132	0.348069
Sum256Base	912.543	287.267	0.377129
Sum256Avx2_Shift	36.383	7205.144	9.459013
Sum256Avx2_Shuffle	58.426	4486.803	5.890336
Sum256Avx2_ShuffleXImm	23.506	11152.278	14.640865
Sum256Avx2_Unpack	22.517	11641.980	15.283751
Sum256Traits	23.253	11273.651	14.800205

[YGroup2TransposeBenchmark_Int64(262144)]
---
NAME	US	MOPS	SCALE
# SumBase_Basic	5910372263571615236
# Sum128Base_Basic	5910372263571877380
# Sum256Base_Basic	5910372263571615236
SumBase_Basic	602.842	434.847
SumBase	601.989	435.463	1.001417
SumTraits	36.511	7179.881	16.511281
Sum128Base_Basic	2826.370	92.749	0.213292
Sum128Base_Move	3246.863	80.738	0.185669
Sum128Base	2806.293	93.413	0.214818
Sum128Sse_Shuffle	210.702	1244.145	2.861109
Sum128Sse_ShuffleXImm	63.642	4119.010	9.472319
Sum128Sse_Unpack	64.027	4094.288	9.415466
Sum128Traits	62.757	4177.127	9.605968
Sum256Base_Basic	1810.463	144.794	0.332977
Sum256Base	1604.768	163.353	0.375657
Sum256Avx2_Shuffle	102.946	2546.429	5.855918
Sum256Avx2_ShuffleXImm	33.151	7907.564	18.184701
Sum256Avx2_Unpack	36.188	7243.872	16.658438
Sum256Traits	34.791	7534.895	17.327690

.NET 5.0

VectorTraits.Benchmarks
benchmarkMode:	0

IsRelease:	True
Environment.ProcessorCount:	16
Environment.Is64BitProcess:	True
Environment.OSVersion:	Microsoft Windows NT 10.0.22631.0
Environment.Version:	5.0.17
Stopwatch.Frequency:	10000000
RuntimeEnvironment.GetRuntimeDirectory:	C:\Program Files\dotnet\shared\Microsoft.NETCore.App\5.0.17\
RuntimeInformation.FrameworkDescription:	.NET 5.0.17
RuntimeInformation.OSArchitecture:	X64
RuntimeInformation.OSDescription:	Microsoft Windows 10.0.22631
RuntimeInformation.RuntimeIdentifier:	win10-x64
IntPtr.Size:	8
BitConverter.IsLittleEndian:	True
Vector.IsHardwareAccelerated:	True
Vector<byte>.Count:	32	# 256bit
Vector<float>.Count:	8	# 256bit
Vector<T>.Assembly.CodeBase:	file:///C:/Program Files/dotnet/shared/Microsoft.NETCore.App/5.0.17/System.Private.CoreLib.dll
GetTargetFrameworkDisplayName(VectorTextUtil):	.NET 5.0
GetTargetFrameworkDisplayName(TraitsOutput):	.NET 5.0
VectorTraitsGlobal.InitCheckSum:	7960959	# 0x0079797F
VectorEnvironment.CpuModelName:	AMD Ryzen 7 7840H w/ Radeon 780M Graphics
VectorEnvironment.SupportedInstructionSets:	Aes, Avx, Avx2, Bmi1, Bmi2, Fma, Lzcnt, Pclmulqdq, Popcnt, Sse, Sse2, Sse3, Ssse3, Sse41, Sse42, X86Base
Vector128s.Instance:	WVectorTraits128Avx2	// Sse, Sse2, Sse3, Ssse3, Sse41, Sse42, Avx, Avx2
Vector256s.Instance:	WVectorTraits256Avx2	// Avx, Avx2, Sse, Sse2
Vectors.Instance:	VectorTraits256Avx2	// Avx, Avx2, Sse, Sse2
Vectors.BaseInstance:	VectorTraits256Base


[YGroup2TransposeBenchmark_Byte(262144)]
---
NAME	US	MOPS	SCALE
# SumBase_Basic	114
# Sum128Base_Basic	114
# Sum256Base_Basic	114
SumBase_Basic	189.921	1380.282
SumBase_ShiftByMul	23.349	11227.262	8.134034
SumBase	23.523	11144.113	8.073794
SumTraits	6.817	38455.627	27.860700
Sum128Base_Basic	560.770	467.472	0.338678
Sum128Base	566.026	463.131	0.335534
Sum128AdvSimd_Shift	Run fail! Requires hardware support AdvSimd!
Sum128AdvSimd_Shuffle	Run fail! Requires hardware support AdvSimd!
Sum128AdvSimdB64	Run fail! Requires hardware support AdvSimd!
Sum128Sse_Shift	12.988	20183.828	14.622973
Sum128Sse_Shuffle	18.188	14413.063	10.442113
Sum128Sse_ShuffleXImm	19.552	13407.321	9.713464
Sum128Sse_Unpack	18.032	14537.709	10.532418
Sum128Traits	12.727	20597.994	14.923031
Sum256Base_Basic	533.959	490.944	0.355684
Sum256Base	500.724	523.530	0.379292
Sum256Avx2_Shift	6.765	38748.703	28.073031
Sum256Avx2_Shuffle	9.656	27149.089	19.669231
Sum256Avx2_ShuffleXImm	9.749	26889.428	19.481110
Sum256Avx2_Unpack	9.262	28303.023	20.505245
Sum256Traits	6.517	40224.465	29.142205

[YGroup2TransposeBenchmark_Int16(262144)]
---
NAME	US	MOPS	SCALE
# SumBase_Basic	-24614
# Sum128Base_Basic	-24614
# Sum256Base_Basic	-24614
SumBase_Basic	242.895	1079.248
SumBase_ShiftByMul	42.709	6137.886	5.687188
SumBase	43.204	6067.628	5.622088
SumTraits	11.549	22699.253	21.032472
Sum128Base_Basic	936.467	279.929	0.259374
Sum128Base	931.715	281.356	0.260697
Sum128AdvSimd_Shift	Run fail! Requires hardware support AdvSimd!
Sum128AdvSimd_Shuffle	Run fail! Requires hardware support AdvSimd!
Sum128AdvSimdB64	Run fail! Requires hardware support AdvSimd!
Sum128Sse_Shift	26.778	9789.487	9.070656
Sum128Sse_Shuffle	37.964	6905.152	6.398114
Sum128Sse_ShuffleXImm	22.558	11620.795	10.767493
Sum128Sse_Unpack	29.378	8923.009	8.267802
Sum128Traits	21.810	12019.639	11.137050
Sum256Base_Basic	875.602	299.387	0.277403
Sum256Base	791.916	331.025	0.306718
Sum256Avx2_Shift	13.323	19675.733	18.230966
Sum256Avx2_Shuffle	19.544	13412.948	12.428050
Sum256Avx2_ShuffleXImm	11.081	23657.104	21.919988
Sum256Avx2_Unpack	15.307	17126.294	15.868729
Sum256Traits	11.262	23276.756	21.567569

[YGroup2TransposeBenchmark_Int32(262144)]
---
NAME	US	MOPS	SCALE
# SumBase_Basic	-468672550
# Sum128Base_Basic	-468148262
# Sum256Base_Basic	-468672550
SumBase_Basic	380.727	688.536
SumBase_ShiftByMul	273.494	958.500	1.392084
SumBase	389.656	672.758	0.977085
SumTraits	23.385	11209.907	16.280786
Sum128Base_Basic	1533.128	170.986	0.248333
Sum128Base	1534.326	170.853	0.248139
Sum128AdvSimd_Shift	Run fail! Requires hardware support AdvSimd!
Sum128AdvSimd_Shuffle	Run fail! Requires hardware support AdvSimd!
Sum128AdvSimdB64	Run fail! Requires hardware support AdvSimd!
Sum128Sse_Shift	51.945	5046.597	7.329460
Sum128Sse_Shuffle	78.729	3329.717	4.835938
Sum128Sse_ShuffleXImm	43.410	6038.754	8.770425
Sum128Sse_Unpack	45.824	5720.625	8.308388
Sum128Traits	41.528	6312.395	9.167851
Sum256Base_Basic	1411.759	185.686	0.269682
Sum256Base	1287.903	203.543	0.295617
Sum256Avx2_Shift	28.651	9149.492	13.288327
Sum256Avx2_Shuffle	39.993	6554.810	9.519923
Sum256Avx2_ShuffleXImm	23.221	11289.191	16.395933
Sum256Avx2_Unpack	23.385	11210.155	16.281145
Sum256Traits	22.879	11457.623	16.640558

[YGroup2TransposeBenchmark_Int64(262144)]
---
NAME	US	MOPS	SCALE
# SumBase_Basic	5910372263571615236
# Sum128Base_Basic	5910372263571877380
# Sum256Base_Basic	5910372263571615236
SumBase_Basic	673.172	389.416
SumBase	677.746	386.788	0.993252
SumTraits	34.365	7628.245	19.588935
Sum128Base_Basic	2822.655	92.871	0.238489
Sum128Base_Move	3259.652	80.421	0.206517
Sum128Base	2804.003	93.489	0.240075
Sum128AdvSimd_AlignRight	Run fail! Requires hardware support AdvSimd!
Sum128AdvSimd_Shuffle	Run fail! Requires hardware support AdvSimd!
Sum128AdvSimdB64	Run fail! Requires hardware support AdvSimd!
Sum128Sse_Shuffle	151.574	1729.481	4.441216
Sum128Sse_ShuffleXImm	67.959	3857.377	9.905544
Sum128Sse_Unpack	67.028	3910.977	10.043184
Sum128Traits	90.610	2893.098	7.429326
Sum256Base_Basic	2624.393	99.887	0.256506
Sum256Base	2394.604	109.473	0.281120
Sum256Avx2_Shuffle	74.790	3505.087	9.000880
Sum256Avx2_ShuffleXImm	33.846	7745.265	19.889437
Sum256Avx2_Unpack	35.582	7367.321	18.918896
Sum256Traits	33.622	7796.693	20.021501

.NET 6.0

VectorTraits.Benchmarks
benchmarkMode:	0

IsRelease:	True
Environment.ProcessorCount:	16
Environment.Is64BitProcess:	True
Environment.OSVersion:	Microsoft Windows NT 10.0.22631.0
Environment.Version:	6.0.33
Stopwatch.Frequency:	10000000
RuntimeEnvironment.GetRuntimeDirectory:	C:\Program Files\dotnet\shared\Microsoft.NETCore.App\6.0.33\
RuntimeInformation.FrameworkDescription:	.NET 6.0.33
RuntimeInformation.OSArchitecture:	X64
RuntimeInformation.OSDescription:	Microsoft Windows 10.0.22631
RuntimeInformation.RuntimeIdentifier:	win10-x64
IntPtr.Size:	8
BitConverter.IsLittleEndian:	True
Vector.IsHardwareAccelerated:	True
Vector<byte>.Count:	32	# 256bit
Vector<float>.Count:	8	# 256bit
Vector<T>.Assembly.CodeBase:	file:///C:/Program Files/dotnet/shared/Microsoft.NETCore.App/6.0.33/System.Private.CoreLib.dll
GetTargetFrameworkDisplayName(VectorTextUtil):	.NET 6.0
GetTargetFrameworkDisplayName(TraitsOutput):	.NET 6.0
VectorTraitsGlobal.InitCheckSum:	7960959	# 0x0079797F
VectorEnvironment.CpuModelName:	AMD Ryzen 7 7840H w/ Radeon 780M Graphics
VectorEnvironment.SupportedInstructionSets:	Aes, Avx, Avx2, Bmi1, Bmi2, Fma, Lzcnt, Pclmulqdq, Popcnt, Sse, Sse2, Sse3, Ssse3, Sse41, Sse42, X86Base
Vector128s.Instance:	WVectorTraits128Avx2	// Sse, Sse2, Sse3, Ssse3, Sse41, Sse42, Avx, Avx2
Vector256s.Instance:	WVectorTraits256Avx2	// Avx, Avx2, Sse, Sse2
Vectors.Instance:	VectorTraits256Avx2	// Avx, Avx2, Sse, Sse2
Vectors.BaseInstance:	VectorTraits256Base


[YGroup2TransposeBenchmark_Byte(262144)]
---
NAME	US	MOPS	SCALE
# SumBase_Basic	114
# Sum128Base_Basic	114
# Sum256Base_Basic	114
SumBase_Basic	187.295	1399.632
SumBase_ShiftByMul	23.452	11177.763	7.986214
SumBase	23.511	11149.732	7.966186
SumTraits	6.661	39356.226	28.118976
Sum128Base_Basic	558.825	469.098	0.335158
Sum128Base	556.816	470.791	0.336367
Sum128AdvSimd_Shift	Run fail! Requires hardware support AdvSimd!
Sum128AdvSimd_Shuffle	Run fail! Requires hardware support AdvSimd!
Sum128AdvSimdB64	Run fail! Requires hardware support AdvSimd!
Sum128Sse_Shift	13.329	19667.740	14.052076
Sum128Sse_Shuffle	18.551	14131.058	10.096264
Sum128Sse_ShuffleXImm	19.608	13369.118	9.551879
Sum128Sse_Unpack	18.882	13883.004	9.919036
Sum128Traits	12.839	20417.316	14.587629
Sum256Base_Basic	443.370	591.254	0.422435
Sum256Base	445.742	588.107	0.420187
Sum256Avx2_Shift	6.631	39535.056	28.246745
Sum256Avx2_Shuffle	9.987	26249.757	18.754752
Sum256Avx2_ShuffleXImm	9.829	26671.605	19.056152
Sum256Avx2_Unpack	9.273	28270.208	20.198310
Sum256Traits	6.742	38880.264	27.778913

[YGroup2TransposeBenchmark_Int16(262144)]
---
NAME	US	MOPS	SCALE
# SumBase_Basic	-24614
# Sum128Base_Basic	-24614
# Sum256Base_Basic	-24614
SumBase_Basic	222.622	1177.530
SumBase_ShiftByMul	43.734	5994.017	5.090329
SumBase	42.735	6134.151	5.209336
SumTraits	11.215	23374.006	19.850025
Sum128Base_Basic	908.367	288.588	0.245079
Sum128Base	908.717	288.477	0.244985
Sum128AdvSimd_Shift	Run fail! Requires hardware support AdvSimd!
Sum128AdvSimd_Shuffle	Run fail! Requires hardware support AdvSimd!
Sum128AdvSimdB64	Run fail! Requires hardware support AdvSimd!
Sum128Sse_Shift	26.567	9867.260	8.379623
Sum128Sse_Shuffle	38.656	6781.425	5.759023
Sum128Sse_ShuffleXImm	22.972	11411.385	9.690948
Sum128Sse_Unpack	29.607	8854.057	7.519176
Sum128Traits	23.346	11228.669	9.535779
Sum256Base_Basic	645.287	406.244	0.344997
Sum256Base	660.004	397.185	0.337304
Sum256Avx2_Shift	13.639	19219.787	16.322117
Sum256Avx2_Shuffle	20.044	13078.338	11.106583
Sum256Avx2_ShuffleXImm	11.165	23479.687	19.939773
Sum256Avx2_Unpack	14.836	17669.918	15.005914
Sum256Traits	11.419	22957.086	19.495962

[YGroup2TransposeBenchmark_Int32(262144)]
---
NAME	US	MOPS	SCALE
# SumBase_Basic	-468672550
# Sum128Base_Basic	-468148262
# Sum256Base_Basic	-468672550
SumBase_Basic	342.768	764.784
SumBase_ShiftByMul	271.270	966.360	1.263571
SumBase	343.945	762.168	0.996579
SumTraits	22.576	11611.570	15.182802
Sum128Base_Basic	1520.487	172.408	0.225433
Sum128Base	1522.807	172.145	0.225090
Sum128AdvSimd_Shift	Run fail! Requires hardware support AdvSimd!
Sum128AdvSimd_Shuffle	Run fail! Requires hardware support AdvSimd!
Sum128AdvSimdB64	Run fail! Requires hardware support AdvSimd!
Sum128Sse_Shift	52.848	4960.324	6.485911
Sum128Sse_Shuffle	78.555	3337.062	4.363403
Sum128Sse_ShuffleXImm	44.182	5933.314	7.758152
Sum128Sse_Unpack	46.995	5578.087	7.293673
Sum128Traits	45.871	5714.850	7.472498
Sum256Base_Basic	914.032	286.800	0.375007
Sum256Base	912.493	287.283	0.375640
Sum256Avx2_Shift	27.415	9562.074	12.502966
Sum256Avx2_Shuffle	41.533	6311.726	8.252947
Sum256Avx2_ShuffleXImm	22.415	11694.822	15.291658
Sum256Avx2_Unpack	23.480	11164.635	14.598408
Sum256Traits	23.189	11304.557	14.781364

[YGroup2TransposeBenchmark_Int64(262144)]
---
NAME	US	MOPS	SCALE
# SumBase_Basic	5910372263571615236
# Sum128Base_Basic	5910372263571877380
# Sum256Base_Basic	5910372263571615236
SumBase_Basic	593.441	441.736
SumBase	594.324	441.080	0.998515
SumTraits	33.734	7770.801	17.591520
Sum128Base_Basic	2818.007	93.025	0.210589
Sum128Base_Move	3262.220	80.358	0.181913
Sum128Base	2820.107	92.955	0.210432
Sum128AdvSimd_AlignRight	Run fail! Requires hardware support AdvSimd!
Sum128AdvSimd_Shuffle	Run fail! Requires hardware support AdvSimd!
Sum128AdvSimdB64	Run fail! Requires hardware support AdvSimd!
Sum128Sse_Shuffle	154.039	1701.800	3.852530
Sum128Sse_ShuffleXImm	61.991	4228.732	9.572993
Sum128Sse_Unpack	64.733	4049.626	9.167533
Sum128Traits	66.042	3969.378	8.985868
Sum256Base_Basic	1615.792	162.239	0.367276
Sum256Base	1605.480	163.281	0.369635
Sum256Avx2_Shuffle	77.410	3386.457	7.666253
Sum256Avx2_ShuffleXImm	35.191	7449.184	16.863445
Sum256Avx2_Unpack	34.882	7515.070	17.012598
Sum256Traits	35.050	7479.196	16.931386

.NET 7.0

VectorTraits.Benchmarks
benchmarkMode:	0

IsRelease:	True
Environment.ProcessorCount:	16
Environment.Is64BitProcess:	True
Environment.OSVersion:	Microsoft Windows NT 10.0.22631.0
Environment.Version:	7.0.20
Stopwatch.Frequency:	10000000
RuntimeEnvironment.GetRuntimeDirectory:	C:\Program Files\dotnet\shared\Microsoft.NETCore.App\7.0.20\
RuntimeInformation.FrameworkDescription:	.NET 7.0.20
RuntimeInformation.OSArchitecture:	X64
RuntimeInformation.OSDescription:	Microsoft Windows 10.0.22631
RuntimeInformation.RuntimeIdentifier:	win10-x64
IntPtr.Size:	8
BitConverter.IsLittleEndian:	True
Vector.IsHardwareAccelerated:	True
Vector<byte>.Count:	32	# 256bit
Vector<float>.Count:	8	# 256bit
Vector128.IsHardwareAccelerated:	True
Vector256.IsHardwareAccelerated:	True
Vector<T>.Assembly.CodeBase:	file:///C:/Program Files/dotnet/shared/Microsoft.NETCore.App/7.0.20/System.Private.CoreLib.dll
GetTargetFrameworkDisplayName(VectorTextUtil):	.NET 7.0
GetTargetFrameworkDisplayName(TraitsOutput):	.NET 7.0
VectorTraitsGlobal.InitCheckSum:	7960959	# 0x0079797F
VectorEnvironment.CpuModelName:	AMD Ryzen 7 7840H w/ Radeon 780M Graphics
VectorEnvironment.SupportedInstructionSets:	Aes, Avx, Avx2, Bmi1, Bmi2, Fma, Lzcnt, Pclmulqdq, Popcnt, Sse, Sse2, Sse3, Ssse3, Sse41, Sse42, X86Base
Vector128s.Instance:	WVectorTraits128Avx2	// Sse, Sse2, Sse3, Ssse3, Sse41, Sse42, Avx, Avx2
Vector256s.Instance:	WVectorTraits256Avx2	// Avx, Avx2, Sse, Sse2
Vectors.Instance:	VectorTraits256Avx2	// Avx, Avx2, Sse, Sse2
Vectors.BaseInstance:	VectorTraits256Base


[YGroup2TransposeBenchmark_Byte(262144)]
---
NAME	US	MOPS	SCALE
# SumBase_Basic	114
# Sum128Base_Basic	114
# Sum256Base_Basic	114
SumBase_Basic	196.980	1330.814
SumBase_Shift	7.055	37155.325	27.919244
SumBase_ShiftByMul	19.215	13642.917	10.251557
SumBase	7.104	36901.097	27.728213
SumTraits	7.013	37378.897	28.087241
Sum128Base_Basic	225.474	1162.633	0.873626
Sum128Base_Shift	14.047	18662.427	14.023316
Sum128Base_Shuffle	953.678	274.877	0.206548
Sum128Base	13.358	19625.135	14.746714
Sum128AdvSimd_Shift	Run fail! Requires hardware support AdvSimd!
Sum128AdvSimd_Shuffle	Run fail! Requires hardware support AdvSimd!
Sum128AdvSimdB64	Run fail! Requires hardware support AdvSimd!
Sum128Sse_Shift	13.897	18863.977	14.174765
Sum128Sse_Shuffle	15.973	16411.431	12.331873
Sum128Sse_ShuffleXImm	19.943	13144.409	9.876969
Sum128Sse_Unpack	19.254	13615.353	10.230845
Sum128Traits	13.758	19054.344	14.317810
Sum256Base_Basic	193.965	1351.502	1.015545
Sum256Base_Shift	6.821	38432.651	28.879053
Sum256Base_Shuffle	846.848	309.553	0.232604
Sum256Base	6.688	39193.919	29.451084
Sum256Avx2_Shift	6.804	38526.091	28.949265
Sum256Avx2_Shuffle	9.689	27054.776	20.329493
Sum256Avx2_ShuffleXImm	9.829	26671.291	20.041334
Sum256Avx2_Unpack	9.525	27520.772	20.679651
Sum256Traits	6.845	38296.407	28.776676

[YGroup2TransposeBenchmark_Int16(262144)]
---
NAME	US	MOPS	SCALE
# SumBase_Basic	-24614
# Sum128Base_Basic	-24614
# Sum256Base_Basic	-24614
SumBase_Basic	245.437	1068.071
SumBase_Shift	13.917	18836.299	17.635811
SumBase_ShiftByMul	26.148	10025.228	9.386293
SumBase	13.735	19086.052	17.869646
SumTraits	11.307	23184.277	21.706680
Sum128Base_Basic	318.792	822.303	0.769896
Sum128Base_Shift	28.725	9125.848	8.544233
Sum128Base_Shuffle	1876.847	139.673	0.130771
Sum128Base	27.528	9522.829	8.915913
Sum128AdvSimd_Shift	Run fail! Requires hardware support AdvSimd!
Sum128AdvSimd_Shuffle	Run fail! Requires hardware support AdvSimd!
Sum128AdvSimdB64	Run fail! Requires hardware support AdvSimd!
Sum128Sse_Shift	28.211	9292.319	8.700095
Sum128Sse_Shuffle	31.389	8351.413	7.819154
Sum128Sse_ShuffleXImm	22.271	11770.778	11.020594
Sum128Sse_Unpack	30.593	8568.649	8.022546
Sum128Traits	22.846	11474.512	10.743210
Sum256Base_Basic	246.963	1061.470	0.993820
Sum256Base_Shift	14.027	18689.176	17.498064
Sum256Base_Shuffle	1716.102	152.755	0.143020
Sum256Base	13.740	19078.670	17.862734
Sum256Avx2_Shift	13.382	19588.936	18.340480
Sum256Avx2_Shuffle	19.836	13215.429	12.373174
Sum256Avx2_ShuffleXImm	11.613	22574.068	21.135361
Sum256Avx2_Unpack	15.576	16830.173	15.757540
Sum256Traits	11.673	22457.159	21.025903

[YGroup2TransposeBenchmark_Int32(262144)]
---
NAME	US	MOPS	SCALE
# SumBase_Basic	-468672550
# Sum128Base_Basic	-468148262
# Sum256Base_Basic	-468672550
SumBase_Basic	392.314	668.199
SumBase_Shift	29.538	8874.686	13.281508
SumBase_ShiftByMul	280.311	935.189	1.399568
SumBase	28.979	9045.898	13.537737
SumTraits	23.986	10928.948	16.355836
Sum128Base_Basic	549.317	477.218	0.714186
Sum128Base_Shift	54.753	4787.752	7.165163
Sum128Base_Shuffle	3823.755	68.557	0.102599
Sum128Base	55.730	4703.784	7.039500
Sum128AdvSimd_Shift	Run fail! Requires hardware support AdvSimd!
Sum128AdvSimd_Shuffle	Run fail! Requires hardware support AdvSimd!
Sum128AdvSimdB64	Run fail! Requires hardware support AdvSimd!
Sum128Sse_Shift	55.428	4729.456	7.077919
Sum128Sse_Shuffle	73.790	3552.559	5.316620
Sum128Sse_ShuffleXImm	45.296	5787.299	8.661045
Sum128Sse_Unpack	48.321	5425.090	8.118978
Sum128Traits	45.924	5708.247	8.542739
Sum256Base_Basic	393.110	666.846	0.997975
Sum256Base_Shift	29.943	8754.774	13.102052
Sum256Base_Shuffle	3429.727	76.433	0.114387
Sum256Base	30.762	8521.597	12.753089
Sum256Avx2_Shift	29.827	8788.961	13.153215
Sum256Avx2_Shuffle	38.790	6757.990	10.113744
Sum256Avx2_ShuffleXImm	23.675	11072.780	16.571090
Sum256Avx2_Unpack	24.222	10822.715	16.196851
Sum256Traits	23.665	11077.138	16.577612

[YGroup2TransposeBenchmark_Int64(262144)]
---
NAME	US	MOPS	SCALE
# SumBase_Basic	5910372263571615236
# Sum128Base_Basic	5910372263571877380
# Sum256Base_Basic	5910372263571615236
SumBase_Basic	672.648	389.720
SumBase	668.884	391.912	1.005627
SumTraits	34.307	7641.226	19.606981
Sum128Base_Basic	970.425	270.133	0.693147
Sum128Base_Move	1708.317	153.452	0.393749
Sum128Base_Shuffle	8272.743	31.688	0.081309
Sum128Base	1703.334	153.901	0.394901
Sum128AdvSimd_AlignRight	Run fail! Requires hardware support AdvSimd!
Sum128AdvSimd_Shuffle	Run fail! Requires hardware support AdvSimd!
Sum128AdvSimdB64	Run fail! Requires hardware support AdvSimd!
Sum128Sse_Shuffle	146.673	1787.274	4.586049
Sum128Sse_ShuffleXImm	66.036	3969.725	10.186103
Sum128Sse_Unpack	64.378	4071.961	10.448435
Sum128Traits	73.638	3559.901	9.134517
Sum256Base_Basic	669.351	391.639	1.004925
Sum256Base_Shuffle	7506.062	34.924	0.089614
Sum256Base	662.972	395.408	1.014595
Sum256Avx2_Shuffle	75.903	3453.686	8.861974
Sum256Avx2_ShuffleXImm	38.022	6894.503	17.690928
Sum256Avx2_Unpack	36.612	7159.992	18.372161
Sum256Traits	36.713	7140.319	18.321680

.NET 8.0

VectorTraits.Benchmarks
benchmarkMode:	0

IsRelease:	True
Environment.ProcessorCount:	16
Environment.Is64BitProcess:	True
Environment.OSVersion:	Microsoft Windows NT 10.0.22631.0
Environment.Version:	8.0.8
Stopwatch.Frequency:	10000000
RuntimeEnvironment.GetRuntimeDirectory:	C:\Program Files\dotnet\shared\Microsoft.NETCore.App\8.0.8\
RuntimeInformation.FrameworkDescription:	.NET 8.0.8
RuntimeInformation.OSArchitecture:	X64
RuntimeInformation.OSDescription:	Microsoft Windows 10.0.22631
RuntimeInformation.RuntimeIdentifier:	win-x64
IntPtr.Size:	8
BitConverter.IsLittleEndian:	True
Vector.IsHardwareAccelerated:	True
Vector<byte>.Count:	32	# 256bit
Vector<float>.Count:	8	# 256bit
Vector128.IsHardwareAccelerated:	True
Vector256.IsHardwareAccelerated:	True
Vector512.IsHardwareAccelerated:	True
Vector<T>.Assembly.CodeBase:	file:///C:/Program Files/dotnet/shared/Microsoft.NETCore.App/8.0.8/System.Private.CoreLib.dll
GetTargetFrameworkDisplayName(VectorTextUtil):	.NET 8.0
GetTargetFrameworkDisplayName(TraitsOutput):	.NET 8.0
VectorTraitsGlobal.InitCheckSum:	-2122844161	# 0x8177F7FF
VectorEnvironment.CpuModelName:	AMD Ryzen 7 7840H w/ Radeon 780M Graphics
VectorEnvironment.SupportedInstructionSets:	Aes, Avx, Avx2, Avx512BW, Avx512CD, Avx512DQ, Avx512F, Avx512Vbmi, Avx512VL, Bmi1, Bmi2, Fma, Lzcnt, Pclmulqdq, Popcnt, Sse, Sse2, Sse3, Ssse3, Sse41, Sse42, X86Base
Vector128s.Instance:	WVectorTraits128Avx2	// Sse, Sse2, Sse3, Ssse3, Sse41, Sse42, Avx, Avx2, Avx512VL
Vector256s.Instance:	WVectorTraits256Avx2	// Avx, Avx2, Sse, Sse2, Avx512VL
Vector512s.Instance:	WVectorTraits512Avx512	// Avx512BW, Avx512DQ, Avx512F, Avx512Vbmi, Avx, Avx2, Sse, Sse2
Vectors.Instance:	VectorTraits256Avx2	// Avx, Avx2, Sse, Sse2, Avx512VL
Vectors.BaseInstance:	VectorTraits256Base


[YGroup2TransposeBenchmark_Byte(262144)]
---
NAME	US	MOPS	SCALE
# SumBase_Basic	114
# Sum128Base_Basic	114
# Sum256Base_Basic	114
# Sum512Base_Basic	114
SumBase_Basic	192.499	1361.794
SumBase_Shift	6.851	38262.719	28.097279
SumBase_ShiftByMul	17.858	14679.149	10.779269
SumBase	6.720	39010.284	28.646234
SumTraits	5.450	48098.762	35.320132
Sum128Base_Basic	226.595	1156.885	0.849530
Sum128Base_Shift	13.532	19372.725	14.225880
Sum128Base_Shuffle	943.578	277.819	0.204010
Sum128Base	14.267	18374.476	13.492840
Sum128AdvSimd_Shift	Run fail! Requires hardware support AdvSimd!
Sum128AdvSimd_Shuffle	Run fail! Requires hardware support AdvSimd!
Sum128AdvSimd_ShuffleX	Run fail! Requires hardware support AdvSimd!
Sum128AdvSimdB64	Run fail! Requires hardware support AdvSimd!
Sum128PackedSimd_Shift	Run fail! Requires hardware support PackedSimd!
Sum128PackedSimd_Shuffle	Run fail! Requires hardware support PackedSimd!
Sum128Sse_Shift	14.064	18639.392	13.687375
Sum128Sse_Shuffle	15.212	17233.052	12.654664
Sum128Sse_ShuffleX	9.672	27102.285	19.901890
Sum128Sse_ShuffleXImm	19.903	13171.379	9.672076
Sum128Sse_Unpack	19.032	13774.146	10.114702
Sum128Traits	9.790	26775.661	19.662042
Sum256Base_Basic	196.697	1332.730	0.978657
Sum256Base_Shift	6.606	39684.337	29.141208
Sum256Base_Shuffle	1015.517	258.138	0.189558
Sum256Base	6.883	38085.830	27.967384
Sum256Avx2_Shift	6.880	38101.763	27.979085
Sum256Avx2_Shuffle	8.062	32517.279	23.878257
Sum256Avx2_ShuffleX	5.208	50338.633	36.964926
Sum256Avx2_ShuffleXImm	10.630	24661.834	18.109806
Sum256Avx2_Unpack	10.475	25026.818	18.377823
Sum256Traits	5.733	45721.618	33.574536
Sum512Base_Basic	182.853	1433.632	1.052752
Sum512Base_Shift	5.403	48515.893	35.626442
Sum512Base_Shuffle	975.994	268.592	0.197234
Sum512Base	5.144	50963.073	37.423468
Sum512Avx512_Shift	5.033	52089.390	38.250551
Sum512Avx512_Shuffle	7.284	35990.248	26.428545
Sum512Avx512_ShuffleX	3.739	70120.064	51.490929
Sum512Avx512_ShuffleXImm	8.779	29861.591	21.928119
Sum512Avx512_Unpack	9.270	28278.590	20.765681
Sum512Traits	3.855	67996.564	49.931590

[YGroup2TransposeBenchmark_Int16(262144)]
---
NAME	US	MOPS	SCALE
# SumBase_Basic	-24614
# Sum128Base_Basic	-24614
# Sum256Base_Basic	-24614
# Sum512Base_Basic	-24614
SumBase_Basic	241.760	1084.313
SumBase_Shift	13.768	19040.668	17.560119
SumBase_ShiftByMul	24.434	10728.752	9.894514
SumBase	14.008	18714.113	17.258956
SumTraits	9.730	26941.241	24.846366
Sum128Base_Basic	321.155	816.253	0.752784
Sum128Base_Shift	26.147	10025.922	9.246334
Sum128Base_Shuffle	1818.991	144.115	0.132909
Sum128Base	26.709	9814.710	9.051545
Sum128AdvSimd_Shift	Run fail! Requires hardware support AdvSimd!
Sum128AdvSimd_Shuffle	Run fail! Requires hardware support AdvSimd!
Sum128AdvSimd_ShuffleX	Run fail! Requires hardware support AdvSimd!
Sum128AdvSimdB64	Run fail! Requires hardware support AdvSimd!
Sum128PackedSimd_Shift	Run fail! Requires hardware support PackedSimd!
Sum128PackedSimd_Shuffle	Run fail! Requires hardware support PackedSimd!
Sum128Sse_Shift	26.295	9969.224	9.194045
Sum128Sse_Shuffle	30.727	8531.394	7.868017
Sum128Sse_ShuffleX	19.176	13670.158	12.607205
Sum128Sse_ShuffleXImm	21.692	12085.037	11.145338
Sum128Sse_Unpack	28.909	9067.771	8.362687
Sum128Traits	17.814	14715.764	13.571507
Sum256Base_Basic	238.454	1099.348	1.013866
Sum256Base_Shift	13.717	19110.914	17.624902
Sum256Base_Shuffle	1673.062	156.685	0.144502
Sum256Base	13.419	19534.603	18.015647
Sum256Avx2_Shift	13.850	18927.024	17.455312
Sum256Avx2_Shuffle	16.346	16037.189	14.790183
Sum256Avx2_ShuffleX	10.090	25981.645	23.961385
Sum256Avx2_ShuffleXImm	11.237	23329.431	21.515400
Sum256Avx2_Unpack	14.606	17947.498	16.551951
Sum256Traits	9.671	27107.054	24.999286
Sum512Base_Basic	195.625	1340.036	1.235839
Sum512Base_Shift	10.319	25404.622	23.429230
Sum512Base_Shuffle	1969.458	133.105	0.122755
Sum512Base	10.297	25459.478	23.479820
Sum512Avx512_Shift	10.259	25553.372	23.566413
Sum512Avx512_Shuffle	15.704	16693.189	15.395174
Sum512Avx512_ShuffleX	7.262	36099.104	33.292139
Sum512Avx512_ShuffleXImm	9.389	27919.269	25.748345
Sum512Avx512_Unpack	13.649	19205.735	17.712351
Sum512Traits	7.234	36236.553	33.418900

[YGroup2TransposeBenchmark_Int32(262144)]
---
NAME	US	MOPS	SCALE
# SumBase_Basic	-468672550
# Sum128Base_Basic	-468148262
# Sum256Base_Basic	-468672550
# Sum512Base_Basic	-468213798
SumBase_Basic	379.306	691.115
SumBase_Shift	27.875	9404.298	13.607437
SumBase_ShiftByMul	274.637	954.510	1.381116
SumBase	27.728	9454.060	13.679439
SumTraits	20.374	12866.286	18.616719
Sum128Base_Basic	550.856	475.885	0.688576
Sum128Base_Shift	51.201	5119.904	7.408185
Sum128Base_Shuffle	3614.286	72.530	0.104946
Sum128Base	52.300	5012.278	7.252456
Sum128AdvSimd_Shift	Run fail! Requires hardware support AdvSimd!
Sum128AdvSimd_Shuffle	Run fail! Requires hardware support AdvSimd!
Sum128AdvSimd_ShuffleX	Run fail! Requires hardware support AdvSimd!
Sum128AdvSimdB64	Run fail! Requires hardware support AdvSimd!
Sum128PackedSimd_Shift	Run fail! Requires hardware support PackedSimd!
Sum128PackedSimd_Shuffle	Run fail! Requires hardware support PackedSimd!
Sum128Sse_Shift	52.309	5011.418	7.251211
Sum128Sse_Shuffle	60.859	4307.408	6.232552
Sum128Sse_ShuffleX	36.652	7152.264	10.348883
Sum128Sse_ShuffleXImm	43.361	6045.618	8.747635
Sum128Sse_Unpack	43.180	6070.926	8.784254
Sum128Traits	38.510	6807.102	9.849455
Sum256Base_Basic	379.085	691.518	1.000583
Sum256Base_Shift	28.797	9103.098	13.171620
Sum256Base_Shuffle	3404.876	76.991	0.111401
Sum256Base	27.391	9570.426	13.847814
Sum256Avx2_Shift	27.959	9375.981	13.566464
Sum256Avx2_Shuffle	33.921	7727.964	11.181885
Sum256Avx2_ShuffleX	20.746	12635.989	18.283495
Sum256Avx2_ShuffleXImm	22.528	11636.384	16.837128
Sum256Avx2_Unpack	22.927	11433.758	16.543940
Sum256Traits	20.707	12659.967	18.318189
Sum512Base_Basic	286.440	915.180	1.324208
Sum512Base_Shift	21.089	12430.099	17.985585
Sum512Base_Shuffle	3591.991	72.980	0.105598
Sum512Base	21.073	12439.970	17.999867
Sum512Avx512_Shift	21.200	12365.036	17.891443
Sum512Avx512_Shuffle	31.650	8282.516	11.984289
Sum512Avx512_ShuffleX	15.912	16475.017	23.838331
Sum512Avx512_ShuffleXImm	18.666	14043.646	20.320287
Sum512Avx512_Unpack	19.453	13475.853	19.498726
Sum512Traits	15.652	16748.758	24.234417

[YGroup2TransposeBenchmark_Int64(262144)]
---
NAME	US	MOPS	SCALE
# SumBase_Basic	5910372263571615236
# Sum128Base_Basic	5910372263571877380
# Sum256Base_Basic	5910372263571615236
# Sum512Base_Basic	5910372263571811844
SumBase_Basic	659.280	397.622
SumBase	657.287	398.827	1.003032
SumTraits	32.869	7975.323	20.057564
Sum128Base_Basic	965.041	271.640	0.683162
Sum128Base_Move	939.463	279.036	0.701763
Sum128Base_Shuffle	7453.719	35.170	0.088450
Sum128Base	941.635	278.392	0.700144
Sum128AdvSimd_AlignRight	Run fail! Requires hardware support AdvSimd!
Sum128AdvSimd_Shuffle	Run fail! Requires hardware support AdvSimd!
Sum128AdvSimd_ShuffleX	Run fail! Requires hardware support AdvSimd!
Sum128AdvSimdB64	Run fail! Requires hardware support AdvSimd!
Sum128PackedSimd_Shuffle	Run fail! Requires hardware support PackedSimd!
Sum128Sse_Shuffle	124.920	2098.488	5.277598
Sum128Sse_ShuffleX	73.756	3554.219	8.938695
Sum128Sse_ShuffleXImm	63.971	4097.879	10.305974
Sum128Sse_Unpack	63.349	4138.089	10.407101
Sum128Traits	64.749	4048.649	10.182163
Sum256Base_Basic	655.702	399.791	1.005456
Sum256Base_Shuffle	8131.135	32.240	0.081081
Sum256Base	662.777	395.524	0.994723
Sum256Avx2_Shuffle	64.316	4075.881	10.250650
Sum256Avx2_ShuffleX	38.489	6810.950	17.129220
Sum256Avx2_ShuffleXImm	33.193	7897.670	19.862270
Sum256Avx2_Unpack	33.369	7856.013	19.757505
Sum256Traits	33.770	7762.624	19.522636
Sum512Base_Basic	467.141	561.166	1.411307
Sum512Base_Shuffle	7265.229	36.082	0.090745
Sum512Avx512_Shuffle	62.614	4186.646	10.529218
Sum512Avx512_ShuffleX	29.688	8829.946	22.206901
Sum512Avx512_ShuffleXImm	28.702	9133.388	22.970042
Sum512Avx512_Unpack	28.987	9043.362	22.743631
Sum512Traits	28.180	9302.326	23.394914

Intel Core i5-8250U

Intel(R) Core(TM) i5-8250U CPU @ 1.60GHz

.NET Framework

VectorTraits.Benchmarks.NetFw
benchmarkMode:	0

IsRelease:	True
Environment.ProcessorCount:	8
Environment.Is64BitProcess:	True
Environment.OSVersion:	Microsoft Windows NT 6.2.9200.0
Environment.Version:	4.0.30319.42000
Stopwatch.Frequency:	10000000
RuntimeEnvironment.GetRuntimeDirectory:	C:\Windows\Microsoft.NET\Framework64\v4.0.30319\
RuntimeInformation.FrameworkDescription:	.NET Framework 4.8.9195.0
RuntimeInformation.OSArchitecture:	X64
RuntimeInformation.OSDescription:	Microsoft Windows 10.0.19045 
IntPtr.Size:	8
BitConverter.IsLittleEndian:	True
Vector.IsHardwareAccelerated:	True
Vector<byte>.Count:	32	# 256bit
Vector<float>.Count:	8	# 256bit
Vector<T>.Assembly.CodeBase:	file:///E:/910Soft/MyCode/VectorTraits_test/RunBenchmarks_All/VectorTraits.Benchmarks.NetFw/bin/Release/System.Numerics.Vectors.DLL
GetTargetFrameworkDisplayName(VectorTextUtil):	.NET Standard 1.1
GetTargetFrameworkDisplayName(TraitsOutput):	.NET Framework 4.5
VectorTraitsGlobal.InitCheckSum:	-25396097	# 0xFE7C7C7F
VectorEnvironment.CpuModelName:	Intel(R) Core(TM) i5-8250U CPU @ 1.60GHz
VectorEnvironment.SupportedInstructionSets:	
Vectors.Instance:	VectorTraits256Base	// 
Vectors.BaseInstance:	VectorTraits256Base


.NET Core 2.1

VectorTraits.Benchmarks.NetCore
benchmarkMode:	0

IsRelease:	True
Environment.ProcessorCount:	8
Environment.Is64BitProcess:	True
Environment.OSVersion:	Microsoft Windows NT 10.0.19045.0
Environment.Version:	4.0.30319.42000
Stopwatch.Frequency:	10000000
RuntimeEnvironment.GetRuntimeDirectory:	C:\Program Files\dotnet\shared\Microsoft.NETCore.App\2.1.30\
RuntimeInformation.FrameworkDescription:	.NET Core 4.6.30411.01
RuntimeInformation.OSArchitecture:	X64
RuntimeInformation.OSDescription:	Microsoft Windows 10.0.19045 
IntPtr.Size:	8
BitConverter.IsLittleEndian:	True
Vector.IsHardwareAccelerated:	True
Vector<byte>.Count:	32	# 256bit
Vector<float>.Count:	8	# 256bit
Vector<T>.Assembly.CodeBase:	file:///C:/Program Files/dotnet/shared/Microsoft.NETCore.App/2.1.30/System.Private.CoreLib.dll
GetTargetFrameworkDisplayName(VectorTextUtil):	.NET Standard 2.0
GetTargetFrameworkDisplayName(TraitsOutput):	.NET Core 2.1
VectorTraitsGlobal.InitCheckSum:	-25396097	# 0xFE7C7C7F
VectorEnvironment.CpuModelName:	Intel(R) Core(TM) i5-8250U CPU @ 1.60GHz
VectorEnvironment.SupportedInstructionSets:	
Vectors.Instance:	VectorTraits256Base	// 
Vectors.BaseInstance:	VectorTraits256Base


.NET Core 3.1

VectorTraits.Benchmarks
benchmarkMode:	0

IsRelease:	True
Environment.ProcessorCount:	8
Environment.Is64BitProcess:	True
Environment.OSVersion:	Microsoft Windows NT 6.2.9200.0
Environment.Version:	3.1.32
Stopwatch.Frequency:	10000000
RuntimeEnvironment.GetRuntimeDirectory:	C:\Program Files\dotnet\shared\Microsoft.NETCore.App\3.1.32\
RuntimeInformation.FrameworkDescription:	.NET Core 3.1.32
RuntimeInformation.OSArchitecture:	X64
RuntimeInformation.OSDescription:	Microsoft Windows 10.0.19045
IntPtr.Size:	8
BitConverter.IsLittleEndian:	True
Vector.IsHardwareAccelerated:	True
Vector<byte>.Count:	32	# 256bit
Vector<float>.Count:	8	# 256bit
Vector<T>.Assembly.CodeBase:	file:///C:/Program Files/dotnet/shared/Microsoft.NETCore.App/3.1.32/System.Private.CoreLib.dll
GetTargetFrameworkDisplayName(VectorTextUtil):	.NET Core 3.0
GetTargetFrameworkDisplayName(TraitsOutput):	.NET Core 3.1
VectorTraitsGlobal.InitCheckSum:	7960959	# 0x0079797F
VectorEnvironment.CpuModelName:	Intel(R) Core(TM) i5-8250U CPU @ 1.60GHz
VectorEnvironment.SupportedInstructionSets:	Aes, Avx, Avx2, Bmi1, Bmi2, Fma, Lzcnt, Pclmulqdq, Popcnt, Sse, Sse2, Sse3, Ssse3, Sse41, Sse42
Vector128s.Instance:	WVectorTraits128Avx2	// Sse, Sse2, Sse3, Ssse3, Sse41, Sse42, Avx, Avx2
Vector256s.Instance:	WVectorTraits256Avx2	// Avx, Avx2, Sse, Sse2
Vectors.Instance:	VectorTraits256Avx2	// Avx, Avx2, Sse, Sse2
Vectors.BaseInstance:	VectorTraits256Base


.NET 5.0

VectorTraits.Benchmarks
benchmarkMode:	0

IsRelease:	True
Environment.ProcessorCount:	8
Environment.Is64BitProcess:	True
Environment.OSVersion:	Microsoft Windows NT 10.0.19045.0
Environment.Version:	5.0.17
Stopwatch.Frequency:	10000000
RuntimeEnvironment.GetRuntimeDirectory:	C:\Program Files\dotnet\shared\Microsoft.NETCore.App\5.0.17\
RuntimeInformation.FrameworkDescription:	.NET 5.0.17
RuntimeInformation.OSArchitecture:	X64
RuntimeInformation.OSDescription:	Microsoft Windows 10.0.19045
RuntimeInformation.RuntimeIdentifier:	win10-x64
IntPtr.Size:	8
BitConverter.IsLittleEndian:	True
Vector.IsHardwareAccelerated:	True
Vector<byte>.Count:	32	# 256bit
Vector<float>.Count:	8	# 256bit
Vector<T>.Assembly.CodeBase:	file:///C:/Program Files/dotnet/shared/Microsoft.NETCore.App/5.0.17/System.Private.CoreLib.dll
GetTargetFrameworkDisplayName(VectorTextUtil):	.NET 5.0
GetTargetFrameworkDisplayName(TraitsOutput):	.NET 5.0
VectorTraitsGlobal.InitCheckSum:	7960959	# 0x0079797F
VectorEnvironment.CpuModelName:	Intel(R) Core(TM) i5-8250U CPU @ 1.60GHz
VectorEnvironment.SupportedInstructionSets:	Aes, Avx, Avx2, Bmi1, Bmi2, Fma, Lzcnt, Pclmulqdq, Popcnt, Sse, Sse2, Sse3, Ssse3, Sse41, Sse42, X86Base
Vector128s.Instance:	WVectorTraits128Avx2	// Sse, Sse2, Sse3, Ssse3, Sse41, Sse42, Avx, Avx2
Vector256s.Instance:	WVectorTraits256Avx2	// Avx, Avx2, Sse, Sse2
Vectors.Instance:	VectorTraits256Avx2	// Avx, Avx2, Sse, Sse2
Vectors.BaseInstance:	VectorTraits256Base


.NET 6.0

VectorTraits.Benchmarks
benchmarkMode:	0

IsRelease:	True
Environment.ProcessorCount:	8
Environment.Is64BitProcess:	True
Environment.OSVersion:	Microsoft Windows NT 10.0.19045.0
Environment.Version:	6.0.24
Stopwatch.Frequency:	10000000
RuntimeEnvironment.GetRuntimeDirectory:	C:\Program Files\dotnet\shared\Microsoft.NETCore.App\6.0.24\
RuntimeInformation.FrameworkDescription:	.NET 6.0.24
RuntimeInformation.OSArchitecture:	X64
RuntimeInformation.OSDescription:	Microsoft Windows 10.0.19045
RuntimeInformation.RuntimeIdentifier:	win10-x64
IntPtr.Size:	8
BitConverter.IsLittleEndian:	True
Vector.IsHardwareAccelerated:	True
Vector<byte>.Count:	32	# 256bit
Vector<float>.Count:	8	# 256bit
Vector<T>.Assembly.CodeBase:	file:///C:/Program Files/dotnet/shared/Microsoft.NETCore.App/6.0.24/System.Private.CoreLib.dll
GetTargetFrameworkDisplayName(VectorTextUtil):	.NET 6.0
GetTargetFrameworkDisplayName(TraitsOutput):	.NET 6.0
VectorTraitsGlobal.InitCheckSum:	7960959	# 0x0079797F
VectorEnvironment.CpuModelName:	Intel(R) Core(TM) i5-8250U CPU @ 1.60GHz
VectorEnvironment.SupportedInstructionSets:	Aes, Avx, Avx2, Bmi1, Bmi2, Fma, Lzcnt, Pclmulqdq, Popcnt, Sse, Sse2, Sse3, Ssse3, Sse41, Sse42, X86Base
Vector128s.Instance:	WVectorTraits128Avx2	// Sse, Sse2, Sse3, Ssse3, Sse41, Sse42, Avx, Avx2
Vector256s.Instance:	WVectorTraits256Avx2	// Avx, Avx2, Sse, Sse2
Vectors.Instance:	VectorTraits256Avx2	// Avx, Avx2, Sse, Sse2
Vectors.BaseInstance:	VectorTraits256Base


.NET 7.0

VectorTraits.Benchmarks
benchmarkMode:	0

IsRelease:	True
Environment.ProcessorCount:	8
Environment.Is64BitProcess:	True
Environment.OSVersion:	Microsoft Windows NT 10.0.19045.0
Environment.Version:	7.0.13
Stopwatch.Frequency:	10000000
RuntimeEnvironment.GetRuntimeDirectory:	C:\Program Files\dotnet\shared\Microsoft.NETCore.App\7.0.13\
RuntimeInformation.FrameworkDescription:	.NET 7.0.13
RuntimeInformation.OSArchitecture:	X64
RuntimeInformation.OSDescription:	Microsoft Windows 10.0.19045
RuntimeInformation.RuntimeIdentifier:	win10-x64
IntPtr.Size:	8
BitConverter.IsLittleEndian:	True
Vector.IsHardwareAccelerated:	True
Vector<byte>.Count:	32	# 256bit
Vector<float>.Count:	8	# 256bit
Vector<T>.Assembly.CodeBase:	file:///C:/Program Files/dotnet/shared/Microsoft.NETCore.App/7.0.13/System.Private.CoreLib.dll
GetTargetFrameworkDisplayName(VectorTextUtil):	.NET 7.0
GetTargetFrameworkDisplayName(TraitsOutput):	.NET 7.0
VectorTraitsGlobal.InitCheckSum:	7960959	# 0x0079797F
VectorEnvironment.CpuModelName:	Intel(R) Core(TM) i5-8250U CPU @ 1.60GHz
VectorEnvironment.SupportedInstructionSets:	Aes, Avx, Avx2, Bmi1, Bmi2, Fma, Lzcnt, Pclmulqdq, Popcnt, Sse, Sse2, Sse3, Ssse3, Sse41, Sse42, X86Base
Vector128s.Instance:	WVectorTraits128Avx2	// Sse, Sse2, Sse3, Ssse3, Sse41, Sse42, Avx, Avx2
Vector256s.Instance:	WVectorTraits256Avx2	// Avx, Avx2, Sse, Sse2
Vectors.Instance:	VectorTraits256Avx2	// Avx, Avx2, Sse, Sse2
Vectors.BaseInstance:	VectorTraits256Base


Arm

Apple M2

Apple M2

.NET 6.0

VectorTraits.Benchmarks
benchmarkMode:	0

IsRelease:	True
Environment.ProcessorCount:	8
Environment.Is64BitProcess:	True
Environment.OSVersion:	Unix 15.0.1
Environment.Version:	6.0.33
Stopwatch.Frequency:	1000000000
RuntimeEnvironment.GetRuntimeDirectory:	/usr/local/share/dotnet/shared/Microsoft.NETCore.App/6.0.33/
RuntimeInformation.FrameworkDescription:	.NET 6.0.33
RuntimeInformation.OSArchitecture:	Arm64
RuntimeInformation.OSDescription:	Darwin 24.0.0 Darwin Kernel Version 24.0.0: Tue Sep 24 23:37:13 PDT 2024; root:xnu-11215.1.12~1/RELEASE_ARM64_T8112
RuntimeInformation.RuntimeIdentifier:	osx.15-arm64
IntPtr.Size:	8
BitConverter.IsLittleEndian:	True
Vector.IsHardwareAccelerated:	True
Vector<byte>.Count:	16	# 128bit
Vector<float>.Count:	4	# 128bit
Vector<T>.Assembly.CodeBase:	file:///usr/local/share/dotnet/shared/Microsoft.NETCore.App/6.0.33/System.Private.CoreLib.dll
GetTargetFrameworkDisplayName(VectorTextUtil):	.NET 6.0
GetTargetFrameworkDisplayName(TraitsOutput):	.NET 6.0
VectorTraitsGlobal.InitCheckSum:	7960961	# 0x00797981
VectorEnvironment.CpuModelName:	Apple M2
VectorEnvironment.SupportedInstructionSets:	AdvSimd, ArmBase, Crc32
Vector128s.Instance:	WVectorTraits128AdvSimdB64	// AdvSimd
Vectors.Instance:	VectorTraits128AdvSimdB64	// AdvSimd
Vectors.BaseInstance:	VectorTraits128Base


[YGroup2TransposeBenchmark_Byte(262144)]
---
NAME	US	MOPS	SCALE
# SumBase_Basic	114
# Sum128Base_Basic	114
# Sum256Base_Basic	114
SumBase_Basic	106.512	2461.178
SumBase_ShiftByMul	23.828	11001.569	4.470041
SumBase	23.829	11000.927	4.469780
SumTraits	9.636	27204.165	11.053308
Sum128Base_Basic	268.920	974.803	0.396072
Sum128Base	268.715	975.545	0.396373
Sum128AdvSimd_Shift	9.672	27104.651	11.012875
Sum128AdvSimd_Shuffle	24.383	10751.218	4.368321
Sum128AdvSimdB64	9.633	27213.746	11.057201
Sum128Sse_Shift	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_Shuffle	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_ShuffleXImm	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_Unpack	Run fail! Requires hardware support Sse, Sse2!
Sum128Traits	9.632	27215.292	11.057829
Sum256Base_Basic	256.151	1023.397	0.415816
Sum256Base	262.904	997.107	0.405134
Sum256Avx2_Shift	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_Shuffle	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_ShuffleXImm	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_Unpack	Run fail! Requires hardware support Avx, Avx2!
Sum256Traits	262.530	998.531	0.405712

[YGroup2TransposeBenchmark_Int16(262144)]
---
NAME	US	MOPS	SCALE
# SumBase_Basic	-24614
# Sum128Base_Basic	-24614
# Sum256Base_Basic	-24614
SumBase_Basic	117.748	2226.313
SumBase_ShiftByMul	33.829	7749.020	3.480651
SumBase	33.822	7750.723	3.481416
SumTraits	19.268	13605.242	6.111108
Sum128Base_Basic	321.126	816.329	0.366673
Sum128Base	319.626	820.157	0.368393
Sum128AdvSimd_Shift	19.303	13580.446	6.099971
Sum128AdvSimd_Shuffle	48.736	5378.852	2.416036
Sum128AdvSimdB64	19.277	13599.072	6.108337
Sum128Sse_Shift	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_Shuffle	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_ShuffleXImm	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_Unpack	Run fail! Requires hardware support Sse, Sse2!
Sum128Traits	19.269	13604.684	6.110858
Sum256Base_Basic	341.059	768.618	0.345243
Sum256Base	341.142	768.430	0.345158
Sum256Avx2_Shift	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_Shuffle	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_ShuffleXImm	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_Unpack	Run fail! Requires hardware support Avx, Avx2!
Sum256Traits	341.044	768.651	0.345257

[YGroup2TransposeBenchmark_Int32(262144)]
---
NAME	US	MOPS	SCALE
# SumBase_Basic	-468148262
# Sum128Base_Basic	-468148262
# Sum256Base_Basic	-467951654
SumBase_Basic	145.595	1800.507
SumBase_ShiftByMul	113.237	2314.998	1.285747
SumBase	145.553	1801.026	1.000288
SumTraits	38.536	6802.625	3.778171
Sum128Base_Basic	406.637	644.664	0.358046
Sum128Base	406.179	645.390	0.358449
Sum128AdvSimd_Shift	38.578	6795.208	3.774052
Sum128AdvSimd_Shuffle	97.455	2689.890	1.493962
Sum128AdvSimdB64	38.541	6801.708	3.777662
Sum128Sse_Shift	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_Shuffle	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_ShuffleXImm	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_Unpack	Run fail! Requires hardware support Sse, Sse2!
Sum128Traits	38.535	6802.761	3.778246
Sum256Base_Basic	494.542	530.074	0.294403
Sum256Base	493.327	531.380	0.295128
Sum256Avx2_Shift	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_Shuffle	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_ShuffleXImm	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_Unpack	Run fail! Requires hardware support Avx, Avx2!
Sum256Traits	494.928	529.661	0.294173

[YGroup2TransposeBenchmark_Int64(262144)]
---
NAME	US	MOPS	SCALE
# SumBase_Basic	5910372263571877380
# Sum128Base_Basic	5910372263571877380
# Sum256Base_Basic	5910372263571942916
SumBase_Basic	199.332	1315.112
SumBase	199.508	1313.956	0.999120
SumTraits	77.074	3401.200	2.586243
Sum128Base_Basic	672.061	390.060	0.296598
Sum128Base_Move	576.701	454.558	0.345642
Sum128Base	671.578	390.340	0.296811
Sum128AdvSimd_AlignRight	77.088	3400.579	2.585771
Sum128AdvSimd_Shuffle	194.923	1344.860	1.022620
Sum128AdvSimdB64	77.063	3401.672	2.586602
Sum128Sse_Shuffle	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_ShuffleXImm	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_Unpack	Run fail! Requires hardware support Sse, Sse2!
Sum128Traits	77.083	3400.783	2.585926
Sum256Base_Basic	759.122	345.325	0.262582
Sum256Base	759.020	345.372	0.262618
Sum256Avx2_Shuffle	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_ShuffleXImm	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_Unpack	Run fail! Requires hardware support Avx, Avx2!
Sum256Traits	758.881	345.435	0.262666

.NET 7.0

VectorTraits.Benchmarks
benchmarkMode:	0

IsRelease:	True
Environment.ProcessorCount:	8
Environment.Is64BitProcess:	True
Environment.OSVersion:	Unix 15.0.1
Environment.Version:	7.0.20
Stopwatch.Frequency:	1000000000
RuntimeEnvironment.GetRuntimeDirectory:	/usr/local/share/dotnet/shared/Microsoft.NETCore.App/7.0.20/
RuntimeInformation.FrameworkDescription:	.NET 7.0.20
RuntimeInformation.OSArchitecture:	Arm64
RuntimeInformation.OSDescription:	Darwin 24.0.0 Darwin Kernel Version 24.0.0: Tue Sep 24 23:37:13 PDT 2024; root:xnu-11215.1.12~1/RELEASE_ARM64_T8112
RuntimeInformation.RuntimeIdentifier:	osx.15-arm64
IntPtr.Size:	8
BitConverter.IsLittleEndian:	True
Vector.IsHardwareAccelerated:	True
Vector<byte>.Count:	16	# 128bit
Vector<float>.Count:	4	# 128bit
Vector128.IsHardwareAccelerated:	True
Vector256.IsHardwareAccelerated:	False
Vector<T>.Assembly.CodeBase:	file:///usr/local/share/dotnet/shared/Microsoft.NETCore.App/7.0.20/System.Private.CoreLib.dll
GetTargetFrameworkDisplayName(VectorTextUtil):	.NET 7.0
GetTargetFrameworkDisplayName(TraitsOutput):	.NET 7.0
VectorTraitsGlobal.InitCheckSum:	7960961	# 0x00797981
VectorEnvironment.CpuModelName:	Apple M2
VectorEnvironment.SupportedInstructionSets:	AdvSimd, Aes, ArmBase, Crc32, Dp, Rdm, Sha1, Sha256
Vector128s.Instance:	WVectorTraits128AdvSimdB64	// AdvSimd
Vectors.Instance:	VectorTraits128AdvSimdB64	// AdvSimd
Vectors.BaseInstance:	VectorTraits128Base


[YGroup2TransposeBenchmark_Byte(262144)]
---
NAME	US	MOPS	SCALE
# SumBase_Basic	114
# Sum128Base_Basic	114
# Sum256Base_Basic	114
SumBase_Basic	106.610	2458.904
SumBase_Shift	9.650	27165.424	11.047776
SumBase_ShiftByMul	22.325	11742.220	4.775387
SumBase	9.651	27161.862	11.046327
SumTraits	9.630	27220.518	11.070182
Sum128Base_Basic	106.581	2459.565	1.000269
Sum128Base_Shift	9.657	27145.381	11.039625
Sum128Base_Shuffle	793.432	330.392	0.134366
Sum128Base	9.658	27142.831	11.038588
Sum128AdvSimd_Shift	9.659	27139.730	11.037327
Sum128AdvSimd_Shuffle	24.373	10755.464	4.374088
Sum128AdvSimdB64	9.630	27221.482	11.070574
Sum128Sse_Shift	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_Shuffle	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_ShuffleXImm	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_Unpack	Run fail! Requires hardware support Sse, Sse2!
Sum128Traits	9.629	27223.667	11.071463
Sum256Base_Basic	345.877	757.911	0.308231
Sum256Base_Shift	474.358	552.629	0.224746
Sum256Base_Shuffle	1472.625	178.011	0.072395
Sum256Base	360.933	726.296	0.295374
Sum256Avx2_Shift	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_Shuffle	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_ShuffleXImm	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_Unpack	Run fail! Requires hardware support Avx, Avx2!
Sum256Traits	360.939	726.284	0.295369

[YGroup2TransposeBenchmark_Int16(262144)]
---
NAME	US	MOPS	SCALE
# SumBase_Basic	-24614
# Sum128Base_Basic	-24614
# Sum256Base_Basic	-24614
SumBase_Basic	117.649	2228.193
SumBase_Shift	19.278	13598.427	6.102893
SumBase_ShiftByMul	26.524	9883.169	4.435508
SumBase	19.277	13598.763	6.103044
SumTraits	19.262	13609.477	6.107853
Sum128Base_Basic	117.694	2227.333	0.999614
Sum128Base_Shift	19.349	13548.513	6.080492
Sum128Base_Shuffle	1804.115	145.303	0.065211
Sum128Base	19.295	13585.946	6.097292
Sum128AdvSimd_Shift	19.297	13584.619	6.096697
Sum128AdvSimd_Shuffle	48.711	5381.586	2.415224
Sum128AdvSimdB64	19.267	13605.659	6.106139
Sum128Sse_Shift	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_Shuffle	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_ShuffleXImm	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_Unpack	Run fail! Requires hardware support Sse, Sse2!
Sum128Traits	19.265	13606.980	6.106732
Sum256Base_Basic	460.501	569.259	0.255480
Sum256Base_Shift	715.705	366.274	0.164382
Sum256Base_Shuffle	2956.847	88.657	0.039789
Sum256Base	460.829	568.853	0.255298
Sum256Avx2_Shift	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_Shuffle	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_ShuffleXImm	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_Unpack	Run fail! Requires hardware support Avx, Avx2!
Sum256Traits	460.516	569.240	0.255471

[YGroup2TransposeBenchmark_Int32(262144)]
---
NAME	US	MOPS	SCALE
# SumBase_Basic	-468148262
# Sum128Base_Basic	-468148262
Check-Sum128Base_Shift	Check `Sum128Base_Shift` mismatch. -833942548!=-468148262
Check-Sum128Base	Check `Sum128Base` mismatch. -833942548!=-468148262
Check-Sum128AdvSimd_Shift	Check `Sum128AdvSimd_Shift` mismatch. -833942548!=-468148262
# Sum256Base_Basic	-467951654
SumBase_Basic	145.372	1803.269
SumBase_Shift	38.548	6800.516	3.771216
SumBase_ShiftByMul	103.579	2530.852	1.403480
SumBase	38.544	6801.109	3.771544
SumTraits	38.518	6805.713	3.774098
Sum128Base_Basic	145.473	1802.007	0.999300
Sum128Base_Shift	38.562	6797.973	3.769805
Sum128Base_Shuffle	3478.954	75.351	0.041786
Sum128Base	38.565	6797.431	3.769505
Sum128AdvSimd_Shift	38.565	6797.408	3.769492
Sum128AdvSimd_Shuffle	97.402	2691.355	1.492487
Sum128AdvSimdB64	38.534	6803.010	3.772598
Sum128Sse_Shift	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_Shuffle	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_ShuffleXImm	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_Unpack	Run fail! Requires hardware support Sse, Sse2!
Sum128Traits	38.536	6802.570	3.772354
Sum256Base_Basic	658.854	397.879	0.220643
Sum256Base_Shift	1173.882	223.314	0.123838
Sum256Base_Shuffle	5879.766	44.584	0.024724
Sum256Base	664.045	394.769	0.218918
Sum256Avx2_Shift	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_Shuffle	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_ShuffleXImm	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_Unpack	Run fail! Requires hardware support Avx, Avx2!
Sum256Traits	663.918	394.844	0.218960

[YGroup2TransposeBenchmark_Int64(262144)]
---
NAME	US	MOPS	SCALE
# SumBase_Basic	5910372263571877380
# Sum128Base_Basic	5910372263571877380
# Sum256Base_Basic	5910372263571942916
SumBase_Basic	199.420	1314.531
SumBase	199.416	1314.559	1.000021
SumTraits	77.077	3401.077	2.587293
Sum128Base_Basic	199.249	1315.664	1.000861
Sum128Base_Move	77.098	3400.162	2.586596
Sum128Base_Shuffle	6473.247	40.497	0.030807
Sum128Base	77.100	3400.032	2.586497
Sum128AdvSimd_AlignRight	77.088	3400.571	2.586908
Sum128AdvSimd_Shuffle	194.772	1345.902	1.023865
Sum128AdvSimdB64	77.067	3401.493	2.587609
Sum128Sse_Shuffle	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_ShuffleXImm	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_Unpack	Run fail! Requires hardware support Sse, Sse2!
Sum128Traits	77.085	3400.729	2.587028
Sum256Base_Basic	1016.640	257.853	0.196156
Sum256Base_Shuffle	10541.249	24.868	0.018918
Sum256Base	1017.750	257.572	0.195942
Sum256Avx2_Shuffle	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_ShuffleXImm	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_Unpack	Run fail! Requires hardware support Avx, Avx2!
Sum256Traits	1017.614	257.606	0.195968

.NET 8.0

VectorTraits.Benchmarks
benchmarkMode:	0

IsRelease:	True
Environment.ProcessorCount:	8
Environment.Is64BitProcess:	True
Environment.OSVersion:	Unix 15.0.1
Environment.Version:	8.0.4
Stopwatch.Frequency:	1000000000
RuntimeEnvironment.GetRuntimeDirectory:	/usr/local/share/dotnet/shared/Microsoft.NETCore.App/8.0.4/
RuntimeInformation.FrameworkDescription:	.NET 8.0.4
RuntimeInformation.OSArchitecture:	Arm64
RuntimeInformation.OSDescription:	Darwin 24.0.0 Darwin Kernel Version 24.0.0: Tue Sep 24 23:37:13 PDT 2024; root:xnu-11215.1.12~1/RELEASE_ARM64_T8112
RuntimeInformation.RuntimeIdentifier:	osx-arm64
IntPtr.Size:	8
BitConverter.IsLittleEndian:	True
Vector.IsHardwareAccelerated:	True
Vector<byte>.Count:	16	# 128bit
Vector<float>.Count:	4	# 128bit
Vector128.IsHardwareAccelerated:	True
Vector256.IsHardwareAccelerated:	False
Vector512.IsHardwareAccelerated:	False
Vector<T>.Assembly.CodeBase:	file:///usr/local/share/dotnet/shared/Microsoft.NETCore.App/8.0.4/System.Private.CoreLib.dll
GetTargetFrameworkDisplayName(VectorTextUtil):	.NET 8.0
GetTargetFrameworkDisplayName(TraitsOutput):	.NET 8.0
VectorTraitsGlobal.InitCheckSum:	-2122844159	# 0x8177F801
VectorEnvironment.CpuModelName:	Apple M2
VectorEnvironment.SupportedInstructionSets:	AdvSimd, Aes, ArmBase, Crc32, Dp, Rdm, Sha1, Sha256
Vector128s.Instance:	WVectorTraits128AdvSimdB64	// AdvSimd
Vectors.Instance:	VectorTraits128AdvSimdB64	// AdvSimd
Vectors.BaseInstance:	VectorTraits128Base


[YGroup2TransposeBenchmark_Byte(262144)]
---
NAME	US	MOPS	SCALE
# SumBase_Basic	114
# Sum128Base_Basic	114
# Sum256Base_Basic	114
# Sum512Base_Basic	114
SumBase_Basic	106.158	2469.367
SumBase_Shift	9.654	27153.733	10.996234
SumBase_ShiftByMul	19.652	13339.615	5.402039
SumBase	9.652	27158.974	10.998357
SumTraits	9.648	27170.146	11.002881
Sum128Base_Basic	106.154	2469.467	1.000041
Sum128Base_Shift	9.654	27153.784	10.996255
Sum128Base_Shuffle	793.201	330.489	0.133835
Sum128Base	9.654	27153.139	10.995994
Sum128AdvSimd_Shift	9.652	27160.417	10.998941
Sum128AdvSimd_Shuffle	24.369	10757.417	4.356347
Sum128AdvSimd_ShuffleX	13.876	18892.350	7.650687
Sum128AdvSimdB64	9.626	27231.946	11.027908
Sum128PackedSimd_Shift	Run fail! Requires hardware support PackedSimd!
Sum128PackedSimd_Shuffle	Run fail! Requires hardware support PackedSimd!
Sum128Sse_Shift	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_Shuffle	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_ShuffleX	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_ShuffleXImm	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_Unpack	Run fail! Requires hardware support Sse, Sse2!
Sum128Traits	9.625	27234.776	11.029054
Sum256Base_Basic	94.001	2788.745	1.129336
Sum256Base_Shift	6.242	41998.174	17.007671
Sum256Base_Shuffle	782.390	335.056	0.135685
Sum256Base	94.263	2780.995	1.126198
Sum256Avx2_Shift	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_Shuffle	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_ShuffleX	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_ShuffleXImm	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_Unpack	Run fail! Requires hardware support Avx, Avx2!
Sum256Traits	94.263	2780.990	1.126196
Sum512Base_Basic	91.580	2862.453	1.159185
Sum512Base_Shift	5.463	47986.257	19.432618
Sum512Base_Shuffle	1071.354	244.685	0.099088
Sum512Base	91.603	2861.747	1.158899
Sum512Avx512_Shift	Run fail! Requires hardware support Avx512BW, Avx512DQ, Avx512F, Avx512Vbmi!
Sum512Avx512_Shuffle	Run fail! Requires hardware support Avx512BW, Avx512DQ, Avx512F, Avx512Vbmi!
Sum512Avx512_ShuffleX	Run fail! Requires hardware support Avx512BW, Avx512DQ, Avx512F, Avx512Vbmi!
Sum512Avx512_ShuffleXImm	Run fail! Requires hardware support Avx512BW, Avx512DQ, Avx512F, Avx512Vbmi!
Sum512Avx512_Unpack	Run fail! Requires hardware support Avx512BW, Avx512DQ, Avx512F, Avx512Vbmi!
Sum512Traits	91.597	2861.939	1.158977

[YGroup2TransposeBenchmark_Int16(262144)]
---
NAME	US	MOPS	SCALE
# SumBase_Basic	-24614
# Sum128Base_Basic	-24614
# Sum256Base_Basic	-24614
# Sum512Base_Basic	8154
SumBase_Basic	116.635	2247.551
SumBase_Shift	19.297	13584.430	6.044104
SumBase_ShiftByMul	23.056	11369.988	5.058835
SumBase	19.288	13591.099	6.047072
SumTraits	19.255	13614.084	6.057298
Sum128Base_Basic	116.691	2246.473	0.999520
Sum128Base_Shift	19.290	13589.474	6.046348
Sum128Base_Shuffle	1800.141	145.624	0.064792
Sum128Base	19.291	13589.010	6.046142
Sum128AdvSimd_Shift	19.292	13587.903	6.045649
Sum128AdvSimd_Shuffle	48.721	5380.546	2.393960
Sum128AdvSimd_ShuffleX	27.724	9455.404	4.206982
Sum128AdvSimdB64	19.261	13609.829	6.055405
Sum128PackedSimd_Shift	Run fail! Requires hardware support PackedSimd!
Sum128PackedSimd_Shuffle	Run fail! Requires hardware support PackedSimd!
Sum128Sse_Shift	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_Shuffle	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_ShuffleX	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_ShuffleXImm	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_Unpack	Run fail! Requires hardware support Sse, Sse2!
Sum128Traits	19.262	13609.635	6.055319
Sum256Base_Basic	110.978	2362.116	1.050973
Sum256Base_Shift	12.955	20235.683	9.003438
Sum256Base_Shuffle	1799.368	145.687	0.064820
Sum256Base	110.955	2362.625	1.051200
Sum256Avx2_Shift	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_Shuffle	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_ShuffleX	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_ShuffleXImm	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_Unpack	Run fail! Requires hardware support Avx, Avx2!
Sum256Traits	110.971	2362.269	1.051042
Sum512Base_Basic	114.853	2282.426	1.015517
Sum512Base_Shift	11.305	23187.864	10.316949
Sum512Base_Shuffle	2293.734	114.287	0.050850
Sum512Base	114.885	2281.791	1.015235
Sum512Avx512_Shift	Run fail! Requires hardware support Avx512BW, Avx512DQ, Avx512F, Avx512Vbmi!
Sum512Avx512_Shuffle	Run fail! Requires hardware support Avx512BW, Avx512DQ, Avx512F, Avx512Vbmi!
Sum512Avx512_ShuffleX	Run fail! Requires hardware support Avx512BW, Avx512DQ, Avx512F, Avx512Vbmi!
Sum512Avx512_ShuffleXImm	Run fail! Requires hardware support Avx512BW, Avx512DQ, Avx512F, Avx512Vbmi!
Sum512Avx512_Unpack	Run fail! Requires hardware support Avx512BW, Avx512DQ, Avx512F, Avx512Vbmi!
Sum512Traits	114.846	2282.569	1.015581

[YGroup2TransposeBenchmark_Int32(262144)]
---
NAME	US	MOPS	SCALE
# SumBase_Basic	-468148262
# Sum128Base_Basic	-468148262
# Sum256Base_Basic	-467951654
# Sum512Base_Basic	-467853350
SumBase_Basic	145.014	1807.714
SumBase_Shift	38.562	6798.027	3.760566
SumBase_ShiftByMul	101.171	2591.105	1.433360
SumBase	38.567	6797.169	3.760091
SumTraits	38.529	6803.854	3.763789
Sum128Base_Basic	144.954	1808.465	1.000415
Sum128Base_Shift	38.566	6797.316	3.760173
Sum128Base_Shuffle	3461.778	75.725	0.041890
Sum128Base	38.569	6796.693	3.759828
Sum128AdvSimd_Shift	38.574	6795.905	3.759392
Sum128AdvSimd_Shuffle	97.386	2691.816	1.489072
Sum128AdvSimd_ShuffleX	55.433	4729.037	2.616032
Sum128AdvSimdB64	38.532	6803.200	3.763427
Sum128PackedSimd_Shift	Run fail! Requires hardware support PackedSimd!
Sum128PackedSimd_Shuffle	Run fail! Requires hardware support PackedSimd!
Sum128Sse_Shift	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_Shuffle	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_ShuffleX	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_ShuffleXImm	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_Unpack	Run fail! Requires hardware support Sse, Sse2!
Sum128Traits	38.526	6804.319	3.764047
Sum256Base_Basic	140.524	1865.475	1.031953
Sum256Base_Shift	25.792	10163.700	5.622405
Sum256Base_Shuffle	3588.573	73.050	0.040410
Sum256Base	140.572	1864.833	1.031598
Sum256Avx2_Shift	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_Shuffle	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_ShuffleX	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_ShuffleXImm	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_Unpack	Run fail! Requires hardware support Avx, Avx2!
Sum256Traits	140.627	1864.114	1.031200
Sum512Base_Basic	153.386	1709.050	0.945421
Sum512Base_Shift	22.493	11654.494	6.447090
Sum512Base_Shuffle	4869.200	53.837	0.029782
Sum512Base	153.368	1709.251	0.945532
Sum512Avx512_Shift	Run fail! Requires hardware support Avx512BW, Avx512DQ, Avx512F, Avx512Vbmi!
Sum512Avx512_Shuffle	Run fail! Requires hardware support Avx512BW, Avx512DQ, Avx512F, Avx512Vbmi!
Sum512Avx512_ShuffleX	Run fail! Requires hardware support Avx512BW, Avx512DQ, Avx512F, Avx512Vbmi!
Sum512Avx512_ShuffleXImm	Run fail! Requires hardware support Avx512BW, Avx512DQ, Avx512F, Avx512Vbmi!
Sum512Avx512_Unpack	Run fail! Requires hardware support Avx512BW, Avx512DQ, Avx512F, Avx512Vbmi!
Sum512Traits	153.348	1709.468	0.945652

[YGroup2TransposeBenchmark_Int64(262144)]
---
NAME	US	MOPS	SCALE
# SumBase_Basic	5910372263571877380
# Sum128Base_Basic	5910372263571877380
# Sum256Base_Basic	5910372263571942916
# Sum512Base_Basic	5910372263571975684
SumBase_Basic	197.288	1328.741
SumBase	197.669	1326.174	0.998068
SumTraits	77.073	3401.236	2.559744
Sum128Base_Basic	197.662	1326.224	0.998106
Sum128Base_Move	77.069	3401.417	2.559880
Sum128Base_Shuffle	6475.218	40.484	0.030468
Sum128Base	77.055	3402.059	2.560363
Sum128AdvSimd_AlignRight	77.060	3401.812	2.560177
Sum128AdvSimd_Shuffle	194.686	1346.495	1.013362
Sum128AdvSimd_ShuffleX	110.819	2365.506	1.780262
Sum128AdvSimdB64	77.047	3402.379	2.560604
Sum128PackedSimd_Shuffle	Run fail! Requires hardware support PackedSimd!
Sum128Sse_Shuffle	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_ShuffleX	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_ShuffleXImm	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_Unpack	Run fail! Requires hardware support Sse, Sse2!
Sum128Traits	77.070	3401.381	2.559853
Sum256Base_Basic	192.138	1364.356	1.026804
Sum256Base_Shuffle	6358.432	41.228	0.031028
Sum256Base	192.164	1364.171	1.026665
Sum256Avx2_Shuffle	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_ShuffleX	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_ShuffleXImm	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_Unpack	Run fail! Requires hardware support Avx, Avx2!
Sum256Traits	192.128	1364.423	1.026854
Sum512Base_Basic	228.325	1148.118	0.864065
Sum512Base_Shuffle	8702.115	30.124	0.022671
Sum512Avx512_Shuffle	Run fail! Requires hardware support Avx512BW, Avx512DQ, Avx512F, Avx512Vbmi!
Sum512Avx512_ShuffleX	Run fail! Requires hardware support Avx512BW, Avx512DQ, Avx512F, Avx512Vbmi!
Sum512Avx512_ShuffleXImm	Run fail! Requires hardware support Avx512BW, Avx512DQ, Avx512F, Avx512Vbmi!
Sum512Avx512_Unpack	Run fail! Requires hardware support Avx512BW, Avx512DQ, Avx512F, Avx512Vbmi!
Sum512Traits	228.307	1148.210	0.864134

AWS Arm t4g.small

Neoverse-N1

.NET Core 3.1

VectorTraits.Benchmarks
benchmarkMode:	0

IsRelease:	True
Environment.ProcessorCount:	2
Environment.Is64BitProcess:	True
Environment.OSVersion:	Unix 6.8.0.1015
Environment.Version:	3.1.32
Stopwatch.Frequency:	1000000000
RuntimeEnvironment.GetRuntimeDirectory:	/home/ubuntu/.dotnet/shared/Microsoft.NETCore.App/3.1.32/
RuntimeInformation.FrameworkDescription:	.NET Core 3.1.32
RuntimeInformation.OSArchitecture:	Arm64
RuntimeInformation.OSDescription:	Linux 6.8.0-1015-aws #16~22.04.1-Ubuntu SMP Mon Aug 19 20:21:57 UTC 2024
IntPtr.Size:	8
BitConverter.IsLittleEndian:	True
Vector.IsHardwareAccelerated:	True
Vector<byte>.Count:	16	# 128bit
Vector<float>.Count:	4	# 128bit
Vector<T>.Assembly.CodeBase:	file:///home/ubuntu/.dotnet/shared/Microsoft.NETCore.App/3.1.32/System.Private.CoreLib.dll
GetTargetFrameworkDisplayName(VectorTextUtil):	.NET Core 3.0
GetTargetFrameworkDisplayName(TraitsOutput):	.NET Core 3.1
VectorTraitsGlobal.InitCheckSum:	7960961	# 0x00797981
VectorEnvironment.CpuModelName:	Neoverse-N1
VectorEnvironment.CpuFlags:	fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm lrcpc dcpop asimddp ssbs
VectorEnvironment.SupportedInstructionSets:	
Vector128s.Instance:	WVectorTraits128Base	// 
Vectors.Instance:	VectorTraits128Base	// 
Vectors.BaseInstance:	VectorTraits128Base


[YGroup2TransposeBenchmark_Byte(262144)]
---
NAME	US	MOPS	SCALE
# SumBase_Basic	114
# Sum128Base_Basic	114
# Sum256Base_Basic	114
SumBase_Basic	342.238	765.971
SumBase_ShiftByMul	115.820	2263.365	2.954897
SumBase	115.887	2262.062	2.953197
SumTraits	115.938	2261.075	2.951908
Sum128Base_Basic	848.664	308.890	0.403266
Sum128Base	855.009	306.598	0.400274
Sum128Sse_Shift	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_Shuffle	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_ShuffleXImm	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_Unpack	Run fail! Requires hardware support Sse, Sse2!
Sum128Traits	854.762	306.687	0.400389
Sum256Base_Basic	806.485	325.045	0.424357
Sum256Base	824.369	317.994	0.415151
Sum256Avx2_Shift	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_Shuffle	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_ShuffleXImm	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_Unpack	Run fail! Requires hardware support Avx, Avx2!
Sum256Traits	939.746	278.952	0.364181

[YGroup2TransposeBenchmark_Int16(262144)]
---
NAME	US	MOPS	SCALE
# SumBase_Basic	-24614
# Sum128Base_Basic	-24614
# Sum256Base_Basic	-24614
SumBase_Basic	444.486	589.769
SumBase_ShiftByMul	137.632	1904.680	3.229538
SumBase	137.561	1905.657	3.231195
SumTraits	137.641	1904.554	3.229325
Sum128Base_Basic	1295.906	202.286	0.342993
Sum128Base	1296.342	202.218	0.342877
Sum128Sse_Shift	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_Shuffle	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_ShuffleXImm	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_Unpack	Run fail! Requires hardware support Sse, Sse2!
Sum128Traits	1294.060	202.575	0.343482
Sum256Base_Basic	1349.783	194.212	0.329302
Sum256Base	1384.601	189.328	0.321021
Sum256Avx2_Shift	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_Shuffle	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_ShuffleXImm	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_Unpack	Run fail! Requires hardware support Avx, Avx2!
Sum256Traits	1613.572	162.462	0.275467

[YGroup2TransposeBenchmark_Int32(262144)]
---
NAME	US	MOPS	SCALE
# SumBase_Basic	-468148262
# Sum128Base_Basic	-468148262
# Sum256Base_Basic	-467951654
SumBase_Basic	725.979	361.090
SumBase_ShiftByMul	343.816	762.455	2.111536
SumBase	731.316	358.455	0.992702
SumTraits	732.413	357.918	0.991216
Sum128Base_Basic	2153.422	121.734	0.337128
Sum128Base	2224.334	117.853	0.326381
Sum128Sse_Shift	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_Shuffle	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_ShuffleXImm	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_Unpack	Run fail! Requires hardware support Sse, Sse2!
Sum128Traits	2164.657	121.102	0.335378
Sum256Base_Basic	2200.667	119.120	0.329891
Sum256Base	2276.937	115.130	0.318840
Sum256Avx2_Shift	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_Shuffle	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_ShuffleXImm	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_Unpack	Run fail! Requires hardware support Avx, Avx2!
Sum256Traits	2716.558	96.499	0.267242

[YGroup2TransposeBenchmark_Int64(262144)]
---
NAME	US	MOPS	SCALE
# SumBase_Basic	5910372263571877380
# Sum128Base_Basic	5910372263571877380
# Sum256Base_Basic	5910372263571942916
SumBase_Basic	705.866	371.379
SumBase	689.973	379.934	1.023034
SumTraits	689.403	380.248	1.023880
Sum128Base_Basic	1519.369	172.535	0.464578
Sum128Base_Move	1839.464	142.511	0.383735
Sum128Base	1509.416	173.672	0.467642
Sum128Sse_Shuffle	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_ShuffleXImm	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_Unpack	Run fail! Requires hardware support Sse, Sse2!
Sum128Traits	1512.254	173.347	0.466764
Sum256Base_Basic	3384.977	77.443	0.208529
Sum256Base	3535.103	74.155	0.199673
Sum256Avx2_Shuffle	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_ShuffleXImm	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_Unpack	Run fail! Requires hardware support Avx, Avx2!
Sum256Traits	4447.914	58.936	0.158696

.NET 5.0

VectorTraits.Benchmarks
benchmarkMode:	0

IsRelease:	True
Environment.ProcessorCount:	2
Environment.Is64BitProcess:	True
Environment.OSVersion:	Unix 6.8.0.1015
Environment.Version:	5.0.17
Stopwatch.Frequency:	1000000000
RuntimeEnvironment.GetRuntimeDirectory:	/home/ubuntu/.dotnet/shared/Microsoft.NETCore.App/5.0.17/
RuntimeInformation.FrameworkDescription:	.NET 5.0.17
RuntimeInformation.OSArchitecture:	Arm64
RuntimeInformation.OSDescription:	Linux 6.8.0-1015-aws #16~22.04.1-Ubuntu SMP Mon Aug 19 20:21:57 UTC 2024
RuntimeInformation.RuntimeIdentifier:	ubuntu.22.04-arm64
IntPtr.Size:	8
BitConverter.IsLittleEndian:	True
Vector.IsHardwareAccelerated:	True
Vector<byte>.Count:	16	# 128bit
Vector<float>.Count:	4	# 128bit
Vector<T>.Assembly.CodeBase:	file:///home/ubuntu/.dotnet/shared/Microsoft.NETCore.App/5.0.17/System.Private.CoreLib.dll
GetTargetFrameworkDisplayName(VectorTextUtil):	.NET 5.0
GetTargetFrameworkDisplayName(TraitsOutput):	.NET 5.0
VectorTraitsGlobal.InitCheckSum:	7960961	# 0x00797981
VectorEnvironment.CpuModelName:	Neoverse-N1
VectorEnvironment.CpuFlags:	fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm lrcpc dcpop asimddp ssbs
VectorEnvironment.SupportedInstructionSets:	AdvSimd, Aes, ArmBase, Crc32, Sha1, Sha256
Vector128s.Instance:	WVectorTraits128AdvSimdB64	// AdvSimd
Vectors.Instance:	VectorTraits128AdvSimdB64	// AdvSimd
Vectors.BaseInstance:	VectorTraits128Base


[YGroup2TransposeBenchmark_Byte(262144)]
---
NAME	US	MOPS	SCALE
# SumBase_Basic	114
# Sum128Base_Basic	114
# Sum256Base_Basic	114
SumBase_Basic	334.542	783.590
SumBase_ShiftByMul	100.832	2599.811	3.317821
SumBase	97.969	2675.793	3.414787
SumTraits	19.944	13143.803	16.773827
Sum128Base_Basic	847.979	309.140	0.394517
Sum128Base	853.016	307.314	0.392188
Sum128AdvSimd_Shift	30.037	8727.499	11.137839
Sum128AdvSimd_Shuffle	82.619	3172.917	4.049206
Sum128AdvSimdB64	19.845	13209.795	16.858045
Sum128Sse_Shift	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_Shuffle	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_ShuffleXImm	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_Unpack	Run fail! Requires hardware support Sse, Sse2!
Sum128Traits	19.838	13214.100	16.863539
Sum256Base_Basic	755.256	347.093	0.442952
Sum256Base	773.134	339.067	0.432710
Sum256Avx2_Shift	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_Shuffle	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_ShuffleXImm	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_Unpack	Run fail! Requires hardware support Avx, Avx2!
Sum256Traits	773.247	339.017	0.432646

[YGroup2TransposeBenchmark_Int16(262144)]
---
NAME	US	MOPS	SCALE
# SumBase_Basic	-24614
# Sum128Base_Basic	-24614
# Sum256Base_Basic	-24614
SumBase_Basic	459.008	571.110
SumBase_ShiftByMul	115.955	2260.738	3.958500
SumBase	115.833	2263.118	3.962667
SumTraits	37.387	7011.678	12.277288
Sum128Base_Basic	1266.279	207.019	0.362486
Sum128Base	1258.685	208.268	0.364673
Sum128AdvSimd_Shift	62.235	4212.182	7.375434
Sum128AdvSimd_Shuffle	166.755	1572.034	2.752596
Sum128AdvSimdB64	37.343	7019.852	12.291601
Sum128Sse_Shift	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_Shuffle	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_ShuffleXImm	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_Unpack	Run fail! Requires hardware support Sse, Sse2!
Sum128Traits	37.454	6999.152	12.255356
Sum256Base_Basic	989.666	264.881	0.463801
Sum256Base	988.192	265.277	0.464493
Sum256Avx2_Shift	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_Shuffle	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_ShuffleXImm	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_Unpack	Run fail! Requires hardware support Avx, Avx2!
Sum256Traits	986.800	265.651	0.465148

[YGroup2TransposeBenchmark_Int32(262144)]
---
NAME	US	MOPS	SCALE
# SumBase_Basic	-468148262
# Sum128Base_Basic	-468148262
# Sum256Base_Basic	-467951654
SumBase_Basic	740.756	353.887
SumBase_ShiftByMul	321.432	815.549	2.304547
SumBase	746.580	351.127	0.992199
SumTraits	75.122	3489.579	9.860713
Sum128Base_Basic	2162.396	121.228	0.342563
Sum128Base	2144.708	122.228	0.345388
Sum128AdvSimd_Shift	124.033	2113.498	5.972238
Sum128AdvSimd_Shuffle	336.098	779.963	2.203990
Sum128AdvSimdB64	74.834	3502.993	9.898616
Sum128Sse_Shift	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_Shuffle	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_ShuffleXImm	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_Unpack	Run fail! Requires hardware support Sse, Sse2!
Sum128Traits	75.369	3478.128	9.828354
Sum256Base_Basic	1493.926	175.473	0.495845
Sum256Base	1496.001	175.230	0.495157
Sum256Avx2_Shift	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_Shuffle	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_ShuffleXImm	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_Unpack	Run fail! Requires hardware support Avx, Avx2!
Sum256Traits	1493.811	175.487	0.495883

[YGroup2TransposeBenchmark_Int64(262144)]
---
NAME	US	MOPS	SCALE
# SumBase_Basic	5910372263571877380
# Sum128Base_Basic	5910372263571877380
# Sum256Base_Basic	5910372263571942916
SumBase_Basic	695.543	376.891
SumBase	695.758	376.775	0.999691
SumTraits	160.781	1630.443	4.326034
Sum128Base_Basic	1611.833	162.637	0.431523
Sum128Base_Move	1013.827	258.569	0.686057
Sum128Base	1624.894	161.330	0.428055
Sum128AdvSimd_AlignRight	178.176	1471.262	3.903680
Sum128AdvSimd_Shuffle	677.594	386.875	1.026490
Sum128AdvSimdB64	161.309	1625.103	4.311866
Sum128Sse_Shuffle	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_ShuffleXImm	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_Unpack	Run fail! Requires hardware support Sse, Sse2!
Sum128Traits	161.371	1624.481	4.310216
Sum256Base_Basic	1888.790	138.789	0.368248
Sum256Base	1927.001	136.037	0.360946
Sum256Avx2_Shuffle	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_ShuffleXImm	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_Unpack	Run fail! Requires hardware support Avx, Avx2!
Sum256Traits	1921.246	136.445	0.362027

.NET 6.0

VectorTraits.Benchmarks
benchmarkMode:	0

IsRelease:	True
Environment.ProcessorCount:	2
Environment.Is64BitProcess:	True
Environment.OSVersion:	Unix 6.8.0.1015
Environment.Version:	6.0.19
Stopwatch.Frequency:	1000000000
RuntimeEnvironment.GetRuntimeDirectory:	/home/ubuntu/.dotnet/shared/Microsoft.NETCore.App/6.0.19/
RuntimeInformation.FrameworkDescription:	.NET 6.0.19
RuntimeInformation.OSArchitecture:	Arm64
RuntimeInformation.OSDescription:	Linux 6.8.0-1015-aws #16~22.04.1-Ubuntu SMP Mon Aug 19 20:21:57 UTC 2024
RuntimeInformation.RuntimeIdentifier:	ubuntu.22.04-arm64
IntPtr.Size:	8
BitConverter.IsLittleEndian:	True
Vector.IsHardwareAccelerated:	True
Vector<byte>.Count:	16	# 128bit
Vector<float>.Count:	4	# 128bit
Vector<T>.Assembly.CodeBase:	file:///home/ubuntu/.dotnet/shared/Microsoft.NETCore.App/6.0.19/System.Private.CoreLib.dll
GetTargetFrameworkDisplayName(VectorTextUtil):	.NET 6.0
GetTargetFrameworkDisplayName(TraitsOutput):	.NET 6.0
VectorTraitsGlobal.InitCheckSum:	7960961	# 0x00797981
VectorEnvironment.CpuModelName:	Neoverse-N1
VectorEnvironment.CpuFlags:	fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm lrcpc dcpop asimddp ssbs
VectorEnvironment.SupportedInstructionSets:	AdvSimd, Aes, ArmBase, Crc32, Sha1, Sha256
Vector128s.Instance:	WVectorTraits128AdvSimdB64	// AdvSimd
Vectors.Instance:	VectorTraits128AdvSimdB64	// AdvSimd
Vectors.BaseInstance:	VectorTraits128Base


[YGroup2TransposeBenchmark_Byte(262144)]
---
NAME	US	MOPS	SCALE
# SumBase_Basic	114
# Sum128Base_Basic	114
# Sum256Base_Basic	114
SumBase_Basic	333.454	786.147
SumBase_ShiftByMul	98.656	2657.141	3.379955
SumBase	98.655	2657.184	3.380010
SumTraits	18.314	14313.830	18.207577
Sum128Base_Basic	850.175	308.341	0.392218
Sum128Base	851.044	308.026	0.391818
Sum128AdvSimd_Shift	29.950	8752.732	11.133711
Sum128AdvSimd_Shuffle	85.345	3071.592	3.907147
Sum128AdvSimdB64	18.318	14311.042	18.204030
Sum128Sse_Shift	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_Shuffle	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_ShuffleXImm	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_Unpack	Run fail! Requires hardware support Sse, Sse2!
Sum128Traits	18.276	14343.457	18.245264
Sum256Base_Basic	755.187	347.124	0.441552
Sum256Base	765.220	342.573	0.435762
Sum256Avx2_Shift	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_Shuffle	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_ShuffleXImm	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_Unpack	Run fail! Requires hardware support Avx, Avx2!
Sum256Traits	765.289	342.542	0.435723

[YGroup2TransposeBenchmark_Int16(262144)]
---
NAME	US	MOPS	SCALE
# SumBase_Basic	-24614
# Sum128Base_Basic	-24614
# Sum256Base_Basic	-24614
SumBase_Basic	459.203	570.867
SumBase_ShiftByMul	114.667	2286.127	4.004659
SumBase	114.790	2283.680	4.000372
SumTraits	40.057	6544.259	11.463723
Sum128Base_Basic	1247.915	210.066	0.367976
Sum128Base	1255.656	208.771	0.365708
Sum128AdvSimd_Shift	61.506	4262.090	7.465997
Sum128AdvSimd_Shuffle	167.778	1562.449	2.736977
Sum128AdvSimdB64	37.412	7006.907	12.274154
Sum128Sse_Shift	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_Shuffle	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_ShuffleXImm	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_Unpack	Run fail! Requires hardware support Sse, Sse2!
Sum128Traits	37.450	6999.869	12.261824
Sum256Base_Basic	971.316	269.885	0.472764
Sum256Base	974.586	268.980	0.471178
Sum256Avx2_Shift	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_Shuffle	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_ShuffleXImm	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_Unpack	Run fail! Requires hardware support Avx, Avx2!
Sum256Traits	972.885	269.450	0.472002

[YGroup2TransposeBenchmark_Int32(262144)]
---
NAME	US	MOPS	SCALE
# SumBase_Basic	-468148262
# Sum128Base_Basic	-468148262
# Sum256Base_Basic	-467951654
SumBase_Basic	747.071	350.896
SumBase_ShiftByMul	306.198	856.125	2.439828
SumBase	743.891	352.396	1.004275
SumTraits	75.024	3494.116	9.957711
Sum128Base_Basic	2138.278	122.596	0.349380
Sum128Base	2145.112	122.205	0.348267
Sum128AdvSimd_Shift	124.072	2112.835	6.021266
Sum128AdvSimd_Shuffle	342.191	766.075	2.183200
Sum128AdvSimdB64	74.882	3500.749	9.976614
Sum128Sse_Shift	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_Shuffle	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_ShuffleXImm	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_Unpack	Run fail! Requires hardware support Sse, Sse2!
Sum128Traits	75.142	3488.660	9.942161
Sum256Base_Basic	1464.213	179.034	0.510220
Sum256Base	1473.945	177.852	0.506852
Sum256Avx2_Shift	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_Shuffle	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_ShuffleXImm	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_Unpack	Run fail! Requires hardware support Avx, Avx2!
Sum256Traits	1469.844	178.348	0.508266

[YGroup2TransposeBenchmark_Int64(262144)]
---
NAME	US	MOPS	SCALE
# SumBase_Basic	5910372263571877380
# Sum128Base_Basic	5910372263571877380
# Sum256Base_Basic	5910372263571942916
SumBase_Basic	685.849	382.218
SumBase	695.905	376.695	0.985550
SumTraits	149.646	1751.755	4.583128
Sum128Base_Basic	1592.738	164.587	0.430610
Sum128Base_Move	1070.460	244.889	0.640705
Sum128Base	1587.306	165.150	0.432084
Sum128AdvSimd_AlignRight	175.833	1490.869	3.900569
Sum128AdvSimd_Shuffle	683.269	383.661	1.003775
Sum128AdvSimdB64	149.568	1752.669	4.585519
Sum128Sse_Shuffle	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_ShuffleXImm	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_Unpack	Run fail! Requires hardware support Sse, Sse2!
Sum128Traits	149.140	1757.701	4.598684
Sum256Base_Basic	1837.431	142.669	0.373265
Sum256Base	1843.422	142.205	0.372052
Sum256Avx2_Shuffle	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_ShuffleXImm	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_Unpack	Run fail! Requires hardware support Avx, Avx2!
Sum256Traits	1841.473	142.356	0.372446

.NET 7.0

VectorTraits.Benchmarks
benchmarkMode:	0

IsRelease:	True
Environment.ProcessorCount:	2
Environment.Is64BitProcess:	True
Environment.OSVersion:	Unix 6.8.0.1015
Environment.Version:	7.0.8
Stopwatch.Frequency:	1000000000
RuntimeEnvironment.GetRuntimeDirectory:	/home/ubuntu/.dotnet/shared/Microsoft.NETCore.App/7.0.8/
RuntimeInformation.FrameworkDescription:	.NET 7.0.8
RuntimeInformation.OSArchitecture:	Arm64
RuntimeInformation.OSDescription:	Linux 6.8.0-1015-aws #16~22.04.1-Ubuntu SMP Mon Aug 19 20:21:57 UTC 2024
RuntimeInformation.RuntimeIdentifier:	ubuntu.22.04-arm64
IntPtr.Size:	8
BitConverter.IsLittleEndian:	True
Vector.IsHardwareAccelerated:	True
Vector<byte>.Count:	16	# 128bit
Vector<float>.Count:	4	# 128bit
Vector128.IsHardwareAccelerated:	True
Vector256.IsHardwareAccelerated:	False
Vector<T>.Assembly.CodeBase:	file:///home/ubuntu/.dotnet/shared/Microsoft.NETCore.App/7.0.8/System.Private.CoreLib.dll
GetTargetFrameworkDisplayName(VectorTextUtil):	.NET 7.0
GetTargetFrameworkDisplayName(TraitsOutput):	.NET 7.0
VectorTraitsGlobal.InitCheckSum:	7960961	# 0x00797981
VectorEnvironment.CpuModelName:	Neoverse-N1
VectorEnvironment.CpuFlags:	fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm lrcpc dcpop asimddp ssbs
VectorEnvironment.SupportedInstructionSets:	AdvSimd, Aes, ArmBase, Crc32, Sha1, Sha256
Vector128s.Instance:	WVectorTraits128AdvSimdB64	// AdvSimd
Vectors.Instance:	VectorTraits128AdvSimdB64	// AdvSimd
Vectors.BaseInstance:	VectorTraits128Base


[YGroup2TransposeBenchmark_Byte(262144)]
---
NAME	US	MOPS	SCALE
# SumBase_Basic	114
# Sum128Base_Basic	114
# Sum256Base_Basic	114
SumBase_Basic	303.824	862.815
SumBase_Shift	30.434	8613.444	9.982961
SumBase_ShiftByMul	102.335	2561.626	2.968918
SumBase	30.487	8598.616	9.965776
SumTraits	16.957	15459.072	17.917029
Sum128Base_Basic	303.780	862.941	1.000147
Sum128Base_Shift	28.389	9233.887	10.702053
Sum128Base_Shuffle	1816.126	144.342	0.167293
Sum128Base	28.616	9160.895	10.617455
Sum128AdvSimd_Shift	28.890	9073.843	10.516563
Sum128AdvSimd_Shuffle	69.831	3753.951	4.350821
Sum128AdvSimdB64	16.997	15423.396	17.875680
Sum128Sse_Shift	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_Shuffle	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_ShuffleXImm	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_Unpack	Run fail! Requires hardware support Sse, Sse2!
Sum128Traits	16.963	15453.771	17.910884
Sum256Base_Basic	1042.465	251.465	0.291448
Sum256Base_Shift	1590.976	164.769	0.190967
Sum256Base_Shuffle	3529.634	74.269	0.086078
Sum256Base	1049.530	249.773	0.289486
Sum256Avx2_Shift	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_Shuffle	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_ShuffleXImm	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_Unpack	Run fail! Requires hardware support Avx, Avx2!
Sum256Traits	1051.108	249.398	0.289052

[YGroup2TransposeBenchmark_Int16(262144)]
---
NAME	US	MOPS	SCALE
# SumBase_Basic	-24614
# Sum128Base_Basic	-24614
# Sum256Base_Basic	-24614
SumBase_Basic	431.239	607.885
SumBase_Shift	59.531	4403.506	7.243974
SumBase_ShiftByMul	115.453	2270.565	3.735185
SumBase	59.628	4396.289	7.232101
SumTraits	33.151	7907.458	13.008138
Sum128Base_Basic	431.384	607.681	0.999664
Sum128Base_Shift	53.676	4883.821	8.034114
Sum128Base_Shuffle	3027.874	86.577	0.142423
Sum128Base	53.723	4879.563	8.027110
Sum128AdvSimd_Shift	53.994	4855.094	7.986856
Sum128AdvSimd_Shuffle	136.918	1914.612	3.149627
Sum128AdvSimdB64	33.240	7886.405	12.973505
Sum128Sse_Shift	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_Shuffle	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_ShuffleXImm	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_Unpack	Run fail! Requires hardware support Sse, Sse2!
Sum128Traits	33.177	7901.401	12.998175
Sum256Base_Basic	1355.661	193.370	0.318103
Sum256Base_Shift	2247.943	116.615	0.191837
Sum256Base_Shuffle	5768.642	45.443	0.074756
Sum256Base	1345.606	194.815	0.320480
Sum256Avx2_Shift	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_Shuffle	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_ShuffleXImm	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_Unpack	Run fail! Requires hardware support Avx, Avx2!
Sum256Traits	1346.965	194.618	0.320156

[YGroup2TransposeBenchmark_Int32(262144)]
---
NAME	US	MOPS	SCALE
# SumBase_Basic	-468148262
# Sum128Base_Basic	-468148262
Check-Sum128Base_Shift	Check `Sum128Base_Shift` mismatch. -833942548!=-468148262
Check-Sum128Base	Check `Sum128Base` mismatch. -833942548!=-468148262
Check-Sum128AdvSimd_Shift	Check `Sum128AdvSimd_Shift` mismatch. -833942548!=-468148262
# Sum256Base_Basic	-467951654
SumBase_Basic	721.926	363.118
SumBase_Shift	119.542	2192.905	6.039103
SumBase_ShiftByMul	285.703	917.540	2.526840
SumBase	119.322	2196.948	6.050237
SumTraits	66.618	3935.018	10.836758
Sum128Base_Basic	720.679	363.746	1.001731
Sum128Base_Shift	107.110	2447.425	6.740033
Sum128Base_Shuffle	6044.139	43.372	0.119442
Sum128Base	107.312	2442.812	6.727330
Sum128AdvSimd_Shift	107.228	2444.736	6.732630
Sum128AdvSimd_Shuffle	274.680	954.362	2.628246
Sum128AdvSimdB64	66.495	3942.317	10.856861
Sum128Sse_Shift	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_Shuffle	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_ShuffleXImm	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_Unpack	Run fail! Requires hardware support Sse, Sse2!
Sum128Traits	66.501	3941.958	10.855873
Sum256Base_Basic	1767.511	148.313	0.408442
Sum256Base_Shift	2960.174	88.557	0.243880
Sum256Base_Shuffle	10894.215	24.063	0.066267
Sum256Base	1768.481	148.231	0.408218
Sum256Avx2_Shift	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_Shuffle	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_ShuffleXImm	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_Unpack	Run fail! Requires hardware support Avx, Avx2!
Sum256Traits	1767.815	148.287	0.408372

[YGroup2TransposeBenchmark_Int64(262144)]
---
NAME	US	MOPS	SCALE
# SumBase_Basic	5910372263571877380
# Sum128Base_Basic	5910372263571877380
# Sum256Base_Basic	5910372263571942916
SumBase_Basic	679.517	385.780
SumBase	689.484	380.203	0.985544
SumTraits	133.260	1967.158	5.099175
Sum128Base_Basic	688.934	380.507	0.986332
Sum128Base_Move	187.259	1399.903	3.628763
Sum128Base_Shuffle	12082.297	21.697	0.056241
Sum128Base	187.246	1400.001	3.629018
Sum128AdvSimd_AlignRight	160.308	1635.254	4.238828
Sum128AdvSimd_Shuffle	548.629	477.816	1.238573
Sum128AdvSimdB64	132.967	1971.497	5.110422
Sum128Sse_Shuffle	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_ShuffleXImm	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_Unpack	Run fail! Requires hardware support Sse, Sse2!
Sum128Traits	133.438	1964.536	5.092379
Sum256Base_Basic	2142.779	122.338	0.317120
Sum256Base_Shuffle	20654.563	12.692	0.032899
Sum256Base	2165.523	121.053	0.313789
Sum256Avx2_Shuffle	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_ShuffleXImm	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_Unpack	Run fail! Requires hardware support Avx, Avx2!
Sum256Traits	2170.585	120.771	0.313057

.NET 8.0

VectorTraits.Benchmarks
benchmarkMode:	0

IsRelease:	True
Environment.ProcessorCount:	2
Environment.Is64BitProcess:	True
Environment.OSVersion:	Unix 6.8.0.1015
Environment.Version:	8.0.7
Stopwatch.Frequency:	1000000000
RuntimeEnvironment.GetRuntimeDirectory:	/home/ubuntu/.dotnet/shared/Microsoft.NETCore.App/8.0.7/
RuntimeInformation.FrameworkDescription:	.NET 8.0.7
RuntimeInformation.OSArchitecture:	Arm64
RuntimeInformation.OSDescription:	Ubuntu 22.04.2 LTS
RuntimeInformation.RuntimeIdentifier:	linux-arm64
IntPtr.Size:	8
BitConverter.IsLittleEndian:	True
Vector.IsHardwareAccelerated:	True
Vector<byte>.Count:	16	# 128bit
Vector<float>.Count:	4	# 128bit
Vector128.IsHardwareAccelerated:	True
Vector256.IsHardwareAccelerated:	False
Vector512.IsHardwareAccelerated:	False
Vector<T>.Assembly.CodeBase:	file:///home/ubuntu/.dotnet/shared/Microsoft.NETCore.App/8.0.7/System.Private.CoreLib.dll
GetTargetFrameworkDisplayName(VectorTextUtil):	.NET 8.0
GetTargetFrameworkDisplayName(TraitsOutput):	.NET 8.0
VectorTraitsGlobal.InitCheckSum:	-2122844159	# 0x8177F801
VectorEnvironment.CpuModelName:	Neoverse-N1
VectorEnvironment.CpuFlags:	fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm lrcpc dcpop asimddp ssbs
VectorEnvironment.SupportedInstructionSets:	AdvSimd, Aes, ArmBase, Crc32, Dp, Rdm, Sha1, Sha256
Vector128s.Instance:	WVectorTraits128AdvSimdB64	// AdvSimd
Vectors.Instance:	VectorTraits128AdvSimdB64	// AdvSimd
Vectors.BaseInstance:	VectorTraits128Base


[YGroup2TransposeBenchmark_Byte(262144)]
---
NAME	US	MOPS	SCALE
# SumBase_Basic	114
# Sum128Base_Basic	114
# Sum256Base_Basic	114
# Sum512Base_Basic	114
SumBase_Basic	299.508	875.250
SumBase_Shift	23.350	11226.945	12.827134
SumBase_ShiftByMul	90.494	2896.796	3.309680
SumBase	23.384	11210.388	12.808218
SumTraits	16.603	15789.241	18.039700
Sum128Base_Basic	299.427	875.485	1.000269
Sum128Base_Shift	23.373	11215.464	12.814018
Sum128Base_Shuffle	1711.601	153.157	0.174987
Sum128Base	23.384	11210.609	12.808470
Sum128AdvSimd_Shift	23.350	11226.761	12.826925
Sum128AdvSimd_Shuffle	68.299	3838.192	4.385254
Sum128AdvSimd_ShuffleX	46.488	5638.952	6.442678
Sum128AdvSimdB64	16.605	15786.867	18.036987
Sum128PackedSimd_Shift	Run fail! Requires hardware support PackedSimd!
Sum128PackedSimd_Shuffle	Run fail! Requires hardware support PackedSimd!
Sum128Sse_Shift	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_Shuffle	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_ShuffleX	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_ShuffleXImm	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_Unpack	Run fail! Requires hardware support Sse, Sse2!
Sum128Traits	16.602	15790.296	18.040905
Sum256Base_Basic	263.546	994.681	1.136454
Sum256Base_Shift	18.211	14395.012	16.446750
Sum256Base_Shuffle	1770.435	148.068	0.169172
Sum256Base	263.232	995.868	1.137811
Sum256Avx2_Shift	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_Shuffle	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_ShuffleX	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_ShuffleXImm	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_Unpack	Run fail! Requires hardware support Avx, Avx2!
Sum256Traits	263.218	995.918	1.137867
Sum512Base_Basic	265.342	987.949	1.128762
Sum512Base_Shift	15.336	17093.738	19.530128
Sum512Base_Shuffle	2917.012	89.867	0.102676
Sum512Base	265.066	988.975	1.129935
Sum512Avx512_Shift	Run fail! Requires hardware support Avx512BW, Avx512DQ, Avx512F, Avx512Vbmi!
Sum512Avx512_Shuffle	Run fail! Requires hardware support Avx512BW, Avx512DQ, Avx512F, Avx512Vbmi!
Sum512Avx512_ShuffleX	Run fail! Requires hardware support Avx512BW, Avx512DQ, Avx512F, Avx512Vbmi!
Sum512Avx512_ShuffleXImm	Run fail! Requires hardware support Avx512BW, Avx512DQ, Avx512F, Avx512Vbmi!
Sum512Avx512_Unpack	Run fail! Requires hardware support Avx512BW, Avx512DQ, Avx512F, Avx512Vbmi!
Sum512Traits	265.186	988.528	1.129424

[YGroup2TransposeBenchmark_Int16(262144)]
---
NAME	US	MOPS	SCALE
# SumBase_Basic	-24614
# Sum128Base_Basic	-24614
# Sum256Base_Basic	-24614
# Sum512Base_Basic	8154
SumBase_Basic	438.426	597.921
SumBase_Shift	49.681	5276.586	8.824891
SumBase_ShiftByMul	88.049	2977.261	4.979356
SumBase	50.042	5238.511	8.761213
SumTraits	33.875	7738.568	12.942463
Sum128Base_Basic	438.427	597.919	0.999998
Sum128Base_Shift	49.652	5279.656	8.830026
Sum128Base_Shuffle	3019.466	86.818	0.145200
Sum128Base	50.835	5156.775	8.624513
Sum128AdvSimd_Shift	50.033	5239.406	8.762710
Sum128AdvSimd_Shuffle	138.841	1888.091	3.157762
Sum128AdvSimd_ShuffleX	98.930	2649.799	4.431689
Sum128AdvSimdB64	33.973	7716.261	12.905157
Sum128PackedSimd_Shift	Run fail! Requires hardware support PackedSimd!
Sum128PackedSimd_Shuffle	Run fail! Requires hardware support PackedSimd!
Sum128Sse_Shift	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_Shuffle	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_ShuffleX	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_ShuffleXImm	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_Unpack	Run fail! Requires hardware support Sse, Sse2!
Sum128Traits	33.962	7718.848	12.909483
Sum256Base_Basic	310.348	844.677	1.412690
Sum256Base_Shift	37.069	7071.845	11.827394
Sum256Base_Shuffle	3037.952	86.290	0.144316
Sum256Base	311.012	842.874	1.409675
Sum256Avx2_Shift	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_Shuffle	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_ShuffleX	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_ShuffleXImm	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_Unpack	Run fail! Requires hardware support Avx, Avx2!
Sum256Traits	311.194	842.382	1.408851
Sum512Base_Basic	327.301	800.926	1.339519
Sum512Base_Shift	32.108	8164.370	13.654602
Sum512Base_Shuffle	5316.418	49.308	0.082466
Sum512Base	327.449	800.564	1.338913
Sum512Avx512_Shift	Run fail! Requires hardware support Avx512BW, Avx512DQ, Avx512F, Avx512Vbmi!
Sum512Avx512_Shuffle	Run fail! Requires hardware support Avx512BW, Avx512DQ, Avx512F, Avx512Vbmi!
Sum512Avx512_ShuffleX	Run fail! Requires hardware support Avx512BW, Avx512DQ, Avx512F, Avx512Vbmi!
Sum512Avx512_ShuffleXImm	Run fail! Requires hardware support Avx512BW, Avx512DQ, Avx512F, Avx512Vbmi!
Sum512Avx512_Unpack	Run fail! Requires hardware support Avx512BW, Avx512DQ, Avx512F, Avx512Vbmi!
Sum512Traits	327.472	800.509	1.338821

[YGroup2TransposeBenchmark_Int32(262144)]
---
NAME	US	MOPS	SCALE
# SumBase_Basic	-468148262
# Sum128Base_Basic	-468148262
# Sum256Base_Basic	-467951654
# Sum512Base_Basic	-467853350
SumBase_Basic	707.176	370.691
SumBase_Shift	100.420	2610.484	7.042202
SumBase_ShiftByMul	292.983	894.741	2.413709
SumBase	100.516	2607.987	7.035468
SumTraits	68.475	3828.336	10.327555
Sum128Base_Basic	707.489	370.527	0.999557
Sum128Base_Shift	100.541	2607.325	7.033680
Sum128Base_Shuffle	5982.815	43.816	0.118201
Sum128Base	100.579	2606.337	7.031016
Sum128AdvSimd_Shift	100.551	2607.062	7.032973
Sum128AdvSimd_Shuffle	281.028	932.805	2.516393
Sum128AdvSimd_ShuffleX	200.493	1307.495	3.527180
Sum128AdvSimdB64	68.394	3832.846	10.339724
Sum128PackedSimd_Shift	Run fail! Requires hardware support PackedSimd!
Sum128PackedSimd_Shuffle	Run fail! Requires hardware support PackedSimd!
Sum128Sse_Shift	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_Shuffle	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_ShuffleX	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_ShuffleXImm	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_Unpack	Run fail! Requires hardware support Sse, Sse2!
Sum128Traits	68.580	3822.432	10.311629
Sum256Base_Basic	493.359	531.346	1.433391
Sum256Base_Shift	74.532	3517.207	9.488236
Sum256Base_Shuffle	5905.791	44.388	0.119743
Sum256Base	491.916	532.904	1.437594
Sum256Avx2_Shift	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_Shuffle	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_ShuffleX	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_ShuffleXImm	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_Unpack	Run fail! Requires hardware support Avx, Avx2!
Sum256Traits	493.477	531.218	1.433047
Sum512Base_Basic	422.876	619.908	1.672301
Sum512Base_Shift	64.532	4062.257	10.958596
Sum512Base_Shuffle	10096.132	25.965	0.070044
Sum512Base	423.660	618.760	1.669206
Sum512Avx512_Shift	Run fail! Requires hardware support Avx512BW, Avx512DQ, Avx512F, Avx512Vbmi!
Sum512Avx512_Shuffle	Run fail! Requires hardware support Avx512BW, Avx512DQ, Avx512F, Avx512Vbmi!
Sum512Avx512_ShuffleX	Run fail! Requires hardware support Avx512BW, Avx512DQ, Avx512F, Avx512Vbmi!
Sum512Avx512_ShuffleXImm	Run fail! Requires hardware support Avx512BW, Avx512DQ, Avx512F, Avx512Vbmi!
Sum512Avx512_Unpack	Run fail! Requires hardware support Avx512BW, Avx512DQ, Avx512F, Avx512Vbmi!
Sum512Traits	423.586	618.868	1.669498

[YGroup2TransposeBenchmark_Int64(262144)]
---
NAME	US	MOPS	SCALE
# SumBase_Basic	5910372263571877380
# Sum128Base_Basic	5910372263571877380
# Sum256Base_Basic	5910372263571942916
# Sum512Base_Basic	5910372263571975684
SumBase_Basic	678.915	386.122
SumBase	679.427	385.831	0.999247
SumTraits	128.830	2034.799	5.269835
Sum128Base_Basic	685.031	382.675	0.991072
Sum128Base_Move	202.653	1293.562	3.350138
Sum128Base_Shuffle	12073.120	21.713	0.056234
Sum128Base	202.810	1292.560	3.347543
Sum128AdvSimd_AlignRight	159.598	1642.530	4.253915
Sum128AdvSimd_Shuffle	559.266	468.729	1.213940
Sum128AdvSimd_ShuffleX	398.623	657.624	1.703152
Sum128AdvSimdB64	128.569	2038.941	5.280562
Sum128PackedSimd_Shuffle	Run fail! Requires hardware support PackedSimd!
Sum128Sse_Shuffle	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_ShuffleX	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_ShuffleXImm	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_Unpack	Run fail! Requires hardware support Sse, Sse2!
Sum128Traits	128.570	2038.923	5.280517
Sum256Base_Basic	529.756	494.839	1.281562
Sum256Base_Shuffle	11744.017	22.321	0.057809
Sum256Base	529.787	494.810	1.281486
Sum256Avx2_Shuffle	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_ShuffleX	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_ShuffleXImm	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_Unpack	Run fail! Requires hardware support Avx, Avx2!
Sum256Traits	529.629	494.958	1.281869
Sum512Base_Basic	600.748	436.363	1.130116
Sum512Base_Shuffle	20426.732	12.833	0.033237
Sum512Avx512_Shuffle	Run fail! Requires hardware support Avx512BW, Avx512DQ, Avx512F, Avx512Vbmi!
Sum512Avx512_ShuffleX	Run fail! Requires hardware support Avx512BW, Avx512DQ, Avx512F, Avx512Vbmi!
Sum512Avx512_ShuffleXImm	Run fail! Requires hardware support Avx512BW, Avx512DQ, Avx512F, Avx512Vbmi!
Sum512Avx512_Unpack	Run fail! Requires hardware support Avx512BW, Avx512DQ, Avx512F, Avx512Vbmi!
Sum512Traits	600.910	436.245	1.129811

Wasm

Wasm on AMD Ryzen 7 7840H

``

.NET 8.0

VectorTraits.Benchmarks.Wasm


IsRelease:	True
Environment.ProcessorCount:	1
Environment.Is64BitProcess:	False
Environment.OSVersion:	Other 1.0.0.0
Environment.Version:	8.0.7
Stopwatch.Frequency:	1000000000
RuntimeEnvironment.GetRuntimeDirectory:	/
RuntimeInformation.FrameworkDescription:	.NET 8.0.7
RuntimeInformation.OSArchitecture:	Wasm
RuntimeInformation.OSDescription:	Browser
RuntimeInformation.RuntimeIdentifier:	browser-wasm
IntPtr.Size:	4
BitConverter.IsLittleEndian:	True
Vector.IsHardwareAccelerated:	False
Vector<byte>.Count:	16	# 128bit
Vector<float>.Count:	4	# 128bit
Vector128.IsHardwareAccelerated:	True
Vector256.IsHardwareAccelerated:	False
Vector512.IsHardwareAccelerated:	False
Vector<T>.Assembly.CodeBase:	
GetTargetFrameworkDisplayName(VectorTextUtil):	.NET 8.0
GetTargetFrameworkDisplayName(TraitsOutput):	.NET 8.0
VectorTraitsGlobal.InitCheckSum:	-2122844158	# 0x8177F802
VectorEnvironment.CpuModelName:	
VectorEnvironment.SupportedInstructionSets:	PackedSimd
Vector128s.Instance:	WVectorTraits128PackedSimd	// PackedSimd
Vectors.Instance:	VectorTraits128PackedSimd	// PackedSimd
Vectors.BaseInstance:	VectorTraits128Base


[YGroup2TransposeBenchmark_Byte(262144)]
---
NAME	US	MOPS	SCALE
# SumBase_Basic	114
# Sum128Base_Basic	114
# Sum256Base_Basic	114
# Sum512Base_Basic	114
SumBase_Basic	11831.250	22.157
SumBase_Shift	6892.708	38.032	1.716488
SumBase_ShiftByMul	42616.667	6.151	0.277620
SumBase	11564.583	22.668	1.023059
SumTraits	2560.937	102.363	4.619890
Sum128Base_Basic	11986.458	21.870	0.987051
Sum128Base_Shift	2114.062	124.000	5.596452
Sum128Base_Shuffle	4698.437	55.794	2.518124
Sum128Base	2106.250	124.460	5.617211
Sum128AdvSimd_Shift	Run fail! Requires hardware support AdvSimd!
Sum128AdvSimd_Shuffle	Run fail! Requires hardware support AdvSimd!
Sum128AdvSimd_ShuffleX	Run fail! Requires hardware support AdvSimd!
Sum128AdvSimdB64	Run fail! Requires hardware support AdvSimd!This process(32bit) is not 64-bit!
Sum128PackedSimd_Shift	2162.500	121.223	5.471098
Sum128PackedSimd_Shuffle	2133.073	122.895	5.546575
Sum128Sse_Shift	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_Shuffle	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_ShuffleX	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_ShuffleXImm	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_Unpack	Run fail! Requires hardware support Sse, Sse2!
Sum128Traits	2069.010	126.700	5.718313
Sum256Base_Basic	12821.875	20.445	0.922739
Sum256Base_Shift	14313.542	18.314	0.826577
Sum256Base_Shuffle	52608.333	4.983	0.224893
Sum256Base	13213.542	19.839	0.895388
Sum256Avx2_Shift	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_Shuffle	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_ShuffleX	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_ShuffleXImm	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_Unpack	Run fail! Requires hardware support Avx, Avx2!
Sum256Traits	13525.000	19.382	0.874769
Sum512Base_Basic	13283.333	19.735	0.890684
Sum512Base_Shift	19766.667	13.262	0.598546
Sum512Base_Shuffle	52695.833	4.975	0.224520
Sum512Base	14180.208	18.487	0.834350
Sum512Avx512_Shift	Run fail! Requires hardware support Avx512BW, Avx512DQ, Avx512F, Avx512Vbmi!
Sum512Avx512_Shuffle	Run fail! Requires hardware support Avx512BW, Avx512DQ, Avx512F, Avx512Vbmi!
Sum512Avx512_ShuffleX	Run fail! Requires hardware support Avx512BW, Avx512DQ, Avx512F, Avx512Vbmi!
Sum512Avx512_ShuffleXImm	Run fail! Requires hardware support Avx512BW, Avx512DQ, Avx512F, Avx512Vbmi!
Sum512Avx512_Unpack	Run fail! Requires hardware support Avx512BW, Avx512DQ, Avx512F, Avx512Vbmi!
Sum512Traits	14408.333	18.194	0.821139

[YGroup2TransposeBenchmark_Int16(262144)]
---
NAME	US	MOPS	SCALE
# SumBase_Basic	-24614
# Sum128Base_Basic	-24614
# Sum256Base_Basic	-24614
# Sum512Base_Basic	8154
SumBase_Basic	16918.750	15.494
SumBase_Shift	12526.042	20.928	1.350686
SumBase_ShiftByMul	33845.833	7.745	0.499877
SumBase	16354.167	16.029	1.034522
SumTraits	5200.000	50.412	3.253606
Sum128Base_Basic	16037.500	16.346	1.054949
Sum128Base_Shift	4230.208	61.970	3.999508
Sum128Base_Shuffle	9546.875	27.459	1.772177
Sum128Base	4171.875	62.836	4.055431
Sum128AdvSimd_Shift	Run fail! Requires hardware support AdvSimd!
Sum128AdvSimd_Shuffle	Run fail! Requires hardware support AdvSimd!
Sum128AdvSimd_ShuffleX	Run fail! Requires hardware support AdvSimd!
Sum128AdvSimdB64	Run fail! Requires hardware support AdvSimd!This process(32bit) is not 64-bit!
Sum128PackedSimd_Shift	4185.937	62.625	4.041807
Sum128PackedSimd_Shuffle	4336.458	60.451	3.901513
Sum128Sse_Shift	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_Shuffle	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_ShuffleX	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_ShuffleXImm	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_Unpack	Run fail! Requires hardware support Sse, Sse2!
Sum128Traits	4264.844	61.466	3.967027
Sum256Base_Basic	18962.500	13.824	0.892221
Sum256Base_Shift	27277.083	9.610	0.620255
Sum256Base_Shuffle	104225.000	2.515	0.162329
Sum256Base	18685.417	14.029	0.905452
Sum256Avx2_Shift	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_Shuffle	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_ShuffleX	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_ShuffleXImm	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_Unpack	Run fail! Requires hardware support Avx, Avx2!
Sum256Traits	20877.083	12.557	0.810398
Sum512Base_Basic	20352.083	12.880	0.831303
Sum512Base_Shift	39487.500	6.639	0.428458
Sum512Base_Shuffle	105491.667	2.485	0.160380
Sum512Base	20095.833	13.045	0.841903
Sum512Avx512_Shift	Run fail! Requires hardware support Avx512BW, Avx512DQ, Avx512F, Avx512Vbmi!
Sum512Avx512_Shuffle	Run fail! Requires hardware support Avx512BW, Avx512DQ, Avx512F, Avx512Vbmi!
Sum512Avx512_ShuffleX	Run fail! Requires hardware support Avx512BW, Avx512DQ, Avx512F, Avx512Vbmi!
Sum512Avx512_ShuffleXImm	Run fail! Requires hardware support Avx512BW, Avx512DQ, Avx512F, Avx512Vbmi!
Sum512Avx512_Unpack	Run fail! Requires hardware support Avx512BW, Avx512DQ, Avx512F, Avx512Vbmi!
Sum512Traits	21535.417	12.173	0.785624

[YGroup2TransposeBenchmark_Int32(262144)]
---
NAME	US	MOPS	SCALE
# SumBase_Basic	-468148262
# Sum128Base_Basic	-468148262
# Sum256Base_Basic	-467951654
# Sum512Base_Basic	-467853350
SumBase_Basic	24447.917	10.723
SumBase_Shift	21891.667	11.975	1.116768
SumBase_ShiftByMul	104375.000	2.512	0.234232
SumBase	24418.750	10.735	1.001194
SumTraits	10519.792	24.919	2.323992
Sum128Base_Basic	22589.583	11.605	1.082265
Sum128Base_Shift	8638.542	30.346	2.830098
Sum128Base_Shuffle	18770.833	13.965	1.302442
Sum128Base	8582.292	30.545	2.848647
Sum128AdvSimd_Shift	Run fail! Requires hardware support AdvSimd!
Sum128AdvSimd_Shuffle	Run fail! Requires hardware support AdvSimd!
Sum128AdvSimd_ShuffleX	Run fail! Requires hardware support AdvSimd!
Sum128AdvSimdB64	Run fail! Requires hardware support AdvSimd!This process(32bit) is not 64-bit!
Sum128PackedSimd_Shift	8315.625	31.524	2.939998
Sum128PackedSimd_Shuffle	8753.125	29.949	2.793050
Sum128Sse_Shift	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_Shuffle	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_ShuffleX	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_ShuffleXImm	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_Unpack	Run fail! Requires hardware support Sse, Sse2!
Sum128Traits	8341.667	31.426	2.930819
Sum256Base_Basic	28575.000	9.174	0.855570
Sum256Base_Shift	54908.333	4.774	0.445250
Sum256Base_Shuffle	205266.666	1.277	0.119103
Sum256Base	28812.500	9.098	0.848518
Sum256Avx2_Shift	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_Shuffle	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_ShuffleX	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_ShuffleXImm	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_Unpack	Run fail! Requires hardware support Avx, Avx2!
Sum256Traits	32952.083	7.955	0.741923
Sum512Base_Basic	31166.667	8.411	0.784425
Sum512Base_Shift	76800.000	3.413	0.318332
Sum512Base_Shuffle	216000.000	1.214	0.113185
Sum512Base	30716.667	8.534	0.795917
Sum512Avx512_Shift	Run fail! Requires hardware support Avx512BW, Avx512DQ, Avx512F, Avx512Vbmi!
Sum512Avx512_Shuffle	Run fail! Requires hardware support Avx512BW, Avx512DQ, Avx512F, Avx512Vbmi!
Sum512Avx512_ShuffleX	Run fail! Requires hardware support Avx512BW, Avx512DQ, Avx512F, Avx512Vbmi!
Sum512Avx512_ShuffleXImm	Run fail! Requires hardware support Avx512BW, Avx512DQ, Avx512F, Avx512Vbmi!
Sum512Avx512_Unpack	Run fail! Requires hardware support Avx512BW, Avx512DQ, Avx512F, Avx512Vbmi!
Sum512Traits	31716.667	8.265	0.770822

[YGroup2TransposeBenchmark_Int64(262144)]
---
NAME	US	MOPS	SCALE
# SumBase_Basic	5910372263571877380
# Sum128Base_Basic	5910372263571877380
# Sum256Base_Basic	5910372263571942916
# Sum512Base_Basic	5910372263571975684
SumBase_Basic	72566.667	3.612
SumBase	72675.000	3.607	0.998509
SumTraits	61325.000	4.275	1.183313
Sum128Base_Basic	132983.333	1.971	0.545682
Sum128Base_Move	128283.333	2.043	0.565675
Sum128Base_Shuffle	140533.333	1.865	0.516366
Sum128Base	118233.333	2.217	0.613758
Sum128AdvSimd_AlignRight	Run fail! Requires hardware support AdvSimd!
Sum128AdvSimd_Shuffle	Run fail! Requires hardware support AdvSimd!
Sum128AdvSimd_ShuffleX	Run fail! Requires hardware support AdvSimd!
Sum128AdvSimdB64	Run fail! Requires hardware support AdvSimd!This process(32bit) is not 64-bit!
Sum128PackedSimd_Shuffle	16702.083	15.695	4.344767
Sum128Sse_Shuffle	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_ShuffleX	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_ShuffleXImm	Run fail! Requires hardware support Sse, Sse2!
Sum128Sse_Unpack	Run fail! Requires hardware support Sse, Sse2!
Sum128Traits	17056.250	15.369	4.254550
Sum256Base_Basic	151933.333	1.725	0.477622
Sum256Base_Shuffle	519516.667	0.505	0.139681
Sum256Base	146700.000	1.787	0.494660
Sum256Avx2_Shuffle	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_ShuffleX	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_ShuffleXImm	Run fail! Requires hardware support Avx, Avx2!
Sum256Avx2_Unpack	Run fail! Requires hardware support Avx, Avx2!
Sum256Traits	65633.333	3.994	1.105637
Sum512Base_Basic	144550.000	1.814	0.502018
Sum512Base_Shuffle	525050.000	0.499	0.138209
Sum512Avx512_Shuffle	Run fail! Requires hardware support Avx512BW, Avx512DQ, Avx512F, Avx512Vbmi!
Sum512Avx512_ShuffleX	Run fail! Requires hardware support Avx512BW, Avx512DQ, Avx512F, Avx512Vbmi!
Sum512Avx512_ShuffleXImm	Run fail! Requires hardware support Avx512BW, Avx512DQ, Avx512F, Avx512Vbmi!
Sum512Avx512_Unpack	Run fail! Requires hardware support Avx512BW, Avx512DQ, Avx512F, Avx512Vbmi!
Sum512Traits	57908.333	4.527	1.253130