================================================================================================
Dataset Benchmark
================================================================================================

OpenJDK 64-Bit Server VM 25.0.2+10-LTS on Linux 6.14.0-1017-azure
AMD EPYC 7763 64-Core Processor
back-to-back map long:                    Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
RDD                                                6666           6681          21         15.0          66.7       1.0X
DataFrame                                          1191           1248          81         84.0          11.9       5.6X
Dataset                                            1894           1934          57         52.8          18.9       3.5X

OpenJDK 64-Bit Server VM 25.0.2+10-LTS on Linux 6.14.0-1017-azure
AMD EPYC 7763 64-Core Processor
back-to-back map:                         Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
RDD                                                7499           7585         121         13.3          75.0       1.0X
DataFrame                                          2457           2470          18         40.7          24.6       3.1X
Dataset                                            7231           7239          11         13.8          72.3       1.0X

OpenJDK 64-Bit Server VM 25.0.2+10-LTS on Linux 6.14.0-1017-azure
AMD EPYC 7763 64-Core Processor
back-to-back filter Long:                 Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
RDD                                                4371           4412          58         22.9          43.7       1.0X
DataFrame                                           819            872          70        122.1           8.2       5.3X
Dataset                                            1633           1639           8         61.2          16.3       2.7X

OpenJDK 64-Bit Server VM 25.0.2+10-LTS on Linux 6.14.0-1017-azure
AMD EPYC 7763 64-Core Processor
back-to-back filter:                      Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
RDD                                                1990           2035          64         50.3          19.9       1.0X
DataFrame                                           117            130          12        855.6           1.2      17.0X
Dataset                                            2133           2151          26         46.9          21.3       0.9X

OpenJDK 64-Bit Server VM 25.0.2+10-LTS on Linux 6.14.0-1017-azure
AMD EPYC 7763 64-Core Processor
aggregate:                                Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
RDD sum                                            1228           1235          10         81.4          12.3       1.0X
DataFrame sum                                        73             88          12       1362.2           0.7      16.7X
Dataset sum using Aggregator                       1569           1574           8         63.7          15.7       0.8X
Dataset complex Aggregator                         4644           4679          49         21.5          46.4       0.3X


