================================================================================================
Benchmark for performance of JSON parsing
================================================================================================

Preparing data for benchmarking ...
OpenJDK 64-Bit Server VM 17.0.7+7 on Linux 5.15.0-1037-azure
Intel(R) Xeon(R) Platinum 8272CL CPU @ 2.60GHz
JSON schema inferring:                    Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
No encoding                                        2964           3045          89          1.7         592.8       1.0X
UTF-8 is set                                       4365           4382          18          1.1         873.1       0.7X

Preparing data for benchmarking ...
OpenJDK 64-Bit Server VM 17.0.7+7 on Linux 5.15.0-1037-azure
Intel(R) Xeon(R) Platinum 8272CL CPU @ 2.60GHz
count a short column:                     Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
No encoding                                        2326           2381          52          2.1         465.2       1.0X
UTF-8 is set                                       3834           3846          17          1.3         766.7       0.6X

Preparing data for benchmarking ...
OpenJDK 64-Bit Server VM 17.0.7+7 on Linux 5.15.0-1037-azure
Intel(R) Xeon(R) Platinum 8272CL CPU @ 2.60GHz
count a wide column:                      Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
No encoding                                        4599           4622          26          0.2        4599.4       1.0X
UTF-8 is set                                       6079           6120          62          0.2        6078.8       0.8X

Preparing data for benchmarking ...
OpenJDK 64-Bit Server VM 17.0.7+7 on Linux 5.15.0-1037-azure
Intel(R) Xeon(R) Platinum 8272CL CPU @ 2.60GHz
select wide row:                          Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
No encoding                                       12217          12443         256          0.0      244340.4       1.0X
UTF-8 is set                                      13720          13823         113          0.0      274409.6       0.9X

Preparing data for benchmarking ...
OpenJDK 64-Bit Server VM 17.0.7+7 on Linux 5.15.0-1037-azure
Intel(R) Xeon(R) Platinum 8272CL CPU @ 2.60GHz
Select a subset of 10 columns:            Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
Select 10 columns                                  2291           2308          18          0.4        2291.5       1.0X
Select 1 column                                    1485           1491           8          0.7        1485.2       1.5X

Preparing data for benchmarking ...
OpenJDK 64-Bit Server VM 17.0.7+7 on Linux 5.15.0-1037-azure
Intel(R) Xeon(R) Platinum 8272CL CPU @ 2.60GHz
creation of JSON parser per line:         Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
Short column without encoding                       689            691           3          1.5         688.7       1.0X
Short column with UTF-8                             973            977           3          1.0         972.8       0.7X
Wide column without encoding                       7239           7283          71          0.1        7238.6       0.1X
Wide column with UTF-8                             9634           9667          30          0.1        9634.3       0.1X

Preparing data for benchmarking ...
OpenJDK 64-Bit Server VM 17.0.7+7 on Linux 5.15.0-1037-azure
Intel(R) Xeon(R) Platinum 8272CL CPU @ 2.60GHz
JSON functions:                           Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
Text read                                            95            100           9         10.5          95.1       1.0X
from_json                                          1638           1646           7          0.6        1638.5       0.1X
json_tuple                                         1971           1996          39          0.5        1970.6       0.0X
get_json_object                                    1799           1809          13          0.6        1799.3       0.1X

Preparing data for benchmarking ...
OpenJDK 64-Bit Server VM 17.0.7+7 on Linux 5.15.0-1037-azure
Intel(R) Xeon(R) Platinum 8272CL CPU @ 2.60GHz
Dataset of json strings:                  Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
Text read                                           390            393           5         12.8          78.0       1.0X
schema inferring                                   2396           2400           3          2.1         479.2       0.2X
parsing                                            2899           2908          10          1.7         579.8       0.1X

Preparing data for benchmarking ...
OpenJDK 64-Bit Server VM 17.0.7+7 on Linux 5.15.0-1037-azure
Intel(R) Xeon(R) Platinum 8272CL CPU @ 2.60GHz
Json files in the per-line mode:          Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
Text read                                           923            927           4          5.4         184.6       1.0X
Schema inferring                                   3256           3267          11          1.5         651.3       0.3X
Parsing without charset                            3347           3355           7          1.5         669.4       0.3X
Parsing with UTF-8                                 4877           4882           8          1.0         975.4       0.2X

OpenJDK 64-Bit Server VM 17.0.7+7 on Linux 5.15.0-1037-azure
Intel(R) Xeon(R) Platinum 8272CL CPU @ 2.60GHz
Write dates and timestamps:               Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
Create a dataset of timestamps                      182            194          11          5.5         181.7       1.0X
to_json(timestamp)                                  979            987           8          1.0         978.8       0.2X
write timestamps to files                           921            923           2          1.1         921.0       0.2X
Create a dataset of dates                           194            201          10          5.2         193.6       0.9X
to_json(date)                                       726            730           4          1.4         725.7       0.3X
write dates to files                                617            623           8          1.6         616.6       0.3X

OpenJDK 64-Bit Server VM 17.0.7+7 on Linux 5.15.0-1037-azure
Intel(R) Xeon(R) Platinum 8272CL CPU @ 2.60GHz
Read dates and timestamps:                                             Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
-----------------------------------------------------------------------------------------------------------------------------------------------------
read timestamp text from files                                                   248            252           6          4.0         247.7       1.0X
read timestamps from files                                                      2436           2460          22          0.4        2436.5       0.1X
infer timestamps from files                                                     6217           6231          14          0.2        6216.7       0.0X
read date text from files                                                        209            215           6          4.8         209.0       1.2X
read date from files                                                             843            849           5          1.2         843.2       0.3X
timestamp strings                                                                263            264           1          3.8         263.2       0.9X
parse timestamps from Dataset[String]                                           2616           2617           1          0.4        2616.0       0.1X
infer timestamps from Dataset[String]                                           6254           6256           2          0.2        6254.0       0.0X
date strings                                                                     333            339           7          3.0         333.2       0.7X
parse dates from Dataset[String]                                                1076           1082           6          0.9        1075.6       0.2X
from_json(timestamp)                                                            3691           3696           6          0.3        3691.1       0.1X
from_json(date)                                                                 2112           2118           6          0.5        2112.4       0.1X
infer error timestamps from Dataset[String] with default format                 1753           1758           5          0.6        1753.3       0.1X
infer error timestamps from Dataset[String] with user-provided format           1741           1741           0          0.6        1740.7       0.1X
infer error timestamps from Dataset[String] with legacy format                  1709           1718           8          0.6        1708.9       0.1X

OpenJDK 64-Bit Server VM 17.0.7+7 on Linux 5.15.0-1037-azure
Intel(R) Xeon(R) Platinum 8272CL CPU @ 2.60GHz
Filters pushdown:                         Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
w/o filters                                       18384          18399          14          0.0      183840.1       1.0X
pushdown disabled                                 17599          17610          10          0.0      175990.2       1.0X
w/ filters                                          957            985          43          0.1        9565.9      19.2X


