-
DataSource API — Loading and Saving Datasets
-
Spark SQL’s Performance Tuning Tips and Tricks (aka Case Studies)
-
Partitioning — Specification of Physical Operator’s Output Partitions
-
SessionCatalog — Metastore of Session-Specific Relational Entities
-
Tungsten Execution Backend (aka Project Tungsten)
-
ExternalAppendOnlyUnsafeRowArray — Append-Only Array for UnsafeRows (with Disk Spill Threshold)
-
AggregationIterator — Generic Iterator of UnsafeRows for Aggregate Physical Operators