Spark: Adding simple custom partition sort order option to RewriteManifests Spark Action#9731
Open
zachdisc wants to merge 1019 commits intoapache:mainfrom zachdisc:rewrite-manifest-reorder
+212,956-117,511
Commits
This pull request is big! We're only showing the most recent 250 commits
Commits on Dec 12, 2024
Core: Rename DeleteFileHolder to PendingDeleteFile / Optimize duplicate data/delete file detection (apache#11254)
Core: lazily load default Hadoop Configuration to avoid NPE with HadoopFileIO because FileIOParser doesn't serialize Hadoop configuration (apache#10926)
AWS: Switch to base2 entropy in ObjectStoreLocationProvider for optimized S3 performance (apache#11112)
Flink: Add IcebergSinkBuilder interface allowed unification of most of operations on FlinkSink and IcebergSink Builders (apache#11305)
API, Arrow, Core, Data, Spark: Replace usage of deprecated ContentFile#path API with location API (apache#11563)
Core: Inherited classes from SnapshotProducer has TableOperations redundantly as member (apache#11578)
Core: Optimize MergingSnapshotProducer to use referenced manifests to determine if manifest needs to be rewritten (apache#11131)
Revert "Core: Update TableMetadataParser to ensure all streams closed (apache#11220)" (apache#11621)
Flink: Backport apache#11244 to Flink 1.19 (Add table.exec.iceberg.use-v2-sink option) (apache#11665)
Flink: Backport Avro planned reader (and corresponding tests) on Flink v1.18 and v1.19 (apache#11668)
Core: Propagate custom metrics reporter when table is created/replaced through Transaction (apache#11671)
Spark 3.5: Align RewritePositionDeleteFilesSparkAction filter with Spark case sensitivity (apache#11700)
Spark 3.3,3.4: Align RewritePositionDeleteFilesSparkAction filter with Spark case sensitivity (apache#11710)
- committed