Iceberg/Comet integration POC #9841

huaxingao · 2024-03-01T02:34:25Z

This PR shows how I will integrate Comet with iceberg. The PR doesn't compile yet because we haven't released Comet yet, but it shows the ideas how we are going to change iceberg code to integrate Comet. Also, Comet doesn't have Spark3.5 support yet so I am doing this on 3.4, but we will add 3.5 support in Comet.

In VectorizedSparkParquetReaders.buildReader, if Comet library is available, a CometIcebergColumnarBatchReader will be created, which will use Comet batch reader to read data. We can also add a property later to control whether we want to use Comet or not.

The logic in CometIcebergVectorizedReaderBuilder is very similar to VectorizedReaderBuilder. It builds Comet column reader instead of iceberg column reader.

The delete logic in CometIcebergColumnarBatchReader is exactly the same as the one in ColumnarBatchReader. I will extract the common code and put the common code in a base class.

The main motivation of this PR is to improve performance using native execution. Comet's Parquet reader is a hybrid implementation: IO and decompression are done in the JVM while decoding is done natively. There is some performance gain from native decoding, but the gain is not much. However, by switching to the Comet Parquet reader, Comet will recognize that this is a Comet scan and will convert the Spark physical plan into a Comet plan for native execution. The major performance gain will be from this native execution.

huaxingao · 2024-03-01T02:41:26Z

cc @aokolnychyi @sunchao

...k/src/main/java/org/apache/iceberg/spark/data/vectorized/comet/CometIcebergColumnReader.java

aokolnychyi

I think this is the right direction to take. I did an initial high-level pass. Looking forward to having a Comet release soon.

...k/src/main/java/org/apache/iceberg/spark/data/vectorized/comet/CometIcebergColumnReader.java

aokolnychyi · 2024-04-16T03:57:07Z

spark/v3.4/build.gradle

    }

+    compileOnly "org.apache.comet:comet-spark-spark${sparkMajorVersion}_${scalaVersion}:0.1.0-SNAPSHOT"


I assume this library will only contain the reader, not the operators.

Right. This only contains the reader.

Does it need to be Spark Version Dependent? Just wondering

We are currently doing some experiments to see if we can provide a Spark Version independent jar.

+1 for exploring that.

...ain/java/org/apache/iceberg/spark/data/vectorized/comet/CometIcebergColumnarBatchReader.java

...rk/src/main/java/org/apache/iceberg/spark/data/vectorized/VectorizedSparkParquetReaders.java

...in/java/org/apache/iceberg/spark/data/vectorized/comet/CometIcebergPositionColumnReader.java

spark/v3.4/spark/src/main/java/org/apache/iceberg/spark/SparkConfParser.java

...v3.4/spark/src/main/java/org/apache/iceberg/spark/data/vectorized/BaseColumnBatchLoader.java

....4/spark/src/main/java/org/apache/iceberg/spark/data/vectorized/comet/CometColumnReader.java

api/src/main/java/org/apache/iceberg/ReaderType.java

aokolnychyi · 2024-04-22T22:27:03Z

build.gradle

@@ -45,6 +45,7 @@ buildscript {
  }
 }

+String sparkMajorVersion = '3.4'


I hope we can soon have a snapshot for Comet jar independent of Spark to clean up deps here.
We can't have parquet module depend on a jar with any Spark deps.

spark/v3.4/build.gradle

aokolnychyi · 2024-04-22T22:27:57Z

spark/v3.4/build.gradle

    }

+    compileOnly "org.apache.comet:comet-spark-spark${sparkMajorVersion}_${scalaVersion}:0.1.0-SNAPSHOT"


+1 for exploring that.

gradle.properties

aokolnychyi · 2024-04-23T00:54:35Z

...v3.4/spark/src/main/java/org/apache/iceberg/spark/data/vectorized/BaseColumnBatchLoader.java

+import org.apache.spark.sql.vectorized.ColumnVector;
+import org.apache.spark.sql.vectorized.ColumnarBatch;
+
+@SuppressWarnings("checkstyle:VisibilityModifier")


These changes would require a bit more time to review. I'll do that tomorrow. I think we would want to restructure the original implementation a bit. Not a concern for now.

We would want to structure this a bit differently. Let me think more.

...rk/src/main/java/org/apache/iceberg/spark/data/vectorized/VectorizedSparkParquetReaders.java

spark/v3.4/spark/src/main/java/org/apache/iceberg/spark/source/SparkColumnarReaderFactory.java

spark/v3.4/spark/src/main/java/org/apache/iceberg/spark/source/BaseBatchReader.java

spark/v3.4/spark/src/main/java/org/apache/iceberg/spark/source/SparkBatch.java

huaxingao · 2024-04-29T16:59:05Z

@aokolnychyi I have addressed the comments. Could you please take one more look when you have a moment? Thanks a lot!

aokolnychyi · 2024-04-30T17:27:41Z

Will check today.

spark/v3.4/spark/src/main/java/org/apache/iceberg/spark/ParquetReaderType.java

spark/v3.4/spark/src/main/java/org/apache/iceberg/spark/SparkSQLProperties.java

aokolnychyi · 2024-04-30T19:04:36Z

...v3.4/spark/src/main/java/org/apache/iceberg/spark/data/vectorized/BaseColumnBatchLoader.java

+import org.apache.spark.sql.vectorized.ColumnVector;
+import org.apache.spark.sql.vectorized.ColumnarBatch;
+
+@SuppressWarnings("checkstyle:VisibilityModifier")


We would want to structure this a bit differently. Let me think more.

...k/v3.4/spark/src/main/java/org/apache/iceberg/spark/data/vectorized/ColumnarBatchReader.java

....4/spark/src/main/java/org/apache/iceberg/spark/data/vectorized/comet/CometColumnReader.java

spark/v3.4/spark/src/main/java/org/apache/iceberg/spark/source/BaseBatchReader.java

spark/v3.4/spark/src/main/java/org/apache/iceberg/spark/source/SparkBatch.java

spark/v3.4/spark/src/main/java/org/apache/iceberg/spark/source/SparkColumnarReaderFactory.java

spark/v3.4/spark/src/main/java/org/apache/iceberg/spark/BatchReadConf.java

spark/v3.4/spark/src/main/java/org/apache/iceberg/spark/SparkReadConf.java

spark/v3.4/spark/src/main/java/org/apache/iceberg/spark/SparkSQLProperties.java

spark/v3.4/spark/src/main/java/org/apache/iceberg/spark/data/vectorized/CometColumnReader.java

spark/v3.4/spark/src/main/java/org/apache/iceberg/spark/source/BaseBatchReader.java

spark/v3.4/spark/src/main/java/org/apache/iceberg/spark/source/BatchDataReader.java

spark/v3.4/spark/src/main/java/org/apache/iceberg/spark/source/SparkBatch.java

spark/v3.4/spark/src/main/java/org/apache/iceberg/spark/SparkReadConf.java

spark/v3.4/spark/src/main/java/org/apache/iceberg/spark/BatchReadConf.java

cornelcreanga · 2024-06-20T14:09:01Z

@huaxingao - Hi, is the Comet Parquet reader able to support page skipping/use page indexes? -eg see #193 for the Iceberg Parquet reader initial issue.

huaxingao · 2024-06-20T15:41:53Z

@cornelcreanga Comet Parquet reader doesn't support page skipping yet

PaulLiang1 · 2024-09-04T04:13:51Z

hey @huaxingao
we are really interested in this feature, just wonder what can we help to getting this integrated?

huaxingao · 2024-09-04T04:25:24Z

@PaulLiang1 Thank you for your interest! We are currently working on a binary release of DataFusion Comet. Once the binary release is available, I will proceed with this PR.

PaulLiang1 · 2024-09-04T04:53:39Z

@huaxingao
I think we got a internal version of building DataFusion comet and publish a JAR internally.
Is there anything we can help with on that front?

Thanks

huaxingao · 2024-09-04T05:24:49Z

@PaulLiang1 Thanks! I'll check with my colleague tomorrow to find out where we are in the binary release process.

huaxingao · 2024-09-05T04:50:30Z

@PaulLiang1 We are pretty close to this and will have a binary release for Comet soon.

PaulLiang1 · 2024-09-05T05:00:49Z

@PaulLiang1 Thanks! I'll check with my colleague tomorrow to find out where we are in the binary release process.

got it, thanks for letting me know. please feel free to let us know if there is anything we could help on. thanks!

aokolnychyi · 2025-01-07T21:51:37Z

spark/v3.4/spark/src/main/java/org/apache/iceberg/spark/data/vectorized/ColumnarBatchUtil.java

+   * @param rowStartPosInBatch The starting position of the row in the batch.
+   * @param hasIsDeletedColumn Indicates whether the columnar batch includes _deleted column.
+   */
+  public static void applyDeletesToColumnarBatch(


I think we inherit the existing loading logic that is too complicated. First, we mix isDeleted and rowIdMapping cases. Second, we create ColumnarBatch prior to having all column vectors (e.g. isDeleted array).

What we add methods like this to the util class instead? Only one of them will be needed in a query, right? We either mark records as removed or hide them.

public static Pair<int[], Integer> buildRowIdMapping( ColumnVector[] vectors, DeleteFilter<InternalRow> deletes, long rowStartPosInBatch, int batchSize) { if (deletes == null) { return null; } PositionDeleteIndex deletedPositions = deletes.deletedRowPositions(); Predicate<InternalRow> eqDeleteFilter = deletes.eqDeletedRowFilter(); ColumnarBatchRow row = new ColumnarBatchRow(vectors); int[] rowIdMapping = new int[batchSize]; int liveRowId = 0; for (int rowId = 0; rowId < batchSize; rowId++) { long pos = rowStartPosInBatch + rowId; row.rowId = rowId; if (isDeleted(pos, row, deletedPositions, eqDeleteFilter)) { deletes.incrementDeleteCount(); } else { rowIdMapping[liveRowId] = rowId; liveRowId++; } } return liveRowId == batchSize ? null : Pair.of(rowIdMapping, liveRowId); } public static boolean[] buildIsDeleted( ColumnVector[] vectors, DeleteFilter<InternalRow> deletes, long rowStartPosInBatch, int batchSize) { boolean[] isDeleted = new boolean[batchSize]; if (deletes == null) { return isDeleted; } PositionDeleteIndex deletedPositions = deletes.deletedRowPositions(); Predicate<InternalRow> eqDeleteFilter = deletes.eqDeletedRowFilter(); ColumnarBatchRow row = new ColumnarBatchRow(vectors); for (int rowId = 0; rowId < batchSize; rowId++) { long pos = rowStartPosInBatch + rowId; row.rowId = rowId; isDeleted[rowId] = isDeleted(pos, row, deletedPositions, eqDeleteFilter); } return isDeleted; } // use separate if statements to reduce the chance of speculative execution for equality tests private static boolean isDeleted( long pos, InternalRow row, PositionDeleteIndex deletedPositions, Predicate<InternalRow> eqDeleteFilter) { if (deletedPositions != null && deletedPositions.isDeleted(pos)) { return true; } if (!eqDeleteFilter.test(row)) { return true; } return false; }

Then our loading logic can look like:

Initialize the vector array.

Load all data vectors (leaving metadata vectors as null).

If you need to discard deleted records, call buildRowIdMapping and either wrap loaded data vectors into other vectors or mutate them in place via setRowIdMapping.

If you need to mark deleted records, call buildIsDeleted to compute the flags.

Load all metadata vectors (we will have the is_deleted array fully populated now).

…unkMetaData> metaData)

github-actions bot added spark build labels Mar 1, 2024

huaxingao mentioned this pull request Mar 1, 2024

Dynamically support Spark native engine in Iceberg #9826

Closed

huaxingao mentioned this pull request Mar 5, 2024

Dynamically support Spark native engine in Iceberg #9721

Closed

sunchao mentioned this pull request Mar 7, 2024

Explore integration with Delta Lake apache/datafusion-comet#174

Open

RussellSpitzer reviewed Apr 2, 2024

View reviewed changes

...k/src/main/java/org/apache/iceberg/spark/data/vectorized/comet/CometIcebergColumnReader.java Outdated Show resolved Hide resolved

aokolnychyi reviewed Apr 16, 2024

View reviewed changes

github-actions bot added the API label Apr 18, 2024

RussellSpitzer reviewed Apr 18, 2024

View reviewed changes

spark/v3.4/spark/src/main/java/org/apache/iceberg/spark/SparkConfParser.java Outdated Show resolved Hide resolved

RussellSpitzer reviewed Apr 18, 2024

View reviewed changes

...v3.4/spark/src/main/java/org/apache/iceberg/spark/data/vectorized/BaseColumnBatchLoader.java Outdated Show resolved Hide resolved

RussellSpitzer reviewed Apr 18, 2024

View reviewed changes

....4/spark/src/main/java/org/apache/iceberg/spark/data/vectorized/comet/CometColumnReader.java Outdated Show resolved Hide resolved

aokolnychyi reviewed Apr 23, 2024

View reviewed changes

github-actions bot removed the API label Apr 26, 2024

aokolnychyi reviewed Apr 30, 2024

View reviewed changes

aokolnychyi reviewed May 3, 2024

View reviewed changes

spark/v3.4/spark/src/main/java/org/apache/iceberg/spark/SparkReadConf.java Outdated Show resolved Hide resolved

aokolnychyi reviewed May 9, 2024

View reviewed changes

spark/v3.4/spark/src/main/java/org/apache/iceberg/spark/BatchReadConf.java Outdated Show resolved Hide resolved

huaxingao closed this Jun 20, 2024

huaxingao reopened this Jun 20, 2024

aokolnychyi reviewed Jan 7, 2025

View reviewed changes

huaxingao mentioned this pull request Jan 9, 2025

Spark 3.5: Refactor delete logic in batch reading #11933

Merged

RussellSpitzer added this to the Iceberg 1.8.0 milestone Jan 15, 2025

Huaxin Gao and others added 21 commits January 24, 2025 19:34

Iceberg/Comet integration

c3ae540

address comments

b14c009

address comments

e05637d

remove unnecessary code

73f3ece

address comments

d8fd83e

address comments

097c6f0

remove unnecessary public

d01552f

address comments

3eefb51

address comments

31122cb

minor changes

d8116d3

update to use comet 0.3.0

d51d9d2

use the new Comet Utils.getColumnReader method

fe0eff9

change PARQUET_READER_TYPE_DEFAULT to Comet to test CometReader

9cea397

Ignore SmokeTest#testGettingStarted for now

91c4422

rebase

df862fb

add setRowGroupInfo(PageReadStore pageStore, Map<ColumnPath, ColumnCh…

5e0b339

…unkMetaData> metaData)

formatting

e5859c4

ignore a few tests for now

85907df

remove comet dependency in build.gradle

091dd84

Trigger Build

dfe5616

add ColumnarBatchUtil

d431803

huaxingao force-pushed the comet3 branch 4 times, most recently from c3ad611 to 5d609e1 Compare January 26, 2025 03:07

rebase

f69013f

huaxingao force-pushed the comet3 branch from 5d609e1 to f69013f Compare January 26, 2025 03:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Iceberg/Comet integration POC #9841

Iceberg/Comet integration POC #9841

huaxingao commented Mar 1, 2024 •

edited

Loading

huaxingao commented Mar 1, 2024

aokolnychyi left a comment

aokolnychyi Apr 16, 2024

huaxingao Apr 16, 2024

RussellSpitzer Apr 18, 2024

huaxingao Apr 21, 2024

aokolnychyi Apr 22, 2024

aokolnychyi Apr 22, 2024

aokolnychyi Apr 22, 2024

aokolnychyi Apr 23, 2024

aokolnychyi Apr 30, 2024

huaxingao commented Apr 29, 2024

aokolnychyi commented Apr 30, 2024

aokolnychyi Apr 30, 2024

cornelcreanga commented Jun 20, 2024

huaxingao commented Jun 20, 2024

PaulLiang1 commented Sep 4, 2024

huaxingao commented Sep 4, 2024

PaulLiang1 commented Sep 4, 2024

huaxingao commented Sep 4, 2024

huaxingao commented Sep 5, 2024

PaulLiang1 commented Sep 5, 2024

aokolnychyi Jan 7, 2025 •

edited

Loading

		}

		compileOnly "org.apache.comet:comet-spark-spark${sparkMajorVersion}_${scalaVersion}:0.1.0-SNAPSHOT"

Iceberg/Comet integration POC #9841

Are you sure you want to change the base?

Iceberg/Comet integration POC #9841

Conversation

huaxingao commented Mar 1, 2024 • edited Loading

huaxingao commented Mar 1, 2024

aokolnychyi left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

huaxingao commented Apr 29, 2024

aokolnychyi commented Apr 30, 2024

Choose a reason for hiding this comment

cornelcreanga commented Jun 20, 2024

huaxingao commented Jun 20, 2024

PaulLiang1 commented Sep 4, 2024

huaxingao commented Sep 4, 2024

PaulLiang1 commented Sep 4, 2024

huaxingao commented Sep 4, 2024

huaxingao commented Sep 5, 2024

PaulLiang1 commented Sep 5, 2024

aokolnychyi Jan 7, 2025 • edited Loading

Choose a reason for hiding this comment

huaxingao commented Mar 1, 2024 •

edited

Loading

aokolnychyi Jan 7, 2025 •

edited

Loading