ParallelIterable: Queue Size w/ O(1) #11895

shanielh · 2024-12-31T10:04:53Z

Instead of using ConcurrentLinkedQueue.size() which runs over the Linked Queue
in order to get the size of the queue, manage an AtomicInteger with the size
of the queue.

ConcurrentLinkedQueue.size() documentation states that this method is not
useful for concurrent applications.

Note: I have a JFR dump that shows this method uses 35% CPU utilization, this
is why I think this commit is important.

Instead of using ConcurrentLinkedQueue.size() which runs over the Linked Queue in order to get the size of the queue, manage an AtomicInteger with the size of the queue. ConcurrentLinkedQueue.size() documentation states that this method is not useful for concurrent applications.

singhpk234

LGTM as well ! Thank you for the fix !

have a JFR dump that shows this method uses 35% CPU utilization, this
is why I think this commit is important

interesting queue must really be huge, do you know what the manifest size / count we are looking at or more details of the table state ?

shanielh · 2025-01-01T11:56:24Z

LGTM as well ! Thank you for the fix !

have a JFR dump that shows this method uses 35% CPU utilization, this
is why I think this commit is important

interesting queue must really be huge, do you know what the manifest size / count we are looking at or more details of the table state ?

Actually I was using ParallelIterable in order to read multiple parquet files in order to compact them, and to scan manifest files.

Table had 180 manifest files with a lot of files:

select count(*), 
       sum(added_data_files_count), 
       sum(existing_data_files_count), 
       sum(deleted_data_files_count) 
  from schema."table$manifests";

count(*)	sum(added_data_files_count)	sum(existing_data_files_count)	sum(deleted_data_files_count)
180	1826	2703684	6844

RussellSpitzer · 2025-01-02T18:30:02Z

I wonder if this is as important if we switch ParallelIterable to use the implementation suggested here #11768 which limits the queue depth significantly and changes the yielding behavior.

I think it's a good perf change here but I do worry about disconnecting the poll/push operations from actually changing the size tracker for the queue. We probably aren't actually going to have any issues here though since we are already check the size as basically random times without regard to ongoing concurrent operations.

shanielh · 2025-01-03T10:12:52Z

I wonder if this is as important if we switch ParallelIterable to use the implementation suggested here #11768 which limits the queue depth significantly and changes the yielding behavior.

I think it's a good perf change here but I do worry about disconnecting the poll/push operations from actually changing the size tracker for the queue. We probably aren't actually going to have any issues here though since we are already check the size as basically random times without regard to ongoing concurrent operations.

Since we poll the size and it's a concurrent data structure, it doesn't really matter if the size is accurate or not, but eventually it is accurate.

As for #11768, we use a different S3FileIO which uses a different mechanism for InputStream, instead of keeping the connection open against S3, we download chunks of data and store it in the memory (on demand, of course). This way we can use ParallelIterable without having to think on the number of connections against S3. This will increase the cost as you might download a file using multiple GET calls instead of one, but allows you to run long lasting InputStream(s).

shanielh · 2025-01-13T07:47:05Z

@RussellSpitzer, I see that #11768 is closed now, we use the PR in a forked version for over a week now and we've observed no issues, any chance to merge this? BTW, the fix for #11768 added another usage of queue.size() which is highly not recommended for concurrent applications, so the state would be even less optimal now, I can rebase if needed.

RussellSpitzer · 2025-01-13T16:23:41Z

@RussellSpitzer, I see that #11768 is closed now, we use the PR in a forked version for over a week now and we've observed no issues, any chance to merge this? BTW, the fix for #11768 added another usage of queue.size() which is highly not recommended for concurrent applications, so the state would be even less optimal now, I can rebase if needed.

As I mentioned before, 11768 reduces the number of calls to queue.size() from O(number of elements) to O(number of files) by moving the check out of the hot path so size() should now be drastically reduced in the number of times it is actually called. Because of this I'm not sure this PR is actually required anymore but it still probably makes sense from a best practices prospective. That said I really would like us to work on a full rework (as I note here #11781 (comment) )

tbaeg · 2025-01-13T23:52:18Z

I think incremental improvement for the existing implementation (even if slated for rewrite) should be included.

Of note, we cherry-picked commits from #11768 and this commit as we were experiencing deadlocking and high CPU usage. We have been running stable in production for over a week now.

github-actions bot added the core label Dec 31, 2024

yinonupsolver approved these changes Dec 31, 2024

View reviewed changes

singhpk234 approved these changes Dec 31, 2024

View reviewed changes

Fokko requested a review from findepi January 13, 2025 12:41

shanielh closed this Jan 13, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ParallelIterable: Queue Size w/ O(1) #11895

ParallelIterable: Queue Size w/ O(1) #11895

shanielh commented Dec 31, 2024

singhpk234 left a comment

shanielh commented Jan 1, 2025

RussellSpitzer commented Jan 2, 2025

shanielh commented Jan 3, 2025

shanielh commented Jan 13, 2025

RussellSpitzer commented Jan 13, 2025

tbaeg commented Jan 13, 2025

ParallelIterable: Queue Size w/ O(1) #11895

ParallelIterable: Queue Size w/ O(1) #11895

Conversation

shanielh commented Dec 31, 2024

singhpk234 left a comment

Choose a reason for hiding this comment

shanielh commented Jan 1, 2025

RussellSpitzer commented Jan 2, 2025

shanielh commented Jan 3, 2025

shanielh commented Jan 13, 2025

RussellSpitzer commented Jan 13, 2025

tbaeg commented Jan 13, 2025