-
Notifications
You must be signed in to change notification settings - Fork 900
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Always use index for insert conflict resolution #7529
base: main
Are you sure you want to change the base?
Conversation
39d7f54
to
4decd26
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Maybe it makes sense to merge it to the 2.17.x directly. Approving since this is an urgent fix.
I think the proper fix would need more changes, e.g. the key_columns
is not needed anymore. It also looks suboptimal that we're going through every index for every input tuple again.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Approving since this is an important fix.
Still, I think this function needs refactoring. Like @akuzm points out, key_columns is no longer needed and it looks like this is more complicated than it needs to be without delving too deep into it now.
Could we at least get rid of the key_columns arg in this fix since it is a dead argument?
Also, wondering if the other similar functions to build scan keys have this problem too since they use key_columns too? (I didn't actually check how it is used so cannot tell now whether it is an issue or not).
4decd26
to
75d728e
Compare
100% agree. Finding the index should be separated from building the scan key. Then we should open the index in the chunk insert state or similar and keep it open until the insert is finished. Only the scankey needs to be updated for the next tuple. |
Actually, key columns are necessary, we just need to skip over columns which are not in key columns. Previous implementation could potentially give wrong results. As far as the index caching, I don't disagree: |
Ok, I guess I need to take another look. How could it give wrong results? Did it decompress the wrong segments, or? |
if (!bms_is_member(column_attno, key_columns)) | ||
{ | ||
break; | ||
continue; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Will this really work? Don't you need at least the first index column as a scan key?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There is a check later for number of scankeys a particular index can use, we only use the index if we can generate at least one scan key.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
But we can't use an index if we only match, e.g., the last index key right?
With this check you might generate a suffix.
Let's say an index includes a,b,c
then you can use it if you match <a>
, <a,b>
, <a,b,c>
, or <a,c>
right?
But you cannot use the index if you match <b>
or <c>
or <b,c>
... or am I missing something?
Here it looks like you can generate a match for the latter examples too since you can skip any column, including the first one.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Ideally, we should also look for longest prefix match. Here it looks like we are happy with first matching index even if there is a better one that matches more keys.
Maybe the simple approach works for our case because we know what indexes exist, but I am not sure the code is really doing the right thing here.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Even though not optimal, there are cirumstances where picking an index without the leading edge is actually done in Postgres?
feike=# \d a
Table "public.a"
Column | Type | Collation | Nullable | Default
--------+---------+-----------+----------+---------
odd | boolean | | |
j | integer | | |
t | text | | |
Indexes:
"a_odd_j_idx" btree (odd, j)
feike=# explain select * from a where j=4;
QUERY PLAN
----------------------------------------------------------------------------
Index Scan using a_odd_j_idx on a (cost=0.42..18484.43 rows=1 width=1909)
Index Cond: (j = 4)
(2 rows)
It's more a statistics based algorithm that should pick the index though?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It's more a statistics based algorithm that should pick the index though?
Also, this index matching code isn't using stats at all.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
FWIW, If you are confident that the current code is OK despite the concerns/questions I raised, then feel free to go ahead.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
But we can't use an index if we only match, e.g., the last index key right?
It's going to return the correct results. It'll be a kind of "index seq scan", where you scan an index but don't actually have any search keys, only plain scan keys ("index cond" vs "index filter" in explain). We actually had a bug like this in our catalog lookups where we used a wrong index.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
But we can't use an index if we only match, e.g., the last index key right?
It's going to return the correct results. It'll be a kind of "index seq scan", where you scan an index but don't actually have any search keys, only plain scan keys ("index cond" vs "index filter" in explain). We actually had a bug like this in our catalog lookups where we used a wrong index.
Sure, but this isn't only about correctness, but about performance. Mostly wondering if falling back to a seqscan isn't faster in that case... 🤷♂️
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I highly doubt using a seq scan is ever faster, unless there is a very small number of batches. Just for the fact that we are looking for a specific tuple.
I think its an OK assumption that we want to do an index scan here whenever we can.
75d728e
to
d7278e6
Compare
What's the current status of this fix? Is it ready to be merged? |
d7278e6
to
05752e0
Compare
During insert conflict resolution, we find the index on the compressed chunk based on the unique constraint on the chunk. However, we can/should always use the index since we have all the index column values in the slot thats being inserted so create scan keys until we find a column that doesn't exist in the uncompressed chunk.
05752e0
to
0e3600f
Compare
During insert conflict resolution, we find the index on the compressed chunk based on the unique constraint on the chunk. However, we can/should always use the index since we have all the index column values in the slot thats being inserted so create scan keys until we find a column that doesn't exist in the uncompressed chunk.