-
Notifications
You must be signed in to change notification settings - Fork 2.4k
fix: Allow bootstrap to complete without updating col stats #12952
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: master
Are you sure you want to change the base?
Conversation
...lient-common/src/main/java/org/apache/hudi/table/action/commit/BaseCommitActionExecutor.java
Outdated
Show resolved
Hide resolved
| updateColumnsToIndexForColumnStats(metaClient, columnsToIndex); | ||
| return null; | ||
| }); | ||
| } catch (UnsupportedOperationException uoe) { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Could we catch a specific exception for bootstrap here?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Have you checked that there is no failed deltacommit in MDT this case and files partition is intact, and the index ready to use does not contain col_stats in the table config?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Would be good to add this validation in the test.
| return null; | ||
| }); | ||
| } catch (UnsupportedOperationException uoe) { | ||
| LOG.warn("Failed to update col stats, bootstrap doesn't support col stats", uoe); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Similarly, let’s add or modify a test case to increase branch coverage.
315f643 to
cd644f1
Compare
|
#12977 seems to be able to fix this issue, I have not tested it yet. I'll test it later |
|
This issue exists on the released Hudi 1.0.2 as well and will be needed |
…able/action/commit/BaseCommitActionExecutor.java Co-authored-by: Y Ethan Guo <[email protected]>
d29bc45 to
74e0ae3
Compare
Describe the issue this Pull Request addresses
Since #12529 ,
BaseCommitActionExecutorwould update col stats by default. But bootstrap operation doesn't support col stats and will fail the bootstrapinghudi/hudi-client/hudi-spark-client/src/main/java/org/apache/hudi/table/action/bootstrap/SparkBootstrapCommitActionExecutor.java
Line 225 in 63362ad
This PR is to allow bootstrap to complete without updating col stats.
Summary and Changelog
Impact
None
Risk Level
None
Documentation Update
None
Contributor's checklist