-
-
Notifications
You must be signed in to change notification settings - Fork 27
Implement __len__ and leverage parquet statistics
#102
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Changes from all commits
Commits
Show all changes
38 commits
Select commit
Hold shift + click to select a range
13b828f
start experimenting with parquet statistics
rjzamora f5f4e19
Merge remote-tracking branch 'upstream/main' into pq-statistics-len
rjzamora 990ba4c
adopt parts of #40
rjzamora 1c62f4c
experimenting with dedicated Metadata class structure
rjzamora afd59d7
add missing file
rjzamora 8302305
go back to and remove sub-class for now
rjzamora a3c5f2c
add parquet test
rjzamora cbced80
use assume vs inherit
rjzamora 5fe5862
use assume vs inherit
rjzamora b0946f8
split test
rjzamora bfd8710
fix doc-string
rjzamora 2d343c7
fix typos
rjzamora aa27c96
Merge remote-tracking branch 'upstream/main' into pq-statistics-len
rjzamora 4ce604d
use _lengths ILO statistics
rjzamora 4ad6fb2
start pushing on _column_statistics
rjzamora d5e93a4
add _collect_statistics machinery to ReadParquet
rjzamora 7b137c5
move utilities out of class body
rjzamora f6823d1
introduce _partitioning
rjzamora e600ea1
add simple test coverage for _partitions
rjzamora 1dbfb18
improve test and fix bug
rjzamora 5020657
Merge remote-tracking branch 'upstream/main' into simple-statistics
rjzamora 423cfcb
remove leftover
rjzamora 58ebf5a
fix parquet len test
rjzamora 5790fb1
fix calculate_divisions default
rjzamora 4be0221
Merge remote-tracking branch 'upstream/main' into simple-statistics
rjzamora cc01ebb
strip out _partitioning changes
rjzamora 0345d19
missing calculate_divisions default
rjzamora 7052a26
move _lengths to a method with force option
rjzamora e26d6cd
cache pd lengths
rjzamora 5c376b9
missing annotations import
rjzamora cd6a5d6
Merge remote-tracking branch 'upstream/main' into simple-statistics
rjzamora 62fbcfa
Merge remote-tracking branch 'upstream/main' into HEAD
rjzamora 253cfeb
use Lengths
rjzamora 1318219
Merge remote-tracking branch 'upstream/main' into simple-statistics
rjzamora 32e4f94
partial fixup
rjzamora bd5395a
improve testing
rjzamora be4af18
cleanup
rjzamora 47ff1d3
remove _len for now
rjzamora File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.