[WIP] C++ implementation of TPCH query 3 #650
Draft
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
This PR implements TPC-H query 3 in rapidsmpf C++ (perhaps sub-optimally) based on #629 . It's not fully self-contained -- it requires a local merge of @wence- 's https://github.com/wence-/rapidsmpf/tree/wence/fea/streaming-examples/cpp branch.
It's not currently working. I unfortunately have less bandwidth than I anticipated, so putting up this PR in case someone wants to implement q3 and is interested in reviewing this as a starting point.
At SF1K, I expect to get:
DuckDB Output:
However, on this PR I get the following output with one MPI rank (I get errors using >1 rank):