Skip to content

Conversation

@beckernick
Copy link
Member

@beckernick beckernick commented Nov 11, 2025

This PR implements TPC-H query 3 in rapidsmpf C++ (perhaps sub-optimally) based on #629 . It's not fully self-contained -- it requires a local merge of @wence- 's https://github.com/wence-/rapidsmpf/tree/wence/fea/streaming-examples/cpp branch.

It's not currently working. I unfortunately have less bandwidth than I anticipated, so putting up this PR in case someone wants to implement q3 and is interested in reviewing this as a starting point.

At SF1K, I expect to get:

DuckDB Output:

┌────────────┬────────────────────┬─────────────────────┬────────────────┐
│ l_orderkey │      revenue       │     o_orderdate     │ o_shippriority │
│   int64    │       double       │      timestamp      │     int32      │
├────────────┼────────────────────┼─────────────────────┼────────────────┤
│   18869634 │        512508.6578 │ 1995-01-10 00:00:00 │              0 │
│ 3947421511 │ 507889.04639999993 │ 1995-03-14 00:00:00 │              0 │
│ 1319897249 │        503401.9508 │ 1995-03-05 00:00:00 │              0 │
│ 2036965252 │  495852.8691999999 │ 1995-03-03 00:00:00 │              0 │
│ 1980912577 │ 493605.46589999995 │ 1995-02-14 00:00:00 │              0 │
│ 4803840546 │ 492521.94299999997 │ 1995-02-18 00:00:00 │              0 │
│ 3407391428 │ 491379.51860000007 │ 1995-03-09 00:00:00 │              0 │
│ 5289035781 │        488004.5812 │ 1995-03-11 00:00:00 │              0 │
│ 5530172133 │        487671.6623 │ 1995-02-07 00:00:00 │              0 │
│ 3885365216 │  487236.4125999999 │ 1995-03-04 00:00:00 │              0 │
├────────────┴────────────────────┴─────────────────────┴────────────────┤
│ 10 rows                                                      4 columns │
└────────────────────────────────────────────────────────────────────────┘

However, on this PR I get the following output with one MPI rank (I get errors using >1 rank):

   l_orderkey        revenue                      o_orderdate  o_shippriority
0  4509702337  5.867792e-315 1970-01-26 16:40:00.002000+00:00               0
1  2936266215  5.867792e-315 1970-01-26 16:40:00.002000+00:00            2374
2  1994653760  5.867792e-315 1970-01-26 16:40:00.002000+00:00            4822
3  4700410533  5.867792e-315 1970-01-26 16:40:00.004000+00:00            7268
4   405172768  5.867792e-315 1970-01-26 16:40:00.004000+00:00            9688
5  5112124259  5.867792e-315 1970-01-26 16:40:00.004000+00:00           12085
6  2819660386  5.867792e-315 1970-01-26 16:40:00.004000+00:00           14531
7   533490498  5.867792e-315 1970-01-26 16:40:00.004000+00:00           16950
8  2210399939  5.867792e-315 1970-01-26 16:40:00.006000+00:00           19350
9  1487582913  5.867792e-315 1970-01-26 16:40:00.006000+00:00           21803

@copy-pr-bot
Copy link

copy-pr-bot bot commented Nov 11, 2025

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant