Skip to content

[Bug] Optimization is now much slower in TPCh benchmarks #796

@rjzamora

Description

@rjzamora

I tried running the TPCh benchmarks in https://github.com/coiled/benchmarks and noticed that dask-expr was much slower than before (both with "pandas" and "cudf" as the backend). After looking at a performance report, it was clear that the slow down was not in compute. I confirmed that the regression was coming from optimize() (a change from ~1s to >10s for query 1).

I did a cursory bisection, and found that the regression was definitely introduced in #395

I haven't had time to figure out why 395 is slowing things down yet.

Metadata

Metadata

Assignees

Labels

bugSomething isn't working

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions