Support multi-partition groupby standard deviation aggregations#21944
Support multi-partition groupby standard deviation aggregations#21944Matt711 wants to merge 5 commits intorapidsai:mainfrom
Conversation
|
This is mathematically correct but numerically tremendously unstable. We really want the grouped aggregation machinery to expose the "intermediate" value before apply finalisation. The one-pass approach uses welford's algorithm iirc. If we xan get that out then we could use it as an input to merge two grouped objects and then continue |
Yeah I even had to switch around some of the OOOs to get the test to pass (trying to avoid small numerical precision differences). The alternative you're suggesting is interesting. Reading variance algorithims... |
|
Interesting, so we already have #21962 computes |
Description
Supports streaming groupby
stdandvaraggregations.Checklist