Optimizing processing of monthly customer invoices as MultiPartitionsDefinition
#20595
Replies: 2 comments
-
|
Hi @the4thamigo-uk - specifying a non-contiguous range of partitions to backfill in a single run isn't currently possible, although it seems like a reasonable thing to add support for. I recommend filing a Github issue for this if you're interested in it, although I can't promise how soon we'll be able to get to it. |
Beta Was this translation helpful? Give feedback.
-
|
Thanks @sryza, I'll make a request, but it seems there is a huge backlog of issues it is quite possible it will never see the light of day. In the meantime, I'm planning to tackle this with an op job that calls |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Hi I would like to have an asset that represents an invoice pushed to the accounting system for a customer for a given month.
I think I should model this in dagster using
MultiPartitionsDefinition, where the month dimension is a represented by an instance ofMonthlyPartitionsDefinition, and the customer dimension is represented by an instance ofDynamicPartitionsDefinition.Each month I would like to materialize invoices for the previous month for all the customers in the customer
DynamicPartitionsDefinition.For performance reasons I would like to materialize all the customers in a single run, rather than launching separate runs for each customer, as the materialization is quick compared to the time taken to launch a pod for the run in kubernetes.
Is the
MultiPartitionsDefinitionthe right model for this scenario? and how can I ensure that the process of materialization is efficient?Update: it looks like you can kinda do this using a partition range, but Im not sure you can specify a particular set of partitions to execute in the run :
#17956
Beta Was this translation helpful? Give feedback.
All reactions