Replies: 5 comments 1 reply
-
|
Thanks for picking your brain on this. Before looking at the details, I was wondering if we could also discuss a practical way of creating new "nodes" and automatically parsing the configuration file. I have something in mind like a I tried to make a toy example as follows. The idea is to end up with the simplest API as possible. In my example the layout is the following: The main (user-defined) script looks like this. We define three tasks (nodes) using a decorator The decorator itself is defined in the Whenever something is decorated with this and imported (either in the main script or through the user package in a The function that builds the pipeline simply reads the configuration file, loops over the steps and looks in the node registry to build the pipeline: For the main script shown above, I use this configuration file: Summary:
I think it would be nice to:
I uploaded the toy manager if you want to try it out: |
Beta Was this translation helpful? Give feedback.
-
|
Thanks a lot for this insight ! It could be very nice, but I think we should first converge on some "manual" exemples to decide if pipelines on plaid objects natively make sense. |
Beta Was this translation helpful? Give feedback.
-
|
For the moment, I have a deepcopy of the dataset in order to have a clean copy of the modified dataset as output of each node of the pipeline, leaving the input dataset unmodified at each node. This is a safe mechanism, but quite expensive. In particuler, applying twice the pipeline on the same dataset do not return different results. An alternative could be:
Pros:
Cons:
|
Beta Was this translation helpful? Give feedback.
-
|
Dataset and sample utilities for updating and extracting from features in progress in #112 |
Beta Was this translation helpful? Give feedback.
-
|
Working PCA-GP examples based on @xroynard idea to wrap sklearn blocks in #101 |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
A relatively clean tentative made here: https://github.com/PLAID-lib/plaid/tree/pipefunc_tests/examples/pipelines
Beta Was this translation helpful? Give feedback.
All reactions