Add Model class #1126

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Merged

edknv merged 23 commits into NVIDIA-Merlin:main from edknv:torch/model_cls

Jun 13, 2023

Contributor

edknv commented May 30, 2023 •

edited

Loading

This PR introduces the Model class for constructing models from individual blocks in the pytorch backend.

Some follow-up will be necessary, such as testing with a toy dataset using dataloader, batch_predict, etc.


          Add Model class

1dcddcc

github-actions bot commented May 30, 2023

Documentation preview

https://nvidia-merlin.github.io/models/review/pr-1126

edknv added 6 commits

May 30, 2023 15:09


          Merge branch 'main' into torch/model_cls

5f17863


          Use BlockContainer in Models class

bcd5c9f


          Add unit tests

13cc5f9


          Add docstrings

27bfe16


          Add module_utils unit tests

6de5e29


          Remove future work

c2394c8

edknv self-assigned this

edknv added enhancement area/pytorch labels

edknv added this to the Merlin 23.06 milestone

edknv requested review from marcromeyn and oliverholworthy and removed request for marcromeyn

June 1, 2023 03:07

marcromeyn reviewed

View reviewed changes

merlin/models/torch/models/base.py

+                  ...    BinaryOutput(schema.select_by_tag(Tags.TARGET).first),
+                  ... )
+                  ... trainer = Trainer(max_epochs=1)
+                  ... with Loader(dataset, batch_size=16) as loader:

Contributor

marcromeyn Jun 1, 2023

Is it required to use it as a context-manager?

Contributor Author

edknv Jun 2, 2023

No, it's probably not necessary in the torch case, but I want to promote this idiom using the context manager everywhere, because the tensorflow equivalent has a memory leak in some cases without a context manager.

marcromeyn reviewed

View reviewed changes

merlin/models/torch/models/base.py

+                      super().__init__()
+                      self.schema = schema
+                      self.pre = BlockContainer(name="pre")

Contributor

marcromeyn Jun 1, 2023

Should we break this part (until line 77) as a function an use it inside Block as well?

Contributor Author

edknv Jun 2, 2023

Yeah, that would be cleaner, but I'm having trouble with torchscript shenanigans.

marcromeyn reviewed

View reviewed changes

merlin/models/torch/models/base.py

+                      """Finds all instances of `ModelOutput` in the model."""
+                      return module_utils.find_all_instances(self, ModelOutput)
+                  def first(self) -> nn.Module:

Contributor

marcromeyn Jun 1, 2023

Isn't this part of the Block class as well?

marcromeyn reviewed

View reviewed changes

merlin/models/torch/models/base.py Show resolved Hide resolved

marcromeyn reviewed

View reviewed changes

merlin/models/torch/models/base.py Outdated Show resolved Hide resolved

marcromeyn reviewed

View reviewed changes

merlin/models/torch/models/base.py

+                      and len(model_outputs) > 1
+                  ):
+                      raise RuntimeError("Multiple outputs but only one target was provided.")

Contributor

marcromeyn Jun 1, 2023

Do we need to check if all model_outputs have a target property set?

Contributor Author

edknv Jun 4, 2023 •

edited

Loading

I updated the logic to check if model_outputs has a target when no targets are provided in lines 205-206.

marcromeyn reviewed

View reviewed changes

merlin/models/torch/models/base.py

+                      self, inputs: Union[torch.Tensor, Dict[str, torch.Tensor]], batch: Optional[Batch] = None
+                  ):
+                      """Performs a forward pass through the model."""
+                      outputs = inputs

Contributor

marcromeyn Jun 1, 2023

Same here as for the init, should we break this out in a function so it can be shared between here and the Block class.

marcromeyn reviewed

View reviewed changes

merlin/models/torch/models/base.py Outdated

+                  def training_step(self, batch, batch_idx):
+                      """Performs a training step with a single batch."""
+                      del batch_idx
+                      inputs, targets = batch

Contributor

marcromeyn Jun 1, 2023

We need some logic here to construct the Batch class and pass it to the forward-pass.

marcromeyn reviewed

View reviewed changes

merlin/models/torch/models/base.py

+                      if self.schema:
+                          return self.schema
+                      return Schema([])

Contributor

marcromeyn Jun 1, 2023

We would need to add a method here for output_schema that combines all the output-schema's of the various model-outputs.

Contributor Author

edknv Jun 4, 2023

Added an output_schema() method.

edknv added 5 commits

June 1, 2023 15:23


          Merge branch 'main' into torch/model_cls

fc013ab


          move initialize() to module_utils

b651dbb


          handle batch in training_step

63ac24c


          Add output_schema()

88aa188


          check if model outputs have no targets when no target is provided

a49f371

edknv and others added 6 commits

June 2, 2023 07:43


          Merge branch 'main' into torch/model_cls

78886c0


          put loss and metrics on the same device

bdda50e


          Merge branch 'torch/model_cls' of github.com:edknv/models into torch/…

7ea78a0

…model_cls


          add docstrings to module_utils functions

68856c0


          move metric device setting to initialize

d58819e


          update logic for using model output targets

36f6155

edknv marked this pull request as ready for review

June 4, 2023 23:34

edknv and others added 5 commits

June 5, 2023 17:30


          Merge branch 'main' into torch/model_cls

2096a37


          Merge branch 'main' into torch/model_cls

58e1013


          Merge branch 'main' into torch/model_cls

3f76c32


          Merge branch 'main' into torch/model_cls

3b4dc51


          Merge branch 'main' into torch/model_cls

8f821b0

marcromeyn approved these changes

View reviewed changes

edknv merged commit 92833fa into NVIDIA-Merlin:main

edknv deleted the torch/model_cls branch

June 13, 2023 16:05

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

area/pytorch enhancement