Skip to content
Discussion options

You must be logged in to vote

Hi @virginiafdez, let me try to answer.

  1. Yes, the persistor can be configured to read checkpoints from the disk. NVFlare doesn't manage existing pretrained checkpoints. You could have a static mount on the server where these can be loaded from. Also, you can define different apps for the server and clients. So, only the server app can contain the ckpt if you want to upload it as part of the job. See a meta.json with a deployment map for that.

  2. We don't have an auto-resume option at this moment. In a production environment, the approach would be to extract the job result from the job storage and use the global model as initialization for the next job. This wouldn't restore all the states…

Replies: 2 comments 2 replies

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
2 replies
@virginiafdez
Comment options

@holgerroth
Comment options

Answer selected by virginiafdez
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
3 participants