It is unclear why data must be given in a .db format in the fine tuning script. Is it possible to get support for generic ase-compatible files? I am unsure if there is some specific to a .db file for training (although I will admit it would allow for asynchronous updates of the training set dynamically).
As a related note, is there a pragmatic way to supply my own dataset batch by batch? I guess I would do this by defining my own dataloader?