Unify datasets cache path from references with regular PyTorch cache?

In the `classification` and `video_classification` references, we cache here:

- https://github.com/pytorch/vision/blob/6e203b44098c3371689f56abc17b7c02bd51a261/references/classification/train.py#L108
- https://github.com/pytorch/vision/blob/6e203b44098c3371689f56abc17b7c02bd51a261/references/video_classification/train.py#L124

However, this directory is not used by PyTorch core. Instead, `~/.cache/torch` is used. For example, `torch.hub` caches in [`~/.cache/torch/hub`](https://github.com/pytorch/pytorch/blob/c6b7c33885eeff9dc125f87c7134772d59d0ba21/torch/hub.py#L326). The datasets v2 used the same root folder and will store the datasets by default in 

https://github.com/pytorch/vision/blob/6e203b44098c3371689f56abc17b7c02bd51a261/torchvision/_internally_replaced_utils.py#L7

which expands to `~/.cache/torch/datasets/vision`. 

Maybe we can use `~/.cache/torch/cached_datasets` or something similar as cache path in the references?

cc @datumbox @vfdev-5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Unify datasets cache path from references with regular PyTorch cache? #6727

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Participants

Unify datasets cache path from references with regular PyTorch cache? #6727

Description

Activity

datumbox commented on Oct 10, 2022

pmeier commented on Oct 10, 2022

YosuaMichael commented on Oct 10, 2022

NicolasHug commented on Oct 10, 2022

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Participants

Issue actions