Currently almost each scripts is linked to an input folder where the raw/processed data must be copy pasted in before we can run the script. This leads to multiple copies of the raw/processed data, and possibly multiple versions which are hard to keep track of. It's also a very inconvenient and time consuming procedure.
A better solution is to create script(s) which reads the raw/processed data, does necessary reformatting and data cleanup, and finally stores .Rdata objects which all the other labelling scripts can import.