Skip to content

Conversation

@Layth17
Copy link
Member

@Layth17 Layth17 commented Dec 19, 2022

  • add a script to collect all target files

  • a unique way of naming files so it is clear which one belongs to which process
    -- grabbing the last 5 "/"-dilemeted fields seems unique enough + creating /full_path to include full paths if needed

  • iterate through the bucket dir to grab the files
    -- downloading folders locally is inefficient. Each folder is +100GB.
    -- instead I am only grabbing the files I want after generating their paths through gsutil ls gs://BUCKET/.../**/monitoring.log
    -- copying files over takes about 15 min

  • automatically copy pull_monitor_logs.sh script to /shared

  • add summary.log

example cmd: bash /shared/pull_monitor_logs.sh --gs-path griffith-lab-test-layth/cromwell-executions/immuno --wf-id b6ef294d-080b-41cb-924e-ea53f6c54a2a

@Layth17 Layth17 marked this pull request as draft December 19, 2022 21:27
@Layth17 Layth17 marked this pull request as ready for review January 18, 2023 04:06
@Layth17 Layth17 mentioned this pull request Jan 18, 2023
1 task
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant