Adding cycling to 3dvar_cf suite by mer-a-o · Pull Request #752 · GEOS-ESM/swell

mer-a-o · 2026-03-26T17:38:40Z

Description

This PR adds a new cycling 3DVar suite for GEOS-CF. The experiment is designed to run a 12h forecast starting from the beginning of the assimilation window and apply the increment from the middle of the window using IAU. The forecast length and frequency can be set using forecast_length and forecast_output_frequency but I didn't test changing the forecast length.

Summary of changes:

RC files that get templated with SWELL are in src/swell/configuration/jedi/interfaces/geos_cf/namelists. The remaining RC files (static) are copied into the scratch directory from geos_cf_run_dir.
get_background task fetches backgrounds from R2D2 using background_experiment in the config as the experiment ID. In cycling experiments (those with "cycle" in their name), the experiment ID is set to background_experiment for the first cycle. For subsequent cycles, it is set to the r2d2_experiment_id of the current experiment, so that backgrounds from the previous cycle are fetched for the variational task in the current one.
save_forecast task stores forecasts in R2D2 using r2d2_experiment_id as the experiment ID. These files are then fetched from R2D2 during the get_background task.
Restarts are saved (save_restart) and fetched (get_restart) from R2D2. To save space, restart files are not stored on R2D2 at every cycle. The rst_store_interval key controls how many cycles pass before restart files are stored as real files rather than symlinks. In intermediate cycles, restarts are saved as symlinks.
prep_forecast prepares the scratch directory within the current cycle's run directory:
- Copies static RC files from geos_cf_run_dir and copies/edits RC files from src/swell/configuration/jedi/interfaces/geos_cf/namelists.
- Copies GEOS-FP files for replay into the scratch directory.
- Changes the format of the JEDI increment file using inc_template.
run_forecast submits gcm_run_geoscf.j (a lighter version of gcm_run.j used for running GEOS-CF) to the queue and waits until the job finishes. One limitation: the Cylc interface shows this job as "running" whether it is waiting in the queue or actually executing. There may be better approaches for handling gcm_run.j execution.
clean_cycle.py now cleans the scratch directory of the previous cycle after the current cycle completes. This way, the restart files from the previous cycle that are saved as symlinks are kept until the current cycle uses them.

rtodling · 2026-03-31T21:25:11Z

this file should not exist in the repo - it should only exist in the experiment. A way to avoid this file being in the repo is to have swell create this file based on the initial date/time of the experiment. From there on, the experiment (GCM) will create this file on its own; the file can then be recycled from one cycle to the next.

removing cap_restart. Note that prep_forecast.py handles writing the correct initial date/time for the experiment.

rtodling · 2026-04-01T12:38:47Z

+  geoscf_jedi.format:    'CFIO',
+  geoscf_jedi.mode:      'instantaneous' ,
+  #geoscf_jedi.frequency:  010000,
+  geoscf_jedi.frequency:  >>SWELL_FC_OUTPUT_FREQ<<,


We should talk about the naming conventions for these templated frequencies and such.

rtodling · 2026-04-01T22:04:56Z

        bkg_steps = []

        # Parse config
-        background_experiment = self.config.background_experiment()


I am trying to understand the reason for the changes here. Is this because the backgrounds for various cycles for the swell experiment can be located outside of r2d2?

Please see the comment in the code. For the first cycle the backgrounds are fetched from R2D2 using the experiment_id specified by the user. For the cycles after that swell ('get_background.py') fetches the backgrounds files from the 12h forecast done in the previous cycles. Those forecast files from the previous cycle get updloaded to R2D2 with the current experiment id. In the following cycle this experiment id is used to fetch those forecast files and will be used as backgrounds for the current cycle.

rtodling · 2026-04-06T15:41:32Z

@@ -0,0 +1,19 @@
+SpeciesName: CO


I puzzled as to why this RC and other RCs and GEOS yaml need to be here. These files should be in the tag of GEOS that supports CF, no? What am I missing?

For each cycle, FileTemplate is replaced to point to the JEDI increment file. I'm keeping all the RC files that are modified for each cycle in SWELL. The rest of RC files are copied from a tagged version of GEOS specified in suite_config.py: geos_cf_run_dir('/discover/nobackup/mabdiosk/rundir/GCv14.0_GCMv1.17_c90_Skylab')
I think this way we can easily keep track of what goes into the swell experiment.

rtodling · 2026-04-06T15:42:00Z

@@ -0,0 +1,20 @@
+SpeciesName: NO2


Same comment as above for RC files ...

rtodling · 2026-04-06T15:42:36Z

@@ -0,0 +1,2769 @@
+Samplings:


Same comment as for the RC files - this should be in the GEOS tag and in the settings of your experiment.

jeromebarre · 2026-04-20T15:39:06Z

Should this be moved out of the draft status at this point? Ideally we want this ready for merge ASAP even if this is isn't perfect this is IMO good enough and replicates what has been done in slylab.
There are fixes to be done on the coding norms and the swell CI. Which is something to clean in saveforecast.

Over time a few improvements on top of my head:

getting met forcings for replay from the original locations and not from my dir in geos
considering handling tarballs for restarts and not individual files
GEOS-CF forecast logs written in the correct locations. But maybe this was fixed since last time I tested

mranst · 2026-05-06T19:22:24Z

Since this script and the RunForecast task are relatively simple, would it be possible to run the executable directly from the task runtime like how Doruk uses gcm_run.j?

[[RunForecast-{{model_component}}]] script = "{{experiment_path}}/run/$datetime/{{model_component}}/gcm_run_geoscf.j" platform = {{platform}} execution time limit = {{scheduling["RunForecast"]["execution_time_limit"]}} [[[directives]]] --output="{{experiment_path}}/run/$datetime/{{model_component}}/gcm_run.log" {%- for key, value in scheduling["RunForecast"]["directives"][model_component].items() %} --{{key}} = {{value}} {%- endfor %}

mranst · 2026-05-06T19:24:32Z

+echo "Submitting GCM job: ${GCM_SCRIPT}"
+
+set +e
+sbatch --wait --output=${LOG_FILE} ${GCM_SCRIPT}


If I'm not mistaken the RunForecast task will be queued into slurm, then this will allocate its own job on top of that. This command also failed for me when I tried it

mranst · 2026-05-06T19:25:53Z

+        # For experiments with cycle in the suite name:
+        # for the first cycle, use background_experiment in config
+        # as the experiment id for fetching from r2d2 for cycles after
+        # the first, use the current experiment id for fetching from r2d2
+        if self.cycle_time_dto() != self.start_cycle_point_dto() and 'cycle' in self.suite_name():
+            background_experiment = self.config.r2d2_experiment_id()
+        else:
+            background_experiment = self.config.background_experiment()
+
+        self.logger.info(f'Fetching background from experiment {background_experiment}')
+


Will this affect 3dvar_marine_cycle and 3dfgat_marine_cycle?

mer-a-o added 12 commits March 18, 2026 11:03

add 3dvar_cf_cyclu suite

0a2be97

save

1e02506

make sure cycles run one after another

795fb5e

save

1520e12

move rst_type to suite_config

fcb36f3

add save rst intervals

0338079

some cleaning

fd51692

save

e139ae8

save and clean up

a36a8a6

add forecast frequency and clean scratch

ff8df7c

save

ea65d0b

save

7d8fcb1

mer-a-o requested review from jeromebarre and viral211 March 26, 2026 22:40

mer-a-o added the compo Atmospheric composition related issues label Mar 26, 2026

mer-a-o requested a review from rtodling March 30, 2026 14:05

mer-a-o added 3 commits March 30, 2026 14:48

some cleaning

2380a80

bugfix

c3ae936

update eva version

882e804

rtodling reviewed Apr 6, 2026

View reviewed changes

jeromebarre reviewed Apr 7, 2026

View reviewed changes

Comment thread src/swell/tasks/prep_forecast.py Outdated

jeromebarre reviewed Apr 7, 2026

View reviewed changes

Comment thread src/swell/configuration/jedi/interfaces/geos_cf/namelists/GEOSCHEMchem_AnaSettings_O3.rc Outdated

mer-a-o added 2 commits April 8, 2026 12:56

addressing comments

502eecb

clean scratch in last cycle

6a0ba68

jeromebarre reviewed Apr 20, 2026

View reviewed changes

Comment thread src/swell/tasks/save_forecast.py Outdated

This was referenced Apr 29, 2026

Store geos-cf rst files on r2d2 in a tarball #774

Open

a more generic way to handle met forcing in geos-cf cycling experiment. #775

Open

mer-a-o added 2 commits May 4, 2026 12:08

save

69f9719

save test

51e75f0

mer-a-o added 2 commits May 5, 2026 14:56

capture gcm_run.j log

937384b

fix tests

cd3cf8f

mer-a-o marked this pull request as ready for review May 5, 2026 21:01

mer-a-o requested review from jeromebarre, mranst and rtodling May 5, 2026 21:01

mranst reviewed May 6, 2026

View reviewed changes

Conversation

mer-a-o commented Mar 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

jeromebarre commented Apr 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

mer-a-o commented Mar 26, 2026 •

edited

Loading

jeromebarre commented Apr 20, 2026 •

edited

Loading