U/alxmrs/experiments/kr2 by alxmrs · Pull Request #681 · m2lines/Samudra

alxmrs · 2026-04-13T19:32:32Z

This PR makes four contributions towards implementing O1KR2 (#615):

Updates the encoder to use dual perceivers to separately represent prognostic and boundary forcings. This is needed because at inference, we plan to use coarse forcings for high res prognostic prediction. Latents from both boundary and prognostic perceivers are linearly mixed, then positional encodings are added before the output is passed on (to the processor).
To enable (1) in the model, this PR also updates the data plumbing across the whole training and inference process to separate the prognostic and boundary tensors. These are no longer concatenated as an "input" tensor.
Also to enable (1), we now update the training and inference AR process to update the GridContext input resolution after step 1. In this new scheme, step 0 does either downscaling or upscaling, and step 1+ updates the grid context to make the input resolution the output resolution. This is needed during "mix" schedule multiscale training, since step 1 and after the prognostic is fed back to the previous decoder step at the output resolution. This does not affect "match" schedule multiscale training, because both resolutions are already equal.
In addition to unit tests being updated, we add a new test_fomo_cross_resolution set of integration tests. These vet that single step cross resolution (prog/boundary), single step mix schedule, and two step AR mixed schedule training of the FOMO model all work as expected.

alxmrs · 2026-04-13T23:37:43Z

@@ -28,26 +29,34 @@ def patch_from(


 class PerceiverEncoder(nn.Module):


The changes in this branch are in service of this encoder update.

alxmrs · 2026-04-14T17:33:56Z

@codex may I have your review?

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 2a89013570

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

chatgpt-codex-connector · 2026-04-14T17:40:47Z

+                prognostic = self.maybe_add_3d_coordinates(
+                    prognostic, ctx.output_resolution_cpu
+                )
+
+            fts = self.encoder(prognostic, boundary, ctx.output_resolution_cpu)


Use input-resolution coordinates for FOMO prognostic path

When src and dst resolutions differ in mix schedule training, TrainData carries prognostic tensors on ctx.input_resolution_cpu (see TorchTrainDataset.ctx), but this code feeds ctx.output_resolution_cpu into the prognostic path. That makes spatial metadata inconsistent with the tensor shape: Concat3dCoordinates can fail with a size mismatch if 3D coordinates are enabled, and even without 3D coordinates the encoder’s positional encoding (pos_scale_enc) is computed from output-grid lat/lon with input-grid patch sizes, producing a token count mismatch at the x + pos_encoding add. As written, cross-resolution mix runs can crash on the first forward pass.

Useful? React with 👍 / 👎.

alxmrs · 2026-04-14T18:02:23Z

@codex may I have your review?

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: d1c7d2e68d

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

alxmrs · 2026-04-14T20:07:47Z

@codex may I have another review?

chatgpt-codex-connector · 2026-04-14T20:14:01Z

Codex Review: Didn't find any major issues. 🚀

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

alxmrs · 2026-04-14T20:52:26Z

+                # so the encoder uses the correct resolution.
+                ctx = dataclasses.replace(
+                    ctx, input_resolution_cpu=ctx.output_resolution_cpu
                )


This addresses the TODO on the left.

alxmrs · 2026-04-14T21:26:09Z

-class Stepper:
-    def __init__(self):
-        pass
+def train_batch(


The Stepper class was really three functions in a trenchcoat. So, I extracted them.

…oading to split the input.

…boundary tensors after projecting the boundary.

… Encoder gets cross attention hparams Refactor: extracted function to created patches.

… catch a bug. Added fourier pos encoding to the boundary layer (new module in augment_input.py).

…d test to confirm the need for the fix.

alxmrs force-pushed the u/alxmrs/experiments/kr2 branch from a736b1a to c159eb8 Compare April 13, 2026 19:34

alxmrs commented Apr 13, 2026

View reviewed changes

Comment thread src/ocean_emulators/models/modules/encoder.py Outdated

alxmrs commented Apr 13, 2026

View reviewed changes

chatgpt-codex-connector Bot reviewed Apr 14, 2026

View reviewed changes

Comment thread src/ocean_emulators/models/fomo.py

alxmrs force-pushed the u/alxmrs/experiments/kr2 branch from 944c435 to 022914a Compare April 14, 2026 20:18

alxmrs commented Apr 14, 2026

View reviewed changes

This was referenced Apr 15, 2026

Separate prognostic and boundary tensors end-to-end #701

Merged

Perceiver encoder updated for cross-resolution fusion #702

Open

Track prognostic resolution across autoregressive rollout steps #703

Draft

alxmrs added 15 commits April 22, 2026 15:25

Scaffolding: models take prog separate from boundary. Refactor data l…

73106d2

…oading to split the input.

Experiment: Encoder perceives over prognostic and cross attends over …

396b69b

…boundary tensors after projecting the boundary.

AR 2: Fix bug in FlashPerceiver to actually pass kwargs to perceiver.…

03254c2

… Encoder gets cross attention hparams Refactor: extracted function to created patches.

AIR 3: optional self attention after fusion. Added a simple assert to…

4116d63

… catch a bug. Added fourier pos encoding to the boundary layer (new module in augment_input.py).

AIR 4, part 1: a few minor catches.

50f719e

AIR 5: Minor code fixes, primarily in test code.

510a6ea

AIR 5: Added integration tests of coarse forcings.

56d32c4

Fixed issues from merge.

0e4f16c

Way simpler: just use a second, smaller perceiver.

289163f

rm unneeded module.

924df14

Remove redundant linear layer. Make more flexible to configure latents.

eff5c2a

Cleanup: we never return embeddings.

d855a16

Do asserts first.

a11afdf

rename variable: in channels bc they are not always prog channels.

96427de

minimize diff

58f54d1

alxmrs added 13 commits April 22, 2026 15:27

Rename because we are not merging anymore.

3bb98cf

rm stale function.

88afc3f

min diff: add back checkpointing.

fcd9916

import at file level.

2b1eace

Added a TODO for an experiment later.

a54ed52

AIR 6: nit: unneeded line.

980fb24

AIR 6 + Codex: Fixed resolution issue (grid ctx) in FOMO wiring. Adde…

53f76d1

…d test to confirm the need for the fix.

Codex: works at steps>1

8534d3e

Added a comment that adds useful context.

3c97c9f

Technically correct in_channels, but the value is not used anyway.

8d332d2

Added TODO

57a3ba7

rm new unit tests, better covered by the integration test.

e8f0856

oops, deleted too many tests.

8f7ffa2

alxmrs force-pushed the u/alxmrs/experiments/kr2 branch from 58f278e to 8f7ffa2 Compare April 22, 2026 22:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

U/alxmrs/experiments/kr2#681

U/alxmrs/experiments/kr2#681
alxmrs wants to merge 28 commits into
mainfrom
u/alxmrs/experiments/kr2

alxmrs commented Apr 13, 2026 •

edited

Loading

Uh oh!

Uh oh!

alxmrs Apr 13, 2026

Uh oh!

alxmrs commented Apr 14, 2026

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

chatgpt-codex-connector Bot Apr 14, 2026

Uh oh!

alxmrs commented Apr 14, 2026

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

Uh oh!

alxmrs commented Apr 14, 2026

Uh oh!

chatgpt-codex-connector Bot commented Apr 14, 2026

Uh oh!

alxmrs Apr 14, 2026

Uh oh!

alxmrs Apr 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

		@@ -28,26 +29,34 @@ def patch_from(


		class PerceiverEncoder(nn.Module):

Conversation

alxmrs commented Apr 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

alxmrs Apr 13, 2026

Choose a reason for hiding this comment

Uh oh!

alxmrs commented Apr 14, 2026

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector Bot Apr 14, 2026

Choose a reason for hiding this comment

Uh oh!

alxmrs commented Apr 14, 2026

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

alxmrs commented Apr 14, 2026

Uh oh!

chatgpt-codex-connector Bot commented Apr 14, 2026

Uh oh!

alxmrs Apr 14, 2026

Choose a reason for hiding this comment

Uh oh!

alxmrs Apr 14, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

alxmrs commented Apr 13, 2026 •

edited

Loading