Sketch: tying the knot for ref validator #1235

frenchy64 · 2025-10-30T19:14:19Z

This way we can fully compile a ref validator (faster & less memory, don't need to stop to compile each new level and cache).

Removing the dynamic variable here is a strong motivator for -validator to take opts. OTOH, we're gaining perf and only need to use dynamic variable once before its cached.

opqdonut

We read this through with @mattiuusitalo and think this would be a great thing to do! See our comments below.

opqdonut · 2025-11-07T07:18:00Z

src/malli/core.cljc

+                         (int? (nth x 0))
+                         (@rec (nth x 1)))))]
+  (vreset! rec f)
+  f)


I'm following so far, sounds like a good idea!

opqdonut · 2025-11-07T07:19:42Z

src/malli/core.cljc

+;;           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
+;;
+;; Because of this essential difference, validators for eager refs can be fully compiled, but
+;; lazy refs must  <START FROM HERE, WE CAN FIX THIS>


Looks like the excellent docs trail off here. So what is the difference in the plan eager vs. lazy?

I got distracted by the question of whether we can tie the knot at schema-creation time using the same approach. Shelving that idea for now, and I'll fill this explanation back in.

opqdonut · 2025-11-07T07:24:15Z

src/malli/core.cljc

+            f (binding [*ref-validators* (assoc ref-validators id vol)]
+                (-validator s))]
+        (vreset! vol f)
+        f))))


This impl looks good, thanks! I wonder if the s logic could be replaced with something like deref?

opqdonut · 2025-11-07T07:26:04Z

src/malli/core.cljc

+        id (-identify-ref-schema this)]
+    (if-some [vol (ref-validators id)]
+      #(@vol %)
+      (let [vol (volatile! nil)


The rationale for using a volatile could be documented. Something like this:

the volatile can never raced, since the created volatile is only written to in this single-threaded context. After the validator has been compiled, the volatile is only read, which is safe to do in parallel.

we want the perf

For posterity it's more like:

volatile can race if multiple threads realize lazy refs and generate validators for them simultaneously

synchronization logic adds unnecessary overhead since all racing validators are equivalent

"extra" validators that are written to the volatile first but then overwritten will be garbage collected because we always get the latest validator from the volatile, so over time there will be one canonical validator for recursion

opqdonut · 2025-11-07T07:26:58Z

src/malli/core.cljc

+                            (when-not allow-invalid-refs
+                              (-fail! ::invalid-ref {:type :ref, :ref ref}))))
+             _ (when-not lazy (?schema))
+             rf (-memoize #(schema (?schema) options))


why this change?

In the spirit of your other comment, I wanted to understand what exactly the difference was between lazy and eager schemas. It was a big obscured by the original rf. At this point, I think the difference is (when-not lazy (?schema)), which you can see clearly in the code now.

opqdonut · 2025-11-07T07:27:30Z

src/malli/core.cljc

+                                   (-validator (rf)))]
+                           (vreset! vol f)
+                           (f x))))))
+                 (if-some [vol (ref-validators id)]


yep, this is the eager case, looks good

opqdonut · 2025-11-07T07:32:15Z

src/malli/core.cljc

+                   (c/assert (not @vol) "invariant: we tie the knot on the way back up")
+                   (if-some [f @vol]
+                     @f
+                     (fn [x]


Ok, so in the lazy case, the knot-tying is delayed until the validator is run. That makes sense. Could the code for the lazy and eager cases be made to look even more symmetric, so that the added fn wrapper stands out as the only difference?

I tried this multiple times and it was not as easy as I anticipated. I don't fully understand why.

frenchy64 · 2025-11-11T23:53:49Z

Plumatic Schema solves this by caching validators for all schemas during compilation. In addition to handling recursive schemas, it also prevents a nasty exponential blowup of compilation size for even non-recursive schemas, that Malli also suffers from.

Malli's validator compilation is exponential. This registry demonstrates how:

(def registry {::creates-1-validator [:tuple]
               ::creates-2-validators [:tuple ::creates-1-validator ::creates-1-validator ::creates-1-validator ::creates-1-validator]
               ::creates-16-validators [:tuple ::creates-2-validators ::creates-2-validators ::creates-2-validators ::creates-2-validators]
               ::creates-64-validators [:tuple ::creates-16-validators ::creates-16-validators ::creates-16-validators ::creates-16-validators]
               ::creates-256-validators [:tuple ::creates-64-validators ::creates-64-validators ::creates-64-validators ::creates-64-validators]
               ::creates-1024-validators [:tuple ::creates-256-validators ::creates-256-validators ::creates-256-validators ::creates-256-validators]
               ::creates-4096-validators [:tuple ::creates-1024-validators ::creates-1024-validators ::creates-1024-validators ::creates-1024-validators]
               ::creates-16384-validators [:tuple ::creates-4096-validators ::creates-4096-validators ::creates-4096-validators ::creates-4096-validators]
               ::creates-65536-validators [:tuple ::creates-16384-validators ::creates-16384-validators ::creates-16384-validators ::creates-16384-validators]
               ::creates-262144-validators [:tuple ::creates-65536-validators ::creates-65536-validators ::creates-65536-validators ::creates-65536-validators]
               ::creates-1048576-validators [:tuple ::creates-262144-validators ::creates-262144-validators ::creates-262144-validators ::creates-262144-validators]
               ::creates-4194304-validators [:tuple ::creates-1048576-validators ::creates-1048576-validators ::creates-1048576-validators ::creates-1048576-validators]})

With this registry, each level of depth N compiles (m/validator ::creates-1-validator) 4^N times.

e.g., (m/validator ::creates-4194304-validators) compiles (m/validator ::creates-1-validator) 4,194,304 (4^11) times.

Plumatic Schema would only compile it once. It's not so trivial to achieve with dynamically scoped refs, but it's the same idea as detecting ref cycles, which we can now do reliably.

Here's a reproduction of the issue https://github.com/frenchy64/malli/pull/36/files which I have been pondering since discussing #1180

I mention this because, like Plumatic Schema, I think it makes sense to have the same solution solve both exponential and recursive validators.

sketch tying the knot for ref validator

b373b68

frenchy64 added the for discussion Discussion is main focus of PR label Oct 30, 2025

frenchy64 added 4 commits October 30, 2025 14:27

handle lazy

207d68c

smaller

3d2090e

actually retrieve vol

a59b475

precompute more

94a49a2

opqdonut added this to Metosin Open Source Backlog Oct 31, 2025

opqdonut moved this to 📬 Inbox in Metosin Open Source Backlog Oct 31, 2025

frenchy64 added 7 commits October 31, 2025 09:14

Merge branch 'master' into ref-validator-knot

fbf78df

start doc

469e4be

share child in ref

a0bcc56

precise notion of "lazy" being whether to deref child at schema creation

70c0b95

bit shorter

3256139

we don't need delay's coordination

5b1e0cc

TODO test a recursive lazy schema

84d8456

opqdonut reviewed Nov 7, 2025

View reviewed changes

opqdonut moved this from 📬 Inbox to ⌛Waiting in Metosin Open Source Backlog Nov 7, 2025

opqdonut moved this from ⌛Waiting to 📬 Inbox in Metosin Open Source Backlog Nov 12, 2025

Sketch: tying the knot for ref validator #1235

Are you sure you want to change the base?

Sketch: tying the knot for ref validator #1235

Uh oh!

Conversation

frenchy64 commented Oct 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

opqdonut left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

frenchy64 Nov 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

frenchy64 commented Nov 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

frenchy64 commented Oct 30, 2025 •

edited

Loading

frenchy64 Nov 11, 2025 •

edited

Loading

frenchy64 commented Nov 11, 2025 •

edited

Loading