feat: Improve unstable size analysis support #935

ChayimFriedman2 · 2025-07-10T04:29:26Z

Include an option panic_if_missing that will panic if there is an ingredient with no heap_size() defined, to ensure coverage.
Add heap_size() to tracked structs, interneds an inputs.

netlify · 2025-07-10T04:29:48Z

✅ Deploy Preview for salsa-rs canceled.

Name	Link
🔨 Latest commit	`d0ceeb4`
🔍 Latest deploy log	https://app.netlify.com/projects/salsa-rs/deploys/6872f3beef22be000857eba8

codspeed-hq · 2025-07-10T04:33:15Z

CodSpeed Performance Report

Merging #935 will degrade performances by 5.56%

_{Comparing ChayimFriedman2:size-analysis-improved (d0ceeb4) with master (d28d66b)}

Summary

❌ 1 regressions
✅ 11 untouched benchmarks

⚠️ Please fix the performance issues or acknowledge them on CodSpeed.

Benchmarks breakdown

	Benchmark	`BASE`	`HEAD`	Change
❌	`amortized[InternedInput]`	3.4 µs	3.6 µs	-5.56%

ChayimFriedman2 · 2025-07-10T04:37:14Z

How exactly does this regress performance?

MichaReiser · 2025-07-10T05:49:19Z

I think it would be great if we can split this PR. 3. is fairly uncontroversial. The use cases for 1 and 2 are less clear to me.

For 1. can you give me an example on what this is uesd for? I'm asking because adding extra arguments makes using it in combination with getsize2 more annoying because it's no longer possible to use the trait method directly

For 2. It prefer an enum as argument rather than a bool. It makes it more expressive what true or false means.

ChayimFriedman2 · 2025-07-10T06:05:13Z

For 1. can you give me an example on what this is uesd for? I'm asking because adding extra arguments makes using it in combination with getsize2 more annoying because it's no longer possible to use the trait method directly

For example, I want to analyze memory usage in rust-analyzer per type and not per query (it is more convenient to draw conclusions this way). This provides a way to do that and more (e.g. what I'm doing now, checking the effect of rust-lang/rust-analyzer#18403) with minimum effort. The disturbance you described is minimal because you can have a single method fn heap_size<T: GetSize>(value: &T, _data: &mut dyn Any) -> usize { ... } and pass it everywhere.

For 2. It prefer an enum as argument rather than a bool. It makes it more expressive what true or false means.

Fair enough.

ChayimFriedman2 · 2025-07-10T06:56:23Z

Edited to use an enum instead of bool.

MichaReiser · 2025-07-10T08:51:13Z

For example, I want to analyze memory usage in rust-analyzer per type and not per query (it is more convenient to draw conclusions this way). This provides a way to do that and more (e.g. what I'm doing now, checking the effect of rust-lang/rust-analyzer#18403) with minimum effort. The disturbance you described is minimal because you can have a single method

I don't think I fully understand the use case. The table in the linked issue is grouped by query? I think we also have an API to iterate over all memos. Could your impelmentation use that instead?

ChayimFriedman2 · 2025-07-10T09:13:58Z

I want e.g. if I have struct Foo(Vec<Bar>), to not have all memory attributed to Foo, but the Vec to be attributed to Bar. This is because the type hierarchy in rust-analyzer is complex and query-level filtering doesn't tell me a lot.

Currently I have nowhere to store this attribute information (besides a static, which is really ugly). This is what I want to change.

ChayimFriedman2 · 2025-07-10T11:01:43Z

When using this in rust-analyzer I noticed there is still no way to have heap_size for the automatically-generated interned for memos with more than one parameter, but I'll fix that in another PR.

MichaReiser · 2025-07-10T11:34:26Z

Currently I have nowhere to store this attribute information (besides a static, which is really ugly). This is what I want to change.

I see. This seems very specific. Have you considered using a thread local in your code? That should give you the same without that we need to change salsa for this less common case.

ChayimFriedman2 · 2025-07-10T12:02:18Z

I can do that, but it will be cleaner if it will be a parameter to the function. And this enables a lot of patterns (basically, you can apply any operation to the whole database, you can even select the operation at runtime should there be such need), and the cost is minimal - if you don't use this feature, you need to replace heap_size with a generic function that has the extra parameter, and pass &mut (). And also from a maintenance of Salsa POV, the cost is only an additional parameter in few places.

And FWIW, even with get-size2 itself you may want to share the tracker between queries (e.g. if queries share Arcs) - that's not possible to do without this PR.

And in general, passing an addition data parameter to a callback is a common pattern (especially in C, where there are no closures, but here we can't use closures so it's back to the C idiom).

MichaReiser · 2025-07-10T12:12:38Z

I can do that, but it will be cleaner if it will be a parameter to the function. And this enables a lot of patterns (basically, you can apply any operation to the whole database, you can even select the operation at runtime should there be such need), and the cost is minimal - if you don't use this feature, you need to replace heap_size with a generic function that has the extra parameter, and pass &mut (). And also from a maintenance of Salsa POV, the cost is only an additional parameter in few places.

I'm not oposed to having such a function that allows analysing the entire db. It only feels to me that this goes beyond what heap_size is intended for and we should maybe consider using a different design for it / generalizing the functionality (and renaming it!) instead of making those changes in place

ChayimFriedman2 · 2025-07-10T12:25:19Z

The design seems perfect to me. The alternative design will be to have a trait for that analysis that is called automatically (if it is implemented, via autoref specialization or explicitly specifying) but that means it'll be foreign for the Salsa user, at least for my framework of size analysis this will cause major trouble, as I need a lot of impls for third-party types.

The naming... I agree it is not ideal. In fact, I originally renamed this function to size_of() (to signify it may do more work than just returning the heap size), but I reverted that change because that name didn't look better for me. In my local prototypes of such feature (before it was implemented in Salsa) I was calling it size_analysis(). But maybe it should have an even more general name, as it can be used for non-size things too (although I have hard time imagining what they could look like). There is also a tension between the fact that it can be used like a simple size counter by returning a number, or it can ignore that completely and do other things. Maybe that suggests we should split the feature into a simple heap_size() and a more complicated function that can do other things, but then I don't see why the simpler function needs to exist, as the only thing that cannot be implemented with the more general framework is calculating the Salsa overhead. And maybe that's indeed what we should do - return the Salsa overhead and only it, and also provide a way to run a custom analysis on ingredient values.

MichaReiser · 2025-07-10T12:37:53Z

Either way. I suggest splitting the PR. I'm on board with 2 and 3 but still think that 1 is looking for a generic traversal logic rather than what get size provides but I'm interested to hear from others.

ChayimFriedman2 · 2025-07-10T13:56:53Z

Okay. I've omitted 1 from the PR and will open a Zulip discussion.

MichaReiser · 2025-07-10T14:08:34Z

@ibraheemdev would you mind taking a look at the changes?

ibraheemdev · 2025-07-10T14:51:39Z

This looks fine to me, but it could use some tests for both the panicking behaviour and the new heap_size attribute on structs.

ibraheemdev · 2025-07-10T14:53:33Z

components/salsa-macro-rules/src/setup_input_struct.rs

@@ -89,6 +92,12 @@ macro_rules! setup_input_struct {

                type Revisions = [$zalsa::Revision; $N];
                type Durabilities = [$zalsa::Durability; $N];
+
+                $(
+                    fn heap_size(value: &Self::Fields, _panic_if_missing: $zalsa::PanicIfHeapSizeMissing) -> usize {


Instead of threading the PanicIfHeapSizeMissing everywhere, you could add a has_heap_size method to Ingredient and do a single check that all ingredients return true if PanicIfHeapSizeMissing::Yes is set. You'd need a HAS_HEAP_SIZE constant on all the configurations, but that should be simple to add.

Or you could have Ingredient::memory_usage return whether or not the heap size was accounted for, and have Configuration::heap_size default to None instead of 0. Either way seems fine.

I actually like the current way more. It is simpler (especially if we want to show the name of the problematic ingredient), and the threading is not that bad. Do you have a preference?

It's fine if you think that's easier, just an idea. It might be nice to add an is_yes helper method to convert to a boolean though.

Or you could have Ingredient::memory_usage return whether or not the heap size was accounted for,

I'd prefer this approach as it's more flexible (if I understand it correctly). It leaves the decision of what to do if heap size isn't implemented to the caller. ty includes the memory analysis in production builts where panicking isn't an option, but we probably want to log a message if a heap size implementation is missing instead. This isn't possible with PanicIfHeapSizeMissing but is possible if the function returns Option

I don't think @ibraheemdev meant to expose that, just as an internal implementation detail.

If we do expose that, how do we? Do we change IngredientInfo::size_of_fields to Option<usize>?

That's what I had in mind but I see the API works slightly different than I thought. My assumption was that it returns an Iteratror over all memos without doing any aggregation but MemoInfo does a first level of aggregation. @ibraheemdev is this something we could change (return a list of SlotInfo instead).

I also think this requires splitting size_of_fields into two:

fields_stack_size which returns usize

fields_heap_size which returns Option<usize>

Why is the memo aggregation a problem here? If we split size_of_fields I think that should be sufficient to detect whether or not the heap size was tracked for a given memo type.

Why is the memo aggregation a problem here?

The API technically allows for one memo to return Some but None for another (both of the same ingredient) which is a bit awkward to handle. This shouldn't be possible but it isn't something we can express easily in the current API.

ChayimFriedman2 · 2025-07-10T22:15:21Z

I added tests, but I can't make sense of the numbers.

ibraheemdev · 2025-07-10T22:27:24Z

tests/memory-usage.rs

-    let input3 = MyInput::new(&db, 3);
+    let input1 = MyInput::new(&db, String::with_capacity(100));
+    let input2 = MyInput::new(&db, String::with_capacity(100));
+    let input3 = MyInput::new(&db, String::with_capacity(100));


You should make the fields different to ensure they aren't all interned to the same ID (the interned/tracked structs store the field not the entire input).

Okay I did that, but I still can't make sense of the numbers.

Which part doesn't make sense? It looks fine to me. The reason for the metadata size increase is alignment (previously the u32 field could fit into the padding).

Why is it 522? It should be 400, no?

capacity can be larger than the String::length. You Can use shrink_to_fit to shrink the String to its length which should get you closer to the expected value

The capacity is actually exact here, it just also includes the stack size of the strings: 250+150+50+(24*3) = 522.

Ah, got it.

1. Include an option `panic_if_missing` that will panic if there is an ingredient with no `heap_size()` defined, to ensure coverage. 2. Add `heap_size()` to tracked structs, interneds an inputs.

ChayimFriedman2 force-pushed the size-analysis-improved branch from 89c8a47 to 0e29abf Compare July 10, 2025 06:56

ChayimFriedman2 marked this pull request as draft July 10, 2025 10:59

ChayimFriedman2 marked this pull request as ready for review July 10, 2025 11:00

ChayimFriedman2 force-pushed the size-analysis-improved branch from 0e29abf to 9d0acfb Compare July 10, 2025 13:56

ibraheemdev reviewed Jul 10, 2025

View reviewed changes

ChayimFriedman2 force-pushed the size-analysis-improved branch from 9d0acfb to dc4d5c7 Compare July 10, 2025 22:14

ibraheemdev reviewed Jul 10, 2025

View reviewed changes

Improve unstable size analysis support

d0ceeb4

1. Include an option `panic_if_missing` that will panic if there is an ingredient with no `heap_size()` defined, to ensure coverage. 2. Add `heap_size()` to tracked structs, interneds an inputs.

ChayimFriedman2 force-pushed the size-analysis-improved branch from dc4d5c7 to d0ceeb4 Compare July 12, 2025 23:46

feat: Improve unstable size analysis support #935

Are you sure you want to change the base?

feat: Improve unstable size analysis support #935

Uh oh!

Conversation

ChayimFriedman2 commented Jul 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

netlify bot commented Jul 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Deploy Preview for salsa-rs canceled.

Uh oh!

codspeed-hq bot commented Jul 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Merging #935 will degrade performances by 5.56%

Summary

Benchmarks breakdown

Uh oh!

ChayimFriedman2 commented Jul 10, 2025

Uh oh!

MichaReiser commented Jul 10, 2025

Uh oh!

ChayimFriedman2 commented Jul 10, 2025

Uh oh!

ChayimFriedman2 commented Jul 10, 2025

Uh oh!

MichaReiser commented Jul 10, 2025

Uh oh!

ChayimFriedman2 commented Jul 10, 2025

Uh oh!

ChayimFriedman2 commented Jul 10, 2025

Uh oh!

MichaReiser commented Jul 10, 2025

Uh oh!

ChayimFriedman2 commented Jul 10, 2025

Uh oh!

MichaReiser commented Jul 10, 2025

Uh oh!

ChayimFriedman2 commented Jul 10, 2025

Uh oh!

MichaReiser commented Jul 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ChayimFriedman2 commented Jul 10, 2025

Uh oh!

MichaReiser commented Jul 10, 2025

Uh oh!

ibraheemdev commented Jul 10, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ChayimFriedman2 commented Jul 10, 2025

Uh oh!

ibraheemdev Jul 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

MichaReiser Jul 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

ChayimFriedman2 commented Jul 10, 2025 •

edited

Loading

netlify bot commented Jul 10, 2025 •

edited

Loading

codspeed-hq bot commented Jul 10, 2025 •

edited

Loading

MichaReiser commented Jul 10, 2025 •

edited

Loading

ibraheemdev Jul 10, 2025 •

edited

Loading

MichaReiser Jul 13, 2025 •

edited

Loading