feat(tests): Update benchmark test to query newly introduced `max_code_size()` from fork config #1649

jochem-brouwer · 2025-05-25T19:41:42Z

🗒️ Description

This PR expands the benchmark test which queries as many other accounts (EXTCODE* and CALL* operations) by also providing a max code size setting in the fork.

Adds max_code_size() to forks
Adds max_initcode_size() to forks
Edits benchmark test to calculate a correct gas limit w.r.t. deploying the to-be-attacked contracts vs. the actual contract

Note that the deposit costs for the EIP-7907 are much higher (relative) than the original costs. This includes the 200 gas/byte code deposit cost, but also the quadratic memory expansion cost.

The cost of querying one such contract (the "loop cost" of the benchmark) is calculated as 2687 gas/contract. The deposit however costs at least 52M gas due to the 200 gas/byte deposit cost of the 0x40000 bytes. So the cost to deposit such contract vs. querying it is about 20_000 times higher 👀

🔗 Related Issues

feat(tests): add benchmark for the worst initcode jumpdest analysis #1646 can use the max_initcode_size() as well
~~Expands the benchmark to also include the bumped sizes in https://eips.ethereum.org/EIPS/eip-7907~~
The benchmark will read the max initcode/code size from fork config, so to add bumped limits like EIP-7907 the max_initcode_size or max_code_size for Osaka has to be manually edited to return that value
Part of Osaka tracker Osaka CFI'd Tracker Issue #1509

✅ Checklist

All: Set appropriate labels for the changes.
All: Considered squashing commits to improve commit history.
All: Added an entry to CHANGELOG.md.
All: Considered updating the online docs in the ./docs/ directory.
Tests: All converted JSON/YML tests from ethereum/tests have been added to converted-ethereum-tests.txt.
Tests: A PR with removal of converted JSON/YML tests from ethereum/tests have been opened.
Tests: Included the type and version of evm t8n tool used to locally execute test cases: e.g., ref with commit hash or geth 1.13.1-stable-3f40e65.
Tests: Ran mkdocs serve locally and verified the auto-generated docs for new tests in the Test Case Reference are correctly formatted.
Tests: For PRs implementing a missed test case, update the post-mortem document to add an entry the list.

jochem-brouwer · 2025-05-26T03:33:49Z

@jsign I'm coming to this test from the EIP-7907 discussions (not zk related). I'm wondering how this benchmark should be ran? It thus includes the pre-setup of deploying these large contracts. It is clear that the final block contains the actual data we care about, but I'm not sure if test executors have a direct way to measure this. Should I just hack in a performance timer to measure the time it takes to execute the last block?

src/ethereum_test_forks/base_fork.py

src/ethereum_test_forks/forks/forks.py

tests/zkevm/test_worst_bytecode.py

jsign · 2025-05-26T16:29:51Z

@jsign I'm coming to this test from the EIP-7907 discussions (not zk related). I'm wondering how this benchmark should be ran? It thus includes the pre-setup of deploying these large contracts. It is clear that the final block contains the actual data we care about, but I'm not sure if test executors have a direct way to measure this. Should I just hack in a performance timer to measure the time it takes to execute the last block?

AFAIK, today test executors never have a special consideration for benchmark-like tests, so I think if you want to measure wall-clock time I think you prob have to do a hack yes.

marioevz

LGTM, it just needs a rebase to fix CI and conflicts.

Thanks for implementing this :)

src/ethereum_test_forks/base_fork.py

jochem-brouwer · 2025-05-29T16:39:39Z

I'll get this one ready for review either tonight or tomorrow 😄 👍 (But also feel free to pick this up and get it over the finish line, this comment to edit the predeploy gas limit back to the original high limit: #1649 (comment) should be taken into account)

jochem-brouwer · 2025-05-30T22:39:53Z

Have also edited the other tests which now depend on the max code size to read it from the fork.

Could, once this is merged, all other PRs which have the MAX_CODE_SIZE now also read this from the fork?

@jsign also see my comment on the "changed" gas limit here: #1649 (comment). I think it is more clear to make this a formula, but I am also fine by hardcoding a very high gas limit.

@marioevz Typecheck correctly complains that my max_code_size() returns int | None and that thus arithmetics like > do not work. I'm not sure, because Frontier for instance has no code size limit and also forks before Shanghai do not have an initcode limit. To hardcode a super high constant is a possibility, but it feels incorrect. Thoughts? Feel free to push the update here also. (I'll mark it ready to review here since this seems to be the last thing to fix here)

jsign · 2025-05-31T00:46:01Z

@jochem-brouwer, replied the thread we had! (but overall lgtm).

Do you mind remembering before merging (or when fully ready) to ping me so I fill these changed tests and I double check cycles as a last check to not see any difference?

Could, once this is merged, all other PRs which have the MAX_CODE_SIZE now also read this from the fork?

Agree, I can do that after this is merged.

jochem-brouwer · 2025-06-03T00:26:39Z

I want to get this PR merged, but I need to solve this int | None problem: marking Frontier/Chainstart to have a code size limit is incorrect, but marking it as None will make the typecheck complain (because it's not a number). Should we handle this in tests to default to some reasonable value? For instance make all code sizes default to the current code size of EIP-170 (24 KiB)

marioevz · 2025-06-03T01:05:14Z

I want to get this PR merged, but I need to solve this int | None problem: marking Frontier/Chainstart to have a code size limit is incorrect, but marking it as None will make the typecheck complain (because it's not a number). Should we handle this in tests to default to some reasonable value? For instance make all code sizes default to the current code size of EIP-170 (24 KiB)

Makes sense to me, default from frontier should be 24 KiB, and every test that uses this parameter should take this into account, although I doubt it will even be a problem.

jochem-brouwer · 2025-06-03T01:47:00Z

Before this gets merged, could @jsign gives this an explicit ✔️ after:

Do you mind remembering before merging (or when fully ready) to ping me so I fill these changed tests and I double check cycles as a last check to not see any difference?

To see if I did not mess up any tests 😄 👍

Ready for review! 🚀

jochem-brouwer · 2025-06-03T01:50:40Z

I removed the EIP 7907 config from Osaka also, for experimenting this is fine but I don't think we want to activate it by default (?)

jsign · 2025-06-03T11:36:11Z

Before this gets merged, could @jsign gives this an explicit ✔️ after:

Do you mind remembering before merging (or when fully ready) to ping me so I fill these changed tests and I double check cycles as a last check to not see any difference?

To see if I did not mess up any tests 😄 👍

Ready for review! 🚀

I'll do a full run of current master vs this branch to compare them and report back!

jsign · 2025-06-03T12:33:58Z

@jochem-brouwer, something feels odd since seems like filling tests in this branch takes way longer than in main (still waiting to finish).

What I'm using to fill: uv run --from=Cancun --until=Cancun -m "zkevm and blockchain_test" --block-gas-limit 36000000 -n auto (use -n 8 if you don't have a lot of RAM)

I think that shouldn't be the case, no?

jochem-brouwer · 2025-06-03T17:28:12Z

Definitely not! I do not have time to look at it today, but will pick this up tomorrow.

What filler are you using? Standard EELS? How many tests does this pick up?

The gas limit is set as 36M which is reasonable this might therefore also have found a performance degradation in the filler?

However my hunch says it has to do with the tests I raised the gas limit above the setting you provided. Maybe I messed up the calculation and it generates way too many contracts and because of the raised gas limit this is thus possible? Likely has to do with a test with elevated gas limits.

jsign · 2025-06-03T17:59:29Z

Definitely not! I do not have time to look at it today, but will pick this up tomorrow.

What filler are you using? Standard EELS? How many tests does this pick up?

I'm using geth as a filler, but in theory shouldn't matter since I also used it for the main filling.

However my hunch says it has to do with the tests I raised the gas limit above the setting you provided. Maybe I messed up the calculation and it generates way too many contracts and because of the raised gas limit this is thus possible? Likely has to do with a test with elevated gas limits.

Yep, I suspect the same. Probably related with the only test that now calculates differently the number of contracts and maybe something odd there!

tests/zkevm/test_worst_bytecode.py

jsign

Compared cycles and all looks good 👍 (assuming latest Mario code change is applied)

…riate for test

Co-authored-by: Paweł Bylica <[email protected]>

…fork-based max code size

…ment

…anymore

…ebase

Co-authored-by: Mario Vega <[email protected]>

jochem-brouwer · 2025-06-04T05:29:19Z

Once CI passes I will merge this one, the rebase conflicts with the max code sizes are getting annoying. After merge thus read max code size from fork instead of using a constant (will be enforced (likely) on existing files but should also be done on new test files)

jochem-brouwer changed the title ~~Bench/worst bytecode raised max codes size~~ feat(tests): Update benchmark test to query newly introduced max_code_size() from fork config May 25, 2025

jochem-brouwer mentioned this pull request May 25, 2025

feat(tests): add benchmark for the worst initcode jumpdest analysis #1646

Merged

9 tasks

chfast reviewed May 26, 2025

View reviewed changes

src/ethereum_test_forks/base_fork.py Outdated Show resolved Hide resolved

src/ethereum_test_forks/base_fork.py Outdated Show resolved Hide resolved

src/ethereum_test_forks/forks/forks.py Outdated Show resolved Hide resolved

jsign reviewed May 26, 2025

View reviewed changes

tests/zkevm/test_worst_bytecode.py Outdated Show resolved Hide resolved

marioevz reviewed May 28, 2025

View reviewed changes

src/ethereum_test_forks/base_fork.py Outdated Show resolved Hide resolved

jochem-brouwer force-pushed the bench/worst_bytecode_raised_max_codes_size branch from 1574a08 to d5b19d2 Compare May 30, 2025 22:35

jochem-brouwer marked this pull request as ready for review May 30, 2025 22:39

chfast mentioned this pull request Jun 2, 2025

zkevm: add SELFDESTRUCT coverage #1678

Merged

jochem-brouwer force-pushed the bench/worst_bytecode_raised_max_codes_size branch from d5e6317 to a14cb3d Compare June 3, 2025 01:45

jochem-brouwer requested review from jsign and marioevz June 3, 2025 01:47

marioevz reviewed Jun 3, 2025

View reviewed changes

tests/zkevm/test_worst_bytecode.py Outdated Show resolved Hide resolved

jsign approved these changes Jun 3, 2025

View reviewed changes

jochem-brouwer and others added 4 commits June 4, 2025 07:22

feat(forks): introduce max code size / max initcode size

491a210

feat(tests): read max code size from fork / scale gas limit as approp…

f50f9a4

…riate for test

Update src/ethereum_test_forks/base_fork.py

caf996e

Co-authored-by: Paweł Bylica <[email protected]>

Update src/ethereum_test_forks/base_fork.py

e6218c0

Co-authored-by: Paweł Bylica <[email protected]>

jochem-brouwer and others added 11 commits June 4, 2025 07:22

Update src/ethereum_test_forks/forks/forks.py

06e310c

Co-authored-by: Paweł Bylica <[email protected]>

feat(tests): fix type error complaints

b20320e

feat(tests): make tests dependent on MAX_CODE_SIZE now depend on the …

5e823ff

…fork-based max code size

feat(tests): use a more logical gas limit formula for contract deploy…

4ac1b68

…ment

feat(tests): default (init)code size limit to first limit set

4b515b8

feat(tests): remove EIP 7907 from Osaka

8980cf7

feat(tests): make typecheck happy

676d87f

feat(tests): do not use fallback for code size being returned None …

08c421c

…anymore

feat(tests): read code size from forks from newly added tests after r…

9961f11

…ebase

Update tests/zkevm/test_worst_bytecode.py

3734fc8

Co-authored-by: Mario Vega <[email protected]>

feat(tests): insert max_code_size for modarith zkevm test

0ed1dc1

jochem-brouwer force-pushed the bench/worst_bytecode_raised_max_codes_size branch from 608a439 to 0ed1dc1 Compare June 4, 2025 05:26

jochem-brouwer merged commit ee9b84d into ethereum:main Jun 4, 2025
14 checks passed

feat(tests): Update benchmark test to query newly introduced max_code_size() from fork config #1649

feat(tests): Update benchmark test to query newly introduced max_code_size() from fork config #1649

Uh oh!

Conversation

jochem-brouwer commented May 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🗒️ Description

🔗 Related Issues

✅ Checklist

Uh oh!

jochem-brouwer commented May 26, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jsign commented May 26, 2025

Uh oh!

marioevz left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

jochem-brouwer commented May 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jochem-brouwer commented May 30, 2025

Uh oh!

jsign commented May 31, 2025

Uh oh!

jochem-brouwer commented Jun 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

marioevz commented Jun 3, 2025

Uh oh!

jochem-brouwer commented Jun 3, 2025

Uh oh!

jochem-brouwer commented Jun 3, 2025

Uh oh!

jsign commented Jun 3, 2025

Uh oh!

jsign commented Jun 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jochem-brouwer commented Jun 3, 2025

Uh oh!

jsign commented Jun 3, 2025

Uh oh!

Uh oh!

jsign left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jochem-brouwer commented Jun 4, 2025

Uh oh!

Uh oh!

Uh oh!

feat(tests): Update benchmark test to query newly introduced `max_code_size()` from fork config #1649

feat(tests): Update benchmark test to query newly introduced `max_code_size()` from fork config #1649

jochem-brouwer commented May 25, 2025 •

edited

Loading

jochem-brouwer commented May 29, 2025 •

edited

Loading

jochem-brouwer commented Jun 3, 2025 •

edited

Loading

jsign commented Jun 3, 2025 •

edited

Loading

jsign left a comment •

edited

Loading