perf(cli): speed up catalog merge key partitioning by almsh · Pull Request #2540 · lingui/js-lingui

almsh · 2026-04-30T13:36:58Z

Description

In our project, we have around 20k messages in Lingui catalogs, and this change cuts our extraction time nearly in half.

mergeCatalog currently partitions catalog keys with repeated Array.includes checks:

const newKeys = nextKeys.filter((key) => !prevKeys.includes(key))
const mergeKeys = nextKeys.filter((key) => prevKeys.includes(key))
const obsoleteKeys = prevKeys.filter((key) => !nextKeys.includes(key))

For large catalogs, this becomes expensive because each includes call may scan the other key array. This PR uses Set.has when both catalogs contain keys, reducing key partitioning from O(N * P) repeated scans to O(N + P) Set construction and lookup work. It also keeps a fast path for empty catalogs, avoiding unnecessary Set creation during initial extraction or fully-obsolete cases.

Operation Count Estimate

Let:

N = nextKeys.length
P = prevKeys.length

Current implementation:

newKeys:      N items * O(P) lookup in prevKeys
mergeKeys:   N items * O(P) lookup in prevKeys
obsoleteKeys: P items * O(N) lookup in nextKeys

total:       O(N * P)

Optimized version:

build Sets:  O(P + N)
newKeys:     N items * O(1) lookup in prevKeySet
mergeKeys:   N items * O(1) lookup in prevKeySet
obsoleteKeys: P items * O(1) lookup in nextKeySet

total:       O(N + P)

In our case, both the previous and next catalogs contain around 20k messages. With the current implementation, each extraction can require hundreds of millions of array comparison steps while partitioning catalog keys. With this change, the same step is reduced to roughly 100k Set build/lookups.

This keeps the behavior unchanged while making large catalog merges significantly cheaper.

One question about the checklist: since this change keeps the existing behavior covered by the current mergeCatalog tests and only changes the key partitioning strategy, would you like me to add a small performance check or benchmark test for this path?

Types of changes

Bugfix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
Documentation update
Examples update

Checklist

I have read the CONTRIBUTING and CODE_OF_CONDUCT docs
I have added tests that prove my fix is effective or that my feature works
I have added the necessary documentation (if appropriate)

vercel · 2026-04-30T13:37:04Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Actions	Updated (UTC)
js-lingui	Ready	Preview	Apr 30, 2026 2:53pm

codecov · 2026-04-30T13:47:06Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 89.39%. Comparing base (6bb8983) to head (81f3ca0).
⚠️ Report is 328 commits behind head on main.

Additional details and impacted files

@@             Coverage Diff             @@
##             main    #2540       +/-   ##
===========================================
+ Coverage   77.05%   89.39%   +12.34%     
===========================================
  Files          84      118       +34     
  Lines        2157     3385     +1228     
  Branches      555     1001      +446     
===========================================
+ Hits         1662     3026     +1364     
+ Misses        382      324       -58     
+ Partials      113       35       -78

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

Copilot

Pull request overview

Improves mergeCatalog performance in the CLI by optimizing how catalog keys are partitioned during merges, which is a hot path for large catalogs.

Changes:

Replaces repeated Array.includes scans with Set.has lookups when both catalogs are non-empty.
Adds a fast path for cases where either the previous or next catalog has no keys to avoid unnecessary Set creation.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot

Pull request overview

Copilot reviewed 1 out of 1 changed files in this pull request and generated no new comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

perf(cli): speed up catalog merge key partitioning

fef9ecc

vercel Bot deployed to Preview April 30, 2026 13:37 View deployment

andrii-bodnar requested review from Copilot and timofei-iatsenko April 30, 2026 14:39

Copilot started reviewing on behalf of andrii-bodnar April 30, 2026 14:46 View session

Copilot AI reviewed Apr 30, 2026

View reviewed changes

Comment thread packages/cli/src/api/catalog/mergeCatalog.ts Outdated

style(cli): make catalog merge condition explicitly boolean

81f3ca0

vercel Bot deployed to Preview April 30, 2026 14:53 View deployment

almsh requested a review from Copilot April 30, 2026 15:55

Copilot started reviewing on behalf of almsh April 30, 2026 15:56 View session

Copilot AI reviewed Apr 30, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

perf(cli): speed up catalog merge key partitioning#2540

perf(cli): speed up catalog merge key partitioning#2540
almsh wants to merge 2 commits intolingui:mainfrom
almsh:merge-catalog-keys-perf

almsh commented Apr 30, 2026

Uh oh!

vercel Bot commented Apr 30, 2026 •

edited

Loading

Uh oh!

codecov Bot commented Apr 30, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

almsh commented Apr 30, 2026

Description

Operation Count Estimate

Types of changes

Checklist

Uh oh!

vercel Bot commented Apr 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov Bot commented Apr 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

vercel Bot commented Apr 30, 2026 •

edited

Loading

codecov Bot commented Apr 30, 2026 •

edited

Loading