fixed super.init in SentenceTransformerPatched #617

raul3820 · 2025-07-10T08:07:55Z

Related Issue

The problem stems from the __init__ method of the parent class, SentenceTransformer. This method uses self.__class__.__name__ to set a configuration value.

When subclass SentenceTransformerPatched calls super().__init__(), self.__class__.__name__ evaluates to "SentenceTransformerPatched". The parent class's internal logic is not designed for this and expects the name to be "SentenceTransformer", which leads to incorrect config and problems like this:

WARNING: No sentence-transformers model found with name temp/models/model_1752105604572. Creating a new one with mean pooling.

The fix avoids calling super().__init__() directly. Instead, it creates a temporary instance of the base SentenceTransformer class, which initializes correctly. It then copies the state from this temporary object to the current instance (self), effectively bypassing the problematic check while still properly initializing the object.

Checklist

I have read the CONTRIBUTING guidelines.
I have added tests to cover my changes.
I have updated the documentation (docs folder) accordingly.

Additional Notes

I tried to run the poetry tests but got stuck here:

Installing dependencies from lock file

Finding the necessary packages for the current system

Package operations: 167 installs, 1 update, 0 removals, 3 skipped

  - Installing certifi (2024.8.30): Pending...
Checking if keyring is available
  - Installing charset-normalizer (3.4.0): Pending...
  - Installing frozenlist (1.4.1): Pending...
  - Installing idna (3.10): Pending...
  - Installing multidict (6.1.0): Pending...
  - Installing nvidia-nvjitlink-cu12 (12.4.127): Pending...
  - Installing propcache (0.2.0): Pending...
  - Installing urllib3 (2.2.3): Pending...
[keyring:keyring.backend] Loading KWallet
[keyring:keyring.backend] Loading SecretService
[keyring:keyring.backend] Loading Windows
[keyring:keyring.backend] Loading chainer
[keyring:keyring.backend] Loading libsecret
[keyring:keyring.backend] Loading macOS
Using keyring backend 'SecretService Keyring'

greptile-apps

PR Summary

Fixed initialization issue in SentenceTransformerPatched where super().init() caused incorrect configuration due to class name checks in parent class.

Implements workaround in libs/infinity_emb/infinity_emb/transformer/embedder/sentence_transformer.py by creating temporary SentenceTransformer instance and copying state
Resolves warning 'No sentence-transformers model found' by avoiding direct super() call that triggered name-based config checks
Potentially introduces future maintenance challenges due to state copying approach
Tests are currently incomplete due to poetry dependency installation issues

_{1 file reviewed, 1 comment}
_{Edit PR Review Bot Settings | Greptile}

greptile-apps · 2025-07-10T08:08:37Z

libs/infinity_emb/infinity_emb/transformer/embedder/sentence_transformer.py

+        temp_model = SentenceTransformer(**dict(
+            model_name_or_path=engine_args.model_name_or_path,
            revision=engine_args.revision,
            trust_remote_code=engine_args.trust_remote_code,
            device=ls.device_placement,
            model_kwargs=model_kwargs,
-        )
+        ))
+        self.__dict__.update(temp_model.__dict__)
        self.to(ls.device_placement)


style: This pattern of creating a temp instance and copying state bypasses normal inheritance. Consider adding comment explaining why dictionary update is safer than inheritance here

Suggested change

temp_model = SentenceTransformer(**dict(

model_name_or_path=engine_args.model_name_or_path,

revision=engine_args.revision,

trust_remote_code=engine_args.trust_remote_code,

device=ls.device_placement,

model_kwargs=model_kwargs,

)

))

self.__dict__.update(temp_model.__dict__)

self.to(ls.device_placement)

# Create temporary model instance and copy its state to bypass SentenceTransformer's

# __init__ which doesn't support our extended configuration. This allows us to

# customize initialization while preserving all the model's internal state.

temp_model = SentenceTransformer(**dict(

model_name_or_path=engine_args.model_name_or_path,

revision=engine_args.revision,

trust_remote_code=engine_args.trust_remote_code,

device=ls.device_placement,

model_kwargs=model_kwargs,

))

self.__dict__.update(temp_model.__dict__)

self.to(ls.device_placement)

fixed super.init in SentenceTransformerPatched

02653d3

greptile-apps bot reviewed Jul 10, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fixed super.init in SentenceTransformerPatched #617

fixed super.init in SentenceTransformerPatched #617

Uh oh!

raul3820 commented Jul 10, 2025

Uh oh!

greptile-apps bot left a comment

Uh oh!

greptile-apps bot Jul 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

fixed super.init in SentenceTransformerPatched #617

Are you sure you want to change the base?

fixed super.init in SentenceTransformerPatched #617

Uh oh!

Conversation

raul3820 commented Jul 10, 2025

Related Issue

Checklist

Additional Notes

Uh oh!

greptile-apps bot left a comment

Choose a reason for hiding this comment

PR Summary

Uh oh!

greptile-apps bot Jul 10, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant