refactor: clean up default settings & duplicate json prompting. #4

bpblanken · 2025-11-21T23:13:21Z

Unifies duplicate json prompting and settings definitions.

github-actions · 2025-11-22T02:08:48Z

Package	Line Rate	Health
.	99%	✔
evagg	89%	➖
evagg.content	99%	✔
evagg.library	100%	✔
evagg.llm	72%	➖
evagg.prompts	100%	✔
evagg.ref	94%	✔
evagg.types	100%	✔
evagg.utils	95%	✔
Summary	95% (2180 / 2298)	✔

Minimum allowed line rate is 50%

bpblanken · 2025-11-22T02:11:21Z

lib/evagg/prompts/__init__.py

+    pass
+
+
+class PromptSpec(NamedTuple):


this is forward looking and doesn't necessarily belong on this pr. I've got some thoughts on pydantic -> prompt -> pydantic. There's some cleanup work ahead, but I think pydantic prompting should make everything easier to read and maintain.

I defer to @theferrit32 to review since he has some experience with pydantic on the vrs-python work.

bpblanken · 2025-11-22T02:11:55Z

test/evagg/content/test_content.py

    )

-    prompts = mock_prompt("{invalid json")
+    prompts = mock_prompt({})


we'd need to change how the mocks work to do better than just returning {} from the mock call to prompt_json.

bpblanken · 2025-11-22T02:12:31Z

lib/evagg/content/observation.py

        self._variant_factory = variant_factory
        self._variant_comparator = variant_comparator

-    async def _run_json_prompt(


this function was wholesale duplicated, now only in one place!

bpblanken · 2025-11-22T02:13:10Z

lib/evagg/llm/aoai.py

    async def prompt_file(
        self,
-        user_prompt_file: str,
-        system_prompt: Optional[str] = None,


I hardcoded the system prompt. I can't see why we'd need to support multiple for this.

bpblanken · 2025-11-22T02:16:16Z

test/evagg/test_llm.py

-            {"role": "system", "content": "Extract field"},
+            {
+                "role": "system",
+                "content": "You are an intelligent assistant to a genetic analyst. Their task is to identify the genetic variant or variants that\nare causing a patient's disease. One approach they use to solve this problem is to seek out evidence from the academic\nliterature that supports (or refutes) the potential causal role that a given variant is playing in a patient's disease.\n\nAs part of that process, you will assist the analyst in collecting specific details about genetic variants that have\nbeen observed in the literature.\n\nAll of your responses should be provided in the form of a JSON object. These responses should never include long,\nuninterrupted sequences of whitespace characters.",


the one thing that's maybe now broken is the library relevance check. the prompt is currently written to not force json, but the system prompt is changing and the parsing functionality is changing. I think we might be better off just removing the relevance check from this repo though to avoid the maintenance.

I wouldn't want to slow things down, but I'd like for you to explain this at some point at a future meeting.

Yes, this is a good question. I think Ashley will be helpful here. From what I could tell the "rare disease relevance" prompting just flowed through a different (potentially older?) code path that didn't produce JSON and instead parsed a simple string. I think it's better to be consistent with the output format, but wasn't even sure we will need a "rare disease relevance" prompt with a bring your own paper approach.

larrybabb

defering to @theferrit32

chore: refactor default settings

96d2ad9

bpblanken changed the title ~~chore: refactor default settings~~ refactor: clean up default settings Nov 21, 2025

bpblanken added 5 commits November 21, 2025 18:22

this one is text

bab5519

removing json

ff89948

closer

48014ed

json too

8c6a2ca

make linty

cb0d23f

bpblanken changed the title ~~refactor: clean up default settings~~ refactor: clean up default settings & duplicate json prompting. Nov 22, 2025

bpblanken commented Nov 22, 2025

View reviewed changes

bpblanken assigned theferrit32 Nov 24, 2025

bpblanken requested review from larrybabb and theferrit32 November 24, 2025 16:24

larrybabb reviewed Nov 24, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

refactor: clean up default settings & duplicate json prompting. #4

refactor: clean up default settings & duplicate json prompting. #4

Uh oh!

bpblanken commented Nov 21, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Nov 22, 2025

Uh oh!

bpblanken Nov 22, 2025

Uh oh!

larrybabb Nov 24, 2025

Uh oh!

bpblanken Nov 22, 2025

Uh oh!

bpblanken Nov 22, 2025

Uh oh!

bpblanken Nov 22, 2025

Uh oh!

bpblanken Nov 22, 2025

Uh oh!

larrybabb Nov 24, 2025

Uh oh!

bpblanken Nov 24, 2025

Uh oh!

larrybabb left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

refactor: clean up default settings & duplicate json prompting. #4

Are you sure you want to change the base?

refactor: clean up default settings & duplicate json prompting. #4

Uh oh!

Conversation

bpblanken commented Nov 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Nov 22, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

larrybabb left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

bpblanken commented Nov 21, 2025 •

edited

Loading