Add websearch #86

laetitia-wilhelm · 2025-05-20T14:46:07Z

Websearch on RAG output

Input:
Output file from the RAG model

Output:

Original query
Summary of the RAG output
Brief answer to the query
Detailed answer

How to Run

python -m mmore websearch --config-file examples/websearch/config.yaml

Key Parameters

n_loops: Number of search loops
max_searches: Maximum number of sources retrieved per web search

Pipeline Overview

Load Input Data
- Extract the original query and the initial answer generated by the RAG model.
Generate Initial Summary
- Summarize the RAG answer with respect to the original query.
Iterative Search and Analysis (Repeated n_loops times)
- Generate Search Query:
  Formulate a refined search query by combining the original query, current knowledge, and previous findings (if any).
- Perform Web Search:
  Retrieve relevant web results using DuckDuckGo.
- Analyze Search Results:
  Use the language model (LLM) to integrate new web information with existing knowledge, updating the summary accordingly.
- Update the current knowledge and previous analysis for the next iteration.
Save Final Results
- Store the final combined summary derived from both web search and RAG output.

fabnemEPFL

Great work so far, sounds promising. Some changes needed.
I have to go so there will be a follow-up review later today

examples/websearchRAG/config.yaml

src/mmore/run_websearch.py

src/mmore/websearchRAG/config.py

src/mmore/websearchRAG/pipeline.py

fabnemEPFL

Additional comments

src/mmore/websearchRAG/pipeline.py

src/mmore/websearchRAG/websearch.py

docs/websearch.md

examples/websearchRAG/config_api.yaml

src/mmore/run_websearch.py

src/mmore/cli.py

…into webRag

fabnemEPFL

I will make soon the changes related to the few additional comments I added

src/mmore/run_websearch.py

fabnemEPFL · 2025-08-13T09:19:28Z

src/mmore/websearchRAG/config.py

+    n_loops: int = 2
+    max_searches: int = 10
+    llm_config: Dict[str, Any] = field(
+        default_factory=lambda: {"llm_name": "gpt-4", "max_new_tokens": 1200}


Make it a field of type LLMConfig

src/mmore/run_websearch.py

laetitia-wilhelm added 12 commits May 20, 2025 06:57

websearch pipeline

e5bdaf0

websearch - half pipeline - not tested

75a8b3a

correct RAG import - LLM HS

22009aa

extract new subqueries

cbb7c8c

query extraction solved

43eb8cd

use_rag(true), use_summary(wrong): working

7d0e86a

local webrag working

ed72368

clean folder

ffdb4c9

websearch function improved

87aa7fb

API mode working + sources output update

65c8dbc

config file API

d1c513d

sources correct - websearch unreliable

1ac8e07

fabnemEPFL requested changes Jun 27, 2025

View reviewed changes

laetitia-wilhelm added 4 commits June 30, 2025 02:13

clean + stop search function

381d6cc

websearch doc

3cde125

Merge branch 'master' of https://github.com/swiss-ai/mmore into webRag

1a4b105

delete print

1a3175d

fabnemEPFL reviewed Jun 30, 2025

View reviewed changes

laetitia-wilhelm added 3 commits June 30, 2025 13:56

corrected import

7672f19

corrected link

9e60558

Switch LangServe to LangGraph

ede3fd0

fabnemEPFL reviewed Jul 13, 2025

View reviewed changes

src/mmore/cli.py Outdated Show resolved Hide resolved

laetitia-wilhelm and others added 6 commits July 14, 2025 08:17

ruff + isort check done

c62ef31

ruff format

8c99ece

changed import

fb12473

formatting done

904cca8

Merge branch 'web_search' of https://github.com/laetitia-wilhelm/mmore …

46fc88d

…into webRag

removed unused import

8d5376e

fabnemEPFL reviewed Aug 13, 2025

View reviewed changes

fabnemEPFL and others added 10 commits August 13, 2025 15:25

Simplified loading of configuration

d6d802c

fixed some logic issues

9800fe9

renamed a method in websearchRAG/websearch.py

cfcf101

fixed some typing / logic issues

2660792

updated slightly the documentation of websearch

0094a23

rotation of devices when loading several llms

f70d837

removed useless newlines

f183178

ruff formatting

e72efaf

bug fix

1de7a57

isort

2c7fcde

fabnemEPFL merged commit 9d0af5c into swiss-ai:master Sep 4, 2025
4 checks passed

Add websearch #86

Add websearch #86

Uh oh!

Conversation

laetitia-wilhelm commented May 20, 2025

Websearch on RAG output

How to Run

Key Parameters

Pipeline Overview

Uh oh!

fabnemEPFL left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

fabnemEPFL left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

fabnemEPFL left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

fabnemEPFL Aug 13, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!