ChatQnA Example with OpenAI-Compatible Endpoint #2091

edlee123 · 2025-06-24T23:34:02Z

Description

Allows ChatQnA to be used with thousands of OpenAI-like endpoints e.g. OpenRouter.ai, Hugging Face, Denvr, and improve the developer experience to use OPEA quickly even on low resource environments.

Key Changes Made:

Created ChatQnA/docker_compose/intel/cpu/xeon/README_endpoint_openai.md: instructions to spin up example.
Created ChatQnA/docker_compose/intel/cpu/xeon/compose_endpoint_openai.yaml: replaces vLLM with an opeai-like endpoint

Also:

Fixed align_generator function to properly detect and skip chunks where content is null in open-ai like endpoints. Previously it'd show the null json in the UI.
Added better error handling and debug logging for easier troubleshooting of endpoint issues.

Issues

N/A

Type of change

List the type of change like below. Please delete options that are not relevant.

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds new functionality)
Breaking change (fix or feature that would break existing design and interface)
Others (enhancement, documentation, validation, etc.)

Dependencies

N/A

Tests

OpenRouter.ai: anthropic/claude-3.7-sonnet
Denvr: meta-llama/Llama-3.1-70B-Instruct
Hugging Face Inference Endpoint: microsoft/phi-4

Signed-off-by: Ed Lee <[email protected]>

…w null json. Also improved exception handling and logging Signed-off-by: Ed Lee <[email protected]>

…yaml Co-authored-by: Copilot <[email protected]> Signed-off-by: Ed Lee <[email protected]>

Signed-off-by: ZePan110 <[email protected]> Signed-off-by: Ed Lee <[email protected]>

Integrate MultimodalQnA set_env to ut scripts. Add README.md for UT scripts. Signed-off-by: ZePan110 <[email protected]> Signed-off-by: Ed Lee <[email protected]>

Signed-off-by: chensuyue <[email protected]> Signed-off-by: Ed Lee <[email protected]>

Signed-off-by: ZePan110 <[email protected]> Signed-off-by: Ed Lee <[email protected]>

Signed-off-by: chensuyue <[email protected]> Signed-off-by: Ed Lee <[email protected]>

Signed-off-by: ZePan110 <[email protected]> Signed-off-by: Ed Lee <[email protected]>

…nt (opea-project#1996) Signed-off-by: Mustafa <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Ed Lee <[email protected]>

Signed-off-by: Yi Yao <[email protected]> Co-authored-by: Copilot <[email protected]> Signed-off-by: Ed Lee <[email protected]>

Signed-off-by: ZePan110 <[email protected]> Signed-off-by: Ed Lee <[email protected]>

…2030) Signed-off-by: ZePan110 <[email protected]> Signed-off-by: Ed Lee <[email protected]>

Signed-off-by: ZePan110 <[email protected]> Signed-off-by: Ed Lee <[email protected]>

Signed-off-by: ZePan110 <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Ed Lee <[email protected]>

Signed-off-by: Ed Lee <[email protected]>

Signed-off-by: ZePan110 <[email protected]> Signed-off-by: Ed Lee <[email protected]>

…d HybridRAG (opea-project#2037) Signed-off-by: ZePan110 <[email protected]> Signed-off-by: Ed Lee <[email protected]>

…archQnA and Translation (opea-project#2038) update secrets token name for ProductivitySuite, RerankFinetuning, SearchQnA and Translation Fix shellcheck issue Signed-off-by: ZePan110 <[email protected]> Signed-off-by: Ed Lee <[email protected]>

…rkflowExecAgent (opea-project#2039) update secrets token name for InstructionTuning, MultimodalQnA and WorkflowExecAgent Fix shellcheck issue Signed-off-by: ZePan110 <[email protected]> Signed-off-by: Ed Lee <[email protected]>

Signed-off-by: ZePan110 <[email protected]> Signed-off-by: Ed Lee <[email protected]>

Signed-off-by: Yongbozzz <[email protected]> Signed-off-by: Ed Lee <[email protected]>

…pea-project#1981) Signed-off-by: Ed Lee <[email protected]>

Signed-off-by: ZePan110 <[email protected]> Signed-off-by: Ed Lee <[email protected]>

Signed-off-by: Yongbozzz <[email protected]> Signed-off-by: Ed Lee <[email protected]>

Signed-off-by: Wang, Xigui <[email protected]> Signed-off-by: Ed Lee <[email protected]>

Signed-off-by: chensuyue <[email protected]> Signed-off-by: Ed Lee <[email protected]>

Signed-off-by: Wang, Xigui <[email protected]> Signed-off-by: Ed Lee <[email protected]>

Signed-off-by: Ed Lee <[email protected]>

…es into chatqna_w_endpoints

github-actions · 2025-06-24T23:34:15Z

Dependency Review

✅ No vulnerabilities or license issues found.

Scanned Files

None

Copilot

Pull Request Overview

This pull request introduces an OpenAI-compatible endpoint for ChatQnA, updates the deployment documentation, and includes improvements in error handling and logging.

Added new Docker Compose file (compose_endpoint_openai.yaml) to support OpenAI-like endpoints.
Updated README files for clearer deployment instructions and configuration details.
Fixed the align_generator function in chatqna.py to better handle and filter null content chunks.

Reviewed Changes

Copilot reviewed 4 out of 4 changed files in this pull request and generated no comments.

File	Description
CodeGen/docker_compose/intel/cpu/xeon/README.md	Updated docker compose command and environment variable documentation; note a markdown table formatting issue.
ChatQnA/docker_compose/intel/cpu/xeon/compose_endpoint_openai.yaml	Added new compose file for OpenAI-compatible endpoint integration.
ChatQnA/docker_compose/intel/cpu/xeon/README_endpoint_openai.md	New documentation with detailed instructions for deploying ChatQnA using the new endpoint.
ChatQnA/chatqna.py	Improved logging and error handling in input/output alignment and generator functions.

Comments suppressed due to low confidence (1)

CodeGen/docker_compose/intel/cpu/xeon/README.md:111

The table row for LLM_ENDPOINT appears to be broken into two columns due to an unintended pipe character. Please merge the content into a single cell to ensure the URL displays correctly.

| `LLM_ENDPOINT`                          | Internal URL for the LLM serving endpoint (used by `codegen-llm-server`). Configured in `compose.yaml`.             | `http://codegen-vllm                           | tgi-server:9000/v1/chat/completions` |

Signed-off-by: Ed Lee <[email protected]>

ChatQnA/chatqna.py

Signed-off-by: Ed Lee <[email protected]>

…es into chatqna_w_endpoints

for more information, see https://pre-commit.ci

edlee123 · 2025-07-02T05:18:35Z

Hi @yao531441 @letonghan if either of you can, I'm looking for one more reviewer please :)

…erence:cpu-1.7 from 1.5 Signed-off-by: Ed Lee <[email protected]>

…es into chatqna_w_endpoints

edlee123 and others added 30 commits June 24, 2025 18:08

Compose file for ChatQnA example with openai-like endpoint

985cddd

Signed-off-by: Ed Lee <[email protected]>

Adding README.md for ChatQnA + endpoint

eec42f2

Signed-off-by: Ed Lee <[email protected]>

In chatqna.py handle null openai api response since UI would show sho…

39962f6

…w null json. Also improved exception handling and logging Signed-off-by: Ed Lee <[email protected]>

Update ChatQnA/docker_compose/intel/cpu/xeon/compose_endpoint_openai.…

d227878

…yaml Co-authored-by: Copilot <[email protected]> Signed-off-by: Ed Lee <[email protected]>

Add tests for different input formats (opea-project#2006)

139972c

Signed-off-by: ZePan110 <[email protected]> Signed-off-by: Ed Lee <[email protected]>

Fix security issues in workflows (opea-project#1977)

9fc1235

Signed-off-by: ZePan110 <[email protected]> Signed-off-by: Ed Lee <[email protected]>

Integrate MultimodalQnA set_env to ut scripts. (opea-project#1965)

70db6c5

Integrate MultimodalQnA set_env to ut scripts. Add README.md for UT scripts. Signed-off-by: ZePan110 <[email protected]> Signed-off-by: Ed Lee <[email protected]>

Optimize benchmark scripts (opea-project#1949)

7ada28f

Signed-off-by: chensuyue <[email protected]> Signed-off-by: Ed Lee <[email protected]>

Fix permissions error. (opea-project#2008)

8c80b08

Signed-off-by: ZePan110 <[email protected]> Signed-off-by: Ed Lee <[email protected]>

Build comps-base:ci for AgentQnA test (opea-project#2010)

a943345

Signed-off-by: chensuyue <[email protected]> Signed-off-by: Ed Lee <[email protected]>

Stop CI test on rocm due to lack of test machine (opea-project#2017)

b63bdb3

Signed-off-by: chensuyue <[email protected]> Signed-off-by: Ed Lee <[email protected]>

Fix workflow permission issues. (opea-project#2018)

2a8f3fb

Signed-off-by: ZePan110 <[email protected]> Signed-off-by: Ed Lee <[email protected]>

Refine the README, folder/file hierarchy and test file for FinanceAge…

34d40dc

…nt (opea-project#1996) Signed-off-by: Mustafa <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Ed Lee <[email protected]>

Add code owners. (opea-project#2022)

a0f7ea0

Signed-off-by: Yi Yao <[email protected]> Co-authored-by: Copilot <[email protected]> Signed-off-by: Ed Lee <[email protected]>

Fix MultimodalQnA UT issues (opea-project#2011)

9776593

Signed-off-by: ZePan110 <[email protected]> Signed-off-by: Ed Lee <[email protected]>

update secrets token name for AgentQnA. (opea-project#2023)

31cd99f

Signed-off-by: ZePan110 <[email protected]> Signed-off-by: Ed Lee <[email protected]>

update secrets token name for AudioQnA. (opea-project#2024)

9b089dd

Signed-off-by: ZePan110 <[email protected]> Signed-off-by: Ed Lee <[email protected]>

update secrets token name for AvatarChatbot and DBQnA. (opea-project#…

313f671

…2030) Signed-off-by: ZePan110 <[email protected]> Signed-off-by: Ed Lee <[email protected]>

update secrets token name for ChatQnA. (opea-project#2029)

39b53a2

Signed-off-by: ZePan110 <[email protected]> Signed-off-by: Ed Lee <[email protected]>

update secrets token name for CodeGen and CodeTrans (opea-project#2031)

d18bc9b

Signed-off-by: ZePan110 <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Ed Lee <[email protected]>

[DocSum] Aligned the output format (opea-project#1948)

0798004

Signed-off-by: Ed Lee <[email protected]>

update secrets token name for DocIndexRetriever. (opea-project#2035)

229f2b1

Signed-off-by: ZePan110 <[email protected]> Signed-off-by: Ed Lee <[email protected]>

update secrets token name for EdgeCraftRag, FinanceAgent, GraphRAG an…

4d0b5c4

…d HybridRAG (opea-project#2037) Signed-off-by: ZePan110 <[email protected]> Signed-off-by: Ed Lee <[email protected]>

update secrets token name for DocSum. (opea-project#2036)

a797945

Signed-off-by: ZePan110 <[email protected]> Signed-off-by: Ed Lee <[email protected]>

update secrets token name for VideoQnA and VisualQnA (opea-project#2040)

81a8841

Signed-off-by: ZePan110 <[email protected]> Signed-off-by: Ed Lee <[email protected]>

Fix shellcheck issues and update secrets TOKEN name (opea-project#2043)

d91de60

Signed-off-by: ZePan110 <[email protected]> Signed-off-by: Ed Lee <[email protected]>

add new feature for EC-RAG (opea-project#2013)

1e20459

Signed-off-by: Yongbozzz <[email protected]> Signed-off-by: Ed Lee <[email protected]>

[CodeGen] Aligned the output format and fixed acc benchmark issues. (o…

09ceb36

…pea-project#1981) Signed-off-by: Ed Lee <[email protected]>

ZePan110 and others added 7 commits June 24, 2025 18:08

Restore secrets for _helm-e2e.yml (opea-project#2055)

a9cb8e9

Signed-off-by: ZePan110 <[email protected]> Signed-off-by: Ed Lee <[email protected]>

Update EdgeCraftRAG README and ENV (opea-project#2052)

4a313a9

Signed-off-by: Yongbozzz <[email protected]> Signed-off-by: Ed Lee <[email protected]>

Fix TGI image HF_TOKEN environment variable rename (opea-project#2059)

506ead3

Signed-off-by: Wang, Xigui <[email protected]> Signed-off-by: Ed Lee <[email protected]>

Release v1.3 ChatQnA OOB benchmark data (opea-project#2041)

7575316

Signed-off-by: chensuyue <[email protected]> Signed-off-by: Ed Lee <[email protected]>

Fix CodeGen non stream output issue (opea-project#2058)

9d94879

Signed-off-by: Wang, Xigui <[email protected]> Signed-off-by: Ed Lee <[email protected]>

Fixed typos in readme per copilot review

fdeadf9

Signed-off-by: Ed Lee <[email protected]>

Merge branch 'chatqna_w_endpoints' of github.com:edlee123/GenAIExampl…

bf18321

…es into chatqna_w_endpoints

Copilot AI review requested due to automatic review settings June 24, 2025 23:34

edlee123 requested review from lvliang-intel, yao531441 and letonghan as code owners June 24, 2025 23:34

Copilot AI reviewed Jun 24, 2025

View reviewed changes

edlee123 added 3 commits June 24, 2025 18:38

Reverting accidently modified CodeGen in merge

532cb66

Signed-off-by: Ed Lee <[email protected]>

Reverting accidently modified CodeGen in merge

c8b14e3

Signed-off-by: Ed Lee <[email protected]>

Merge branch 'main' into chatqna_w_endpoints

2a0d757

edlee123 mentioned this pull request Jun 27, 2025

[Feature] Enable remote inference endpoints for examples #1973

Open

36 tasks

Merge branch 'main' into chatqna_w_endpoints

ad679a6

letonghan reviewed Jun 30, 2025

View reviewed changes

ChatQnA/chatqna.py Outdated Show resolved Hide resolved

edlee123 and others added 4 commits July 2, 2025 00:07

Use OPEA CustomLogger style

c097939

Signed-off-by: Ed Lee <[email protected]>

Merge branch 'chatqna_w_endpoints' of github.com:edlee123/GenAIExampl…

0bc4fd4

…es into chatqna_w_endpoints

[pre-commit.ci] auto fixes from pre-commit.com hooks

9227eae

for more information, see https://pre-commit.ci

Merge branch 'main' into chatqna_w_endpoints

16641a2

edlee123 requested a review from letonghan July 2, 2025 05:09

edlee123 added 5 commits July 3, 2025 10:10

Merge branch 'main' into chatqna_w_endpoints

7c85779

Merge branch 'main' into chatqna_w_endpoints

81a2e02

Merge branch 'main' into chatqna_w_endpoints

204702e

Updated compose_endpoint_openai.yaml to use newer text-embeddings-inf…

0184157

…erence:cpu-1.7 from 1.5 Signed-off-by: Ed Lee <[email protected]>

Merge branch 'chatqna_w_endpoints' of github.com:edlee123/GenAIExampl…

f000373

…es into chatqna_w_endpoints

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ChatQnA Example with OpenAI-Compatible Endpoint #2091

ChatQnA Example with OpenAI-Compatible Endpoint #2091

Uh oh!

edlee123 commented Jun 24, 2025

Uh oh!

github-actions bot commented Jun 24, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

edlee123 commented Jul 2, 2025

Uh oh!

Uh oh!

ChatQnA Example with OpenAI-Compatible Endpoint #2091

Are you sure you want to change the base?

ChatQnA Example with OpenAI-Compatible Endpoint #2091

Uh oh!

Conversation

edlee123 commented Jun 24, 2025

Description

Issues

Type of change

Dependencies

Tests

Uh oh!

github-actions bot commented Jun 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Dependency Review

Scanned Files

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

edlee123 commented Jul 2, 2025

Uh oh!

Uh oh!

github-actions bot commented Jun 24, 2025 •

edited

Loading