feat: add streaming support to LLM API calls and configuration #122

lybtt · 2025-10-20T09:52:20Z

No description provided.

docs: deploy documentation on vercel

…8-docs-vercel Revert "docs: deploy documentation on vercel"

docs: deploy documentation on vercel

siyuan-youtu · 2025-10-21T02:27:19Z

Thank you for your attention and contribution.

For YouTube-GraphRAG, during the graph construction, since it inherently requires obtaining the complete LLM output for processing, switching to a streaming approach would not be very meaningful. During the inference phase, the primary factors affecting the speed and user experience are the question decomposition and the multiple retrieval iterations, where the time cost is dominated by the LLM's output generation.

Since these are intermediate steps, switching to streaming output is probably unnecessary. Only the final step, where the LLM generates the answer based on the retrieved context, would be meaningful to change to streaming output.

Additionally, if there are any files unrelated to this specific change, it is recommended to remove them from the commit.

lybtt · 2025-10-21T03:34:15Z

Thank you for your attention and contribution.

For YouTube-GraphRAG, during the graph construction, since it inherently requires obtaining the complete LLM output for processing, switching to a streaming approach would not be very meaningful. During the inference phase, the primary factors affecting the speed and user experience are the question decomposition and the multiple retrieval iterations, where the time cost is dominated by the LLM's output generation.

Since these are intermediate steps, switching to streaming output is probably unnecessary. Only the final step, where the LLM generates the answer based on the retrieved context, would be meaningful to change to streaming output.

Additionally, if there are any files unrelated to this specific change, it is recommended to remove them from the commit.

Just wanted to add that for lengthy text outputs, especially with privately deployed models that often have shorter timeout settings, streaming can help avoid timeouts by delivering content incrementally.

TencentCloudADP-DevRel and others added 7 commits October 20, 2025 16:56

docs: update documentation

408e59b

Merge pull request TencentCloudADP#118 from TencentCloudADP/docs-vercel

712e940

docs: deploy documentation on vercel

Revert "docs: deploy documentation on vercel"

668762a

Merge pull request TencentCloudADP#119 from TencentCloudADP/revert-11…

7996f97

…8-docs-vercel Revert "docs: deploy documentation on vercel"

docs: deploy documentation on vercel

115039d

Merge pull request TencentCloudADP#120 from TencentCloudADP/docs-vercel

3c1ea34

docs: deploy documentation on vercel

feat: add streaming support to LLM API calls and configuration

1928abf

TencentCloudADP-DevRel force-pushed the main branch from 44602a2 to 97a4489 Compare October 20, 2025 10:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add streaming support to LLM API calls and configuration #122

feat: add streaming support to LLM API calls and configuration #122

Uh oh!

lybtt commented Oct 20, 2025

Uh oh!

siyuan-youtu commented Oct 21, 2025

Uh oh!

lybtt commented Oct 21, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

feat: add streaming support to LLM API calls and configuration #122

Are you sure you want to change the base?

feat: add streaming support to LLM API calls and configuration #122

Uh oh!

Conversation

lybtt commented Oct 20, 2025

Uh oh!

siyuan-youtu commented Oct 21, 2025

Uh oh!

lybtt commented Oct 21, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants