feat: Integrate OpenAI proxy controller, token usage tracking, and dependency updates by skurzyp-blockydevs · Pull Request #4 · hedera-dev/playground-backend

skurzyp-blockydevs · 2026-02-10T18:07:41Z

Changes:

bumps ai-sdk version
adds openai sdk
adds controller with proxy for OpenAI API endpoints /chat/completions and /responses
extracts token usage counting to separate TokenUsageService - it is reused in both AiAssistant and the OpenAiProxy
refactors the controllers to use Dependency Injection design pattern

Todo:

Testing impact of the ai-sdk version on the AiAssistant - Blocked by not having access to required services

…pendency updates Signed-off-by: skurzyp-blockydevs <stanislaw.kurzyp@blockydevs.com>

…d usage tracking, and improved error handling Refactored ChatController initialization, added CORS for local environments, and optimized environment variable loading.

piotrswierzy · 2026-02-12T17:52:47Z

app/ai-assistant/src/application/controllers/impl/OpenAIProxyControllerImpl.ts

Nice structure overall (BYOK fallback, shared streaming helper, centralized error handler, usage tracking abstraction). The implementation is solid directionally, but there are a few must-fix issues before merging — mainly around Fastify streaming lifecycle, Responses API streaming usage, and separation of concerns.

🚨 Must-fix

1. Fastify streaming lifecycle

You write directly to reply.raw. When manually taking over the response stream in Fastify, you should call reply.hijack(); otherwise you can hit lifecycle/headers issues depending on environment and plugins.

Consider also:

flushHeaders() (if available)

X-Accel-Buffering: no

Cache-Control: no-transform

These help prevent buffering issues when running behind nginx/CDNs.

2. Client disconnect handling

If the client closes the connection mid-stream, we should stop consuming the upstream OpenAI stream. Otherwise we may continue generating tokens and incurring cost.

Add request.raw.on('close' | 'aborted') handler and abort/cancel the OpenAI stream if supported (or break the loop).

3. Responses API streaming usage is not captured

Current handleStreamingRequest() checks only chunk.usage.

In Responses streaming, usage typically arrives nested (e.g. under response.completed → response.usage).

As implemented, /responses streaming will likely never record usage and will log “No usage data…”.

We need a small usage extractor that handles:

chunk.usage ?? chunk.response?.usage

4. Don’t throw JSON errors after streaming has started

If an error occurs after we start writing SSE frames, calling reply.status().send() in handleOpenAIError() can fail because headers/body were already sent.

For streaming paths:

Either emit an SSE error event and close

Or just end the socket

JSON error responses should be reserved for non-streaming paths only.

5. Usage tracking after response end can break the handler

handleStreamingRequest() ends the response and then awaits trackUsage().

If trackUsage() throws, the outer catch may attempt to send an error response on a closed reply.

Wrap trackUsage() in its own try/catch and log failures instead of propagating.

🏗 Architectural Concern – Controller Is Doing Too Much

The controller currently handles:

Route registration

HTTP/SSE transport

BYOK resolution

Usage limit checks

OpenAI client instantiation

Streaming orchestration

Usage extraction

Usage accounting

Error translation

This mixes transport concerns with application/business logic. It’s not unmanageable yet, but it’s already trending toward a “god controller.”

Recommendation (incremental refactor, not a rewrite)

Keep in Controller:

Route registration

Extracting authenticated userId

SSE header setup / hijack

Writing SSE frames

Mapping service errors → HTTP responses

Extract to an OpenAIProxyService (application layer):

checkUsageLimit

BYOK resolution (resolveOpenAIClient)

OpenAI API invocation

Streaming vs non-streaming orchestration

Usage extraction logic

Calling tokenUsageService.incrementUsage

Optionally:

OpenAIClientFactory for BYOK/client creation

UsageExtractor utility to normalize Chat vs Responses usage formats

Why this matters

Better separation of concerns

Proper unit testability (without Fastify/SSE)

Easier evolution (model allowlists, org quotas, pricing tiers, retries, observability)

Reduced risk of future streaming edge-case bugs

Not blocking for this PR if time is tight, but strongly recommend either addressing now or creating a follow-up refactor task.

⚠️ Strong recommendations

Do not trust x-user-id header for auth/quota/BYOK resolution. Derive userId from auth middleware (JWT/session). Header spoofing can otherwise spend other users’ quota.

Add request validation/schema for /responses instead of Record<string, any>.

Consider model allowlisting and sensible limits (max_output_tokens, tools usage, etc.) to prevent abuse/cost spikes.

Consider whether to append data: [DONE] for /responses. Responses streaming is JSON event-based; [DONE] may break strict SSE JSON consumers. Align with expected client behavior.

Overall: good foundation, but we need to harden streaming lifecycle handling and clean up separation of concerns before this grows further.

Key Changes & Addressed Feedback

Fastify Streaming Lifecycle:

Implemented reply.hijack() for manual socket control.

Added X-Accel-Buffering: no and Cache-Control: no-transform headers.

Client Disconnect Handling:

Added reply.raw.on('close') listeners to trigger upstream aborts via AbortController.

Stops upstream consumption immediately to save tokens.

Responses API Streaming Usage:

Validated and fixed usage extraction for nested chunk.response.usage fields.

Error Handling:

Prevented JSON error responses after streaming headers are sent; falls back to SSE error events or stream termination.

Safe Usage Tracking:

Wrapped trackUsage in a try/catch block to ensure logging of usage failures never crashes the response handler.

Architectural Separation:

Extracted OpenAIProxyService to handle business logic (BYOK resolution, validation, API calls).

Controller now purely handles transport (HTTP/SSE, headers, error mapping).

Testing & Validation

Validated compatibility with popular AI SDKs using an external project which utilizes the OpenAI Proxy:

Langchain JS: Verified ChatOpenAI works correctly for both streaming and non-streaming modes.

Vercel AI SDK: Verified generateText and streamText work correctly for both streaming and non-streaming modes.

In addition to the above, tested the connection closing if the client disconnects mid-stream.

Ad Strong recommendations section

x-user-id header is secure and verified before the request reaches the server. There is SPOE service before in the request routing path.

routes body are now typed with OpenAI types

async registerRoutes(): Promise<void> { this.fastify.post<{ Body: ChatCompletionCreateParamsStreaming | ChatCompletionCreateParamsNonStreaming }>( `${this.basePath}/chat/completions`, this.handleChatCompletion.bind(this) ); this.fastify.post<{ Body: ResponseCreateParamsStreaming | ResponseCreateParamsNonStreaming }>( `${this.basePath}/responses`, this.handleResponses.bind(this) ); }

What do you think about approach?
2 & 3. Skipped for now

…or handling and usage tracking Signed-off-by: skurzyp-blockydevs <stanislaw.kurzyp@blockydevs.com>

Signed-off-by: skurzyp-blockydevs <stanislaw.kurzyp@blockydevs.com>

…streaming/non-streaming requests Signed-off-by: skurzyp-blockydevs <stanislaw.kurzyp@blockydevs.com>

… handling Signed-off-by: skurzyp-blockydevs <stanislaw.kurzyp@blockydevs.com>

Signed-off-by: skurzyp-blockydevs <stanislaw.kurzyp@blockydevs.com>

piotrswierzy

For now it looks good enough

skurzyp-blockydevs · 2026-02-13T11:54:09Z

@piotrswierzy Thanks. I'll add a placeholder task to the board so we don't forget about further improvements. You can fill it with details later.

app/ai-assistant/src/index.ts

…ith regex pattern matching Signed-off-by: skurzyp-blockydevs <stanislaw.kurzyp@blockydevs.com>

pat-rg

Change proxy basePath - /api/openai/v1 → /api/playground/assistant/....
The HAProxy API Gateway uses an ACL to route traffic to the ai-assistant microservice:

   acl allowed_path path_beg /api/playground/assistant
   http-request deny deny_status 404 if !allowed_path

Any path not starting with /api/playground/assistant will be rejected with 404. By changing the basePath, the new proxy routes will automatically inherit authentication (SPOE), rate limiting, and CORS already configured in HAProxy.

Fix unused chatService in index.ts - Either inject it into ChatControllerImpl or remove it (currently dead code)

📝 Note about branches
If it's not an inconvenience, could you create a branch directly in the main repository instead of using a fork? This allows the CI workflow (ai-assistant-build.yaml) to build and deploy a version to the develop environment for testing before merging. For future PRs, please follow this approach as well.

skurzyp-blockydevs · 2026-02-16T09:12:51Z

Thank you for your review, @pat-rg. I’ll implement changes 1 and 2.

Regarding point 2, thanks for catching that - I’ll remove the variable.

Unfortunately, I’m not able to create a branch directly in your repository since I don’t have write access.

…initialization Signed-off-by: skurzyp-blockydevs <stanislaw.kurzyp@blockydevs.com>

skurzyp-blockydevs · 2026-02-16T09:55:56Z

@pat-rg I have pushed the changes. Please re-review

pat-rg

Thanks for the quick fix!
I pushed your changes to a branch in the main repository to deploy them to the develop environment. During deployment, this ESM import error came up:

Error [ERR_MODULE_NOT_FOUND]: Cannot find module '/app/dist/utils/environment'

In index.ts line 17, the import is missing the .js extension:

// Current
import { isLocal } from "./utils/environment";

// Should be
import { isLocal } from "./utils/environment.js";

This error only shows up when running the compiled code (npm run build && npm start), not during development with ts-node. For future changes, please test the production build locally before pushing.
Could you fix this and push to your fork? Once updated, I'll pull the changes and redeploy.
Also, I'm looking into getting you write access for future PRs.

Signed-off-by: skurzyp-blockydevs <stanislaw.kurzyp@blockydevs.com>

skurzyp-blockydevs · 2026-02-16T10:18:08Z

@pat-rg I have fixed it but there is some other issue now. I'll let you know when it will be ready

Signed-off-by: skurzyp-blockydevs <stanislaw.kurzyp@blockydevs.com>

skurzyp-blockydevs · 2026-02-16T10:21:45Z

@pat-rg it should be all good now. Could you please try to redeploy?

pat-rg · 2026-02-16T11:04:50Z

@pat-rg it should be all good now. Could you please try to redeploy?

@skurzyp-blockydevs The updated version is now deployed to the develop environment. The API is available at:
https://dev-playground-assistant.kabila.app/api/playground/assistant/
Please run the tests you need and let me know when it's ready for approval and merge.

skurzyp-blockydevs · 2026-02-16T11:10:47Z

Thank you. We are still waiting on deployment of the hedera portal frontend. Our PR I stuck due to SNYK check failing and I don't have access to the logs. Can you by any chance help me out with it? It would be if you could provide them. Also, is this check strictly necessary for the development environment? I assume that there can be some problem in a sub dependency of our Agentic SDK and this means we will need to make a new release. This might take couple of days and if possible we would like to be able to test the FE in the meantime.

skurzyp-blockydevs added 2 commits February 10, 2026 18:55

feat: Integrate OpenAI proxy controller, token usage tracking, and de…

2d80e6f

…pendency updates Signed-off-by: skurzyp-blockydevs <stanislaw.kurzyp@blockydevs.com>

feat: Enhance OpenAI proxy controller with streaming support, detaile…

a41ebe4

…d usage tracking, and improved error handling Refactored ChatController initialization, added CORS for local environments, and optimized environment variable loading.

piotrswierzy suggested changes Feb 12, 2026

View reviewed changes

skurzyp-blockydevs added 5 commits February 13, 2026 11:02

refactor: extract controller logic to OpenAIProxyService, improve err…

d96fe61

…or handling and usage tracking Signed-off-by: skurzyp-blockydevs <stanislaw.kurzyp@blockydevs.com>

docs: Add detailed JSDoc comments for OpenAIProxyService methods

a6f46d9

Signed-off-by: skurzyp-blockydevs <stanislaw.kurzyp@blockydevs.com>

feat: Add client disconnect handling with AbortController for OpenAI …

2b29e79

…streaming/non-streaming requests Signed-off-by: skurzyp-blockydevs <stanislaw.kurzyp@blockydevs.com>

refactor: remove redundant comment in OpenAIProxyControllerImpl error…

994ac2f

… handling Signed-off-by: skurzyp-blockydevs <stanislaw.kurzyp@blockydevs.com>

refactor: remove unnecessary type casting in OpenAIProxyService methods

f3d8b9b

Signed-off-by: skurzyp-blockydevs <stanislaw.kurzyp@blockydevs.com>

skurzyp-blockydevs requested a review from piotrswierzy February 13, 2026 11:13

skurzyp-blockydevs marked this pull request as ready for review February 13, 2026 11:18

piotrswierzy approved these changes Feb 13, 2026

View reviewed changes

MWBlocky reviewed Feb 13, 2026

View reviewed changes

app/ai-assistant/src/index.ts Show resolved Hide resolved

refactor: simplify CORS setup by replacing specific localhost ports w…

36add5d

…ith regex pattern matching Signed-off-by: skurzyp-blockydevs <stanislaw.kurzyp@blockydevs.com>

MWBlocky approved these changes Feb 13, 2026

View reviewed changes

pat-rg requested changes Feb 16, 2026

View reviewed changes

refactor: update OpenAI endpoint paths and remove unused ChatService …

6c2bc1b

…initialization Signed-off-by: skurzyp-blockydevs <stanislaw.kurzyp@blockydevs.com>

skurzyp-blockydevs requested a review from pat-rg February 16, 2026 09:56

pat-rg temporarily deployed to develop February 16, 2026 09:58 — with GitHub Actions Inactive

pat-rg requested changes Feb 16, 2026

View reviewed changes

refactor: ensure consistent file extensions in imports within index.ts

34c0a12

Signed-off-by: skurzyp-blockydevs <stanislaw.kurzyp@blockydevs.com>

refactor: standardize import path for agents by appending index.js

df9e67f

Signed-off-by: skurzyp-blockydevs <stanislaw.kurzyp@blockydevs.com>

skurzyp-blockydevs requested a review from pat-rg February 16, 2026 10:51

pat-rg temporarily deployed to develop February 16, 2026 10:57 — with GitHub Actions Inactive

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Integrate OpenAI proxy controller, token usage tracking, and dependency updates#4

feat: Integrate OpenAI proxy controller, token usage tracking, and dependency updates#4
skurzyp-blockydevs wants to merge 11 commits intohedera-dev:developfrom
skurzyp-blockydevs:feat/agent-lab-proxy

skurzyp-blockydevs commented Feb 10, 2026 •

edited

Loading

Uh oh!

piotrswierzy Feb 12, 2026 •

edited

Loading

Uh oh!

skurzyp-blockydevs Feb 13, 2026

Uh oh!

piotrswierzy left a comment

Uh oh!

skurzyp-blockydevs commented Feb 13, 2026

Uh oh!

Uh oh!

pat-rg left a comment

Uh oh!

skurzyp-blockydevs commented Feb 16, 2026

Uh oh!

skurzyp-blockydevs commented Feb 16, 2026 •

edited

Loading

Uh oh!

pat-rg left a comment

Uh oh!

skurzyp-blockydevs commented Feb 16, 2026

Uh oh!

skurzyp-blockydevs commented Feb 16, 2026 •

edited

Loading

Uh oh!

pat-rg commented Feb 16, 2026

Uh oh!

skurzyp-blockydevs commented Feb 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

skurzyp-blockydevs commented Feb 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

piotrswierzy Feb 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

🚨 Must-fix

1. Fastify streaming lifecycle

2. Client disconnect handling

3. Responses API streaming usage is not captured

4. Don’t throw JSON errors after streaming has started

5. Usage tracking after response end can break the handler

🏗 Architectural Concern – Controller Is Doing Too Much

Recommendation (incremental refactor, not a rewrite)

Why this matters

⚠️ Strong recommendations

Uh oh!

skurzyp-blockydevs Feb 13, 2026

Choose a reason for hiding this comment

Key Changes & Addressed Feedback

Testing & Validation

Ad Strong recommendations section

Uh oh!

piotrswierzy left a comment

Choose a reason for hiding this comment

Uh oh!

skurzyp-blockydevs commented Feb 13, 2026

Uh oh!

Uh oh!

pat-rg left a comment

Choose a reason for hiding this comment

Uh oh!

skurzyp-blockydevs commented Feb 16, 2026

Uh oh!

skurzyp-blockydevs commented Feb 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pat-rg left a comment

Choose a reason for hiding this comment

Uh oh!

skurzyp-blockydevs commented Feb 16, 2026

Uh oh!

skurzyp-blockydevs commented Feb 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pat-rg commented Feb 16, 2026

Uh oh!

skurzyp-blockydevs commented Feb 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

skurzyp-blockydevs commented Feb 10, 2026 •

edited

Loading

piotrswierzy Feb 12, 2026 •

edited

Loading

Ad `Strong recommendations` section

skurzyp-blockydevs commented Feb 16, 2026 •

edited

Loading

skurzyp-blockydevs commented Feb 16, 2026 •

edited

Loading