Add anthropic retrieve batches and retreive file content support #17700

Sameerlite · 2025-12-09T08:29:37Z

Title

Add anthropic retrieve batches and retreive file content support

Relevant issues

Fixes LIT-1457

Pre-Submission checklist

Please complete all items before asking a LiteLLM maintainer to review your PR

I have Added testing in the tests/litellm/ directory, Adding at least 1 test is a hard requirement - see details
I have added a screenshot of my new test passing locally
My PR passes all unit tests on make test-unit
My PR's scope is as isolated as possible, it only solves 1 specific problem

Type

🆕 New Feature
🐛 Bug Fix
🧹 Refactoring

Changes

vercel · 2025-12-09T08:29:43Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Preview	Comments	Updated (UTC)
litellm	Ready	Preview	Comment	Dec 10, 2025 0:59am

.../proxy/pass_through_endpoints/llm_provider_handlers/anthropic_passthrough_logging_handler.py

+                )
+
+                verbose_proxy_logger.info(
+                    f"Stored Anthropic batch managed object with unified_object_id={unified_object_id}, batch_id={model_object_id}"


To address this issue, we should avoid logging the raw unified_object_id and batch_id values if they could incorporate or be derived from sensitive user data (such as API keys, passwords, identifiers, etc.). The best fix is to redact or mask the sensitive portions before logging, or simply omit logging these fields altogether if not strictly necessary. If logging is required for troubleshooting or audit, log only non-sensitive, high-level information (e.g., log that a batch managed object was stored, but do not log any identifiers).

The problematic code is in litellm/proxy/pass_through_endpoints/llm_provider_handlers/anthropic_passthrough_logging_handler.py, lines 513–514.
The fix involves updating the logging statement to either:

Redact sensitive information;

Omit the identifiers;

Or hash the identifiers (e.g., use a one-way hash and log only the hash).

Given best practices, it’s preferable to simply remove the logging of the identifier or replace it with a generic statement that does not include potentially sensitive IDs.

.../proxy/pass_through_endpoints/llm_provider_handlers/anthropic_passthrough_logging_handler.py

+            else:
+                # Fallback to model name
+                actual_model_id = model_name
+                verbose_proxy_logger.warning(f"Model not found in router, using model name: {actual_model_id}")


Sensitive data should never be logged, especially if it originates from untrusted input. Here, the problematic log statement is in AnthropicPassthroughLoggingHandler.get_actual_model_id_from_router. To fix the issue:

Avoid logging the actual, raw model name if it is user-supplied and not guaranteed to be safe (sanitize, redact, or hash).

Instead of printing the full model name, log only that the fallback occurred, or log a redacted version (e.g., truncating, masking, or indicating its class/type).

Optionally, use a utility to sanitize strings before logging, e.g., only print the model provider, or a fixed prefix, or a hash.

Apply this fix specifically to line 537 (and optionally 541 for consistency), in litellm/proxy/pass_through_endpoints/llm_provider_handlers/anthropic_passthrough_logging_handler.py.

No additional dependencies are needed, as the fix can be implemented in plain Python.

.../proxy/pass_through_endpoints/llm_provider_handlers/anthropic_passthrough_logging_handler.py

+                return actual_model_id
+        else:
+            # Fallback if router is not available
+            verbose_proxy_logger.warning(f"Router not available, using model name: {model_name}")


To fix this issue, we want to ensure that potentially sensitive user-supplied values, such as model_name, are never included in clear-text logs. The best approach is to avoid including the real value of model_name in the log message and instead log a generic placeholder, such as [REDACTED], or simply do not mention the specific model name at all. This preserves logging context (e.g., that a fallback occurred) without risking data leakage. We will edit the log statement at line 541 in litellm/proxy/pass_through_endpoints/llm_provider_handlers/anthropic_passthrough_logging_handler.py, replacing the message string so the potentially sensitive data is not logged. No other changes are required.

Add anthropic retrieve batches and retreive file content support

385f4e8

vercel bot deployed to Preview December 9, 2025 08:30 View deployment

Revert batch utils with original logic

854183e

vercel bot deployed to Preview December 10, 2025 11:32 View deployment

Transform anthropic file content to openai file content

ec3c919

vercel bot deployed to Preview December 10, 2025 11:38 View deployment

Add tests for file and batch feat for anthropic

b9d3d7c

vercel bot deployed to Preview December 10, 2025 11:46 View deployment

Fix:Argument llm_provider

a693845

vercel bot deployed to Preview December 10, 2025 11:52 View deployment

fix code quality qa

ad87aa1

vercel bot deployed to Preview December 10, 2025 11:55 View deployment

Add batch passthrough endpoint cost tracking for anthropic

9e3a04a

vercel bot deployed to Preview December 10, 2025 12:56 View deployment

Fix lint error

0f99517

vercel bot deployed to Preview December 10, 2025 12:59 View deployment

github-advanced-security bot found potential problems Dec 10, 2025

View reviewed changes

Sameerlite marked this pull request as ready for review December 11, 2025 05:01

Sameerlite merged commit 8942053 into main Dec 11, 2025
48 of 59 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Add anthropic retrieve batches and retreive file content support #17700

Add anthropic retrieve batches and retreive file content support #17700

Uh oh!

Sameerlite commented Dec 9, 2025 •

edited

Loading

Uh oh!

vercel bot commented Dec 9, 2025 •

edited

Loading

Uh oh!

Check failure

Copilot Autofix

Check failure

Copilot Autofix

Check failure

Copilot Autofix

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

@@ -510,7 +510,7 @@
                             )
                             verbose_proxy_logger.info(
-                                f"Stored Anthropic batch managed object with unified_object_id={unified_object_id}, batch_id={model_object_id}"
+                                "Stored Anthropic batch managed object for cost tracking."
                             )
                         else:
                             verbose_proxy_logger.warning("Managed files hook not available, cannot store batch object for cost tracking")

Uh oh!

Add anthropic retrieve batches and retreive file content support #17700

Add anthropic retrieve batches and retreive file content support #17700

Uh oh!

Conversation

Sameerlite commented Dec 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Title

Relevant issues

Pre-Submission checklist

Type

Changes

Uh oh!

vercel bot commented Dec 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Check failure

Uh oh!

Copilot Autofix

Check failure

Uh oh!

Copilot Autofix

Check failure

Uh oh!

Copilot Autofix

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Sameerlite commented Dec 9, 2025 •

edited

Loading

vercel bot commented Dec 9, 2025 •

edited

Loading