Update max_tokens default from 4096 to 65535 #108

devin-ai-integration · 2025-07-18T17:07:22Z

Update token defaults from 4096/1024 to 65535

Summary

Updated default token limits across both Python and Node.js SDKs to increase the maximum tokens from 4096/1024 to 65535. This change affects:

Python SDK: GenerationConfig.max_tokens default (4096 → 65535)
Node.js SDK: Fine-tuning max_tokens (4096 → 65535) and max_new_tokens (1024 → 65535)

The changes allow users to process longer content by default without requiring explicit token limit configuration.

Review & Testing Checklist for Human

API Compatibility: Verify that the VLM API backend actually supports 65535 tokens and doesn't reject requests with this limit
Integration Testing: Run integration tests (npm run test:integration for Node.js, full test suite for Python) to confirm end-to-end functionality works with new defaults
Cost Impact Assessment: Consider whether the 16x increase in default token limits could lead to unexpected cost increases for users, and whether documentation or warnings are needed
Backward Compatibility: Evaluate if this constitutes a breaking change that might affect existing user code or requires deprecation notices

Recommended Test Plan: Create a test request with content that would exceed the old limits (>4096 tokens) but stay within the new limit, and verify it processes successfully through the full API pipeline.

Diagram

%%{ init : { "theme" : "default" }}%%
graph TD
    subgraph Legend
        L1["Major Edit"]:::major-edit
        L2["Minor Edit"]:::minor-edit  
        L3["Context/No Edit"]:::context
    end

    PythonTypes["vlmrun/client/types.py<br/>GenerationConfig.max_tokens"]:::major-edit
    NodeFinetuning["src/client/fine_tuning.ts<br/>max_tokens & max_new_tokens"]:::major-edit
    
    PythonClient["Python SDK Client"]:::context
    NodeClient["Node.js SDK Client"]:::context
    VlmAPI["VLM API Backend"]:::context
    
    PythonClient --> PythonTypes
    NodeClient --> NodeFinetuning
    PythonTypes --> VlmAPI
    NodeFinetuning --> VlmAPI
    
    classDef major-edit fill:#90EE90
    classDef minor-edit fill:#87CEEB  
    classDef context fill:#FFFFFF

Notes

All unit tests pass locally for both SDKs
The Node.js SDK change affects specifically the fine-tuning generation method, not the general prediction methods
This change was requested to support processing longer documents/content without requiring explicit configuration
Session: https://app.devin.ai/sessions/5d3da55be7e1482292fd31b5116381d1
Requested by: [email protected]

- Updated GenerationConfig.max_tokens default value to 65535 - This increases the maximum token limit for generation requests Co-Authored-By: [email protected] <[email protected]>

devin-ai-integration · 2025-07-18T17:07:25Z

🤖 Devin AI Engineer

I'll be helping with this pull request! Here's what you should know:

✅ I will automatically:

Address comments on this PR. Add '(aside)' to your comment to have me ignore it.
Look at CI failures and help fix them

Note: I can only respond to comments from users who have write access to this repository.

⚙️ Control Options:

Disable automatic comment and CI monitoring

- Version increment to trigger automatic deployment on merge - Includes max_tokens default update from 4096 to 65535 Co-Authored-By: [email protected] <[email protected]>

Update max_tokens default from 4096 to 65535

3d63b6d

- Updated GenerationConfig.max_tokens default value to 65535 - This increases the maximum token limit for generation requests Co-Authored-By: [email protected] <[email protected]>

devin-ai-integration bot temporarily deployed to dev July 18, 2025 17:07 Inactive

dineshreddy91 requested a review from spillai July 18, 2025 17:10

Bump version to 0.2.21 for deployment

8dbe2a9

- Version increment to trigger automatic deployment on merge - Includes max_tokens default update from 4096 to 65535 Co-Authored-By: [email protected] <[email protected]>

devin-ai-integration bot temporarily deployed to dev July 18, 2025 17:28 Inactive

dineshreddy91 merged commit b5a11de into main Jul 18, 2025
4 checks passed

spillai approved these changes Jul 18, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update max_tokens default from 4096 to 65535 #108

Update max_tokens default from 4096 to 65535 #108

Uh oh!

devin-ai-integration bot commented Jul 18, 2025 •

edited

Loading

Uh oh!

devin-ai-integration bot commented Jul 18, 2025

Uh oh!

Uh oh!

Uh oh!

Update max_tokens default from 4096 to 65535 #108

Update max_tokens default from 4096 to 65535 #108

Uh oh!

Conversation

devin-ai-integration bot commented Jul 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Update token defaults from 4096/1024 to 65535

Summary

Review & Testing Checklist for Human

Diagram

Notes

Uh oh!

devin-ai-integration bot commented Jul 18, 2025

🤖 Devin AI Engineer

Uh oh!

Uh oh!

Uh oh!

devin-ai-integration bot commented Jul 18, 2025 •

edited

Loading