Native support for tekken/mistral #363

graelo · 2025-08-09T22:20:39Z

Add Mistral Tokenizer Support for Tool Calling

Summary

This PR adds basic support for Mistral tokenizers in MLX-LM, with a focus on tool calling functionality. The implementation includes optional mistral-common integration and practical examples to help users work with Mistral models for function calling.

Changes Made

Core Tokenizer Integration

Optional Mistral support: Added mistral-common as an optional dependency
Enhanced TokenizerWrapper: Extended to handle both HuggingFace and Mistral tokenizers
Chat template method: Added apply_chat_template() with automatic OpenAI-to-Mistral format conversion
Streaming detokenizer: New MistralStreamingDetokenizer for proper special token handling

Examples and Documentation

mistral_tool_use.py: Multi-turn tool calling example with weather and math functions
mistral_parallel_tool_use.py: Example showing parallel tool calls

Key Features

Format conversion: Automatically converts OpenAI-style messages to Mistral format
Tool calling: Supports Mistral's [TOOL_CALLS] format with robust parsing
Graceful fallbacks: Works with or without mistral-common installed
OpenAI compatibility: Uses standard OpenAI message format as input
Quantization: Saves and uploads to HF the original tekken.json file along with the model files

Usage Example

from mlx_lm import load, generate

model, tokenizer = load("graelo/Devstral-Small-2507-4bits")

# Standard OpenAI format
messages = [{"role": "user", "content": "What's the weather in Paris?"}]
tools = [{"type": "function", "function": {...}}]

prompt = tokenizer.apply_chat_template(messages, tools=tools)
response = generate(model, tokenizer, prompt, max_tokens=100)

Dependencies

Added optional mistral extra in setup.py:

pip install mlx-lm[mistral]  # Includes mistral-common

Files Modified

mlx_lm/tokenizer_utils.py - Core tokenizer wrapper enhancements
mlx_lm/examples/mistral_*.py - New tool calling examples
TOOL_CALLING_TUTORIAL.md - Tutorial documentation
setup.py - Optional mistral dependency

This is a first step toward better Mistral integration in MLX-LM. The implementation is straightforward but should provide a good foundation for users wanting to experiment with Mistral tool calling.

graelo · 2025-08-10T22:53:16Z

I'm putting this on hold as I now understand the tool results are not properly encoded using ToolMessage, I had missed that. I'll reopen the PR hopefully soon.

graelo · 2025-08-11T09:17:26Z

I find the Mistral addition to work really well, but it clutters the code in tokenizer_utils.py: I'll try factor these parts in a separate file. Once ready, I'll put the PR as ready for review.

graelo · 2025-08-20T15:02:43Z

Rebased on main

graelo marked this pull request as draft August 10, 2025 22:51

graelo force-pushed the feat/mistral branch from e49ed19 to b326d4d Compare August 11, 2025 12:38

graelo marked this pull request as ready for review August 11, 2025 12:41

graelo added 5 commits August 20, 2025 16:59

feat: support for tekken/mistral

b04f673

fix: add function name to tool result for mistral

097cef4

chore: factor mistral out of tokenizer_utils

b2ca3b6

fix: better end of tool calls detection

a6e61f7

fix: end of tool calls detection

c937a01

graelo force-pushed the feat/mistral branch from 71b1860 to c937a01 Compare August 20, 2025 15:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Native support for tekken/mistral #363

Native support for tekken/mistral #363

Uh oh!

graelo commented Aug 9, 2025 •

edited

Loading

Uh oh!

graelo commented Aug 10, 2025

Uh oh!

graelo commented Aug 11, 2025

Uh oh!

graelo commented Aug 20, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Native support for tekken/mistral #363

Are you sure you want to change the base?

Native support for tekken/mistral #363

Uh oh!

Conversation

graelo commented Aug 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Add Mistral Tokenizer Support for Tool Calling

Summary

Changes Made

Core Tokenizer Integration

Examples and Documentation

Key Features

Usage Example

Dependencies

Files Modified

Uh oh!

graelo commented Aug 10, 2025

Uh oh!

graelo commented Aug 11, 2025

Uh oh!

graelo commented Aug 20, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

graelo commented Aug 9, 2025 •

edited

Loading