Skip to content

Commit 2080cad

Browse files
committed
v3.2.2: auto combine
1 parent aca83cd commit 2080cad

16 files changed

+53
-2701
lines changed

CHANGELOG.md

Lines changed: 38 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -5,6 +5,44 @@ All notable changes to this project will be documented in this file.
55
The format is based on [Keep a Changelog](https://keepachangelog.com/en/1.0.0/),
66
and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0.html).
77

8+
## [3.2.2] - 2025-06-26
9+
10+
### 🎵 Combined Audio Functionality
11+
12+
This release introduces the revolutionary combined audio feature that allows generating single, seamless audio files from long text content.
13+
14+
### ✨ Added
15+
16+
- **Combined Audio Endpoints**: New `/api/generate-combined` and `/v1/audio/speech-combined` endpoints
17+
- **Intelligent Text Splitting**: Smart algorithm that splits text at sentence boundaries, then word boundaries, preserving natural speech flow
18+
- **Seamless Audio Combination**: Professional audio processing to merge chunks into single continuous files
19+
- **OpenAI Compatibility**: Full OpenAI TTS API compatibility for combined audio generation
20+
- **Advanced Fallback System**: Multiple fallback mechanisms for audio combination (PyDub → WAV concatenation → raw concatenation)
21+
- **Rich Metadata**: Response headers with chunk count, file size, and processing information
22+
- **Comprehensive Testing**: Full test suite with unit tests, integration tests, and performance benchmarks
23+
24+
### 🔄 Changed
25+
26+
- **Extended Character Limits**: No longer limited to 4096 characters per request
27+
- **Enhanced Web Interface**: Updated documentation with combined audio endpoint information
28+
- **Improved Error Handling**: Better validation and error messages for long text processing
29+
30+
### 🛠️ Technical Features
31+
32+
- **Concurrent Processing**: Parallel chunk processing for faster generation
33+
- **Memory Optimization**: Efficient memory usage for large text processing
34+
- **Format Support**: Works with all supported audio formats (MP3, WAV, OPUS, AAC, FLAC, PCM)
35+
- **Performance Monitoring**: Built-in performance tracking and optimization
36+
- **Unicode Support**: Full Unicode text handling for international content
37+
38+
### 📋 Use Cases
39+
40+
- **Long Articles**: Convert blog posts and articles to single audio files
41+
- **Audiobooks**: Generate chapters as continuous audio
42+
- **Educational Content**: Transform learning materials to audio format
43+
- **Accessibility**: Enhanced support for visually impaired users
44+
- **Podcast Creation**: Convert scripts to professional audio content
45+
846
## [3.1.0] - 2024-12-19
947

1048
### 🔧 Format Support Improvements

COMBINED_AUDIO_API.md

Lines changed: 0 additions & 245 deletions
This file was deleted.

0 commit comments

Comments
 (0)