Skip to content

Conversation

@polariton
Copy link

@polariton polariton commented Jul 10, 2025

Changes proposed in this pull request:

  • Implementation of to_ordinal for Czech language.
  • Fix several issues with to_ordinal for Korean language.

Status

  • READY
  • HOLD
  • WIP (Work-In-Progress)

How to verify this change

Test Czech and Korean ordinals as follows:

n = 100
num2words(n, ordinal=True, lang='cs')
num2words(n, ordinal=True, lang='ko')

Additional notes

I introduced these changes to properly convert ordinal numbers in Czech and Korean for creating the multilanguage speech datasets with CoquiTTS.

Stanislav Kruchinin added 2 commits July 10, 2025 13:23
- There should be no space after 백 (hundred) because Korean numbers are written as compound words without spaces.
- Handle place values properly: Check if the number ends with 백, 천, 만, 억 first.
- Handle 십 (ten) properly.
- Use proper string splitting with limit parameter.
- Use .get() method instead of try/except for dictionary lookups.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant