Skip to content

LMDeploy Release V0.0.3

Choose a tag to compare

@lvhan028 lvhan028 released this 09 Aug 09:55
· 1484 commits to main since this release
4bd0b48

What's Changed

🚀 Features

  • Support tensor parallelism without offline splitting model weights by @grimoire in #158
  • Add script to split HuggingFace model to the smallest sharded checkpoints by @LZHgrla in #199
  • Add non-stream inference api for chatbot by @lvhan028 in #200

💥 Improvements

🐞 Bug fixes

  • Fix build test error and move turbmind csrc test cases to tests/csrc by @lvhan028 in #188
  • Fix launching client error by moving lmdeploy/turbomind/utils.py to lmdeploy/utils.py by @lvhan028 in #191

📚 Documentations

New Contributors

Full Changelog: v0.0.2...v0.0.3