v0.2
What’s Changed
Breaking Changes
- added new dependency NVIDIA CUB as submodule
New Features
- SparseTensor is supported as embedding input, in addition to Dense and Ragged Tensor.
- Add support and example for keras model.fit() api through custom train_step() function
Improvements
- Improved embedding lookup speed when input is multi-hot with combiner.
- Improved embedding lookup speed when input is one-hot, regardless of its combiner and format(Tensor, SparseTensor or RaggedTensor)
- Add support for data parallel input, cpu embedding and TF native embedding api as options in benchmark
Bug Fixes
- fix build with tensorflow 2.10+
- fix a bug where batch dimension could be None at early stage in graph mode
Full Changelog: v0.1...v0.2