v0.2

FDecaYed released this 09 Feb 08:06

· 35 commits to main since this release

6e7b613

What’s Changed

Breaking Changes

added new dependency NVIDIA CUB as submodule

New Features

SparseTensor is supported as embedding input, in addition to Dense and Ragged Tensor.
Add support and example for keras model.fit() api through custom train_step() function

Improvements

Improved embedding lookup speed when input is multi-hot with combiner.
Improved embedding lookup speed when input is one-hot, regardless of its combiner and format(Tensor, SparseTensor or RaggedTensor)
Add support for data parallel input, cpu embedding and TF native embedding api as options in benchmark

Bug Fixes

fix build with tensorflow 2.10+
fix a bug where batch dimension could be None at early stage in graph mode

Full Changelog: v0.1...v0.2

Assets 2