Hello, first of all thank you for this library.
For a school assignment, I've written a C-MinHash implementation by extending the MinHash class. It is supposed to be strictly more accurate than MinHash, while running at about the same speed.
If you are interested, I could submit a pull request with the additions. I also wanted to ask if you would prefer it to be a separate class, or an extension of MinHash.
Hello, first of all thank you for this library.
For a school assignment, I've written a C-MinHash implementation by extending the MinHash class. It is supposed to be strictly more accurate than MinHash, while running at about the same speed.
If you are interested, I could submit a pull request with the additions. I also wanted to ask if you would prefer it to be a separate class, or an extension of MinHash.