Minhashing#

Minhash#

This module contains functions and classes that will be useful in creating the minhash signature given a dask bag of texts.

MinhashLSH(shingle_size, num_shingle_bucket, ...)

Base class definition for extraction of the minhash signature given a dask bag of text data.