# One Permutation Hashing

Ping LiArt OwenCun-Hui Zhang

While minwise hashing is promising for large-scale learning in massive binary data, the preprocessing cost is prohibitive as it requires applying (e.g.,) $k=500$ permutations on the data. The testing time is also expensive if a new data point (e.g., a new document or a new image) has not been processed... (read more)

PDF Abstract