no code implementations • NeurIPS 2012 • Ping Li, Art Owen, Cun-Hui Zhang
While minwise hashing is promising for large-scale learning in massive binary data, the preprocessing cost is prohibitive as it requires applying (e. g.,) $k=500$ permutations on the data.