MinHash

id: minhash-258-1901868
title: MinHash
text: In computer science and data mining, MinHash is a technique for quickly estimating how similar two sets are. The scheme was invented by Andrei Broder (1997), and initially used in the AltaVista search engine to detect duplicate web pages and eliminate them from search results. It has also been applied in large-scale clustering problems, such as clustering documents by the similarity of their sets of words.
brand slug: wiki
category slug: encyclopedia
description: Data mining technique
original url: https://en.wikipedia.org/wiki/MinHash
date created:
date modified: 2023-12-04T23:20:29Z
main entity: {"identifier":"Q11091745","url":"https://www.wikidata.org/entity/Q11091745"}
image:
fields total: 13
integrity: 14

Related Entries

Explore Next Part