Fisichella, M.: Siamese coding network and pair similarity prediction for near-duplicate image detection. In: International Journal of Multimedia Information Retrieval 11 (2022), Nr. 2, S. 159-170. DOI: https://doi.org/10.1007/s13735-022-00233-w
Abstract: | |
Near-duplicate detection in a dataset involves finding the elements that are closest to a new query element according to a given similarity function and proximity threshold. The brute force approach is very computationally intensive as it evaluates the similarity between the queried item and all items in the dataset. The potential application domain is an image sharing website that checks for plagiarism or piracy every time a new image is uploaded. Among the various approaches, near-duplicate detection was effectively addressed by SimPair LSH (Fisichella et al., in Decker, Lhotská, Link, Spies, Wagner (eds) Database and expert systems applications, Springer, 2014). As the name suggests, SimPair LSH uses locality sensitive hashing (LSH) and computes and stores in advance a small set of near-duplicate pairs present in the dataset and uses them to reduce the candidate set returned for a given query using the Triangle inequality. We develop an algorithm that predicts how the candidate set will be reduced. We also develop a new efficient method for near-duplicate image detection using a deep Siamese coding neural network that is able to extract effective features from images useful for building LSH indices. Extensive experiments on two benchmark datasets confirm the effectiveness of our deep Siamese coding network and prediction algorithm. | |
License of this version: | CC BY 4.0 Unported |
Document Type: | Article |
Publishing status: | publishedVersion |
Issue Date: | 2022 |
Appears in Collections: | Forschungszentren |
pos. | country | downloads | ||
---|---|---|---|---|
total | perc. | |||
1 | Germany | 16 | 38.10% | |
2 | United States | 11 | 26.19% | |
3 | Netherlands | 4 | 9.52% | |
4 | China | 2 | 4.76% | |
5 | Australia | 2 | 4.76% | |
6 | Vietnam | 1 | 2.38% | |
7 | Philippines | 1 | 2.38% | |
8 | Lebanon | 1 | 2.38% | |
9 | Greece | 1 | 2.38% | |
10 | France | 1 | 2.38% | |
other countries | 2 | 4.76% |
Hinweis
Zur Erhebung der Downloadstatistiken kommen entsprechend dem „COUNTER Code of Practice for e-Resources“ international anerkannte Regeln und Normen zur Anwendung. COUNTER ist eine internationale Non-Profit-Organisation, in der Bibliotheksverbände, Datenbankanbieter und Verlage gemeinsam an Standards zur Erhebung, Speicherung und Verarbeitung von Nutzungsdaten elektronischer Ressourcen arbeiten, welche so Objektivität und Vergleichbarkeit gewährleisten sollen. Es werden hierbei ausschließlich Zugriffe auf die entsprechenden Volltexte ausgewertet, keine Aufrufe der Website an sich.