Vis enkel innførsel

dc.contributor.authorBreznik, Eva
dc.contributor.authorWetzer, Elisabeth
dc.contributor.authorLindblad, Joakim
dc.contributor.authorSladoje, Nataša
dc.date.accessioned2024-11-11T09:42:11Z
dc.date.available2024-11-11T09:42:11Z
dc.date.issued2024-08-13
dc.description.abstractIn tissue characterization and cancer diagnostics, multimodal imaging has emerged as a powerful technique. Thanks to computational advances, large datasets can be exploited to discover patterns in pathologies and improve diagnosis. However, this requires efficient and scalable image retrieval methods. Cross-modality image retrieval is particularly challenging, since images of similar (or even the same) content captured by different modalities might share few common structures. We propose a new application-independent content-based image retrieval (CBIR) system for reverse (sub-)image search across modalities, which combines deep learning to generate representations (embedding the different modalities in a common space) with robust feature extraction and bag-of-words models for efficient and reliable retrieval. We illustrate its advantages through a replacement study, exploring a number of feature extractors and learned representations, as well as through comparison to recent (cross-modality) CBIR methods. For the task of (sub-)image retrieval on a (publicly available) dataset of brightfield and second harmonic generation microscopy images, the results show that our approach is superior to all tested alternatives. We discuss the shortcomings of the compared methods and observe the importance of equivariance and invariance properties of the learned representations and feature extractors in the CBIR pipeline. Code is available at: https://github.com/MIDA-group/CrossModal_ImgRetrieval.en_US
dc.identifier.citationBreznik, Wetzer, Lindblad, Sladoje. Cross-modality sub-image retrieval using contrastive multimodal image representations. Scientific Reports. 2024;14(1)en_US
dc.identifier.cristinIDFRIDAID 2291677
dc.identifier.doi10.1038/s41598-024-68800-1
dc.identifier.issn2045-2322
dc.identifier.urihttps://hdl.handle.net/10037/35621
dc.language.isoengen_US
dc.publisherSpringer Natureen_US
dc.relation.journalScientific Reports
dc.relation.projectIDNorges forskningsråd: 309439en_US
dc.rights.accessRightsopenAccessen_US
dc.rights.holderCopyright 2024 The Author(s)en_US
dc.rights.urihttps://creativecommons.org/licenses/by/4.0en_US
dc.rightsAttribution 4.0 International (CC BY 4.0)en_US
dc.titleCross-modality sub-image retrieval using contrastive multimodal image representationsen_US
dc.type.versionpublishedVersionen_US
dc.typeJournal articleen_US
dc.typeTidsskriftartikkelen_US
dc.typePeer revieweden_US


Tilhørende fil(er)

Thumbnail

Denne innførselen finnes i følgende samling(er)

Vis enkel innførsel

Attribution 4.0 International (CC BY 4.0)
Med mindre det står noe annet, er denne innførselens lisens beskrevet som Attribution 4.0 International (CC BY 4.0)