dc.contributor.author | Sarıgül, Mehmet | |
dc.contributor.author | Karacan, Levent | |
dc.date.accessioned | 2023-12-28T10:29:57Z | |
dc.date.available | 2023-12-28T10:29:57Z | |
dc.date.issued | 2023 | en_US |
dc.identifier.citation | Sarıgül, M., Karacan, L. (2023). Region contrastive camera localization. Pattern Recognition Letters, 169, pp. 110-117.
https://doi.org/10.1016/j.patrec.2023.03.030 | en_US |
dc.identifier.issn | 0167-8655 | |
dc.identifier.issn | 1872-7344 | |
dc.identifier.uri | https://doi.org/10.1016/j.patrec.2023.03.030 | |
dc.identifier.uri | https://hdl.handle.net/20.500.12508/2836 | |
dc.description.abstract | Visual camera localization is a well-studied computer vision problem and has many applications. Recently, deep convolutional neural networks have begun to be utilized to solve six-degree-of-freedom (6-DoF) camera pose estimation via scene coordinate regression from a single RGB image and they outperform the traditional methods. However, recent works do not consider scene variations such as viewpoint, light, scale, etc due to the camera motion. In this work, we propose a region contrastive representation learning approach to alleviate these problems. The proposed approach maps image features from different camera views of the same 3D region to nearby points in the learned feature space. In contrast, it pushes visual features of other regions to distant points. Our method improves the existing camera localization methods and achieves state-of-the-art results on indoor 7-Scenes and outdoor Cambridge Landmarks datasets. Experimental results show that the proposed approach reduces the pose and angle errors and increases the average accuracy from 84.8% to 85.62% on the state-of-the-art baseline model. In addition, we perform an ablation study on a baseline network with different settings to demonstrate the efficiency of the proposed region contrastive camera localization method. | en_US |
dc.language.iso | eng | en_US |
dc.publisher | Elsevier | en_US |
dc.relation.isversionof | 10.1016/j.patrec.2023.03.030 | en_US |
dc.rights | info:eu-repo/semantics/closedAccess | en_US |
dc.subject | Camera localization | en_US |
dc.subject | Contrastive learning | en_US |
dc.subject | Pose estimation | en_US |
dc.subject.classification | Image Retrieval | |
dc.subject.classification | Scene Recognition | |
dc.subject.classification | Bag | |
dc.subject.classification | Electrical Engineering, Electronics & Computer Science
- Robotics
- Simultaneous Localization and Mapping | |
dc.subject.other | Cameras | |
dc.subject.other | Convolutional neural networks | |
dc.subject.other | Deep neural networks | |
dc.subject.other | Camera localization | |
dc.subject.other | Camera pose estimation | |
dc.subject.other | Computer vision problems | |
dc.subject.other | Contrastive learning | |
dc.subject.other | Convolutional neural network | |
dc.subject.other | Localization method | |
dc.subject.other | Pose-estimation | |
dc.subject.other | RGB images | |
dc.subject.other | Six-degree-of-freedom (6-DoF) | |
dc.subject.other | State of the art | |
dc.subject.other | Degrees of freedom (mechanics) | |
dc.title | Region contrastive camera localization | en_US |
dc.type | article | en_US |
dc.relation.journal | Pattern Recognition Letters | en_US |
dc.contributor.department | Mühendislik ve Doğa Bilimleri Fakültesi -- Bilgisayar Mühendisliği Bölümü | en_US |
dc.identifier.volume | 169 | en_US |
dc.identifier.startpage | 110 | en_US |
dc.identifier.endpage | 117 | en_US |
dc.relation.tubitak | 120E447 | |
dc.relation.publicationcategory | Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı | en_US |
dc.contributor.isteauthor | Karacan, Levent | |
dc.relation.index | Web of Science - Scopus | en_US |
dc.relation.index | Web of Science Core Collection - Science Citation Index Expanded | |