Repository: gmberton/awesome-Visual-Place-Recognition
Branch: main
Commit: 3e1738a6d31b
Files: 104
Total size: 66.0 KB
Directory structure:
gitextract_gwxqzf2c/
├── .gitignore
├── LICENSE
├── README.md
├── citations/
│ ├── Alibey_2022_gsvcities.txt
│ ├── Alibey_2023_mixvpr.txt
│ ├── Arandjelovic_2016_NetVLAD.txt
│ ├── Asha_2019_todaygan.txt
│ ├── Barbarani_2023_reranking_benchmark.txt
│ ├── Barros_2022_Survey.txt
│ ├── Berton_2021_geowarp.txt
│ ├── Berton_2021_svox.txt
│ ├── Berton_2022_benchmark_berton.txt
│ ├── Berton_2022_cosPlace.txt
│ ├── Berton_2023_EigenPlaces.txt
│ ├── Berton_2024_meshvpr.txt
│ ├── Berton_2025_MegaLoc.txt
│ ├── Cai_2022_stun.txt
│ ├── Cao_2020_delg.txt
│ ├── CarlevarisBianco_2016_nclt.txt
│ ├── Cepeda_2023_GeoCLIP.txt
│ ├── Chen_2011_san_francisco_landmark.txt
│ ├── Cheng_2020_Unifying.txt
│ ├── Clark_2023_GeoGuessNet.txt
│ ├── Cummins_2009_eynsham.txt
│ ├── Doan_2019_scalable.txt
│ ├── Dusmanu_2019_D2Net.txt
│ ├── Fu_2023_matc_net.txt
│ ├── Fuwen_2021_reranking_transformers.txt
│ ├── Garg_2019_semanticGeometric.txt
│ ├── Garg_2021_SeqNet.txt
│ ├── Garg_2021_Survey.txt
│ ├── Garg_2024_revisitanything.txt
│ ├── Gastruc_2024_OSV5M.txt
│ ├── Ge_2020_sfrs.txt
│ ├── Geiger_2013_kitti.txt
│ ├── Gronat_2013_cvpr_pitts.txt
│ ├── Haas_2024_pigeon.txt
│ ├── Hausler_2019_fusion.txt
│ ├── Hausler_2021_patch_netvlad.txt
│ ├── Hausler_2025_PairVPR.txt
│ ├── Hays_2008_im2gps.txt
│ ├── Ibrahimi_2021_insideout_vpr.txt
│ ├── Izquierdo_2024_cliquemining.txt
│ ├── Izquierdo_2024_salad.txt
│ ├── Keetha_2023_AnyLoc.txt
│ ├── Khaliq_2024_vladbuff.txt
│ ├── Kim_2017_crn.txt
│ ├── Knopp_2010_geotagged_streetview.txt
│ ├── Kordopatis_2021_EfficientNetGeoloc.txt
│ ├── Lee_2022_cvnet.txt
│ ├── Leyvavallina_2021_gcl.txt
│ ├── Liu_2019_sare.txt
│ ├── Liu_2021_densernet.txt
│ ├── Lowry_2016_Survey.txt
│ ├── Lu_2024_CricaVPR.txt
│ ├── Lu_2024_SelaVPR.txt
│ ├── Lu_2025_SelaVPRpp.txt
│ ├── Maddern_2017_robotCar.txt
│ ├── Masone_2021_survey.txt
│ ├── Mereu_2022_SeqVLAD.txt
│ ├── Milford_2008_st_lucia.txt
│ ├── Milford_2012_SeqSLAM.txt
│ ├── Muller_2018_hierarchical_geolocation.txt
│ ├── Noh_2017_delf.txt
│ ├── Paolicelli_2022_Semantic_VPR.txt
│ ├── Peng_2021_appsvr.txt
│ ├── Peng_2021_sralNet.txt
│ ├── Pion_2020_benchmark_VisLoc.txt
│ ├── Pramanick_2022_transformer_geoloc.txt
│ ├── Revaud_2019_r2d2.txt
│ ├── Ros_2016_synthia.txt
│ ├── Sarlin_2020_superglue.txt
│ ├── Sattler_2012_aachen.txt
│ ├── Sattler_2018_aachen_daynight.txt
│ ├── Schubert_2021_graph_optimization.txt
│ ├── Schubert_2023_vpr_tutorial.txt
│ ├── Seo_2018_CPlaNet.txt
│ ├── Sferrazza_2025_match.txt
│ ├── Sun_2021_loftr.txt
│ ├── Sunderhauf_2013_nordland.txt
│ ├── Theiner_2022_WACV.txt
│ ├── Torii_2015_pitts250k.txt
│ ├── Torii_2018_tokyo247.txt
│ ├── Torii_2021_large_scale3D.txt
│ ├── Trivigno_2023_DivideClassify.txt
│ ├── Vo_2017_revIm2GPS.txt
│ ├── Vysotska_2016_sequences_matching.txt
│ ├── Wang_2019_ageAgnosticVPR.txt
│ ├── Wang_2022_TransVPR.txt
│ ├── Warburg_2020_msls.txt
│ ├── Warburg_2021_bayesian.txt
│ ├── Weyand_2016_PlaNet.txt
│ ├── Weyand_2020_gldv2.txt
│ ├── Yildiz_2022_AmsterTime.txt
│ ├── Yin_2025_GeneralPR.txt
│ ├── Zaffar_2021_vprbench.txt
│ ├── Zaffar_2024_estimation.txt
│ ├── Zamir_2014_102k_streetview.txt
│ ├── Zhang_2021_Survey.txt
│ ├── Zhang_2021_gated_netvlad.txt
│ ├── Zhang_2023_SpatioTemporalAttention.txt
│ ├── Zhu_2018_apanet.txt
│ └── Zhu_2023_R2Former.txt
└── how-to-PR.md
================================================
FILE CONTENTS
================================================
================================================
FILE: .gitignore
================================================
.spyproject
.idea
.DS_Store
================================================
FILE: LICENSE
================================================
MIT License
Copyright (c) 2023 Gabriele Berton
Permission is hereby granted, free of charge, to any person obtaining a copy
of this software and associated documentation files (the "Software"), to deal
in the Software without restriction, including without limitation the rights
to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
copies of the Software, and to permit persons to whom the Software is
furnished to do so, subject to the following conditions:
The above copyright notice and this permission notice shall be included in all
copies or substantial portions of the Software.
THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
SOFTWARE.
================================================
FILE: README.md
================================================
# Awesome Visual Place Recognition [](https://github.com/sindresorhus/awesome)
A curated list of papers on Visual Place Recognition and related fields, inspired by [awesome-NeRF](https://github.com/awesome-NeRF/awesome-NeRF).
PRs are very much appreciated.
#### [How to submit a pull request?](https://github.com/gmberton/awesome-Visual-Place-Recognition/blob/main/how-to-PR.md)
## Table of Contents
- [Surveys](#surveys)
- [Papers](#papers)
- [Talks](#talks)
## Surveys
| Title | First Author | Venue | Github | Bibtex |
|---|---|---|---|---|
| [Visual Place Recognition: A Tutorial](https://ieeexplore.ieee.org/document/10261441) | Stefan Schubert | RAM 2023 | | [BibTex](citations/Schubert_2023_vpr_tutorial.txt) |
| [Place recognition survey: An update on deep learning approaches](https://arxiv.org/abs/2106.10458) | Tiago Barros | ArXiv 2022 | | [BibTex](citations/Barros_2022_Survey.txt) |
| [General Place Recognition Survey: Towards the Real-world Autonomy Age](https://ieeexplore.ieee.org/abstract/document/10937370) | Peng Yin | T-RO 2025 | | [BibTex](citations/Yin_2025_GeneralPR.txt) |
| [A Survey on Deep Visual Place Recognition](https://ieeexplore.ieee.org/document/9336674) | Carlo Masone | IEEE Access 2021 | | [BibTex](citations/Masone_2021_survey.txt) |
| [Visual place recognition: A survey from deep learning perspective](https://www.sciencedirect.com/science/article/abs/pii/S003132032030563X) | Xiwu Zhang | Pattern Recognition 2021 | | [BibTex](citations/Zhang_2021_Survey.txt) |
| [Where is your place, Visual Place Recognition?](https://arxiv.org/abs/2103.06443) | Sourav Garg | IJCAI 2021 | | [BibTex](citations/Garg_2021_Survey.txt) |
| [Visual Place Recognition: A Survey](https://ieeexplore.ieee.org/document/7339473) | Stephanie Lowry | IEEE Transactions on Robotics 2016 | | [BibTex](citations/Lowry_2016_Survey.txt) |
## Papers
Papers are roughly split into categories, and a paper can (rarely) appear in more than one category. Papers are sorted in reverse chronological order within each category.
<details open>
<summary>Standard VPR</summary>
| Title | First Author | Venue | Github | Bibtex |
|---|---|---|---|---|
| [MegaLoc: One Retrieval to Place Them All](https://arxiv.org/abs/2502.17237) | Gabriele Berton | | [GitHub](https://github.com/gmberton/MegaLoc) | [BibTex](./citations/Berton_2025_MegaLoc.txt)
| [Close, But Not There: Boosting Geographic Distance Sensitivity in Visual Place Recognition](https://arxiv.org/abs/2407.02422) | Sergio Izquierdo | ECCV 2024 | [GitHub](https://github.com/serizba/cliquemining) | [BibTex](./citations/Izquierdo_2024_cliquemining.txt)
| [VLAD-BuFF: Burst-aware Fast Feature Aggregation for Visual Place Recognition](https://arxiv.org/abs/2409.19293) | Ahmad Khaliq | ECCV 2024 | [GitHub](https://github.com/Ahmedest61/VLAD-BuFF/) | [BibTex](./citations/Khaliq_2024_vladbuff.txt)
| [Revisit Anything: Visual Place Recognition via Image Segment Retrieval](https://arxiv.org/abs/2409.18049) | Kartik Garg | ECCV 2024 | [GitHub](https://github.com/AnyLoc/Revisit-Anything) | [Bibtex](./citations/Garg_2024_revisitanything.txt)
| [Optimal Transport Aggregation for Visual Place Recognition](https://arxiv.org/abs/2311.15937) | Sergio Izquierdo | CVPR 2024 | [GitHub](https://github.com/serizba/salad) | [BibTex](./citations/Izquierdo_2024_salad.txt) |
| [CricaVPR: Cross-image Correlation-aware Representation Learning for Visual Place Recognition](https://arxiv.org/abs/2402.19231) | Feng Lu | CVPR 2024 | [GitHub](https://github.com/Lu-Feng/CricaVPR) | [BibTex](./citations/Lu_2024_CricaVPR.txt) |
| [EigenPlaces: Training Viewpoint Robust Models for Visual Place Recognition](https://arxiv.org/abs/2308.10832) | Gabriele Berton | ICCV 2023 | [GitHub](https://github.com/gmberton/EigenPlaces) | [BibTex](./citations/Berton_2023_EigenPlaces.txt) |
| [AnyLoc: Towards Universal Visual Place Recognition](https://anyloc.github.io/) | Nikhil Keetha | ArXiv 2023 | [GitHub](https://github.com/AnyLoc/AnyLoc) | [BibTex](./citations/Keetha_2023_AnyLoc.txt) |
| [MixVPR: Feature Mixing for Visual Place Recognition](https://arxiv.org/abs/2303.02190) | Ali-bey Amar | WACV 2023 | [GitHub](https://github.com/amaralibey/MixVPR) | [BibTex](./citations/Alibey_2023_mixvpr.txt) |
| [Data-efficient Large Scale Place Recognition with Graded Similarity Supervision](https://arxiv.org/abs/2303.11739) | María Leyva-Vallina | CVPR 2023 | [GitHub](https://github.com/marialeyvallina/generalized_contrastive_loss) | [BibTex](./citations/Leyvavallina_2021_gcl.txt) |
| [Rethinking Visual Geo-localization for Large-Scale Applications](https://arxiv.org/abs/2204.02287) | Berton Gabriele | CVPR 2022 | [GitHub](https://github.com/gmberton/CosPlace) | [BibTex](./citations/Berton_2022_cosPlace.txt) |
| [GSV-Cities: Toward appropriate supervised visual place recognition](https://arxiv.org/abs/2210.10239) | Ali-bey Amar | Neurocomputing 2022 | [GitHub](https://github.com/amaralibey/gsv-cities) | [BibTex](./citations/Alibey_2022_gsvcities.txt) |
| [Learning Semantics for Visual Place Recognition through Multi-Scale Attention](https://arxiv.org/abs/2201.09701) | Valerio Paolicelli | ICIAP 2022 | [GitHub](https://github.com/valeriopaolicelli/SegVPR) | [BibTex](./citations/Paolicelli_2022_Semantic_VPR.txt) |
| [Attentional Pyramid Pooling of Salient Visual Residuals for Place Recognition](https://openaccess.thecvf.com/content/ICCV2021/papers/Peng_Attentional_Pyramid_Pooling_of_Salient_Visual_Residuals_for_Place_Recognition_ICCV_2021_paper.pdf) | Peng Guohao | ICCV 2021 | | [BibTex](./citations/Peng_2021_appsvr.txt) |
| [Vector of Locally and Adaptively Aggregated Descriptors for Image Feature Representation](https://www.sciencedirect.com/science/article/abs/pii/S0031320321001394) | Jian Zhang | PR 2021 | | [BibTex](./citations/Zhang_2021_gated_netvlad.txt) |
| [Semantic Reinforced Attention Learning for Visual Place Recognition](https://arxiv.org/abs/2108.08443) | Guohao Peng | ICRA 2021 | | [BibTex](./citations/Peng_2021_sralNet.txt) |
| [Self-supervising Fine-Grained Region Similarities for Large-Scale Image Localization](https://arxiv.org/abs/2006.03926) | Ge Yixiao | ECCV 2020 | [GitHub](https://github.com/yxgeee/OpenIBL) | [BibTex](./citations/Ge_2020_sfrs.txt) |
| [Stochastic Attraction-Repulsion Embedding for Large Scale Image Localization](https://arxiv.org/abs/1808.08779) | Liu Liu | ICCV 2019 | | [BibTex](./citations/Liu_2019_sare.txt) |
| [Attention-based Pyramid Aggregation Network for Visual Place Recognition](https://arxiv.org/abs/1808.00288) | Yingying Zhu | ACM MM 2018 | | [BibTex](./citations/Zhu_2018_apanet.txt) |
| [Learned Contextual Feature Reweighting for Image Geo-Localization](https://openaccess.thecvf.com/content_cvpr_2017/papers/Kim_Learned_Contextual_Feature_CVPR_2017_paper.pdf) | Kim Hyo Jin | CVPR 2017 | | [BibTex](./citations/Kim_2017_crn.txt) |
| [NetVLAD: CNN architecture for weakly supervised place recognition](https://arxiv.org/abs/1511.07247) | Relja Arandjelović | CVPR 2016 | [GitHub](https://github.com/Relja/netvlad) | [BibTex](./citations/Arandjelovic_2016_NetVLAD.txt) |
| [Visual Place Recognition with Repetitive Structures](https://openaccess.thecvf.com/content_cvpr_2013/papers/Torii_Visual_Place_Recognition_2013_CVPR_paper.pdf) | A. Torii | PAMI 2015 | | [BibTex](./citations/Torii_2015_pitts250k.txt) |
| [Learning and Calibrating Per-Location Classifiers for Visual Place Recognition](https://www.cv-foundation.org/openaccess/content_cvpr_2013/papers/Gronat_Learning_and_Calibrating_2013_CVPR_paper.pdf) | Gronat Petr | CVPR 2013 | | [BibTex](./citations/Gronat_2013_cvpr_pitts.txt) |
</details>
<details open>
<summary>VPR Datasets</summary>
| Title | First Author | Venue | Github | Bibtex |
|---|---|---|---|---|
| [Rethinking Visual Geo-localization for Large-Scale Applications](https://arxiv.org/abs/2204.02287) | Berton Gabriele | CVPR 2022 | [GitHub](https://github.com/gmberton/CosPlace) | [BibTex](./citations/Berton_2022_cosPlace.txt) |
| [GSV-Cities: Toward appropriate supervised visual place recognition](https://arxiv.org/abs/2210.10239) | Ali-bey Amar | Neurocomputing 2022 | [GitHub](https://github.com/amaralibey/gsv-cities) | [BibTex](./citations/Alibey_2022_gsvcities.txt) |
| [AmsterTime: A Visual Place Recognition Benchmark Dataset for Severe Domain Shift](https://arxiv.org/abs/2203.16291) | B. Yildiz | ICPR 2022 | | [BibTex](./citations/Yildiz_2022_AmsterTime.txt) |
| [Adaptive-Attentive Geolocalization From Few Queries: A Hybrid Approach](https://arxiv.org/abs/2010.06897) | Berton Gabriele | WACV 2021 | [GitHub](https://github.com/valeriopaolicelli/adageo-pytorch) | [BibTex](./citations/Berton_2021_svox.txt) |
| [Google Landmarks Dataset v2 – A Large-Scale Benchmark for Instance-Level Recognition and Retrieval](https://arxiv.org/abs/2004.01804) | Tobias Weyand | CVPR 2020 | [GitHub](https://github.com/cvdfoundation/google-landmark) | [BibTex](./citations/Weyand_2020_gldv2.txt) |
| [Mapillary Street-Level Sequences: A Dataset for Lifelong Place Recognition](https://openaccess.thecvf.com/content_CVPR_2020/papers/Warburg_Mapillary_Street-Level_Sequences_A_Dataset_for_Lifelong_Place_Recognition_CVPR_2020_paper.pdf) | Warburg Frederik | CVPR 2020 | [GitHub](https://github.com/mapillary/mapillary_sls) | [BibTex](./citations/Warburg_2020_msls.txt) |
| [24/7 Place Recognition by View Synthesis](https://openaccess.thecvf.com/content_cvpr_2015/papers/Torii_247_Place_Recognition_2015_CVPR_paper.pdf) | A. Torii | PAMI 2018 | | [BibTex](./citations/Torii_2018_tokyo247.txt) |
| [Benchmarking 6DOF Outdoor Visual Localization in Changing Conditions](https://arxiv.org/abs/1707.09092) | T. Sattler | cvpr 2018 | | [BibTex](./citations/Sattler_2018_aachen_daynight.txt) |
| [1 Year, 1000km: The Oxford RobotCar Dataset](https://robotcar-dataset.robots.ox.ac.uk/images/robotcar_ijrr.pdf) | W. Maddern | IJRR 2017 | [website](https://oxford-robotics-institute.github.io/radar-robotcar-dataset/) | | [BibTex](./citations/Maddern_2017_robotCar.txt) |
| [The SYNTHIA Dataset: A Large Collection of Synthetic Images for Semantic Segmentation of Urban Scenes](https://www.cv-foundation.org/openaccess/content_cvpr_2016/papers/Ros_The_SYNTHIA_Dataset_CVPR_2016_paper.pdf) | G. Ros | cvpr 2016 | | [BibTex](./citations/Ros_2016_synthia.txt) |
| [University of Michigan North Campus long-term vision and lidar dataset](http://robots.engin.umich.edu/nclt/nclt.pdf) | N. Carlevaris-Bianco | ijrr 2016 | [webisite](http://robots.engin.umich.edu/nclt/) | | [BibTex](./citations/CarlevarisBianco_2016_nclt.txt) |
| [Image Geo-localization Based on Multiple Nearest Neighbor Feature Matching using Generalized Graphs](https://www.crcv.ucf.edu/papers/PAMI_Amir%20Zamir.pdf) | Zamir A.R. | PAMI 2014 | [website](https://www.crcv.ucf.edu/projects/GMCP_Geolocalization/) | | [BibTex](./citations/Zamir_2014_102k_streetview.txt) |
| [Vision meets robotics: The KITTI dataset](https://www.cvlibs.net/publications/Geiger2013IJRR.pdf) | A Geiger | IJRR 2013 | [website](https://www.cvlibs.net/datasets/kitti/) | | [BibTex](./citations/Geiger_2013_kitti.txt) |
| [Are we there yet? Challenging SeqSLAM on a 3000 km journey across all four seasons](https://www.researchgate.net/publication/283623386_Are_we_there_yet_challenging_SeqSLAM_on_a_3000_km_journey_across_all_four_seasons) | N. Suenderhauf | ICRAW 2013 | | [BibTex](./citations/Sunderhauf_2013_nordland.txt) |
| [Image retrieval for image-based localization revisited](https://www.graphics.rwth-aachen.de/media/papers/sattler_weyand_bmvc12.pdf) | Torsten Sattler | cvpr 2012 | | [BibTex](./citations/Sattler_2012_aachen.txt) |
| [City-scale landmark identification on mobile devices](https://ieeexplore.ieee.org/document/5995610) | D. M. Chen | CVPR 2011 | | [BibTex](./citations/Chen_2011_san_francisco_landmark.txt) |
| [Avoiding confusing features in place recognition](https://www.researchgate.net/publication/221304796_Avoiding_Confusing_Features_in_Place_Recognition) | Knopp J. | ECCV 2010 | | [BibTex](./citations/Knopp_2010_geotagged_streetview.txt) |
| [Highly scalable appearance-only SLAM - FAB-MAP 2.0](https://www.roboticsproceedings.org/rss05/p39.pdf) | M. Cummins | RSS 2009 | | [BibTex](./citations/Cummins_2009_eynsham.txt) |
| [Mapping a Suburb With a Single Camera Using a Biologically Inspired SLAM System](https://www.researchgate.net/publication/224329582_Mapping_a_Suburb_With_a_Single_Camera_Using_a_Biologically_Inspired_SLAM_System) | Michael Milford | TRO 2008 | | [BibTex](./citations/Milford_2008_st_lucia.txt) |
</details>
<details open>
<summary>Uncertainty estimation in VPR</summary>
| Title | First Author | Venue | Github | Bibtex |
|---|---|---|---|---|
| [To Match or Not to Match: Revisiting Image Matching for Reliable Visual Place Recognition](https://arxiv.org/abs/2504.06116) | Davide Sferrazza | ArXiv 2025 | [GitHub](https://github.com/FarInHeight/To-Match-or-Not-to-Match) | [BibTex](./citations/Sferrazza_2025_match.txt) |
| [On the Estimation of Image-matching Uncertainty in Visual Place Recognition](https://arxiv.org/abs/2404.00546) | Mubariz Zaffar | CVPR 2024 | [GitHub](https://github.com/MubarizZaffar/SUE) | [BibTex](./citations/Zaffar_2024_estimation.txt) |
| [STUN: Self-Teaching Uncertainty Estimation for Place Recognition](https://arxiv.org/abs/2203.01851) | Kaiwen Cai | IROS 2022 | [GitHub](https://github.com/ramdrop/stun) | [BibTex](./citations/Cai_2022_stun.txt) |
| [Bayesian Triplet Loss: Uncertainty Quantification in Image Retrieval](https://arxiv.org/abs/2011.12663) | Frederik Warburg | ICCV 2021 | | [BibTex](./citations/Warburg_2021_bayesian.txt) |
</details>
<details open>
<summary>Cross-domain VPR</summary>
| Title | First Author | Venue | Github | Bibtex |
|---|---|---|---|---|
| [Adaptive-Attentive Geolocalization From Few Queries: A Hybrid Approach](https://arxiv.org/abs/2010.06897) | Berton Gabriele | WACV 2021 | [GitHub](https://github.com/valeriopaolicelli/adageo-pytorch) | [BibTex](./citations/Berton_2021_svox.txt) |
| [Inside Out Visual Place Recognition](https://arxiv.org/abs/2111.13546) | Sarah Ibrahimi | BMVC 2021 | [GitHub](https://github.com/saibr/IOVPR) | [BibTex](./citations/Ibrahimi_2021_insideout_vpr.txt) |
| [Night-to-day image translation for retrieval-based localization](https://arxiv.org/abs/1809.09767) | Anoosheh Asha | ICRA 2019 | | [BibTex](./citations/Asha_2019_todaygan.txt) |
| [Attention-Aware Age-Agnostic Visual Place Recognition](https://arxiv.org/abs/1909.05163) | Ziqi Wang | ICCVW 2019 | | [BibTex](./citations/Wang_2019_ageAgnosticVPR.txt) |
| [Lazy Data Association For Image Sequences Matching Under Substantial Appearance Changes](https://www.ipb.uni-bonn.de/pdfs/vysotska16ral-icra.pdf) | Olga Vysotska | RAL 2016 | | [BibTex](./citations/Vysotska_2016_sequences_matching.txt) |
</details>
<details open>
<summary>Sequence-based VPR</summary>
| Title | First Author | Venue | Github | Bibtex |
|---|---|---|---|---|
| [Learning Sequence Descriptor based on Spatio-Temporal Attention for Visual Place Recognition](https://arxiv.org/abs/2305.11467) | Fenglin Zhang | ArXiv 2023 | | [BibTex](./citations/Zhang_2023_SpatioTemporalAttention.txt) |
| [MATC-Net: Learning compact sequence representation for hierarchical loop closure detection](https://www.sciencedirect.com/science/article/abs/pii/S0952197623009181) | Fuji Fu | 2023 | | [BibTex](./citations/Fu_2023_matc_net.txt) |
| [Learning Sequential Descriptors for Sequence-Based Visual Place Recognition](https://arxiv.org/abs/2207.03868) | Riccardo Mereu | RAL 2022 | [GitHub](https://github.com/vandal-vpr/vg-transformers) | [BibTex](./citations/Mereu_2022_SeqVLAD.txt) |
| [SeqNet: Learning Descriptors for Sequence-based Hierarchical Place Recognition](https://arxiv.org/abs/2102.11603) | Sourav Garg | RAL 2021 | [GitHub](https://github.com/oravus/seqNet) | [BibTex](./citations/Garg_2021_SeqNet.txt) |
| [Fast and Memory Efficient Graph Optimization via ICM for Visual Place Recognition](https://www.roboticsproceedings.org/rss17/p091.pdf) | Stefan Schubert | RSS 2021 | | [BibTex](./citations/Schubert_2021_graph_optimization.txt) |
| [SeqSLAM: Visual route-based navigation for sunny summer days and stormy winter nights](https://ieeexplore.ieee.org/document/6224623) | Michael Milford | ICRA 2012 | | [BibTex](./citations/Milford_2012_SeqSLAM.txt) |
</details>
<details open>
<summary>Re-ranking</summary>
| Title | First Author | Venue | Github | Bibtex |
|---|---|---|---|---|
| [SelaVPR++: Towards Seamless Adaptation of Foundation Models for Efficient Place Recognition](https://arxiv.org/abs/2502.16601) | Feng Lu | | [GitHub](https://github.com/Lu-Feng/SelaVPR) | [BibTex](./citations/Lu_2025_SelaVPRpp.txt) |
| [Pair-VPR: Place-Aware Pre-training and Contrastive Pair Classification for Visual Place Recognition with Vision Transformers](http://arxiv.org/abs/2410.06614) | Stephen Hausler | IEEE RA-L 2025 | [GitHub](https://github.com/csiro-robotics/Pair-VPR) | [BibTex](./citations/Hausler_2025_PairVPR.txt) | [BibTex](./citations/Hausler_2025_PairVPR.txt)
| [Towards Seamless Adaptation of Pre-trained Models for Visual Place Recognition](https://arxiv.org/abs/2402.14505) | Feng Lu | ICLR 2024 | [GitHub](https://github.com/Lu-Feng/SelaVPR) | [BibTex](./citations/Lu_2024_SelaVPR.txt) |
| [Are Local Features All You Need for Cross-Domain Visual Place Recognition?](https://arxiv.org/abs/2304.05887) | Giovanni Barbarani | CVPRW 2023 | [GitHub](https://github.com/gbarbarani/re-ranking-for-VPR) | [BibTex](./citations/Barbarani_2023_reranking_benchmark.txt) |
| [R2former: Unified retrieval and reranking transformer for place recognition](https://arxiv.org/abs/2304.03410) | Sijie Zhu | CVPR 2023 | [GitHub](https://github.com/bytedance/R2Former) | [BibTex](./citations/Zhu_2023_R2Former.txt) |
| [TransVPR: Transformer-Based Place Recognition With Multi-Level Attention Aggregation](https://arxiv.org/abs/2201.02001) | Wang Ruotong | CVPR 2022 | | [BibTex](./citations/Wang_2022_TransVPR.txt) |
| [Correlation Verification for Image Retrieval](https://arxiv.org/abs/2204.01458) | Lee Seongwon | CVPR 2022 | [GitHub](https://github.com/sungonce/CVNet) | [BibTex](./citations/Lee_2022_cvnet.txt) |
| [Viewpoint Invariant Dense Matching for Visual Geolocalization](https://arxiv.org/abs/2109.09827) | Berton Gabriele | ICCV 2021 | [GitHub](https://github.com/gmberton/geo_warp) | [BibTex](./citations/Berton_2021_geowarp.txt) |
| [Patch-NetVLAD: Multi-Scale Fusion of Locally-Global Descriptors for Place Recognition](https://arxiv.org/abs/2103.01486) | Hausler Stephen | CVPR 2021 | [GitHub](https://github.com/QVPR/Patch-NetVLAD) | [BibTex](./citations/Hausler_2021_patch_netvlad.txt) |
| [LoFTR: Detector-Free Local Feature Matching with Transformers](https://arxiv.org/abs/2104.00680) | Sun Jiaming | CVPR 2021 | [GitHub](https://github.com/zju3dv/LoFTR) | [BibTex](./citations/Sun_2021_loftr.txt) |
| [Instance-level Image Retrieval using Reranking Transformers](https://arxiv.org/abs/2103.12236) | Fuwen Tan | ICCV 2021 | [GitHub](https://github.com/uvavision/RerankingTransformer) | [BibTex](./citations/Fuwen_2021_reranking_transformers.txt) |
| [DenserNet: Weakly Supervised Visual Localization Using Multi-scale Feature Aggregation](https://arxiv.org/abs/2012.02366) | Liu Dongfang | AAAI 2021 | | [BibTex](./citations/Liu_2021_densernet.txt) |
| [Unifying Deep Local and Global Features for Image Search](https://arxiv.org/abs/2001.05027) | B. Cao | eccv 2020 | | [BibTex](./citations/Cao_2020_delg.txt) |
| [SuperGlue: Learning Feature Matching with Graph Neural Networks](https://arxiv.org/abs/1911.11763) | Paul-Edouard Sarlin and | CVPR 2020 | [GitHub](https://github.com/magicleap/SuperGluePretrainedNetwork) | [BibTex](./citations/Sarlin_2020_superglue.txt) |
| [R2D2: Repeatable and Reliable Detector and Descriptor](https://arxiv.org/abs/1906.06195) | Jerome Revaud | NIPS 2019 | [GitHub](https://github.com/naver/r2d2) | [BibTex](./citations/Revaud_2019_r2d2.txt) |
| [D2-Net: A Trainable CNN for Joint Detection and Description of Local Features](https://arxiv.org/abs/1905.03561) | Dusmanu Mihai | CVPR 2019 | [GitHub](https://github.com/mihaidusmanu/d2-net) | [BibTex](./citations/Dusmanu_2019_D2Net.txt) |
| [Large-Scale Image Retrieval with Attentive Deep Local Features](https://arxiv.org/abs/1612.06321) | Noh Hyeonwoo | ICCV 2017 | | [BibTex](./citations/Noh_2017_delf.txt) |
</details>
<details open>
<summary>Benchmarks</summary>
| Title | First Author | Venue | Github | Bibtex |
|---|---|---|---|---|
| [Are Local Features All You Need for Cross-Domain Visual Place Recognition?](https://arxiv.org/abs/2304.05887) | Giovanni Barbarani | CVPRW 2023 | [GitHub](https://github.com/gbarbarani/re-ranking-for-VPR) | [BibTex](./citations/Barbarani_2023_reranking_benchmark.txt) |
| [Deep Visual Geo-localization Benchmark](https://arxiv.org/abs/2204.03444) | Berton Gabriele | CVPR 2022 | [GitHub](https://github.com/gmberton/deep-visual-geo-localization-benchmark) | [BibTex](./citations/Berton_2022_benchmark_berton.txt) |
| [VPR-Bench: An Open-Source Visual Place Recognition Evaluation Framework with Quantifiable Viewpoint and Appearance Change](https://arxiv.org/abs/2005.08135) | Zaffar Mubariz | IJCV 2021 | [GitHub](https://github.com/MubarizZaffar/VPR-Bench) | [BibTex](./citations/Zaffar_2021_vprbench.txt) |
| [Benchmarking Image Retrieval for Visual Localization](https://arxiv.org/abs/2011.11946) | Pion Noe | 3DV 2020 | | [BibTex](./citations/Pion_2020_benchmark_VisLoc.txt) |
| [Benchmarking 6DOF Outdoor Visual Localization in Changing Conditions](https://arxiv.org/abs/1707.09092) | T. Sattler | cvpr 2018 | | [BibTex](./citations/Sattler_2018_aachen_daynight.txt) |
</details>
<details open>
<summary>World-wide Geo-Localization</summary>
| Title | First Author | Venue | Github | Bibtex |
|---|---|---|---|---|
| [PIGEON: Predicting Image Geolocations](https://arxiv.org/abs/2307.05845) | Lukas Haas | CVPR 2024 | [GitHub](https://github.com/LukasHaas/PIGEON) | [BibTex](./citations/Haas_2024_pigeon.txt) |
| [OpenStreetView-5M: The Many Roads to Global Visual Geolocation](https://arxiv.org/abs/2404.18873) | Guillaume Astruc | CVPR 2024 | [GitHub](https://github.com/gastruc/osv5m) | [BibTex](./citations/Gastruc_2024_OSV5M.txt) |
| [GeoCLIP: Clip-Inspired Alignment between Locations and Images for Effective Worldwide Geo-localization](https://arxiv.org/abs/2309.16020) | Vicente Vivanco Cepeda | NeurIPS 2023 | [GitHub](https://github.com/VicenteVivan/geo-clip) | [BibTex](./citations/Cepeda_2023_GeoCLIP.txt) |
| [Where We Are and What We're Looking At: Query Based Worldwide Image Geo-localization Using Hierarchies and Scenes](https://arxiv.org/abs/2303.04249) | Brandon Clark | CVPR 2023 | [GitHub](https://github.com/AHKerrigan/GeoGuessNet) | [BibTex](./citations/Clark_2023_GeoGuessNet.txt) |
| [Interpretable Semantic Photo Geolocation](https://arxiv.org/abs/2104.14995) | Theiner Jonas | WACV 2022 | [GitHub](https://github.com/jtheiner/semantic_geo_partitioning) | [BibTex](./citations/Theiner_2022_WACV.txt) |
| [Where in the World is this Image? Transformer-based Geo-localization in the Wild](https://arxiv.org/abs/2204.13861) | Pramanick Shraman | ECCV 2022 | [GitHub](https://github.com/ShramanPramanick/Transformer_Based_Geo-localization) | [BibTex](./citations/Pramanick_2022_transformer_geoloc.txt) |
| [Leveraging EfficientNet and Contrastive Learning for Accurate Global-scale Location Estimation](https://arxiv.org/abs/2105.07645) | Giorgos Kordopatis-Zilos | ICMR 2021 | [GitHub](https://github.com/mever-team/visloc-estimation) | [BibTex](./citations/Kordopatis_2021_EfficientNetGeoloc.txt) |
| [Geolocation Estimation of Photos Using a Hierarchical Model and Scene Classification](https://openaccess.thecvf.com/content_ECCV_2018/papers/Eric_Muller-Budack_Geolocation_Estimation_of_ECCV_2018_paper.pdf) | Muller-Budack Eric | ECCV 2018 | [GitHub](https://github.com/TIBHannover/GeoEstimation) | [BibTex](./citations/Muller_2018_hierarchical_geolocation.txt) |
| [CPlaNet: Enhancing Image Geolocalization by Combinatorial Partitioning of Maps](https://arxiv.org/abs/1808.02130) | Paul Hongsuck Seo | ECCV 2018 | | [BibTex](./citations/Seo_2018_CPlaNet.txt) |
| [Revisiting IM2GPS in the Deep Learning Era](https://arxiv.org/abs/1705.04838) | Vo Nam | ICCV 2017 | | [BibTex](./citations/Vo_2017_revIm2GPS.txt) |
| [PlaNet - Photo Geolocation with Convolutional Neural Networks](https://arxiv.org/abs/1602.05314) | Tobias Weyand | ECCV 2016 | | [BibTex](./citations/Weyand_2016_PlaNet.txt) |
| [IM2GPS: estimating geographic information from a single image](http://graphics.cs.cmu.edu/projects/im2gps/im2gps.pdf) | James Hays | CVPR 2008 | | [BibTex](./citations/Hays_2008_im2gps.txt) |
</details>
<details open>
<summary>Others</summary>
| Title | First Author | Venue | Github | Bibtex |
|---|---|---|---|---|
| [MeshVPR: Citywide Visual Place Recognition Using 3D Meshes](https://arxiv.org/abs/2406.02776) | Gabriele Berton | ECCV 2024 | [GitHub](https://github.com/gmberton/MeshVPR) | [Bibtex](./citations/Berton_2024_meshvpr.txt)
| [Divide&Classify: Fine-Grained Classification for City-Wide Visual Geo-localization](https://arxiv.org/abs/2307.08417) | Gabriele Trivigno | ICCV 2023 | [GitHub](https://github.com/ga1i13o/Divide-and-Classify) | [BibTex](./citations/Trivigno_2023_DivideClassify.txt) |
| [Unifying Visual Localization and Scene Recognition for People With Visual Impairment](https://www.researchgate.net/publication/340327129_Unifying_Visual_Localization_and_Scene_Recognition_for_People_With_Visual_Impairment) | R. Cheng | IEEE Access 2020 | | [BibTex](./citations/Cheng_2020_Unifying.txt) |
| [Scalable Place Recognition Under Appearance Change for Autonomous Driving](https://arxiv.org/abs/1908.00178) | A. D. Doan | ICCV 2019 | | [BibTex](./citations/Doan_2019_scalable.txt) |
| [Multi-Process Fusion: Visual Place Recognition Using Multiple Image Processing Methods](https://arxiv.org/abs/1903.03305) | S. Hausler | ral 2019 | [GitHub](https://github.com/StephenHausler/Multi-Process-Fusion) | [BibTex](./citations/Hausler_2019_fusion.txt) |
| [Semantic–geometric visual place recognition: a new perspective for reconciling opposing views](https://www.researchgate.net/publication/332280264_Semantic-geometric_visual_place_recognition_a_new_perspective_for_reconciling_opposing_views) | S. Garg | IJRR 2019 | | [BibTex](./citations/Garg_2019_semanticGeometric.txt) |
| [Are Large-Scale 3D Models Really Necessary for Accurate Visual Localization?](https://openaccess.thecvf.com/content_cvpr_2017/papers/Sattler_Are_Large-Scale_3D_CVPR_2017_paper.pdf) | A. Torii | PAMI 2021 | | [BibTex](./citations/Torii_2021_large_scale3D.txt) |
</details>
## Talks
- [Benchmarking urban visual geo-localization](https://www.youtube.com/watch?v=snxP0yrabnQ), Carlo Masone, CVPR 2023
- [Urban visual geo-localization: towards large-scale applications](https://youtu.be/snxP0yrabnQ?si=2R_wH9SCE_90-P-b&t=1629), Gabriele Berton, CVPR 2023
- [Large-Scale Visual Localization](https://www.youtube.com/watch?v=RaVPiIGhdWk), Giorgos Tolias, Yannis Avrithis, Zuzana Kukelova, Torsten Sattler, Sudipta n: Sinha, Eric Brachmann, ICCV 2021
- [General Place Recognition Competition](https://www.youtube.com/watch?v=xpEKOyJ7OIU), Yimin Zhang, Luca Carlone, Michael Milford, Junyan Zhu, Ji Zhang, Sebastian Scherer, ICRA 2022
- [Tutorial : Large-Scale Visual Place Recognition and Image-Based Localization Part 1](https://www.youtube.com/watch?v=GDMLjzbEth8), Akihiko Torii, Giorgos Tolias, CVPR 2017
- [Tutorial : Large-Scale Visual Place Recognition and Image-Based Localization Part 2](https://www.youtube.com/watch?v=947W99gAvQ8), Torsten Sattler, Alex Kendall, CVPR 2017
## License
MIT
================================================
FILE: citations/Alibey_2022_gsvcities.txt
================================================
@article{Alibey_2022_gsvcities,
title={GSV-Cities: Toward appropriate supervised visual place recognition},
author={Ali-bey, Amar and Chaib-draa, Brahim and Giguere, Philippe},
journal={Neurocomputing},
year={2022},
}
================================================
FILE: citations/Alibey_2023_mixvpr.txt
================================================
@inproceedings{Alibey_2023_mixvpr,
title={MixVPR: Feature Mixing for Visual Place Recognition},
author={Ali-bey, Amar and Chaib-draa, Brahim and Giguere, Philippe},
booktitle={IEEE Winter Conference on Applications of Computer Vision (WACV)},
year={2023},
}
================================================
FILE: citations/Arandjelovic_2016_NetVLAD.txt
================================================
@inproceedings{Arandjelovic_2016_NetVLAD,
title={{NetVLAD}: {CNN} Architecture for Weakly Supervised Place Recognition},
author={{Arandjelović}, Relja and Gronat, Petr and Torii, Akihiko and Pajdla, Tomas and Sivic, Josef},
booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
year={2016},
}
================================================
FILE: citations/Asha_2019_todaygan.txt
================================================
@inproceedings{Asha_2019_todaygan,
title={Night-to-day image translation for retrieval-based localization},
author={Anoosheh, Asha and Sattler, Torsten and Timofte, Radu and Pollefeys, Marc and Van Gool, Luc},
booktitle={IEEE International Conference on Robotics and Automation (ICRA)},
year={2019},
}
================================================
FILE: citations/Barbarani_2023_reranking_benchmark.txt
================================================
@inproceedings{Barbarani_2023_reranking_benchmark,
title={Are Local Features All You Need for Cross-Domain Visual Place Recognition?},
author={Barbarani, Giovanni and Mostafa, Mohamad and Bayramov, Hajali and Trivigno, Gabriele and Berton, Gabriele and Masone, Carlo and Caputo, Barbara},
booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops},
year={2023},
}
================================================
FILE: citations/Barros_2022_Survey.txt
================================================
@article{Barros_2022_Survey,
title={Place recognition survey: An update on deep learning approaches},
author={Tiago Barros, Ricardo Pereira, Luís Garrote, Cristiano Premebida, Urbano J. Nunes},
journal={ArXiv},
year={2022},
}
================================================
FILE: citations/Berton_2021_geowarp.txt
================================================
@inproceedings{Berton_2021_geowarp,
title={Viewpoint Invariant Dense Matching for Visual Geolocalization},
author={Berton, Gabriele and Masone, Carlo and Paolicelli, Valerio and Caputo, Barbara},
booktitle={IEEE International Conference on Computer Vision (ICCV)},
year={2021},
}
================================================
FILE: citations/Berton_2021_svox.txt
================================================
@inproceedings{Berton_2021_svox,
title={Adaptive-Attentive Geolocalization From Few Queries: A Hybrid Approach},
author={Berton, Gabriele and Paolicelli, Valerio and Masone, Carlo and Caputo, Barbara},
booktitle={IEEE Winter Conference on Applications of Computer Vision (WACV)},
year={2021},
}
================================================
FILE: citations/Berton_2022_benchmark_berton.txt
================================================
@inproceedings{Berton_2022_benchmark_berton,
title={Deep Visual Geo-localization Benchmark},
author={Berton, Gabriele and Mereu, Riccardo and Trivigno, Gabriele and Masone, Carlo and Csurka, Gabriela and Sattler, Torsten and Caputo, Barbara},
booktitle={IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
year={2022},
}
================================================
FILE: citations/Berton_2022_cosPlace.txt
================================================
@inproceedings{Berton_2022_cosPlace,
title={Rethinking Visual Geo-localization for Large-Scale Applications},
author={Berton, Gabriele and Masone, Carlo and Caputo, Barbara},
booktitle={IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
year={2022},
}
================================================
FILE: citations/Berton_2023_EigenPlaces.txt
================================================
@inproceedings{Berton_2023_EigenPlaces,
title={EigenPlaces: Training Viewpoint Robust Models for Visual Place Recognition},
author={Berton, Gabriele and Trivigno, Gabriele and Masone, Carlo and Caputo, Barbara},
booktitle={IEEE International Conference on Computer Vision (ICCV)},
year={2023},
}
================================================
FILE: citations/Berton_2024_meshvpr.txt
================================================
@InProceedings{Berton_2024_meshvpr,
author="Berton, Gabriele
and Junglas, Lorenz
and Zaccone, Riccardo
and Pollok, Thomas
and Caputo, Barbara
and Masone, Carlo",
editor="Leonardis, Ale{\v{s}}
and Ricci, Elisa
and Roth, Stefan
and Russakovsky, Olga
and Sattler, Torsten
and Varol, G{\"u}l",
title="MeshVPR: Citywide Visual Place Recognition Using 3D Meshes",
booktitle="Computer Vision -- ECCV 2024",
year="2025",
publisher="Springer Nature Switzerland",
address="Cham",
pages="321--339",
abstract="Mesh-based scene representation offers a promising direction for simplifying large-scale hierarchical visual localization pipelines, combining a visual place recognition step based on global features (retrieval) and a visual localization step based on local features. While existing work demonstrates the viability of meshes for visual localization, the impact of using synthetic databases rendered from them in visual place recognition remains largely unexplored. In this work we investigate using dense 3D textured meshes for large-scale Visual Place Recognition (VPR). We identify a significant performance drop when using synthetic mesh-based image databases compared to real-world images for retrieval. To address this, we propose MeshVPR, a novel VPR pipeline that utilizes a lightweight features alignment framework to bridge the gap between real-world and synthetic domains. MeshVPR leverages pre-trained VPR models and is efficient and scalable for city-wide deployments. We introduce novel datasets with freely available 3D meshes and manually collected queries from Berlin, Paris, and Melbourne. Extensive evaluations demonstrate that MeshVPR achieves competitive performance with standard VPR pipelines, paving the way for mesh-based localization systems. Data, code, and interactive visualizations are available at https://meshvpr.github.io/.",
isbn="978-3-031-72904-1"
}
================================================
FILE: citations/Berton_2025_MegaLoc.txt
================================================
@misc{Berton_2025_MegaLoc,
title={MegaLoc: One Retrieval to Place Them All},
author={Gabriele Berton and Carlo Masone},
year={2025},
eprint={2502.17237},
archivePrefix={arXiv},
primaryClass={cs.CV},
url={https://arxiv.org/abs/2502.17237},
}
================================================
FILE: citations/Cai_2022_stun.txt
================================================
@inproceedings{cai2022stun,
title={STUN: Self-teaching uncertainty estimation for place recognition},
author={Cai, Kaiwen and Lu, Chris Xiaoxuan and Huang, Xiaowei},
booktitle={2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)},
pages={6614--6621},
year={2022},
organization={IEEE}
}
================================================
FILE: citations/Cao_2020_delg.txt
================================================
@inproceedings{Cao_2020_delg,
title={Unifying Deep Local and Global Features for Image Search},
author={B. Cao and A. Araujo and J. Sim},
booktitle={European Conference on Computer Vision (ECCV)},
year={2020},
}
================================================
FILE: citations/CarlevarisBianco_2016_nclt.txt
================================================
@article{CarlevarisBianco_2016_nclt,
title={University of Michigan North Campus long-term vision and lidar dataset},
author={N. {Carlevaris-Bianco} and A. K. Ushani and R. M. Eustice},
journal={The International Journal of Robotics Research},
year={2016},
}
================================================
FILE: citations/Cepeda_2023_GeoCLIP.txt
================================================
@inproceedings{Cepeda_2023_GeoCLIP,
author = {Vivanco Cepeda, Vicente and Nayak, Gaurav Kumar and Shah, Mubarak},
booktitle = {Advances in Neural Information Processing Systems},
editor = {A. Oh and T. Naumann and A. Globerson and K. Saenko and M. Hardt and S. Levine},
pages = {8690--8701},
publisher = {Curran Associates, Inc.},
title = {GeoCLIP: Clip-Inspired Alignment between Locations and Images for Effective Worldwide Geo-localization},
url = {https://proceedings.neurips.cc/paper_files/paper/2023/file/1b57aaddf85ab01a2445a79c9edc1f4b-Paper-Conference.pdf},
volume = {36},
year = {2023}
}
================================================
FILE: citations/Chen_2011_san_francisco_landmark.txt
================================================
@inproceedings{Chen_2011_san_francisco_landmark,
title={City-scale landmark identification on mobile devices},
author={D. M. {Chen} and G. {Baatz} and K. {Köser} and S. S. {Tsai} and R. {Vedantham} and T. {Pylvänäinen} and K. {Roimela} and X. {Chen} and J. {Bach} and M. {Pollefeys} and B. {Girod} and R. {Grzeszczuk},
booktitle={IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
year={2011},
}
================================================
FILE: citations/Cheng_2020_Unifying.txt
================================================
@article{Cheng_2020_Unifying,
title={Unifying Visual Localization and Scene Recognition for People With Visual Impairment},
author={R. {Cheng} and K. {Wang} and J. {Bai} and Z. {Xu},
journal={IEEE Access},
year={2020},
}
================================================
FILE: citations/Clark_2023_GeoGuessNet.txt
================================================
@INPROCEEDINGS{Clark_2023_GeoGuessNet,
author={Clark, Brandon and Kerrigan, Alec and Kulkarni, Parth Parag and Cepeda, Vicente Vivanco and Shah, Mubarak},
booktitle={2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
title={Where We Are and What We're Looking At: Query Based Worldwide Image Geo-localization Using Hierarchies and Scenes},
year={2023},
volume={},
number={},
pages={23182-23190},
keywords={Visualization;Computer vision;Social networking (online);Urban areas;Feature extraction;Transformers;Pattern recognition;Scene analysis and understanding},
doi={10.1109/CVPR52729.2023.02220}}
================================================
FILE: citations/Cummins_2009_eynsham.txt
================================================
@inproceedings{Cummins_2009_eynsham,
title={Highly scalable appearance-only SLAM - FAB-MAP 2.0},
author={M. Cummins and P. Newman},
booktitle={Robotics: Science and Systems (RSS)},
year={2009},
}
================================================
FILE: citations/Doan_2019_scalable.txt
================================================
@inproceedings{Doan_2019_scalable,
title={Scalable Place Recognition Under Appearance Change for Autonomous Driving},
author={A. D. Doan and Y. Latif and T.-J. Chin and Y. Liu and T.-T. Do and I. Reid},
booktitle={IEEE International Conference on Computer Vision (ICCV)},
year={2019},
}
================================================
FILE: citations/Dusmanu_2019_D2Net.txt
================================================
@inproceedings{Dusmanu_2019_D2Net,
title={D2-Net: A Trainable CNN for Joint Detection and Description of Local Features},
author={Dusmanu, Mihai and Rocco, Ignacio and Pajdla, Tomas and Pollefeys, Marc and Sivic, Josef and Torii, Akihiko and Sattler, Torsten},
booktitle={IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
year={2019},
}
================================================
FILE: citations/Fu_2023_matc_net.txt
================================================
@article{Fu_2023_matc_net,
title={MATC-Net: Learning compact sequence representation for hierarchical loop closure detection},
author={Fuji Fu and Jinfu Yang and Jiahui Zhang and Jiaqi Ma},
journal={Engineering Applications of Artificial Intelligence},
year={2023},
}
================================================
FILE: citations/Fuwen_2021_reranking_transformers.txt
================================================
@inproceedings{Fuwen_2021_reranking_transformers,
title={Instance-level Image Retrieval using Reranking Transformers},
author={Fuwen Tan and Jiangbo Yuan and Vicente Ordonez},
booktitle={IEEE International Conference on Computer Vision (ICCV)},
year={2021},
}
================================================
FILE: citations/Garg_2019_semanticGeometric.txt
================================================
@article{Garg_2019_semanticGeometric,
title={Semantic–geometric visual place recognition: a new perspective for reconciling opposing views},
author={S. Garg and N. Suenderhauf and M. Milford},
journal={The International Journal of Robotics Research},
year={2019},
}
================================================
FILE: citations/Garg_2021_SeqNet.txt
================================================
@article{Garg_2021_SeqNet,
title={SeqNet: Learning Descriptors for Sequence-based Hierarchical Place Recognition},
author={Garg, Sourav and Milford, Michael},
journal={IEEE Robotics and Automation Letters},
year={2021},
}
================================================
FILE: citations/Garg_2021_Survey.txt
================================================
@article{Garg_2021_Survey,
title={Where is your place, Visual Place Recognition?},
author={Sourav Garg and Tobias Fischer and Michael Milford},
journal={ArXiv},
year={2021},
}
================================================
FILE: citations/Garg_2024_revisitanything.txt
================================================
@InProceedings{Garg_2024_revisitanything,
author="Garg, Kartik
and Puligilla, Sai Shubodh
and Kolathaya, Shishir
and Krishna, Madhava
and Garg, Sourav",
editor="Leonardis, Ale{\v{s}}
and Ricci, Elisa
and Roth, Stefan
and Russakovsky, Olga
and Sattler, Torsten
and Varol, G{\"u}l",
title="Revisit Anything: Visual Place Recognition via Image Segment Retrieval",
booktitle="Computer Vision -- ECCV 2024",
year="2025",
publisher="Springer Nature Switzerland",
address="Cham",
pages="326--343",
abstract="Accurately recognizing a revisited place is crucial for embodied agents to localize and navigate. This requires visual representations to be distinct, despite strong variations in camera viewpoint and scene appearance. Existing visual place recognition pipelines encode the whole image and search for matches. This poses a fundamental challenge in matching two images of the same place captured from different camera viewpoints: the similarity of what overlaps can be dominated by the dissimilarity of what does not overlap. We address this by encoding and searching for image segments instead of the whole images. We propose to use open-set image segmentation to decompose an image into `meaningful' entities (i.e., things and stuff). This enables us to create a novel image representation as a collection of multiple overlapping subgraphs connecting a segment with its neighboring segments, dubbed SuperSegment. Furthermore, to efficiently encode these SuperSegments into compact vector representations, we propose a novel factorized representation of feature aggregation. We show that retrieving these partial representations leads to significantly higher recognition recall than the typical whole image based retrieval. Our segments-based approach, dubbed SegVLAD, sets a new state-of-the-art in place recognition on a diverse selection of benchmark datasets, while being applicable to both generic and task-specialized image encoders. Finally, we demonstrate the potential of our method to ``revisit anything'' by evaluating our method on an object instance retrieval task, which bridges the two disparate areas of research: visual place recognition and object-goal navigation, through their common aim of recognizing goal objects specific to a place. Source code: https://github.com/AnyLoc/Revisit-Anything.",
isbn="978-3-031-73113-6"
}
================================================
FILE: citations/Gastruc_2024_OSV5M.txt
================================================
@INPROCEEDINGS{Gastruc_2024_OSV5M,
author={Astruc, Guillaume and Dufour, Nicolas and Siglidis, Ioannis and Aronssohn, Constantin and Bouia, Nacim and Fu, Stephanie and Loiseau, Romain and Nguyen, Van Nguyen and Raude, Charles and Vincent, Elliot and Xu, Lintao and Zhou, Hongyu and Landrieu, Loic},
booktitle={2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
title={OpenStreetView-5M: The Many Roads to Global Visual Geolocation},
year={2024},
volume={},
number={},
pages={21967-21977},
keywords={Training;Visualization;Computer vision;Geology;Computational modeling;Roads;Self-supervised learning;Geolocation;Street view;Open-access},
doi={10.1109/CVPR52733.2024.02074}}
================================================
FILE: citations/Ge_2020_sfrs.txt
================================================
@inproceedings{Ge_2020_sfrs,
title={Self-supervising Fine-Grained Region Similarities for Large-Scale Image Localization},
author={Ge, Yixiao and Wang, Haibo and Zhu, Feng and Zhao, Rui and Li, Hongsheng},
booktitle={European Conference on Computer Vision (ECCV)},
year={2020},
}
================================================
FILE: citations/Geiger_2013_kitti.txt
================================================
@article{Geiger_2013_kitti,
title={Vision meets robotics: The KITTI dataset},
author={A Geiger and P Lenz and C Stiller and R Urtasun},
journal={The International Journal of Robotics Research},
year={2013},
}
================================================
FILE: citations/Gronat_2013_cvpr_pitts.txt
================================================
@inproceedings{Gronat_2013_cvpr_pitts,
title={Learning and Calibrating Per-Location Classifiers for Visual Place Recognition},
author={Gronat, Petr and Obozinski, Guillaume and Sivic, Josef and Pajdla, Tomá},
booktitle={IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
year={2013},
}
================================================
FILE: citations/Haas_2024_pigeon.txt
================================================
@INPROCEEDINGS{Haas_2024_pigeon,
author={Haas, Lukas and Skreta, Michal and Alberti, Silas and Finn, Chelsea},
booktitle={2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
title={PIGEON: Predicting Image Geolocations},
year={2024},
volume={},
number={},
pages={12893-12902},
keywords={Computer vision;Accuracy;Geology;Semantics;Games;Benchmark testing;Internet;Image Geolocalization;Visual Place Recognition;Photo Geolocalization;Computer Vision;Semantic Geocells;Multi-Task Pretraining;Haversine;Location Refinement;Clustering;Voronoi;Multi-Modal;GeoGuessr;Retrieval;Blind Experiment;Multi-Task;Pretraining;CLIP},
doi={10.1109/CVPR52733.2024.01225}}
================================================
FILE: citations/Hausler_2019_fusion.txt
================================================
@article{Hausler_2019_fusion,
title={Multi-Process Fusion: Visual Place Recognition Using Multiple Image Processing Methods},
author={S. {Hausler} and A. {Jacobson} and M. {Milford},
journal={IEEE Robotics and Automation Letters (RAL)},
year={2019},
}
================================================
FILE: citations/Hausler_2021_patch_netvlad.txt
================================================
@inproceedings{Hausler_2021_patch_netvlad,
title={Patch-NetVLAD: Multi-Scale Fusion of Locally-Global Descriptors for Place Recognition},
author={Hausler, Stephen and Garg, Sourav and Xu, Ming and Milford, Michael and Fischer, Tobias},
booktitle={IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
year={2021},
}
================================================
FILE: citations/Hausler_2025_PairVPR.txt
================================================
@ARTICLE{Hausler_2025_PairVPR,
author={Hausler, Stephen and Moghadam, Peyman},
journal={IEEE Robotics and Automation Letters},
title={Pair-VPR: Place-Aware Pre-Training and Contrastive Pair Classification for Visual Place Recognition With Vision Transformers},
year={2025},
volume={},
number={},
pages={1-8},
keywords={Visual place recognition;Transformers;Decoding;Training;Image reconstruction;Computer vision;Benchmark testing;Image recognition;Data models;Visualization;Deep Learning for Visual Perception;Recognition;Localization},
doi={10.1109/LRA.2025.3546512}}
================================================
FILE: citations/Hays_2008_im2gps.txt
================================================
@inproceedings{Hays_2008_im2gps,
title={im2gps: estimating geographic information from a single image},
author={James Hays and Alexei A. Efros},
booktitle={IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
year={2008},
}
================================================
FILE: citations/Ibrahimi_2021_insideout_vpr.txt
================================================
@inproceedings{Ibrahimi_2021_insideout_vpr,
title={Inside Out Visual Place Recognition},
author={Sarah Ibrahimi and Nanne van Noord and Tim Alpherts and Marcel Worring},
booktitle={British Machine Vision Conference (BMVC)},
year={2021},
}
================================================
FILE: citations/Izquierdo_2024_cliquemining.txt
================================================
@InProceedings{Izquierdo_2024_cliquemining,
author="Izquierdo, Sergio
and Civera, Javier",
editor="Leonardis, Ale{\v{s}}
and Ricci, Elisa
and Roth, Stefan
and Russakovsky, Olga
and Sattler, Torsten
and Varol, G{\"u}l",
title="Close, But Not There: Boosting Geographic Distance Sensitivity in Visual Place Recognition",
booktitle="Computer Vision -- ECCV 2024",
year="2025",
publisher="Springer Nature Switzerland",
address="Cham",
pages="240--257",
abstract="Visual Place Recognition (VPR) plays a critical role in many localization and mapping pipelines. It consists of retrieving the closest sample to a query image, in a certain embedding space, from a database of geotagged references. The image embedding is learned to effectively describe a place despite variations in visual appearance, viewpoint, and geometric changes. In this work, we formulate how limitations in the Geographic Distance Sensitivity of current VPR embeddings result in a high probability of incorrectly sorting the top-k retrievals, negatively impacting the recall. In order to address this issue in single-stage VPR, we propose a novel mining strategy, CliqueMining, that selects positive and negative examples by sampling cliques from a graph of visually similar images. Our approach boosts the sensitivity of VPR embeddings at small distance ranges, significantly improving the state of the art on relevant benchmarks. In particular, we raise recall@1 from 75{\%} to 82{\%} in MSLS Challenge, and from 76{\%} to 90{\%} in Nordland. Models and code are available at https://github.com/serizba/cliquemining.",
isbn="978-3-031-73464-9"
}
================================================
FILE: citations/Izquierdo_2024_salad.txt
================================================
@InProceedings{Izquierdo_CVPR_2024_SALAD,
author = {Izquierdo, Sergio and Civera, Javier},
title = {Optimal Transport Aggregation for Visual Place Recognition},
booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
month = {June},
year = {2024},
}
================================================
FILE: citations/Keetha_2023_AnyLoc.txt
================================================
@article{Keetha_2023_AnyLoc,
title={AnyLoc: Towards Universal Visual Place Recognition},
author={Nikhil Keetha and Avneesh Mishra and Jay Karhade and Krishna Murthy Jatavallabhula and Sebastian Scherer and Madhava Krishna and Sourav Garg},
journal={arXiv},
year={2023},
}
================================================
FILE: citations/Khaliq_2024_vladbuff.txt
================================================
@InProceedings{Khaliq_2024_vladbuff,
author="Khaliq, Ahmad
and Xu, Ming
and Hausler, Stephen
and Milford, Michael
and Garg, Sourav",
editor="Leonardis, Ale{\v{s}}
and Ricci, Elisa
and Roth, Stefan
and Russakovsky, Olga
and Sattler, Torsten
and Varol, G{\"u}l",
title="VLAD-BuFF: Burst-Aware Fast Feature Aggregation for Visual Place Recognition",
booktitle="Computer Vision -- ECCV 2024",
year="2025",
publisher="Springer Nature Switzerland",
address="Cham",
pages="447--466",
abstract="Visual Place Recognition (VPR) is a crucial component of many visual localization pipelines for embodied agents. VPR is often formulated as an image retrieval task aimed at jointly learning local features and an aggregation method. The current state-of-the-art VPR methods rely on VLAD aggregation, which can be trained to learn a weighted contribution of features through their soft assignment to cluster centers. However, this process has two key limitations. Firstly, the feature-to-cluster weighting does not account for over-represented repetitive structures within a cluster, e.g., shadows or window panes; this phenomenon is also referred to as the `burstiness' problem, classically solved by discounting repetitive features before aggregation. Secondly, feature to cluster comparisons are compute-intensive for state-of-the-art image encoders with high-dimensional local features. This paper addresses these limitations by introducing VLAD-BuFF with two novel contributions: i) a self-similarity based feature discounting mechanism to learn Burst-aware features within end-to-end VPR training, and ii) Fast Feature aggregation by reducing local feature dimensions specifically through PCA-initialized learnable pre-projection. We benchmark our method on 9 public datasets, where VLAD-BuFF sets a new state of the art. Our method is able to maintain its high recall even for {\$}{\$}{\{}12{\}}{\{}{\backslash}times {\}}{\$}{\$}12{\texttimes}reduced local feature dimensions, thus enabling fast feature aggregation without compromising on recall. Through additional qualitative studies, we show how our proposed weighting method effectively downweights the non-distinctive features. Source code: https://github.com/Ahmedest61/VLAD-BuFF/.",
isbn="978-3-031-72784-9"
}
================================================
FILE: citations/Kim_2017_crn.txt
================================================
@inproceedings{Kim_2017_crn,
title={Learned Contextual Feature Reweighting for Image Geo-Localization},
author={Kim, Hyo Jin and Dunn, Enrique and Frahm, Jan-Michael},
booktitle={IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
year={2017},
}
================================================
FILE: citations/Knopp_2010_geotagged_streetview.txt
================================================
@inproceedings{Knopp_2010_geotagged_streetview,
title={Avoiding confusing features in place recognition},
author={Knopp, J. and Sivic, J. and Pajdla, T.},
booktitle={European Conference on Computer Vision (ECCV)},
year={2010},
}
================================================
FILE: citations/Kordopatis_2021_EfficientNetGeoloc.txt
================================================
@article{Kordopatis_2021_EfficientNetGeoloc,
title={Leveraging EfficientNet and Contrastive Learning for Accurate Global-scale Location Estimation},
author={Giorgos Kordopatis-Zilos and Panagiotis Galopoulos and S. Papadopoulos and Y. Kompatsiaris},
journal={ACM International Conference on Multimedia Retrieval (ICMR)},
year={2021},
}
================================================
FILE: citations/Lee_2022_cvnet.txt
================================================
@inproceedings{Lee_2022_cvnet,
title={Correlation Verification for Image Retrieval},
author={Lee, Seongwon and Seong, Hongje and Lee, Suhyeon and Kim, Euntai},
booktitle={IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
year={2022},
}
================================================
FILE: citations/Leyvavallina_2021_gcl.txt
================================================
@article{Leyvavallina_2021_gcl,
title={Data-efficient Large Scale Place Recognition with Graded Similarity Supervision},
author={María Leyva-Vallina and Nicola Strisciuglio and Nicolai Petkov},
journal={IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
year={2023},
}
================================================
FILE: citations/Liu_2019_sare.txt
================================================
@inproceedings{Liu_2019_sare,
title={Stochastic Attraction-Repulsion Embedding for Large Scale Image Localization},
author={Liu, Liu and Li, Hongdong and Dai, Yuchao},
booktitle={IEEE International Conference on Computer Vision (ICCV)},
year={2019},
}
================================================
FILE: citations/Liu_2021_densernet.txt
================================================
@article{Liu_2021_densernet,
title={DenserNet: Weakly Supervised Visual Localization Using Multi-scale Feature Aggregation},
author={Liu, Dongfang and Cui, Yiming and Yan, Liqi and Mousas, Christos and Yang, Baijian and Chen, Yingjie},
journal={Association for the Advancement of Artificial Intelligence (AAAI)},
year={2021},
}
================================================
FILE: citations/Lowry_2016_Survey.txt
================================================
@article{Lowry_2016_Survey,
title={Visual Place Recognition: A Survey},
author={Lowry, Stephanie and Saunderhauf, Niko and Newman, Paul and Leonard, John J. and Cox, David and Corke, Peter and Milford, Michael J.},
journal={IEEE Transactions on Robotics},
year={2016},
}
================================================
FILE: citations/Lu_2024_CricaVPR.txt
================================================
@InProceedings{Lu_CVPR_2024_CricaVPR,
author = {Feng, Lu and Xiangyuan, Lan and Lijun, Zhang and Dongmei, Jiang and Yaowei, Wang and Chun, Yuan},
title = {CricaVPR: Cross-image Correlation-aware Representation Learning for Visual Place Recognition},
booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
month = {June},
year = {2024},
}
================================================
FILE: citations/Lu_2024_SelaVPR.txt
================================================
@inproceedings{selavpr,
title={Towards Seamless Adaptation of Pre-trained Models for Visual Place Recognition},
author={Lu, Feng and Zhang, Lijun and Lan, Xiangyuan and Dong, Shuting and Wang, Yaowei and Yuan, Chun},
booktitle={The Twelfth International Conference on Learning Representations},
year={2024}
}
================================================
FILE: citations/Lu_2025_SelaVPRpp.txt
================================================
@misc{Lu_2025_SelaVPRpp,
title={SelaVPR++: Towards Seamless Adaptation of Foundation Models for Efficient Place Recognition},
author={Feng Lu and Tong Jin and Xiangyuan Lan and Lijun Zhang and Yunpeng Liu and Yaowei Wang and Chun Yuan},
year={2025},
eprint={2502.16601},
archivePrefix={arXiv},
primaryClass={cs.CV},
url={https://arxiv.org/abs/2502.16601},
}
================================================
FILE: citations/Maddern_2017_robotCar.txt
================================================
@article{Maddern_2017_robotCar,
title={1 Year, 1000km: The Oxford RobotCar Dataset},
author={W. Maddern and G. Pascoe and C. Linegar and P. Newman},
journal={The International Journal of Robotics Research},
year={2017},
}
================================================
FILE: citations/Masone_2021_survey.txt
================================================
@article{Masone_2021_survey,
title={A Survey on Deep Visual Place Recognition},
author={Masone, Carlo and Caputo, Barbara},
journal={IEEE Access},
year={2021},
}
================================================
FILE: citations/Mereu_2022_SeqVLAD.txt
================================================
@article{Mereu_2022_SeqVLAD,
title={Learning Sequential Descriptors for Sequence-Based Visual Place Recognition},
author={Mereu, Riccardo and Trivigno, Gabriele and Berton, Gabriele and Masone, Carlo and Caputo, Barbara},
journal={IEEE Robotics and Automation Letters},
year={2022},
}
================================================
FILE: citations/Milford_2008_st_lucia.txt
================================================
@article{Milford_2008_st_lucia,
title={Mapping a Suburb With a Single Camera Using a Biologically Inspired SLAM System},
author={Michael Milford and G. Wyeth},
journal={IEEE Transactions on Robotics},
year={2008},
}
================================================
FILE: citations/Milford_2012_SeqSLAM.txt
================================================
@inproceedings{Milford_2012_SeqSLAM,
title={SeqSLAM: Visual route-based navigation for sunny summer days and stormy winter nights},
author={Milford, Michael J. and Wyeth, Gordon. F.},
booktitle={2012 IEEE International Conference on Robotics and Automation},
year={2012},
}
================================================
FILE: citations/Muller_2018_hierarchical_geolocation.txt
================================================
@inproceedings{Muller_2018_hierarchical_geolocation,
title={Geolocation Estimation of Photos Using a Hierarchical Model and Scene Classification},
author={Muller-Budack, Eric and Pustu-Iren, Kader and Ewerth, Ralph},
booktitle={European Conference on Computer Vision (ECCV)},
year={2018},
}
================================================
FILE: citations/Noh_2017_delf.txt
================================================
@inproceedings{Noh_2017_delf,
title={Large-Scale Image Retrieval with Attentive Deep Local Features},
author={Noh, Hyeonwoo and Araujo, Andre and Sim, Jack and Weyand, Tobias and Han, Bohyung},
booktitle={IEEE International Conference on Computer Vision (ICCV)},
year={2017},
}
================================================
FILE: citations/Paolicelli_2022_Semantic_VPR.txt
================================================
@inproceedings{Paolicelli_2022_Semantic_VPR,
title={Learning Semantics for Visual Place Recognition through Multi-Scale Attention},
author={Paolicelli, Valerio and Tavera, Antonio and Masone, Carlo and Berton, Gabriele and Caputo, Barbara},
booktitle={International Conference on Image Analysis and Processing (ICIAP)},
year={2022},
}
================================================
FILE: citations/Peng_2021_appsvr.txt
================================================
@inproceedings{Peng_2021_appsvr,
title={Attentional Pyramid Pooling of Salient Visual Residuals for Place Recognition},
author={Peng, Guohao and Zhang, Jun and Li, Heshan and Wang, Danwei},
booktitle={IEEE International Conference on Computer Vision (ICCV)},
year={2021},
}
================================================
FILE: citations/Peng_2021_sralNet.txt
================================================
@inproceedings{Peng_2021_sralNet,
title={Semantic Reinforced Attention Learning for Visual Place Recognition},
author={Guohao Peng and Yufeng Yue and Jun Zhang and Zhenyu Wu and Xiaoyu Tang and Danwei Wang},
booktitle={IEEE International Conference on Robotics and Automation (ICRA)},
year={2021},
}
================================================
FILE: citations/Pion_2020_benchmark_VisLoc.txt
================================================
@inproceedings{Pion_2020_benchmark_VisLoc,
title={Benchmarking Image Retrieval for Visual Localization},
author={Pion, Noe and Humenberger, Martin and Csurka, Gabriela and Cabon, Yohann and Sattler, Torsten},
booktitle={INTERNATIONAL CONFERENCE ON 3D VISION (3DV)},
year={2020},
}
================================================
FILE: citations/Pramanick_2022_transformer_geoloc.txt
================================================
@inproceedings{Pramanick_2022_transformer_geoloc,
title={Where in the World is this Image? Transformer-based Geo-localization in the Wild},
author={Pramanick, Shraman and Nowara, Ewa M and Gleason, Joshua and Castillo, Carlos D and Chellappa, Rama},
booktitle={European Conference on Computer Vision (ECCV)},
year={2022},
}
================================================
FILE: citations/Revaud_2019_r2d2.txt
================================================
@inproceedings{Revaud_2019_r2d2,
title={R2D2: Repeatable and Reliable Detector and Descriptor},
author={Jerome Revaud and Philippe Weinzaepfel and C{'{e}}sar Roberto de Souza and Martin Humenberger},
booktitle={Conference on Neural Information Processing Systems (NeurIPS)},
year={2019},
}
================================================
FILE: citations/Ros_2016_synthia.txt
================================================
@inproceedings{Ros_2016_synthia,
title={The SYNTHIA Dataset: A Large Collection of Synthetic Images for Semantic Segmentation of Urban Scenes},
author={G. Ros and L. Sellart and J. Materzynska and D. Vazquez and Lopez and M. Antonio},
booktitle={IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
year={2016},
}
================================================
FILE: citations/Sarlin_2020_superglue.txt
================================================
@inproceedings{Sarlin_2020_superglue,
title={SuperGlue: Learning Feature Matching with Graph Neural Networks},
author={Paul-Edouard Sarlin and Daniel DeTone and Tomasz Malisiewicz and Andrew Rabinovich},
booktitle={IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
year={2020},
}
================================================
FILE: citations/Sattler_2012_aachen.txt
================================================
@inproceedings{Sattler_2012_aachen,
title={Image retrieval for image-based localization revisited},
author={Torsten Sattler and Tobias Weyand and Bastian Leibe and Leif Kobbelt},
booktitle={IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
year={2012},
}
================================================
FILE: citations/Sattler_2018_aachen_daynight.txt
================================================
@inproceedings{Sattler_2018_aachen_daynight,
title={Benchmarking 6DOF Outdoor Visual Localization in Changing Conditions},
author={T. Sattler and W. Maddern and C. Toft and A. Torii and L. Hammarstrand and E. Stenborg and D. Safari and M. Okutomi and M. Pollefeys and J. Sivic and F. Kahl and T. Pajdla},
booktitle={IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
year={2018},
}
================================================
FILE: citations/Schubert_2021_graph_optimization.txt
================================================
@inproceedings{Schubert_2021_graph_optimization,
title = {Fast and Memory Efficient Graph Optimization via ICM for Visual Place Recognition},
author = {Schubert, Stefan and Neubert, Peer and Protzel, Peter},
booktitle={Robotics: Science and Systems (RSS)},
year = {2021},
month = {07},
pages = {},
doi = {10.15607/RSS.2021.XVII.091}
}
================================================
FILE: citations/Schubert_2023_vpr_tutorial.txt
================================================
@inproceedings{Schubert_2023_vpr_tutorial,
title={Visual Place Recognition: A Tutorial},
author={Schubert, Stefan and Neubert, Peer and Garg, Sourav and Milford, Michael and Fischer, Tobias},
journal={IEEE Robotics \& Automation Magazine},
year={2023},
doi={10.1109/MRA.2023.3310859}
}
================================================
FILE: citations/Seo_2018_CPlaNet.txt
================================================
@inproceedings{Seo_2018_CPlaNet,
title={CPlaNet: Enhancing Image Geolocalization by Combinatorial Partitioning of Maps},
author={Paul Hongsuck Seo and Tobias Weyand and Jack Sim and Bohyung Han},
booktitle={European Conference on Computer Vision (ECCV)},
year={2018},
}
================================================
FILE: citations/Sferrazza_2025_match.txt
================================================
@article{sferrazza2025match,
title={To Match or Not to Match: Revisiting Image Matching for Reliable Visual Place Recognition},
author={Sferrazza, Davide and Berton, Gabriele and Trivigno, Gabriele and Masone, Carlo},
journal={arXiv preprint arXiv:2504.06116},
year={2025}
}
================================================
FILE: citations/Sun_2021_loftr.txt
================================================
@article{Sun_2021_loftr,
title={LoFTR: Detector-Free Local Feature Matching with Transformers},
author={Sun, Jiaming and Shen, Zehong and Wang, Yuang and Bao, Hujun and Zhou, Xiaowei},
journal={IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
year={2021},
}
================================================
FILE: citations/Sunderhauf_2013_nordland.txt
================================================
@inproceedings{Sunderhauf_2013_nordland,
title={Are we there yet? Challenging SeqSLAM on a 3000 km journey across all four seasons},
author={N. Suenderhauf and P. Neubert and P. Protzel},
booktitle={IEEE International Conference on Robotics and Automation Workshops},
year={2013},
}
================================================
FILE: citations/Theiner_2022_WACV.txt
================================================
@inproceedings{Theiner_2022_WACV,
title={Interpretable Semantic Photo Geolocation},
author={Theiner, Jonas and Muller-Budack, Eric and Ewerth, Ralph},
booktitle={IEEE Winter Conference on Applications of Computer Vision (WACV)},
year={2022},
}
================================================
FILE: citations/Torii_2015_pitts250k.txt
================================================
@article{Torii_2015_pitts250k,
title={Visual Place Recognition with Repetitive Structures},
author={A. {Torii} and J. {Sivic} and M. {Okutomi} and T. {Pajdla},
journal={IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI)},
year={2015},
}
================================================
FILE: citations/Torii_2018_tokyo247.txt
================================================
@article{Torii_2018_tokyo247,
title={24/7 Place Recognition by View Synthesis},
author={A. {Torii} and R. {Arandjelović} and J. {Sivic} and M. {Okutomi} and T. {Pajdla},
journal={IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI)},
year={2018},
}
================================================
FILE: citations/Torii_2021_large_scale3D.txt
================================================
@article{Torii_2021_large_scale3D,
title={Are Large-Scale 3D Models Really Necessary for Accurate Visual Localization?},
author={A. Torii and Hajime Taira and Josef Sivic and M. Pollefeys and M. Okutomi and T. Pajdla and Torsten Sattler},
journal={IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI)},
year={2021},
}
================================================
FILE: citations/Trivigno_2023_DivideClassify.txt
================================================
@inproceedings{Trivigno_2023_DivideClassify,
title={Divide&Classify: Fine-Grained Classification for City-Wide Visual Geo-localization},
author={Trivigno, Gabriele and Berton, Gabriele and Masone, Carlo and Caputo, Barbara},
booktitle={IEEE International Conference on Computer Vision (ICCV)},
year={2023},
}
================================================
FILE: citations/Vo_2017_revIm2GPS.txt
================================================
@inproceedings{Vo_2017_revIm2GPS,
title={Revisiting IM2GPS in the Deep Learning Era},
author={Vo, Nam and Jacobs, Nathan and Hays, James},
booktitle={IEEE International Conference on Computer Vision (ICCV)},
year={2017},
}
================================================
FILE: citations/Vysotska_2016_sequences_matching.txt
================================================
@article{Vysotska_2016_sequences_matching,
title={Lazy Data Association For Image Sequences Matching Under Substantial Appearance Changes},
author={Olga Vysotska and C. Stachniss},
journal={IEEE Robotics and Automation Letters},
year={2016},
volume={1},
pages={213-220},
url={https://api.semanticscholar.org/CorpusID:14752571}
}
================================================
FILE: citations/Wang_2019_ageAgnosticVPR.txt
================================================
@inproceedings{Wang_2019_ageAgnosticVPR,
title={Attention-Aware Age-Agnostic Visual Place Recognition},
author={Wang, Ziqi and Li, Jiahui and Khademi, Seyran and van Gemert, Jan},
booktitle={IEEE International Conference on Computer Vision (ICCV) Workshops},
year={2019},
}
================================================
FILE: citations/Wang_2022_TransVPR.txt
================================================
@inproceedings{Wang_2022_TransVPR,
title={TransVPR: Transformer-Based Place Recognition With Multi-Level Attention Aggregation},
author={Wang, Ruotong and Shen, Yanqing and Zuo, Weiliang and Zhou, Sanping and Zheng, Nanning},
booktitle={IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
year={2022},
}
================================================
FILE: citations/Warburg_2020_msls.txt
================================================
@inproceedings{Warburg_2020_msls,
title={Mapillary Street-Level Sequences: A Dataset for Lifelong Place Recognition},
author={Warburg, Frederik and Hauberg, Soren and Lopez-Antequera, Manuel and Gargallo, Pau and Kuang, Yubin and Civera, Javier},
booktitle={IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
year={2020},
}
================================================
FILE: citations/Warburg_2021_bayesian.txt
================================================
@inproceedings{warburg2021bayesian,
title={Bayesian triplet loss: Uncertainty quantification in image retrieval},
author={Warburg, Frederik and J{\o}rgensen, Martin and Civera, Javier and Hauberg, S{\o}ren},
booktitle={Proceedings of the IEEE/CVF International conference on Computer Vision},
pages={12158--12168},
year={2021}
}
================================================
FILE: citations/Weyand_2016_PlaNet.txt
================================================
@inproceedings{Weyand_2016_PlaNet,
title={PlaNet - Photo Geolocation with Convolutional Neural Networks},
author={Tobias Weyand and Ilya Kostrikov and James Philbin},
booktitle={European Conference on Computer Vision (ECCV)},
year={2016},
}
================================================
FILE: citations/Weyand_2020_gldv2.txt
================================================
@inproceedings{Weyand_2020_gldv2,
title={Google Landmarks Dataset v2 – A Large-Scale Benchmark for Instance-Level Recognition and Retrieval},
author={Tobias Weyand and A. Ara{'u}jo and Bingyi Cao and Jack Sim},
booktitle={IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
year={2020},
}
================================================
FILE: citations/Yildiz_2022_AmsterTime.txt
================================================
@inproceedings{Yildiz_2022_AmsterTime,
title={AmsterTime: A Visual Place Recognition Benchmark Dataset for Severe Domain Shift},
author={B. Yildiz and S. Khademi and R. Siebes and J. Van Gemert},
booktitle={International Conference on Pattern Recognition (ICPR)},
year={2022},
}
================================================
FILE: citations/Yin_2025_GeneralPR.txt
================================================
@Article{yin2025general,
title={General place recognition survey: Towards real-world autonomy},
author={Yin, Peng and Jiao, Jianhao and Zhao, Shiqi and Xu, Lingyun and Huang, Guoquan and Choset, Howie and Scherer, Sebastian and Han, Jianda},
journal={IEEE Transactions on Robotics},
year={2025},
publisher={IEEE}
}
================================================
FILE: citations/Zaffar_2021_vprbench.txt
================================================
@article{Zaffar_2021_vprbench,
title={VPR-Bench: An Open-Source Visual Place Recognition Evaluation Framework with Quantifiable Viewpoint and Appearance Change},
author={Zaffar, Mubariz and Garg, Sourav and Milford, Michael and Kooij, Julian and Flynn, David and McDonald-Maier, Klaus and Ehsan, Shoaib},
journal={International Journal of Computer Vision (IJCV)},
year={2021},
}
================================================
FILE: citations/Zaffar_2024_estimation.txt
================================================
@inproceedings{zaffar2024estimation,
title={On the estimation of image-matching uncertainty in visual place recognition},
author={Zaffar, Mubariz and Nan, Liangliang and Kooij, Julian FP},
booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
pages={17743--17753},
year={2024}
}
================================================
FILE: citations/Zamir_2014_102k_streetview.txt
================================================
@article{Zamir_2014_102k_streetview,
title={Image Geo-localization Based on Multiple Nearest Neighbor Feature Matching using Generalized Graphs},
author={Zamir, A.R. and Shah, M.},
journal={IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI)},
year={2014},
}
================================================
FILE: citations/Zhang_2021_Survey.txt
================================================
@article{Zhang_2021_Survey,
title={Visual place recognition: A survey from deep learning perspective},
author={Xiwu Zhang and Lei Wang and Yan Su},
journal={Pattern Recognition},
year={2021},
}
================================================
FILE: citations/Zhang_2021_gated_netvlad.txt
================================================
@article{Zhang_2021_gated_netvlad,
title={Vector of Locally and Adaptively Aggregated Descriptors for Image Feature Representation},
author={Jian Zhang and Yunyin Cao and Qun Wu},
journal={Pattern Recognition},
year={2021},
}
================================================
FILE: citations/Zhang_2023_SpatioTemporalAttention.txt
================================================
@article{Zhang_2023_SpatioTemporalAttention,
title={Learning Sequence Descriptor based on Spatio-Temporal Attention for Visual Place Recognition},
author={Fenglin Zhang and Junqiao Zhao and Yingfeng Cai and Gengxuan Tian and Wenjie Mu and Chen Ye},
journal={ArXiv},
year={2023},
}
================================================
FILE: citations/Zhu_2018_apanet.txt
================================================
@inproceedings{Zhu_2018_apanet,
title={Attention-based Pyramid Aggregation Network for Visual Place Recognition},
author={Yingying Zhu and Jiong Wang and Lingxi Xie and Liang Zheng},
booktitle={ACM Multimedia},
year={2018},
}
================================================
FILE: citations/Zhu_2023_R2Former.txt
================================================
@inproceedings{Zhu_2023_R2Former,
title={R2former: Unified retrieval and reranking transformer for place recognition},
author={Zhu, Sijie and Yang, Linjie and Chen, Chen and Shah, Mubarak and Shen, Xiaohui and Wang, Heng},
booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
year={2023},
}
================================================
FILE: how-to-PR.md
================================================
1. Put the bibtex in `awesome-Visual-Place-Recognition/citations/<YOUR-PROJECT>.txt`.
2. Modify the `README.md` and follow the format - [TITLE](link to paper), AUTHOR, CONFERENCE YEAR | [github](optional_link) | [bibtex](link). For example:
- [NetVLAD: CNN architecture for weakly supervised place recognition](https://arxiv.org/abs/1511.07247), Relja Arandjelović et al., CVPR 2016 | [github](https://github.com/Relja/netvlad) | [bibtex](./citations/Arandjelovic_2018_NetVLAD.txt)
gitextract_gwxqzf2c/ ├── .gitignore ├── LICENSE ├── README.md ├── citations/ │ ├── Alibey_2022_gsvcities.txt │ ├── Alibey_2023_mixvpr.txt │ ├── Arandjelovic_2016_NetVLAD.txt │ ├── Asha_2019_todaygan.txt │ ├── Barbarani_2023_reranking_benchmark.txt │ ├── Barros_2022_Survey.txt │ ├── Berton_2021_geowarp.txt │ ├── Berton_2021_svox.txt │ ├── Berton_2022_benchmark_berton.txt │ ├── Berton_2022_cosPlace.txt │ ├── Berton_2023_EigenPlaces.txt │ ├── Berton_2024_meshvpr.txt │ ├── Berton_2025_MegaLoc.txt │ ├── Cai_2022_stun.txt │ ├── Cao_2020_delg.txt │ ├── CarlevarisBianco_2016_nclt.txt │ ├── Cepeda_2023_GeoCLIP.txt │ ├── Chen_2011_san_francisco_landmark.txt │ ├── Cheng_2020_Unifying.txt │ ├── Clark_2023_GeoGuessNet.txt │ ├── Cummins_2009_eynsham.txt │ ├── Doan_2019_scalable.txt │ ├── Dusmanu_2019_D2Net.txt │ ├── Fu_2023_matc_net.txt │ ├── Fuwen_2021_reranking_transformers.txt │ ├── Garg_2019_semanticGeometric.txt │ ├── Garg_2021_SeqNet.txt │ ├── Garg_2021_Survey.txt │ ├── Garg_2024_revisitanything.txt │ ├── Gastruc_2024_OSV5M.txt │ ├── Ge_2020_sfrs.txt │ ├── Geiger_2013_kitti.txt │ ├── Gronat_2013_cvpr_pitts.txt │ ├── Haas_2024_pigeon.txt │ ├── Hausler_2019_fusion.txt │ ├── Hausler_2021_patch_netvlad.txt │ ├── Hausler_2025_PairVPR.txt │ ├── Hays_2008_im2gps.txt │ ├── Ibrahimi_2021_insideout_vpr.txt │ ├── Izquierdo_2024_cliquemining.txt │ ├── Izquierdo_2024_salad.txt │ ├── Keetha_2023_AnyLoc.txt │ ├── Khaliq_2024_vladbuff.txt │ ├── Kim_2017_crn.txt │ ├── Knopp_2010_geotagged_streetview.txt │ ├── Kordopatis_2021_EfficientNetGeoloc.txt │ ├── Lee_2022_cvnet.txt │ ├── Leyvavallina_2021_gcl.txt │ ├── Liu_2019_sare.txt │ ├── Liu_2021_densernet.txt │ ├── Lowry_2016_Survey.txt │ ├── Lu_2024_CricaVPR.txt │ ├── Lu_2024_SelaVPR.txt │ ├── Lu_2025_SelaVPRpp.txt │ ├── Maddern_2017_robotCar.txt │ ├── Masone_2021_survey.txt │ ├── Mereu_2022_SeqVLAD.txt │ ├── Milford_2008_st_lucia.txt │ ├── Milford_2012_SeqSLAM.txt │ ├── Muller_2018_hierarchical_geolocation.txt │ ├── Noh_2017_delf.txt │ ├── Paolicelli_2022_Semantic_VPR.txt │ ├── Peng_2021_appsvr.txt │ ├── Peng_2021_sralNet.txt │ ├── Pion_2020_benchmark_VisLoc.txt │ ├── Pramanick_2022_transformer_geoloc.txt │ ├── Revaud_2019_r2d2.txt │ ├── Ros_2016_synthia.txt │ ├── Sarlin_2020_superglue.txt │ ├── Sattler_2012_aachen.txt │ ├── Sattler_2018_aachen_daynight.txt │ ├── Schubert_2021_graph_optimization.txt │ ├── Schubert_2023_vpr_tutorial.txt │ ├── Seo_2018_CPlaNet.txt │ ├── Sferrazza_2025_match.txt │ ├── Sun_2021_loftr.txt │ ├── Sunderhauf_2013_nordland.txt │ ├── Theiner_2022_WACV.txt │ ├── Torii_2015_pitts250k.txt │ ├── Torii_2018_tokyo247.txt │ ├── Torii_2021_large_scale3D.txt │ ├── Trivigno_2023_DivideClassify.txt │ ├── Vo_2017_revIm2GPS.txt │ ├── Vysotska_2016_sequences_matching.txt │ ├── Wang_2019_ageAgnosticVPR.txt │ ├── Wang_2022_TransVPR.txt │ ├── Warburg_2020_msls.txt │ ├── Warburg_2021_bayesian.txt │ ├── Weyand_2016_PlaNet.txt │ ├── Weyand_2020_gldv2.txt │ ├── Yildiz_2022_AmsterTime.txt │ ├── Yin_2025_GeneralPR.txt │ ├── Zaffar_2021_vprbench.txt │ ├── Zaffar_2024_estimation.txt │ ├── Zamir_2014_102k_streetview.txt │ ├── Zhang_2021_Survey.txt │ ├── Zhang_2021_gated_netvlad.txt │ ├── Zhang_2023_SpatioTemporalAttention.txt │ ├── Zhu_2018_apanet.txt │ └── Zhu_2023_R2Former.txt └── how-to-PR.md
Condensed preview — 104 files, each showing path, character count, and a content snippet. Download the .json file or copy for the full structured content (75K chars).
[
{
"path": ".gitignore",
"chars": 28,
"preview": ".spyproject\n.idea\n.DS_Store\n"
},
{
"path": "LICENSE",
"chars": 1072,
"preview": "MIT License\n\nCopyright (c) 2023 Gabriele Berton\n\nPermission is hereby granted, free of charge, to any person obtaining a"
},
{
"path": "README.md",
"chars": 27986,
"preview": "# Awesome Visual Place Recognition [. The extraction includes 104 files (66.0 KB), approximately 22.9k tokens. Use this with OpenClaw, Claude, ChatGPT, Cursor, Windsurf, or any other AI tool that accepts text input. You can copy the full output to your clipboard or download it as a .txt file.
Extracted by GitExtract — free GitHub repo to text converter for AI. Built by Nikandr Surkov.