GNSS-free global localization in urban environments

Ma, Weixin

Full metadata record

DC Field	Value	Language
dc.contributor	Department of Mechanical Engineering	en_US
dc.contributor.advisor	Su, Zhongqing (ME)	en_US
dc.creator	Ma, Weixin	-
dc.identifier.uri	https://theses.lib.polyu.edu.hk/handle/200/13745	-
dc.language	English	en_US
dc.publisher	Hong Kong Polytechnic University	en_US
dc.rights	All rights reserved	en_US
dc.title	GNSS-free global localization in urban environments	en_US
dcterms.abstract	Global localization provides robots with their current positions relative to the environment, enabling them to perform more complex downstream tasks such as path planning and navigation. One of the most widely adopted solutions, the Global Navigation Satellite System (GNSS), has demonstrated significant advantages over the past decades. Its robustness and accuracy may be compromised by weak signals, multipath effects, or signal obstruction, especially in densely built urban areas with tall buildings. We therefore identify a need to advance techniques on global localization in GNSS-denied environments. The goal of this thesis is to develop real-time solutions with high memory efficiency for GNSS-free global localization in urban environments, with a focus on autonomous vehicles.	en_US
dcterms.abstract	Our first contribution focuses on global localization in high-rise environments using a publicly available map, i.e., OpenStreetMap. We observe that the building roof outline captured by a sky-looking fish-eye camera shares similarities with the building outline featured in the OpenStreetMap. Based on this observation, we propose a descriptor that incorporates both topological (i.e., junction types) and geometric (i.e., building outline) information to bridge fish-eye camera images with OpenStreetMap data. To handle the challenge of similar or repeated building outlines across different images, we formulate our method as a Bayesian Filtering problem using Monte Carlo localization, which leverages multiple consecutive fish-eye images for robust localization. The sky-looking fish-eye camera also naturally avoids disturbances from dynamic objects such as vehicles and pedestrians, making it particularly suitable for urban environments.	en_US
dcterms.abstract	While our first solution enables real-time localization in high-rise environments without the need for manually collecting and maintaining reference maps, its long-term reliability can be impacted by the quality of captured images, which are sensitive to changes in illumination, weather, and seasons. In contrast, range sensors such as LiDAR and radar are more resilient to these environmental factors. To investigate the influence of these factors on range sensing based global localization methods, we conduct a comprehensive evaluation of current techniques in long-term scenarios with significant seasonal variations and adverse weather conditions. In addition, we design a novel metric to evaluate the influence of matching thresholds on place recognition performance for long term localization.	en_US
dcterms.abstract	Based on our previous evaluation study, LiDAR-based place recognition shows good robustness under long-term conditions. However, it typically only identifies whether the current place has been visited before. Determining the vehicle's pose relative to the environment requires additional point cloud or feature point based registration, which might demand significant memory to store these data. Our third contribution addresses this challenge by integrating pose estimation with place recognition to improve LiDAR-based global localization performance. Rather than relying on abstract environmental representations like 3D points, we use lightweight semantic features (e.g., traffic signs, trees, poles, and so on) to represent both on-vehicle LiDAR scans and reference scans in the databases, significantly reducing memory requirements. We propose a novel semantic histogram descriptor to represent each semantic instance, which is used for instance association in pose estimation and aggregated into a global descriptor for place recognition.	en_US
dcterms.abstract	While our previous contribution show competitive localization accuracy and memory efficiency, the scan-to-scan framework still redundantly stores some of semantic instances multiple times across keyframes. Meanwhile, its RANSAC-based pose estimation might fail when outliers dominate. Our fourth contribution shifts to a scan-to-map localization manner, where each semantic instance is stored only once in the reference map, further improving the memory efficiency. We improve the semantic histogram descriptor in our previous work to achieve more robust and effective instance-to-instance correspondences. In addition, we propose a novel Road Surface Normal (RSN) map to provide a prior rotational constraint, enhancing pose estimation. We then apply graph-theoretic outlier pruning to extract inlier correspondences for robust 6-DoF pose estimation.	en_US
dcterms.abstract	Finally, we develop a multi-robot localization system, building upon our previous scan-to-map localization approach. Unlike single-robot localization, multi-robot systems typically lack a prior map, and the initial relative poses between local reference frames of robots are unknown. To this end, we formulate the multi-robot localization as an optimization problem and incorporate one-shot registration-based localization into the optimization process. Specifically, lightweight semantic instances from different robots are transmitted to a central server, which constructs instance maps for inter-robot localization. We propose a dual-metric validation strategy to confirm the validity of pairwise localization results from our previous scan-to-map localization solution to reduce the risk of involving incorrect localization results. A pose averaging based optimization method is used to obtain reliable alignment estimations between local reference frames of different robots. A shortest transformation chain searching method is used to align all robots into a shared reference frame. Our preliminary results demonstrate the feasibility and effectiveness of the proposed system, as well as its promising potential in communication bandwidth-limited conditions.	en_US
dcterms.extent	xxi, 165 pages : color illustrations	en_US
dcterms.isPartOf	PolyU Electronic Theses	en_US
dcterms.issued	2025	en_US
dcterms.educationalLevel	Ph.D.	en_US
dcterms.educationalLevel	All Doctorate	en_US
dcterms.LCSH	Location-based services	en_US
dcterms.LCSH	Automated vehicles	en_US
dcterms.LCSH	Remote sensing	en_US
dcterms.LCSH	Hong Kong Polytechnic University -- Dissertations	en_US
dcterms.accessRights	open access	en_US

Files in This Item:

File	Description	Size	Format
8181.pdf	For All Users	4.32 MB	Adobe PDF	View/Open

Copyright Undertaking

As a bona fide Library user, I declare that:

I will abide by the rules and legal ordinances governing copyright regarding the use of the Database.
I will use the Database for the purpose of my research or private study only and not for circulation or further reproduction or any other purpose.
I agree to indemnify and hold the University harmless from and against any loss, damage, cost, liability or expenses arising from copyright infringement or unauthorized usage.

By downloading any item(s) listed above, you acknowledge that you have read and understood the copyright undertaking as stated above, and agree to be bound by all of its terms.

Show simple item record

Please use this identifier to cite or link to this item: https://theses.lib.polyu.edu.hk/handle/200/13745