Large Multi-Modal Model Cartographic Map Comprehension for Textual Locality Georeferencing

OpenAI OpenAI
2025 Leibniz-Zentrum für Informatik (Schloss Dagstuhl) 2,084 citations

Abstract

Millions of biological sample records collected in the last few centuries archived in natural history collections are un-georeferenced. Georeferencing complex locality descriptions associated with these collection samples is a highly labour-intensive task collection agencies struggle with. None of the existing automated methods exploit maps that are an essential tool for georeferencing complex relations. We present preliminary experiments and results of a novel method that exploits multi-modal capabilities of recent Large Multi-Modal Models (LMM). This method enables the model to visually contextualize spatial relations it reads in the locality description. We use a grid-based approach to adapt these auto-regressive models for this task in a zero-shot setting. Our experiments conducted on a small manually annotated dataset show impressive results for our approach (∼1 km Average distance error) compared to uni-modal georeferencing with Large Language Models and existing georeferencing tools. The paper also discusses the findings of the experiments in light of an LMM’s ability to comprehend fine-grained maps. Motivated by these results, a practical framework is proposed to integrate this method into a georeferencing workflow.

Keywords

TransformerComputer scienceSecurity tokenProcess (computing)Scale (ratio)Artificial intelligenceMachine learningCartographyEngineeringElectrical engineeringOperating system

Related Publications

Hypergraph Neural Networks

In this paper, we present a hypergraph neural networks (HGNN) framework for data representation learning, which can encode high-order data correlation in a hypergraph structure....

2019 Proceedings of the AAAI Conference on... 1418 citations

Publication Info

Year
2025
Type
preprint
Citations
2084
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

2084
OpenAlex

Cite This

OpenAI (2025). Large Multi-Modal Model Cartographic Map Comprehension for Textual Locality Georeferencing. Leibniz-Zentrum für Informatik (Schloss Dagstuhl) . https://doi.org/10.4230/lipics.giscience.2025.12

Identifiers

DOI
10.4230/lipics.giscience.2025.12

Data Quality

Data completeness: 77%