TY - GEN
T1 - Extracting interrogative intents and concepts from geo-analytic questions
AU - Xu, H.
AU - Hamzei, Ehsan
AU - Nyamsuren, E.
AU - Winter, Stephan
AU - Tomko, Martin
AU - Scheider, S.
PY - 2020
Y1 - 2020
N2 - Understanding the syntactic and semantic structure of geographic questions is a necessary step towards true geographic question-answering (GeoQA) machines. The empirical basis for the understanding of the capabilities expected from GeoQA systems are geographic question corpora. Available corpora have been mostly drawn from generic Web search logs or limited user studies, supporting the focus of GeoQA systems on retrieving factoids: factual knowledge about particular places and everyday processes. Yet, the majority of questions enquired about in the spatial sciences go beyond simple place facts, with more complex analytical intents informing the questions. In this paper, we introduce a new corpus of geo-analytic questions drawn from textbooks and scientific articles. We analyse and compare this corpus with two general-purpose GeoQA corpora in terms of grammatical complexity and semantic concepts, using a new parsing method that allows us to differentiate and quantify patterns of a question's intent.
AB - Understanding the syntactic and semantic structure of geographic questions is a necessary step towards true geographic question-answering (GeoQA) machines. The empirical basis for the understanding of the capabilities expected from GeoQA systems are geographic question corpora. Available corpora have been mostly drawn from generic Web search logs or limited user studies, supporting the focus of GeoQA systems on retrieving factoids: factual knowledge about particular places and everyday processes. Yet, the majority of questions enquired about in the spatial sciences go beyond simple place facts, with more complex analytical intents informing the questions. In this paper, we introduce a new corpus of geo-analytic questions drawn from textbooks and scientific articles. We analyse and compare this corpus with two general-purpose GeoQA corpora in terms of grammatical complexity and semantic concepts, using a new parsing method that allows us to differentiate and quantify patterns of a question's intent.
KW - Geo-analytic questions
KW - Geographic questions
KW - Information extraction
KW - Grammatical parser
KW - Concepts and intents
KW - Geographic question-answering systems
U2 - 10.5194/agile-giss-1-23-2020
DO - 10.5194/agile-giss-1-23-2020
M3 - Conference contribution
T3 - Lecture Notes in Geoinformation and Cartography
BT - Proceedings of the 23rd AGILE conference on Geographic Information Science
PB - Springer
ER -