Annotation uncertainty in the context of grammatical change

Research output: Contribution to journalArticleResearchpeer review

Authors

  • Marie Luis Merten
  • Marcel Wever
  • Michaela Geierhos
  • Doris Tophinke
  • Eyke Hüllermeier

External Research Organisations

  • Universität Zürich (UZH)
  • Universität der Bundeswehr München
  • Paderborn University
  • Ludwig-Maximilians-Universität München (LMU)
View graph of relations

Details

Original languageEnglish
Pages (from-to)430-459
Number of pages30
JournalInternational Journal of Corpus Linguistics
Volume28
Issue number3
Publication statusPublished - 19 Jul 2023
Externally publishedYes

Abstract

This paper elaborates on the notion of uncertainty in the context of annotation in large text corpora, specifically focusing on (but not limited to) historical languages. Such uncertainty might be due to inherent properties of the language, for example, linguistic ambiguity and overlapping categories of linguistic description, but could also be caused by a lack of annotation expertise. By examining annotation uncertainty in more detail, we identify the sources, deepen our understanding of the nature and different types of uncertainty encountered in daily annotation practice, and discuss practical implications of our theoretical findings. This paper can be seen as an attempt to reconcile the perspectives of the main scientific disciplines involved in corpus projects, linguistics and computer science, to develop a unified view and to highlight the potential synergies between these disciplines.

Keywords

    annotation, fuzziness, grammatical change, uncertainty

ASJC Scopus subject areas

Cite this

Annotation uncertainty in the context of grammatical change. / Merten, Marie Luis; Wever, Marcel; Geierhos, Michaela et al.
In: International Journal of Corpus Linguistics, Vol. 28, No. 3, 19.07.2023, p. 430-459.

Research output: Contribution to journalArticleResearchpeer review

Merten ML, Wever M, Geierhos M, Tophinke D, Hüllermeier E. Annotation uncertainty in the context of grammatical change. International Journal of Corpus Linguistics. 2023 Jul 19;28(3):430-459. doi: 10.1075/ijcl.20113.mer
Merten, Marie Luis ; Wever, Marcel ; Geierhos, Michaela et al. / Annotation uncertainty in the context of grammatical change. In: International Journal of Corpus Linguistics. 2023 ; Vol. 28, No. 3. pp. 430-459.
Download
@article{35cf913042ec4117aa6ec473f1e7450c,
title = "Annotation uncertainty in the context of grammatical change",
abstract = "This paper elaborates on the notion of uncertainty in the context of annotation in large text corpora, specifically focusing on (but not limited to) historical languages. Such uncertainty might be due to inherent properties of the language, for example, linguistic ambiguity and overlapping categories of linguistic description, but could also be caused by a lack of annotation expertise. By examining annotation uncertainty in more detail, we identify the sources, deepen our understanding of the nature and different types of uncertainty encountered in daily annotation practice, and discuss practical implications of our theoretical findings. This paper can be seen as an attempt to reconcile the perspectives of the main scientific disciplines involved in corpus projects, linguistics and computer science, to develop a unified view and to highlight the potential synergies between these disciplines.",
keywords = "annotation, fuzziness, grammatical change, uncertainty",
author = "Merten, {Marie Luis} and Marcel Wever and Michaela Geierhos and Doris Tophinke and Eyke H{\"u}llermeier",
note = "Publisher Copyright: {\textcopyright} 2023 John Benjamins Publishing Company.",
year = "2023",
month = jul,
day = "19",
doi = "10.1075/ijcl.20113.mer",
language = "English",
volume = "28",
pages = "430--459",
journal = "International Journal of Corpus Linguistics",
issn = "1384-6655",
publisher = "John Benjamins Publishing Company",
number = "3",

}

Download

TY - JOUR

T1 - Annotation uncertainty in the context of grammatical change

AU - Merten, Marie Luis

AU - Wever, Marcel

AU - Geierhos, Michaela

AU - Tophinke, Doris

AU - Hüllermeier, Eyke

N1 - Publisher Copyright: © 2023 John Benjamins Publishing Company.

PY - 2023/7/19

Y1 - 2023/7/19

N2 - This paper elaborates on the notion of uncertainty in the context of annotation in large text corpora, specifically focusing on (but not limited to) historical languages. Such uncertainty might be due to inherent properties of the language, for example, linguistic ambiguity and overlapping categories of linguistic description, but could also be caused by a lack of annotation expertise. By examining annotation uncertainty in more detail, we identify the sources, deepen our understanding of the nature and different types of uncertainty encountered in daily annotation practice, and discuss practical implications of our theoretical findings. This paper can be seen as an attempt to reconcile the perspectives of the main scientific disciplines involved in corpus projects, linguistics and computer science, to develop a unified view and to highlight the potential synergies between these disciplines.

AB - This paper elaborates on the notion of uncertainty in the context of annotation in large text corpora, specifically focusing on (but not limited to) historical languages. Such uncertainty might be due to inherent properties of the language, for example, linguistic ambiguity and overlapping categories of linguistic description, but could also be caused by a lack of annotation expertise. By examining annotation uncertainty in more detail, we identify the sources, deepen our understanding of the nature and different types of uncertainty encountered in daily annotation practice, and discuss practical implications of our theoretical findings. This paper can be seen as an attempt to reconcile the perspectives of the main scientific disciplines involved in corpus projects, linguistics and computer science, to develop a unified view and to highlight the potential synergies between these disciplines.

KW - annotation

KW - fuzziness

KW - grammatical change

KW - uncertainty

UR - http://www.scopus.com/inward/record.url?scp=85168533774&partnerID=8YFLogxK

U2 - 10.1075/ijcl.20113.mer

DO - 10.1075/ijcl.20113.mer

M3 - Article

AN - SCOPUS:85168533774

VL - 28

SP - 430

EP - 459

JO - International Journal of Corpus Linguistics

JF - International Journal of Corpus Linguistics

SN - 1384-6655

IS - 3

ER -

By the same author(s)