Clustering Semantic Predicates in the Open Research Knowledge Graph

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Authors

External Research Organisations

  • German National Library of Science and Technology (TIB)
View graph of relations

Details

Original languageEnglish
Title of host publicationFrom Born-Physical to Born-Virtual
Subtitle of host publicationAugmenting Intelligence in Digital Libraries - 24th International Conference on Asian Digital Libraries, ICADL 2022, Proceedings
EditorsYuen-Hsien Tseng, Marie Katsurai, Hoa N. Nguyen
PublisherSpringer Science and Business Media Deutschland GmbH
Pages477-484
Number of pages8
ISBN (electronic)978-3-031-21756-2
ISBN (print)9783031217555
Publication statusPublished - 2022
Externally publishedYes
Event24th International Conference on Asia-Pacific Digital Libraries, ICADL 2022 - Hanoi, Viet Nam
Duration: 30 Nov 20222 Dec 2022

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume13636 LNCS
ISSN (Print)0302-9743
ISSN (electronic)1611-3349

Abstract

When semantically describing knowledge graphs (KGs), users have to make a critical choice of a vocabulary (i.e. predicates and resources). The success of KG building is determined by the convergence of shared vocabularies so that meaning can be established. The typical lifecycle for a new KG construction can be defined as follows: nascent phases of graph construction experience terminology divergence, while later phases of graph construction experience terminology convergence and reuse. In this paper, we describe our approach tailoring two AI-based clustering algorithms for recommending predicates (in RDF statements) about resources in the Open Research Knowledge Graph (ORKG) https://orkg.org/. Such a service to recommend existing predicates to semantify new incoming data of scholarly publications is of paramount importance for fostering terminology convergence in the ORKG. Our experiments show very promising results: a high precision with relatively high recall in linear runtime performance. Furthermore, this work offers novel insights into the predicate groups that automatically accrue loosely as generic semantification patterns for semantification of scholarly knowledge spanning 44 research fields.

Keywords

    Artificial intelligence, Clustering algorithms, Content-based recommender systems, Open research knowledge graph

ASJC Scopus subject areas

Cite this

Clustering Semantic Predicates in the Open Research Knowledge Graph. / Arab Oghli, Omar; D’Souza, Jennifer; Auer, Sören.
From Born-Physical to Born-Virtual: Augmenting Intelligence in Digital Libraries - 24th International Conference on Asian Digital Libraries, ICADL 2022, Proceedings. ed. / Yuen-Hsien Tseng; Marie Katsurai; Hoa N. Nguyen. Springer Science and Business Media Deutschland GmbH, 2022. p. 477-484 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 13636 LNCS).

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Arab Oghli, O, D’Souza, J & Auer, S 2022, Clustering Semantic Predicates in the Open Research Knowledge Graph. in Y-H Tseng, M Katsurai & HN Nguyen (eds), From Born-Physical to Born-Virtual: Augmenting Intelligence in Digital Libraries - 24th International Conference on Asian Digital Libraries, ICADL 2022, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 13636 LNCS, Springer Science and Business Media Deutschland GmbH, pp. 477-484, 24th International Conference on Asia-Pacific Digital Libraries, ICADL 2022, Hanoi, Viet Nam, 30 Nov 2022. https://doi.org/10.48550/arXiv.2210.02034, https://doi.org/10.1007/978-3-031-21756-2_39
Arab Oghli, O., D’Souza, J., & Auer, S. (2022). Clustering Semantic Predicates in the Open Research Knowledge Graph. In Y.-H. Tseng, M. Katsurai, & H. N. Nguyen (Eds.), From Born-Physical to Born-Virtual: Augmenting Intelligence in Digital Libraries - 24th International Conference on Asian Digital Libraries, ICADL 2022, Proceedings (pp. 477-484). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 13636 LNCS). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.48550/arXiv.2210.02034, https://doi.org/10.1007/978-3-031-21756-2_39
Arab Oghli O, D’Souza J, Auer S. Clustering Semantic Predicates in the Open Research Knowledge Graph. In Tseng YH, Katsurai M, Nguyen HN, editors, From Born-Physical to Born-Virtual: Augmenting Intelligence in Digital Libraries - 24th International Conference on Asian Digital Libraries, ICADL 2022, Proceedings. Springer Science and Business Media Deutschland GmbH. 2022. p. 477-484. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). Epub 2022 Dec 7. doi: 10.48550/arXiv.2210.02034, 10.1007/978-3-031-21756-2_39
Arab Oghli, Omar ; D’Souza, Jennifer ; Auer, Sören. / Clustering Semantic Predicates in the Open Research Knowledge Graph. From Born-Physical to Born-Virtual: Augmenting Intelligence in Digital Libraries - 24th International Conference on Asian Digital Libraries, ICADL 2022, Proceedings. editor / Yuen-Hsien Tseng ; Marie Katsurai ; Hoa N. Nguyen. Springer Science and Business Media Deutschland GmbH, 2022. pp. 477-484 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).
Download
@inproceedings{a9686f8ab6234bd7ae32af8d38212e4c,
title = "Clustering Semantic Predicates in the Open Research Knowledge Graph",
abstract = "When semantically describing knowledge graphs (KGs), users have to make a critical choice of a vocabulary (i.e. predicates and resources). The success of KG building is determined by the convergence of shared vocabularies so that meaning can be established. The typical lifecycle for a new KG construction can be defined as follows: nascent phases of graph construction experience terminology divergence, while later phases of graph construction experience terminology convergence and reuse. In this paper, we describe our approach tailoring two AI-based clustering algorithms for recommending predicates (in RDF statements) about resources in the Open Research Knowledge Graph (ORKG) https://orkg.org/. Such a service to recommend existing predicates to semantify new incoming data of scholarly publications is of paramount importance for fostering terminology convergence in the ORKG. Our experiments show very promising results: a high precision with relatively high recall in linear runtime performance. Furthermore, this work offers novel insights into the predicate groups that automatically accrue loosely as generic semantification patterns for semantification of scholarly knowledge spanning 44 research fields.",
keywords = "Artificial intelligence, Clustering algorithms, Content-based recommender systems, Open research knowledge graph",
author = "{Arab Oghli}, Omar and Jennifer D{\textquoteright}Souza and S{\"o}ren Auer",
note = "Funding Information: Supported by TIB Leibniz Information Centre for Science and Technology, the EU H2020 ERC project ScienceGRaph (GA ID: 819536).; 24th International Conference on Asia-Pacific Digital Libraries, ICADL 2022 ; Conference date: 30-11-2022 Through 02-12-2022",
year = "2022",
doi = "10.48550/arXiv.2210.02034",
language = "English",
isbn = "9783031217555",
series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
publisher = "Springer Science and Business Media Deutschland GmbH",
pages = "477--484",
editor = "Yuen-Hsien Tseng and Marie Katsurai and Nguyen, {Hoa N.}",
booktitle = "From Born-Physical to Born-Virtual",
address = "Germany",

}

Download

TY - GEN

T1 - Clustering Semantic Predicates in the Open Research Knowledge Graph

AU - Arab Oghli, Omar

AU - D’Souza, Jennifer

AU - Auer, Sören

N1 - Funding Information: Supported by TIB Leibniz Information Centre for Science and Technology, the EU H2020 ERC project ScienceGRaph (GA ID: 819536).

PY - 2022

Y1 - 2022

N2 - When semantically describing knowledge graphs (KGs), users have to make a critical choice of a vocabulary (i.e. predicates and resources). The success of KG building is determined by the convergence of shared vocabularies so that meaning can be established. The typical lifecycle for a new KG construction can be defined as follows: nascent phases of graph construction experience terminology divergence, while later phases of graph construction experience terminology convergence and reuse. In this paper, we describe our approach tailoring two AI-based clustering algorithms for recommending predicates (in RDF statements) about resources in the Open Research Knowledge Graph (ORKG) https://orkg.org/. Such a service to recommend existing predicates to semantify new incoming data of scholarly publications is of paramount importance for fostering terminology convergence in the ORKG. Our experiments show very promising results: a high precision with relatively high recall in linear runtime performance. Furthermore, this work offers novel insights into the predicate groups that automatically accrue loosely as generic semantification patterns for semantification of scholarly knowledge spanning 44 research fields.

AB - When semantically describing knowledge graphs (KGs), users have to make a critical choice of a vocabulary (i.e. predicates and resources). The success of KG building is determined by the convergence of shared vocabularies so that meaning can be established. The typical lifecycle for a new KG construction can be defined as follows: nascent phases of graph construction experience terminology divergence, while later phases of graph construction experience terminology convergence and reuse. In this paper, we describe our approach tailoring two AI-based clustering algorithms for recommending predicates (in RDF statements) about resources in the Open Research Knowledge Graph (ORKG) https://orkg.org/. Such a service to recommend existing predicates to semantify new incoming data of scholarly publications is of paramount importance for fostering terminology convergence in the ORKG. Our experiments show very promising results: a high precision with relatively high recall in linear runtime performance. Furthermore, this work offers novel insights into the predicate groups that automatically accrue loosely as generic semantification patterns for semantification of scholarly knowledge spanning 44 research fields.

KW - Artificial intelligence

KW - Clustering algorithms

KW - Content-based recommender systems

KW - Open research knowledge graph

UR - http://www.scopus.com/inward/record.url?scp=85145007977&partnerID=8YFLogxK

U2 - 10.48550/arXiv.2210.02034

DO - 10.48550/arXiv.2210.02034

M3 - Conference contribution

AN - SCOPUS:85145007977

SN - 9783031217555

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 477

EP - 484

BT - From Born-Physical to Born-Virtual

A2 - Tseng, Yuen-Hsien

A2 - Katsurai, Marie

A2 - Nguyen, Hoa N.

PB - Springer Science and Business Media Deutschland GmbH

T2 - 24th International Conference on Asia-Pacific Digital Libraries, ICADL 2022

Y2 - 30 November 2022 through 2 December 2022

ER -

By the same author(s)