Knowledge-Driven Data Ecosystems Toward Data Transparency

Research output: Contribution to journalArticleResearchpeer review

Authors

  • Sandra Geisler
  • Maria Esther Vidal
  • Cinzia Cappiello
  • Bernadette Farias Lóscio
  • Avigdor Gal
  • Matthias Jarke
  • Maurizio Lenzerini
  • Paolo Missier
  • Boris Otto
  • Elda Paja
  • Barbara Pernici
  • Jakob Rehof

External Research Organisations

  • RWTH Aachen University
  • Fraunhofer Institute for Applied Information Technology (FIT)
  • German National Library of Science and Technology (TIB)
  • Politecnico di Milano
  • Universidade Federal de Pernambuco
  • Technion-Israel Institute of Technology
  • Sapienza Università di Roma
  • Newcastle University
  • TU Dortmund University
  • IT University of Copenhagen
View graph of relations

Details

Original languageEnglish
Article number3
Pages (from-to)1-12
Number of pages12
JournalJournal of Data and Information Quality
Volume14
Issue number1
Early online date23 Dec 2021
Publication statusPublished - Mar 2022
Externally publishedYes

Abstract

A data ecosystem (DE) offers a keystone-player or alliance-driven infrastructure that enables the interaction of different stakeholders and the resolution of interoperability issues among shared data. However, despite years of research in data governance and management, trustability is still affected by the absence of transparent and traceable data-driven pipelines. In this work, we focus on requirements and challenges that DEs face when ensuring data transparency. Requirements are derived from the data and organizational management, as well as from broader legal and ethical considerations. We propose a novel knowledge-driven DE architecture, providing the pillars for satisfying the analyzed requirements. We illustrate the potential of our proposal in a real-world scenario. Last, we discuss and rate the potential of the proposed architecture in the fulfillmentof these requirements.

Keywords

    data ecosystems, data quality, Data transparency, trustability

ASJC Scopus subject areas

Cite this

Knowledge-Driven Data Ecosystems Toward Data Transparency. / Geisler, Sandra; Vidal, Maria Esther; Cappiello, Cinzia et al.
In: Journal of Data and Information Quality, Vol. 14, No. 1, 3, 03.2022, p. 1-12.

Research output: Contribution to journalArticleResearchpeer review

Geisler, S, Vidal, ME, Cappiello, C, Lóscio, BF, Gal, A, Jarke, M, Lenzerini, M, Missier, P, Otto, B, Paja, E, Pernici, B & Rehof, J 2022, 'Knowledge-Driven Data Ecosystems Toward Data Transparency', Journal of Data and Information Quality, vol. 14, no. 1, 3, pp. 1-12. https://doi.org/10.1145/3467022
Geisler, S., Vidal, M. E., Cappiello, C., Lóscio, B. F., Gal, A., Jarke, M., Lenzerini, M., Missier, P., Otto, B., Paja, E., Pernici, B., & Rehof, J. (2022). Knowledge-Driven Data Ecosystems Toward Data Transparency. Journal of Data and Information Quality, 14(1), 1-12. Article 3. https://doi.org/10.1145/3467022
Geisler S, Vidal ME, Cappiello C, Lóscio BF, Gal A, Jarke M et al. Knowledge-Driven Data Ecosystems Toward Data Transparency. Journal of Data and Information Quality. 2022 Mar;14(1):1-12. 3. Epub 2021 Dec 23. doi: 10.1145/3467022
Geisler, Sandra ; Vidal, Maria Esther ; Cappiello, Cinzia et al. / Knowledge-Driven Data Ecosystems Toward Data Transparency. In: Journal of Data and Information Quality. 2022 ; Vol. 14, No. 1. pp. 1-12.
Download
@article{e43a2c2d6a0344ceb90cce21cfbe1c8f,
title = "Knowledge-Driven Data Ecosystems Toward Data Transparency",
abstract = "A data ecosystem (DE) offers a keystone-player or alliance-driven infrastructure that enables the interaction of different stakeholders and the resolution of interoperability issues among shared data. However, despite years of research in data governance and management, trustability is still affected by the absence of transparent and traceable data-driven pipelines. In this work, we focus on requirements and challenges that DEs face when ensuring data transparency. Requirements are derived from the data and organizational management, as well as from broader legal and ethical considerations. We propose a novel knowledge-driven DE architecture, providing the pillars for satisfying the analyzed requirements. We illustrate the potential of our proposal in a real-world scenario. Last, we discuss and rate the potential of the proposed architecture in the fulfillmentof these requirements.",
keywords = "data ecosystems, data quality, Data transparency, trustability",
author = "Sandra Geisler and Vidal, {Maria Esther} and Cinzia Cappiello and L{\'o}scio, {Bernadette Farias} and Avigdor Gal and Matthias Jarke and Maurizio Lenzerini and Paolo Missier and Boris Otto and Elda Paja and Barbara Pernici and Jakob Rehof",
note = "Funding Information: A. Gal was supported by the Benjamin and Florence Free Chair M. Lenzerini was supported by the MUR-PRIN project “HOPE” (grant 2017MMJJRE) and the EU under the H2020-EU.2.1.1 project TAILOR (grant 952215). M.-E. Vidal was supported by the EU H2020 project iASiS (grant 727658) and CLARIFY (grant 875160). S. Geisler was supported by the German Innovation Fund project SALUS (grant 01NVF18002). This work has also supported by the German Federal Ministry of Education and Research (BMBF) in the context of the InDaSpacePlus project (grant 01IS17031), Fraunhofer Cluster of Excellence “Cognitive Internet Technologies” (CCIT), and by the Deutsche Forschungsgemeinschaft (DFG) under Germany{\textquoteright}s Excellence Strategy - EXC-2023 Internet of Production - 390621612. Pernici acknowledges the support of the EU H2020 Crowd4SDG project, grant id 872944. Authors{\textquoteright} addresses: S. Geisler, Fraunhofer FIT, Germany, Schloss Birlinghoven, Sankt Augustin, 53757 and RWTH Aachen University, Germany, Ahornstrasse 55, Aachen, 52056 Schloss Birlinghoven, Sankt Augustin, 53757; email: geisler@cs.rwth-aachen.de; M.-E. Vidal, TIB-Leibniz Information Centre for Science and Technology, Gerrmany, Welfengarten 1B, Hannover, 30167; email: maria.vidal@tib.eu; C. Cappiello, Politecnico di Milano, Italy, piazza Leonardo da Vinci 32, Milano, 20133; email: cinzia.cappiello@polimi.it; B. F. L{\'o}scio, Federal University of Pernambuco, Brazil, Cidade Universitaria, Recife/PE, 50740-560; email: bfl@cin.ufpe.br; A. Gal, Technion Israel Institute of Technology, Israel, Technion City, Haifa, 32000; email: avigal@ie.technion.ac.il; M. Jarke, RWTH Aachen University and Fraunhofer FIT, Germany, Ahornstrasse 55, Aachen, 52056; email: jarke@dbis.rwth-aachen.de; M. Lenzerini, Sapienza Universit{\`a} di Roma, Italy, via Ariosto 25, Roma, I-00185; email: lenzerini@diag.uniroma1.it; P. Missier, Newcastle University, United Kingdom, Firebrick Avenue, Newcastle upon Tyne, NE4 5TG; email: paolo.missier@ncl.ac.uk; B. Otto and J. Rehof, TU Dortmund University, Germany, Otto-Hahn-Str. 12, Dortmund, 44227, Fraunhofer ISST, Germany, Emil-Figge-Stra{\ss}e 91, Dortmund, 44227; emails: {boris.otto, jakob.rehof}@cs.tu-dortmund.de; E. Paja, IT University of Copenhagen, Denmark, Rued Langgaards Vej 7, Copenhagen S, DK-2300; email: elpa@itu.dk; B. Pernici, Politecnico di Milano, Italy, piazza Leonardo da Vinci 32, Milano, 20133; email: barbara.pernici@polimi.it. Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from permissions@acm.org. {\textcopyright} 2021 Copyright held by the owner/author(s). Publication rights licensed to ACM. 1936-1955/2021/12-ART3 $15.00 https://doi.org/10.1145/3467022 ",
year = "2022",
month = mar,
doi = "10.1145/3467022",
language = "English",
volume = "14",
pages = "1--12",
number = "1",

}

Download

TY - JOUR

T1 - Knowledge-Driven Data Ecosystems Toward Data Transparency

AU - Geisler, Sandra

AU - Vidal, Maria Esther

AU - Cappiello, Cinzia

AU - Lóscio, Bernadette Farias

AU - Gal, Avigdor

AU - Jarke, Matthias

AU - Lenzerini, Maurizio

AU - Missier, Paolo

AU - Otto, Boris

AU - Paja, Elda

AU - Pernici, Barbara

AU - Rehof, Jakob

N1 - Funding Information: A. Gal was supported by the Benjamin and Florence Free Chair M. Lenzerini was supported by the MUR-PRIN project “HOPE” (grant 2017MMJJRE) and the EU under the H2020-EU.2.1.1 project TAILOR (grant 952215). M.-E. Vidal was supported by the EU H2020 project iASiS (grant 727658) and CLARIFY (grant 875160). S. Geisler was supported by the German Innovation Fund project SALUS (grant 01NVF18002). This work has also supported by the German Federal Ministry of Education and Research (BMBF) in the context of the InDaSpacePlus project (grant 01IS17031), Fraunhofer Cluster of Excellence “Cognitive Internet Technologies” (CCIT), and by the Deutsche Forschungsgemeinschaft (DFG) under Germany’s Excellence Strategy - EXC-2023 Internet of Production - 390621612. Pernici acknowledges the support of the EU H2020 Crowd4SDG project, grant id 872944. Authors’ addresses: S. Geisler, Fraunhofer FIT, Germany, Schloss Birlinghoven, Sankt Augustin, 53757 and RWTH Aachen University, Germany, Ahornstrasse 55, Aachen, 52056 Schloss Birlinghoven, Sankt Augustin, 53757; email: geisler@cs.rwth-aachen.de; M.-E. Vidal, TIB-Leibniz Information Centre for Science and Technology, Gerrmany, Welfengarten 1B, Hannover, 30167; email: maria.vidal@tib.eu; C. Cappiello, Politecnico di Milano, Italy, piazza Leonardo da Vinci 32, Milano, 20133; email: cinzia.cappiello@polimi.it; B. F. Lóscio, Federal University of Pernambuco, Brazil, Cidade Universitaria, Recife/PE, 50740-560; email: bfl@cin.ufpe.br; A. Gal, Technion Israel Institute of Technology, Israel, Technion City, Haifa, 32000; email: avigal@ie.technion.ac.il; M. Jarke, RWTH Aachen University and Fraunhofer FIT, Germany, Ahornstrasse 55, Aachen, 52056; email: jarke@dbis.rwth-aachen.de; M. Lenzerini, Sapienza Università di Roma, Italy, via Ariosto 25, Roma, I-00185; email: lenzerini@diag.uniroma1.it; P. Missier, Newcastle University, United Kingdom, Firebrick Avenue, Newcastle upon Tyne, NE4 5TG; email: paolo.missier@ncl.ac.uk; B. Otto and J. Rehof, TU Dortmund University, Germany, Otto-Hahn-Str. 12, Dortmund, 44227, Fraunhofer ISST, Germany, Emil-Figge-Straße 91, Dortmund, 44227; emails: {boris.otto, jakob.rehof}@cs.tu-dortmund.de; E. Paja, IT University of Copenhagen, Denmark, Rued Langgaards Vej 7, Copenhagen S, DK-2300; email: elpa@itu.dk; B. Pernici, Politecnico di Milano, Italy, piazza Leonardo da Vinci 32, Milano, 20133; email: barbara.pernici@polimi.it. Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from permissions@acm.org. © 2021 Copyright held by the owner/author(s). Publication rights licensed to ACM. 1936-1955/2021/12-ART3 $15.00 https://doi.org/10.1145/3467022

PY - 2022/3

Y1 - 2022/3

N2 - A data ecosystem (DE) offers a keystone-player or alliance-driven infrastructure that enables the interaction of different stakeholders and the resolution of interoperability issues among shared data. However, despite years of research in data governance and management, trustability is still affected by the absence of transparent and traceable data-driven pipelines. In this work, we focus on requirements and challenges that DEs face when ensuring data transparency. Requirements are derived from the data and organizational management, as well as from broader legal and ethical considerations. We propose a novel knowledge-driven DE architecture, providing the pillars for satisfying the analyzed requirements. We illustrate the potential of our proposal in a real-world scenario. Last, we discuss and rate the potential of the proposed architecture in the fulfillmentof these requirements.

AB - A data ecosystem (DE) offers a keystone-player or alliance-driven infrastructure that enables the interaction of different stakeholders and the resolution of interoperability issues among shared data. However, despite years of research in data governance and management, trustability is still affected by the absence of transparent and traceable data-driven pipelines. In this work, we focus on requirements and challenges that DEs face when ensuring data transparency. Requirements are derived from the data and organizational management, as well as from broader legal and ethical considerations. We propose a novel knowledge-driven DE architecture, providing the pillars for satisfying the analyzed requirements. We illustrate the potential of our proposal in a real-world scenario. Last, we discuss and rate the potential of the proposed architecture in the fulfillmentof these requirements.

KW - data ecosystems

KW - data quality

KW - Data transparency

KW - trustability

UR - http://www.scopus.com/inward/record.url?scp=85124698785&partnerID=8YFLogxK

U2 - 10.1145/3467022

DO - 10.1145/3467022

M3 - Article

AN - SCOPUS:85124698785

VL - 14

SP - 1

EP - 12

JO - Journal of Data and Information Quality

JF - Journal of Data and Information Quality

SN - 1936-1955

IS - 1

M1 - 3

ER -