HyKG-CF: A Hybrid Approach for Counterfactual Prediction using Domain Knowledge

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Authors

  • Hao Huang
  • Maria Esther Vidal

Research Organisations

External Research Organisations

  • German National Library of Science and Technology (TIB)
View graph of relations

Details

Original languageEnglish
Title of host publicationWSDM 2025 - Proceedings of the 18th ACM International Conference on Web Search and Data Mining
Pages1104-1105
Number of pages2
ISBN (electronic)9798400713293
Publication statusPublished - 10 Mar 2025
Event18th ACM International Conference on Web Search and Data Mining, WSDM 2025 - Hannover, Germany
Duration: 10 Mar 202514 Mar 2025

Abstract

Predictive models are gaining attention as powerful tools for aiding clinicians in diagnosis, prognosis, and treatment recommendations. However, their reliance on associative patterns may raise concerns about reliability of decision support, as association does not necessarily imply causation. To address this limit, we propose HyKG-CF, a hybrid approach to counterfactual prediction that leverages data and domain knowledge encoded in knowledge graph (KG). HyKG-CF integrates symbolic reasoning (on knowledge) with numerical learning (on data) using large language models (LLMs) and statistical models to learn causal Bayesian networks (CBNs) for accurate counterfactual prediction. Using data and knowledge, HyKG-CF improves the accuracy of causal discovery and counterfactual prediction. We evaluate HyKG-CF on a non-small cell lung cancer (NSCLC) KG, demonstrating that it outperforms other baselines. The results highlight the promise of combining domain knowledge with causal models to improve counterfactual prediction.

Keywords

    causality, counterfactual prediction, knowledge graphs

ASJC Scopus subject areas

Sustainable Development Goals

Cite this

HyKG-CF: A Hybrid Approach for Counterfactual Prediction using Domain Knowledge. / Huang, Hao; Vidal, Maria Esther.
WSDM 2025 - Proceedings of the 18th ACM International Conference on Web Search and Data Mining. 2025. p. 1104-1105.

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Huang, H & Vidal, ME 2025, HyKG-CF: A Hybrid Approach for Counterfactual Prediction using Domain Knowledge. in WSDM 2025 - Proceedings of the 18th ACM International Conference on Web Search and Data Mining. pp. 1104-1105, 18th ACM International Conference on Web Search and Data Mining, WSDM 2025, Hannover, Lower Saxony, Germany, 10 Mar 2025. https://doi.org/10.1145/3701551.3708813
Huang, H., & Vidal, M. E. (2025). HyKG-CF: A Hybrid Approach for Counterfactual Prediction using Domain Knowledge. In WSDM 2025 - Proceedings of the 18th ACM International Conference on Web Search and Data Mining (pp. 1104-1105) https://doi.org/10.1145/3701551.3708813
Huang H, Vidal ME. HyKG-CF: A Hybrid Approach for Counterfactual Prediction using Domain Knowledge. In WSDM 2025 - Proceedings of the 18th ACM International Conference on Web Search and Data Mining. 2025. p. 1104-1105 doi: 10.1145/3701551.3708813
Huang, Hao ; Vidal, Maria Esther. / HyKG-CF : A Hybrid Approach for Counterfactual Prediction using Domain Knowledge. WSDM 2025 - Proceedings of the 18th ACM International Conference on Web Search and Data Mining. 2025. pp. 1104-1105
Download
@inproceedings{b6c9023273b541e6a9cc9d1a16795bbc,
title = "HyKG-CF: A Hybrid Approach for Counterfactual Prediction using Domain Knowledge",
abstract = "Predictive models are gaining attention as powerful tools for aiding clinicians in diagnosis, prognosis, and treatment recommendations. However, their reliance on associative patterns may raise concerns about reliability of decision support, as association does not necessarily imply causation. To address this limit, we propose HyKG-CF, a hybrid approach to counterfactual prediction that leverages data and domain knowledge encoded in knowledge graph (KG). HyKG-CF integrates symbolic reasoning (on knowledge) with numerical learning (on data) using large language models (LLMs) and statistical models to learn causal Bayesian networks (CBNs) for accurate counterfactual prediction. Using data and knowledge, HyKG-CF improves the accuracy of causal discovery and counterfactual prediction. We evaluate HyKG-CF on a non-small cell lung cancer (NSCLC) KG, demonstrating that it outperforms other baselines. The results highlight the promise of combining domain knowledge with causal models to improve counterfactual prediction.",
keywords = "causality, counterfactual prediction, knowledge graphs",
author = "Hao Huang and Vidal, {Maria Esther}",
note = "Publisher Copyright: {\textcopyright} 2025 Copyright held by the owner/author(s).; 18th ACM International Conference on Web Search and Data Mining, WSDM 2025, WSDM 2025 ; Conference date: 10-03-2025 Through 14-03-2025",
year = "2025",
month = mar,
day = "10",
doi = "10.1145/3701551.3708813",
language = "English",
pages = "1104--1105",
booktitle = "WSDM 2025 - Proceedings of the 18th ACM International Conference on Web Search and Data Mining",

}

Download

TY - GEN

T1 - HyKG-CF

T2 - 18th ACM International Conference on Web Search and Data Mining, WSDM 2025

AU - Huang, Hao

AU - Vidal, Maria Esther

N1 - Publisher Copyright: © 2025 Copyright held by the owner/author(s).

PY - 2025/3/10

Y1 - 2025/3/10

N2 - Predictive models are gaining attention as powerful tools for aiding clinicians in diagnosis, prognosis, and treatment recommendations. However, their reliance on associative patterns may raise concerns about reliability of decision support, as association does not necessarily imply causation. To address this limit, we propose HyKG-CF, a hybrid approach to counterfactual prediction that leverages data and domain knowledge encoded in knowledge graph (KG). HyKG-CF integrates symbolic reasoning (on knowledge) with numerical learning (on data) using large language models (LLMs) and statistical models to learn causal Bayesian networks (CBNs) for accurate counterfactual prediction. Using data and knowledge, HyKG-CF improves the accuracy of causal discovery and counterfactual prediction. We evaluate HyKG-CF on a non-small cell lung cancer (NSCLC) KG, demonstrating that it outperforms other baselines. The results highlight the promise of combining domain knowledge with causal models to improve counterfactual prediction.

AB - Predictive models are gaining attention as powerful tools for aiding clinicians in diagnosis, prognosis, and treatment recommendations. However, their reliance on associative patterns may raise concerns about reliability of decision support, as association does not necessarily imply causation. To address this limit, we propose HyKG-CF, a hybrid approach to counterfactual prediction that leverages data and domain knowledge encoded in knowledge graph (KG). HyKG-CF integrates symbolic reasoning (on knowledge) with numerical learning (on data) using large language models (LLMs) and statistical models to learn causal Bayesian networks (CBNs) for accurate counterfactual prediction. Using data and knowledge, HyKG-CF improves the accuracy of causal discovery and counterfactual prediction. We evaluate HyKG-CF on a non-small cell lung cancer (NSCLC) KG, demonstrating that it outperforms other baselines. The results highlight the promise of combining domain knowledge with causal models to improve counterfactual prediction.

KW - causality

KW - counterfactual prediction

KW - knowledge graphs

U2 - 10.1145/3701551.3708813

DO - 10.1145/3701551.3708813

M3 - Conference contribution

AN - SCOPUS:105001669150

SP - 1104

EP - 1105

BT - WSDM 2025 - Proceedings of the 18th ACM International Conference on Web Search and Data Mining

Y2 - 10 March 2025 through 14 March 2025

ER -