Explain and Predict, and then Predict Again

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Authors

  • Zijian Zhang
  • Koustav Rudra
  • Avishek Anand

Research Organisations

View graph of relations

Details

Original languageEnglish
Title of host publicationWSDM 2021 - Proceedings of the 14th ACM International Conference on Web Search and Data Mining
Pages418-426
Number of pages9
ISBN (electronic)9781450382977
Publication statusPublished - 8 Mar 2021
Event14th ACM International Conference on Web Search and Data Mining, WSDM 2021 - online, Virtual, Online, Israel
Duration: 8 Mar 202112 Mar 2021

Abstract

A desirable property of learning systems is to be both effective and interpretable. Towards this goal, recent models have been proposed that first generate an extractive explanation from the input text and then generate a prediction on just the explanation called explain-then-predict models. These models primarily consider the task input as a supervision signal in learning an extractive explanation and do not effectively integrate rationales data as an additional inductive bias to improve task performance. We propose a novel yet simple approach ExPred, which uses multi-task learning in the explanation generation phase effectively trading-off explanation and prediction losses. Next, we use another prediction network on just the extracted explanations for optimizing the task performance. We conduct an extensive evaluation of our approach on three diverse language datasets - sentiment classification, fact-checking, and question answering - and find that we substantially outperform existing approaches.

Keywords

    explanation, interpretable by design, learning with rationales, machine learning interpretation, multitask learning

ASJC Scopus subject areas

Cite this

Explain and Predict, and then Predict Again. / Zhang, Zijian; Rudra, Koustav; Anand, Avishek.
WSDM 2021 - Proceedings of the 14th ACM International Conference on Web Search and Data Mining. 2021. p. 418-426.

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Zhang, Z, Rudra, K & Anand, A 2021, Explain and Predict, and then Predict Again. in WSDM 2021 - Proceedings of the 14th ACM International Conference on Web Search and Data Mining. pp. 418-426, 14th ACM International Conference on Web Search and Data Mining, WSDM 2021, Virtual, Online, Israel, 8 Mar 2021. https://doi.org/10.1145/3437963.3441758
Zhang, Z., Rudra, K., & Anand, A. (2021). Explain and Predict, and then Predict Again. In WSDM 2021 - Proceedings of the 14th ACM International Conference on Web Search and Data Mining (pp. 418-426) https://doi.org/10.1145/3437963.3441758
Zhang Z, Rudra K, Anand A. Explain and Predict, and then Predict Again. In WSDM 2021 - Proceedings of the 14th ACM International Conference on Web Search and Data Mining. 2021. p. 418-426 doi: 10.1145/3437963.3441758
Zhang, Zijian ; Rudra, Koustav ; Anand, Avishek. / Explain and Predict, and then Predict Again. WSDM 2021 - Proceedings of the 14th ACM International Conference on Web Search and Data Mining. 2021. pp. 418-426
Download
@inproceedings{2250670cb9aa4997bb9925f651b2eebb,
title = "Explain and Predict, and then Predict Again",
abstract = "A desirable property of learning systems is to be both effective and interpretable. Towards this goal, recent models have been proposed that first generate an extractive explanation from the input text and then generate a prediction on just the explanation called explain-then-predict models. These models primarily consider the task input as a supervision signal in learning an extractive explanation and do not effectively integrate rationales data as an additional inductive bias to improve task performance. We propose a novel yet simple approach ExPred, which uses multi-task learning in the explanation generation phase effectively trading-off explanation and prediction losses. Next, we use another prediction network on just the extracted explanations for optimizing the task performance. We conduct an extensive evaluation of our approach on three diverse language datasets - sentiment classification, fact-checking, and question answering - and find that we substantially outperform existing approaches.",
keywords = "explanation, interpretable by design, learning with rationales, machine learning interpretation, multitask learning",
author = "Zijian Zhang and Koustav Rudra and Avishek Anand",
note = "Funding Information: Funding for this project was in part provided by the European Union's Horizon 2020 research and innovation program under grant agreement No 832921, and No 871042.; 14th ACM International Conference on Web Search and Data Mining, WSDM 2021 ; Conference date: 08-03-2021 Through 12-03-2021",
year = "2021",
month = mar,
day = "8",
doi = "10.1145/3437963.3441758",
language = "English",
pages = "418--426",
booktitle = "WSDM 2021 - Proceedings of the 14th ACM International Conference on Web Search and Data Mining",

}

Download

TY - GEN

T1 - Explain and Predict, and then Predict Again

AU - Zhang, Zijian

AU - Rudra, Koustav

AU - Anand, Avishek

N1 - Funding Information: Funding for this project was in part provided by the European Union's Horizon 2020 research and innovation program under grant agreement No 832921, and No 871042.

PY - 2021/3/8

Y1 - 2021/3/8

N2 - A desirable property of learning systems is to be both effective and interpretable. Towards this goal, recent models have been proposed that first generate an extractive explanation from the input text and then generate a prediction on just the explanation called explain-then-predict models. These models primarily consider the task input as a supervision signal in learning an extractive explanation and do not effectively integrate rationales data as an additional inductive bias to improve task performance. We propose a novel yet simple approach ExPred, which uses multi-task learning in the explanation generation phase effectively trading-off explanation and prediction losses. Next, we use another prediction network on just the extracted explanations for optimizing the task performance. We conduct an extensive evaluation of our approach on three diverse language datasets - sentiment classification, fact-checking, and question answering - and find that we substantially outperform existing approaches.

AB - A desirable property of learning systems is to be both effective and interpretable. Towards this goal, recent models have been proposed that first generate an extractive explanation from the input text and then generate a prediction on just the explanation called explain-then-predict models. These models primarily consider the task input as a supervision signal in learning an extractive explanation and do not effectively integrate rationales data as an additional inductive bias to improve task performance. We propose a novel yet simple approach ExPred, which uses multi-task learning in the explanation generation phase effectively trading-off explanation and prediction losses. Next, we use another prediction network on just the extracted explanations for optimizing the task performance. We conduct an extensive evaluation of our approach on three diverse language datasets - sentiment classification, fact-checking, and question answering - and find that we substantially outperform existing approaches.

KW - explanation

KW - interpretable by design

KW - learning with rationales

KW - machine learning interpretation

KW - multitask learning

UR - http://www.scopus.com/inward/record.url?scp=85103007189&partnerID=8YFLogxK

U2 - 10.1145/3437963.3441758

DO - 10.1145/3437963.3441758

M3 - Conference contribution

AN - SCOPUS:85103007189

SP - 418

EP - 426

BT - WSDM 2021 - Proceedings of the 14th ACM International Conference on Web Search and Data Mining

T2 - 14th ACM International Conference on Web Search and Data Mining, WSDM 2021

Y2 - 8 March 2021 through 12 March 2021

ER -