Simultaneous confidence intervals for comparisons of several multinomial samples

Research output: Contribution to journalArticleResearchpeer review

Authors

Research Organisations

External Research Organisations

  • University of Canterbury
View graph of relations

Details

Original languageEnglish
Pages (from-to)65-76
Number of pages12
JournalComputational Statistics and Data Analysis
Volume106
Publication statusPublished - 10 Sept 2016

Abstract

Multinomial data occur if the major outcome of an experiment is the classification of experimental units into more than two mutually exclusive categories. In experiments with several treatment groups, one may then be interested in multiple comparisons between the treatments w.r.t several definitions of odds between the multinomial proportions. Asymptotic methods are described for constructing simultaneous confidence intervals for this inferential problem. Further, alternative methods based on sampling from Dirichlet posterior distributions with vague Dirichlet priors are described. Monte Carlo simulations are performed to compare these methods w.r.t. their frequentist simultaneous coverage probabilities for a wide range of sample sizes and multinomial proportions: The methods have comparable properties for large samples and no rare events involved. In small sample situations or when rare events are involved in the sense that the expected values in some cells of the contingency table are as low as 5 or 10, the method based on sampling from the Dirichlet posterior yields simultaneous coverage probabilities closest to the nominal confidence level. The methods are provided in an R-package and their application is illustrated for examples from developmental toxicology and differential blood counts.

Keywords

    Baseline logit, Coverage probability, Dirichlet, Multiple comparisons, Polytomous data

ASJC Scopus subject areas

Cite this

Simultaneous confidence intervals for comparisons of several multinomial samples. / Schaarschmidt, Frank; Gerhard, Daniel; Vogel, Charlotte.
In: Computational Statistics and Data Analysis, Vol. 106, 10.09.2016, p. 65-76.

Research output: Contribution to journalArticleResearchpeer review

Download
@article{ca5ea7642324421e94f86533d01aa6ae,
title = "Simultaneous confidence intervals for comparisons of several multinomial samples",
abstract = "Multinomial data occur if the major outcome of an experiment is the classification of experimental units into more than two mutually exclusive categories. In experiments with several treatment groups, one may then be interested in multiple comparisons between the treatments w.r.t several definitions of odds between the multinomial proportions. Asymptotic methods are described for constructing simultaneous confidence intervals for this inferential problem. Further, alternative methods based on sampling from Dirichlet posterior distributions with vague Dirichlet priors are described. Monte Carlo simulations are performed to compare these methods w.r.t. their frequentist simultaneous coverage probabilities for a wide range of sample sizes and multinomial proportions: The methods have comparable properties for large samples and no rare events involved. In small sample situations or when rare events are involved in the sense that the expected values in some cells of the contingency table are as low as 5 or 10, the method based on sampling from the Dirichlet posterior yields simultaneous coverage probabilities closest to the nominal confidence level. The methods are provided in an R-package and their application is illustrated for examples from developmental toxicology and differential blood counts.",
keywords = "Baseline logit, Coverage probability, Dirichlet, Multiple comparisons, Polytomous data",
author = "Frank Schaarschmidt and Daniel Gerhard and Charlotte Vogel",
note = "Publisher Copyright: {\textcopyright} 2016 Elsevier B.V. Copyright: Copyright 2017 Elsevier B.V., All rights reserved.",
year = "2016",
month = sep,
day = "10",
doi = "10.1016/j.csda.2016.09.004",
language = "English",
volume = "106",
pages = "65--76",
journal = "Computational Statistics and Data Analysis",
issn = "0167-9473",
publisher = "Elsevier",

}

Download

TY - JOUR

T1 - Simultaneous confidence intervals for comparisons of several multinomial samples

AU - Schaarschmidt, Frank

AU - Gerhard, Daniel

AU - Vogel, Charlotte

N1 - Publisher Copyright: © 2016 Elsevier B.V. Copyright: Copyright 2017 Elsevier B.V., All rights reserved.

PY - 2016/9/10

Y1 - 2016/9/10

N2 - Multinomial data occur if the major outcome of an experiment is the classification of experimental units into more than two mutually exclusive categories. In experiments with several treatment groups, one may then be interested in multiple comparisons between the treatments w.r.t several definitions of odds between the multinomial proportions. Asymptotic methods are described for constructing simultaneous confidence intervals for this inferential problem. Further, alternative methods based on sampling from Dirichlet posterior distributions with vague Dirichlet priors are described. Monte Carlo simulations are performed to compare these methods w.r.t. their frequentist simultaneous coverage probabilities for a wide range of sample sizes and multinomial proportions: The methods have comparable properties for large samples and no rare events involved. In small sample situations or when rare events are involved in the sense that the expected values in some cells of the contingency table are as low as 5 or 10, the method based on sampling from the Dirichlet posterior yields simultaneous coverage probabilities closest to the nominal confidence level. The methods are provided in an R-package and their application is illustrated for examples from developmental toxicology and differential blood counts.

AB - Multinomial data occur if the major outcome of an experiment is the classification of experimental units into more than two mutually exclusive categories. In experiments with several treatment groups, one may then be interested in multiple comparisons between the treatments w.r.t several definitions of odds between the multinomial proportions. Asymptotic methods are described for constructing simultaneous confidence intervals for this inferential problem. Further, alternative methods based on sampling from Dirichlet posterior distributions with vague Dirichlet priors are described. Monte Carlo simulations are performed to compare these methods w.r.t. their frequentist simultaneous coverage probabilities for a wide range of sample sizes and multinomial proportions: The methods have comparable properties for large samples and no rare events involved. In small sample situations or when rare events are involved in the sense that the expected values in some cells of the contingency table are as low as 5 or 10, the method based on sampling from the Dirichlet posterior yields simultaneous coverage probabilities closest to the nominal confidence level. The methods are provided in an R-package and their application is illustrated for examples from developmental toxicology and differential blood counts.

KW - Baseline logit

KW - Coverage probability

KW - Dirichlet

KW - Multiple comparisons

KW - Polytomous data

UR - http://www.scopus.com/inward/record.url?scp=84988672433&partnerID=8YFLogxK

U2 - 10.1016/j.csda.2016.09.004

DO - 10.1016/j.csda.2016.09.004

M3 - Article

AN - SCOPUS:84988672433

VL - 106

SP - 65

EP - 76

JO - Computational Statistics and Data Analysis

JF - Computational Statistics and Data Analysis

SN - 0167-9473

ER -

By the same author(s)