A Systematic Evaluation of Single-Cell Foundation Models on Cell-Type Classification Task

Nicolas Steiner; Ziteng Li; Omid Vosoughi; Johanna Schrader; Soumyadeep Roy; Wolfgang Nejdl; Ming Tang

doi:10.1145/3701551.3708811

Details

Originalsprache	Englisch
Titel des Sammelwerks	WSDM 2025 - Proceedings of the 18th ACM International Conference on Web Search and Data Mining
Seiten	1112-1113
Seitenumfang	2
ISBN (elektronisch)	9798400713293
Publikationsstatus	Veröffentlicht - 10 März 2025
Veranstaltung	18th ACM International Conference on Web Search and Data Mining, WSDM 2025 - Hannover, Deutschland Dauer: 10 März 2025 → 14 März 2025

Abstract

This study presents a comprehensive benchmarking of three state-of-the-art single-cell foundation models scGPT, Geneformer, and scFoundation, on cell-type classification tasks. We evaluate the models on three datasets: myeloid, human pancreas, and multiple sclerosis, examining both standard fine-tuning and few-shot learning scenarios. Our work reveals that scFoundation consistently achieves the best performance while Geneformer performs poorly, yielding results sometimes even worse than those of the baseline models. Additionally, we demonstrate that a good foundation model can generalize well even when fine-tuned with out-of-distribution data, a capability that the baseline models lack. Our work highlights the potential of foundation models for addressing challenging biomedical questions, particularly in contexts where models are trained on one population but deployed on another.

ASJC Scopus Sachgebiete

Informatik (insg.)
Computernetzwerke und -kommunikation
Informatik (insg.)
Angewandte Informatik
Informatik (insg.)
Software

Zitieren

A Systematic Evaluation of Single-Cell Foundation Models on Cell-Type Classification Task. / Steiner, Nicolas; Li, Ziteng; Vosoughi, Omid et al.
WSDM 2025 - Proceedings of the 18th ACM International Conference on Web Search and Data Mining. 2025. S. 1112-1113.

Publikation: Beitrag in Buch/Bericht/Sammelwerk/Konferenzband › Aufsatz in Konferenzband › Forschung › Peer-Review

Steiner, N, Li, Z, Vosoughi, O, Schrader, J, Roy, S, Nejdl, W & Tang, M 2025, A Systematic Evaluation of Single-Cell Foundation Models on Cell-Type Classification Task. in WSDM 2025 - Proceedings of the 18th ACM International Conference on Web Search and Data Mining. S. 1112-1113, 18th ACM International Conference on Web Search and Data Mining, WSDM 2025, Hannover, Niedersachsen, Deutschland, 10 März 2025. https://doi.org/10.1145/3701551.3708811

Steiner, N., Li, Z., Vosoughi, O., Schrader, J., Roy, S., Nejdl, W., & Tang, M. (2025). A Systematic Evaluation of Single-Cell Foundation Models on Cell-Type Classification Task. In WSDM 2025 - Proceedings of the 18th ACM International Conference on Web Search and Data Mining (S. 1112-1113) https://doi.org/10.1145/3701551.3708811

Steiner N, Li Z, Vosoughi O, Schrader J, Roy S, Nejdl W et al. A Systematic Evaluation of Single-Cell Foundation Models on Cell-Type Classification Task. in WSDM 2025 - Proceedings of the 18th ACM International Conference on Web Search and Data Mining. 2025. S. 1112-1113 doi: 10.1145/3701551.3708811

Steiner, Nicolas ; Li, Ziteng ; Vosoughi, Omid et al. / A Systematic Evaluation of Single-Cell Foundation Models on Cell-Type Classification Task. WSDM 2025 - Proceedings of the 18th ACM International Conference on Web Search and Data Mining. 2025. S. 1112-1113

Download

@inproceedings{903cef16bb1e4f49a8829429742fac45,

title = "A Systematic Evaluation of Single-Cell Foundation Models on Cell-Type Classification Task",

abstract = "This study presents a comprehensive benchmarking of three state-of-the-art single-cell foundation models scGPT, Geneformer, and scFoundation, on cell-type classification tasks. We evaluate the models on three datasets: myeloid, human pancreas, and multiple sclerosis, examining both standard fine-tuning and few-shot learning scenarios. Our work reveals that scFoundation consistently achieves the best performance while Geneformer performs poorly, yielding results sometimes even worse than those of the baseline models. Additionally, we demonstrate that a good foundation model can generalize well even when fine-tuned with out-of-distribution data, a capability that the baseline models lack. Our work highlights the potential of foundation models for addressing challenging biomedical questions, particularly in contexts where models are trained on one population but deployed on another.",

keywords = "cell-type classification, few-shot learning, foundation models, out-of-distribution data",

author = "Nicolas Steiner and Ziteng Li and Omid Vosoughi and Johanna Schrader and Soumyadeep Roy and Wolfgang Nejdl and Ming Tang",

note = "Publisher Copyright: {\textcopyright} 2025 Copyright held by the owner/author(s).; 18th ACM International Conference on Web Search and Data Mining, WSDM 2025, WSDM 2025 ; Conference date: 10-03-2025 Through 14-03-2025",

year = "2025",

month = mar,

day = "10",

doi = "10.1145/3701551.3708811",

language = "English",

pages = "1112--1113",

booktitle = "WSDM 2025 - Proceedings of the 18th ACM International Conference on Web Search and Data Mining",

}

Download

TY - GEN

T1 - A Systematic Evaluation of Single-Cell Foundation Models on Cell-Type Classification Task

AU - Steiner, Nicolas

AU - Li, Ziteng

AU - Vosoughi, Omid

AU - Schrader, Johanna

AU - Roy, Soumyadeep

AU - Nejdl, Wolfgang

AU - Tang, Ming

PY - 2025/3/10

Y1 - 2025/3/10

N2 - This study presents a comprehensive benchmarking of three state-of-the-art single-cell foundation models scGPT, Geneformer, and scFoundation, on cell-type classification tasks. We evaluate the models on three datasets: myeloid, human pancreas, and multiple sclerosis, examining both standard fine-tuning and few-shot learning scenarios. Our work reveals that scFoundation consistently achieves the best performance while Geneformer performs poorly, yielding results sometimes even worse than those of the baseline models. Additionally, we demonstrate that a good foundation model can generalize well even when fine-tuned with out-of-distribution data, a capability that the baseline models lack. Our work highlights the potential of foundation models for addressing challenging biomedical questions, particularly in contexts where models are trained on one population but deployed on another.

AB - This study presents a comprehensive benchmarking of three state-of-the-art single-cell foundation models scGPT, Geneformer, and scFoundation, on cell-type classification tasks. We evaluate the models on three datasets: myeloid, human pancreas, and multiple sclerosis, examining both standard fine-tuning and few-shot learning scenarios. Our work reveals that scFoundation consistently achieves the best performance while Geneformer performs poorly, yielding results sometimes even worse than those of the baseline models. Additionally, we demonstrate that a good foundation model can generalize well even when fine-tuned with out-of-distribution data, a capability that the baseline models lack. Our work highlights the potential of foundation models for addressing challenging biomedical questions, particularly in contexts where models are trained on one population but deployed on another.

KW - cell-type classification

KW - few-shot learning

KW - foundation models

KW - out-of-distribution data

UR - http://www.scopus.com/inward/record.url?scp=105001669179&partnerID=8YFLogxK

U2 - 10.1145/3701551.3708811

DO - 10.1145/3701551.3708811

M3 - Conference contribution

AN - SCOPUS:105001669179

SP - 1112

EP - 1113

BT - WSDM 2025 - Proceedings of the 18th ACM International Conference on Web Search and Data Mining

T2 - 18th ACM International Conference on Web Search and Data Mining, WSDM 2025

Y2 - 10 March 2025 through 14 March 2025

ER -

Research@Leibniz University

A Systematic Evaluation of Single-Cell Foundation Models on Cell-Type Classification Task

Autorschaft

Organisationseinheiten

Externe Organisationen

Details

Abstract

ASJC Scopus Sachgebiete

Zitieren

Von denselben Autoren

Enhancing quality inspection of highly variant geared motors

WSDM 2025 General Chairs' Welcome

Adaptive Dispatching of Mobile Charging Stations using Multi-Agent Graph Convolutional Cooperative-Competitive Reinforcement Learning

Retrieval-Augmented Generation of Event Collections from Web Archives and the Live Web

Processing UK Biobank High Resolution Accelerometry Data for Unsupervised Identification of Activity Profiles and Their Differences in Clinically Relevant Outcome Parameters: The ATLAS Index Revisited

Enhancing quality inspection of highly variant geared motors

WSDM 2025 General Chairs' Welcome

Adaptive Dispatching of Mobile Charging Stations using Multi-Agent Graph Convolutional Cooperative-Competitive Reinforcement Learning

Retrieval-Augmented Generation of Event Collections from Web Archives and the Live Web

Processing UK Biobank High Resolution Accelerometry Data for Unsupervised Identification of Activity Profiles and Their Differences in Clinically Relevant Outcome Parameters: The ATLAS Index Revisited

Enhancing quality inspection of highly variant geared motors