Details
| Original language | English |
|---|---|
| Title of host publication | The Semantic Web |
| Subtitle of host publication | 22nd European Semantic Web Conference, ESWC 2025, Proceedings |
| Editors | Edward Curry, Maribel Acosta, Maria Poveda-Villalón, Marieke van Erp, Adegboyega Ojo, Katja Hose, Cogan Shimizu, Pasquale Lisena |
| Publisher | Springer Science and Business Media Deutschland GmbH |
| Pages | 244-261 |
| Number of pages | 18 |
| ISBN (electronic) | 978-3-031-94578-6 |
| ISBN (print) | 9783031945779 |
| Publication status | Published - 31 May 2025 |
| Event | 22nd European Semantic Web Conference, ESWC 2025 - Portoroz, Slovenia Duration: 1 Jun 2025 → 5 Jun 2025 |
Publication series
| Name | Lecture Notes in Computer Science |
|---|---|
| Volume | 15719 LNCS |
| ISSN (Print) | 0302-9743 |
| ISSN (electronic) | 1611-3349 |
Abstract
Extracting structured information from unstructured text is crucial for modeling real-world processes, but traditional schema mining relies on semi-structured data, limiting scalability. This paper introduces schema-miner, a novel tool that combines large language models with human feedback to automate and refine schema extraction. Through an iterative workflow, it organizes properties from text, incorporates expert input, and integrates domain-specific ontologies for semantic depth. Applied to materials science—specifically atomic layer deposition—schema-miner demonstrates that expert-guided LLMs generate semantically rich schemas suitable for diverse real-world applications.
Keywords
- Human-in-the-loop Workflow, Large Language Models, Schema Discovery, Schema Mining, Scientific Schemas
ASJC Scopus subject areas
- Mathematics(all)
- Theoretical Computer Science
- Computer Science(all)
- General Computer Science
Cite this
- Standard
- Harvard
- Apa
- Vancouver
- BibTeX
- RIS
The Semantic Web : 22nd European Semantic Web Conference, ESWC 2025, Proceedings. ed. / Edward Curry; Maribel Acosta; Maria Poveda-Villalón; Marieke van Erp; Adegboyega Ojo; Katja Hose; Cogan Shimizu; Pasquale Lisena. Springer Science and Business Media Deutschland GmbH, 2025. p. 244-261 (Lecture Notes in Computer Science; Vol. 15719 LNCS).
Research output: Chapter in book/report/conference proceeding › Conference contribution › Research › peer review
}
TY - GEN
T1 - LLMs4SchemaDiscovery
T2 - 22nd European Semantic Web Conference, ESWC 2025
AU - Sadruddin, Sameer
AU - D’Souza, Jennifer
AU - Poupaki, Eleni
AU - Watkins, Alex
AU - Babaei Giglou, Hamed
AU - Rula, Anisa
AU - Karasulu, Bora
AU - Auer, Sören
AU - Mackus, Adrie
AU - Kessels, Erwin
N1 - Publisher Copyright: © The Author(s), under exclusive license to Springer Nature Switzerland AG 2025.
PY - 2025/5/31
Y1 - 2025/5/31
N2 - Extracting structured information from unstructured text is crucial for modeling real-world processes, but traditional schema mining relies on semi-structured data, limiting scalability. This paper introduces schema-miner, a novel tool that combines large language models with human feedback to automate and refine schema extraction. Through an iterative workflow, it organizes properties from text, incorporates expert input, and integrates domain-specific ontologies for semantic depth. Applied to materials science—specifically atomic layer deposition—schema-miner demonstrates that expert-guided LLMs generate semantically rich schemas suitable for diverse real-world applications.
AB - Extracting structured information from unstructured text is crucial for modeling real-world processes, but traditional schema mining relies on semi-structured data, limiting scalability. This paper introduces schema-miner, a novel tool that combines large language models with human feedback to automate and refine schema extraction. Through an iterative workflow, it organizes properties from text, incorporates expert input, and integrates domain-specific ontologies for semantic depth. Applied to materials science—specifically atomic layer deposition—schema-miner demonstrates that expert-guided LLMs generate semantically rich schemas suitable for diverse real-world applications.
KW - Human-in-the-loop Workflow
KW - Large Language Models
KW - Schema Discovery
KW - Schema Mining
KW - Scientific Schemas
UR - http://www.scopus.com/inward/record.url?scp=105007760723&partnerID=8YFLogxK
U2 - 10.1007/978-3-031-94578-6_14
DO - 10.1007/978-3-031-94578-6_14
M3 - Conference contribution
AN - SCOPUS:105007760723
SN - 9783031945779
T3 - Lecture Notes in Computer Science
SP - 244
EP - 261
BT - The Semantic Web
A2 - Curry, Edward
A2 - Acosta, Maribel
A2 - Poveda-Villalón, Maria
A2 - van Erp, Marieke
A2 - Ojo, Adegboyega
A2 - Hose, Katja
A2 - Shimizu, Cogan
A2 - Lisena, Pasquale
PB - Springer Science and Business Media Deutschland GmbH
Y2 - 1 June 2025 through 5 June 2025
ER -