Jump to main content Hotkeys
Distributed and Self-organizing Systems
Distributed and Self-organizing Systems

PUBLICATION

ORKG Properties Ontology Consolidated: LLM-Driven Refinement of Crowdsourced Knowledge for Machine-Actionability

Type

Conference Paper

Year

2026

Authors

Sandra Schaftner

gaedke

Research Area

Web Engineering

Event

"SemTech 2026: 4th International Workshop on AI and Semantic Technologies for the Scientific, Technical, and Legal Web"

Published in

WWW Companion '26: Companion Proceedings of the ACM Web Conference 2026

ISBN/ISSN

979-8-4007-2308-7/2026/04

Download

DOI PDF

Abstract

In the era of Generative AI, Scientific Knowledge Graphs (SKGs) have become instrumental for grounding Retrieval-Augmented Generation (RAG), especially for mitigating hallucinations. As a prominent example, the Open Research Knowledge Graph (ORKG) serves as a foundational element of emerging research data infrastructures. It distinguishes itself by modeling not only metadata but also the semantic content of research publications as fine-grained triples. However, the quality and utility of these triples heavily depend on the predicates defined in the ORKG ontology. Currently, the ontology suffers from quality degradation inherent to uncontrolled crowdsourcing, such as widely duplicated properties, inconsistent naming conventions, and ambiguous semantics, thereby inhibiting machine-actionability. In this paper, we introduce the Consolidated ORKG Properties Ontology (OPO-Consolidated), a consolidated schema designed to resolve these inconsistencies. We present a semi-automated workflow that combines Large Language Models (LLMs) with human engineering to systematically clean and restructure the existing property set. Our evaluation, utilizing an established Gold Standard dataset of 153 research papers, demonstrates that OPO-Consolidated substantially improves schema conciseness (by resolving synonymy and redundancy) and semantic consistency (by enforcing uniform naming conventions) while maintaining the valuable semantic coverage of the data. Furthermore, we show that OPO-Consolidated ensures backward compatibility with existing data, providing a seamless migration path while establishing a machine-actionable foundation for established ORKG comparisons and future downstream tasks.

Reference

TBA



Powered by DGS
Edit list (authentication required)

Press Articles