5th International Workshop on Scientific Knowledge: Representation, Discovery, and Assessment
2 November 2025 - Nara, Japan
co-located with The
24th International Semantic Web Conference, ISWC 2025
Recently, we have experienced a massive increase in the volume of scientific articles and research artefacts (e.g., datasets, models, software packages). This trend is expected to continue and opens up challenges including the development of large-scale machine-readable representations of scientific knowledge, making scholarly data and knowledge discoverable and accessible, and designing reliable and comprehensive metrics to assess scientific impact. Sci-K provides a forum for researchers and practitioners from different disciplines to present, educate, and guide research related to scientific knowledge. Three themes cover the most important challenges in this field: representation, discoverability, and assessment.
There is a need for flexible, context-sensitive, fine-grained, and machine-actionable representations of scholarly knowledge that are, at the same time, structured, interlinked, and semantically rich: Scientific Knowledge Graphs (SKGs), also known as Research Knowledge Graphs (RKGs). ...SKGs/RKGs can power data-driven services for navigating, analysing, and making sense of research dynamics. Current challenges are related to the design of ontologies or alternative representation methods able to conceptualise scholarly knowledge, model its representation, and enable exchange.
Scholarly information should be easily findable, discoverable, and visible so that it can be mined and organised within SKGs/RKGs. Discovery tools should be able to crawl the Web and identify scholarly data, whether on a publisher’s website or elsewhere – institutional repositories ... , preprint servers or open-access repositories. This is challenging and requires a deep understanding of both the scholarly communication landscape and the needs of a variety of stakeholders: researchers (of different fields and sub-fields), publishers, funders, and the general public. Other challenges are related to the discovery and extraction of entities and concepts, integration of information from heterogeneous sources, identification of duplicates, finding connections between entities, and identifying conceptual inconsistencies. We are particularly interested in modern systems that integrate AI, NLP, and LLM technologies.
Due to the continuous growth in the volume of research output, rigorous approaches for the evaluation and assessment of research impact are now more relevant than ever. There is a need for reliable, comprehensive, and equitable metrics and indicators of the scientific impact and merit of publications ... , datasets, research institutions, individual researchers, and other relevant entities.
Sci-K is calling for high-quality submissions around the three main themes of research related to
scientific knowledge: representation, discoverability, and assessment.
Topics of interest include, but are not limited to:
Associate Professor at Old Dominion University
Abstract: Since 2023, there has been a surge of public and research interest in large language models (LLMs) and recently vision language models, which significantly shifted the paradigm of mining scholarly big data, bringing both challenges and opportunities for this ever-growing field. This paradigm shift not only significantly improves the performance of traditional metadata-centered pipelines for knowledge extraction, classification, and downstream tasks, which usually served as core components for academic digital libraries, but it also opens doors to the content-centered tasks, mining fine-grained knowledge and data, which provides deeper insights and wider applications of scholarly publications for a broader audience beyond scientific researchers. We explore LLM-based solutions for several content-centered tasks related to knowledge and data from scholarly publications, and prospect how these solutions can shed light on supporting advanced services, such as data preservation, scholarly comparison, review generation, and science dissemination. We share preliminary work in this direction, including open-access datasets and software extraction, complex table data extraction, scientific claim verification, and research reproducibility assessment.
Bio: Dr. Jian Wu is an associate professor of Computer Science at ODU. Dr. Wu obtained his Ph.D. degree at Pennsylvania State University (Penn State) in 2011 and worked as a postdoctoral fellow with Dr. C. Lee Giles before joining ODU in 2018. Since then, his research has been supported by NSF, IMLS, DARPA, Los Alamos National Laboratory, Virginia Commonwealth, and the Open Philanthropy. Dr. Wu’s research interests include natural language processing, scholarly big data, information retrieval, digital libraries, and the science of science. He has published more than 90 peer-reviewed papers in ACM, IEEE, and AAAI venues, with best papers and nominations, in addition to his earlier publications in Astronomy and Astrophysics. Dr. Wu shared the British Computer Society Award 2021 for the Best Open Source Project with Dr. C. Lee Giles.
This program is aligned with The 24th International Semantic Web Conference (ISWC 2025) program.
1st Session | |
- | Opening and Welcome |
- | Keynote by Dr. Jian Wu, Deep Mining Scholarly Big Data in the Large Language Model Era |
- | Knowledge Representation and Discovery for Cultural Heritage Research Data with CTO and SHMARQL. Tabea Tietz, Etienne Posthumus, Linnaea Söhn, Jonatan Jalle Steller, Oleksandra Bruns, Joerg Waitelonis, Torsten Schrade and Harald Sack (Discoverability) |
- | Deep Research in the Era of Agentic AI: Requirements and Limitations for Scholarly Research. Mohamad Yaser Jaradeh and Sören Auer (Assessment - Short Paper) |
- | Coffee Break |
2nd Session | |
- | A Survey on Metadata for Machine Learning Models and Datasets: Standards, Practices, and Harmonization Challenges. Genet Asefa Gesese, Zongxiong Chen, Oussama Zoubia, Fidan Limani, Kanishka Silva, Muhammad Asif Suryani, Benjamin Zapilko, Leyla Jael Castro, Ekaterina Kutafina, Dhwani Solanki, Heike Fliegl, Sonja Schimmler, Zeyd Boukhers and Harald Sack (Representation) |
- | KONDA: An LLM-based Tool for Semantic Annotation and Knowledge Graph Creation Using Ontologies for Research Data. Soo-Yon Kim, Martin Görz and Sandra Geisler (Representation) |
- | COPE: Chronic Observation and Progression Events Ontology. Asara Senaratne, Oshani Seneviratne, Hon Zent Lim and Leelanga Seneviratne (Representation) |
- | Ontologies in Motion: A BFO-Based Approach to Knowledge Graph Construction for Motor Performance Research Data in Sports Science. Sarah Rebecca Ondraszek, Jörg Waitelonis, Katja Keller, Claudia Niessner, Anna M. Jacyszyn and Harald Sack (Representation - Short Paper) |
- | From Philosophy to NLU: Evolving Definitions of Research Hypotheses. Jian Wu and Sarah Rajtmajer (Representation - Short Paper) |
- | Lunch |
3rd Session | |
- | Are Scientific Annotations Consistently Represented across Science Knowledge Graphs? Jenifer Tabita Ciuciu-Kiss and Daniel Garijo (Representation) |
- | MOP: Augmenting and Standardizing Heterogeneous Knowledge Graph Data Sources. Julia Evans, Mirjan Hoffmann, Sophie Matter and Axel Klinger (Representation) |
- | AI4DiTraRe: Building the BFO-Compliant Chemotion Knowledge Graph. Ebrahim Norouzi, Nicole Jung, Anna M. Jacyszyn, Joerg Waitelonis and Harald Sack (Representation) |
- | Towards AI-Supported Research: a Vision of the TIB AIssistant. Sören Auer, Allard Oelen, Mohamad Yaser Jaradeh, Mutahira Khalid, Farhana Keya, Sasi Kiran Gaddipati, Jennifer D'Souza, Lorenz Schlüter, Amirreza Alasti, Gollam Rabby, Azanzi Jiomekong and Oliver Karras (Assessment - Short Paper) |
- | ClimaFactsKG: Towards an Interlinked Knowledge Graph of Scientific Evidence to Fight Climate Misinformation. Grégoire Burel and Harith Alani (Discoverability - Short Paper) |
- | Coffee Break |
4th Session | |
- | Interlinking Research Data and Services in the Historical Sciences with MemO and the NFDI4Memory Knowledge Graph. Sarah Rebecca Ondraszek, Tabea Tietz, Jörg Waitelonis and Harald Sack (Discoverability - Short Paper) |
- |
Discussion Session (Panel / Fishbowl) |
- | Closing and Outlook |
July 11th, 2025 July 25th, 2025 (23:59, AoE timezone)
August 8th,
2025 August 30th,
2025
August 28th,
2025 September 16th,
2025 (tentative)
November 2nd, 2025
Submissions are welcome in the following categories:
The workshop calls for full research papers, describing original work on the listed
topics, and short papers, on early research results, new results on previously
published works, demos, and projects. In accordance with Open Science principles,
research papers may also be in the form of data or software papers (short or long
papers). Data papers present the motivation and methodology behind the creation of
data sets that are of value to the community, e.g., annotated corpora, benchmark
collections, and training sets. Software papers present software functionality, its
value for the community, and its application. To enable reproducibility and
peer-review, authors are requested to share the DOIs of datasets and software
products described in the articles.
The workshop also calls for vision/position papers providing insights towards new or
emerging areas, innovative or risky approaches, or emerging applications that will
require extensions to the state of the art. Vision papers do not necessarily have to
present results but should carefully elaborate on the motivation and ongoing
challenges of the described area.
Submissions must adhere to the CEURART
template. Please use the latest version of the template in
single-column format to prepare your submissions. You can download an offline version with the
style
files from http://ceur-ws.org/Vol-XXX/CEURART.zip.
It also contains DOCX template files. Overleaf users may want to use the
CEURART template available in Overleaf. Please adhere also to the CEUR-WS Policy on AI-Assisting Tools.
Submissions for review must be in PDF format. They must be self-contained and written in English.
Submissions that do not follow these guidelines, or do not view or print properly, will be rejected
without review.
Sci-K will adopt a single-anonymous review process, and each paper will be reviewed by at least three
Program Committee members.
The proceedings of the workshops will be published on CEUR either as a standalone volume or in companion
proceedings of ISWC 2025.
Submit your contributions to Sci-K 2025 Easychair page:
Contact: scik2025@easychair.org
We invite you to submit your paper to Sci-K 2025 if it was rejected from the main tracks
(Research, Resource, In-Use),
provided that it is in scope of the workshop.
Here is what you need to do:
For any question on the process, please contact: scik2025@easychair.org
ISWC2025 will be an in-person conference. All Sci-K papers that will be presented at the workshop and at
least one author per accepted paper must register to the conference.
The registration system is now live here, and the early bird rate
is available until September 16th.
You can find all the useful information about the venue, travel, and accommodation on a dedicated page
here.
Please check your VISA requirements to enter Japan. You can find helpful information on the ISWC website
here.
If you are a student, we encourage you to apply for a student grant
here.
PC members in alphabetical order.
Co-chairs for Sci-K 2025 (alphabetically)
Alphabetically ordered
Italian Research Council (CNR), Pisa (IT)
Université Libre de Bruxelles (ULB), Brussels (BE)
“Athena” RC, Athens (GR)