4th International Workshop on Scientific Knowledge: Representation, Discovery, and Assessment
12 November 2024 - Baltimore, MD, USA
co-located with The
23rd International Semantic Web Conference, ISWC 2024
In the last decades, we have experienced a substantial increase in the volume of published scientific articles and research artefacts (e.g., data sets, software packages); this trend is expected to continue and opens up challenges including the development of large-scale machine-readable representations of scientific knowledge, making scholarly data discoverable and accessible, and designing reliable and comprehensive metrics to assess scientific impact. The main objective of Sci-K is to provide a forum for researchers and practitioners from different disciplines to present, educate, and guide research related to scientific knowledge. We foresee three themes that cover the most important challenges in this field: representation, discoverability, and assessment.
There is an urge for flexible, context-sensitive, fine-grained, and machine-actionable representations of scholarly knowledge that at the same time are structured, interlinked, and semantically rich. Scientific Knowledge Graphs (SKGs) are becoming ... increasingly popular as infrastructures for representing scholarly knowledge. They are large networks describing the actors (e.g., authors, organisations), documents (e.g., publications, patents), ancillary material (e.g., research data, software), contextual information (e.g., projects, fundings), and research knowledge (e.g., research topics, tasks, technologies) in this space as well as their reciprocal relationships. These resources provide substantial benefits to researchers, companies, and policymakers by powering several data-driven services for navigating, analysing, and making sense of research dynamics. Some SKGs examples include OpenAlex, AMiner, Open Academic Graph, ScholarlyData.org, Semantic Scholar, PID Graph, Open Research Knowledge Graph, OpenCitations, and the OpenAIRE research graph. Regarding this aspect, the main challenge is related to the design of ontologies able to conceptualise scholarly knowledge, model its representation, and enable its exchange across different SKGs.
It is important that scholarly information is easily findable, discoverable, and visible, so that it can be mined and organised within SKGs. To this end, we need discovery tools able to crawl the Web and identify scholarly data, whether on a publisher’s ...website or elsewhere – institutional repositories, pre-print servers, open-access repositories, and others. This is a particularly challenging endeavour because it requires a deep understanding of both the scholarly communication landscape and the needs of a variety of stakeholders: researchers, publishers, funders, and the general public. Typically, in addition to the journal landing page, a paper's open version, pieces of software as well as data sets are often shared via alternative channels that are disconnected from the journal page. Currently, this is a major obstacle that the community of practitioners is facing for creating comprehensive knowledge graphs. In brief, the challenges are related to the discovery and extraction of entities and concepts, integration of information from heterogeneous sources, identification of duplicates, finding connections between entities, and identifying conceptual inconsistencies.
Due to the continuous growth in the volume of research output, rigorous approaches for the assessment of research impact are now more valuable than ever. In this context, we urge reliable, comprehensive, and equitable metrics and indicators of the scientific ... impact and merit of publications, data sets, research institutions, individual researchers, and other relevant entities. Scientific impact refers to the attention a research work receives inside its respective and related disciplines, the social/mass media, etc. Scientific merit, on the other hand, relates to the quality aspects of a work, such as its novelty, reproducibility, FAIR-ness, and readability. Nowadays, due to the growing popularity of Open Science initiatives, a large number of useful science-related data sets have been made openly available, paving the way for the synthesis of more sophisticated indicators of scientific impact and merit and, consequently, more rigorous research assessment. For instance, in recent years, we observed a surge of large SKGs, which providevery rich and relatively clean sources of information about academics, their publications and relevant metadata. These SKGs can be used for the development of novel research assessment approaches.
Sci-K is calling for high-quality submissions around the three main themes of research related to
scientific knowledge: representation, discoverability, and assessment.
Topics of interest include, but are not limited to:
Research Scientist at Google Research
Abstract: Datasets constitute a key part of scientific knowledge: they are described in publications and referenced in them; datasets themselves connect with one another in intricate ways. I will build on our experience with Google Dataset Search and discuss the different types of these connections, focusing on datasets. I will discuss the complementary roles of community and tooling to build the connections.
Bio: Natasha Noy is a scientist at Google Research where she works on making structured data accessible and useful. She leads the team building Dataset Search, a search engine for all the datasets on the Web. Prior to joining Google, she worked at Stanford Center for Biomedical Informatics Research where she made major contributions in the areas of ontology development and alignment, and collaborative ontology engineering. Dr. Noy is a Fellow of AAAI and ACM.
This program is aligned with The 23rd International Semantic Web Conference (ISWC 2024) program.
1st Session | |
- | Workshop Opening - Welcome |
- | Knowledge Graph Enabled Scientific Data Repositories. Paulo Pinheiro, Henrique Santos, James Masters, Matthew Johnson, Jeanette A. Stingone, Sofia Bengoa, Marcello Bax and Deborah L. McGuinness (Representation) |
- | DiTraRe: AI on a Spider’s Web. Interweaving Disciplines for Digitalisation. Anna Jacyszyn, Harald Sack, Ditrare-Study Group, Matthias Razum and Felix Bach (Discoverabilty - Short Paper) |
- | Ensuring FAIRness in Machine Learning Projects. Sefika Efeoglu, Zongxiong Chen and Sonja Schimmler (Assessment - Short Paper) |
- | Identifying Semantic Relationships Between Research Topics Using Large Language Models in a Zero-Shot Learning Setting. Tanay Aggarwal, Angelo Salatino, Francesco Osborne and Enrico Motta (Representation) |
- | Coffee Break |
2nd Session | |
- | Enhancing Scientific Discovery and Decision-Making: A Knowledge Graph-based Research Support System. Liubov Kovriguina, Linn Aung, Peter Haase, Nicolas Heist and David Lamprecht (Discoverabilty) |
- | Skills and Expertise in Large Organizations: An Enterprise Knowledge Graph Approach. Blerina Spahiu, Anna Lisa Gentile, Chad De Luca and Andrea Maurino (Assessment) |
- | Enhancing Scientific Knowledge Graph Generation Pipelines with LLMs and Human-in-the-Loop. Stefani Tsaneva, Danilo Dessì, Francesco Osborne and Marta Sabou (Representation) |
- | Federated Querying of Scholarly Communication Infrastructures. Muhammad Haris, Sören Auer and Markus Stocker (Discoverabilty) |
- | Lunch |
3rd Session | |
- | Assessing the Reliability and Scientific Rigor of References in Wikidata. Hannah Schuster, Amin Anjomshoaa and Axel Polleres (Assessment) |
- | Keynote by Natasha Noy |
- | Closing |
- | Coffee Break |
Pictures shot at the end of the workshop. On the left, the attending organisers (Angelo, Francesco, and Sonja) with Natasha Noy (keynote spearker). On the right, the audience.
Proceedings have been published on CEUR-WS.org and are available here: https://ceur-ws.org/Vol-3780/
July 11th, 2024 July 18th, 2024 (23:59, AoE timezone)
August 8th, 2024 August 15th,
2024 (tentative)
August 15th, 2024 September 15th,
2024
November 12th, 2024
Submissions are welcome in the following categories:
The workshop calls for full research papers (up to 8 pages + 2 pages of appendices + 2 pages of
references), describing original work on the listed topics, and short papers (up to 4 pages + 2 pages of
appendices + 2 pages of references), on early research results, new results on previously published
works, demos, and projects. In accordance with Open Science principles, research papers may also
be in the form of data or software papers (short or long papers). Data papers present the
motivation and methodology behind the creation of data sets that are of value to the community, e.g.,
annotated corpora, benchmark collections, and training sets. Software papers present software
functionality, its value for the community, and its application. To enable reproducibility and
peer-review, authors are requested to share the DOIs of datasets and software products described in the
articles.
The workshop also calls for vision/position papers (up to 4 pages + 2 pages of appendices + 2
pages of references) providing insights towards new or emerging areas, innovative or risky approaches,
or emerging applications that will require extensions to the state of the art. Vision papers do not
necessarily have to present results but should carefully elaborate on the motivation and ongoing
challenges of the described area.
Submissions must adhere to the CEURART
template. Please use the template in
single-column format to prepare your submissions. You can download an offline version with the
style
files from http://ceur-ws.org/Vol-XXX/CEURART.zip.
It also contains DOCX template files. Overleaf users may want to use the
CEURART template available in Overleaf.
Submissions for review must be in PDF format. They must be self-contained and written in English.
Submissions that do not follow these guidelines, or do not view or print properly, will be rejected
without review.
Sci-K will adopt a single-blind review process, and each paper will be reviewed by at least three
Program Committee members.
The proceedings of the workshops will be published on CEUR.
Submit your contributions to Sci-K 2024 Easychair page: https://easychair.org/conferences/?conf=scik2024
Contact: scik2024@easychair.org
ISWC2024 will be an in-person conference. All Sci-K papers that will be presented at the workshop and at
least one author per accepted paper must register to the conference.
Should be there any change, we will make any possible effort to inform you on time.
Tentative list and in alphabetical order.
Co-chairs for Sci-K 2024 (alphabetically)