courses:wshop:topics:tematy2025zima

Tematy projektów WSHOP -- zima 2025/2026

  • : Possibility of extending it to master thesis
  • : Quick project
  • : Linked to international scientific project
  • : Linked to JU-internal scientific project
  • Student: FIXME
  • Namespace in the wiki: FIXME
  • The goal of the project: FIXME
  • Technology: FIXME
  • Description: FIXME
  • Links:
    • FIXME
  • Student: Bartłomiej Błoniarz, Michał Gniadek, Inka Sokołowska
  • Namespace in the wiki: FIXME
  • The goal of the project: Analysis of player behavior based on affective events in the game.
  • Technology: Python, psychology basis, experimental data
  • Description: Identification of characteristic points in the game, linking emotional reactions based on in-game actions, information about eye movements, player movements, and psychological profile. Real data, opportunity to participate in the experiment from the researcher's perspective, further analysis in the next semester, and potential use in a master's thesis.
  • Links:
  • Student: Mateusz Dyszewski
  • Namespace in the wiki:
  • The goal of the project: Implementing automatic label generation with LLM + CIDOC-CRM
  • Technology: RDF, Python, LLM
  • Description: The main goal is to use LLMs (or SLMs) to generate labels for events in CIDOC-CRM. Instances of the event class in CIDOC-CRM connect people, places, and properties, but when imported from other datasets they often lack labels. The aim is to generate descriptive labels based on linked entities. For example: given an event typed as matriculation, with institution Cracow Academy and participant Mikolaj Kopernik, the generated label could be “Copernicus matriculation at the Cracow Academy.”
  • Links:
  • Student:
  • Namespace in the wiki:
  • The goal of the project: Applying KG discovery algorithms to digital humanities KG
  • Technology: RDF, Python
  • Description: This project applies KG discovery algorithms to find interesting and non-obvious relations within digital humanities KGs, using the CIDOC-CRM ontology as a case study. It explores serendipitous discovery, path evaluation, and pattern identification. Graph-based models enable this because they facilitate algorithmic exploration of linked cultural heritage data.
  • Links:
  • Student:
  • Namespace in the wiki:
  • The goal of the project: Applying network analysis to digital humanities KG
  • Technology: RDF, Python
  • Description: This project uses network analysis methods (degree, eigenvector, PageRank centrality, community detection) to reveal patterns, central entities, and relationships in cultural heritage KGs based on CIDOC-CRM. The aim is to identify structural properties and key nodes to aid interpretation and support research within cultural heritage contexts.
  • Links:
  • Student:
  • Namespace in the wiki:
  • The goal of the project: Implementing a tool for automatic KG shortcut generation
  • Technology: RDF, Python
  • Description: This project automates the creation of KG “shortcuts,” which are derived relations that simplify navigation, querying, and graph algorithm usage in complex ontologies like CIDOC-CRM. Shortcuts may omit certain reifications or events, making knowledge graphs easier to traverse and analyse. The approach should be generalizable beyond CIDOC-CRM.
  • Links:
  • Student:Justyna Gargula

  • Student: Bartosz Mierzwa, Piotr Wójtowicz
  • Namespace in the wiki: polyvocal
  • The goal of the project: How to model and process various (sometimes even contradictory) opinions on the same topic in knowledge graphs
  • Technology: literature studies, prototypes evaluation, RDF/Semantic Web
  • Description: Classically, knowledge bases strive to show one universal truth ( same for machine learning models). Therefore, usually methods in knowledge bases revolve around concepts such as “consistency”, “conflict resolution”, “lack of redundancy”. But reality is different, and there are various parallel opinions on the same facts or artifacts, e.g. a painting exhibited in the Louvre may be understood quite differently by a person educated in Western European culture and a Japanese person - on the one hand they may refer to other cultural symbols in their interpretations, and on the other hand they may not understand something if the creator came from outside of their culture and may need additional contextual information. In a sense, an analogous situation occurs in filter bubbles, where people “locked” in their bubble have a shared body of knowledge that may be specific only to them, and which one needs to know in order to understand the information they are conveying. The goal of the project is to study the available literature on such approaches, prepare a catalog of situations/problems in which such polyvocality can take place, and evaluate prototypes (if they exist)/prepare our own prototypes of such knowledge bases in the form of knowledge graphs.
  • Links:
  • Student: Tomasz Pakuła, Szymon Pietrzak
  • Namespace in the wiki: enrichment
  • The goal of the project: Determine ways to automatically enrich metadata in the field of cultural heritage and which of these methods actually work
  • Technology: literature studies, prototypes evaluation, RDF/Semantic Web
  • Description: The description of cultural heritage objects is very minimal and still relies on the same simple metadata as in card catalogs. Detailed metadata appeared only for selected collections and was created manually. But over the last few decades, the semantic web, machine learning, and other areas of AI have been developing. Has this made it possible to create metadata automatically? Yes. We have LLM, which creates descriptions or keywords, we have object detection methods (e.g., YOLO), which indicate what objects are in an image, we have handwriting recognition methods that can read manuscripts. Are there other methods? Are there tools ready for this? Do they work? Is this actually used in the field of cultural heritage? To what extent? Does it utilize knowledge graphs? This is to be determined within the project.
  • Student: Paweł Jasiński
  • Namespace in the wiki: incunabula
  • The goal of the project: Create an incunabula owners catalogue and integrate it with CHExRISH ontology
  • Technology: Python, RDF/Semantic Web
  • Description: (1) export the metadata from the Jagiellonian Library catalogue, (2) create a knowledge graph schema according to good practices in the domain (to check as a part of the project), (3) transfer all exported metadata into such a knowledge graph, (4) validate with the domain experts, (5) repeat, if needed, (6) integrate the final graph with the CHExRISH ontology
  • Student: Hubert Musiał, Jakub Bednarz
  • Namespace in the wiki: chcloud
  • The goal of the project: Validate the usability of Wikidata as a central point of cultural heritage data cloud
  • Technology: RDF/Semantic Web, some programming (Python preferred)
  • Description: Wikidata has many links to external sources of information (URIs/foreign keys), but are these links correct? Can additional information be extracted from these sources, or do these pages not contain any data that can be processed automatically? The aim of this project is: (a) to check whether these links are actually valid, (b) check whether/how and what data can be extracted from these external sources and whether this process can be automated, © prepare a knowledge graph that will combine this data into a single cultural heritage data cloud [for the purposes of the project, we will probably limit it to individuals associated with the University], (d) evaluate such a graph (including comparison with a graph containing only data from Wikidata)
  • Student: Jakub Chmura, Aleksandra Stępień
  • Namespace in the wiki: dockerpatchcore
  • The goal of the project: The goal is to provide docker image with REST API (based on Flask) as a service which later could be connected to website for visual anomaly detection.
  • Technology: Docker, Flask, Python
  • Description: The code that implements anomaly detection will be provided in a form of a parametrized python script. The goal of the project would be to design endpoints for REST API and architecture of the service (e.g. how datasets will be loaded, how concurrency will be implemented (celery, redis server?)) and implementation of proposed service along with docker configuration.
  • Links:
  • Student: Maciej Wójcki, Jakub Kręcisz
  • Namespace in the wiki: dockeropenai
  • The goal of the project: The goal of the project it to prepare and test docker image for custom hugginface model that provides OpenAI REST API
  • Technology: Docker, Python, Flask, FastAPI
  • Description: The goal is to use one of the ready-to-use docker images and test it for the sake of hosting custom hugginface model (including LLMs, multimodal models such as Phi.3.5, etc.)
  • Links:
  • Student: Paweł Wacławik
  • Namespace in the wiki: xaihyperopt
  • The goal of the project: The goal of the project it to use counterfactual generation and surrogate model learining to implement counterfactual-based sampler for hyperparameter optimization
  • Technology: Optuna, Python
  • Description:
  • Links:
  • Student: Kamil Kochańczyk, Jacek Gołębiowski
  • Namespace in the wiki: anomalycf
  • The goal of the project: How to generate visually correct counterfactual that is at the same time valid
  • Technology: Python, Pytorch, Tensorflow
  • Description: The work will be based on the MsC of Jakub Siwy, who tested various inpaiting methods for genreation of valid normal samples in anomaly-detection task. In this project, the goal is to extend the Future works by performing more experiments on all sampels from the provided datasets and to focus mainly on how newly generated samples can improve anomaly detection process. I.e. implement Counterfactual-based augmentation of dataset for more reliable anomaly detection.
  • Links:
  • Student: Błażej Torbus
  • Namespace in the wiki: openmlrag
  • The goal of the project: The goal is to use RAG systems to embed knowledge about ML pipelines obtained from OpenML
  • Technology: Python, R2R
  • Description: The main idea is to build a system that will encode knowledge about pipelines that is present in the OpenML reposiotry and allow for querying it to obtain most promising set of pipelines/hyperparameters for a specified dataset/task. This could be later used to improve the fyperparameetr optimization and explainability by constructign setup for optimizer based on data charactersitics, task, and other constrains that are available online and can server as an expert/background knoweldge.
  • Links:
  • Student: Szymon Fortuna, Tair Yerniyazov
  • Namespace in the wiki: decboundcomp
  • The goal of the project: Research on different methods that can be used to describe/measure the differences in the decision boundary of a classifier.
  • Technology: Python
  • Description: Having two models, trained on the same task how to explain the differences between them. This refers to the problem of capturing, measuring and describing the Rashomon Effect.
  • Links:
  • Student: Stanisław Mnich, Angelo Norelli
  • Namespace in the wiki: gnnexplanations
  • The goal of the project: Combine Logical neural netowrks with graph neural networks for better reasonin and explainability (healthcare domain)
  • Technology: GNN, Python, Pytorch
  • Description: Combining GNN Insights with Domain Knowledge. Exploring is using the GNN model to identify key features or temporal patterns associated with abnormal breathing. These could then be integrated with domain expertise, similar to our previous work where we combined Bayesian networks with a counterfactual generation mechanism to produce more plausible explanations. This could be ideally approached win LNN
  • Links:
    • LNN: LNN
    • (Access will be granted only to project team): GNN-TS-ANO
  • Student: Jan Zioło, Wojciech Szymańskim
  • Namespace in the wiki: xaitabullshit
  • The goal of the project: Investigate how much value existing rule-based approaches give in comparison to glassbox models
  • Technology: Python
  • Description: We have already prepared a script, where we test 4 explainers on around 55 datasets. We would like to extend this to more explainers and more datasets and confrim our observation, that complex rule-based approaches are not useful for tabular data, as good old fashion decision tree can easily compete with them.
  • Links:
  • courses/wshop/topics/tematy2025zima.txt
  • Last modified: 5 days ago
  • by kkt