C. Lofi
48 records found
1
In forensic investigations, an increasing amount of evidence is retrieved from digital devices. This evidence is often extracted from devices using digital forensic platforms. The platforms are able to extract digital traces from several types of files, originating from different
...
Improving research data reusability through data conversations
Bridging gaps in metadata supply and demand
Efficient and inclusive data reuse across research disciplines is based on high quality metadata that bridges the gap between data producers and consumers. This gap, referred to as the metadata gap, arises when the metadata provided by producers do not meet the needs of consumers
...
With the advent of large language models (LLMs), developing solutions for Natural Language Processing (NLP) tasks has become more approachable. However, these models are opaque, which presents several challenges, such as prompt engineering, quality assessment, and error analysis.
...
Large language models have achieved breakthroughs in many natural language processing tasks. One of their main appeals is the ability to tackle problems that lack sufficient training data to create a dedicated solution. Manga translation is one such task, a still budding and un
...
In recent years, the rapid advancements in big data, machine learning, and artificial intelligence have led to a corresponding rise in privacy concerns. One of the solutions to address these concerns is federated learning. In this thesis, we will look at the setting of vertical f
...
Music annotation and transcription of music sheets are traditionally performed by experts. Although these processes result in high quality data, the scope of each effort is relatively narrow resulting in highly specialised and specific datasets of annotated music compositions, wh
...
Schema matching is a critical data integration process, which aims at capturing relevance between elements of different datasets; when datasets are tabular, it translates to the process of discovering related columns among them. Accurately discovering column matches is integral f
...
Despite the low adoption rates of artificial intelligence (AI) in respiratory medicine, its potential to improve patient outcomes is substantial. To facilitate the integration of AI systems into the clinical setting, it is essential to prioritise the development of explainable AI
...
Colorectal cancer is a widespread disease that significantly impacts the health of individuals worldwide. Understanding the needs and concerns of those affected by this disease is crucial for improving patient outcomes and enhancing the quality of care. Patient web forums have em
...
The goal of this research is to model and understand the effects of tourism demand on air quality by performing data integration on multi-source data. This research is aimed at researchers and practitioners aiming to perform multidisciplinary research in the fields of data scienc
...
Gene similarity has been an area of great interest in numerous fields for decades, as it can provide insights into the evolutionary relationships among different species. This knowledge is particularly useful for advancing biotechnologies, discovering new drugs and treatments for
...
There are an estimated 253 million blind and visually impaired people in the world. To grant them access to text publications that contain images, experts are employed to write image descriptions. There is both a societal and a legislative pressure to supply image descriptions to
...
Over time Linked Data collections are continuously subject to change because of numerous reasons. Users could insert new observations, or they could rectify erroneous statements in these knowledge graphs. In order not to lose historically import information, this trend of evolvin
...
Public technology has been shown to have a strong dependence on physical touch, which increases the transmission of diseases. Gesture recognition helps to reduce this transmission, as the dependence on physical touch is removed. Furthermore, the use of visible light for gesture r
...
Dataset discovery techniques originally required datasets to have the same domain which made them unsuitable to be used on a larger scale. To avoid this requirement, newer techniques use additional information, aside from the datasets being processed, to better understand the dat
...
Binary Neural Networks (BNNs) are compact and efficient by using binary weights instead of real-valued weights. Current BNNs use latent real-valued weights during training, where several training hyper-parameters are inherited from real-valued networks. The interpretation of seve
...
Investigating changes in forest cover has been an area of intense research for decades. From manual surveys to remote sensing we have come a long way in mapping the world around us. Machine learning and its' younger sibling, deep learning, have emerged as highly useful tools on t
...
Short Duration ECG into Autoencoder Followed By Clustering
An Explorational Study
Electrocardiography is the craft of producing electrocardiograms. These graphs give physicians insight into the potential pathology of the heart. In order to come to a diagnosis, physicians use electrocardiograms in combination with follow-up physical examinations. There has been
...
Multi-Level Fairness Framework
A Socio-Technical framework for Fairness Requirements Engineering in Machine Learning
Machine Learning models are begin increasingly used within the industry such as by financial institutions, governments and commercial companies. In the past few years, there have been several incidents where these ML models show discriminatory behavior towards particular groups o
...
Database entity type recognition is the practice of recognizing conceptual entity types for which given data sets contain data. In big data or data lake settings, it is not always known which conceptual entity types are represented in each data set, making it difficult to extract
...