RH
R. Hai
Authored
6 records found
Dynamic Digital Twin
Diagnosis, Treatment, Prediction, and Prevention of Disease During the Life Course
A digital twin (DT), originally defined as a virtual representation of a physical asset, system, or process, is a new concept in health care. A DT in health care is not a single technology but a domain-adapted multimodal modeling approach incorporating the acquisition, management
...
Data Lakes
A Survey of Functions and Systems
Data lakes are becoming increasingly prevalent for Big Data management and data analytics. In contrast to traditional 'schema-on-write' approaches such as data warehouses, data lakes are repositories storing raw data in its original formats and providing a common access interface
...
Amalur
Data Integration Meets Machine Learning
Machine learning (ML) training data is often scattered across disparate collections of datasets, called <italic>data silos</italic>. This fragmentation poses a major challenge for data-intensive ML applications: integrating and transforming data residing in different sources dema
...
Amalur
Data Integration Meets Machine Learning
Machine learning (ML) training data is often scattered across disparate collections of datasets, called <italic>data silos</italic>. This fragmentation poses a major challenge for data-intensive ML applications: integrating and transforming data residing in different sources dema
...
Amalur
Next-generation Data Integration in Data Lakes
Data science workflows often require extracting, preparing and integrating data from multiple data sources. This is a cumbersome and slow process: most of the times, data scientists prepare data in a data processing system or a data lake, and export it as a table, in order for it
...
Amalur
Data Integration Meets Machine Learning
Machine learning (ML) training data is often scattered across disparate collections of datasets, called data silos. This fragmentation poses a major challenge for data-intensive ML applications: integrating and transforming data residing in different sources demand a lot of manua
...
Contributed
14 records found
Dealing with conflicting trains
Effectively avoiding and resolving conflicts during shunting
A shunting yard is used to store trains between arrival and departure. A conflict arises in a shunting yard when one train obstructs another from leaving. Resolving a conflict is done by re-allocating the trains obstructing the departing train to other tracks in the shunting yard
...
Comparing planners for rail planning in PDDL
How multiple shunting yard layouts can be created in PDDL to replicate real-world scenarios
This paper explored the usage of the Planning Domain Definition Language in the Train Unit Shunting Problem or TUSP, an NP-hard problem that occurs in train storage. This paper focuses on the evaluation and improvement of existing planners to solve TUSP with multiple shunting yar
...
Optimizing the PDDL domain of TUSP to improve planner performance
Modifying the domain to improve planner execution time, plan quality, and problem solvability
It is possible to improve the performance of planners by modifying the PDDL domain of a problem. The goal of this research is to implement this to the domain of the Train Unit Shunting Problem (TUSP). The research question we attempt to answer is: To what extent can we improve pl
...
Mixed-direction train shunting with numerical planning
Approach to support train departures at any time during the shunting plan
In this paper, we discuss our novel approach to support mixed-direction shunting in the planning domain for tree-like shunting yards. We explain how we used numeric fluents to improve an existing PDDL domain to support such actions. We elaborate on the underlying conceptual model
...
Enriching Machine Learning Model Metadata
Collecting performance metadata through automatic evaluation
As the sharing of machine learning (ML) models has increased in popularity, more so-called model zoos are created. These repositories facilitate the sharing of models and their metadata, and other people to find and re-use an existing model. However, the metadata provided for mod
...
Adaptable Resource Generation Protocols For Quantum Networks
Reinforcement Learning For Fast Quantum Resource Generation Policies
Quantum networks allow quantum processors to communicate over large distances. These networks often require simultaneously existing multiple entangled pairs of quantum bits (entangled links) as a fundamental resource for communication. Link generation is a sequential and probabil
...
Adaptive Algorithm for Resource Generation in a Quantum Network
Using a Markov Decision Process Approach to Optimize the Quantum Resource Generation Algorithm
Entangled links can be seen as a connection between two parties that persists remotely, but whose quality (fidelity) decreases over time. Many quantum applications rely on having a certain number of simultaneously active entangled links for their execution. Every application has
...
Time's Up!
Robust Watermarking in Large Language Models for Time Series Generation
The advent of pretrained probabilistic time series foundation models has significantly advanced the field of time series forecasting. Despite these models’ growing popularity, the application of watermarking techniques to them remains underexplored. This paper addresses this rese
...
Time's Up!
Robust Watermarking in Large Language Models for Time Series Generation
The advent of pretrained probabilistic time series foundation models has significantly advanced the field of time series forecasting. Despite these models’ growing popularity, the application of watermarking techniques to them remains underexplored. This paper addresses this rese
...
Optimizing Database Joins
Cost Models and Benchmarking for CPU and GPU Systems
Optimizing SQL query execution through effective cost models is a critical challenge in database management systems (DBMS). This thesis introduces a modular benchmarking system for cost models, with a pluggable architecture for both cost models and execution engines, enabling com
...
Optimizing Database Joins
Cost Models and Benchmarking for CPU and GPU Systems
Optimizing SQL query execution through effective cost models is a critical challenge in database management systems (DBMS). This thesis introduces a modular benchmarking system for cost models, with a pluggable architecture for both cost models and execution engines, enabling com
...
When writing functional code that composes multiple recursive functions that operate on a datastrcuture, we often incur a lot of computational overhead allocating memory, only to later read, use, and discard this information.
This can be alleviated using fusion, a technique that
...
When writing functional code that composes multiple recursive functions that operate on a datastrcuture, we often incur a lot of computational overhead allocating memory, only to later read, use, and discard this information.
This can be alleviated using fusion, a technique that
...
Optimising Adaptive Resource Generation in Near-Term Quantum Networks
A Markov Decision Process Model to Produce an Optimal Resource Generation Policy
A quantum network allows us to connect quantum information processors to achieve capabilities that are not possible using classical computation. Quantum network protocols typically require several entangled states available simultaneously. Previously, an entanglement generation p
...