Authored

10 records found

Corrnet

Fine-grained emotion recognition for video watching using wearable physiological sensors

Recognizing user emotions while they watch short-form videos anytime and anywhere is essential for facilitating video content customization and personalization. However, most works either classify a single emotion per video stimuli, or are restricted to static, desktop environmen ...

Towards user-oriented privacy for recommender system data

A personalization-based approach to gender obfuscation for user profiles

In this paper, we propose a new privacy solution for the data used to train a recommender system, i.e., the user–item matrix. The user–item matrix contains implicit information, which can be inferred using a classifier, leading to potential privacy violations. Our solution, calle ...

Joint Feature Synthesis and Embedding

Adversarial Cross-Modal Retrieval Revisited

Recently, generative adversarial network (GAN) has shown its strong ability on modeling data distribution via adversarial learning. Cross-modal GAN, which attempts to utilize the power of GAN to model the cross-modal joint distribution and to learn compatible cross-modal features ...

Leave No User Behind

Towards Improving the Utility of Recommender Systems for Non-mainstream Users

In a collaborative-filtering recommendation scenario, biases in the data will likely propagate in the learned recommendations. In this paper we focus on the so-called mainstream bias: the tendency of a recommender system to provide better recommendations to users who have a mains ...

From Deterministic to Generative

Multimodal Stochastic RNNs for Video Captioning

Video captioning, in essential, is a complex natural process, which is affected by various uncertainties stemming from video content, subjective judgment, and so on. In this paper, we build on the recent progress in using encoder-decoder framework for video captioning and address ...

Towards Seed-Free Music Playlist Generation

Enhancing collaborative Filtering with Playlist Title Information

In this paper, we propose a hybrid Neural Collaborative Filtering (NCF) model trained with a multi-objective function to achieve a music playlist generation system. The proposed approach focuses particularly on the cold-start problem (playlists with no seed tracks) and uses a tex ...

Statistical Significance Testing in Information Retrieval

An Empirical Analysis of Type I, Type II and Type III Errors

Statistical significance testing is widely accepted as a means to assess how well a difference in effectiveness reflects an actual difference between systems, as opposed to random noise because of the selection of topics. According to recent surveys on SIGIR, CIKM, ECIR and TOIS ...

The influence of personal values on music taste

Towards value-based music recommendations

The feld of recommender systems has a lot to gain from the feld of psychology. Indeed, many psychology researchers have investigated relations between models that describe humans and consumption preferences. One example of this is personality, which has been shown to be a valid c ...

Are Nearby Neighbors Relatives?

Testing Deep Music Embeddings

Deep neural networks have frequently been used to directly learn representations useful for a given task from raw input data. In terms of overall performance metrics, machine learning solutions employing deep representations frequently have been reported to greatly outperform tho ...

S2IGAN

Speech-to-Image Generation via Adversarial Learning

An estimated half of the world’s languages do not have a written form, making it impossible for these languages to benefit from any existing text-based technologies. In this paper, a speech-to-image generation (S2IG) framework is proposed which translates speech descriptions to p ...

Contributed

10 records found

Side-Channel Attacks using Convolutional Neural Networks

A Study on the performance of Convolutional Neural Networks on side-channel data

Side-Channel Attacks, are a prominent type of attacks, used to break cryptographic implementations on a computing system. They are based on information "leaked" by the hardware of a computing system, rather than the encryption algorithm itself. Recent studies showed that Side-Cha ...
Images possess the ability to convey a wide range of emotions, and extracting affective information from images is crucial for affect prediction systems. This process can be achieved through the application of machine learning algorithms. Categorical Emotion States (CES) and Dime ...

Live streaming via WiFi

Monitoring premature babies

This report describes the implementation of a custom streaming solution from an IP camera to a web browser. The system aims to both provide live video and Video on Demand. This will be used to monitor premature babies in incubators.

HoloNav: HoloLens as a Surgical Navigation System

Detecting optical reflective spheres using YOLOv5 and the Hololens' grayscale cameras

Surgical navigation is a tool that surgeons rely on everyday to perform accurate surgeries all over the world. However, this technology requires good hand-eye coordination and a high level of concentration. HoloNav is a project that inquires to see if using the HoloLens and augme ...

Recommender Systems with Evolutionary Algorithms: Many­-Objective Optimization for Large­-Scale Music Recommendation

A demonstration showing the reliability of serving users recommendations with trade-­off for large music collections, by leveraging diverse Recommender Systems and Evolutionary Algorithms

Using Recommender Systems with Evolutionary Algorithms is an extremely niche domain. It holds the key to enabling new user interaction designs, where users can effectively configure their experience with a Recommender System. This thesis answers important questions about the scie ...

More than a feeling?

Reliability and robustness of high-level music classifiers

High-level music classification tasks such as automatic music mood annotation impose several challenges, both from a psychological and a machine learning point of view. Ground truth labels for these tasks at hand are hard to define due to the abstract and aesthetic nature of the ...

DeepSleep

A sensor-agnostic approach towards modelling a sleep classification system

Sleep is a natural state of our mind and body during which our muscles heal and our memories are consolidated. It is such a habitual phenomenon that we have been viewing it as another ordinary task in our day-to-day life. However, owing to the current fast-paced, technology-drive ...

Music in Use

Novel perspectives on content-based music Retrieval

Music consumption has skyrocketed in the past few years with advancements in internet and streaming technologies. This has resulted in the rapid development of the inter-disciplinary field of Music Information Retrieval (MIR), which develops automatic methods to efficiently and e ...
Machine learning (ML) has become a core technology for many real-world applications. Modern ML models are applied to unprecedentedly complex and difficult challenges, including very large and subjective problems. For instance, applications towards multimedia understanding have be ...

On Fine-grained Temporal Emotion Recognition in Video

How to Trade off Recognition Accuracy with Annotation Complexity?

Fine-grained emotion recognition is the process of automatically identifying the emotions of users at a fine granularity level, typically in the time intervals of 0.5s to 4s according to the expected duration of emotions. Previous work mainly focused on developing algorithms to r ...