Search results | TU Delft Repositories

Searched for: +

(1 - 5 of 5)

document: MERLIon CCS Challenge: A English-Mandarin code-switching child-directed speech corpus for language identification and diarization
Chua, Victoria Y.H. (author), Liu, Hexin (author), Perera, Leibny Paola Garcia (author), Woon, Fei Ting (author), Wong, Jinyi (author), Zhang, X. (author), Khudanpur, Sanjeev (author), Khong, Andy W.H. (author), Dauwels, J.H.G. (author), Styles, Suzy J. (author)
To enhance the reliability and robustness of language identification (LID) and language diarization (LD) systems for heterogeneous populations and scenarios, there is a need for speech processing models to be trained on datasets that feature diverse language registers and speech patterns. We present the MERLIon CCS challenge, featuring a...
journal article 2023

document: Learning to Solve Multiple-TSP With Time Window and Rejections via Deep Reinforcement Learning
Zhang, Rongkai (author), Zhang, Cong (author), Cao, Zhiguang (author), Song, Wen (author), Tan, Puay Siew (author), Zhang, Jie (author), Wen, Bihan (author), Dauwels, J.H.G. (author)
We propose a manager-worker framework (the implementation of our model is publically available at: https://github.com/zcaicaros/manager-worker-mtsptwr) based on deep reinforcement learning to tackle a hard yet nontrivial variant of Travelling Salesman Problem (TSP), i.e. multiple-vehicle TSP with time window and rejections (mTSPTWR), where...
journal article 2023

document: Image Search Engine by Deep Neural Networks
Yao, Y. (author), Zhang, Q. (author), HU, Y. (author), Meo, C. (author), Wang, Y. (author), Nanetti, Andrea (author), Dauwels, J.H.G. (author)
We typically search for images by keywords, e.g., when looking for images of apples, we would enter the word “apple” as query. However, there are limitations. For example, if users input keywords in a specific language, then they may miss results labeled in other languages. Moreover, users may have an image of the object they want to obtain more...
conference paper 2022

document: End-to-end language diarization for bilingual code-switching speech
Liu, Hexin (author), Perera, Leibny Paola Garcia (author), Zhang, Xinyi (author), Dauwels, J.H.G. (author), Khong, Andy W.H. (author), Khudanpur, Sanjeev (author), Styles, Suzy J. (author)
We propose two end-to-end neural configurations for language diarization on bilingual code-switching speech. The first, a BLSTM-E2E architecture, includes a set of stacked bidirectional LSTMs to compute embeddings and incorporates the deep clustering loss to enforce grouping of languages belonging to the same class. The second, an XSA-E2E...
conference paper 2021

document: R3L: Connecting Deep Reinforcement Learning To Recurrent Neural Networks For Image Denoising Via Residual Recovery
Zhang, Rongkai (author), Zhu, Jiang (author), Zha, Zhiyuan (author), Dauwels, J.H.G. (author), Wen, Bihan (author)
State-of-the-art image denoisers exploit various types of deep neural networks via deterministic training. Alternatively, very recent works utilize deep reinforcement learning for restoring images with diverse or unknown corruptions. Though deep reinforcement learning can generate effective policy networks for operator selection or architecture...
conference paper 2021

Searched for: +

(1 - 5 of 5)