Searched for: +
(1 - 5 of 5)
document
Chua, Victoria Y.H. (author), Liu, Hexin (author), Perera, Leibny Paola Garcia (author), Woon, Fei Ting (author), Wong, Jinyi (author), Zhang, X. (author), Khudanpur, Sanjeev (author), Khong, Andy W.H. (author), Dauwels, J.H.G. (author), Styles, Suzy J. (author)
To enhance the reliability and robustness of language identification (LID) and language diarization (LD) systems for heterogeneous populations and scenarios, there is a need for speech processing models to be trained on datasets that feature diverse language registers and speech patterns. We present the MERLIon CCS challenge, featuring a...
journal article 2023
document
Zhang, Rongkai (author), Zhang, Cong (author), Cao, Zhiguang (author), Song, Wen (author), Tan, Puay Siew (author), Zhang, Jie (author), Wen, Bihan (author), Dauwels, J.H.G. (author)
We propose a manager-worker framework (the implementation of our model is publically available at: https://github.com/zcaicaros/manager-worker-mtsptwr) based on deep reinforcement learning to tackle a hard yet nontrivial variant of Travelling Salesman Problem (TSP), i.e. multiple-vehicle TSP with time window and rejections (mTSPTWR), where...
journal article 2023
document
Yao, Y. (author), Zhang, Q. (author), HU, Y. (author), Meo, C. (author), Wang, Y. (author), Nanetti, Andrea (author), Dauwels, J.H.G. (author)
We typically search for images by keywords, e.g., when looking for images of apples, we would enter the word “apple” as query. However, there are limitations. For example, if users input keywords in a specific language, then they may miss results labeled in other languages. Moreover, users may have an image of the object they want to obtain more...
conference paper 2022
document
Liu, Hexin (author), Perera, Leibny Paola Garcia (author), Zhang, Xinyi (author), Dauwels, J.H.G. (author), Khong, Andy W.H. (author), Khudanpur, Sanjeev (author), Styles, Suzy J. (author)
We propose two end-to-end neural configurations for language diarization on bilingual code-switching speech. The first, a BLSTM-E2E architecture, includes a set of stacked bidirectional LSTMs to compute embeddings and incorporates the deep clustering loss to enforce grouping of languages belonging to the same class. The second, an XSA-E2E...
conference paper 2021
document
Zhang, Rongkai (author), Zhu, Jiang (author), Zha, Zhiyuan (author), Dauwels, J.H.G. (author), Wen, Bihan (author)
State-of-the-art image denoisers exploit various types of deep neural networks via deterministic training. Alternatively, very recent works utilize deep reinforcement learning for restoring images with diverse or unknown corruptions. Though deep reinforcement learning can generate effective policy networks for operator selection or architecture...
conference paper 2021
Searched for: +
(1 - 5 of 5)