Search results | TU Delft Repositories

Searched for: +

(1 - 5 of 5)

document: A Mechanically Tunable Quantum Dot in a Graphene Break Junction
Caneva, S. (author), Hermans, Matthijs (author), Lee, M. (author), García-Fuente, Amador (author), Watanabe, Kenji (author), Taniguchi, Takashi (author), Dekker, C. (author), Ferrer, Jaime (author), van der Zant, H.S.J. (author), Gehring, P. (author)
Graphene quantum dots (QDs) are intensively studied as platforms for the next generation of quantum electronic devices. Fine tuning of the transport properties in monolayer graphene QDs, in particular with respect to the independent modulation of the tunnel barrier transparencies, remains challenging and is typically addressed using...
journal article 2020

document: The First Multimodal Information Based Speech Processing (Misp) Challenge: Data, Tasks, Baselines And Results
Chen, Hang (author), Zhou, Hengshun (author), Du, Jun (author), Lee, Chin-Hui (author), Chen, Jingdong (author), Watanabe, Shinji (author), Siniscalchi, Sabato Marco (author), Scharenborg, O.E. (author), Liu, Di-Yuan (author)
In this paper we discuss the rational of the Multi-model Information based Speech Processing (MISP) Challenge, and provide a detailed description of the data recorded, the two evaluation tasks and the corresponding baselines, followed by a summary of submitted systems and evaluation results. The MISP Challenge aims at tack-ling speech processing...
conference paper 2022

document: Audio-Visual Wake Word Spotting in MISP2021 Challenge: Dataset Release and Deep Analysis
Zhou, Hengshun (author), Du, Jun (author), Zou, Gongzhen (author), Nian, Zhaoxu (author), Lee, Chin Hui (author), Siniscalchi, Sabato Marco (author), Watanabe, Shinji (author), Scharenborg, O.E. (author), Chen, Jingdong (author)
In this paper, we describe and release publicly the audio-visual wake word spotting (WWS) database in the MISP2021 Challenge, which covers a range of scenarios of audio and video data collected by near-, mid-, and far-field microphone arrays, and cameras, to create a shared and publicly available database for WWS. The database and the code ...
journal article 2022

document: Audio-Visual Speech Recognition in MISP2021 Challenge: Dataset Release and Deep Analysis
Chen, Hang (author), Du, Jun (author), Dai, Yusheng (author), Lee, Chin Hui (author), Siniscalchi, Sabato Marco (author), Watanabe, Shinji (author), Scharenborg, O.E. (author), Chen, Jingdong (author), Yin, Bao Cai (author), Pan, Jia (author)
In this paper, we present the updated Audio-Visual Speech Recognition (AVSR) corpus of MISP2021 challenge, a large-scale audio-visual Chinese conversational corpus consisting of 141h audio and video data collected by far/middle/near microphones and far/middle cameras in 34 real-home TV rooms. To our best knowledge, our corpus is the first...
journal article 2022

document: The Multimodal Information Based Speech Processing (Misp) 2022 Challenge: Audio-Visual Diarization And Recognition
Wang, Zhe (author), Wu, Shilong (author), Chen, Hang (author), He, Mao-Kui (author), Du, Jun (author), Lee, Chin-Hui (author), Chen, Jingdong (author), Watanabe, Shinji (author), Siniscalchi, Sabato Marco (author), Scharenborg, O.E. (author), Liu, Diyuan (author)
The Multi-modal Information based Speech Processing (MISP) challenge aims to extend the application of signal processing technology in specific scenarios by promoting the research into wake-up words, speaker diarization, speech recognition, and other technologies. The MISP2022 challenge has two tracks: 1) audio-visual speaker diarization (AVSD),...
conference paper 2023

Searched for: +

(1 - 5 of 5)