Visually based speech onset/ofset detection

More Info
expand_more