Periodic Reporting for period 1 - TalkingHeads (TalkingHeads: Audiovisual Speech Recognition in-the-wild)

Summary
Audio-visual (AV) Automatic Speech Recognition (ASR) in unconstrained (in-the-wild) videos collected from real-world multimedia databases (outdoor conversation/interviews, TV shows with multiple speakers) using novel deep learning methodologies and architectures.IMPORTANCE FOR...
More information & hyperlinks