Audio-visual event detection using duration dependent input output Markov modelsM. NaphadeA. Garget al.2001CBAIVL 2001