Audio-visual algorithms for person tracking and characterization (upgrade)

Summary
Corresponding to all tasks