Audio-visual framework for generating sound