Sound and Visual Synchronization

Check out this really interesting study over at Cognitive Daily, which explores the differences in acoustic and visual processing times. The authors of the study used a very elegant, simple protocol to demonstrate how accurate people are at reporting synchrony and "dis-synchrony."

One side note was that raw auditory processing times are faster than visual processing times. This may have to do with the levels and depth of processing that visual stimuli undergoes, and the amount of information (color, depth, size, position, movement, distance, etc) that must be integrated into a coherent "picture" of the world. I would bet that when auditory information contains more information (such as speech, music, etc) the depth of processing is also increased, and reaction times are a bit longer. Interestingly, the conduction velocity in humans appears to be quite a bit slower than in sonar-dependent dolphins, where huge auditory fibers appear to have evolved for especially rapid conduction (1).

(1) Ridgway, S. H., Bullock, T. H., Carder, D. A., Seeley, R. L., and Galam- bos, R. Auditory brainstem response in dolphins. Proceedings of the National Academy of Sciences, 1981, 78: 1943-47

More like this

Most of us start to tire after about half a day without any sleep. Staying awake for five in a row would be extremely difficult and even if you could manage it, you'd be a physical and mental wreck by the end. But not all animals suffer from the same problem. A dolphin can stay awake and alert for…
Echolocation - or biological sonar - can be thought of as an auditory imaging system that is used by organisms in environments where vision is ineffective. It involves the emission of vocalizations by the animal, and the detection of the echoes of those sounds, which are used to produce three-…
Alviniconcha hessleri (Mollusca: Mesogastropoda: Provannidae) When you think of hydrothermal vents, what comes to mind first? Is it the gushing black smoke out of a chimney? Perhaps you envision the enormous tubeworms with their red velvety plumes sticking out of their white tubes. Some may…
[Introduction|Part 2|Part 3] The study by McKemy et al is of great significance, as it led to the identification and characterization of the first cold receptor. This study also suggests that TRP channels have a general role in thermosensation, as all the previously identified TRP channels are…

This is completely anecdotal and unscientific, but my experience in doing VAPP (video-audio post production) work was that I needed a two-frame (about 67 ms) offset between sound and picture to be really certain that the sync was off, at least for typical match-the-lips-to-the-words stuff. For something more "hard-edged", like a drummer hitting a cymbal, I could spot a sync offset of about 1-1.5 frames.

Better engineers than I could spot a 1-frame offset on any program material.

It's way easier to detect a problem with synchronization between two audio recordings than it is for an audio recording and a video recording. With two audio tapes, the drums will start to flam obviously at no more than 10 ms of offset.

By Ktesibios (not verified) on 07 Nov 2006 #permalink