Interpreting hybrid images

By neurophilosophy on July 18, 2007.

How the brain interprets complex visual scenes is an enduring mystery for researchers. This process occurs extremely rapidly - the "meaning" of a scene is interpreted within 1/20th of a second, and, even though the information processed by the brain may be incomplete, the interpretation is usually correct.

Occasionally, however, visual stimuli are open to interpretation. This is the case with ambiguous figures - images which can be interpreted in more than one way. When an ambiguous image is viewed, a single image impinges upon the retina, but higher order processing in the visual cortex leads to a number of different interpretations of that image.

Only one of these interpretations is available to our conscious awareness at any one time. Repeated viewing of the image leads to perceptual reversal, whereby first one, and then the other, interpretation is perceived. For psychologists and neuroscientists, ambiguous figures provide a means by which the functioning of the human visual system can be investigated.

Salvador Dali's 1940 painting Slave Market with the Disappearing Bust of Voltaire (top) is an example of an ambiguous figure. In this painting, the two nuns just left of centre can also be perceived as the bust of the French writer and philosopher Voltaire. When looking at the painting, our perception of the painting switches from one interpretation to the other.

In a study published in 2002, Lizann Bonnar, then at the University of Glasgow, and her colleagues, investigated the stimuli which drive perception of the visual scene depicted in Dali's painting. Participants were presented with a cropped greyscale version of the painting, consisting solely of the area containing the nuns. A "bubble" filter was used to enhance or obscure certain features of that part of the painting. They found that the participants reported seeing the bust of Voltaire when the finer details of the painting were obscured, and reported seeing the nuns when large scale features were obscured.

This experiment showed the importance of scale information in perception. The researchers specifically manipulated the spatial resolution of the painting (that is, the periodicity with which image intensity changes). Large scale features change little over a given distance, and therefore have a low spatial resolution, while fine-grained features change much more over the same distance, and so have a high spatial resolution.

In a second experiment, the participants were shown random noise patterns before the cropped greyscale painting. One group was shown a pattern with a high spatial resolution, the other a pattern with a low spatial resolution. Afterwards, the former reported seeing the bust of Voltaire, while the latter reported seeing the nuns. This showed that previous experience is an important factor in perception. The participants had selectively perceived the frequency channels presented to them before they viewed the image.

Aude Oliva, head of the Computational Visual Cognition Laboratory at the Massachusettes Institute of Technology, has been using a similar approach to gain a better understanding of the processing of information in the visual cortex.

For more than 10 years, Oliva and her colleagues have been creating and using hybrid images that consist of two superimposed images, both of which have been altered with specialized filtering software.

Using these filters, sharp facial features, such as wrinkles and other blemishes, are removed from one image, and coarse features, such as the shape of the mouth or nose, are removed from the other. The two images are then superimposed; because features with a high spatial frequency are visible only from up close, and those with low spatial frequencies are only visible from further away, superimposition of the two produces a single image whose perception changes as a function of viewing distance.

Thus, the hybrid is a single image with two stable percepts; at a given distance, only one of the images is visible, and it is this image that dominates processing in the visual system; the other image is perceived as something lacking internal organization (noise).

Above is an example of the hybrid images created by Oliva's group. From up close, the image is perceived as Albert Einstein, because only the sharp features are visible; but if you step a few metres away from the monitor, the blurred features become visible, and the image of Marilyn Monroe emerges.

Oliva's group has been using this and similar images to investigate the role of different frequency channels for image recognition, and the time course over which this process occurs. What they have found is that when participants are shown hybrid images for durations of 30 milliseconds, they only recognized the low spatial resolution component of the image; when the images were displayed for 150 milliseconds, they only recognized the high spatial resolution component; In both cases, the participants were oblivious to the other interpretation of the image.

Participants were also shown hybrid images consisting of sad and angry faces (high and low spatial resolution, respectively) of superimposed male and female faces. When the images were displayed for 50 milliseconds, and the participants were asked to determine the emotion of the face they had seen, they always reported seeing an angry face; but when asked to determine the sex of the person in the image, they reported seeing a male as often as they reported seeing a female, although the two faces had different spatial resolutions.

Thus, selection of frequency bands during fast image recognition appears to be flexible - in some cases, the brain picks out characteristics with a low spatial resolution, while in others, it discriminates those with a high resolution. It seems that the brain is adept at selecting the frequncy band containing the most information relevant to a particular task. Again, the participants were unaware that the images they viewed contained information in the other frequency range.

The work carried out by Oliva's group shows that the brain extracts large-scale features slightly earlier than fine-grained features. Large scale features are processed within 50 milliseconds, giving an overall impression of the visual scene. The processing of fine-grained details begins slightly later, at around 100 milliseconds. The fine- and coarse-grained features are extracted separately, and processed in parallel through different channels, in successively higher order areas of the visual cortex. In a process called perceptual grouping, the information from the channels is then seamlessly recombined at visual cortical areas of the highest order to produce a coherent, and usually unambiguous, image.

More like this

best for you, http://www.crunchyroll.com/user/bbwsex Cheapest bbw sex, =-((,

Very nice.

I will note that squinting is sufficient to bring out the face of Marilyn Monroe in the above image. The effect is particularly striking around the eyes.

I was also able to bring Marilyn into view by relaxing the focus in my eyes. My brain is now hopelessly twisted. A great post -- thanks!

I saw Abraham Lincoln in the Einstein picture.

Advertisment

Donate

ScienceBlogs is where scientists communicate directly with the public. We are part of Science 2.0, a science education nonprofit operating under Section 501(c)(3) of the Internal Revenue Code. Please make a tax-deductible donation if you value independent science communication, collaboration, participation, and open access.

You can also shop using Amazon Smile and though you pay nothing more we get a tiny something.

Science 2.0

Science Codex

The Hidden Burnout Crisis Facing SEO Social Media Marketers

More by this author

Neurophilosophy now hosted by The Guardian

August 11, 2011

AFTER four years here at ScienceBlogs.com, Neurophilosophy is moving to a new home. As of today, it will be hosted by The Guardian. During its time here, the blog has grown from strength to strength. It has received over 2.5 million page views, was featured regularly on the New York Times science…

Human echolocation activates visual parts of the brain

May 25, 2011

WE all know that bats and dolphins use echolocation to navigate, by producing high frequency bursts of clicks and interpreting the sound waves that bounce off objects in their surroundings. Less well known is that humans can also learn to echolocate. With enough training, people can use this…

A whiff of early brain evolution

May 19, 2011

Skull of Hadrocodium wui. (Image courtesy of Mark Klinger and Zhe-Xi Luo, Carnegie Museum of Natural History) THE question of how mammals evolved their exceptionally large brains has intrigued researchers for years, and although many ideas have been put forward, none has provided a clear answer.…

Sleepy brain waves predict dream recall

May 10, 2011

THE patterns of brain waves that occur during sleep can predict the likelihood that dreams will be successfully recalled upon waking up, according to a new study published in the Journal of Neuroscience. The research provides the first evidence of a 'signature' pattern of brain activity …

US military planned using spy crows to find Osama bin Laden

May 8, 2011

THE United States military funded research into using networks of 'spy crows' to locate soldiers who are missing in action, and extended the work to see if the birds might be useful in helping them to find Osama bin Laden. The idea may seem far-fetched, but unlike some…