Google in Your Brain? PageRank As a Semantic Memory Model

By developinginte… on November 29, 2007.

The world wide web can be understood as a giant matrix of associations (links) between various nodes (web pages). At an abstract level, this is similar to human memory, consisting of a matrix of associations (learned relationships, or neuronal connections) between various nodes (memories, or the distributed representations constituting them). In the new issue of Psych. Science, Griffiths et al. ask whether Google's famously accurate and fast PageRank algorithm for internet search might behave similarly to the brain's algorithm - whatever that might be - for searching human memory.

About PageRank

The PageRank algorithm is based on the assumption that the most important nodes in a network contain a large number of associations with other nodes, which themselves contain a large number of associations with other nodes, which themselves... and so on. This "recursive definition of importance" is formalized in Google's algorithm to efficiently calculate the rankings of different web pages, and to return those web pages which are mostly highly ranked that also fit a certain search term.

Search in Human Memory

One way of graphing the associative structure of human memory is simply to ask human subjects to generate words which are strongly associated with other words. Averaged across many subjects, the frequency of those generated words reflects the "associate frequency" of the words in human memory. You might think of this result as "MemoryRank" instead of PageRank.

How well does PageRank account for human memory?

Griffiths et al note one critical difference between PageRank and the "associate frequency" measure of human memory: the latter doesn't account for the fact that some cues are strongly associated with more words than others. This is captured by PageRank's more recursive definition of importance.

To evaluate which ranking scheme better predicts human data, the two methods were used on a large set of verbal associations, all generated by humans in response to each of over 5,000 words. The result of this process was two ranks for each word in the set - one generated according to PageRank and one according to associate frequency. This list was then culled to include only those words generated by a set of 50 adults, each of whom had been asked to generate the first word that came to mind in response to each letter of the alphabet (excluding 5 low-frequency letters).

If PageRank or associate frequency were perfect models of human memory, then the human data should be completely predictable: humans should always pick those words which have the highest rank and start with the desired letter.

The result:

"PageRank outperformed both associate frequency and word frequency as a predictor" of those words generated by humans in response to each letter of the alphabet. And this wasn't due merely to the training set - Griffiths et al. manipulated the training set in various ways, and in all cases, PageRank came out on top (relative to associate frequency and word frequency).

What does this mean?

It turns out that PageRank is mathematically equivalent to a large number of other formalisms that are used in cognitive science. For example, severely limited connectionist networks (limited insofar as connection weights are equalized across all projections from a certain node) are mathematically equivalent to PageRank: the activation in such a network should ultimately settle on those nodes in proportion to their PageRank. Likewise, PageRank can also be considered an estimate of "priors" in a Bayesian network (with some simplifying assumptions about likelihood).

So Google's PageRank may accomplish network search in ways that can also be implemented in other frameworks widely used in cognitive psychology. However, PageRank (at least as it is known in the public domain) makes the strongly simplifying assumption that all associations from a particular node equally contribute to the importance of each of the connected nodes.

Although this assumption may be necessary for Google's purposes, it is extremely clear that no such limitation exists in the brain. After all, the most widely recognized algorithm for neural computation - Hebbian learning - works precisely because it modifies the relative weights of one node to another independently from the weights of that node to all other nodes.

Is Google in my brain?

No one is suggesting that Larry Page has discovered the secret to the organization of human memory. In fact, it's clear that some of PageRank's (public) assumptions about the structure of networks do not hold - for example, the idea that the importance of a single node is distributed equally through all its connections. Much better models of verbal processing abound in cognitive psychology (see, for example, LSA). Still, Griffiths et al. compellingly demonstrate that the advantageous qualities of PageRank do indeed generalize from the world wide web to the semantic networks present in the brain.

More like this

Google Predicts Memory, and Probably Everything Else

There's a paper in the December 2007 issue of Psychological Science titled "Google and the Mind: Predicting Fluency With PageRank." Here's the abstract: Griffiths, T.L., Steyvers, M., & Firl, A. (2007). Google and the mind: Predicting fluency with PageRank. Psychological Science, 18(12), 1069-…

Uncertainty Reduction: Ambiguity Resolution Mechanisms in Language

Ambiguity is a constant problem for any embodied cognitive agent with limited resources. Decisions need to be made, and their consequences understood, despite the probabilistic veil of uncertainty enveloping everything from sensory input to action execution. Clearly, there must be mechanisms for…

Strategies In Memory: Temporal Dissociations in Prefrontal Activity In Long- & Short-term Memory

Early neuropsychology research indicated that long-term memory and short-term memory were separable - in other words, long-term memory could be impaired by damage to the hippocampus without any corresponding deficits in short-term memory. However, this idea has come under scrutiny in recent years…

10 Important Differences Between Brains and Computers

"A good metaphor is something even the police should keep an eye on." - G.C. Lichtenberg Although the brain-computer metaphor has served cognitive psychology well, research in cognitive neuroscience has revealed many important differences between brains and computers. Appreciating these…

I think that to get the personal loans from creditors you should have a firm motivation. Nevertheless, one time I have received a commercial loan, just because I wanted to buy a house.

Me agrada la noticia de que empresas como Developing Intellig luchen por expandirse por nuestro paÃs, EspaÃ±a. Espero que tengan una gran evoluciÃ³n y apuesto que asÃ sera, el trabajo bien organizado y solido, que dan sus frutos. Invito y animo a que mÃ¡s empresas pierdan el miedo a expandirse por nuestro planeta.

Patricia Gonzalez Vargas
http://www.hotsale.es
Centro comercial online

It's understandable that money can make us autonomous. But what to do when someone does not have cash? The one way only is to try to get the credit loans and just financial loan.

I will recommend not to hold off until you get enough money to buy all you need! You can take the mortgage loans or credit loan and feel yourself free

Google always in my brain. Google is interesting and challenging. It is important to have a page rank because most of the people look on the page rank of the site. For me page rank is just like a reputation of the site.

That's a great article. I've been computing Wikipedia's PageRank as part of my Information Retrieval course. I've got the Irish Wikipedia going right now, in fact. I'm doing it in this language because, due to its size, it seems impossible to compute the English Wikipedia's PR on a standalone machine (e.g., I'd have to write a distributed version.)

Back to the brain, the English Wikipedia has 2 million vertices (articles) and 63 million edges (links). The complexity of computing the PageRank of a graph is O(|H|log(1/e)) floating point operations, where H is the number of edges and e is the precision required, usually taken to be 10-8. The complexity is independent of the number of vertices (Bianchini et al., 2005). This means that I can use a theoretical algorithm to compute the PR of Wikipedia on IBM's BlueGene/P, which can compute .5 quadrillion floating point operations per second, in 1/106 seconds. (it would actually be much faster than this because the distributed versions have decent speedup).

Back to the brain (*ahem*), computing the PR of a semantic network is cute and all, but what about the whole brain? I'll punt on that since I've got good numbers for just cortex: It's O(.15 quadrillion*log(1/10-8)), which is just larger than a quadrillion, which will only take a couple of seconds. Of course, our algorithms aren't perfect - if they were I could crunch these numbers on my desktop in a limited amount of time. But we can make up for it in hardware - BlueBrain anticipates a human scale brain simulation (without relevant connectivity - just scale) within the next ten years. Did I mention that backprop has complexity O(n)? Of course, they are using fancy pants differential equations :)

This is really interesting. I am getting lots of fluctuation on my different sites, and some of the pages I am linking to are claiming high PR, when the current Google PR is low or zero. Thanks for the post.

Advertisment

Donate

ScienceBlogs is where scientists communicate directly with the public. We are part of Science 2.0, a science education nonprofit operating under Section 501(c)(3) of the Internal Revenue Code. Please make a tax-deductible donation if you value independent science communication, collaboration, participation, and open access.

You can also shop using Amazon Smile and though you pay nothing more we get a tiny something.

Science 2.0

Science Codex

EPA Reconsiders Its Biden Ban On Asbestos Everywhere

More by this author

Performance Improves with Transcranial Random Noise Stimulation

November 21, 2011

Stimulating the brain with high frequency electrical noise can supersede the beneficial effects observed from transcranial direct current stimulation, either anodal or cathodal (as well as those observed from sham stimulation), in perceptual learning, as newly reported by Fertonani, Pirully &…

Attractors All the Way Up: Metastability, Rostrocaudal Hierarchies, and Synaptic Facilitation

November 18, 2011

In their wonderful Neuroimage article, Braun & Mattia present a comprehensive introduction to the possible neuronal implementations and cognitive sequelae of a particular dynamical phenomenon: the attractor state. In another excellent paper, just recently out in Frontiers, Itskov, Hansel and…

Architecture of the VLPFC and its Monkey/Human Mapping

November 17, 2011

If you ever said to yourself, "I wonder whether the human mid- and posterior ventrolateral prefrontal cortex has a homologue in the monkey, and what features of its cytoarchitecture or subcortical connectivity may differentiate it from other regions of PFC" then this post is for you. Otherwise,…

Modus Tollens, Modus Shmollens! When people commit a fallacy so absurd that it's only recently been given a name.

November 16, 2011

Suppose - rather reasonably - that soups which taste like garlic have garlic in them. You observe two people eating soup; one of them says to the other, "There is no garlic in this soup." Do you think it's likely that the soup taste like garlic? If you said yes, then congratulations! You've just…

Greater Performance Improvements When Quick Responses Are Rewarded More Than Accuracy Itself.

November 8, 2011

Last month's Frontiers in Psychology contains a fascinating study by Dambacher, HuÌbner, and SchlÃ¶sser in which the authors demonstrate that the promise of financial reward can actually reduce performance when rewards are given for high accuracy. Counterintuitively, performance (characterized as…