Notebooks on Language: Kuperberg: "Neural mechanisms of language comprehension" (2006)

Wednesday, June 12, 2013

Kuperberg: "Neural mechanisms of language comprehension" (2006)

Suppose I put you in front of a computer screen that flashes a word every second:

The … cheese … was … eaten … by …

While you're reading, I record the electrical activity off the scalp of your head with an EEG scanner. Possibly, I also ask you to do something with the sentence when you're done reading, like judge its plausibility, or answer a question about it.

Once you've gotten used to this task, I perform a manipulation: Without warning, I insert a weird or unexpected word:

The … cheese … was … eaten … by …the … cat …

When this happens, you'll obviously have to work harder than usual to make sense of this unexpected stimulus. This means more brain activity, and more brain activity means more electrical charge.

Typical Responses: The N400 and the P600

There are, in particular, two specific ways that the electrical activity at the scalp of your head changes measurably when this happens: You may exhibit an excess of negative electrical energy about 400 milliseconds the unexpected word, or an excess of positive electrical about 600 milliseconds after.

These two events are called the N400 and the P600. They can occur together, separately, or not at all.

The N400 was first described in 1980 by Martha Kutas and Steven A. Hillyard. They explained it as a a kind of "second look" effect and found that it was provoked by "semantic incongruity" (p. 204).

The P600 was described in 1992 by Lee Osterhout and Phillip J. Holcomb. They were explicitly interested in teasing apart syntactic from semantic effects, and they found that the P600 appeared specifically after syntactic anomalies like The librarian trusted to buy the books.

This was great news for the Chomskyan theory of language: At last, solid evidence that semantics and syntax are independent. And what could be more convincing to a linguist than "brain stuff"?

Delineation Problems

But of course, the story is a bit more complicated than that. In a wonderful paper from 2006, Gina R. Kuperberg reviews the large and growing pool of experimental findings related to the N400 and the P600.

Her conclusion is that the two electrical responses are the trace of two different processes, "one that links incoming semantic information with existing information stored in semantic memory, and another that combines relationships between people, objects and actions to construct new meaning" (p. 45).

If we want to evaluate a synthesis like that, we need to keep to separate issues apart: First, can we predict when the two different waveforms will come up? And second, if we can predict this, by what cues?

This dichotomy reflects the familiar problem of, on one hand, assessing whether people have stable intuitions about grammaticality, and, on the other hand, trying to articulate those intuitions in an adequate grammatical formalism. We can get both of these tasks wrong, independently of each other.

So here's what I want to do: I'll just give you a huge list with examples, and then you'll get a sense of where the N400 shows up, and where the P600 shows up. If it looks as if there is a system to this, then we can throw some grammatical vocabulary at this system; but first we need to get a sense of what the system is.

A Bunch of Examples

This section contains all of the examples that Kuperberg cites in her review. They come from a wide range of different sources, so I can't appropriately cite every one of them. I'll just repeat her examples without attribution.

The N400 was originally described as a reaction to semantic anomalies. It shows up in contrastive pairs like the following:

It was his first day at work (baseline)
He spread the warm bread with socks (strong N400)

It shows up strongly when you read sentences that are completely unambiguous as to what they are saying, but just say something really weird:

The honey is being murdered (strong N400)

It's also visible when words are semantically permissible, but less expected:

He mailed the letter without a stamp (baseline)
He mailed the letter without a thought (moderate N400)

In fact, the N400 also appears when a sentence expresses completely legitimate assertions which just happen to be inconsistent with experience:

Dutch trains are white (strong N400; they are in fact yellow)

The P600, on the other hand, was originally described as sensitive to syntactic violations. This conclusion was based on contrasts like the following:

The broker hoped to sell the stock (baseline)
The broker persuaded to sell the stock (strong P600)

Similarly, we find contrasts such as these:

The doctor believed the patient was lying (baseline)
The doctor charged the patient was lying (strong P600)

This morphosyntactic account of the causes of the P600 is also consistent with the fact that it responds to grammatical incongruence and weird word orders:

The spoiled child throw the toys on the floor (strong P600)
The expensive very tulip (strong P600)
Jennifer rode a gray huge elephant (P600; compare huge gray)

Somewhat strangely, though, the predictability of the word that carries the incongruence can affect how strong the P600 effect is:

Sie bereist den Land … (strong P600; Land is expected, but should be das)
Sie befährt den Land … (milder P600)

The P600 can also be provoked by sentences in which the subject and object seem to be swapped or replaced by a wrong word:

Every morning at breakfast the eggs would eat … (P600)
Every morning at breakfast the eggs would plant … (P600)

This contrasts with the N400 effect that is visible when the word is merely unexpected:

Every morning at breakfast the boys would plant … (N400)

A similar example is the following:

The hearty meal was devoured … (baseline)
The hearty meal was devouring … (P600)
The dusty tabletops were devouring … (N400)

Or, again:

Tyler cancelled the subscription (baseline)
Tyler cancelled the birthday (N400)
Tyler cancelled the tongue (N400 + P600)

This also has the consequence that when a cat flees a mouse, or when a javelin throws an athlete, you see a strong P600 effect rather than a N400:

De kat die voor de muizen vluchtte … (P600)
De speer heeft de athleten geworpen … (P600)

However, if the javelin summarizes the athletes, you get both a P600 and a N400 effect:

De speer heeft de athleten opgesomd … (P600)

So it seems that when there is some sort of normal relationship between verb and object, but the sentence expresses the wrong one, both effects occur simultaneously:

The trees that in the park played … (P600 + N400)
The apple that in the tree climbed … (P600 + N400)

This doesn't depend on the distinction between subject and object, as can be seen by using a passive alternation:

To make good documentaries cameras must interview … (P600)
To make good documentaries cameras must be interviewed … (P600)

Another example of this contrast comes up if you let an elephant do various things to a tree: Topple it, prune it, or spoil it like a child:

… dat de olifanten de bomen omduwden … (baseline)
… dat de olifanten de bomen snoeiden … (P600)
… dat de olifanten de bomen verwenden … (N400)

When a detective "stains" a banker instead of interrogating him, both effects also occur:

… dass der Kommissar den Banker abhörte … (baseline)
… dass der Kommissar den Banker abbeizte … (N400 + P600)

Also, when the verb understood gets an inanimate object as its agent, a P600 effect is visible:

At long last, the man's pain was understood by the doctor (baseline)
At long last, the man's pain was understood by the hypochondriac (weak N400)
At long last, the man's pain was understood by the violinist (N400)
At long last, the man's pain was understood by the medicine (strong N400 + P600)
At long last, the man's pain was understood by the pens (strong N400 + P600)

A very nice example that also brings out the nature of the P600 waveform is the following:

The novelist that the movie inspired … (P600)

This sentence is, strictly speaking, perfectly grammatical: A movie can inspire a novelist. However, it's much more probable to hear someone talk about a novel that inspired a movie, and something thus seems to have gone wrong with the sentence.

Interestingly, context can also heavily influence whether something counts as bizarre or as scrambled:

[In a story about traveling] … the woman told the suitcase … (P600)
[In a story about something else] … the woman told the suitcase … (N400)

Misspellings also seem to trigger a P600 effect if the context offers an obvious candidate for the correct word:

In that library the pupils borrow bouks … (P600)
The pillows were stuffed with bouks … (no P600)

I don't whether such examples also cause a N400 effect, but presumably, they do.

Sense-Making and Decoding

As may be apparent from the way I presented these examples, I'm not quite comfortable with the way that both the traditional accounts and Kuperberg's paper talks about "what the brain does" and "what the P600 picks up on." I think that we can largely make sense of the N400 and the P600 waveforms in terms of what kind of repair practices the sentences suggest.

More precisely, you can react to an unexpected sentence in two ways: Either, you can suspect that it was scrambled or otherwise corrupted due to noise, or you can believe that it came over uncorrupted, but just expresses a really weird idea. You might be pushed into the first hypothesis if there is a really obvious nearby expression that makes much more sense, and into the second if there isn't.

Looking at it this way quite accurately explains the differences between the two effects, I think, and essentially doesn't invoke any grammatical notions. Instead, we can think of comprehension as a kind of Bayesian decoding process along the following lines:

Recover the intended message from the received codeword.
Find a reasonable interpretation of the intended message.

When the received codeword corresponds unproblematically to a message, the first step is fast, and we can proceed directly to the interpretation within the first 500 milliseconds.

If you then afterwards find that the intended message is really weird, you can either go back and check whether there really wasn't corrupted, or you can just work harder trying to interpret it. The first response will yield a P600 effect, and the second an N400.

An advantage of this story is also that it accounts for the strange fact that "semantics" should be processed before "syntax." The order of the N400 and the P600 may be due to the fact that reconstructing a likely message (like so many backwards-reasoning tasks), is much more computationally expensive interpreting one. Kuperberg also hints at this possibility by attributing the P600 to "combinatory processing."

Of course, I also like this story because it doesn't drive a wedge in between syntax and semantic when there doesn't have to be one. But you can disagree with me on that.

Notebooks on Language