Dual-Coding,
Context-Availability, and Concreteness Effects in Sentence Comprehension: An
Electrophysiological Investigation
Phillip J. Holcomb
Tufts
University
John Kounios
University
of Pennsylvania
Jane E. Anderson
Harvard
Medical School
W. Caroline West
MGH
Event-related potentials were
recorded in two experiments while participants read sentences in a word-by-word
congruency judgment task. Sentence final words were either congruent,
semantically anomalous (Experiments 1 and 2) or neutral (Experiment 2) with
respect to sentence context. Half of all final words referred to concrete and
half to abstract concepts. A different scalp distribution of the N400 to
concrete and abstract final words was found for anomalous and neutral, but not
congruent sentences. While the interaction of context and concreteness is
consistent with context-availability model, the differential scalp distribution
of effects for concrete and abstract words, as well as larger context effects
for concrete words was interpreted as being more consistent with an extended
dual-code account of semantic processing.
Theories of how knowledge is stored and processed in the brain have
generally fallen into one of two camps.
The first proposes that all meanings for objects, events, and concepts
are stored and processed by a common a-modal semantic system (e.g., Caramazza,
Hillis, Rapp, & Romani, 1990; Gernsbacher, 1985; Kroll & Potter, 1984; Pylyshyn, 1984). Conversely, the second
class of theories posits that multiple semantic systems independently store and
process semantic information, often redundantly (e.g., Paivio, 1971, 1986,
1991; Shallice, 1988, 1993).
A salient example of this
distinction, and the focus of the present study, has been the debate over the
origin of concreteness effects, which is the observation that words
representing concrete concepts (e.g., table) are processed more quickly and
accurately than words representing comparatively abstract concepts (e.g.,
aptitude). Unfortunately, in spite of
the relative consistency of the experimental findings, there has been
considerable disagreement concerning the source of concreteness effects, the
two major theoretical contenders being dual-coding theory and the context-availability
model (see reviews by Paivio, 1991; Schwanenflugel, 1991).
Dual-coding theory
Dual-coding theory (Paivio, 1986, 1991)
explains concreteness effects by recourse to modality-specific systems for
representation and processing.
According to this theory, a variant of the multiple semantic-systems
view, concrete words are associated with information stored in both a verbal
“linguist” semantic system as well as a nonverbal “imagistic” semantic system.
Abstract words, however, are associated primarily with information stored in
the linguistic system. When one
encounters a concrete word it initially activates linguistic information, but
shortly thereafter it also begins to activate imagistic information via
referential links that interconnect the linguistic and image systems. Abstract words, on the other hand, lack or
have many fewer referential connections between systems and predominantly
activate linguistic representations. Concrete
words have distinct processing advantages over abstract words because they have
access to information from multiple systems. So, for example, in a lexical
decision task participants can classify “hand” as a word faster than
“idea” because “hand” is processed and
represented in both systems while “idea” is processed/represented only in the
linguistic system. This additional semantic activity from dual systems allows
participants to quickly differentiate concrete words from pseudowords
(pseudowords presumably generate little semantic activation). The relatively
lower semantic activity from a single system makes abstract words more
difficult to differentiate from pseudowords resulting in relatively slower
reaction times (RT).
Context-availability model
In contrast, the
context-availability model (Bransford & McCarrell, 1974; Kieras, 1978)
denies the existence of different types of informational codes or processing
systems as determinants of concreteness effects. This theory, a variation of the single semantic-system view,
argues that comprehension is heavily reliant on context supplied by either the
preceding discourse or the comprehender’s own mental knowledge base (semantic
memory). Concrete words are thought to
be more closely associated to relevant “contextual” knowledge in semantic
memory than are abstract words, because concrete words exhibit stronger or more
extensive associative links to this stored material. However, the underlying
nature of the representations for the two word types and the processes that
operate on these representations do not differ according to this account. So,
participants can classify “hand” as a word faster than “idea” because “hand”
activates more semantic information. Where context-availability differs from
dual-coding is in how and where this additional information is stored and
processed. Context-availability argues for a simpler quantitative difference
between word types within a single system, while dual-coding argues for a
qualitative difference based on activity in different systems.
Numerous studies have sought
to empirically invalidate one or the other of these explanations (Paivio, 1986,
1991; Schwanenflugel, 1991).
Unfortunately, many of these experiments exhibit what hindsight suggests
to be methodological and theoretical limitations. At least in the realm of semantic processing (the focus of this
report), neither view can claim, based on previous findings, to be the complete
explanation of concreteness effects. Below, we briefly discuss some of the
relevant research supporting both accounts. Following this, we describe a
paradigm and methodology designed to circumvent some of the difficulties
inherent in previous studies with the aim of providing data capable of
discriminating between the competing theories.
Context and Concreteness
Schwanenflugel
and her colleagues have been the most outspoken proponents of the
context-availability interpretation of concreteness effects in semantic
processing (e.g., Schwanenflugel, Harnishfeger, & Stowe, 1988;
Schwanenflugel & Shoben, 1983; Schwanenflugel & Stowe, 1989; Schwanenflugel,
1991). They have presented two sources of evidence which they argue favor this
model. First, Schwanenflugel and colleagues (e.g., Schwanenflugel, &
Shoben, 1983; Schwanenflugel, et al., 1988) found that participants’ estimates
of the relative difficulty of retrieving associated contextual information for
isolated abstract and concrete words (context availability ratings) were
correlated with concreteness ratings. Moreover, they found that these context
availability ratings were a better predictor of lexical decision performance
than rated concreteness or imageability. When concrete and abstract words were
equated on this factor, the advantage normally seen for concrete words was no
longer significant. However, one potentially serious problem with these studies
is that it is not clear how participants actually made context-availability
ratings. In particular, the authors of these studies apparently did not check
to see if participants might have sometimes used some type of imagery strategy.
For example, it might be that for concrete words and even some abstract words
that many participants used mental images to help determine how easy or how
many different contexts a word can be used in. Thus partialing out rated
concreteness might have missed an important residual dimension of concreteness
or imagery. To eliminate this
possibility participants’ actual generated contexts would have to be monitored
and controlled for image based intrusions.
In a second series of
experiments Schwanenflugel and colleagues (e.g., Schwanenflugel, Harnishfeger,
& Stowe, 1988; Schwanenflugel & Shoben, 1983; Schwanenflugel &
Stowe, 1989) have more convincingly demonstrated that when sufficient
supportive context is provided, either in the form of several or even a single
prior sentence, concreteness effects on accuracy and RT diminish or even vanish
in a variety of tasks including lexical decision, naming and sentence
meaningfulness judgments. This effect takes the form of context producing large
changes in performance on abstract items and little or no change in performance
on concrete items. Schwanenflugel and colleagues argue that this implies that
concreteness effects are reducible to differences in the availability of
context.
In other words, when abstract words are provided with an external
context, such as a supportive sentence stem, of equivalent potency to that
normally available to concrete words from within semantic memory, then they are
processed as efficiently as concrete words. Concrete words do not benefit as
much from an external context because they already have strong built-in
contexts, so an external context does little to change how these items are
processed. Therefore, according to this
view, there is no need to postulate a more architecturally complex separate
system for representing and processing imagistic information.
We believe that the above
conclusion may be premature. This is because the basic dual-coding theory as
proposed by Paivio (1984) does not argue that context cannot facilitate semantic
processing, nor does it argue that such contextual facilitation could not
supersede or mask concreteness effects.
It simply states that there are separate imaginal and linguistic
systems, and that the consequences of this structural configuration can sometimes
be manifested in behavior. With minor augmentation dual-coding theory could be
made to account for context effects such as those reported by Schwanenflugel
and colleagues, while retaining its central feature of multiple systems. One
possibility is that a supportive sentence frame works by “priming” relevant
sentence final words (e.g., Stanovich and West, 1983). The resulting semantic
facilitation within the linguistic system may be sufficient to overcome the
added benefit concrete words normally exhibit in isolation due to referential
processing via the imagistic system.1 This could happen, for
example, because context works faster or earlier than concreteness.
Note that at one level the
above explanation is actually very similar to that offered by
context-availability; external context compensates for richer/stronger internal
associations via priming in the linguistic system. However, upon closer
inspection the two theories remain quite distinct. While context availability
argues that the biggest effects of linguistic context should occur for abstract
words, context-extended dual-coding predicts similar effects of linguistic
context for abstract and concrete words within the verbal system. However, it
also predicts larger effects of context for concrete than abstract words
within the imagistic system. We will return to the discussion of the
predictions of each model in the Discussion section, but for now, suffice it to
say that the finding of a context by concreteness interaction does not
constitute evidence against the existence of multiple systems. Rather it simply
implies that there are two different factors that affect one or more processing stages in common (McClelland, 1979; Sternberg, 1969). If this is true, then one of the major
tenants of the context-availability
critique of dual-coding research would be rendered nugatory.
Event-Related Potentials
The ERP technique involves
measuring the brain's electrical activity (electroence-phalograms or EEGs)
detectable by scalp electrodes after specific stimulus events. These individual EEG waveforms are then
averaged across stimulus presentations to yield a waveform characterizing the
measurable part of the brain's electrical response to the stimulus (i.e., minus
the "noise" representing other brain activity). Researchers have linked the various
components (roughly the peaks) in the ERP waveform to a number of cognitive
processes (see Coles & Rugg [1995] for a recent introduction and review).
The ERP component of most
interest for present purposes is the N400, which is a negative-going wave
peaking at approximately 400 ms after the onset of a stimulus. Numerous studies
have implicated the N400 in some aspect of semantic processing. For example, Kutas and Hillyard (1980,
1984), in what is now the classic N400 paradigm, were the first to show that
this component is larger in amplitude in response to sentence final words that
are semantically anomalous (e.g., "He takes cream and sugar in his attention."),
and is greatly reduced or even absent to high probability congruent sentence
endings (e.g., "He takes cream and sugar in his coffee."). In contrast, the N400 has been shown to be
unaffected by physically anomalous but semantically congruent stimuli (e.g., a
congruent final word in a different type font). Subsequent studies revealed
that semantic anomalies throughout a sentence elicit N400s (e.g., Kutas &
Hillyard, 1983) as do anomalies occurring in sentences presented at very rapid
rates (up to 10 words/sec, Kutas, 1987).
An important study by Kutas,
Lindamood, and Hillyard (1984) showed that robust N400s could be obtained
without the occurrence of a semantic anomaly and that under these conditions
its amplitude is a monotonic, decreasing function of the cloze probability of
the final words of sentences. For
example, N400 amplitude was large in amplitude to less predictable, but
non-anomalous words (e.g., "Captain Sheir wanted to stay with the sinking raft.")
and was smaller in response to more predictable words (e.g., "She called
her husband at his office."). Based on findings such as these and
others it has been argued that the N400 reflects the process of integrating
semantic information into a relatively high level discourse or mental model
type of representation (e.g., Brown & Hagoort, 1993; Holcomb, 1993). In
this formulation larger N400s are taken as being indicative of a more effortful
or involved integration process. A common thread in virtually every account of
the N400 is that it is sensitive to contextual and semantic manipulations which
would appear to make it an ideal choice of dependent variables for use in
searching for the locus of interactions between concreteness and
context-availability.
Experiment 1
Kounios and Holcomb (1994)
demonstrated that ERPs are sensitive to concreteness, particularly when the
task involves deep semantic processing. Specifically, concrete words presented
in a list elicited a more negative ERP between 300 and 500 ms after stimulus
onset than did abstract words, and this difference was larger in a semantic categorization
task (Experiment 2) than in a lexical decision task (Experiment 1). This
suggests that the effects were at least partially mediated by semantic
properties of the words. The ERP
negativity coincided temporally with the classic N400, although the topographic
distribution of the concreteness effect differed from the typical N400 effect
in that it was maximal at anterior scalp locations, whereas the typical N400
effect tends to be centro-parietal (e.g., Kutas & Van Petten, 1988). However, more “anterior N400” distributions
have been observed previously in single-word tasks (e.g., Boddy, 1986; Nobre
& McCarthy, 1994) and in picture-processing tasks (e.g., Holcomb &
McPherson, 1993; Ganis, Kutas, & Sereno, 1996; McPherson & Holcomb, in
press). The most likely explanation for these differing distributions is that
the N400 is actually the product of several underlying neural generators whose
weights vary according to task demands and stimulus properties (Kounios, 1996;
Nobre & McCarthy, 1994). In other
words, the N400 probably does not reflect a unitary process, but rather several
functionally related, but neurally distinct ones. Regardless of whether this or
some other explanation of the differing scalp distributions for the N400 is
correct, the differential scalp distributions of ERPs reported by Kounios and
Holcomb (1994) for concrete and abstract words suggests the involvement of
non-identical neural and cognitive processing systems, which if correct, would
be inconsistent with single-code theories such as the context-availability
model.
The logic of the above
conclusion is based on what we will call the “spatial distinctiveness
principle”, which assumes that two or more different cognitive systems will
tend to be more spatially distinct within the brain than will a single
cognitive system. For example, if concrete and abstract words activate the same
basic cognitive processes and/or representations (a common semantic system)
then, on average, the same population of neurons should be active when these
two types of words are processed and these neurons should produce, on average,
a similar spatial pattern of electrical activity for both types of events. With
electrodes placed on the scalp this activity should show up as a difference in
the size of potentials (i.e., a significant main effect of word type), but not
in a difference in the distribution of potentials across the scalp (i.e., an
absence of a word type by scalp site interaction) . If, however, the two types
of words influence somewhat different cognitive processes/representations
(separate semantic systems) then, on average, somewhat distinct populations of
neurons should be active when these two types of events are processed and these
different populations should produce electrical activity with different scalp
distributions (a word type by scalp site interaction).
The spatial distinctiveness
principle is quite robust and, in theory, can differentiate between a variety
of possible neural organizations.2 For example, a single system
which encompasses an expansive cortical area or even multiple areas could be
differentiated from separate systems with equivalent spatial distributions.
This is because a large or multi-area single system need only be spatially
homogeneous with respect to the factor of interest. Such a single large system
would produce main effects for differences in strength of activity due to the
factor of interest (e.g., word type) that extend across a relatively wide
region of scalp, but no interaction between factor and site. Separate systems
of equivalent spatial distribution would produce the factor by site
interaction. This logic works for much smaller brain regions as well, although
an adequate number of electrodes placed close enough together to differentiate
the systems is necessary.
Another important finding in
the Kounios and Holcomb study was that repetition had a greater effect on the
ERPs to concrete than abstract words.
Furthermore, large repetition effects (decreased N400 amplitude to repeated
words) were observed over both hemispheres in both experiments for concrete
words, but were not observed for abstract words in Experiment 1 and only over
the left hemisphere in Experiment 2.
These results are in direct contrast with the context-availability model
which predicts that added context (in this case in the form of repetition)
should make a bigger difference in the processing of abstract words than
concrete words (Schwanenflugel & Shoben, 1983). This finding could be
interpreted as being consistent with the context-extended dual-coding theory
described above. This model predicts similar effects of linguistic context for
concrete and abstract words within the linguistic system, and larger effects of
context for concrete than abstract words within the imagistic system.
The authors did, however,
acknowledge some caveats concerning the interpretation of their results. First, the repetition effects reported in
Kounios and Holcomb may not have tapped the same type of contextual processes
as the sentence level context effects studied by Schwanenflugel and colleagues
(e.g., Schwanenflugel & Shoben, 1983).
Second, the frontal 300-500 ms negativity that differentiated concrete
and abstract words may not have been a “true” N400 effect (neither of the tasks
used by Kounios & Holcomb were the classic N400 paradigm) and thus may
reflect something other than semantic processing (see Neville, Kutas, Chesney,
& Schmidt, 1986).
The present study was an
attempt to address both of these issues and to replicate the word type by scalp
site interaction that Kounios and Holcomb interpreted as evidence for
dual-coding theory and against the context-availability model. To ensure that effects of context were being measured, a sentence
processing task was utilized rather than a single word task. Furthermore, to
ensure that what was being observed was, in fact, an N400, the classic
anomalous sentence paradigm used to elicit an N400 response (Kutas &
Hillyard, 1980) was employed.
Sentences in which the final
word was either congruent or incongruent (anomalous) were presented to
participants whose task was to decide if each sentence made sense. In the present experiment, the concreteness
of the sentence final words was manipulated in addition to their congruency, thus,
it was designed to examine effects of both concreteness and context. The design is much like one used by
Schwanenflugel and Stowe (1989) in which they found clear differences in
meaningfulness-judgment RTs between concrete and abstract items when these
words formed an anomalous ending of a sentence (concrete faster than abstract),
but no differences in RT when the two word types were congruent final words.
They interpreted this finding as further evidence for the context-availability
model.
The results of Kounios and
Holcomb (1994) suggest the following predictions with regard to ERPs in the
current experiment. First, concrete words should produce ERPs which are more
negative going between 300 and 500 ms (the N400 window) than abstract words.
Second, this difference should be due, at least in part, to the modulation of
the traditional N400 component, which should be revealed by an interaction
between context (a variable known to influence the N400) and concreteness.
Third, supporting the dual-code model, but not the context-availability model,
the distribution across the scalp of the context sensitive N400 effect should
be different for concrete and abstract words. Fourth, consistent with extended
dual-coding theory, but not with the context-availability model, the biggest
effects of context on the N400 should be for concrete words.
Method
Participants. Sixteen students (9 female, 7 male) between 18 and 20 years
(mean: 18.63) of age from the Tufts University community served as
participants. All were right-handed
native speakers of English with normal or corrected-to-normal visual acuity.
Stimuli and procedure. The stimuli for this experiment were 160 sentences, 80 of which
ended in an abstract word (mean concreteness rating: 2.52), the remaining 80 of
which ended in a concrete word (mean concreteness rating: 6.19). The sentences for the two final-word types
did not differ in cloze probability (concrete: .61 (SD .22), abstract: .59 (SD
.27)) and the final words did not differ in frequency (concrete: 65.1 per
million (SD 77), abstract: 69.7 per million (SD 92); Francis & Kucera,
1982) or length (concrete: 5.8 letters (SD 1.75), abstract: 5.9 letters (SD
1.65)). Concreteness and cloze
probability were assessed by having separate groups of participants rate the
materials. For concreteness 35 participants rated the pool of 160 final words
on a scale from 1 (very abstract) to 7 (very concrete). Several examples of each category were given
prior to the test. Cloze probability was calculated by having another group of
15 participants fill in what they thought was the most appropriate final word
for each of the 160 sentence stems. These participants were told to read each
sentence stem quickly and to fill into the final word position the first word
that came to mind. No word generated by any of the 15 cloze participants was
utilized in forming the anomalous sentence endings, thus effectively making the
cloze values for the anomalies zero.
From these sentences, four
lists were formed such that half of the sentences in each list (80) ended in an
appropriate or semantically congruent final word, while the other half
ended in a semantically anomalous final word (See Table 1). The anomalous sentences were counterbalanced
such that half of the 40 sentences that predicted concrete final words were
completed with anomalous abstract words, and half were completed with anomalous
concrete words. For the 40 anomalous
sentences that predicted abstract final words, half were completed with
anomalous concrete words, and the other half were completed with anomalous
abstract words. So, each of the four
lists had 40 sentences with semantically congruent concrete final words, 40
sentences with semantically congruent abstract final words, 40 sentences with
semantically anomalous concrete final words (20 predicting an abstract word and
20 predicting a concrete word), and 40 sentences with semantically anomalous
abstract final words (20 predicting an abstract word and 20 predicting a
concrete word). Each participant read
each of the 160 sentence frames and each of the 160 final words only once. However, across participants, each final
word and each sentence frame appeared in both the congruent and anomalous
conditions.
Table 1
Sample Sentences
|
A. Concrete final word, CONGRUENT |
|
Armed
robbery implies that the thief used a weapon. |
|
|
|
B. Abstract final word, CONGRUENT |
|
Lisa
argued that this had not been the case in one single instance. |
|
|
|
C. Concrete final word, ANOMALOUS |
|
Armed
robbery implies that the thief used a rose. |
|
Lisa
argued that this had not been the case in one single rose. |
|
|
|
D. Abstract final word, ANOMALOUS |
|
Armed
robbery implies that the thief used a fun. |
|
Lisa
argued that this had not been the case in one single fun. |
|
|
|
E. Concrete final word, NEUTRAL |
|
They
said it was because of the rose. |
|
Robert
said it was due to the weapon. |
|
|
|
F. Abstract final word, NEUTRAL |
|
They
said it was because of the fun. |
|
Robert
said it was due to this instance. |
The experiment was
self-paced, each trial beginning after the participant responded to the final
word from the previous sentence. Participant
responses (YES/NO) were registered by a small two-button panel resting in their
lap. Three seconds following the
participant's response, a fixation cross was displayed for 500 ms in the center
of the computer monitor. This served as
a warning that the next trial was about to begin. This was followed by a 300 ms blank screen, after which the first
word of the sentence was displayed. The
first word and each subsequent word in the sentence was sequentially displayed
for 200 ms each. Consecutive words
were separated by a 300 ms blank-screen inter-stimulus-interval (ISI) for a
total word-to-word onset interval of 500 ms.
The final word of each sentence was displayed with a punctuation period
to indicate that it concluded the sentence, and was followed by a 1300 ms
blank-screen ISI. All words were
centered on the display monitor. After
the final-word ISI, the message “RESPOND NOW” was presented in the center of
the display until the participant made his/her response. Participants were instructed to press the
“YES” button if the sentence made sense and the “NO” button if it did not. They were also told not to move or blink
their eyes from the onset of the fixation cross until the “RESPOND NOW” message
was presented. Response hand was counterbalanced across participants. Each participant was given 10 practice
trials prior to the run of 160 experimental sentences.
EEG procedure. Tin electrodes (Electro-Cap International, Inc., Eaton, OH) were placed
at several scalp sites (see the head schematic in Figure 1 for the relative
locations on the head), on the right mastoid bone, below the left eye, and to
the right of the right eye (all referenced to the left mastoid). The scalp sites included standard
International 10‑20 System locations: occipital left (O1) and right (O2);
and frontal left (F7) and right (F8).
Six electrodes were also placed at nonstandard locations over left and
right temporo-parietal cortex (30% of the interaural distance lateral to a
point 13% of the nasion-inion distance posterior to Cz: Wernicke’s left [WL]
and right [WR], right and left temporal cortex (33% of the interaural distance
lateral to Cz: TL and TR), and right and left anterior temporal cortex (50% of
the distance from T3/4 to F7/8: ATL and ATR).
The EEG was amplified with
Grass Model 12 amplifiers (3db cutoffs at .01 and 30 Hz) and was digitized
on-line at 200 Hz. Average ERPs were
formed off-line from correct-response trials free of ocular and movement artifacts
(which averaged less than 10% across conditions).
Data analysis. The ERP data to sentence
final words were quantified by calculating the mean amplitudes (relative to a
100 ms prestimulus baseline) in three latency windows (150-300 ms, 300-500 ms,
and 500-800 ms). These windows were
chosen because they roughly correspond to the latency ranges of the P2, N400,
and the late positivity reported in previous language studies (see Kutas &
Van Petten, 1988; Osterhout & Holcomb, 1995) and because they were also used
in our earlier study comparing concrete and abstract words (Kounios &
Holcomb, 1994).
The approach to statistical analysis involved the
use of a repeated measures analysis of variance (ANOVA) followed by simple
effects tests in the case of significant interactions. There were two levels of Sentence-Type
(congruent vs. anomalous) and two levels of final Word-Type (concrete
vs. abstract). ERPs at midline and
lateral sites were analyzed in separate ANOVAs so that laterality effects could
be assessed. Midline-site analyses
included an Electrode-Site factor (Fz vs. Cz vs. Pz). Lateral-site analyses included the factors
of Electrode-Site (frontal vs. anterior-temporal vs. temporal vs.
Wernicke’s vs. occipital), and Hemisphere (right vs. left). The Geisser-Greenhouse
correction (Geisser & Greenhouse, 1959) was applied to all repeated
measures containing more than one degree of freedom in the numerator. Finally,
analyses with significant interactions of stimulus variables with a topographic
factor (e.g., Electrode-Site or Hemisphere) were repeated after amplitude
values were normalized (using z-scores) separately within each level of the
Word- or Sentence-Type factor (see McCarthy & Wood, 1985). Only
interactions significant after normalization are reported.
Results
Accuracy Data. Subjects were very accurate in deciding if sentences made sense,
averaging 97.5% correct responses (Table 2).
Overall, they were significantly more accurate in their decisions about
anomalies than about congruent sentences (main effect of sentence-type: F[1,
15] = 13.71, MSe = .01, p
< .01). Furthermore, subjects were
more accurate in their responses to concrete than abstract final words in the
congruent-sentence condition, but performed equally well for the two word types
in the anomalous-sentence condition (Sentence-Type by Word-Type interaction: F[1,
15] = 4.87, MSe = .01, p < .05).
Table 2
Experiment
1 -- Percent Correct (Standard Deviation)
Final-Word
Type
|
|
Sentence-Type Concrete Abstract |
|
|
|
Congruent 97.3 (3.5) 95.2
(3.9) |
|
Anomalous 98.8
(1.6) 98.8 (1.6) |
Overview of ERPs. The grand-mean ERPs (averaged
across all 16 subjects) for congruent and anomalous final words are plotted in
Figure 1. The ERPs in this figure show
that there were several relatively early (less than 400 ms) components elicited
in both conditions. These potentials
are generally thought to reflect sensory and early perceptual processes (e.g.,
Rugg & Coles, 1995). They included a broadly-distributed early negativity
(N1) that peaked around 125 ms at all but the most posterior sites (i.e., O1,
O2). At the posterior sites, there was
an early positivity between 100 and 125 ms (P1) followed by a later N1 with a
peak near 200 ms. At most sites, the N1
was followed by a positivity between 200 and 300 ms (P2). Note that, with the possible exception of
the P2, none of the early components appeared to be differentiated according to
sentence-type.

Figure 1. Grand mean ERPs from 13 scalp sites for congruent and anomalous final words in Experiment 1. In this and all subsequent figures, the ERP waveforms for each electrode site are plotted separately, with anterior sites shown at the top of each figure, posterior sites at the bottom, left-hemisphere sites on the left, and right-hemisphere sites on the right. Three sites down the middle of the head are plotted in the middle of the figure. The approximate locations of these sites can be seen in the head schematic located at the top of the figure (note that this is a view looking down at the top of the head with the nose pointed toward the top of the figure). The x-axes display time with the vertical calibration bar placed at the time of the onset of the stimulus. Each x-axis tic represents 100 ms. Note that 100 ms of activity prior to stimulus onset is displayed. This was used as a baseline for equating the post-stimulus portion of each waveform. The y-axes represent voltages on a microvolt scale, with negative voltages plotted up, according to convention. Note that the peaks of the N1, P2, N400 and LPC components have been labeled at representative sites. Finally, the filled area between the ERPs for the congruent and anomalous final words represents the area used to quantify the N400 component (300 to 500 ms).
There were also several
later ERP components visible in the waveforms (see Figure 1). Following the P2, there was a negative-going
wave which peaked around 400 ms. This
negativity, which exhibited a broad scalp distribution, was clearly larger
(i.e., was more negative-going) for anomalous than congruent final words,
suggesting that it was a traditional N400 wave (cf. Kutas & Hillyard,
1980). Following the N400, there was a
late positive component (LPC), which
peaked between 600 and 800 ms over central and posterior sites. At these sites, the LPC was slightly larger
and peaked somewhat later for anomalous final words than for congruent ones.
Analyses by epoch. Differences between anomalous and congruent final words started
in the 150 to 300 ms time window, with anomalous endings producing more
negative-going ERPs than congruent endings (main effect of Sentence-Type, midline:
F[1, 15] = 13.60, MSe = 10.99, p < .01; lateral: F[1,
15] = 14.49, MSe = 9.25, p < .01). However, there was not a
significant main effect for Word Type or interaction between Word Type and
Electrode Site in this epoch.
In the 300 to 500 ms epoch, which
typically encompasses most of the activity of the N400 component (e.g., Kutas
& Hillyard, 1980), anomalous sentences continued to elicit more negative
ERPs than congruent sentences (main effect of Sentence-Type, midline: F[1,
15] = 67.65, MSe = 20.42, p < .0001; lateral: F[1, 15]
= 70.55, MSe = 20.33, p < .0001), and concrete final words
elicited more negative ERPs than abstract words (main effect of Word-Type,
midline: F[1, 15] = 10.46, MSe = 10.06, p < .01; lateral: F[1, 15] = 24.32, MSe =
7.62, p < .0001). As can be
seen in Figure 1 the difference between congruent and anomalous sentences was
largest at centro-parietal scalp sites (Sentence-Type X Electrode-Site interaction, midline: F[2,30] =
11.86, MSe = .08, p < .002;
lateral: F[4,60] = 8.67, MSe
= .33, p < .01), and was also larger over the right hemisphere than
the left (Sentence-Type X Hemisphere interaction: F[1,15] = 13.47, MSe = .41, p < .01).
The difference between
concrete and abstract final words was larger over anterior lateral sites than
posterior lateral sites (Word-Type X Electrode-Site interaction, F[4,60] =14.72, MSe =
.16, p < .0001). There were
also significant Sentence-Type X Word-Type X Electrode-Site interactions
(midline: F[2,30] = 4.64, MSe
= .07, p < .05; lateral: F[4,60]
= 7.36, MSe = .26, p < .01), indicating that the effects of
concreteness (Word-Type) were different for congruent and anomalous final words
(see the difference waves in Figure 2).
Therefore, this latter interaction was followed up by separate ANOVAs for
the two types of sentences. Analyses of the anomalous sentences produced a
significant main effect of Word-Type (midline: F[1,15] = 6.38, MSe
= 20.63, p < .05; lateral: F[1,15] = 13.88, MSe =
21.27, p < .01) with concrete words more negative than abstract (see
Figure 3). Importantly, there was also a significant Word-Type X Electrode-Site
interaction (midline: F[2,30] = 7.09, MSe = .09, p <
.01; lateral: F[4,60] = 18.50, MSe = .22, p < .0001),
indicating that the difference between concrete and abstract words was larger
at more anterior sites. In contrast, for congruent sentences (Figure 4), there
were no significant main effects or interactions involving Word-Type (p’s
> .5), indicating that the concreteness effects revealed by the overall
analyses in the 300 to 500 ms window were due almost exclusively to the
anomalous sentences (compare Figures 3 and 4).

Figure 2. Plotted in this figure are difference waves which were produced by subtracting ERPs to congruent sentences from ERPs to anomalous sentences, for concrete and abstract final words in Experiment 1. The area under the large negative deflection between 200 and 600 ms represents the N400 effect (the difference between the anomalous and congruent final words). Note that this effect is larger for concrete and abstract words, especially at the most anterior sites.

Figure 3. Grand mean ERPs for congruent final words that were concrete or abstract (Experiment 1).

Figure 4. Grand mean ERPs for anomalous final words that were concrete or abstract (Experiment 1).
As can be seen in Figure 1
the effects of Sentence-Type had dissipated by the 500 to 800 ms temporal
window at most electrode sites. There was, however, a small residual effect at right
anterior sites which was revealed in the Sentence-Type X Electrode-Site X
Hemisphere interaction ( F[4,60]
= 4.97, MSe = .07, p. < 05).
However, as in the previous epoch, there continued to be robust effects
of concreteness (Word-Type – see Figure 3) at all anterior sites for anomalous
sentences (Sentence-Type X Word-Type X Electrode-Site interaction midline: F[2,30]
= 13.25, MSe = .05, p < .001; lateral: F[4,60] = 7.33, MSe = .14, p < .01). This was confirmed by separate follow up
analyses on the two types of sentences.
For the anomalous sentences, there was a significant Word-Type X
Electrode-Site interaction (midline: F[2,30] = 9.38, MSe = .06, p
< .01; lateral: F[4,60] = 11.71, MSe = .10, p <
.0001). For the congruent sentences,
again, there were no significant main effects or interactions involving
Word-Type (compare Figures 3 and 4).
Discussion
In this experiment,
participants verified whether congruent and anomalous sentences with either
concrete or abstract final words made sense.
As in previous studies (e.g., Kutas & Hillyard, 1984) ERPs
time-locked to the onset of final words revealed large effects of sentence
context. Sentences with anomalous endings produced more negative-going ERPs
than sentences with congruent endings, particularly in the 300 to 500 ms
window. These effects of context had a central-posterior distribution and were
slightly larger over the right than left hemisphere, which is consistent with
the known distribution of the N400 component (Kutas & Hillyard, 1984). The
most widely accepted view of the N400 is that it reflects differences in the
ease of integrating semantic information into a higher level discourse or
mental model type of representation (e.g., Brown & Hagoort, 1994; Holcomb,
1993); the larger the N400, the more effortful or involved the integration
process. According to this view participants in the current study had more
difficulty integrating the meaning of anomalous final words into the
representation established by the prior sentence context.
As predicted, there were
also clear effects of concreteness. Concrete final words elicited more
negative-going ERPs than abstract final words, most notably within the window
of the N400 (300 to 500 ms), but also extending into the later measurement
epoch (500 to 800 ms). Also as predicted, the concreteness effect had a more
anterior scalp distribution than the overall effects of context. However, of
most importance to the goals of this study, in the 300 to 500 ms window, there
was a clear interaction of concreteness and context which strongly suggests
modulation of at least some of the neural generators associated with the N400
component. Given the large literature linking the N400 to the processing of
meaning (see Osterhout & Holcomb, 1995 for a review), this finding adds
credence to the argument that the observed effects of concreteness and context
are indeed semantic.
Interestingly, there were no
discernible ERP differences in any window between concrete and abstract words
in the congruent sentences. In other words,
a supportive sentence context wiped out all ERP evidence of concreteness
effects, a finding that would appear to be consistent with the central
prediction of the context-availability model. However, two other findings,
which replicate effects reported by Kounios and Holcomb (1994), are
inconsistent with the predictions of this theory. First, in the anomalous
sentences, the differences between concrete and abstract words varied
systematically across the scalp. This concreteness effect was largest at the
most anterior scalp sites, with concrete words producing much larger
negativities than abstract words. This difference gradually decreased moving
towards the back of the head until at the most posterior sites (O1/O2) there
were virtually no concreteness amplitude differences (see Figure 4). Under the
assumptions of the spatial distinctiveness principle outlined in the
Introduction, this pattern of results strongly suggests that non-identical
neural/cognitive systems are responsible for processing the two word types, at
least when they are read outside of a supportive context. In this case, the
labile topography of the N400 also supports the notion that this ERP component
is generated by at least two different neural sources whose relative
contributions to scalp topography vary across tasks and materials (Kounios,
1996; Nobre & McCarthy, 1994).
The above pattern of results
is most consistent with multiple system theories such as dual-coding theory
(Paivio, 1986) which argue that semantic information is represented and
processed by two separate systems, one linguistic and one image based, and that
while concrete words are processed in both systems, abstract words are
processed primarily in the linguistic system. On the other hand, this pattern
of results is clearly at odds with the predictions of single system accounts of
concreteness, such as the context-availability model (Schwanenflugel, 1991),
which argue that both word types are processed and represented within a single
system. This latter type of model predicts only main effects of concreteness
and a flat distribution for the N400 across the scalp (i.e., no interaction of
scalp site and concreteness).
A second finding that was at
odds with the predictions of the context-availability model and data reported
to support it, was that the effects of context in the N400 window were larger
for concrete than abstract words. In other words, when placed in a supportive
context, concrete words revealed a more dramatic decline in N400 amplitude than
abstract words. The context-availability model predicts larger context effects
for abstract words.
In contrast to the above
findings, prior findings using RT as the dependent measure have been
interpreted as supporting the context-availability predictions. For example, in
lexical decision and meaningfulness judgments, abstract words have been shown
to produce large declines in RT going from a non-supportive to a supportive
sentence context, while concrete words, which produce relatively fast
non-supportive context RTs to begin with, show only small or no changes in RT
when placed in a supportive context (Schwanenflugel & Shoben, 1983;
Schwanenflugel & Stowe, 1989). Context-availability proposes that the same
process that underlies faster out-of-context concrete word RTs (built-in
context), also accounts for concrete words’ fast responses within a supportive
sentence (external context) and that it is only the source of the context that
differs. In essence, participants simply trade one source of context for
another, and thus, RTs remain relatively unchanged.
This apparent contradiction
in findings between behavioral measures and ERPs is not without
precedence. Kounios and Holcomb (1992)
also found that RT and N400 amplitude were not correlated in a sentence
verification task and concluded that this is because these measures are not
necessarily sensitive to the exact same set of underlying cognitive operations.
In particular, they argued that RT is much more sensitive to participant’s
decision processes and task dependent strategies than is N400 amplitude. Accordingly, one way to explain the apparent
discrepancy between the RT and N400 concreteness by context interactions is to
assume that the behavioral judgments in the Schwanenflugel studies were
relatively more sensitive to
participant’s decision strategies (in particular a two step decision
process), while the N400s in the current study and in the Kounios and Holcomb
(1994) study were primarily sensitive to changes in a single semantic
integration process and were relatively immune to decision processes. In this
view N400s were larger to out-of-context concrete than abstract words because
concrete words have relatively more semantic information (linguistic and
imagistic) that readers must attempt to integrate into an unrelated context and
therefore integration was more effortful.3 N400s were equivalently
small to both word types following supportive contexts because semantic
information from the final word was easily integrated into the representation
established by the context. The observation that the N400s were equally small
for congruent sentences fits with the fact that the concrete and abstract
sentence stems were matched to produce equivalent levels of contextual support.
In the case of RT, this view
proposes that when participants make lexical decisions or meaningfulness
judgments they first use the strategy of determining if a target item can
easily be integrated into the prior sentence context. If it can, then it’s
likely a word (pseudowords cannot be semantically integrated) or a meaningful
item (non-meaningful items cannot easily be integrated) and RT is similarly
fast for both concrete and abstract words. If, however, the context is
non-supportive and integration does not easily occur, then participants must
fall back on a second strategy to make their judgments. Here it is proposed
that participants now use integration difficulty to aid their decision process.
For example, in a meaningfulness judgment task they might respond to concrete
words in an anomalous context more quickly than abstract words, because they
detect the relatively greater effort involved in the semantic integration
process for concrete words. A similar strategy would work for lexical decision
as it should be relatively easier to discriminate out-of-context concrete words
from pseudowords than out-of-context abstract words from pseudowords due to the
greater effort involved in integrating concrete words into a non-supportive
context. This type of explanation is similar to one offered by Neely and
colleagues (e.g., Neely, 1991) to explain the pattern of RT effects in semantic
priming lexical decision tasks.4
One difference between the
current results and those of Kounios and Holcomb (1994) is that the earlier
study reported a lateral asymmetry for the N400 effect between concrete and
abstract words (larger differences over the right hemisphere) in their second
experiment. No such asymmetry was found here, although there was a slight
(non-significant) trend in this direction (compare the WL and WR sites in
Figure 2). The most likely reason for this discrepancy are differences in the
task used in the two studies. Kounios and Holcomb found the asymmetry in a
concreteness judgment task with single word contexts. No asymmetry was found
here in a sentence meaningfulness judgment task. One possibility is that the
larger right hemisphere effect was due to a more explicit use of imagery in the
concreteness judgment task, although West and Holcomb (submitted) also failed
to find a hemispheric asymmetry between word types in a task where participants
were asked about the ease of imageability.
In summary, the findings of
this experiment support and extend Kounios and Holcomb (1994) showing that the
concreteness factor yields a topographic distribution of ERPs consistent with
the structural assumptions of extended dual-coding theory, but which is not
reconcilable with the notion that concreteness effects are reducible to the
availability of supportive context.
Experiment 2
In Experiment 1, no
difference was observed in the ERPs elicited by congruent concrete and
congruent abstract words. However,
anomalous concrete words produced substantially more negative values in the
time window of the N400 than did abstract words. Dual-coding theory
proposes that this larger effect for concrete words is due to the
activity of two qualitatively different systems (verbal and imagistic)
processing the concrete words in parallel while only one system (the verbal
system) processes abstract words. There
is, however, an alternate explanation, namely, that in the case of the
anomalies, the concrete final words were somehow more anomalous than the
analogous abstract final words. Perhaps the abstract words were inherently more
vague and seemed less anomalous than their concrete counterparts. By this same logic, the concrete words can
be readily identified as sharply incongruent, since they refer to more specific
entities. According to this view, the
larger N400 seen for the concrete words in Experiment 1, and perhaps the more
anterior distribution of this effect may have had nothing specifically to do
with the concreteness of the word, but rather to the degree of anomaly. This is
possible, because while the congruent sentences were carefully matched for
cloze probability across the two word types, anomalies were formed by attaching
final words from high-cloze sentences to another sentence stem. In forming
anomalies two rules were followed. First, none of the 15 individuals that
participated in the cloze procedure filled in the “target” word for the
anomalous sentence stem (i.e., cloze probability = 0). Second, in selecting
items for anomalous sentence stems the experimenters used their best intuitions
in attempting to pair anomalous ending with sentence stems so that each ending
was not related to any acceptable ending for the sentence. This is probably not the best way to equate
level of anomaly between the two word types. Ideally the concrete and abstract
anomalies should have been equated based on some more objective method of
assessment.
Experiment 2 was designed to
circumvent the possibility that the concrete anomalous endings were somehow more anomalous than the
abstract anomalies and that this difference rather than differences in
concreteness was responsible for the N400 concreteness findings in Experiment
1. However, rather than attempting to match the anomalous concrete and abstract
final words on degree of anomaly, a “neutral” condition was added instead. This
was done for two reasons. First, equating for degree of anomaly would
undoubtedly require using a different set of sentences than those used in
Experiment 1 making any between-experiment differences difficult to interpret.
Second, anomaly ratings and ERPs would have had to have been collected in
different groups of participants performing somewhat different tasks (anomaly
detection vs. anomaly rating). This would make it difficult to know if degree
of anomaly had been adequately controlled in the ERP participants. To
circumvent these problems a different approach was taken. Rather than equating
anomaly levels a neutral condition was added to the design. In this condition
concrete and abstract words occurred at the ends of sentences where they were
not anomalous, but also where they would not have been easily predicted.
Schwanenflugel and Stowe (1989) used neutral sentences (e.g., The next word
will be …) in the out-of-context condition of one of their experiments
contrasting concrete and abstract words. These sentences had the desired effect
in their study of producing large
differences in RT between concrete and abstract words. Similar low cloze
probability sentences have been demonstrated by Kutas and colleagues (Kutas et
al., 1984) to produce robust N400s. If, under this condition, concrete words do
not elicit a larger N400 than abstract words, it could be concluded that the
results observed in Experiment 1 may have been due to the concrete words having
been perceived as more anomalous than the abstract words. Conversely, if concrete words still elicit a
larger negativity in the neutral condition, then it can be concluded that the
results of Experiment 1 were not simply due to the degree of anomaly, but
rather reflect inherent differences in the processing of the two word types.
Neutral sentences were
constructed such that the final words were congruent in that they fit into the
sentences, but were of low cloze probability in that the sentence context
provided no evidence on which to predict the final word or even to predict its
concreteness (e.g., “Larry said it must have been the wine.”; “It
happened because of her mood.”). Consistent with the results of
Experiment 1 it was predicted that concrete words at the ends of neutral and
anomalous sentences would produce larger N400s with a more anterior
distribution than abstract words at the ends of these same sentences.
Method
Participants. Twenty-four naïve undergraduates (21 female, 3 male) between 18
and 22 years of age (mean: 18.67) from Tufts University served as
participants. All were right-handed native
speakers of English with normal or corrected-to-normal visual acuity.
Stimuli and procedure. Stimulus materials for congruent and anomalous sentences were
taken from Experiment 1. However, half
of the congruent sentence stems were replaced with neutral stems. Thus,
Experiment 2 consisted of 80 anomalous sentences, 40 congruent sentences, and
40 neutral sentences. Neutral sentence frames were composed in such a way as to
be semantically acceptable but contextually ambiguous (see Table 1 for an example).
To assure that this was the case a separate group of 15 participants was used
to assess the cloze probabilities of these materials. These participants read
the neutral stems and filled in the first sentence ending word that came to
mind. The mean cloze value for neutral sentences was .007 (one participant each
filled in the appropriate final word for each of four of the 40 neutral
sentences).
As in Experiment 1
participants were instructed to press the “YES” button if the sentence made
sense (congruent and neutral) and the “NO” button if it did not (anomalies).
Four lists were created so that anomalous and congruent/neutral sentence frames
were each completed with both a concrete and an abstract final word. The lists were counterbalanced such that each
word appeared in each type of sentence frame.
Thus, each participant saw eight sentence types. These consisted of the same four anomalous
sentence types from Experiment 1, congruent sentences with concrete or abstract
final words, and neutral sentences with concrete or abstract final words.
Both the experimental procedure and the ERP
procedure were identical to that in Experiment 1. Data analysis was also identical to that in Experiment 1, except
for the inclusion of three levels of Sentence-Type (congruent vs. anomalous vs.
neutral). These conditions were compared in a single omnibus ANOVA and were
followed up with separate pair-wise analyses were warranted.
Results
Accuracy data. In this experiment, participants were generally quite accurate in
deciding whether sentences made sense, averaging 91.3% correct responses (Table
3). However, participants were less
accurate in correctly judging the neutral sentences than the congruent or
anomalous sentences (main effect of Sentence-Type, F[2, 46] = 19.36, MSe
= 104.57 , p < .0001). They
also responded more accurately to sentences with abstract final words than to
sentences with concrete final words (main effect of Word-Type, F[1, 23]
= 5.58, MSe = 30.87 , p < .05), but only in the
neutral-sentence condition (Sentence-Type X Word-Type interaction, F[2,
46] = 5.05, MSe = 33.01, p < .05).
Table
3
Experiment
2 -- Percent Correct (Standard Deviation)
|
Final-Word
Type |
|
Sentence-Type Concrete Abstract |
|
|
|
Congruent 95.8 (7.2) 95.5 (7.0) |
|
Anomalous 97.5
(3.0) 96.3 (4.0) |
|
Neutral 82.3 (15.3) 88.5 (10.9) |
Overview of ERPs. The grand-mean ERPs (averaged across all 24 participants) for all
congruent, anomalous, and neutral final words are plotted in Figure 5. In all conditions, the ERPs show early
components (P1, N1, P2) similar to those observed in Experiment 1 (Figure 1). Again, only the P2 component appeared to be
affected by sentence type.

Figure 5. Grand mean ERPs for congruent, anomalous, and neutral final words in Experiment 2.
The later ERP components
were also similar to those in the previous experiment. There was again a negative-going wave which
peaked at approximately 400 ms (N400) that was broadly distributed. This component was clearly larger for
anomalous and neutral final words than for congruent final words. Following the N400, there was again a
positive-going wave (LPC) peaking at 600-800 ms over central and posterior
sites. This late positivity was larger
for anomalous final words and smallest for neutral final words.
Analyses by epoch. In the 150 to 300 ms
time-window, there was a main effect of Sentence-Type (midline: F[2, 46]
= 5.46, MSe = 6.4, p < .01; lateral: F[2, 46] = 5.83, MSe
= 10.3, p < .01). Follow-up analyses indicated that anomalous final
words produced more negative ERPs than either congruent or neutral final words
(Congruent vs. Anomalous, midline: F[1,23] = 10.54, MSe = 6.6, p
< .01; lateral: F[1,23] = 5.41, MSe = 9.2, p < .05;
Anomalous vs. Neutral, midline: F[1,23] = 4.43, MSe = 4.9, p
< .05; lateral: F[1,23]= 14.96, MSe = 7.8, p <
.001).
The main effect of
concreteness was not significant in the 150 to 300 epoch, but there was a
significant interaction between Word-Type and Sentence-Type (midline: F[2,46]
= 6.09, MSe = 13.1, p < .01; lateral: F[2,46] = 3.42, MSe
= 15.8, p < .05). Follow-up analyses indicated that concrete words
yielded more negative ERPs than did abstract words for both anomalous and
neutral sentences when compared to congruent sentences (Anomalous vs.
Congruent, midline: F[1,23] = 12.12, MSe = 10.5, p < .002;
lateral: F[1,23] = 5.96, MSe = 14.1, p < .03; Neutral vs.
Congruent, midline: F[1,23] = 5.60, MSe = 19.7, p < .03; lateral:
F[1,23] = 3.79, MSe = 20.7, p < .06; Anomalous vs. Neutral, p’s
> .7).
In the 300 to 500 ms
time-window there was a main effect of Sentence-Type (midline: F[2, 46]
= 34.57, MSe =25.8, p < .0001; lateral: F[2, 46] =
28.50, MSe = 33.9, p < .0001) and a Sentence-Type X
Electrode-Site interaction (midline: F[2,46] = 11.2, MSe = .10, p
< .0001; lateral: F[4,92] = 25.38, MSe = .31, p <
.0001). Follow-up analyses indicated that both anomalous and neutral final
words elicited more negative ERPs than did congruent final words, and that
these effects tended to be larger over more posterior sites (Sentence-Type X
Electrode-Site interaction: Congruent vs. Anomalous, midline: F[2,46] =
5.52, MSe = .1, p < .05, lateral: F[4,92] = 7.01, MSe
= .3, p < .01; Congruent vs. Neutral, midline: F[2,46] =
19.60, MSe = .1, p < .0001; lateral: F[4,92] = 43.95, MSe
= .3, p < .0001 – see Figure 5).

Figure 6. Grand mean ERPs for congruent final words that were concrete or abstract (Experiment 2).

Figure 7. Grand mean ERPs for anomalous final words that were concrete or abstract (Experiment 2).

Figure 8. Grand mean ERPs for neutral final words that were concrete or abstract (Experiment 2).
During
the 300 to 500 ms epoch, there was also a main effect of Word-Type (midline: F[1,
23] = 23.48, MSe = 19.0, p < .001; lateral: F[1, 23] =
36.28, MSe = 23.7, p < .0001)], a Sentence Type X Word-Type
interaction (midline: F[2, 46] = 9.30, MSe = 11.8, p <
.001; lateral: F[2, 46] = 9.35, MSe = 23.6, p < .0001), and a three-way
interaction between Word-Type, Sentence-Type and Electrode-Site (midline: F[4,
92] = 2.96, MSe = .1, p
< .05; lateral: F[8, 184] = 9.03, MSe = .2, p <
.0001). Follow-up analyses indicated that concrete words produced significantly
more negative-going ERPs than abstract words, but only for the anomalous and
neutral conditions (main effect of Word-Type: Anomalous, midline: F[1,
23] = 32.33, MSe = 8.7, p < .0001; lateral: F[1, 23] =
58.17, MSe = 10.5, p < .0001; Neutral, midline: F[1, 23]
= 27.91, MSe = 13.8, p < .0001; lateral: F[1, 23] =
28.52, MSe = 18.9, p < .0001; Congruent, p’s > .5 –
compare Figures 6, 7 and 8).
Furthermore, for Anomalous and Neutral endings, these effects of
concreteness tended to increase in magnitude toward more anterior sites
(Word-Type X Electrode-Site interaction, Anomalous midline: F[2, 46] =
5.13, MSe = .1, p <
.05; lateral: F[4, 92] = 16.31, MSe = .2, p < .0001; Neutral midline: F[2,46]
= 3.43, MSe = 1.0, p < .05, lateral, p < .1).
However, for Congruent endings, there were differences between the two word
types only at the most posterior lateral sites (O1/O2) where Concrete endings
were more negative-going than Abstract endings (Word-Type X Electrode-Site
interaction, Congruent lateral, F[4,92] = 5.68, MSe = .2, p
< .01).
In the 500 to 800 ms epoch,
there was again a main effect of Sentence-Type (midline: F[2, 46] =
8.58, MSe = 19.53, p < .001; lateral: F[2, 46] = 13.88,
MSe = 30.17, p < .001) and an interaction between
Sentence-Type and Electrode-Site (midline: F[4,92] = 10.10, MSe =
.10, p < .0001; lateral: F[8,184] = 19.54, MSe = .30, p
< .0001). Follow-up analyses revealed that Neutral sentence final words
elicited more negative ERPs than either Congruent or Anomalous final words (Sentence-Type,
Neutral vs. Congruent, midline: F[1,23] = 6.71, MSe = 21.81, p
< .05; lateral: F[1,23] = 12.66, MSe = 41.10, p <
.01; Neutral vs. Anomalous, midline: F[1,23] = 18.16, MSe =
17.71, p < .001, lateral: F[1,23] = 23.38, MSe = 30.79,
p < .0001), and these differences tended to have a posterior
distribution [Sentence-Type X Electrode-Site interaction, Neutral vs.
Congruent, midline: F[2, 46] = 12.62, MSe = .11, p < .001; lateral: F[4, 92]
= 20.44, MSe = .31, p
< .0001; Neutral vs. Anomalous, midline: F[2,46] = 18.14, MSe
= .09, p < .0001; lateral: F[4,92] = 39.51, MSe = .19, p
< .0001).
During the 500 to 800 ms epoch, there was also a main effect of Word-Type (midline: F[1, 23] = 10.75, MSe = 19.5, p < .01; lateral: F[1, 23] = 17.20, MSe = 28.0, p < .001), a Sentence-Type by Word-Type interaction (midline: F[2, 46] = 11.94, MSe = 11.9, p < .001; lateral: F[2, 46] = 11.50, MSe = 15.7, p < .001) and at lateral sites a three-way interaction between Sentence-Type, Word-Type and Electrode Site (F[8,184] = 5.28, MSe = .12, p < .01). Follow-up analyses indicated that concrete words were more negative going than abstract words for anomalous and neutral, but not for congruent final sentences (Word-Type main effect, Anomalous midline: F[1, 23] = 18.83, MSe = 6.24, p <.001; lateral: F[1, 23] = 33.60, MSe = 9.75, p < .0001; Neutral midline: F[1,