Anticipatory Coarticulation and Aphasia: Implications for Connectionist Models of Speech Production

William F. Katz

Department of Psychiatry, UCSD

_____________________________________________________________

NOTE: In the email version, there are no special characters
for IPA symbols. Thus, phonetic transcriptions include symbols
which should be translated into the following:

[*] = schwa
[E] =  epsilon (lax, central vowel)
[o:] = mid front rounded vowel, half-close
--------------------------------------------------------------

Introduction

     Much of human behavior may be  described  as  involving
serially  ordered  processes.  This  is  true for both motor
behavior as well as perception and comprehension. What makes
serially  ordered  aspects of behavior difficult to model is
that although discrete stages of activity may be psychologi-
cally  perceptible,  actual behavior is rarely context-free.
That is, we have the ability to perform the "same" types  of
motor  movements  in vastly differing real world situations,
as well as to decode "single" units of meaning out of highly
parallel information streams.

     Nowhere has this phenomenon been more apparent than  in
speech  production and perception. Even within the component
of  language  traditionally  thought  to  involve  the  most
"bottom-up"  information, i.e., phonology, one is confronted
with information present in a highly context-dependent form.
Speech  has  remained one of the more problematic aspects of
human communication to study because it has  been  extremely
difficult  to identify the units corresponding to components
of phonological analysis in either articulatory gestures  or
in  the  acoustic waveform. At the core of this problem is a
phenomenon known as  "coarticulation"  (or  "co-production",
Fowler,  1980). This refers to the fact that speakers do not
string together  discrete  sound  segments  as  beads  on  a
string,  but rather overlap speech sounds in a graded, time-
compressed manner.

     Coarticulation may  be  generally  classified  as  per-
severatory  ("backwards",  "left-to-right")  or anticipatory
("forwards, "right- to-left").  Anticipatory  coarticulation
is  of  special interest to speech researchers because it is
considered to be a  measure  of  the  planning  of  upcoming
speech  segments.  By examining speakers' ability to antici-
pate articulatory configurations it  has  been  possible  to
gain  insight  into the nature of the speech sequencing pro-
cess.

     Speech production has traditionally been  divided  into
planning  and execution processes. These processes have been
commonly given names borrowed from linguistic theory,  i.e.,
"phonemic"  (or selectional), and "phonetic" (motor output).
Because coarticulation involves the translation of  linguis-
tic  targets  into speech articulator movements, the process
entails both  phonemic  and  phonetic  information.  It  has
therefore  remained controversial whether these regularities
should be captured in a linguistic grammar  (cf.  Browman  &
Goldstein,  1986;  Keating,  in  press;  Fowler,  in press).
Importantly, the phonemic/phonetic nature of  coarticulation
has also made it necessary to study cognitive and linguistic
representations, as well as a broad range of  physical  data
(e.g.,  the kinematic properties of the speech articulators)
in order to best capture the facts.

     A recent breakthrough in modeling cognitive aspects  of
coarticulation has been made by Jordan (1986), who simulated
anticipatory rounding and nasalization in a small corpus  of
French and English words and phrases. Jordan proposed a con-
nectionist model which receives as input a  "plan"  (in  the
form of featural specifications for individual phonemes) and
yields  as  output  a  list  of  coarticulatory  constraints
presented  occurring serially across time. Jordan's model is
based  upon  previous  PDP   frameworks,   incorporating   a
recurrent  network  of  processing  units, "hidden units" to
capture nonlinear response  patterns  (Hinton  &  Sejnowski,
1983),  and a back propagation learning algorithm to provide
training (Rumelhart, Hinton, & Williams, 1986). This  model,
however,  has  provided  two  key innovations: First of all,
input was separated into state and plan units. This  permit-
ted  nominally  serial  properties  to  be  modelled without
recall to  explicit  temporal  order  or  action  sequences.
Secondly,  plan  units  (representing  phoneme strings) were
designed to include "don't care" conditions. This  permitted
anticipated features (e.g., rounding) to spread over certain
phonemic segments and not others.

     Jordan was careful to point out that the model was  not
intended to be a "realistic model of speech production", due
to the fact that complex lower-level (motoric) processes are
also  involved in actual speech. Moreover, it was noted that
the same regularities captured in the connectionist approach
could  also  be  explained  by means of "traditional" models
(e.g.,  feature  spreading,  Henke,   1966).   Nevertheless,
Jordan's  model  was  able to predict the degree of featural
spread (i.e., the "boundary conditions" for articulators) in
a  manner consistent with the empirical data, and was there-
fore claimed to provide a more parsimonious account of coar-
ticulation.  The  success  of  this initial attempt suggests
that more elaborated connectionist models might  be  greatly
useful in coarticulation research.

     Numerous empirical studies have been conducted  analyz-
ing the acoustic, perceptual, and kinematic details of anti-
cipatory coarticulation in the speech of normal adults  (see
Sharf  and  Ohde,  1981; Lubker and Gay, 1982, for reviews).
These  studies  have  addressed   important   claims   about
language-dependent  features  of  speech  planning, and have
yielded  important  information  about  the  properties   of
individual  speech  articulators.  In  addition, researchers
have recently begun to examine the development of  coarticu-
lation  in normal children (Repp, 1986; Sereno et al., 1987;
Nittrouer, 1989a; 1989b; Katz, Kripke, and  Tallal,  1989a),
as  well  as in children presenting with language impairment
(Hewlett, 1988; Katz, Kripke, and Tallal, 1989b).

     This article will focus primarily upon studies of adult
aphasic  subjects  (i.e.,  subjects presenting with specific
damage  to  brain  regions  known  to  subserve  speech  and
language function).  This research has sought to make use of
"experiments in nature" to analyze  a  number  of  important
questions  about  speech  behavior.   First of all, the data
address whether the sound sequencing capabilities  necessary
for   naturally  coarticulated  speech  are  linked  to  the
integrity of particular neural structures in the  brain.  It
is  possible to investigate, for example, hypotheses stating
that anterior brain regions (esp. "Broca's Area")  are  pre-
ferentially  involved  in  speech  motor  planning (Mlcoch &
Noll, 1980, Kimura and Watson, 1989). In addition, the coar-
ticulatory  patterns of aphasic speech have been viewed as a
possible means of determining the extent  of  normal  speech
motor  planning  processes.   That is, by observing the con-
straints governing aphasic speech breakdown, it may be  pos-
sible  to  infer  specific planning processes at work in the
normal brain. Finally, it  has  also  been  of  interest  to
determine  whether  coarticulatory  data  comport with known
patterns of  "phonemic"  and  "phonetic"  disintegration  in
aphasic speech.

     In this paper, I shall first provide a brief summary of
the facts known about coarticulation in normal adult speech.
This will be followed by  a  short  description  of  aphasic
speech  characteristics, including facts known about coarti-
culation. The body of this report will be concerned with new
research  addressing labial, lingual, and velar anticipatory
coarticulation in normal and  aphasic  German-speaking  sub-
jects.  The  results  of  these investigations indicate that
anterior aphasic subjects show essentially intact  anticipa-
tory  coarticulation.  These data will be discussed in light
of current models of speech production.

Coarticulation in normal, adult speech

     The most widely studied forms of anticipatory coarticu-
lation  involve  the  motion of the lips, tongue, and velum.
Anticipatory labial coarticulation typically  involves  ini-
tiation  of  a  rounding gesture during consonant production
preceding a rounded vowel. For example, a speaker  producing
the  English  syllable [su] will round his lips at the start
of the [s] in anticipation of the rounded vowel [u]. In con-
trast,  no rounding occurs when he is producing the syllable
[si].

     Studies of anticipatory  lingual  coarticulation  typi-
cally  focus  on the front/back positioning of the tongue in
velar stop closure as a function of the  feature  specifica-
tion  of  the  following  vowel.   For instance, the English
phoneme /k/ has a front allophone with a relatively anterior
vocal  tract constriction, and a back allophone with a rela-
tively posterior constriction.

     Anticipatory  velar  coarticulation  describes  effects
upon  velar height as a function of the nasality features of
upcoming phones.  For example, the presence of the nasalized
consonant  [n]  in the English word "pent" may begin shortly
after the initial [p], effectively nasalizing the vowel [E].

     Coarticulation has been studied by means  of  acoustic,
perceptual  and kinematic analyses. These studies have indi-
cated that the timing and extent of anticipatory coarticula-
tion is language-particular. Thus, one finds that lip round-
ing is anticipated earlier and  with  greater  precision  in
Swedish  (a  language  having  an  elaborate  set of rounded
vowels) than in English (which has relatively  little  vowel
rounding).  It  has  also  been  shown  that  within a given
language there may be substantial individual variation, with
some  subjects  showing  more  "feature- based" anticipatory
patterns, and others showing evidence of anticipation over a
"phase-locked",  single  window of time (see Lubker and Gay,
1982, for details).

Speech production in aphasia

     The major  subdivision  between  aphasic  syndromes  is
based  upon  the  character  of speech output (Goodglass and
Kaplan, 1982).  Aphasic subjects with lesions in  the  ante-
rior  portion  of  the brain generally present speech marked
with difficulties in the initiation and sequencing of  arti-
culatory movements.  These symptoms are generally considered
to be an integral part of a nonfluent aphasia,  referred  to
as  "Broca's  aphasia." There may also be agrammatism, which
is a reduction  and  simplification  of  grammatical  forms,
including  the  loss of small function words.  These factors
add up to a speech pattern which is halting  and  effortful,
generally termed "dysfluent."

     In contrast, aphasics with posterior lesions  generally
present  with  speech  which is fluent and well-articulated,
though semantically impoverished. These  deficits  are  gen-
erally  considered  to  be an integral part of two posterior
aphasia syndromes (Wernicke's aphasia, Conduction  aphasia).
Wernicke's  aphasia involves a severe impairment of auditory
and written comprehension, and speech which is  semantically
"empty"  and  difficult to understand. There is an abundance
of high-frequency, low-content words (e.g., "thing",  "it"),
and  a  reduction  of  substantive nouns and verbs to convey
meaning. The speech of these patients typically shows verbal
and phonemic paraphasias. There may be a phenomenon known as
"press for speech" (or "logorrhea") in  which  the  patient,
even  in a conversation setting, produces copious amounts of
speech without stopping. The syndrome of Conduction  aphasia
is  qualitatively  similar to Wernicke's aphasia in a number
of respects, however, the chief difference is  that  repeti-
tion is greatly impaired in relation to the level of fluency
of spontaneous speech.

     Traditional clinical descriptions of  aphasia  consider
the errors in speech produced by anterior (Broca's) aphasics
to reflect phonetic  or  articulatory  errors,  whereas  the
errors  of posterior (e.g., Wernicke's) aphasics are thought
to originate at the phoneme planning level. In recent years,
fine-grained  acoustic  analyses  have  uncovered additional
data which generally support this dichotomy. With respect to
anterior aphasic subjects, these data all suggest impairment
in the timing or integration of movements of  the  articula-
tory  system. These anterior aphasic impairments do not seem
to reflect a global weakness or discoordination of the arti-
culators.  Rather,  articulatory  disabilities  appear to be
best characterized as affecting two "independent"  articula-
tors,  e.g.,  coordinating  vocal  fold  vibration  with the
tongue tip release of stop consonants.

Coarticulation in aphasia

     Ziegler and von Cramon (1985, 1986, 1986b), based  upon
acoustic  and perceptual studies of German-speaking anterior
aphasic subjects, argue that anterior aphasia may involve  a
"delayed  onset  of  anticipatory vowel gestures relative to
the labial occlusion." A similar conclusion was  tentatively
reached  by Tuller and Story (1986), who conducted an acous-
tic analysis of coarticulatory information  present  in  the
speech of English-speaking anterior aphasic subjects.

     In contrast, Katz (1987, 1988)  conducted  an  acoustic
and  perceptual  study  of  English-speaking  aphasics which
showed a "mixed" pattern of  results  for  anterior  aphasic
subjects.  The  acoustic  data showed no differences between
the coarticulation present in normal  and  anterior  aphasic
speech,   while   the  perceptual  data  indicated  somewhat
degraded coarticulatory information in the speech  of  ante-
rior  aphasic  subjects.  Moreover, group differences in the
perceptual data varied as a function of  stimulus  type.  It
was  concluded that a "uniform delay" in coarticulation does
not adequately characterize anterior aphasic speech. Rather,
coarticulatory "planning" was considered to be intact, while
the degree of actual coarticulatory behavior was  considered
to  vary  as  a function of the complexity of the motor ges-
tures involved.

     Little direct kinematic exists concerning  this  issue.
Sussman,  Marquardt,  MacNeilage,  &  Hutchison  (1988) have
reported kinematic findings concerning labial coarticulation
in  aphasia. It was found found that for correct productions
of CV, CCV, and VC CCV stimuli,  anterior  aphasic  subjects
exhibited  lip  and  jaw  coarticulatory behavior similar to
normal subjects.

     With respect  to  velar  coarticulation,  a  series  of
kinematic  experiments  have been conducted by Itoh and col-
leagues in Japan.  Using both fibroscopic  (Itoh,  Sasanuma,
and  Ushijima,  1979)  and X- ray microbeam (Itoh, Sasanuma,
Hirose, Yoshioka,  and  Ushijima,  1980)  techniques,  these
authors  analyzed  the speech of a Japanese- speaking, ante-
rior aphasic subject. The results showed  a  great  deal  of
variability in the apraxic patient's speech kinematics, par-
ticularly with respect to the successional patterns of velar
movement.  However,  despite  occasional  deviations, it was
concluded that anticipatory coarticulation was intact.

     As the preceding, somewhat equivocal pattern of results
indicates,  it  is difficult to obtain a coherent picture of
coarticulation in aphasic speech based solely upon  acoustic
or  perceptual  data.  Rather,  it  is  essential to combine
acoustic and perceptual investigations with direct kinematic
measurement.   Moreover,  studies  which provide information
about the simultaneous motion of several articulators  offer
a greater advantage for understanding coarticulatory impair-
ment than do analyses of  individual  articulator  movement.
For  these  reasons,  Katz,  Machetanz,  Orth,  and Schoenle
(1989c, 1989d) conducted kinematic and acoustic analyses  of
the  speech  of  German-speaking  anterior aphasic subjects.
This work made use of the recent technology  of  electromag-
netic  articulography, which affords real-time, simultaneous
tracking of several articulators in the vocal tract.   Also,
by  using German-speaking subjects, it was possible to study
lip-rounding contrasts which  did  not  involve  changes  in
tongue position (as found in English).

Experiment

                           METHOD

Subjects

     Subjects included two anterior aphasic and two  normal,
control  speakers.  All  subjects  were  adult, right-handed
native speakers of German,  from  similar  dialect  regions.
Aphasic  subjects  presented  with  single,  clearly defined
anterior lesions (see Katz et al., 1989c for  details),  and
were  classified based upon clinical exam and speech pathol-
ogy assessment. Control subjects had no history of neurolog-
ical disease, and no speech or language difficulties.

Procedure

     Speech kinematics were measured using  the  electromag-
netic  articulography  system developed at the University of
Goettingen Department of Clinical Neurophysiology (Schoenle,
Grabe,  Wenig, Hohne, Schrader, & Conrad, 1987). This device
allows for the simultaneous recording of multiple points  in
and  outside  of  the vocal tract. Subjects were seated in a
quiet testing room and were fitted with a helmet  containing
three  magnetic  transmitter  coils.   Minute receiver coils
were attached to the upper lip (UL) and lower lip (LL)  (for
the  labial  study) and to the tongue tip and velum (for the
nasal study). A computer was used to sample positional  data
for two receiver coils, and to record acoustic data.

Speech material

     Speakers  were  asked  to  produce  real  word  stimuli
designed  to  probe the timing of anticipatory labialization
during consonant production  and  anticipatory  nasalization
during  vowel  production.  There was a total of 10 stimulus
items   (4   labial,   6   nasal).   The   labial    stimuli
([g*li:g*]/[g*ly:g*];  [g*lez*]/[g*lo:z*])  consist  of word
pairs contrasting minimally in the rounding feature  of  the
vowel   following  the  consonant  [l].  The  nasal  stimuli
([ti:g*]/[tingl]; [ti:d*]/[ti:n*]; [ti:b*]/[ti:m*])  consist
of  word  pairs  differing  in the nasality features of each
word's initial vowel and middle consonant. The nasal stimuli
were  also selected to represent velar, alveolar, and labial
place of articulation for middle (word-internal) consonants.
All  stimuli were embedded in the carrier phrase: "Ich sagte
_______ zweimal" ("I said ________ twice").

     Subjects' productions were perceptually screened by the
researchers,  and  speech  errors  were identified. Detailed
classification of speech errors are listed in  Katz  et  al.
(1989c).   For the normal control subjects, no speech errors
were noted. For the aphasic subjects, a total of  22  speech
errors were detected (= 7.3% of the 300 total repetitions by
aphasic subjects). Errors  were  separately  classified  for
further  analysis.  From  a total of 15 repetitions recorded
for each stimulus type (per speaker), the first ten  correct
repetitions were used for kinematic and acoustic analyses.

Analysis

     Kinematic data were analyzed from  graphic  representa-
tions  of  articulator  position and tangential velocity. In
addition,  quantitative  records  of  speech   timing   were
obtained  using  interactive  software  designed  for speech
movement analysis.

     Speech acoustics were analyzed using the a speech  pro-
cessing  program for microcomputer. Speech samples were low-
pass filtered and digitized.  Speech  segment  regions  were
identified  in  the waveform from an oscillographic display,
and segment durations were recorded. Due to the known diffi-
culty  of  analyzing  cues for vowel nasalization present in
prevocalic consonantal  spectra  only  labial  stimuli  were
analyzed.  For the labial [g*lVg*] stimuli, 5 segments ([g],
schwa, [l], vowel, final [g*]) were delineated (see Katz  et
al., 1989c for details).

     For each stimulus, an analysis window was  placed  over
specific  areas  of each segment. Spectra were then obtained
using Fourier analysis and linear predictive  coding  (LPC).
Analysis of anticipatory lip-rounding focussed upon spectral
peaks in the liquid portion of the waveform anticipating the
second formant of the vowel. Additional details are provided
in Katz et al., 1989c.

Kinematic Results

I. Correct productions

A. Labial stimuli

i. Articulator displacement

     Lip rounding may be  characterized  in  terms  of  both
extension   and  vertical  movement  (raising  or  lowering,
depending upon the region of  the  lip  examined,  and  upon
individual subject characteristics).  The two control speak-
ers produced robust UL protrusion (extension  and  lowering)
for rounded (as compared with unrounded) stimuli.  These lip
protrusion gestures were rapid, concise,  and  were  clearly
related to production of the rounded vowel in the utterance.
In addition, speaker PS  showed  substantial  extension  and
lowering  of  LL  (and  jaw)  for both stimuli series, while
speaker EO demonstrated only slight LL (and jaw)  protrusion
for [g*ly:g*] as compared with [geli:ge].

     Considering next the anterior aphasic data, UL  protru-
sion was also found for rounded (as compared with unrounded)
stimuli. The overall time course of these gestures  differed
somewhat between the two aphasic subjects. Speaker AW showed
rapid UL movements corresponding with production of  rounded
vowel  segments  (i.e.,  resembling  the data of the control
subjects). In contrast, speaker EG showed  more  gradual  UL
movements, with a much lesser amount of net displacement. In
terms of LL (and jaw) movement, both aphasic speakers showed
articulator  protrusion  (extension and raising) for rounded
(as compared with unrounded) stimuli.

     In order to determine the extent to which speakers dif-
fered  in  the regularity of articulator movement, a measure
of item-to- item displacement variation  was  computed  (see
Katz  et  al.,  1989c).  The aphasic speakers showed greater
overall variation than the control speakers,  with  particu-
larly  high  variation  for  LL  (and jaw) displacement. The
results of two-way (Group x Articulator) analyses  of  vari-
ance  (ANOVA)  confirmed  that these patterns were statisti-
cally significant.

ii. Articulator timing

     Because it was  considered  important  to  examine  the
extent  to  which  anticipatory  coarticulation  varied with
speaking rate, the overall duration of speakers'  utterances
were  analyzed.  It was found that the aphasic speakers pro-
duced slower speech, with greater variation in segment  tim-
ing,  than  is  found  in the speech of normal, control sub-
jects. Further analysis of aphasic speakers'  temporal  pat-
terns  indicated  that these subjects showed a prevalence of
intersyllabic pauses, as well as occasional segment  prolon-
gations  (particularly  for  vowels)  in their speech. These
findings are in accord with previous descriptions of  verbal
apraxic speech.

     In order to analyze the point of anticipatory coarticu-
lation  onset,  individual  repetitions  were inspected with
attention paid to the exact point in the  acoustic  waveform
at  which  kinematic, coarticulatory effects could be noted.
The results demonstrated that lip protrusion  was,  for  all
subjects,  confined  to  a  region  proximate to the rounded
vowel. The beginning of labial  protrusion  generally  began
either  shortly before, or during production of the syllable
[g*], i.e.,  the  syllable  preceding  that  containing  the
rounded vowel.

     Although speakers were quite consistent across  repeti-
tions  (with  control  subjects  showing greater consistency
than aphasics),  there  were  notable  Speaker-  and  Group-
dependent differences in coarticulation onset. These indivi-
dual patterns  are  discussed  at  length  in  Katz  et  al.
(1989c).  In  general, the data may be summarized as showing
that anterior aphasic subjects show more variation in  their
onset positions, although this does not fit the pattern of a
uniform delay. Rather, aphasic speakers' variation generally
involved  unusually  early  lip  protrusion in comparison to
that found for control subjects.

B. Nasal stimuli

i. Articulator displacement

     The two control speakers showed little movement of  the
velum   during  production  of  the  non-nasalized  stimuli,
whereas robust velar port opening (i.e., velar extension and
lowering)  was  observed  for  productions  of the nasalized
stimuli. In comparison with the normal speakers, the aphasic
speakers  showed  highly impaired patterns of velar movement
(detailed in Katz et al.,  1989c).  These  impairments  were
more  marked  for  aphasic  subject  EG than for subject AW.
However, despite these imprecise movement patterns,  aphasic
subjects  showed  clear  evidence  of  correct, anticipatory
velar port opening before nasalized consonants.

     As with the labialization stimuli, variation  in  velum
displacement  was quantified by calculating the RMS distance
between  individual  utterances  and  averaged  displacement
waveforms.    Statistical  analysis  revealed  that  aphasic
speakers produced greater overall  variation  in  velum  and
tongue displacement than the control speakers. This was par-
ticularly true for the velar movement of aphasic subject EG,
who  showed  the  opposite pattern (i.e., greater velar than
lingual variation). Taken together, these data indicate that
the two aphasic speakers, considered as a "group", were more
variable than controls, and that aphasic subject EG showed a
particularly high degree of velar movement variation.

ii. Articulator timing

     The time course of correct  velar  stimuli  productions
was  investigated  by conducting comparisons of the acoustic
and kinetic data. As with the  labialization  results,  sub-
jects  were  found  to be quite consistent from utterance to
utterance, though aphasics  showed  greater  variability  in
onset   position.  Both  control  speakers  showed  context-
dependent differences at a region  of  the  speech  waveform
located  between the end of the aspiration following the [t]
segment, and the first 40 ms of  vowel  pulsing.  Similarly,
the  first  noticeable  context-dependent  difference in the
movement traces  of  the  aphasic  subjects  (i.e.,  visible
changes  in  the  rates  of  velar lowering) was noted to be
within this same  temporal  region.  In  other  words,  even
though  the  displacement  of velar position was noted to be
qualitatively different and more variable for aphasic speak-
ers,  the aphasic subjects appeared to initiate nasalization
gestures at approximately the  same  point  in  time  during
speech as normal controls. These data suggest that, although
the overall ability to control spatial  positioning  of  the
velum was clearly compromised for aphasic speakers, temporal
aspects of velar,  coarticulatory  movement  appeared  rela-
tively preserved.

II. Error-prone productions

     An investigation was made into the  claim  that  speech
production  errors  perceptually  resembling phone substitu-
tions might in fact be the result of discrete,  interarticu-
latory  phasing  difficulties  (e.g.,  Mlcoch  & Noll, 1980;
Ziegler and von Cramon, 1985). There were, however, very few
speech  errors  in the database containing the target struc-
tures of interest. These consisted  of  instances  in  which
aphasic  subject  EG  produced substitutions of "[tid*]" for
[tin*] targets.  For these cases, there were found to be [d]
stop  consonant  bursts in the acoustic waveforms. Kinematic
analyses showed that in two of  the  cases,  velar  movement
more  closely resembled correctly-produced [d] than [n] ges-
tures, while in the third  case  the  displacement  patterns
fell  midway  between  those  typical of nasal and non-nasal
consonants. In  sum,  these  data  do  not  suggest  that  a
slightly  mistimed [n] production resulted in a [d] percept,
but rather indicate that these errors likely  resulted  from
phone selectional ("phonemic") difficulties.

Acoustic Results

I. Correct productions (labial stimuli)

Vowel-anticipatory peaks in [l] spectra

     Data for the vowel pairs [i:]/[y:] and  [e:]/[o:]  were
grouped  for  comparison of coarticulatory effects. For both
sets of stimuli, the distribution of peaks  for  productions
by aphasic subjects was more variable than for those of nor-
mal subjects. This was particularly true for the higher fre-
quency  regions  (above 2 kHz).  Aphasia-related generaliza-
tions about coarticulatory shift were difficult to make  for
[i:]/[y:],  because  the  effect  did not seem clearly esta-
blished for the two normal subjects in  the  F2  range,  and
because  data  from  the aphasic speakers showed substantial
variation in the F3 frequency range. For [e:]/[o:], spectral
peaks in the F2 frequency range showed evidence of coarticu-
latory effects for both aphasic and control  subjects.  Con-
sidering both stimulus sets together, aphasic speakers' pro-
ductions appear  to  provide  evidence  for  vowel  context-
dependent  spectral shift in a manner similar to (or exceed-
ing that of) productions by normal subjects.

Discussion

     Although the present data must  be  considered  prelim-
inary  because of the small number of subjects investigated,
the findings address a number of important issues concerning
the neurological bases of speech production. To briefly sum-
marize the results, the kinematic data showed that for  both
labial  and  nasal  (correct) productions, aphasic speakers'
coarticulatory patterns were more highly variable than those
of  control subjects. These differences, however, were noted
chiefly for spatial displacement characteristics, while  the
temporal aspects of articulator movement involved in antici-
patory coarticulation appeared largely intact. It  was  also
found that velum mistiming did not appear to explain a small
corpus of stop/nasal substitution errors produced by one  of
the aphasic speakers.

     The acoustic data  largely  agree  with  the  kinematic
findings.  To the extent that vowel formant frequency energy
could be traced  back  into  the  portion  of  the  waveform
corresponding  to the prevocalic consonant, [l], the correct
productions of the anterior aphasic subjects showed patterns
of  labial  anticipation  similar  to (or exceeding that of)
normal speakers.

     A major empirical issue which these data  address  con-
cerns  whether  anterior  aphasic  subjects  show systematic
delays in  the  production  of  anticipatory  coarticulation
information  during speech. With respect to labial anticipa-
tion, uniform coarticulatory delays on the order of 20-30 ms
have  been  proposed  as a possible speech characteristic of
German (Ziegler and von Cramon,  1985;  Ziegler,  1989)  and
English-speaking  (Tuller  and Story, 1987) anterior aphasic
subjects. In contrast, Katz (1987, 1988) and Sussman et  al.
(1988)  have provided evidence suggesting that temporal con-
trol of anticipatory labial coarticulation is largely intact
in  anterior aphasic speakers' correct productions. The data
from the present experiment replicate the findings of  Suss-
man  et  al.  (1988),  and support the acoustic data of Katz
(1987, 1988), in that anterior aphasic subjects  were  found
to  produce  coarticulatory gestures as early as (or earlier
than) matched control speakers.  These data suggest that  if
listeners  show  uniform delays in picking up coarticulatory
information present in anterior aphasic speech, this may  be
due to complicating factors other than actual coarticulatory
cues (see Katz et al., 1989c for discussion).

     With  respect  to  anticipatory  velar  movement,   the
present study addresses the question of whether discoordina-
tions in velar movement might correspond  to  coarticulatory
impairments  (e.g.,  Mlcoch and Noll, 1980) and whether such
discoordinations  might  also   account   for   perceptually
apparent  "substitution"  errors  in  the speech of anterior
aphasic subjects (Itoh et al., 1979, 1980; Ziegler  and  von
Cramon,  1986b).  The present data replicate the findings of
Itoh et al. (1979, 1980), in that  they  demonstrate  essen-
tially  intact  anticipatory  coarticulation  in the correct
speech of anterior aphasic subjects.

     As for theories concerning aphasics'  error-prone  pro-
ductions, the present findings do not rule out the possibil-
ity that interarticulatory phasing difficulties may  account
for occasional substitution errors. In the present database,
however, there was very little evidence  for  this.  Of  the
three stop/nasal substitution errors examined, all contained
clearly  identifiable  stop  consonant  bursts,   suggesting
selectional   ("phonemic")   rather  than  interarticulatory
discoordination in motor output. Only one of the three  sub-
stitution  errors showed kinematic patterns allowing for the
possibility  of  interarticulatory  discoordination   (i.e.,
velar  displacement patterns midway between those typical of
nasal and non-nasal consonants).

     A key theoretical aim of this research  is  to  explore
how  patterns of coarticulation in aphasic speech can inform
models of normal speech production. With respect to  locali-
zation  of  function  issues,  the present data suggest that
anticipatory coarticulation capabilities of the adult  brain
do  not  critically  rely  upon  anterior  structures (e.g.,
Broca's area). Rather, it appears that anterior regions  are
involved  in coordinating the timing of the articulators for
producing individual phones (e.g., VOT values for stop  con-
sonants)  and possibly single syllables (see Kimura and Wat-
son, 1989), but not for effecting  anticipatory  transitions
between phones. If future experimentation confirms the find-
ing that highly automatized behavior  such  as  anticipatory
coarticulation  does  not  require the integrity of anterior
structures  in  adult  subjects,  then  models   implicating
specific  anterior  regions  (e.g., Broca's area) as general
"speech programming" centers will  certainly  require  revi-
sion. It may instead be the case that anticipatory coarticu-
lation is a property more  globally  represented  throughout
the  language  "zone" of the brain (i.e, dominant peri- Syl-
vian cortex and subcortical structures). This type  of  dif-
fuse  neural representation has been proposed for other pro-
perties of language, such as the representation  of  indivi-
dual lexical items (Ojemann, 1983).

     The present findings agree with  the  traditional  view
that  anterior  aphasics demonstrate problems chiefly at the
"phonetic" level, while posterior  aphasic  evidence  mainly
"phonemic"   (selectional)   deficits.   That  is,  anterior
aphasics appear  to  have  difficulty  in  interarticulatory
coordination,  which  impinges  on their ability to initiate
and produce a variety of speech sounds.  The problem is  not
in  phoneme  selection, it is in outputting selected sounds.
In a similar  fashion,  it  may  be  assumed  that  anterior
aphasics retain representations containing information about
the coarticulatory  spread  of  featural  information.  Such
representations would be qualitatively similar to the "boun-
dary conditions" yielded as the output of the Jordan  (1986)
model.  In  keeping  with  this  view, one could reason that
where the system fails for anterior aphasics is in the  map-
ping of coarticulated representations into motor output.

     Future investigations might  examine  these  issues  by
"lesioning"  models  of coarticulation in speech production,
and observing the  manner  in  which  feature  spreading  is
affected.  Results from "lesioned" connectionist models have
recently been used by investigators  to  simulate  cognitive
breakdown  in  adult  aphasia  (Gigely,  1988)  and acquired
dyslexia (Hinton & Shallice, in press).  The  present  data,
however,  suggest  that  "lesion"  experiments  using models
similar to Jordan (1986) would be most relevant to  deficits
in   posterior  aphasic  speech,  i.e.,  deficits  involving
"phonemic" (selectional) errors.  In  order  to  best  model
anterior  aphasic  speech,  it is essential to first develop
connectionist models which incorporate information about the
kinematic properties of the articulators. Models designed to
capture the "lower-level" inertial properties of the articu-
lators  are  currently  under  development  by  a  number of
researchers  (e.g.,  Browman  and  Goldstein,  1985;  Kelso,
Saltzman, and Tuller, 1989). [1]
____________________
  1  These models  assume  that  the  articulators  may  be
viewed as a series of mass-spring oscillators, whose "dynam-
ic" patterns may be described  mathematically  in  terms  of
system  constraints  upon  oscillatory  properties.  Because
"dynamic" models offers a high degree of mathematical rigor,
they have been viewed favorably by a number of connectionist
researchers.  However,  these  models  remain  controversial
within the speech research community. For example, see Kell-
er (in press) for an alternative view.
_____________________


     Additional information about the role of  brain  struc-
tures  in speech motor programming might be obtained by com-
paring the present results with data concerning the develop-
ment of coarticulatory patterns in children. Recent findings
have suggested that the ability  to  sequence  intersyllabic
anticipatory  coarticulation  information is present from an
early age, and may be more extensive in young children  than
in  older  children  (Nittrouer,  1989a; 1989b; Katz et al.,
1989a). These data suggest that intrasyllabic coarticulation
patterns might develop during early stages of brain develop-
ment, at which point  they  are  relatively  susceptible  to
disruption. However, once mature coarticulatory capabilities
are established, they may be far less susceptible to disrup-
tion,  even  in the face of massive damage to anterior brain
structures.


                      Acknowledgments

This research was sponsored by grant NS 08176-01A1  to  Wil-
liam  Katz,  University  of  Goettingen/DFG  grant to Jochen
Machetanz, and BMFT grant 0706839A to Paul-Walter  Schoenle.
The author thanks  Karen  Yummier,  Virginia  Marchman,  and
Teenie Matlock for their helpful comments.

                       References

Browman, C. & Goldstein, L. (1985). Dynamic modeling of phonetic
     structure.In V. Fromkin (Ed.) Phonetic Linguistics. New York: 
     Academic Press.

Browman, C. & Goldstein, L. (1986). Towards an  articulatory
     phonology.  Phonology Yearbook 3, 219-252.

Fowler, C.A. (1980).  Coarticulation  and  theories  of
     extrinsic timing. J. Phonetics 8, 113-133.

Fowler, C. (in press). Some regularities in speech  are not 
     consequences of formal rules. A commentary on Keating. In J.   
     Kingston  and M.  Beckman  (Eds.)  Papers in Laboratory Phonetics, 
     I. London: Cambridge University Press.

Gigely, H. Process synchronization, lexical  ambiguity, and ambiguity
     and aphasia.  In S. Small, G. Cottrell, M. Tannenhaus (Eds.) Lex- 
     ical ambiguity  resolution: Perspectives  from  psycholinguistics,  
     neuropsychology,  and  artificial  intelligence.  San  Mateo,
     Calif.: Morgan Kaufmann Publishers, Inc.

Goodglass, H. and Kaplan, E. (1982). The assessment  of aphasia and
     related disorders. Philadelphia: Lea and Febinger.

Hinton, G. and Shallice, T. (1989). Lesioning a connectionist network:
     Investigations of acquired dyslexia.  CRG  Technical  Report
     89-3,  University of Toronto, Canada.

Hinton, G. & Sejnowski, T. (1983).  Optimal  perceptual inference.  
     Proceedings of the IEEE Computer Society Conference on  Computer  
     Vision and Pattern Recognition, 448-453.

Hewlett, N. (1988). Acoustic properties of /k/ and  /t/ in normal and
     phonologically disordered speech. Clinical Linguistics and  Phonetics,  
     Vol. 2., No. 1, 29-45.

Itoh, M., Sasanuma,  S.,  Ushijima,  T.  (1979).  Velar movements during
     speech in a patient with apraxia of speech. Brain and Language,  7,
     227-239.

Itoh, M., Sasanuma, S., Hirose, H., Yoshioka, H., Ushijima, T. (1980).   
     Abnormal articulatory dynamics in a patient with apraxia  of  speech:
     X-ray  microbeam  observation.  Brain and Language, 11,
     66-75.

Jordan, M. (1986). Serial order: A parallel distributed processing
     approach.(Tech. Rep. 8604). La Jolla: University  of  California  
     San Diego, Institute for Cognitive Science.

Katz, W.F.  (1987).  Anticipatory  labial  and  lingual coarticulation in  
     aphasia. In J. Ryalls (Ed.), Phonetic Approaches to Speech Production
     in  Aphasia  and Related Disorders. San Diego: College-Hill Press.

Katz,  W.F.  (1988).  Anticipatory  coarticulation   in aphasia: 
     Acoustic and perceptual data. Brain and Language, 35, 340-368.

Katz, W.F., Kripke, C., and Tallal, P. (1989a). Anticipatory labial
     coarticulation in the speech of young children (age 3 to 8).
     Submitted.

Katz, W.F., Kripke, C., and Tallal, P. (1989b). Anticipatory labial
     coarticulation in the speech of normal and language-impaired
     children. Submitted.

Katz, W.F., Machetanz, J., Orth, U.  and  Schoenle,  P. (1989c) A
     kinematic analysis of  anticipatory  coarticulation  in  the
     speech  of  anterior aphasic subjects using electromag-
     netic articulography. Submitted.

Katz, W.F., Machetanz, J., Orth, U.  and  Schoenle,  P. (1989d)
     Anticipatory labial coarticulation  in  two  German-speaking
     anterior    aphasic    subjects:   Acoustic   analyses.
     Manuscript.

Keating, P. (in press). The window model of coarticulation:
     Articulatory evidence. In J. Kingston and M. Beckman  (Eds.)
     Papers  in  Laboratory  Phonetics, I. London: Cambridge
     University Press.

Keller, E. (in press). Speech  motor  timing.  In  W.J. Hardcastle &
     Marchal (Eds.) Speech production and speech modelling.

Kelso,  J.A.,   Saltzman,  E.L.,  and  Tuller,  B.  (in press).  The
     dynamical perspective on speech production: Data and theory.
     Journal of Phonetics 14, 29-59.

Kimura, D., & Watson, N. (1989). The  relation  between oral movement
     control and speech. Brain and Language, 37 (4), 565-590.

Lubker, J.F. and Gay, T. (1982). Anticipatory labial coarticulation:
     Experimental, biological, and  linguistic variables.
     Journal of the Acoustical Socety of America, 17(2), 437-447.

Mlcoch and Noll, J.D. (1980). Speech production  models as related 
     to the concept of apraxia of speech.  In  N.J.  Lass  (Ed.),
     Speech  and  language:  Advances  in basic research and
     practice, Vol. 4, 201-239.

Nittrouer, S., Studdert-Kennedy, M. and  McGowan,  R.S. (1989a).  
     The emergence of phonetic segments: Evidence from  the  
     spectral structure  of fricative vowel syllables spoken by 
     children and adults. Journal of Speech and Hearing Reseach, 
     32,  No.  1,  pg. 120-132.

Nittrouer, S. and  Whalen  D.  (1989).  The  perceptual effects of 
     child-adult differences in  fricative-vowel  coarticulation.
     J.  Acoust. Soc. Am. 86 (4), 1266-1276.

Ojemann, G. (1983).  Brain  organization  for  language from the
     perspective of electrical  stimulation  mapping.  Behavioral
     and Brain Sciences, 189-230.

Repp, B. (1986). Some observations on  the  development of
     anticipatory coarticulation. J. Acoust.  Soc.  Am.  79  (5),
     1616-1619.

Rumelhart, D.E., Hinton, G.E., & Williams, R.J. (1985). Learning
     internal representations by error  propagation  (Tech,  Rep.
     8506).   La  Jolla: University of California San Diego,
     Institute for Cognitive Science.

Sharf, D. and Ohde, R. (1981). Physiological,  acoustic and 
     perceptual aspects of coarticulation: Implications  for  
     the remediation  of  articulatory  disorders. In N. J. Lass
     (Ed.), Speech and language: Advances in basic research,
     Vol. 5, New York: Academic Press.

Schoenle,  P.W.,  Grabe,  K.,  Wenig,  P.,  Hohne,  J., Schrader, J., 
     Conrad, B. (1987). Electromagnetic  Articulography:  Use  of
     alternating  magnetic  fields for tracking movements of
     multiple points inside and  outside  the  vocal  tract.
     Brain and Language, 31, 26-35.

Sereno, J.A., Baum, S.R., Marean, G.C., and P.  Lieberman (1987).
     Acoustic analyses and perceptual data on anticipatory labial
     coarticulation  in  adults and children. J. Acous. Soc.
     Am. 81, 512-519.

Sussman, H., Marquardt, T., MacNeilage, P.,  Hutchison, J. (1988).
     Anticipatory coarticulation in aphasia: Methodological  con-
     siderations. Brain and Language, 35, 369-379.

Tuller, B. and Story, R. (1987). Anticipatory coarticulation in
     aphasia. In J. Ryalls (Ed.), Phonetic Approaches  to  Speech
     Production in Aphasia and Related Disorders, San Diego:
     College-Hill Press.

Ziegler, W., von Cramon, D. (1985) Anticipatory coarticulation in
     a patient with apraxia of speech.  Brain  and  Language  26,
     117-130.

Ziegler and Von Cramon (1986a).  Disturbed  coarticulation in apraxia
     of speech: Acoustic evidence. Brain and Language, 29, 34-47.

Ziegler and Von  Cramon  (1986b).  Timing  deficits  in apraxia of
     speech. European  Archives of  Psychiatry  and  Neurological
     Sciences, 236, 44-49.

[CRL Newsletter Home Page]
[CRL Home Page]
Center for Research in Language
CRL Newsletter
Article 4-1