|
|
|
|||
| Home Help Feedback Subscriptions Archive Search Table of Contents | ||||
First published online May 30, 2008
Journal of Experimental Biology 211, 1978-1991 (2008)
Published by The Company of Biologists 2008
doi: 10.1242/jeb.014092
Two-voice complexity from a single side of the syrinx in northern mockingbird Mimus polyglottos vocalizations
1 Department of Biology, Jordan Hall, Indiana University, Bloomington, IN 47405,
USA
2 School of Medicine, Jordan Hall, Indiana University, Bloomington, IN 47405,
USA
3 National Center for Voice and Speech, 1101 13th Street, Denver, CO 80204,
USA
4 Program in Neural Science, Jordan Hall, Indiana University, Bloomington, IN
47405, USA
* Author for correspondence (e-mail: szolling{at}indiana.edu)
Accepted 9 April 2008
| Summary |
|---|
|
|
|---|
Key words: birdsong, mockingbird, nonlinear phenomena, song learning, vocal production
| INTRODUCTION |
|---|
|
|
|---|
Understanding the vocal mechanisms responsible for the spectral and temporal complexity of bird song can provide a valuable insight into the performance constraints and evolution of song diversity. A major source of acoustic complexity in songbirds comes from the versatility of motor control of their duplex vocal organ, but in some species the intrinsic biomechanical properties of the vibratory sound generating structures in the vocal organ may also contribute significantly to song complexity. In this paper, we attempt to assess the contribution each of these sources makes to the vocal diversity of song by the northern mockingbird.
Sources of acoustic complexity
The bipartite syrinx
The songbird's vocal organ, the syrinx, is composed of modified cartilages
at the cranial end of each primary bronchus and the caudal end of the trachea.
Each bronchus contains a pair of fleshy pads, the medial and lateral labia
(King, 1989
), which vibrate
when adducted into the respiratory airflow
(Goller and Larsen, 2002
) and
provide the bird with two independently controlled sound sources, one in each
bronchus (Suthers, 1990
). The
acoustic properties of song generated by each pair of labia are controlled by
the activity of ipsilateral syringeal muscles innervated by the
tracheosyringeal branch of the hypoglossal nerve
(Goller and Suthers, 1996a
;
Goller and Suthers,
1996b
).
Spectrographic analyses of birdsong
(Borror and Reese, 1956
;
Greenewalt, 1968
;
Stein, 1968
;
Thorpe, 1961
) revealed the
presence in many species of two, simultaneous non-harmonically related
frequencies. Greenewalt (Greenewalt,
1968
) referred to these as `two-voice' phenomena and hypothesized
the voices originated on opposite sides of the syrinx, a view supported by
subsequent experiments showing varying degrees of song lateralization
following unilateral section of the tracheosyringeal branch of the hypoglossal
nerve (Floody and Arnold,
1997
; Lemon, 1973
;
Nottebohm, 1971
;
Nottebohm and Nottebohm, 1976
;
Suthers, 1990
;
Suthers et al., 2004
;
Williams et al., 1992
).
A more detailed understanding of the acoustic contribution each sound
source makes to oscine song has come from techniques for recording airflow
through each side of the syrinx, together with syringeal and respiratory motor
activity during spontaneous song with both sides of the syrinx intact
(Suthers, 1990
). These data
show that songbirds exploit their dual sound source to increase vocal
virtuosity in multiple ways, including the production of two-voice elements,
switching phonation between sides to produce abrupt frequency steps between
notes and taking advantage of lateralized functional specializations in the
acoustic properties of each sound source (reviewed in
Suthers, 1999
;
Suthers and Goller, 1997
;
Suthers and Zollinger,
2004
).
Syringeal nonlinear dynamics
In addition to vocal production by coordinated neuromuscular control of the
syrinx and respiratory system, there is evidence that intrinsic, passive,
biomechanical properties of syrinx, particularly the dynamic vibratory
properties of the paired oscillators (the medial and lateral labia) comprising
each sound source, can also result in the production of complex sounds
(Fee, 2002
;
Fee et al., 1998
;
Mindlin and Laje, 2005
).
Recent experiments have demonstrated that the labia, much like the vocal folds
in the mammalian larynx (Berry et al.,
1996
; Herzel et al.,
1994
), comprise a nonlinear physical system
(Fee, 2002
;
Fee et al., 1998
). As in any
such system, the oscillating masses predictably exhibit certain traits or
behaviors including abrupt bifurcations between different vibratory modes.
Four acoustic phenomena are associated with nonlinear systems
(Wilden et al., 1998
). These
`nonlinear phenomena' (NLP) include frequency jumps, subharmonics, biphonation
and deterministic chaos. In addition to the NLP recorded from excised syrinxes
of zebra finches (Fee et al.,
1998
), NLP have been identified in the natural vocalizations of a
diverse set of species, including non-songbirds such as doves
(Beckers and ten Cate, 2006
)
and parrots (Fletcher, 2000
),
frogs (Suthers et al., 2006
),
terrestrial and marine mammals (Herzel et
al., 1995
; Riede et al.,
2007
; Riede et al.,
2000
; Riede et al.,
2004
; Riede et al.,
1997
; Titze et al.,
1993
; Tokuda et al.,
2002
; Tyson et al.,
2007
; Wilden et al.,
1998
).
Identifying the source of acoustic complexity in a two-voice vocal system
Because their vocal organ contains two independent sound sources,
spectrographic analyses alone in songbirds cannot always reliably distinguish
between two voices and some kinds of NLP. Frequency jumps produced by rapidly
switching phonation from one side of the syrinx to the other (see
Allan and Suthers, 1994
) may be
indistinguishable from frequency jumps resulting from bifurcations due to the
nonlinear dynamics of a single pair of oscillators.
In mammals, biphonation is traditionally defined as the simultaneous
appearance of two independent frequencies
(Berry et al., 1996
;
Wilden et al., 1998
). The same
definition was applied by Greenewalt
(Greenewalt, 1968
) to describe
standard two-voice phenomena in songbirds, but it is now clear that though
both examples contain simultaneous harmonically unrelated frequencies, their
physical basis is quite different. In two-voice phenomena each voice is
generated by a separate set of paired oscillators whereas in biphonation
harmonically unrelated frequencies are generated by the nonlinear properties
of a single set of oscillators (Suthers et
al., 2005
).
Additional problems in discerning NLP from two-voice phenomena based solely
on emitted vocalizations were described by Laje, Mindlin and colleagues
(Laje and Mindlin, 2005
;
Laje et al., 2008
), whose
models of source–source and source–tract interactions in the
oscine syrinx demonstrate that interactions of the two sides of the syrinx can
produce acoustic effects resembling those commonly associated with nonlinear
theory, such as subharmonics and biphonation. Their models support earlier
evidence of source–source coupling in the oscine syrinx
(Nowicki and Capranica, 1986
)
and demonstrate that certain complex sounds typically associated with
nonlinear dynamics, such as frequency jumps, subharmonics or biphonation in
birdsong, might also result from acoustic interactions within the syrinx and
trachea of sounds produced on the two sides.
Here we investigate the occurrence of the nonlinear characteristics in spontaneous song of the northern mockingbird, a vocal mimic. By monitoring subsyringeal pressure and airflow through each side of the syrinx we can determine if these nonlinear features are lateralized to one side of the syrinx or produced bilaterally and distinguish them from superficially similar two-voice vocalizations, in order to more accurately estimate their contribution to song complexity.
| MATERIALS AND METHODS |
|---|
|
|
|---|
90 and 120 syllable types at the
time of the experiment. Tutor sounds were chosen or designed for an experiment
on song production [tutoring details are given elsewhere
(Zollinger and Suthers,
2004
|
|
Surgery and data acquisition
Birds were anesthetized by injection of chloropent (4.1
µlg–1 body mass; recipe from Fort Dodge Animal Health,
Overland Park, KS, USA) into the pectoral muscle. A silastic cannula (Dow
Corning Corp, Midland, MI, USA; i.d. 1.02mm, o.d. 2.16mm) was inserted into a
cranial thoracic air sac for measurement of subsyringeal pressure. The cannula
was attached to a miniature piezoresistive pressure transducer (Fujikura
FPM-02PG, Marietta, GA, USA) mounted on a small backpack attached to an
elastic belt fitted around the bird's thorax.
A mid-ventral incision was made between the clavicles in order to expose
the syrinx through an opening in the interclavicular membrane. The rate of
airflow was recorded by a heated microbead thermistor (Thermometrics, Edison,
NJ, USA, BB05JA202) inserted into each bronchus a few semi-rings caudal to the
syrinx. The interclavicular membrane was sealed around the thermistor leads,
which were routed under the skin to the backpack. Surgical methods are
described in more detail elsewhere
(Suthers et al., 1994
;
Zollinger and Suthers, 2004
).
Pressure and airflow signals were transmitted from the backpack on leads that
exited through the top of the cage to signal conditioning instruments (Hector
Engineering, Elletsville, IN, USA) and a multi-channel digital data recorder
(Metrum DataTape RSR512, Littleton, CO, USA). Four signals (vocalization, rate
of airflow through the right and left bronchi, and air sac pressure) were
recorded digitally (40 000 samples s–1
channel–1) onto separate tracks on S-VHS 1/2'' magnetic
tape cassettes (Maxell ST-31BQ SVHS, Fair Lawn, NJ, USA) using the Metrum
recorder. Signals were transferred from tape to microcomputer using a Data
Translation board (DT-2821G) and an antialiasing filter (TTE, J87, St Pete
Beach, FL, USA; 8kHz high cut-off, stopband attenuation 60 dB per 1/3 octave).
An experiment lasted 7–10 days, during which the bird could move freely
about its cage. Vocalizations during experiments were recorded with a
directional condenser microphone (Audio-technica AT835b, Stow, OH, USA)
positioned approximately 50 cm in front of the cage.
Two methods were used to determine what each side of the syrinx contributed
to the song. The first method measured airflow through the syrinx; any air
flowing through one side of the syrinx while the other side was closed
indicated that the sound was produced entirely with the open side. In some
recordings, the thermistors responded to air oscillations up to
2 or 3kHz
produced by the acoustic signal from the ipsilateral side of the syrinx. In
these cases it was possible to determine the sound generated by each side when
both sides were phonating (Suthers,
1990
). The low frequency components of bronchial signals related
to respiratory or phonatory motor patterns were removed post-recording with a
digital 100 Hz Hanning high-pass filter.
Signals were analyzed using Igor Pro v. 5 (WaveMetrics Inc., Lake Oswego, OR, USA) and Adobe Audition v. 1.5 (Adobe Systems Inc., San Jose, CA, USA). Statistical analysis was conducted using Igor Pro v. 5 and SigmaStat v. 2.03 (SPSS Inc., Chicago, IL, USA). Preoperative song (100–300 min per bird) was recorded from adult birds (>300 days post-hatching) for comparison with pre- and post-surgery repertoires (Avisoft-Recorder v. 1.7, Avisoft Bioacoustics, Berlin, Germany).
For each syllable that spectrographically resembled one of the four NLP, we examined the airflow and pressure recordings, and noted the acoustic contribution of each side of the syrinx. A syllable was defined as a sound in which the air sac pressure was negative prior to the sound, positive during the sound production, and negative after completion of the sound (a single expiratory pulse). Vocalizations were first examined for the occurrence of NLP through visual inspection of narrowband spectrograms (1024 points at 40 000 samples s–1, frame duration 25.6 ms, window duration 75%, Hanning window type) and associated power spectra. Each syllable was scored for the presence of the four NLP described above. If a syllable contained more than one type of NLP it was counted in each category.
Identifying nonlinear phenomena
Spectrographic evaluation
We examined 1000 syllables each from four subjects for acoustic evidence of
NLP (frequency jumps, subharmonics, biphonation or deterministic chaos). This
initial sorting of sounds by visual spectrographic examination is not
sufficient to determine the mechanism of production, but was done to identify
potential NLP for further investigation.
Frequency jumps are sudden changes in fundamental frequency (f0) to a higher or lower f0 (Fig. 2A). A frequency jump was defined as a visible, instantaneous (<5 ms silent interval between adjacent frequencies, as measured from the time waveform) step-change in f0.
|
Deterministic chaos (low-dimensional noise) is technically distinguishable
from stochastic noise (high-dimensional chaos) by the number of dimensions
needed to describe it (Tokuda et al.,
2002
). However, the distinction can also be made based on telltale
characteristics visible in narrowband spectrograms
(Wilden et al., 1998
),
including preceding subharmonics (e.g. Fig.
2B) and the presence of harmonic `windows' in otherwise noisy
segments. We classified sounds as `deterministic chaos' if we observed a
broadband, noisy segment in the spectrogram
(Fig. 2C), plus at least two
additional indications, such as a sudden onset of the noise, preceding
subharmonics or harmonic windows within the noise.
|
Mechanism of vocal production
For each putative instance of NLP observed spectrographically, we then
examined concurrent airflow through the right and left sides of the syrinx,
along with subsyringeal air sac pressure. This analysis allowed us to
determine if the occurrence of such sounds could be explained by independent
phonation on the two sides of the syrinx, or if they were produced by a single
source.
In order to test whether the relationships between the occurrence of frequency jumps or subharmonics and changes in either air sac pressure or bronchial airflow were significant, we investigated changes in rates of air flow immediately prior to the bifurcation and at an earlier point in the same syllables. Temporal resolution of the time series was 25 µs. The signal was examined over two 5 ms time intervals (20–25 and 5–0 ms) prior to the bifurcation and 0–5 ms after the bifurcation. Air sac pressure and rate of airflow through the syrinx were normalized to a percentage of maximum flow rate or pressure. The normalized pressure or rate was then regressed against distance (time) to the point of bifurcation. Slopes of these regression lines were tested for differences in variance between groups using SigmaStat 3.11 (Systat Software Inc., San Jose, CA, USA). Because most examples of chaos occurred either for the entire duration of the syllable, or immediately following a period of subharmonics, similar analyses of flow and pressure prior to the onset of chaos were not conducted. Similarly, biphonation usually had a gradual onset or lasted the entire duration of the syllable, precluding a meaningful analysis of flow and pressure fluctuation associated with the bifurcation in these cases.
| RESULTS |
|---|
|
|
|---|
Frequency jumps
Occurrences of frequency jumps
Unilaterally produced frequency jumps
(Fig. 3) occurred in 1.2% of
syllables overall (mockingbird m108, 0.8%; m123, 1.2%; m130, 1.5%; m152, 1.3%)
bilateral frequency jumps were observed in 4.9% of all syllables (m108, 3.3%;
m123, 3.0%; m130, 6.8%; m152, 6.5%). In unilaterally produced frequency jumps,
the fundamental frequency shifts abruptly up or down with a silent interval
<5 ms between adjacent frequencies. During the jumps there is airflow
through only one side of the syrinx, the other side being closed. The change
in frequency due to the jumps ranged from
45 to 450 Hz. Jumps from both a
higher to lower frequency and lower to higher frequency were observed (47.9%
down-jumps, 52.1% up-jumps). Mimicked copies of tutored frequency jumps
between tone-pairs could spectrographically resemble frequency jumps resulting
from nonlinear dynamics; however, these large steps in frequency
(
500–4500 Hz) were always produced bilaterally, by alternating
phonation between the two sides of the syrinx, rather than unilaterally
(Zollinger and Suthers,
2004
).
|
0).
|
|
Physiology and production of subharmonics
The occurrence of subharmonics in mockingbird song did not correlate with
predictable changes in rates of bronchial airflow or pressure
(Fig. 4B). Rates of airflow
were not significantly different, either in their mean or variance, just prior
to the onset of subharmonics, from these values 5 ms after onset or 25 ms
prior to onset, (25 ms prior, mean 0.33±0.929 s.d., variance 0.86; 5 ms
prior, mean 0.79±1.287 s.d., variance 1.66; 5 ms post, mean
0.41±0.736 s.d., variance 0.54). Differences between groups were not
significant (one-way ANOVA, d.f.=2, SS=2.74, MS=1.37, F=1.34,
P=0.268). Airflow could either increase or decrease and the magnitude
of these changes varied. Whereas many of the instances of subharmonics
coincided with an increasing rate of airflow (43.5% of cases), subharmonics
also occurred during periods of decreasing or constant flow rates (30.4% and
26.1% of cases, respectively).
Biphonation and two-voice phenomena
Occurrences of biphonation and two-voice phenomena
The simultaneous presence of two or more independent frequencies was very
common in the acoustic signals of these mockingbirds (68.0% of all syllables;
individually: m108, 69.6%; m123, 62.1%; m130, 94.4%; m152, 45.8%). Sounds
containing two independent frequencies were often produced during bilateral
vocalization indicated by simultaneous airflow through both sides of the
syrinx (63.7% of total syllables, 93.7% of sounds with
2 independent
frequencies), and therefore meet the traditional definition of two-voice
phenomena. When a single f0 with sidebands was observed
concurrent with bilateral airflow, we did not count these as NLP since the
contribution of the two voices to the sound was not clear
(Table 2, SB-2VC column).
Mockingbirds also produced biphonic sounds, i.e. two harmonically unrelated
sounds using one side of the syrinx, the other side being closed (4.3% of
total syllables, 6.3% of sounds with
2 independent frequencies). We found
unilateral biphonation of both types illustrated in the synthesized examples
(Fig. 1A,B). Examples of `type
A' biphonation (Fig. 6) were
less common than `type B' biphonation (Fig.
7, arrow a), but both were observed in mockingbird
vocalizations.
|
|
Since oscines have two theoretically independent sound sources (a pair of vibratory tissues within each side of the syrinx) we looked for examples of two simultaneous biphonic sounds, or `dual biphonation' such as illustrated in Fig. 1C,D. Mockingbirds did produce dual biphonation, or two-voice syllables in which each `voice' was biphonic (Fig. 7, arrow b and Fig. 8). Type B `dual biphonation' (e.g. each side producing a different biphonic sound, each consisting of a fundamental frequency and an independent modulating frequency) represented between 0.8 and 15.1% of syllables examined per bird (7.1±6.35%, mean ± s.d.). Interestingly, in every case of bilateral biphonation, the modulating frequency (m0) of the two voices was the same for both right- and left-produced carrier frequencies (i.e. voices), even when m0 was itself frequency modulated. We did not find any examples of simultaneous bilateral production of type A biphonation.
|
Physiology and production of biphonation and two-voice phenomena
Examples of type A biphonation produced with airflow through only a single
side of the syrinx are shown in Fig.
6, arrows a and b. A power spectrum of the sound at arrow b shows
the first fundamental frequency (f0) at 946 Hz and a
second fundamental frequency (g0) at 1505 Hz. Second
harmonics of f0 and g0 are visible at
1892 Hz and 3010 Hz, respectively. Other peaks correspond to linear
combination products of f0 and g0, as
labelled. At arrow a, the lower frequency (f0) is 990 Hz,
and the higher frequency is 1505 Hz.
An example of type B unilateral biphonation is shown in Fig. 7 (first syllable). Across our sample, the modulating frequencies (m0) in type B biphonation ranged from 57 to 540Hz. For type B unilateral biphonation, the spacing of the sidebands in the frequency domain corresponds to the rate of AM in the sound waveform. Examination of the waveform during the first syllable (top trace, Fig. 7B) shows the period of the AM pattern is approximately 11.5 ms, and sidebands 115 Hz above and below f0 in a power spectrum (Fig. 7C).
Examples of two-voice biphonation are shown in Fig. 7 (second syllable) and Fig. 8. At the start of the second syllable in Fig. 7 the bird opens both sides of the syrinx to produce a two-voice syllable in which each side generates independent fundamental frequencies, and both fundamentals (f0 and g0) are flanked by sidebands. In this second syllable, the sidebands are equal distance from both f0 and g0, and the modulating frequency (m0) is the same for both. Comparing 35 ms segments from the center of each syllable in the pair (Fig. 7B), a 115 Hz modulation pattern is apparent in both. However, compared with the waveform of the unilaterally produced syllable (Fig. 7B, top trace), addition of a second fundamental (g0), and airflow through the second side of the syrinx, results in an additional AM pattern on the sound waveform of 550 Hz (Fig. 7B, bottom trace). This 550 Hz modulation during the two-voice syllable is likely the result of beating (sinusoidal oscillations in amplitude resulting from the linear interaction of two similar frequencies). The observed frequency of 550 Hz amplitude modulation is equal to the difference in frequency of g0–f0 (2695–2145 Hz in Fig. 7).
In another example of two-voice biphonation (Fig. 8), the two carrier frequencies (f0 and g0; representing the left and right voice, respectively) are frequency modulated (FM) in opposite directions. During the first 50 ms of the syllable in Fig. 8A, both the right and left sides of the syrinx are open and the bird sings two converging FM sounds, each with sidebands. Unlike the second syllable in Fig. 7, the two-voice biphonation during the first part of the syllable in Fig. 8 does not result in the appearance of a pronounced beating pattern in the waveform. However, in all cases of two-voice biphonation, m0 was always the same on the two sides of the syrinx, even if the FM pattern of f0 and g0 were opposite (upsweeping or downsweeping, respectively). The source of m0 in mockingbirds is not known.
Deterministic chaos
Occurrences of deterministic chaos
Syllables containing apparent deterministic chaos were present in 8.3% of
syllables (individually, m108, 6.3%; m123, 5.0%; m130, 6.7%, m152; 15.3%).
Chaotic sounds occurred in vocalizations accompanied by unilateral airflow
(Fig. 5) as well as those
produced during bilateral airflow (Fig.
9). Most examples of chaos in the repertoires of these birds were
produced with bilateral syringeal airflow (m108, 2.9%; m123, 4.5%; m130, 5.9%;
m152, 13.2% of syllables sampled), however, examples of unilaterally-produced
aperiodic sounds were also present (1.7% of all syllables sampled for all
birds. Individually: m108, 3.4%; m123, 0.5%; m130, 0.8%; m152, 2.1% of
syllables sampled). In each case, the apparent chaos took the form of
broadband noise with a sudden onset. Chaotic segments were sometimes preceded
by or followed by subharmonics (Fig.
5), and harmonic windows within the `noisy' segment (e.g.
Fig. 9A, arrow b) were commonly
observed.
|
All four of the birds in this study included `loud hew' calls in their song repertoires. We examined the rate of airflow and sound recorded in each bronchus for 67 loud hew calls from the four birds in this study (to increase sample size, we used an additional 50 loud hews that were not in the 4000 syllables used for NLP count). Although always produced with airflow through both sides of the syrinx, the chaotic nature of the hew calls often appeared to be the result of chaotic oscillations from only a single side of the syrinx. Fig. 9 shows the contribution to sound from the right and left sides during hew calls produced by mockingbird m130, as recorded in the right and left bronchus by the thermistors. The frequency response of the thermistors rolls off at about 3 or 4 kHz, so it is not possible to completely rule out a chaotic contribution at higher frequencies on the left side. However, the presence of rapid fluctuations in the air flow rate through the right side (FR in Fig. 9A,B) and their absence in the flow rate on the left side (FL in Fig. 9A,B) suggests that the two sides are behaving differently when producing these chaotic sounds. In most hew calls (51 of 67) the right side alone appeared responsible for most of the aperiodicity, while the left side's contribution was more tonal. In these 51 cases, the rate of airflow in the right side shows an aperiodic, rapid modulation, which is not present in the flow on the left. In 5 of 67 calls, the left side appeared responsible for most of aperiodicity, and a strong aperiodic fluctuation in left flow rate throughout the duration of the call, while the flow rates on the right side were more constant. In 11 of 67 calls, the two sides appeared to contribute equally, or the relative contribution of the right and left sides could not be determined from the bronchial flow signals.
| DISCUSSION |
|---|
|
|
|---|
We found unilateral biphonation of two types in mockingbird song: as two
harmonically unrelated fundamental frequencies, each independently modulated
(Fig. 6), and as one
f0 modulated by a second, lower frequency, appearing as
sidebands parallel to the f0 in a narrowband spectrogram
(Fig. 7). It is important to
note that while sidebands indicate the occurrence of two independent
frequencies, there are several ways in which they may be produced. In
biphonation, sidebands result from one f0 modulated by a
second, lower frequency (Greenewalt,
1968
; Lavenex,
1999
). Experiments with an excised larynx and computer modelling
have shown that the mammalian vocal folds, vibrating asymmetrically, can
produce AM with accompanying sidebands that are the result of linear
combinations of the two fundamentals (e.g.
2g0–f0,
2f0–g0)
(Giovanni et al., 1999
;
Herzel et al., 1995
;
Mergell and Herzel, 1997
;
Neubauer et al., 2001
). In
such cases, the two vocal folds oscillate at slightly different frequencies,
each producing a separate sound pressure wave that is close to the other in
fundamental frequency. If the same phenomenon can occur during oscillation of
the medial and lateral labia in songbirds, biphonic vocalizations might be the
result of either a higher f0 with a lower modulating
frequency (e.g. Fig. 1B), or
the interaction of two higher tones, which are close in fundamental frequency
(Fig. 1A). In either case,
neither unilateral biphonation nor dual (two-voice) biphonation, together with
their syringeal motor correlates, has been previously described in intact
songbirds.
Production and control of nonlinear phenomena
We hypothesize that although production of NLP is a passive biomechanical
process, birds may sometimes exert voluntary control over the conditions under
which it occurs. However, in other cases NLP might be `unintentional,'
resulting from instabilities of the vocal system. The occurrence of NLP in
mammalian vocalizations increases as sound level and frequency increase,
suggesting it may be produced by driving the vocal system to its performance
limit (Riede et al., 2007
) and
past threshold points that mark boundaries between stable vibratory modes.
In a zebra finch excised syrinx preparation, it was demonstrated that a
linear increase in subsyringeal pressure results in bifurcations between
different vibratory modes (Fee et al.,
1998
). The absence of a consistent relationship between the
occurrence of NLP and a detectable increase in rates of bronchial air flow,
subsyringeal air sac pressure, or f0 in our mockingbirds
may reflect the availability of respiratory, syringeal and upper vocal tract
neuromuscular control in our live birds. Although we did not observe a
consistent pattern in the direction or rate of bronchial airflow or in
subsyringeal pressure associated with the occurrence of NLP, the fact that
frequency jumps were significantly (but not always) correlated with increased
variability in the rate of airflow is consistent with the hypothesis that at
least some NLP reflects a failure in vocal motor control. Nonlinear theory
suggests that even very small changes in control parameters such as subglottal
pressure can result in production of NLP. There are many potential control
parameters that we did not measure, such as syringeal resistance, the tension
of the oscillating labia or the pressure and flow profiles across their
surface.
Source–source and source–tract interactions
We found several cases of airflow through both sides of the syrinx
generating three independent frequencies (f0,
g0 and m0). Nowicki and Capranica
(Nowicki and Capranica, 1986
)
reported evidence for a source–vocal tract coupling in the
amplitude-modulated calls of black-capped chickadees (Parus
atricapillus), and speculated on possible source–source (acoustic
or mechanical) coupling. Our observation that in each case of bilateral
biphonation the frequency of the amplitude modulation (m0)
was the same for both the left- and right-produced fundamentals supports their
hypothesis of source–source coupling in the oscine syrinx.
Possible mechanisms of source–filter and source–source
interactions were investigated recently using computational models
(Laje and Mindlin, 2005
;
Laje et al., 2008
), which
demonstrate that such interactions could indeed produce the complex,
multi-frequency sounds observed in birdsong. Laje and Mindlin's results are
relevant to our observations, because they demonstrate how sounds, such as
subharmonics, previously presumed to be NLP in intact songbirds such as zebra
finches (e.g. Fee et al.,
1998
), could alternatively be the result of acoustic interactions
between the two sides. In addition, Nelson
(Nelson, 2004
) describes how
rapid FM within a song element in eastern towhees can produce an AM-like
sideband pattern in narrowband spectrograms, and further speculated on an
as-yet-unidentified third independent modulator in the syrinx, but further
evidence for the identity or existence of such a structure is lacking.
These simulations and models demonstrate that crosstalk between a
fundamental frequency and resonance frequencies may affect the vibration of
the source. The likelihood of an interaction of sound source and vocal tract
filter increases if vocal tract impedance is adjusted to match the impedance
of the source (Titze, 2008
). A
highly variable vocal tract system with complex motor patterns has been
demonstrated in songbirds (Fletcher et
al., 2006
; Riede et al.,
2006
), and thus one may expect that songbirds, like humans, have a
variety of vocal `tools' at their disposal to avoid or minimize the occurrence
of involuntary NLP during song.
Deterministic chaos in calls and song
Periods of deterministic chaos can theoretically occur in any system of
coupled oscillators, such as the paired labia within each side of a songbird
syrinx. Whereas chaos may or may not be a common feature of learned song, the
unlearned calls of many songbirds, such as alarm calls, contact calls and
aggressive calls, are often characterized by broadband, noisy, `buzzy' or
`harsh' sounds (Marler,
2004
).
Although our data show that mockingbirds may have some control over the
respiratory and syringeal parameters necessary to allow the production of
chaotic sounds, as evidenced by the chaotic nature of three very common call
types, chaotic vibratory modes are also presumably induced involuntarily at
times. Broadband, aperiodic sounds were found within a small number of
otherwise pure-tonal song elements. These erratic occurrences of chaos may be
more akin to those described in the mammalian literature, which are often
attributed to pathologies (Herzel et al.,
1994
; Mende et al.,
1990
), instabilities in the vocal system
(Mergell et al., 2000
),
increasing or maximizing frequency or amplitude levels
(Berry, 2001
;
Brown et al., 2003
;
Riede et al., 2007
), or abrupt
desynchronization of the vibrating labia
(Neubauer et al., 2001
).
Instances of chaotic sounds within more tonal vocalizations were likely
involuntary, as they rarely occurred more than once in the same syllable type
from the same bird.
Possible communicative roles of nonlinear phenomena
Several communicative functions of NLP have been hypothesized. For
instance, it has been suggested that increased vocal `roughness' might be an
honest indicator of poor reproductive fitness
(Goller, 1998
) or health
status (Herzel et al., 1994
;
Riede et al., 1997
). A
preference for pure-tonal over harmonic or aperiodic vocalizations has been
shown for some songbirds (Strote and
Nowicki, 1996
). An alternative, seemingly contradictory,
hypothesis suggests that animals might exploit the nonlinear properties of
their vocal systems to increase vocal complexity, and that NLP could aid in
individual recognition in some species
(Fee et al., 1998
;
Fitch et al., 2002
;
Volodina et al., 2006
;
Wilden et al., 1998
).
Additionally, NLP may function to increase the auditory impact of calls
(Owren, 2003
;
Owren and Rendall, 2001
), and
so could be useful both for attracting allies or mates, as well as for
signalling status or physical condition. While it is still not clear what, if
any, adaptive advantages or selective pressures are associated with NLP in
animal vocalizations, the literature is becoming increasingly rich with
examples of these phenomena in an ever-widening range of taxa. The extent to
which birds use nonlinear phenomena as salient features in vocal communication
varies significantly between species. Species differences in the inclusion of
NLP in their vocal signals may reflect the need, in part, to achieve an
appropriate balance between syllable stereotypy and syllable diversity.
LIST OF ABBREVIATIONS
| Acknowledgments |
|---|
| References |
|---|
|
|
|---|
Allan, S. E. and Suthers, R. A. (1994). Lateralization and motor stereotypy of song production in the brown-headed cowbird. J. Neurobiol. 25,1154 -1166.[CrossRef][Medline]
Ballentine, B., Hyman, J. and Nowicki, S.
(2004). Vocal performance influences female response to male bird
song: an experimental test. Behav. Ecol.
15,163
-168.
Baptista, L. F. and Trail, P. W. (1992). The role of song in the evolution of passerine diversity. Syst. Biol. 41,242 -247.
Beckers, G. J. L. and ten Cate, C. (2006). Nonlinear phenomena and song evolution in Streptopelia doves. Acta Zool. Sin. 52,482 -485.
Beckers, G. J. L., Suthers, R. A. and ten Cate, C.
(2003). Mechanisms of frequency and amplitude modulation in ring
dove song. J. Exp. Biol.
206,1833
-1843.
Berry, D. A. (2001). Mechanisms of modal and nonmodal phonation. J. Phon. 29,431 -450.[CrossRef]
Berry, D. A., Herzel, H., Titze, I. R. and Story, B. H. (1996). Bifurcations in excised larynx experiments. J. Voice 10,129 -138.[CrossRef][Medline]
Borror, D. J. and Reese, C. R. (1956). Vocal gymnastics in wood thrush songs. Ohio J. Sci. 56,177 -182.
Brown, C. H., Alipour, F., Berry, D. A. and Montequin, D. (2003). Laryngeal biomechanics and vocal communication in the squirrel monkey (Saimiri boliviensis). J. Acoust. Soc. Am. 113,2114 -2126.[CrossRef][Medline]
Derrickson, K. C. and Breitwisch, R. (1992). Northern mockingbird. In The Birds of North America. Vol. 7 (ed. P. S. A. Poole and F. Gill), pp.1 -24. Philadelphia, PA: Smith, Edwards, Dunlop.
Fee, M. S. (2002). Measurement of the linear and nonlinear mechanical properties of the oscine syrinx: implications for function. J. Comp. Physiol. A 188,829 -839.[CrossRef][Medline]
Fee, M. S., Shraiman, B., Pesaran, B. and Mitra, P. P. (1998). The role of nonlinear dynamics of the syrinx in the vocalizations of a songbird. Nature 395, 67-71.[CrossRef][Medline]
Fitch, W. T., Neubauer, J. and Herzel, H. (2002). Calls out of chaos: the adaptive significance of nonlinear phenomena in mammalian vocal production. Anim. Behav. 63,407 -418.[CrossRef]
Fitzpatrick, J. W. (1988). Why so many passerine birds? A response to Raikow. Syst. Zool. 37, 71-76.
Fletcher, N. H. (2000). A class of chaotic bird calls? J. Acoust. Soc. Am. 108,821 -826.[CrossRef][Medline]
Fletcher, N. H., Riede, T. and Suthers, R. A. (2006). Model for vocalization by a bird with distensible vocal cavity and open beak. J. Acoust. Soc. Am. 119,1005 -1011.[CrossRef][Medline]
Floody, O. R. and Arnold, A. P. (1997). Song lateralization in the zebra finch. Horm. Behav. 31, 25-34.[CrossRef][Medline]
Forstmeier, W., Kempenaers, B., Meyer, A. and Leisler, B. (2002). A novel song parameter correlates with extra-pair paternity and reflects male longevity. Proc. R. Soc. Lond. B Biol. Sci. 269,1479 -1485.[Medline]
Giovanni, A., Ouaknine, M., Guelfucci, B., Yu, P., Zanaret, M. and Triglia, J. M. (1999). Nonlinear behavior of vocal fold vibration: the role of coupling between the vocal folds. J. Voice 13,465 -476.[CrossRef][Medline]
Goller, F. (1998). Vocal gymnastics and the bird brain. Nature 395,11 -12.[CrossRef]
Goller, F. and Larsen, O. N. (2002). New perspectives on mechanisms of sound generation in songbirds. J. Comp. Physiol. A 188,841 -850.[CrossRef][Medline]
Goller, F. and Suthers, R. A. (1996a). Role of
syringeal muscles in controlling the phonology of bird song. J.
Neurophysiol. 76,287
-300.
Goller, F. and Suthers, R. A. (1996b). Role of
syringeal muscles in gating airflow and sound production in singing brown
thrashers. J. Neurophysiol.
75,867
-876.
Greenewalt, C. H. (1968). Bird Song: Acoustics and Physiology. Washington, DC: Smithsonian Institution Press.
Herzel, H., Berry, D., Titze, I. R. and Saleh, S. (1994). Analysis of vocal disorders with methods from nonlinear dynamics. J. Speech Hear. Res. 37,1008 -1019.[Medline]
Herzel, H., Berry, D., Titze, I. R. and Steinecke, I. (1995). Nonlinear dynamics of the voice: signal analysis and biomechanical modelling. Chaos 5, 30-34.[CrossRef][Medline]
King, A. S. (1989). Functional anatomy of the syrinx. In Form and Function in Birds. Vol.4 (ed. A. S. King and J. McLelland), pp.105 -192. London: Academic Press.
Laje, R. and Mindlin, G. B. (2005). Modeling source-source and source-filter acoustic interaction in birdsong. Phys. Rev. E 72,036218 .[CrossRef]
Laje, R., Sciamarella, D., Zanella, J. and Mindlin, G. B. (2008). Bilateral source acoustic interaction in a syrinx model of an oscine bird. Phys. Rev. E 77, 011912.[CrossRef]
Lavenex, P. B. (1999). Vocal production mechanisms in the budgerigar (Melopsittacus undulatus): the presence and implications of amplitude modulation. J. Acoust. Soc. Am. 106,491 -505.[CrossRef][Medline]
Lemon, R. E. (1973). Nervous control of the syrinx in white-throated sparrows (Zonotrichia albicollis). J. Zool. Lond. 171,131 -140.
Marler, P. (2004). Bird calls – their potential for behavioral neurobiology. Ann. N. Y. Acad. Sci. 1016,31 -44.[CrossRef][Medline]
Mende, W., Herzel, H. and Wermke, K. (1990). Bifurcations and chaos in newborn-infant cries. Phys. Lett. A 145,418 -424.[CrossRef]
Mergell, P. and Herzel, H. (1997). Modelling biphonation – the role of the vocal tract. Speech Commun. 22,141 -154.[CrossRef]
Mergell, P., Herzel, H. and Titze, I. R. (2000). Irregular vocal-fold vibration – high-speed observation and modeling. J. Acoust. Soc. Am. 108,2996 -3002.[CrossRef][Medline]
Mindlin, G. B. and Laje, R. (2005). The Physics of Birdsong. Berlin: Springer-Verlang.
Nelson, B. S. (2004). Dynamics of frequency and amplitude modulations in vocalizations produced by eastern towhees, Pipilo erythrophthalmus. J. Acoust. Soc. Am. 115,1333 -1344.[CrossRef][Medline]
Neubauer, J., Mergell, P., Eysholdt, U. and Herzel, H. (2001). Patio-temporal analysis of irregular vocal fold oscillations: biphonation due to desynchronization of spatial modes. J. Acoust. Soc. Am. 110,3179 -3192.[CrossRef][Medline]
Nottebohm, F. (1971). Neural lateralization of vocal control in a passerine bird. I. Song. J. Exp. Zool. 177,229 -262.[CrossRef][Medline]
Nottebohm, F. and Nottebohm, M. E. (1976). Left hypoglossal dominance in the control of canary and white-crowned sparrow song. J. Comp. Physiol. 108,171 -192.[CrossRef]
Nowicki, S. and Capranica, R. R. (1986). Bilateral syringeal coupling during phonation of a songbird. J. Neurosci. 6,3595 -3610.[Abstract]
Nowicki, S. and Searcy, W. A. (2004). Song function and the evolution of female preferences – why birds sing, why brains matter. Ann. N. Y. Acad. Sci. 1016,704 -723.[CrossRef][Medline]
Owren, M. J. (2003). Vocal production and perception in nonhuman primates provide clues about early hominids and speech evolution. ATR Sympos HIS Series 1, 1-19.
Owren, M. J. and Rendall, D. (2001). Sound on the rebound: bringing form and function back to the forefront in understanding nonhuman primate vocal signaling. Evol. Anthropol. 10, 58-71.[CrossRef]
Raikow, R. J. (1986). Why are there so many kinds of passerine birds? Syst. Zool. 35,255 -259.
Riede, T., Wilden, I. and Tembrock, G. (1997). Subharmonics, biphonations, and frequency jumps – common components of mammalian vocalization or indicators for disorders? Z. Saugetierkd. 62,198 -203.
Riede, T., Herzel, H., Mehwald, D., Seidner, W., Trumler, E., Bohme, G. and Tembrock, G. (2000). Nonlinear phenomena in the natural howling of a dog-wolf mix. J. Acoust. Soc. Am. 108,1435 -1442.[CrossRef][Medline]
Riede, T., Owren, M. J. and Arcadi, A. C. (2004). Nonlinear acoustics in pant hoots of common chimpanzees (Pan troglodytes): frequency jumps, subharmonics, biphonation, and deterministic chaos. Am. J. Primatol. 64,277 -291.[CrossRef][Medline]
Riede, T., Suthers, R. A., Fletcher, N. H. and Blevins, W.
E. (2006). Songbirds tune their vocal tract to the
fundamental frequency of their song. Proc. Natl. Acad. Sci.
USA 103,5543
-5548.
Riede, T., Arcadi, A. C. and Owren, M. J. (2007). Nonlinear acoustics in the pant hoots of common chimpanzees (Pan troglodytes): vocalizing at the edge. J. Acoust. Soc. Am. 121,1758 -1767.[CrossRef][Medline]
Stein, R. C. (1968). Modulation in bird sound. Auk 94,229 -243.
Strote, J. and Nowicki, S. (1996). Responses to songs with altered tonal quality by adult song sparrows (Melospiza melodia). Behaviour 133, pp.161 -172.[CrossRef]
Suthers, R. A. (1990). Contributions to birdsong from the left and right sides of the intact syrinx. Nature 347,473 -477.[CrossRef]
Suthers, R. A. (1999). The motor basis of vocal performance in songbirds. In The Design of Animal Communication (ed. M. Hauser and M. Konishi), pp.37 -62. Cambridge, MA: MIT Press.
Suthers, R. A. and Goller, F. (1997). Motor correlates of vocal diversity in songbirds. In Current Ornithology. Vol. 14 (ed. V. Nolan, Jr, E. Ketterson and C. F. Thompson), pp. 235-288. New York: Plenum Press.
Suthers, R. A. and Zollinger, S. A. (2004). Producing song: the vocal apparatus. Ann. N. Y. Acad. Sci. 1016,109 -129.[CrossRef][Medline]
Suthers, R. A., Goller, F. and Hartley, R. S. (1994). Motor dynamics of song production by mimic thrushes. J. Neurobiol. 25,917 -936.[CrossRef][Medline]
Suthers, R. A., Vallet, E., Tanvez, A. and Kreutzer, M. (2004). Bilateral song production in domestic canaries. J. Neurobiol. 60,381 -393.[CrossRef][Medline]
Suthers, R. A., Beckers, G. J. L. and Nelson, B. S. (2005). Vocal mechanisms for avian communication. In Behavior and Neurodynamics for Auditory Communication (ed. J. S. Kanwal and G. Ehret), pp. 3-35. Cambridge: Cambridge University Press.
Suthers, R. A., Narins, P. M., Lin, W. Y., Schnitzler, H. U.,
Denzinger, A., Xu, C. H. and Feng, A. S. (2006). Voices of
the dead: complex nonlinear vocal signals from the larynx of an ultrasonic
frog. J. Exp. Biol. 209,4984
-4993.
Thorpe, W. H. (1961). Bird-Song. Cambridge: Cambridge University Press.
Titze, I. R. (2008). Nonlinear source-filter coupling in phonation. Theory J. Acoust. Soc. Am. 123,2733 -2749.[CrossRef]
Titze, I. R., Baken, R. and Herzel, H. (1993). Evidence of chaos in vocal fold vibration. In Vocal Fold Physiology (ed. I. R. Titze), pp.143 -188. San Diego: Singular Publishing Group.
Tokuda, I., Riede, T., Neubauer, J., Owren, M. J. and Herzel, H. (2002). Nonlinear analysis of irregular animal vocalizations. J. Acoust. Soc. Am. 111,2908 -2919.[CrossRef][Medline]
Tyson, R. B., Nowacek, D. P. and Miller, P. J. O. (2007). Nonlinear phenomena in the vocalizations of North Atlantic right whales (Eubalaena glacialis) and killer whales (Orcinus orca). J. Acoust. Soc. Am. 122,1365 -1373.[CrossRef][Medline]
Vallet, E., Beme, I. and Kreutzer, M. (1998). Two-note syllables in canary songs elicit high levels of sexual display. Anim. Behav. 55,291 -297.[CrossRef][Medline]
Vermeij, G. J. (1988). The evolutionary success of passerines: a question of semantics? Syst. Zool. 37, 69-71.
Volodina, E. V., Volodin, I. A., Isaeva, I. V. and Unck, C. (2006). Biphonation may function to enhance individual recognition in the dhole, Cuon alpinus. Ethology 112,815 -825.[CrossRef]
Wilden, I., Herzel, H., Peters, G. and Tembrock, G. (1998). Subharmonics, biphonation, and deterministic chaos in mammal vocalization. Bioacoustics 9, 171-196.
Williams, H., Crane, L. A., Hale, T. K., Esposito, M. A. and Nottebohm, F. (1992). Right-side dominance for song control in the Zebra Finch. J. Neurobiol. 23,1006 -1020.[CrossRef][Medline]
Zollinger, S. A. and Suthers, R. A. (2004). Motor mechanisms of a vocal mimic: implications for birdsong production. Proc. R. Soc. Lond. B Biol. Sci. 271,483 -491.[Medline]
![]()
CiteULike
Complore
Connotea
Del.icio.us
Digg
Reddit
Technorati
Twitter What's this?
This article has been cited by other articles:
![]() |
A. N. Rice and A. H. Bass Novel vocal repertoire and paired swimbladders of the three-spined toadfish, Batrachomoeus trispinosus: insights into the diversity of the Batrachoididae J. Exp. Biol., May 1, 2009; 212(9): 1377 - 1391. [Abstract] [Full Text] [PDF] |
||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||