Phonology 1.5: Consonants
In this lesson, we will learn about the particulars of Japanese consonant pronunciation and the rules behind it in the context of Japanese phonology. Terms will be defined as necessary, and connections to English will be made occasionally, though this will be limited due to the overwhelming dialectical diversity found across the English speaking world.
Lesson Note: This lesson assumes that you have basic knowledge of IPA symbols, but IPA symbols and linguistic terminology are defined when appropriate.
[] encases phonetic descriptions, whereas // encases phonemic1 descriptions.
Consonants 子音
stops/Plosives 閉鎖音・破裂音
A stop is a sound that stops air flow completely. The Japanese non-nasal stops are [p, b, t, d, k, g]. If we were to organize these by voicing and place of articulation, we would get the following chart.
| Bilabial 両唇音 | Alveolar 歯茎音 | Velar 軟口蓋音 | |
| [-V] | [p] | [t] | [k] |
| [+V] | [b] | [d] | [g] |
The bilabial sounds [p] and [b] are made by compressing the lips. Though the Japanese [t] and [d] are alveolar like in English, they are pronounced with the tongue tip almost touching the back of the upper teeth (舌端音). For some speakers, they will be fully dental (歯音).
The voiced velar [g] has a [+nasal] allophone [ŋ] word medially and after the moraic uvular nasal /ɴ/. The two allophones of /g/, [g, ŋ] are interchangeable in between vowels.
In English, unvoiced stops are heavily aspirated, especially in word-initial position. In Japanese, unvoiced stops are aspirated but not to the same degree as in English but greater than as in Spanish.
Fricatives 摩擦音
Fricatives are made via turbulence made by forcing air to flow narrowly through the oral cavity, or in other words, by making friction with the tongue and mouth.
| Bilabial 両唇音 | Alveolar 歯茎音 | Alveolo-Palatal 歯茎硬口蓋音 | Palatal 硬口蓋音 | Velar 軟口蓋音 | Glottal 声門音 | |
| [-V] | ([ɸ]) | [s] | [ɕ] | ([ç]) | [h] | |
| [+V] | ([β]) | [z] | [ʑ] | ([ɣ]) |
Chart Note: In this chart, we include phonemes and allophones of phonemes that happen to be pronounced as fricatives. To distinguish them in the chart, allophones of phonemes pronounced as fricatives are shown in parentheses.
Alveolo-palatal sounds in Japanese involve no rounding of the lips with the blade of the tongue behind the alveolar ridge and the body of the tongue raised toward the palate. This is unlike alveolar sounds which are made on or slightly before the alveolar ridge.
Phonemically, there are only five fricatives in Japanese, but we see the rest of the space is nearly filled up when we include allophones of other (non-)fricative phonemes. In the first column, we see [ɸ] and [β]. The first is an allophone of /h/ when it is followed by the back unrounded vowel [ɯᵝ]. The latter is an optional allophone of /b/ in rapid speech in word medial position, but it is not that common, and even in such rapid and or vulgar speech in which it would appear, it is not a guarantee that it will even appear.
[s] and [z] are fundamental fricatives of Japanese. [s]’s hissing effect is slightly more amplified in Japanese to many English ears, but this is minute and would not be a feature shared by all Japanese speakers. [z] is often restricted by speakers in word medial position, and in Standard Japanese, it is not supposed to appear word initially. /z/ when word initial or after the uvular [ɴ] is pronounced with the affricate allophone [dz]. We will learn more about affricates in the next section. We will say that because this splicing is so dependent on speaker variation that these allophones are in free variation with [dz] being the most frequent pronunciation.
The alveolopalatal fricatives [ɕ] and [ʑ] have an odd status in Japanese phonology. Some reconstructions of older forms of Japanese suggest that [s] and [z] were in fact [ɕ] and [ʑ] or at least such with [-low] vowels such as [i] and [e] because to this day, there are dialects in which [si] and [se] are realized as [ɕi] and [ɕe] respectively. These sounds became phonemes in Japanese through the introduction of loanwords from Chinese.
In Standard Japanese, /s/ has the obligatory allophone [ɕ] before the high front vowel [i]. This makes /s/ partially neutralized with [ɕ]. This allophonic variation existed before the introduction of [ɕ] as a phoneme. The same came be said for /z/ which must become [ʑ] before [i].
Unlike [z] and [dz] which are still in free variation, [ʑ] is disappearing. Regardless of location, it is replaced entirely by the allophone [dʑ].
The palatal [ç] is an obligatory phoneme of [h] before the high vowel [i] in Standard Japanese, but it does not exist, traditionally, outside Eastern Japanese dialects.
Like the bilabial fricative, the velar fricative [ɣ] is an optional allophone of /g/ in rapid and or vulgar speech, but this is somewhat more common. As it is not an obligatory feature, many speakers cannot even pronounce it. This allophone only appears word medially.
The glottal fricative [h] is very similar to the English [h] and can be accompanied by vocal cord vibration intervocalically just like in English. To many English speakers, it may sound slightly more tense than the English [h] but relatively weak before the low-mid vowel [a].
Affricates 破擦音
An affricate is the combination of a stop and a fricative. In Japanese, the voiced alveopalatal fricatives and voiced affricatives have collapsed together, with the alveopalatal fricative pronunciations lingering on as rare allophones.
| Alveolar 歯茎音 | Alveolopalatal 歯茎硬口蓋音 | |
| [-V] | [ts] | [tɕ] |
| [+V] | [dz] | [dʑ] |
[ts] is an obligatory allophone of /t/ before the back unrounded vowel [ɯᵝ], but due to the introduction of loanwords from modern foreign languages, it has now become phonemic, but it has also existed in this capacity in non-standard dialects.
[tɕ] has traditionally been an allophone of /t/ before the high front vowel [i] but became a phoneme of Japanese through borrowings from Chinese, and thanks to borrowings from modern foreign languages, it may now be used with all of the vowels of Japanese. It has traditionally been not paired with the mid-vowel [e].
The voiced affricates [dʑ] and [dz] initially came in the language via Chinese borrowings in Middle Japanese. They were maintained as distinct allophones of separate sounds from the voiced fricatives, but the voiced fricatives began to partially neutralize with the voiced affricatives before high vowels. Complete collapse has occurred for many speakers in which all voiced fricatives become voiced affricatives. Once [z] disappears, we will then be able to say complete collapse has occurred for all speakers. In some regions, [dz] has become [d]. This means that /d/, /z/, and /dz/ are completely neutralized for some speakers.
Approximants 接近音
Approximants is not the best term in the world, but it refers to liquids and glides which are all sonorants. Sonorants are sounds created with no obstruction to air flow and are continuant. Liquids differ from glides in that liquids are [+consonantal] and glides are not. Glides are treated phonologically as consonants in Japanese, but their articulation most resembles a vowel.
| Alveolar 歯茎音 | Palatal 硬口蓋音 | Velar 軟口蓋音 | |
| liquid [+V] | [ɾ] | ||
| glide [+V] | [j] | [w] |
The alveolar tap/flap [ɾ] does have allophones. It is neither the English [l] nor [ɻ]. It is often described as sounding like the English [d], though the English [d] is semi-voiced whereas the Japanese [ɾ] is fully voiced and created by merely tapping the alveolar ridge. Before [i] and [j], it usually sounds like the tap [ɾ], but before [o], it often sounds like the alveolar approximant [ɺ], It may also be a trill, [r], in vulgar or casual speech.
The palatal glide [j] is seen only with [a, o, ɯᵝ] in native and Sino-Japanese vocabulary. [je] has been introduced in loans, but it is frequently replaced with [i.e]. [yi] has not been successfully introduced, though a Katakana diglyph does exist for it – (イィ). All Japanese stops and fricatives are palatalized to enlarge the Japanese phonemic inventory.
| Bilabial 両唇音 | Alveolar 歯茎音 | Velar 軟口蓋音 | |
| Stop [-V] | [pj] | [tj= tɕ] | [kj] |
| Stop [+V] | [bj] | [dj= dʑ] | [gj] |
| Fricative [-V] | [sj=ɕ] | ||
| Fricative [+V] | [zj= ʑ] |
Phonologically, the sounds in the alveolar column are combinations of a stop/fricative with a palatal glide as the underlining representation. However, this underlining representation does not reflect the surface pronunciation, which is shown to the right. [ɕ] and [ʑ] with [i] is the result of palatalization, but it is not resultant from the juxtaposition of a palatal glide. After palatalized consonants, traditionally only the vowels [a, i, ɯᵝ] would follow. The vowel [e] has only appears after palatalized consonants in modern loanwords.
The Japanese [w] is not accompanied with the large protrusion and rounding of the lips like in English and is compressed like its true vowel counterpart [ɯᵝ]. An ad hoc IPA representation of this is a double arrow ⇔ below w. This phoneme is now mostly restricted to [a] in native words. It survives with [o] among a decent minority in but one morpheme, [-(w)o] (accusative marker). In loans, it is seen with all vowels but [ɯᵝ]. When wu is transcribed, it is typically spelled out as [ɯᵝ:], but attempts are being made to somehow introduce it as seen in the Katakana diglyph ウゥ.
Labialization used to be a secondary feature of pronunciation in the past. [kw] and [gw] were once phonemes of Japanese but have since collapsed completely with [k] and [g] respectively. These phonemes have arguably been re-introduced via modern loanwords, but most speakers would pronounce something like kwo as [kɯᵝ.o] instead.
Nasals 鼻音
There were two stops which we did not see in our discussion above, [m, n]. These sounds are undoubtedly phonemes of Japanese, but the one nasal sound that causes headaches is the moraic uvular nasal /ɴ/. In totality, it has at least seven allophones, which are in complementary distribution under normal circumstances. This sound coarticulates with the following sound, making this progressive nasal assimilation.
| Bilabial | Alveolar | Alveolopalatal | Palatal | Velar | Uvular | |
| nasal [+V] | [m] | [n] | [nj] | [ɲ] | [ŋ] | [ɴ] |
Before bilabials, /ɴ/ becomes [m]. Before non-approximant alveolars, it becomes [n]. These two cases are not examples of partial neutralization because they are [+syllabic] whereas the phonemes /m/ and /n/ are [-syllabic]. They are different by one feature. Before velars, it becomes [ŋ]. Before alveo-palatal and palatal sounds, it is respectively an alveo-palatal [nj] and palatal [ɲ] respectively. Before approximants and vowels, it is either [ĩ,ɯ̃ᵝ]. The first appears before [i], but the latter occurs before everything else. This shows us that [a], despite being a central vowel, is treated like a back vowel in Japanese phonology.
Consonant Gemination 促音化
Japanese arguably has long consonants/consonant gemination. Long consonants can be transcribed like vowels with a colon (technically a symbol that looks like a colon with triangles on top of each other instead of circles), or by doubling the consonant letter. One can interpret this as consonant fortition or glottal stop inserting before the consonant or something like it because the result is a consonant that is arguably usually two morae (though it is internalized by natives as two morae regardless if it is truly phonetically uttered as such or not).
Thus, the symbol Q has been used by some Japanese phonologists who believe it is a moraic obstruent. At the end of vowels in abrupt utterances, a glottal stop is realized, and because the Kana scripts treat these two things as the same sound, some have argued that underlining, a phonemic glottal stop precedes a consonant to make it a geminate in Japanese. The argument, though, that Q is an archiphoneme which realizes as the sound that follows next is more plausible.
Aside from rare loans and geminate nasals from the juxtaposition of a nasal stop and the syllabic /ɴ/, gemination is restricted to unvoiced consonants. According to Kawahara (2006), Japanese has a suffix -ɾi that contains a “floating mora” that triggers gemination in certain cases (e.g. |tap| +|ri| > [tappɯᵝɾi] (‘a lot of’). When this leads to a geminated voiced obstruent, a moraic nasal appears instead as a sort of “partial gemination” (e.g. |zabu| + |ri| > [zambɯᵝɾi] (‘splashing’).
Summary Chart of All Japanese Sounds including those restricted as Allophones
| Bilabial | Alveolar | Alveolo-palatal | Palatal | Velar | Uvular | Glottal | |
| Nasal | m | n | n̠ʲ | ɲ | ŋ | ɴ | |
| Plosive | b, p | t, d, ts | tɕ, dʑ | k, g | ʔ | ||
| Fricative | ɸ, β | s, z, dz | ɕ, (ʑ) | ç | (Ɣ) | h | |
| Trill | r | ||||||
| Liquid | ɽ, ɾ, ɺ | ||||||
| Glide | j | w͍ | |||||
| Moraic Obstruent | Q |
参照:
1. http://en.wikipedia.org/wiki/Japanese_phonology
2. An Introduction To Japanese Linguistics, Natsuko Tsujimura (1996)
3. Kawahara, Shigeto (2006). “A Faithfulness ranking projected from a perceptibility scale: The case of [+ Voice] in Japanese”. Language 82
- “Phonemic” as in each “sound” present constitutes a “phoneme,” but a “phoneme” may have more than one pronunciation. Writing phonemes phonetically helps distinguish which pronunciation is meant, but writing phonemes purely phonemically also aids in knowing what constitutes a single “sound (with potential variants)” in the language. ↩︎
