Japanese Phonemes and Phones
Youâd be surprised how hard it is to get a straight answer on how many phonemes exist in Japanese. Most of the time, what you get are the phones of Japanese, which, of course, are quite different. Today weâll try to settle the question once and for all:
If youâve ever seen a Kana chart, then youâve seen the Gojuuon, of the 50 sounds. This in itself gives us a really great starting point to finding the answer.
(Courtesy of myjapaneseprofessor.com)
From here, we can guess that there have to be at least 5 vowels and 9 consonants, assuming that the solo ă and ăȘ,ă«,ăŹ,ă, and ăź use the same /n/.
/k/ /s/ /t/ /n/ /h/ /m/ /y/ /r/ /w/ || Â /a/ /i/ /u/ /e/ /o/
But wait, thereâs more!Â
We also know that Japanese voices various columns, through the dakuten, those being the ămarks you see after a kana.
With the dakuten, you get voiced /k/ /s/ and /t/; and you also get a voiced bilabial plosive of /p/, which is what /h/ was once upon a time. So we have /g/ /z/ /d/ and /b/. And we also have the handakuten, which is the ăyou sometimes see after the kana starting with /h/, which make /h/ into /p/.Â
/k/ /g/ /s/ /z/ /t/ /d/ /n/ /h/ /b/ /p/ /m/ /y/ /r/ /w/ || Â /a/ /i/ /u/ /e/ /o/
But wait, thereâs more!
We also have the phenomenon of palatalization, which is represented by the ăă
ă you can see after ă,ă, ă, ă, ăĄ, ă«, ăČ, ăł, ăŽand ăż, effectively making digraphs.
For the sake of simplification and, we will call these digraphs /ky/ /gy/ /sy/ /zy/ /ty/ /ny/ /hy/ /by/ /py/ and /my/. This does not correspond with the romanization system we use, but thatâs okay. When weâre just transliterating, weâre not terribly concerned about phonemes as such.
Okay, so hereâs our answer
/k/ /g/ /ky/ /gy/ /s/ /sy/ /zy/ /z/ /t/ /ty/ /d/ /n/ /ny/ /h/ /hy/ /b/ /by/ /p/ /py/ /m/ /my/ /y/ /r/ /w/ || Â /a/ /i/ /u/ /e/ /o/
24 consonants, and 5 vowels. (Weâll eventually concede them an extra nasal phoneme, but letâs pretend all non-fontal nasals are the same.)
Now letâs figure out the phones.
In a perfect world, one would have one phone for one phoneme. But thatâs really never the case. Fair warning, phonetics is a hotly debated topic, so if you talk to 5 different people, youâll get 5 different answers.
The crossed out phonemes are the ones that we wonât worry much about because they do not have allophones.
/k/ /g/ /ky/ /gy/ /s/ /sy/ /zy/ /z/ /t/ /ty/ /d/ /n/ /ny/ /h/ /hy/ /b/ /by/ /p/ /py/ /m/ /my/ /y/ /r/ /w/ || Â /a/ /i/ /u/ /e/ /o/
Allophones are two phones that correspond to the same phoneme under different distributions. Youâll see what I mean in a minute.
Distribution refers to the surroundings of a phone, i.e. what comes before and after it. Depending on itâs distribution, the sound a phoneme is will change. For example, take the words âhouseâ and âHugh.â Notice how the /h/ changes from an open sound to a constrained sound. The sound changes because of whatâs coming in front of it, itâs distribution.
In Japanese, the most important factor in phoneme distribution is what comes after it, meaning, in most cases, what vowel comes after the consonant. There are three instances where what comes before the phoneme is important: /g/, and /u/ and /n/.
The easiest case to talk about is /u/, because everyone becomes savvy to this very quickly.
Letâs take two anime names: âUzumakiâ and âSasuke.â Note that those two middle /u/âs are pronounced different. In the first case, itâs pronounced; and in the second it isnât.Â
When /u/ is between two voiceless consonants, it is silent.
Letâs look at once more case, that of verbs: âdesu,â âkimasu,â âkorosu.â Here /u/ is also silent. But if you look at verbs such as âkikuâ and ânaku,â the /u/ is pronounced.
In final position, if preceded by /s/, /u/ is silent.
In the case of /g/, this is a dialectical thing, but itâs common enough to talk about. (Iâd even say that itâs exaggerated in some instances where people are trying to speak clear, Standard Japanese.)
With /g/, there is a chance it will turn into the velar nasal: [Ć]
[Ć] is the same sound that Spanishâs âñâ represents. Itâs the same /n/ in sing.Â
Youâd be surprised by how difficult it is to pin down when this phenomenon happens. It has probably changed a lot in the past 50 years. We have heard from people who learned Japanese 30 years ago that itâs supposed to be a posh feminine thing, but weâve heard audio of men doing it all the time. Just keep in mind that it does happen.
There seems to be some consensus that the /n/âs in ăȘ, ă«, ăŹ, ă, and ăź are all the same. Itâs the alveolar nasal [n].
In the case of ă, where the thing following it is not a vowel, it will assimilate in position according to the features of the consonant. If itâs followed by a vowel, however, it will be [n], which is why weâll try to keep it one phoneme. (Some linguists believe that this is actually a nasalized vowel, but we cannot quite see it.)
The most famous case of this kind of assimilation happening is with the word ć
茩 (ăăă±ă/senpai), where it is pronounced with an [m], âsempai.â [m] is the bilabial nasal, and [p] is a bilabial plosive. Â
The same thing happens with velar consonants [k] and [g]. çŁäŒ (ăăăă
ă/sankyuu) will be pronounced as our friend [Ć].
In cases where it is at the end of the sentence, it will be pronounced as the uvular nasal [ÉŽ].
When /n/ precedes a bilabial consonant, it is [m].
When /n/ precedes a velar consonant, it is [Ć].
When /n/ is in final position, it is [Ɏ].
In all other cases, /n/ is [n].
/r/ is another famous case. This is a case of two allophones having âequal distribution,â i.e. you can exchange one for another and youâll never really have a problem.
/r/ is both a flap and a lateral. To put it colloquially, you can pronounce either as an r or as an l. Some people have developed rationales for when a consonant is pronounced either as an the flap [ÉŸ] or as the lateral [l], but there is a lot of inconsistency between them.
There exist a couple of variations with /h/ (and aspirations with certain plosives), but the one case we need to talk about is when /h/ precedes /u/.
The tendency is to place /h/ generally in the glottal area, so the glottal fricative [h].Â
Before /u/, /h/ can be pronounced as a bilabial fricative [Éž], the labiodental fricative [f], or the glottal fricative [h].
The standard pronunciation, to our understanding, is [Éž], but the others are heard.Â
/w/ is a dying phoneme. There originally existed four Kana for it (which can still be heard in the Iroha poem): ă, ă (wi), ă (we), and ă. Now only two are in use: ă and ă; and ă is dying out. If it were not for its functional use, for clearly marking the accusative case particle, it would be gone as well.
We bring up /w/ because there are those who pronounce the /w/ before /o/ and those who donât. Both seem to be fine, though some may claim that this is an abnormality known as hypercorrection, where a sound is produced unnaturally for the same of some kind of regularity (this is common in cases of assimilation, such as pronouncing the final /s/ in âringsâ as an [s] and not as a [z]).Â
Regardless, there is an older generation that did learn to pronounce ă with the /w/.
When /w/ precedes /o/, it can either be silent or pronounced.
These four consonants undergo a similar transformation before /i/.
The consensus is that they become post-alveolar affricates.
/t/ before /i/ becomes [tÉ]
/d/ before /i/ becomes [dÊ]
/s/ before /i/ becomes [É]
/z/ before /i/ becomes [Ê]
Itâs worth noting here that the palatal phonemes that /ty/, /gy/, /sy/, and /zy/ have these very same phones.
Then, in the case of /t/ and /d/, when they precede /u/, they become affricates, but alveolar and not post-alveolar, so [tÍĄs] [dÍĄz].
/t/ before /u/ becomes [tÍĄs]
/d/ before /u/ becomes [dÍĄz]
And that, friends, covers most of everything.Â
There are some things involving loanwords and nuances, such as /s/ remaining [s] before /i/ and there being a [v] and not just a [b], but thatâs secondary to all this.