Most of the time I use vulgarlang to generate languages, since I can stay phonologically consistent across a culture while being ratshit at making up words. Staring at its blessedly low-effort output, I got curious about Mandoâa, one of Star Warsâ most famous conlangs. The easiest thing to do seemed to be to grab the list of words from mandoa.org, so I did. After scrubbing it a bit for punctuation marks - not the comma - I ran it through a couple of analysis tools to see how itâs different from English, and what other characteristics it might have. If anyoneâs got any longer Mandoâa texts, please flick me a link even if it contains user invented words. Iâd be interested to see how different they are.
The first thing was a frequency analysis. For instance, in a plain English text, 12.7% of all letters will be an E, while very few will be an X. In Mandoâa, 17.14% of all letters will be an A, and very few will be a W.
The twelve most frequent letters in English, in order, are: ETAOIN SHRDLU. In Mandoâa, theyâre ARIEUâT ONSLY. Until now, I hadnât noticed that Travissâs Mandoâa doesnât actually contain an F, and I think but canât find a reference for, that she said Fett is derived from vhett, meaning farmer. Mandoâa has no Z either, which could be a nice bit of world-building regarding Satine Kryze perhaps coming from another population altogether (like the Norman French after 1066) but almost certainly isnât. There are also no letters for X or Q.
After that, I did a sliding bigram analysis. This scans through the text looking for which two letters are most likely to appear next to each other. In English, the six most common are TH, HE, IN, ER, AN and RE. Interestingly, for a language created by someone called Karen, Mandoâa has AR, IR, AA, RA, KA, and AN.
K itself is nearly seven times more likely to appear in Mandoâa than it is in English. People do tend to prefer the letters in their name over other letters in the alphabet, so this isnât quite as egotistical as you might assume. Itâs the thirteenth most common letter in Mandoâa, half way down the frequency list.
Even though the letters donât appear in written Mandoâa, the sounds for F, Q, X, and Z might be present in spoken Mandoâa. However, I tend to headcanon them as too pragmatic to put up with an orthography as horrifying as Englishâs, so for me this now means that someone with a heavy Mando accent would curse vhiervhek about the vhekking banta vhodder. Thatâs not a typo in bantha; Mandoâa is missing TH as a bigram and presumably as a phoneme as well.
I have no idea why she chose these letters to leave out while leaving the rhotics in. F itself occurs about 2.2% of the time in English, the 15th most common letter, but the bigram VH that she implies replaced it is only the 171st most frequent one in Mandoâa, showing up 0.16% of the time. Odd, since itâs in the name belonging to the most famous Mandos of them all.
Although this analysis is based on Mandoâa orthography, it suggests a restricted number of phonemes. The pronunciation guide limits that even more. I couldnât find any Mando written in IPA, although some poor sod has probably tried it. At some point in the future I might try and figure out how many phonemes it has, compared to real-world languages and other conlangs, like Klingon and Dothraki.
Thereâs no real conclusion here, except that it hasnât escaped my notice that the seven most common letters in Mandoâa, ARIEUâT, are similar to the word for traitor, foreigner, outsider: aruetii, which could be a bemusing in-universe cultural artifact, and that double letters donât seem to mean anything, pronunciation-wise, so why is the third most common bigram AA when it doesnât denote a long vowel or anything useful?
(and if Etainâs name came from ETAOIN?)