In one discussion with neighbours over the festive period, the question was raised as to whether Ulster Scots is a language and another question was raised soon afterwards about how true it is that all [European] languages have a common ancestor. I tried to explain briefly that these are ultimately the same question – although I failed, because the explanation is not brief!
To be honest, it would be easier to answer this question on a podcast rather than in writing and, if you’d like to hear me try that at some stage in 2024, do buy me a coffee and I’ll actually put it towards the relevant equipment (as I’m not really supposed to drink too much coffee on doctor’s orders!)
In the meantime, I will do my best in writing, using as my corpus, quite simply, the numbers from one to ten.
Spanish/German
Let us look at the numbers from one to ten in Spanish and German:
- uno / eins; 2. dos / zwei; 3. tres / drei; 4. cuatro / vier; 5. cinco / fünf; 6. seis / sechs; 7. siete / sieben; 8. ocho / acht; 9. nueve / neun; 10. diez / zehn.
This would be easier read out, as it would clarify the actual differences in speech, although of course that has its drawbacks because different varieties of a language pronounce things different ways: for example, cinco is predominantly pronounced as if “thin-ko” in Spain by “sin-ko” in most of the rest of the Spanish-speaking world.
Fundamentally, however, what we can see here (and could hear here) is that Spanish and German are clearly two different languages; yet there are also clearly parallels. For example, it is not a coincidence that:
- the number ‘one’ contains a nasal in both languages (and in English);
- the numbers ‘two’ and ‘ten’ start with the same letter (as they do in English);
- the start of the numbers ‘four’ and ‘five’ and also ‘six’ and ‘seven’ is the same in Spanish and in German (and in English; even though the initial letter of ‘four’ and ‘five’ is now pronounced differently in Spanish and written different in German);
- the number ‘eight’ contains a middle bit with a variation of what was obviously originally a ‘k’ sound; and
- the number ‘nine’ is close in both languages to the word for ‘new’ (nuevo and neu respectively).
Spanish/Italian
Let us try that again with Spanish, but now compare Italian:
- uno / uno; 2. dos / due; 3. tres / tre; 4. cuatro / quattro; 5. cinco / cinque; 6. seis / sei; 7. siete / sette; 8. ocho / otto; 9. nueve / nove; 10. diez / dieci.
These are still, I think we would agree, still clearly separate languages; yet in fact they are obviously considerably closer than Spanish and German are. We even have the precise same spelling (and pretty much the same pronunciation) for ‘one’, and a very similar pronunciation (though actually not quite identical, as the ‘t’ is pronounced double in Italian) for ‘four’. The pronunciation of ‘five’ is quite distinct in reality (Italian begins as if “chin-” rather than “thin-” or “sin-“). Otherwise, however, if we further take into account the Italian rule that all except some very common short words must end in a vowel (hence the removal of the final consonant in ‘three’ and ‘six’ and the addition of one in ‘ten’), these are really quite similar – and again, the word for ‘nine’ in Italian is indeed close to novo ‘new’.
Dutch/German
Given Spanish is closer to Italian than it is to German, it is worth noting that German is, for example, obviously closer to Dutch than it is to Spanish (Dutch first below):
- een / eins; 2. twee / zwei; 3. drie / drei; 4. vier / vier; 5. vijf / fünf; 6. zes / sechs; 7. zeven / sieben; 8. acht / acht; 9. negen / neun; 10. tien / zehn.
What do you think? Are these roughly as close to each other as Spanish and Italian are? Certainly, they are closer to each other than Spanish was to German and, interestingly, again we have two forms which are pretty much identical in speech (and indeed, in this case, in writing: ‘four’ and ‘eight’). Again we have the same initial letter in Dutch both for ‘four’ and ‘five’ and for ‘six’ and ‘seven’; we have the nasal in ‘one’; and we have some fairly obvious common sound changes (e.g. ‘t-‘ in Dutch and English goes to ‘z-‘ in German). We also have evidence of removal of nasals after a vowel with compensatory lengthening, which occurs in ‘five’ in English also (the German, containing the nasal, is closer to the original – we will come back to that).
Dutch/Afrikaans
Dutch has an even closer partner, however, namely Afrikaans. Here we run into some trouble writing out the numbers but, in reality, they are identical in speech with three marginal differences:
- the long <e> which appears in all of twee, zes, zeven and negen (written in Afrikaans twee, ses, sewe and nege) is pronounced higher in Afrikaans – in Dutch the first syllable of zeven sounds close to English “safe”, but in Afrikaans it is closer to English “sieve”;
- the vowel in ‘five’ (written vyf in Afrikaans) is lower in Afrikaans, with no obvious English approximation (it is maybe closer to “fave” than “five”);
- the end vowel of ‘seven’ and ‘nine’ (unwritten in Afrikaans, as noted above) is barely pronounced in Dutch but definitely not at all in Afrikaans (though it does re-appear in derived terms such as negentien ‘nineteen’).
Fundamentally, the difference between Dutch and Afrikaans is a fairly straightforward phonological one which still leaves them closer to each other in counting than would frequently be the case for dialects of the same language.
Portuguese
Similarly to Dutch and Afrikaans, the differentiation in numbers between European Portuguese and Brazilian is to do with pronunciation; they are in fact written the same way on either side of the Atlantic (at least, they are as far as ten). Yet there are clear differences in pronunciation:
- um ‘one’ is pronounced similarly (note there is now an ‘m’ rather than an ‘n’ in every other language we have looked at, but that is primarily a spelling convention – it is a nasal, still);
- dois ‘two’ and seis ‘six’ are pronounced as if ending in a ‘sh’ or even ‘zh’ (close to the ‘s’ in English “leisure”) in Portugal, but may shift towards more of a ‘z’ in parts of Brazil;
- três ‘three’ has a much more open vowel in Brazil (the difference here is not dissimilar in scale, though not quite the same in quality, as between Dutch and Afrikaans in, for example, twee);
- quatro ‘four’, cinco ‘five’ and actually also oito ‘eight’ end in what to English speakers would sound more of a ‘-u’ or ‘-oo’ sound, but this is held longer in Brazil;
- sete ‘seven’ is quite distinct, pronounced as if “set” with just a hint of a final vowel in Portugal but to rhyme almost with “itchy” in much of Brazil;
- nove ‘nine’ also has a much clearer final vowel in Brazil, which is not far from rhyming with English “coffee” (but with a voiced consonant v); and
- dez ‘ten’ is also quite distinct, pronounced in Portugal with the ‘zh’ ending noted above which is retained in parts of Brazil but not in others (which are more like English ‘z’) but, conversely, with the initial sound made affricate (so similar to the first syllable of English “gestate”) across much of Brazil but not in Portugal.
Upon hearing these, I suspect most people would draw the instinctive conclusion that they were in fact farther apart from each other than Dutch is from Afrikaans even though, in this case, they are deemed to be the same language – which brings us back to our Ulster Scots question…
English / Scots
Let us, therefore, compare Standard British English (based on educated Oxford/Cambridge pronunciation) with Traditional Scots (based on a fairly conservative pronunciation in rural Antrim, for the sake of argument):
- one (“wun”) / ane (“yin”); 2. two / twa; 3. three / three; 4. four /fouwer; 5. five / five; 6. six / sax; 7. seven / sieven; 8. eight / echt; 9. nine / nine; 10. ten / ten.
These are clearly pretty similar, but are they any more similar in practice than Dutch and Afrikaans, or European and Brazilian Portuguese?
Three of the first four in particular exhibit marked differentiation: “wun” was a dialect form which has taken over from “own” (as in “alone”, “lonely”) as the common pronunciation since Shakespeare’s time; likewise “yin” competes with “wan” in Scots. A different form of ‘two’ was taken forward in Scots ‘twa’ (‘w’ still pronounced followed typically by a long ‘a’) as it was Dutch twee. In traditional speech at least, Scots also retains a medial semi-consonant sound in fouwer.
There are other obvious differences too, of course, in the vowel of ‘six’ and ‘seven’; ‘eight’ is a tricky one as, in practice, few speakers now retain the medial ‘ch’ in echt (to sound like the medial consonant in Dutch/German acht) although it may be retained when reading literature aloud, for example.
We can see, therefore, that language versus dialect is a difficult call, at least linguistically. This is why it is rarely a linguistic decision!
What about different languages in time and space, then?
Latin/Italian
Spanish and Italian were quite close because they derive from Latin, of course. So if we go back to Golden Age Latin (around 2100 years ago) and compare it to modern Italian, what do we find?
- ūnus / uno; 2. duō / due; 3. trēs / tre; 4. quattuor / quattro; 5. quīnque / cinque; 6. sex / sei; 7. septem / sette; 8. octō / otto; 9. novem / nove; 10. decem / dieci.
It is worth assessing each of these sound changes going forward to Italian (but then doing something else…)
- ūnum was in fact the accusative form (feminine ūnam) taken down into daughter languages, in which the final -m marked a nasal, and this was lost over time leaving, in Late Latin, masculine unu and feminine una (thus Spanish and Italian uno, also feminine indefinite article una);
- duō is deceptive, as it was not as similar to Italian as it looks because the final (long) -o was stressed rather than the middle (short) u; the Italian emphasises the middle u and in fact derives from a “breaking” of that long -o into the diphthong ue (a process more common, in fact, in Spanish, even though it did not happen in this instance);
- trēs (also tria) lost the final -s in Italian (though not in most other Western Romance languages) ultimately probably for reasons of requiring a final vowel noted above (or perhaps those discussed last week);
- the initial qu- of quattuor was not held as long as it is in modern Italian – qu- marked a fairly short /k/ followed by just a hint of a ‘w’ glide…
- … and this was relevant to quīnque also, whose first qu- softened before the high vowel (as if written ci-, as it came to be) in Late Latin, but the loss of the final vowel or conversion to -o in other languages meant the second -qu- was retained as a hard c or q (i.e. /k/ or similar) in other languages (the final vowel was likely lost in Italian but then reinstated for the aforementioned reasons of requiring a final vowel);
- sex lost its final consonant in Italian (again, because it likes final vowels) and usually had it simplified to -s in other languages (even ultimately in spoken French, despite it retaining -x in writing);
- septem lost its final -m (as ūnum above) and also had a consonantal cluster in the middle which was simplified in most daughter languages to t, though Italian retains (and pronounces) the double in sette as evidence that there were once two separate consonants there;
- octō had a long final vowel in ancient times which is why it was widely retained; again, Italian simplified the cluster (as with septem) to tt, thus otto (cf. Latin noctem ‘night’, Italian notte) – Portuguese did this slightly differently to oito (also noite ‘night’) and Spanish to ocho (also noche ‘night’); French and Catalan did lose the final vowel and diphthongised the initial (perhaps to compensate), leaving huit which, in the standard language of Catalonia itself (though not in Valencia and not in some inland dialects) has in fact been re-strengthened initially to vuit;
- novem (again note also novum ‘new’) simply lost its final nasal, and also changed the pronunciation of the medial vowel from something close to English /w/ to equivalent to /v/, giving Italian nove (this one did diphthongise in Spanish, giving nueve, and also once upon a time in French, though it has since shifted again and the final consonant has become devoiced, thus neuf);
- decem shifted to dece as expected in Late Latin but the stressed vowel often diphthongised, thus dieci in Italian and diez in Spanish (actually also deu in Catalan; it was also heightened to dix in French).
We can see fairly clearly here how the languages interrelate through time, typically fairly regularly.
Latin/German
German does not derive from Latin, so why would we compare them? Of course, although they do not derive from each other, they do each derive from a common ancestor. We can see this in the numbers. Let us look at how:
- ūnus / eins; 2. duō / zwei; 3. trēs / drei; 4. quattuor / vier; 5. quīnque / fünf; 6. sex / sechs; 7. septem / sieben; 8. octō / acht; 9. novem / neun; 10. decem / zehn.
Look at these and see that yes, shifting the Romance languages back 2100 years to the time of Golden Age Latin does make them look a little more alike than, say, modern Spanish and German do.
Then consider a few common sound changes which occurred in Germanic languages:
- typically voiced d- shifted to unvoiced t- (note that perhaps th- had shifted to t- already in Latin);
- qu- (actually “k” followed by a slight “w”) shifted (perhaps via unvoiced “p”) to unvoiced “v” usually written f;
- complex clusters which shifted later to tt in Italian shifted in Germanic to v;
- medial consonantal sounds approximating <w> were often lost in Germanic.
Let us have another look. And then consider that in the dialects that became German there was a further shift:
- an ongoing chain shift saw th- (represented in Latin by t-) shift to d- and pre-existing t- shift to ts- (written z-), essentially meaning that versus Latin t- shifts to d- and d- shifts to z-;
- medial -v- shifted in some environments to -b-; and
- k had shifted to h which was then lost after vowels with compensatory lengthening.
Let us now apply all of that from the Latin:
- ūnus now varies only from earlier Germanic ainaz in the exact quality of its vowels and the voicing of the final consonant, but it is not hard to see that they are the same word and German eins descends from the latter;
- duō shifts to “tuo” or actually ‘two’ in Germanic (you can see now why it is important to emphasise that the final -o was stressed in Latin), which then via a further shift becomes zwo or zwei in German (zwo is the form often used in counting to distinguish from drei);
- trēs shifts the other way to dr- in German (drei; and thr- in English, hence ‘three’);
- quattuor becomes something like fedwor, then loses the d of the cluster to become fewwer, then loses that cluster altogether leaving vier (first letter pronounced as f- in all of German, Dutch and Afrikaans), similarly also English ‘four’ (but the cluster was retained in Shakespeare’s time, thus fouwer, and in Traditional Scots);
- quīnque is actually quite simple – as the qu shifted to f it became simply finfe, represented in modern German after loss of the final vowel (though this is sometimes retained in counting) as fünf (more northerly dialects lost post-vocalic nasals with compensatory vowel lengthening, hence Dutch vijf, English ‘five’);
- sex is still much the same in German (and similar in English, with just a minor vowel change to ‘six’);
- septem has the cluster simplified to –v- in Germanic (Dutch zeven) and then further to -b- in German sieben;
- octō lost its final vowel in Germanic but is otherwise much the same with just a shift in the initial vowel (German/Dutch acht; Scots echt);
- novem is the ‘new’ number, as explained here (Dutch is an outlier here as negen and subsequent Afrikaans nege is a reformation by analogy – the -g- really should not be there!);
- decem was something like “tai-hun” in the Germanic of 2100 years ago, with the initial devoicing and final nasal intact and the vowels not dissimilar but the medial consonant on the way to being lost – in German (which also exhibits the further t- to z- shift) it is retained in writing, but that now just marks the compensatory vowel lengthening in zehn.
As a final aside, the final element of de-cem and tai-hun looks awfully like the first element in Latin centum and English ‘hundred’. That is because it probably was: the original word from which centum derived likely initially meant merely a large number, or perhaps even specifically a square of the base number, or perhaps what became decem was the large number and centum was an even larger one. Ten squared would be a hundred, of course, although in fact the base number at the outset may well have been eight (hence the ‘new’ number).
After all that, however, it becomes quite evident that we can see patterns in the numbers not just in Spanish and Italian, or in Dutch and German, or for that matter in languages which are alike or the same, but even in Spanish and Latin compared to German. There is a simple reason for that – they are all ultimately the same word in the same language.
So what is a “language” and what is a “dialect” again?!