Characteristics of Languages
In the early 1990's, I was an English translator at the government-designated agency to translate United Nations documents in China. When we received a document in a foreign language other than English, we passed it on to those translators that know the language. The challenge was to identify the language so we would know who to assign the document to. We had plenty of books and dictionaries, but no computer database or software for this purpose, and of course no Internet. It didn't take me long to figure out some patterns in various languages and create the following table, which was seen on other translators' desks soon later. With this table, you can easily identify the language given part of a document. Nowadays though, translation software such as Google Translate obviates the need of consulting this table if language identification is all you want.
Language | Common Words | Characteristics | |||
English | the | of | a(n) | and | |
Spanish (español) |
el*,la(s),lo(s) | de(l) | un,uno,una | y,e |
|
French (français) |
le(s),la,l'*(also see Italian) | de,d',du,des | un(e) | et* (also see Latin) |
|
German (Deutsch) |
der,den,des,das,die,dem | von (also for "from") | ein(e) | und* |
|
Portuguese (português) |
o* | de,da | e |
|
|
Italian (italiano) |
il,i*,la,le,lo,l' | de,di,dei,del | un,uno,una | e |
|
Norwegian (norsk) |
til | en,ei,et | og* |
|
|
Swedish (svensk) |
de(n),det,dem,dom,dens,deras | en | och* |
|
|
Dutch (Nederlands) Flemish (Vlaams) |
de,het* | van | een | en |
|
Danish (dansk) |
de(n),det,des,desto | en,et,én,ét | og* |
|
|
Polish (polski) |
i |
|
|||
Hungarian (Magyar) |
a | és,meg |
|
||
Vietnamese (tiẽng Vìệt) |
va |
|
|||
Turkish (Türkçe) |
bir | ve* |
|
||
Latin | et |
|
|||
Swahili (Kiswahili) |
na |
|
Note
1. * indicates a very important characteristic.
2. If the characteristic letter is composed of a regular letter plus a symbol above or
under it, then the regular letter also exists. E.g., Spanish has "ñ" then there is
also "n".
3. Brackets after typical words contain their English translation.
4. If a hyphen precedes a typical letter combination, this combination is a word suffix;
if it follows it, a prefix. A parenthesis indicates the hyphen is optional.
5. (Outdated by today's software) If you still can't identify the language by using this table, please try Doug Beeferman's Stochastic Language
Identifier. Failing that, read Kenneth Katzner's excellent book, The Languages of the World, Routledge; 3rd ed., 2002.
To my Miscellaneous Page