Monday, July 10, 2017

Tian Ji's horse racing and the electoral vote system

[The following was written on November 11, 2016.]

The author of the famous military strategy book The Art of War, Sun Wu, commonly known as Sun Tzu, had a descendent, Sun Bin, who also wrote a book with the same title. In ca. 340 BC, Sun Bin advised his patron Tian Ji at a horse racing event and won the race. The following is the excerpt from Sima Qian's Records of the Grand Historian about this interesting story:

齐使者如梁,孙膑以刑徒阴见,说齐使。齐使以为奇,窃载与之齐。齐将田忌善而客待之。忌数与齐诸公子驰逐重射。孙子见其马足不甚相远,马有上、中、下、辈。于是孙子谓田忌曰:“君弟重射,臣能令君胜。”田忌信然之,与王及诸公子逐射千金。及临质,孙子曰:“今以君之下驷与彼上驷,取君上驷与彼中驷,取君中驷与彼下驷。”既驰三辈毕,而田忌一不胜而再胜,卒得王千金。于是忌进孙子于威王。威王问兵法,遂以为师。
(The ambassador of the Qi state went to the Liang state. Sun Bin as a convicted criminal went to visit and talk to him secretly. The Qi ambassador regarded Sun as valuable and carried him back to Qi. Tian Ji, the Qi general, gave him a warm reception. Ji and some princes often betted heavily on horse racing. Mr. Sun saw that all the horses were about equally capable, rated superior, average, and inferior. So Sun advised Tian Ji, "Sir, you just bet heavily. I'll make you win." Tian Ji trusted him and betted a thounsand units of gold with the king and the princes. Right before the race, Mr. Sun said, "Use your inferior horse to race with his best horse, use your average horse to race with his inferior horse, and use your best horse to race with his average horse." After three rounds, Tian Ji lost one and won two of the three rounds, and carried away one thousand units of gold. Then Ji recommended Mr. Sun to the King Wei, who interviewed Sun on military tactics and assigned him as the Chief of Staff.)

Fast forward to 2016. We see that the electoral vote in the US presidential race matters while the popular vote does not and that the two votes mathematically represent two different winners in this 2016 presidential race. Although neither Hillary Clinton nor Donald Trump can move her or his supporters from one state to another, there is similarity between the electoral vote system and Tian Ji's winning strategy. If democracy is the name given to the principle of the minority obeying the majority, the popular vote is the only true democracy. (As of this writing, Clinton has won 60,274,974 popular votes, while Trump has won 59,937,338.)

The reasons for some people to decide to not vote are (A) equal dislike of the candidates; (B) lack of interest in politics; (C) living in a non-swing state, one person's vote matters little. Group C may be small. But it's the only one out of the three that would make a difference if the American electoral vote system were abolished or even mitigated (by adjusting the weights i.e. the electors assigned to different states, e.g.). If that happened, swing states would have lower voter turnout and non-swing states would have higher. But since there're fewer swing states than non-swing states, the total popular vote count would be higher.

Sunday, April 16, 2017

自由: "freedom" or "liberty"?

A Chinese reader asked me about the difference between "freedom" and "liberty" when translating Chinese "自由" into English. We can find many answers with a Google search for "difference between freedom and liberty". One article maintains that "Freedom is a state of being capable of making decisions without external control", while liberty "is freedom which has been granted to a people by an external control". And some like this laboriously attempt to make a clear distinction between these two words.

Having read a handful of such answers but not satisfied with any of these, I told the person asking me the question: 1. the etymology of the two words differs; 2. in general usage, "liberty" is more abstract and philosophical than "freedom". Other than these two points, there is no difference, but in different contexts, only one of the two words is more common. For example, nowadays we say "freedom of speech", not "liberty of speech". (But see the ngram figure in Appendix 1.) We say "Liberty, Equality, Fraternity", not "Freedom, Equality, Fraternity". These set phrases are by convention, just as in Chinese idiom "破釜沉舟" ("cut off all means of retreat", "decide to fight to death"), not "破釜沉船", even though "舟" and "船" are completely synonymous.

Making distinctions between words is so intriguing that someone has even built a Web site www.differencebetween.net dedicated to this task. Language professionals and general public alike are fond of writing articles on these topics. While many such articles are valuable contributions to the correct usage in English, there is one common deficiency not fully recognized: the judges are the native speakers of the language, not linguists or scholars. An age-old debate among lexicographers is relevant here: Should a dictionary be prescriptive, directing people toward correct or supposedly correct usage, or be descriptive, faithfully documenting the actual usage in the native speaker community? Nowadays there may be more dictionaries in the latter category, presumably consistent with the increased level of public education. In the case of "freedom" vs. "liberty", if enough people, not English-as-a-foreign-language learners but native speakers, ask the question about their difference, the very fact that they ask this is a sign that the distinction, if there is a theoretical one, hardly exists in practice. Instead of making a great effort to separate them, it would be better to acknowledge, in modesty, the lack of difference between them.

________________________

Appendix 1

This figure is the Google ngram showing the historical usage of "freedom of speech" and "liberty of speech". We can see that from the mid-19th century on, "freedom of speech" has significantly gained in usage over "liberty of speech". But before that time, it only had slightly higher usage frequency.

Appendix 2

Some Weibo users gave me a few helpful pointers on this topic. One user informed me that political theorist and philosopher Isaiah Berlin's Four Essays on Liberty used "freedom" and "liberty" interchangeably. Two other users directed me to political scientist Hanna Pitkin's Are Freedom and Liberty Twins? According to Pitkin, most people don't make a distinction between these two terms, but Hannah Arendt is an exception. However, the author questioned Arendt's distinction from the point of view of political science as well as etymology (see the bottom of p.6 and p.9 of the article).

Appendix 3

The prescriptive-descriptive dichotomy, however, only applies to everyday language usage. In academic fields, especially of science and technology, but to some extent, of social sciences and humanities as well, the "prescriptive" approach should be supported. Take osteoarthritis as an example. An educated English speaker would think this meant inflammation (-itis) of bone (osteo-) joint (-arthr-). But it is not. Then, should the distinction between "freedom" and "liberty", if non-existing in practice, be made in the academic circle as two different terms in social sciences or humanities, followed by educative admonition to the public about the research outcome? Scholars have the freedom of research and can make any distinction between any pair of words in their research. In fact, social scientists and particularly philosophers habitually do that. As to whether the distinction should be imposed to the public, No!

Monday, January 9, 2017

Comparison of Chinese and Western Etymology

In my last post, I said "Most languages in the world take the alphabetic writing system. Studying the internal history of its vocabulary primarily means analyzing phonological and morphological changes through time." In this post,[note1] I'll expand on that point and contrast that with the Chinese tradition.

Take the word language as an example. In English, we read

late 13c., langage "words, what is said, conversation, talk," from Old French langage "speech, words, oratory; a tribe, people, nation" (12c.), from Vulgar Latin *linguaticum, from Latin lingua "tongue," also "speech, language," from PIE *dnghu- "tongue" (see tongue (n.)).
The -u- is an Anglo-French insertion (see gu-); it was not originally pronounced. Meaning "manner of expression" (vulgar language, etc.) is from c. 1300. ...

Source: Online Etymology Dictionary

In Spanish, we have

idioma m. language. [LL. idiōma: id. <Gk. idiōma: peculiarity (as lang.) <idiousthai: to make one's own <idios. See idio-.]; idiomático,ca a. idiomatic. [Gr. idiōmatikos: particular.]
Source: A Comprehensive Etymological Dictionary of the Spanish Language with Families of Words based on Indo-European Roots by Edward A. Roberts, 2014.

And most importantly, in French, we have

LANGUE, sf. a tongue; formerly lengue, from L. lingua. For in=en=an see § 71, and Hist. Gram. p. 48. — Der. langage, languette.
Source: An Etymological Dictionary of the French Language by Auguste Brachet, 1882.

The reason for my praise "most importantly" is that Auguste Brachet, the "romanistischer Autodidakt"-turned-professor according to (German) Wikipedia, created a monumental masterpiece in not just French etymology but etymology in general. In addition to what a regular etymologist would do, such as tracing the word form to its etymons in the same or other languages, Mr. Brachet systematically summarized the rules of the morphological and phonological changes and applied them to individual words in his dictionary. In the said example, he noted that for the derivation of in < en < an in the development of Latin lingua to French langue, the reader can consult his rule 71 in the book, where he says

I in Latin position [i.e. "when followed in the Latin word by two consonants" according to him, a convention not exactly the same as adopted today; my note] is changed to e in Merovingian Latin: thus fermum, ..., for firmum, ...' and this e, pronounced ei (see § 66), has produced two distinct French forms, according as it has preferred the open è sound, or the i sound.

You can choose to follow up to rule 66 in this book and p.48 of his A Historical Grammar of the French Tongue for more information about these sound (phonological) as well as spelling (orthographic) changes.

Western etymological publications may be divided into two groups: (1) dictionaries that give etymons or source words; (2) scholarly books and research articles on phonological and orthographic changes over time. Mr. Brachet's dictionary is unique in that it merges the two into one, so that the reader is conveniently offered the explanation of sound changes right in the headword entry, obviating the need to research as to why, e.g., the first i in *linguaticum would change to a in the history of the English word language.

However, a word contains more than its sound and spelling, but its meaning as well, which etymology cannot avoid tracing. But as linguist Calvert Watkins warned us, it is "more hazardous to attempt to reconstruct meaning than to reconstruct linguistic form". Sense development is much less researched and also less described in dictionaries. Unlike phonology, semantics or the study of meanings of words is not easily subject to formal (as in "formal logic") structural analysis. And yet tracing the sense development is the primary task of Chinese etymology. Chinese phonological development is a separate field of study; it is not incorporated in etymology, because the meaning of Chinese characters (or words, whose meanings are almost always based on the component characters) is largely dissociated from the sound. Take the character 文 ("text") as an example.


Source: 谢光辉《汉语字源字典》, 北京大学出版社, 2000年, 29页
Translation of the embedded text: "文" is a pictographic character. "文" in oracle bone script (甲骨文) and bronze inscription script (金文) resembles a standing person facing forward. His chest bears tattoo of decorative patterns. This is in fact a vivid description of the ancient "文身" (tattoo) custom. Thus the original meaning of "文" was a person with tattoo on his body, as well as pattern, texture. Later, the meaning was extended to character, article, culture, civilization etc.

That was a typical entry of Chinese character etymology. For simple characters especially pictographic ones, it is simply pure 依类象形 or description of the object according to what it looks like. The focus is on the meaning, not the reading or sound. Some more complicated characters may be decomposed into elements each of which is analyzed the same way, as in the case of "秦" (see my last post).

Needless to say, the majority of the characters (at least 80%) are of the type 形声字 or characters of form and sound, such as "指" (finger; to point), where the form radical "扌" suggests the meaning, i.e. something related to hand, and the sound component "旨" suggests its reading , i.e. zhǐ. The classical Shuowen Jiezi (说文解字) dictionary, unsurprisingly, points out that this character "从手旨聲" (the meaning is based on "手" and the sound on "旨").

Similarities and differences between Chinese and Western etymology can also be revealed from the definition of the word etymology itself. The Webster dictionary defines it as "the history of a linguistic form (as a word) shown (1) by tracing its development since its earliest recorded occurrence in the language where it is found, (2) by tracing its transmission from one language to another, (3) by analyzing it into its component parts, (4) by identifying its cognates in other languages, or (5) by tracing it and its cognates to a common ancestral form in an ancestral language" (I added the parenthesized numbers). Thus we see that most western dictionaries with etymological information meet the requirements (1) and (2), sometimes (3). Wiktionary and Friedrich Kluge's An Etymological Dictionary of the German Language also meet (4) and (5) most of the time. What if we apply these requirements to Chinese character etymology? (1) is often met if we interpret it as finding the first occurrence in history, which nowadays is made drastically easy with the aid of a computer-based search. But tracing its development in the course of long history, either inside Chinese or (2) across different languages, is rarely done. (3) is done, though with significant differences from that in western languages. (4) and (5) are rare because they're mostly irrelevant to Chinese characters.

How is analyzing a Chinese character into its components special compared to the western tradition? While a character e.g. "指" can be analyzed into "扌" (for meaning) and "旨" (for sound), there is no systematic change of a component from one form to another. Take rule 126, one of the many summarized by Mr. Brachet for French, as an example, "Before a, initial c ... passes through the successive aspirated sounds k'h, tk'h, kch, ch." He supports this rule of ca- > ch- with about 80 words as evidence, champ < campus, chien < canis, etc. Can we construct an analogy of this rule and find supporting examples in Chinese etymology? Since Chinese does not use an alphabetic writing system, there's hardly any need in dealing with the sound change of a character in etymology. Instead, we may substitute the change in form of a character. For example, after studying the 金文, 小篆, and 楷体 forms of "指" and other characters with "扌" on the left side, we may conclude that all (or most) such characters have gone through the predictable change of this radical in these forms, just as the French ca- changed to ch-. Similarly, all or most characters with "旨" on the right side probably went through the same change as shown here (see the row for 字源演变). Thus, we find in etymological studies a parallel between Chinese and western languages in identifying common component change in characters or words.

However, Chinese etymological dictionaries are also interested in finding the "root cause" of the most basic characters. Because the characters are ultimately from pictographs in origin, this "root cause" finding is mostly "依类象形" (describing the object according to what it looks like). If we must find a parallel for this practice in western etymology, it is equivalent to answering the question why e.g. the Proto-Indo-European stem from which Modern English word word is ultimately derived is *were-, that is, why that sound. Obviously, except for some onomatopoeias, there is no answer, or no such research. While Chinese etymologists have forged ahead in that direction, so far this "research" is, I'm afraid, very much based on guess work, simply because there is no record left in history about why a specific character was invented to be of that form. "文" may indeed be a symbol for a person with tattoo, with no hard proof anyway. But this is too error-prone. In my last post, I quoted the article 许慎为何将象释成母猴——“为”字趣释 (Why did Xu Shen interpret an elephant as a female monkey: interesting interpretation of character "为"). In a recent weibo blog post, a scholar interpreted, purely based on its resemblance, "夷" in its original oracle bone script as a person squatting, while in 《汉语字源字典》 (Dictionary of Chinese Character Etymology) by another scholar in this field, it was thought to represent a man bound by ropes, to be served as a slave or for sacrifice. On this stretch of imagination, I have but one comment: "汉字字源,看图识字,见仁见智" (Chinese character etymology / Look at pictures and learn to be literate / Trust your opinions and beliefs).

________________________
[note1] Due to the unique nature of the Chinese language, etymology can be of characters as well as words. This post is about character etymology.

Sunday, September 4, 2016

Why is it rare to see Chinese etymology?

People speaking English as the native language are used to dictionaries in which each headword contains not only the definition of the word and example phrases or sentences, but also brief etymology, as in this example in the Merriam-Webster dictionary for the word word.

Middle English, from Old English; akin to Old High German wort word, Latin verbum, Greek eirein to say, speak, Hittite weriya- to call, name
First Known Use: before 12th century

A Chinese dictionary, on the other hand, almost never gives the etymology. In this blog posting, I'll try to explain why.

For the sake of discussion, we need to make a distinction between two types of Chinese dictionaries. Due to the nature of the Chinese language, the English word dictionary (or its equivalent in most other languages) can mean either "字典" (literally "character-dictionary") or "词典" also written as "辞典" (literally "word-dictionary") in Chinese. I have not seen a dictionary for general Chinese words published by anyone that contains etymological information for the headwords.[note1] Thereinafter, a Chinese etymological dictionary only refers to a character-dictionary.

The disappointment at lack of an etymological dictionary of Chinese words does not extend to that for a dictionary of Chinese characters or 字典. Back in the Eastern Han dynasty (25–220 AD), the scholar Xu Shen (c. 58 – c. 147 CE) wrote the monumental dictionary Shuowen Jiezi (literally "Explaining Graphs and Analyzing Characters" according to Wikipedia). Since Xu lived in a period only one thousand or less years after a large number of Chinese characters were invented, the etymology he gave in the book for each of the 9000 plus characters is mostly trustworthy. Take the character "秦" (qín) as an example.[note2]

伯益之後所封國。地宜禾。从禾,舂省。一曰秦,禾名。𥠼,籒文秦从秝。匠鄰切
(The fief given to the descendant of Boyi. The land is suitable for crops. The character has a meaning based on "禾" ("crop") and contains an abbreviation or syncope of the character "舂". Another theory claims that this character is the name of a crop. This character in Zhouwen script [a script used just before the time of the First Emperor], "𥠼", is based on "秝". Pronounced with the initial consonant of 匠 combined with the final of 鄰.)

This is an excellent example of Chinese character etymology; it not only describes the source of the character but also analyzes the morphology or form of the character, as evidenced by the construction of "秦" through "禾" and part of "舂". The significance of Xu's book in the history of the Chinese language is such that almost two millennia later, scholars are still using his book in research. The only major revision came after the 1899 discovery of oracle bones, which the Shang dynasty (c. 1600 BC–c. 1046 BC) people used for divination. The oracle bone script predates Xiaozhuan script, the primary source for Xu Shen's character etymology because the latter is the earliest script known to Xu. Owing to this gap of knowledge, Xu inevitably made numerous mistakes in his otherwise near-perfect dictionary. One good example can illustrate the point. In the article 许慎为何将象释成母猴——“为”字趣释 (Why did Xu Shen interpret an elephant as a female monkey: interesting interpretation of character "为"), the author explained how the simple character "为", meaning "for" or "to do" nowadays, evolved from the oracle-bone pictograph depicting a man holding an elephant leash but mistaken for a female monkey by Xu Shen. (By the way, elephants indeed roamed around middle and northern China three thousand years ago, but the species was not the same as in southern China or India today.)

With all the background information, now we may answer the question why it is rare to see Chinese etymology. By that I don't mean you can't find character etymology at all. Books such as 《汉语字源字典》 ("Dictionary of Chinese Character Etymology") and the Web site Chinese Etymology by Richard Sears are available. But this is almost never incorporated into a Chinese dictionary other than a specialized etymological dictionary. If a general English reader is not more academically inclined than a Chinese reader, why does a common English dictionary such as the Webster, American Heritage, or OED (Oxford English Dictionary) include etymology without hesitation? The reason may be that Chinese (character) etymology almost never helps a reader in studying the Chinese language due to the long history and evolution of the character. (Can you stretch your imagination far enough to associate the scene of a man and an elephant with the sense of "for" or its slightly older sense of "to do"? See above.) In addition to the long history, I believe there's another, more subtle, element in clouding the Chinese etymology. Most languages in the world take the alphabetic writing system. Studying the internal history of its vocabulary primarily means analyzing phonological and morphological changes through time; e.g., there was a systematic change of f to h in Spanish for a large number of words. Secondly, less conducted is the semantic evolution of words; it's less done because it is "more hazardous to attempt to reconstruct meaning than to reconstruct linguistic form" as linguist Calvert Watkins said. And yet, the Chinese characters rarely went through systematic morphological changes that apply to a large number of characters and, since Chinese is not based on an alphabetic writing system, phonological changes are not conducive to the study of etymology per se. This leaves a large part of Chinese etymology to the study of semantic evolution, which is, as stated, more error-prone in scholarly reconstruction.

There is another reason for not incorporating etymology in Chinese dictionaries. Many characters originate from pictographs or pictograph-like glyphs such as Xiaozhuan script. Publication has to render them as images instead of text, which is an editorial inconvenience. The images with their explanatory texts take a significant amount of space relative to the definitions and examples in usage, which a regular user cares more about. This is in contrast with the etymology in an English dictionary, which can be made brief and still makes sense to the minority of interested readers. And yet a third reason may be that it's just the custom of Chinese lexicography, i.e. no etymology except in specialized dictionaries. This is probably also the reason why dictionaries of other languages than English lack etymology. (Try to find etymology in any dictionary of Spanish, French, German or Italian in a bookstore or library!) But nobody knows the original cause or reason for this custom.

Therefore, unlike a language where a student may make use of etymology in vocabulary study optionally combined with some mnemonics (as demonstrated in my book for Spanish), the Chinese characters have to be studied in a different way. Etymology comes in handy only for the very first few characters, such as "火" ("fire"), "山" ("mountain"), which are frequently used to impress complete beginners. After 10 or 20 such "pictographs", rote memory is commonly adopted, but books such as Tuttle Learning Chinese Characters that laboriously make up mnemonics are helpful. Fortunately, a large portion of the character repertoire consists of characters combining two parts, one more or less representing its meaning and the other representing the sound. However, in none of these cases would etymology play any role.

___________________________________
[note1] By emphasizing "general", I'd like to point out that a special group of Chinese words, 成语 (idioms), are an exception, in that dictionaries of Chinese idioms almost always give the first occurrence of the idioms and sometimes even briefly describe the sense development as well.
With regard to dictionaries of words in general, one may think of the book 《辭源》, literally "word origin". First published in 1925, it takes a misleading title because it's no more than a dictionary (albeit of high-quality) of Chinese words with no etymology. In fact, even if we take an alternative interpretation of "辭源" as "first occurrence of word", this book fails as well; e.g., the entry for "中国" does not list its first occurrence in the Book of Documents, or the bronze inscription which the Book records. Another book we can even more readily dismiss is the 《詞源》 by Zhang Yan in the Song dynasty because the book is on the subject of the literary genre , not "words".
[note2] Incidentally, the character "秦" is significant in that traditionally many scholars including Paul Pelliot believed that it is the ultimate source for the word China in many languages in the world, although more recent research attributed the origin to "晋". Two other sources of the word referring to China are Khitan as in the case of Russian, and silk.)