Thursday, February 28, 2019

Mutual intelligibility in writing only

Mutual intelligibility is one important way to differentiate a language from a dialect, in spite of some complications. One such complication is the separation between writing and verbal intelligibility in the language varieties spoken in China; people in many regions of China may pronounce the same characters so differently that they can communicate with each other only by writing and not verbally. How do we define or measure mutual intelligibility in writing only (MIW hereinafter) in general? Here I describe an experiment that may serve as a starting point. Two people with high school or more education who natively speak language varieties A and B, respectively, but not both (if A and B are different), are subject to a test. Each person reads 100 sentences in normal speed randomly selected from the entire corpus of Modern A or B. (As an approximation to the entire corpus, take the Internet and book content indexed by Google as an example.) Each sentence is followed by 10 interpretations given in the language variety the other person understands, and he (she) chooses the correct one (10 choices instead of 4 or 5 just to reduce the random guess correctness). Then repeat the test switching the two people along with their respective language variety. If >=50 sentences are correctly understood, A and B are excluded from MIW. If it's <50, they are further subject to a test in which 100 sentences selected from the entire corpus are shown in writing. If >=90 sentences are correctly understood, we consider varieties A-B a case of MIW.

Thus, Sichuanese-Mandarin will be disqualified because they can be verbally communicated (and of course with written script). But Shanghainese-Mandarin, Hunanese-Mandarin, Shanghainese-Hunanese are good examples of MIW. Cantonese warrants more discussions. It's obvious that the Cantonese-Mandarin (or -Shanghainese etc.) pair has no verbal MI. There are grammar particles, pronouns and some common words unique to Cantonese. When a literate person who natively speaks Cantonese but has not learned the written Chinese in the way Chinese is taught in mainland China writes in Cantonese, can the writing be understood with >=90 correctness by one speaking Mandarin only? Suppose the content is absolutely randomly selected from the entire Cantonese corpus, and is not purely colloquial and definitely not contrived to contain a disproportionately high ratio of Cantonese-specific markers or characters. I don't know the answer, and an actual experiment is needed. One example of such a written script in Cantonese is a Wikipedia page. I personally don't know Cantonese and I may or may not be able to correctly answer 90 out of 100 questions in a reading comprehension test. Note that Cantonese is special in that many native Cantonese speakers do read Chinese text proficiently, although mandarin or other Chinese dialect speakers don't read Cantonese text (such as that Wikipedia page), creating asymmetric intelligibility, which is quite common in the world. Thus, when these two people try to communicate by writing, the preferred script they choose will be Chinese, not Cantonese. In discussing MIW, we should define two levels, one only allowing the written script to be the textual representation of the spoken language (e.g. Cantonese text for Cantonese speech), the other allowing the two people to choose whatever their preferred script is. Technically, we should limit MIW to the first case.

According to Wikipedia, Icelandic-Faroese and German-Dutch are MIW pairs. (The article also lists French-some Romance languages but does not give a good reference to support it. To my knowledge of a few Romance languages, this pair is invalid.) Based on a posting in Facebook Linguistics group, the following are additional language varieties that are candidates for MIW:

* Scots-English
* Many languages in South Asia with Sanskrit roots
* Swiss German-Standard German
* Hanoi Vietnamese-Southern Vietnamese (but highly disputed in the Sinosphere group whose members are mostly Vietnamese)
* Danish-some other Scandinavian languages

Note that I'm dealing with MIW between language varieties, a concept encompassing dialects within a language as well as languages, styles, registers, etc. While MIW within a language may not be limited to Chinese, it's probably safe to say the Chinese language has the most MIW pairs among its dialects, due to the dissociation between the pronunciation and the written form.

Saturday, August 4, 2018

"below" is not an adjective

In a technical discussion forum about databases, someone posted an off-topic message: "to all Oracle staff, this phrase is not English: 'follow the below steps' How does this slip into Oracle Support tech note documents". Indeed, the English word "below" should not be used as if it was an adjective (see e.g. Wiktionary). But I've seen this incorrect usage for 20+ years especially in the IT industry. In the beginning, it mostly occurred in messages written by people with Indian-like names. Nowadays, Chinese or other ethnicities as well.

In any case, instead of saying "the below steps", we should say "the following steps", or "the steps below" (implying "located" before "below"). I'm guessing the adjectival usage of "below" is probably due to influence from the antonym "above", which *can* be used as an adjective as in "the above steps".

In light of the descriptivism vs. prescriptivism debate in which the latter has slowly lost ground in the past century, some people may argue that as more and more people start to use "below" as an adjective, this usage may eventually become accepted; after all, language evolves with the way it's spoken by the people. In fact, Merriam-Webster has already acknowledged this usage, after adverb, preposition, and noun. But for now, the majority of the native speakers and no other English dictionary consider this usage acceptable. It's wise to be standard-compliant and stop saying "the below steps".

(A good discussion is found on Daily Writing Tips.)

Friday, July 6, 2018

Basic Chinese Characters

I finally finished my little book Basic Chinese Characters. It contains 2500 commonly used Chinese characters selected by the Ministry of Education of China, with pinyin and definitions manually added by me. The book sorts the characters by frequency usage according to Google's estimate of occurrences of each character on the Internet (a method only I used and probably I invented). Some more descriptions of the book, plus sample pages, are at yong321.freeshell.org/bcc/. The book is available on Amazon as an e-book.

The book is in the format of character - pinyin (tones marked with numbers) - definition. For example,

1-99
二 er4 two
三 san1 three
四 si4 four
六 liu6 six
七 qi1 seven
零 ling2 zero
本 ben3 notebook; (measure word for books etc.); 本来(lai2) originally
日 ri4 sun
所 suo3 (function word, roughly “that which”); 所以 therefore; bureau
下 xia4 down, below; to go down
...
1200-1299
止 zhi3 to stop
脆 cui4 brittle, crispy
诞 dan4 birth
碍 ai4 blocking, hindrance
散 san4 to scatter, to disperse; scattered, loose (w.p. san3)
兽 shou4 beast
逝 shi4 to drift away; 逝世(shi4) to pass away, to die
猪 zhu1 pig
暂 zan4 temporary
腊 la4 preserved meat

Free offer If you as a reader of this blog are interested in this book, for a limited time, I can selectively offer this book for free on one condition and one wish. You must not share my book with anyone else. If your friend would like a copy, please have him or her contact me directly. But since there is no technical way to enforce this requirement, I can only trust you as on a verbal agreement. In addition to this requirement, I sincerely hope you can write an honest review and post it to the Amazon.com website, or if not feasible, on Goodreads. You can request a free copy by sending me an email at yong321@yahoo.com. It would be nice if you could tell me one or a few book reviews you previously wrote on Amazon.com.

Irrespective of any interest in the book, if you have any comments, suggestions, or corrections, please let me know. They are highly appreciated.

Monday, April 30, 2018

Ludwig Feuerbach and the End/Outcome of Classical German Philosophy

The 200th anniversary of the birth of Karl Marx (May 5, 1818 - March 14, 1883) is coming soon. This great thinker is one of the very few that have had profound influence over human history. His numerous works, along with those of his close friend and also great thinker, Friedrich Engels, have been translated into dozens of languages and meticulously studied around the world. This short posting is about one single word in the title of Engels' book, Ludwig Feuerbach and the End of Classical German Philosophy.

In the 1980's, I read about disagreement with the Chinese rendering of the word, “终结” (literally "end", "termination"), in a Chinese article. If my memory serves me right, the author of the note was 朱光潛, a renowned scholar and philosopher in China. He argues that, as the original German title "Ludwig Feuerbach und der Ausgang der klassischen deutschen Philosophie" uses the word "Ausgang", literally "exit" or "outcome", there's no reason to change it to "end" in English or “终结” in Chinese, which is obviously different in meaning. Since both Wikipedia and Marxists.org use the word "end" in English and only a small number of websites on the Internet use the word "outcome", I had some email exchanges with a knowledgeable volunteer on Marxists.org, Ben, partly duplicated as follows:

Ben:
it always struck me as strange that this has always been translated as "end" - maybe it was a result of a certain "Stalino-Hegelian" teleology, which infected the movement in the 20th century? 'Ausgang' would probably be better translated as 'denouement' (as in a novel or play) or, as you suggest, "outcome".
Communist greetings

Me:
If it was the result of "Stalino-Hegelian" teleology, why would scholars in the English world be affected, as would the Russian and Chinese translators, which is understandable? British or American translators don't need to go through Russian and Chinese sources to do the German-to-English translation.

Ben:
I think it is worth bearing in mind that the project of translating Marx and Engels into English was also overseen by mainly Soviet funds and Soviet-type scholars. I am not suggesting that they have not done an outstanding job (of course they have!) but am merely pointing out that ideology and outlook cannot *but* find reflection in translation work and rendering somebody's thoughts into another language.
Communist greetings

Me:
I wanted to confirm that Russian translators were responsible for the popular English translation "end" but couldn't find definitive evidence. According to the translation by Foreign Language Press in Beijing, this 1976 English translation in China is based on the 1951 edition by Foreign Languages Publishing House in Moscow. Then I found an earlier one, published in 1946, Ludwig Feuerbach and the End of Classical German Philosophy, by Progress Publishers, which according to Wikipedia "was a Moscow-based Soviet publisher founded in 1931. It was noted for its English-language editions of books on Marxism-Leninism".

As we can see, in the English translation as early as 1946, the Moscow edition already used "end" for "Ausgang", as if Engels was announcing the death of classical German philosophy. A good description is in fact given by the last link, i.e. "Engels considered this something of a summation or closure of the post-Hegelian criticism Marx and he had initiated in The German Ideology 43 years before." Note that the words "summation", "closure", although not literally matching "Ausgang", are a good paraphrase of it.

The Wikipedia page also gives the title translation in other languages. French uses "fin", Portuguese "fim", Japanese "終結", and Russian "конец", all meaning "end". It's a small surprise that all these semi-official translations in various languages somewhat deviate from the German original.