I recently wrote about being amazed by how many characters my daughter learned in a year of Chinese elementary school. I’ve got a lot of thoughts on that, and it’s a great way to highlight the difference between “first language acquisition” and “second language acquisition,” as well as the difference in respective study materials. But first, I just want to share just the lists of characters and words covered in the textbooks of the two semesters of first grade in China. (Otherwise, I’ll never get this stuff done!)
The following word lists come from this 语文 (Chinese language) textbook series, the standard set approved for all Chinese children by the Chinese government in 2018 (and published by 人民教育出版社):
The book on the left is for semester 1 (上册), and the book on the right is for semester 2 (下册).
In the images to follow, the characters in the 写字表 (“Character Writing List”) are all words the kids need to learn to write, even if some of them initially appear in a 识字 (“Character Recognition”) section of the textbook, and some of them first appear in other sections.
Grade 1: Semester 1 (Character List)
Grade 1: Semester 2 (Character List)
Grade 1: Semester 2 (Word List)
This isn’t a comprehensive list of all the words that could be made (or even were covered) by the characters learned in the second semester of first grade. It’s more a list of words that can be formed with the new characters learned and were covered in class. Single-character words are not included in this list. (Note: just perusing this list, you will notice that even in first grade, certain words appear that you would never teach a non-native beginner learner.)
Apologies for the iffy quality… the scanner was acting up. All the characters should be clearly legible, though.
I’ll follow up in a future post with some of my thoughts on all this. I also plan to convert these lists to nice electronic text formats (or maybe just find a place to download them), but if someone else does it first, please share!
In the meantime, beginners, do not despair! You’re not a child, and you won’t learn like one, but you can still learn Chinese. Just differently.
My daughter has just finished first grade in a Chinese elementary school. I’ve been absolutely blown away by how many characters she has learned in her first year (that’s a topic of an upcoming post).
Just the other day, we were having a conversation (mostly in English) about what characters we thought were “hard.” It was interesting getting her perspective, because it was totally different from mine. We didn’t agree at all on which ones were “hard.”
That’s when I brought up the (non-standard) Chinese character “biáng,” a ridiculously complex character used only to write “biángbiáng 面” (a kind of noodle). Anyway, she loved it, and after writing it a few times, can now write it from memory, and it actually looks pretty good.
Please excuse the “proud dad” nature of this post… I’m actually more blown away than straight-up proud. No one even encouraged her to learn to write this character. But here’s her writing the character from memory (bad video quality… sorry):
And here’s the finished product, after she added a bit of extra text to the top and bottom:
Note: computers cannot display this Chinese character. It’s often written in pinyin, and even when it appears on menus in China, it’s either handwritten or some weird mismatched pasted-on character.
And yes, my 7-year-old’s Chinese handwriting is already better than mine. It only took one year.
This ad is hanging in Shanghai’s “Cloud 9” (龙之梦) shopping mall:
First of all the repeating character is 鹅, which means “goose.” In the circular logo, you can see a little characterplay going on with the goose head.
Above that, you have “鹅，鹅，鹅” which, of course, reads “goose, goose, goose.” This is a famous first line of a classical Chinese poem. It’s famous because it’s so simple, so a lot of kids memorize it as one of their first (if not the first) classical poems committed to memory.
The word is 扣子, meaning “button” (the kind you sew onto clothing). In Chinese, the kind of button you press is a totally different word, and even has the verb for “to press” as the first character: 按钮. (When you think about it, it seems kind of dumb that we use “button” for both of those things in English. Sure, you can say “push-button” in English, but it still feels to me like whoever decided to use the word “button” for the new kind that you press wasn’t super bright…)
Way back in 2015 I recommended the Outlier Dictionary of Chinese Characters for Pleco. It’s been a while, but the team has been busy. They’ve been continuously adding to their character dictionary, and they’ve also created a video course for self-learners that want to learn Chinese characters by the Outlier method.
I don’t routinely plug other products, but this is one I really believe in. These guys know what they’re doing, and they are utterly dedicated to their cause. You may notice in the video that they’re not exactly “entertainers,” but they don’t beat around the bush and they do know what they’re talking about!
The name of the service is 盟盟 (and apparently all the good domain names have been taken for that one). You can see how the “盟” character blends nicely into the drawing of the ship.
But no, the brand has nothing to do with ships or cruises or whatever… So while the characterplay looks like it kind of works, the picture really has nothing to do with what 盟盟 is all about: franchising (加盟) other brands.
This ad (spotted in the Shanghai Metro) is interesting for a number of reasons:
What caught my attention was the font. “Blocky” (sometimes pixely) fonts are quite common, but I’ve never seen one so “spaced out” like this before. Yet the word 流行 (“popular”) is clearly visible.
And I didn’t even realize it at first, but the word 流行 is also written backwards! This is not something I have seen before, and I’m not sure what the intended effect is. (Maybe if I were a fan I’d get it?)
Nice of the poster to include the pinyin for 流行 (liúxíng), though!
Chris Lee is the English name for Li Yuchun (李宇春), which some of the older “China watcher” crowd might remember for her rise to prominence on the popular “Super Girl” singing competition in 2005.
Over Christmas in Florida, my dad gave my son (then not quite three years old) a set of wooden blocks. What’s interesting is that these were Chinese character blocks, featuring both Chinese characters and their stroke orders. And these were made in the USA! (Must have been one of the few toys my kids got for Christmas that wasn’t made in China, and it was a set of Chinese character blocks.)
We couldn’t fit such a heavy toy in our luggage on the trip back, so my dad shipped them over for us: Chinese blocks, made in the USA, shipped from the USA to China. Irony level up!
These are apparently “Uncle Goose” brand wooden blocks, but the ones we have look different from what’s on Amazon. I like ours better… they look like these, and have a map of China on the back, as well as a cool dragon and stuff.
In Shanghai’s Changning Raffles City (长宁来福士) mall, there’s a new bookstore called Yan Ji You (言几又). Does that name strike you as weird? It should. It’s a weird name. 言 is used as a word by itself mostly just in classical Chinese, but then the use of 几 (grammar points here) and 又 (grammar points here) also don’t make sense. What’s going on?
It’s not obvious, but the name as a deconstruction of the character 设, as in 设计, the word for “design.” When you break down the character 设, you can break it into “left-right” structure (⿰) first, giving you 讠 and 殳. Then 殳 can be further broken down as a top-bottom structure (⿱), giving you 讠, 几, 又. But 讠 (the “speech radical”) is just a simplified stand-in for 言, and 言 looks way better as a stand-alone character anyway, so the end result is 言几又.
“I see” is a children’s dance studio in Shanghai. Here’s the logo:
Can you read the Chinese name? Hint: the first character is not just 火. (It’ll be much easier if you’re already familiar with the Chinese names of some popular fairytales. Or even if you’re familiar with the Chinese names of Disney animated classics.)
In this case, we’re dealing with the name 灰姑娘, literally, “ashes girl,” which is the Chinese name for “Cinderella.”
Confession time! I think it wasn’t until I learned the Chinese name for Cinderella that I even realized that the “Cinder” part of “Cinderella” was a reference to ashes rather than being kind of like a cross between “Cindy” and “Stella” with a random “er” thrown in for style.
According to Wikipedia, the “cinder” part “has to do with the fact that servants… were usually soiled with ash at that time, because of their cleaning work and also because they had to live in cold basements so they usually tried to get warm by sitting close to the fireplace.”
Anyway, it might be easy to miss that the dancer in the logo next to the 火 character is part of the larger character 灰.
I’ve been dealing a lot with clients’ Chinese character issues, and happened to stumble upon this Quora answer of Brendan O’Kane’s to a question about the origin of the character 奶:
Chinese speakers believe a lot of things about their own writing system, many of them untrue. One of the deepest-rooted and most pernicious of these false beliefs is the notion that characters have meaning. They don’t. The Chinese language [simplifying here; feel free to replace with “Chinese languages,” if you prefer] was spoken long before it was ever written, and has been spoken fluently throughout its history by far more people than have been able to write it fluently. The modern components of a character are not a reliable guide to either the meaning of the character or the early forms of a character, and the characters that make up a word are not necessarily a reliable guide to the meaning of the word. A lot of the stuff referred to as “etymology” in Chinese would more accurately be described as “stories about pictures” — cute, and occasionally helpful for memorization, and sometimes even sort of accurate, but mostly no more truthful than the old story about the English word “sincere” coming from Latin “sine cera,” “without wax,” or about “history” being “his story.”
Lots of interesting ideas here, and Brendan is spot on. And although “Chinese speakers believe a lot of things about their own writing system, many of them untrue,” that doesn’t mean you shouldn’t learn much of what Chinese speakers believe about their language (and writing system). In fact, you kind of have to. That’s culture. It’s like learning about all the ways that “America” is “the land of the free,” even if you don’t believe that the U.S. is that great bastion of liberty. What a people believes about its country is important.
Still, you don’t take everything at face value. Brendan’s point might be a “there is no spoon” moment for you, though, if you’re ready for it.
The key point here is that no bit of language, either spoken or written, has a meaning that people haven’t given it. (For more information on where meaning comes from, read up on semiotics and semantics.) Furthermore, spoken language is primary. Written language is a technology employed by a society. Sure, it’s a special technology with special properties and all kinds of cultural power, but it’s not the language itself, nor is it inherently meaningful in itself. Chinese characters do not hold any meaning that people do not give them.
If all this sounds obvious, that’s great, but if you pay attention, you may notice that Chinese characters do sometimes seem to take on mystical qualities in Chinese culture.
I’m not trying to get overly philosophical or quibble over irrelevant details. The question for me is: what does this mean for the learner of Chinese? Here are a few points:
You don’t have to know the full origins of every character you learn. Sure, they are sometimes helpful for memorization, and if that’s the case, great.
It’s worth noting how many non-language-oriented native speakers, fully fluent and literate, have no interest in character origins, and have forgotten most of what they once knew about that stuff. And yet they are still fully fluent and literate in Chinese.
Since character meanings are neither inherent nor absolute, it’s not bad to sometimes make up your own little stories to help you remember characters. The key is consistency (so as not to confuse yourself), not factual accuracy.
Still, because characters are such an important part of Chinese culture, it’s not a good idea to make up your own stories that run counter to the standard ones that virtually every Chinese person knows, like the meanings of the most basic pictographic (人, 日, 木, etc.) or the simple or compound ideographic (上, 明, 好, etc.) ones. For the more complicated ones that most native speakers couldn’t explain, your own story mnemonics are safe to use.
This is a complicated issue with tons of cultural baggage, I realize. I’m happy to discuss in the comments!
It’s hard to succinctly explain what I mean by this title, because “character structure” and “character composition” are pretty much always used to mean “the character components that make up a character” (or, to use the more outdated term, “radicals”). But the character components would be the content. The limited number of spatial configurations in which those components routinely combine are the “character structure patterns” I’m talking about in this post.
Take a look at this:
If that’s not clear enough, let me break it down for you.
First of all, these “structural patterns” of Chinese characters are referred to as “Ideographic Description Characters” in the IT world, and each one actually has its own Unicode character! So you can copy and paste them just like other text (provided you have Unicode support), and even Google them. (Pro tip: Baidu them. Baidu Baike (Baidu’s Wikipedia) has lots of examples of each type.)
Here are those 12 Unicode characters:
⿰, ⿱, ⿲, ⿳, ⿴, ⿵, ⿶, ⿷, ⿸, ⿹, ⿺, ⿻
The patterns ⿰ and ⿱ (and sometimes a combination of those two, one embedded in the other) make up the most characters. Here are some simple examples of characters that use the more common structural patterns:
My advice is:
If you’re learning characters, learn these patterns. There aren’t that many, and they’re useful. It’s also good to dispel the notion that character components can be combined in an infinite number of ways. It’s a lot to absorb, for sure, but it’s not an infinite number of options you’re dealing with.
If you’re teaching characters, teach these patterns (or at least point them out) as you teach the character components. Everyone teaches components, but it’s nice to add a little structure to the teaching of structure. Confirm the growing, amorphous familiarity your students are acquiring, and give it a definite form.
If you’re building a website or app, include these patterns. It’s not going to be useful to look up characters in this way, but if done right, it could be a great way to explore a character set, and self-directed exploration is one of the best ways to learn.
It’s a lot of work to create a new font in Chinese. Instead of English’s 26 capital letters, 26 lower-case letters, 10 numbers and a smattering of symbols, you have literally thousands of Chinese characters you need for even a basic font. But if you just need a special font for a logo or a book cover, it makes sense to put the design work into just the Chinese characters you need. And if you look at enough Chinese book covers, you discover some cool custom fonts!
Here are some covers with custom Chinese fonts which I discovered on a recent trip to the book store (Chinese title in text below each photo):
I suppose it’s possible that not all of these are custom-designed characters (they might just be fonts I’m not aware of), but they’re still pretty cool!
I’m always on the lookout for interesting, creative use of Chinese characters, and that includes cool and weird Chinese fonts. Well, the characters at the bottom of this poster really caught me by surprise, because I didn’t even realize they were characters at first:
If you’re struggling to make anything out, note that English at the bottom right: “Crazy Circus City.” Big clue.
OK, here’s what it reads:
Literally, “crazy (疯狂) circus (马戏) city (城).”
(Don’t worry if you still find it hard to read even after you know what it says, and even if you know the characters. It’s really hard to read!)
For some reason the traditional form of 马 is used, though: 馬. My Chinese teacher back in college always told me that mixing simplified and traditional characters was a big no-no. It’s just too… crazy.
If the whole thing had been in traditional characters, it would have read:
A while back I wrote about What 80% Comprehension Feels Like, and I quoted the English examples used in Marcos Benevides’ excellent presentation which simulate 80% comprehension in English by including made-up English-like vocabulary words.
I’ve been thinking about that presentation a lot, both about the impact of such a demonstration, as well as about how it could be accomplished in Chinese. I ended up creating my own examples in Chinese. I’ll go ahead and share that first, and follow up with some discussion of the considerations involved.
(Before you attempt to read the following, please note that if your Chinese is not at least at an intermediate level, the following exercise is not going to work. Like its English-language counterpart, these examples are most effective with native speakers.)
Here is 98% comprehension:
Here is 95% comprehension:
Here is 80% comprehension:
The tricky thing about reading Chinese is that it’s not just a matter of vocabulary and grammar; there’s an issue not present in English: the issue of Chinese characters. When a learner reads a difficult Chinese text, all three of these components tend to play a part in the difficulty: vocabulary, grammar, and characters.
But for the example to work for both learners and native speakers alike, there needs to be a way to guarantee that parts of the text were incomprehensible, as accomplished with made-up words in English. How can one do this in Chinese?
How I did it
First of all, to maximize the chances that the “intelligible” parts of the Chinese sample text are also readable by learners, I used as simple a text as I could: a Level 1 Mandarin Companion graded reader. For these examples, it was The Secret Garden.
Then, I had to be sure I chose the more difficult content words to swap out, and that I got all instances of them in each sample. Obviously, I had to count the words to make sure I got the desired percentage right. But equally important, to make my samples representative of real-life 98%, 95%, and 80% comprehension experiences, the words chosen should “cloud” reading comprehension to the appropriate degree, no more, no less.
But here’s the tricky part: how to represent characters the reader doesn’t know. The obvious way would be to create my own characters that don’t really exist. I enjoy doing this, but it’s time consuming, and to make it look truly credible it would have to not stand out at all when mixed in with the other characters. Too much work.
So I turned to the Unihan database of Chinese characters. Over the years, more and more obscure characters have been added to this set of characters, and I found a list of the most recent additions. (Most recently added should mean most obscure, but I chose Extension D from this page because it was both recent and a small download.)
A quick check confirmed that these characters were indeed obscure, but many of them didn’t look like simplified Chinese characters, or were just too weird, so I had to choose carefully. After making my choices, I also had to check to make sure that educated Chinese adults didn’t recognize the characters (guessing doesn’t count).
After that, I selectively swapped out characters in the samples. (My 80% comprehension text sample is the shortest, because I was running out of “good” obscure characters, and I didn’t want to have to find more!)
One interesting side effect of using such obscure characters in my texts was that most software couldn’t render them. Whatever fonts they used just didn’t include those bizarre characters. Only Wenlin, with its custom font designed to render all kinds of obscure characters, could display them all. So I had to do screenshots of Wenlin’s interface.
How to use this
I used these passages as part of a presentation on extensive reading at LanguageCon in September. I got the effect I wanted: Chinese members of the audience giggled (embarrassedly?) at the characters they didn’t know, especially when they got to the 80% comprehension example.
Chinese learners smiled wryly: there wasn’t much amusing about a fake recreation of the challenge they face on a daily basis, trying to read Chinese.
More than anything, I hoped that the Chinese audience could empathize with the learners of Chinese. Most Chinese people never know what it feels like to have to learn so many foreign characters as a part of a foreign language learning experience. Through these examples, though, they can get an inkling.
Actually, maybe they were chuckling in relief… at least they’ve got that challenge behind them.
The AllSet Learning blog also has a similar Chinese language article on this topic: 80%没有你想的那么多.
This image has been on posters all around Shanghai’s Jing’an District this past summer:
Each character is related to traffic:
Actually, each character is merely “decorated” in the traffic theme. It’s not true “characterplay” in the sense that characters are constructed or represented in some non-standard way for artistic effect. I’ll take it, though!