Difference between revisions of "Language/Multiple-languages/Culture/Introduction-to-Han-script"
| Line 173: | Line 173: | ||
In practice, there is also a rare category: phonetic compounds [會音字]. | In practice, there is also a rare category: phonetic compounds [會音字]. | ||
There may be multiple variants for a character. | There may be multiple variants for a character. A form is chosen as the standard one in an orthography. An example is 龜 `tortoise`: | ||
https://upload.wikimedia.org/wikipedia/commons/thumb/3/33/Variants_of_Kangxi_radical_213_%27turtle%27.svg/960px-Variants_of_Kangxi_radical_213_%27turtle%27.svg.png | https://upload.wikimedia.org/wikipedia/commons/thumb/3/33/Variants_of_Kangxi_radical_213_%27turtle%27.svg/960px-Variants_of_Kangxi_radical_213_%27turtle%27.svg.png | ||
Revision as of 21:14, 24 April 2026
| introduction | variants | parts | Restructure |
| Hanisation | texts | idioms | place names |
Hi, polyglots!
Every time you see Han characters, you may see them as random strokes and wonder how other people memorise them. Once you have payed attention for a while and learned about the basics of Han script, they will become crystal clear to you.
in progress
Evolution of Han script
There is an evolution: Egyptian → Proto-Sinaitic → Phoenician → … → most scripts used today. Han script has its own: oracle bone script [甲骨文], bronze script [金文] and other scripts → seal script [篆書] → clerical script [隸書] → regular script [楷書].
There are artistic scripts derived from the regular script:
- running script [行書] (a.k.a. semi-cursive script)
- grass script [草書] (a.k.a. cursive script)
There are also typefaces for printing:
- Song [宋體] / Ming [明體] (a.k.a. sans-serif)
- Imitation Song [仿宋體]
- Gothic [黑體] (a.k.a. sans)
Glyph forms of early scripts are highly diverse. After the unification of China by Qin dynasty, glyph forms other than the ones used in the State of Qin are abolished. In the modern time, character glyph forms are standarised differently in different sovereign states and dependent territories.
Simplifications of Han characters take place in different times. Systematic simplifications include the creation of the clerical script in the State of Qin and the Chinese Character Simplification Scheme in the People's Republic of China. In Japan, there is also a reform that simplified the characters. Some character forms have gone through multiple changes, for example, 云 originally means “cloud”, then it was used for “say” and 雲 is used in substitute; then in the PRC's simplification, its meaning of “cloud” is restored. If one puts more value on its cultural heritage, then the Traditional variants are more preferred; if one puts more value on its nature of tools, then the Simplified variants are preferred.
There are glyph forms considered more orthodoxy for preserving the original characteristics of the characters. Characters of these forms are called Inherited Characters [傳承字]. Open-source fonts include the following:
There are website for searching characters:
| script | image |
|---|---|
| oracle bone script [甲骨文] | https://commons.wikimedia.org/wiki/Category:Oracle_bone_script |
| bronze script [金文] | |
| seal script [篆書] | |
| clerical script [隸書] | |
| regular script [楷書] | |
| running script [行書] | https://commons.wikimedia.org/wiki/Category:Semi-cursive_script |
| grass script [草書] | https://commons.wikimedia.org/wiki/Category:Chinese_cursive_script |
| typeface | image |
|---|---|
| Song [宋體] / Ming [明體] | https://commons.wikimedia.org/wiki/Category:Ming_(typefaces) |
| Imitation Song [仿宋體] |
|
| Gothic [黑體] |
Creation and use of Han characters
It is mentioned that there are six categories of characters [六書] in the Rites of Zhōu [《周禮》] without explanation; later explanations are disputed. There are undisputed four categories of method to form characters and two disputed categories of method to use characters. The four undisputed categories:
| category of characters | explanation of the category | example | meaning of the example | explanation of the example |
|---|---|---|---|---|
| pictographs [象形字] | a character where concrete objects are depicted in pictorial form |
|
|
|
| simple ideographs [指事字] | a character where abstract ideas are depicted in pictorial form |
|
|
|
| compound ideographs [會意字] | a character consisting of two or more semantic components |
|
|
|
| phono-semantic compounds [形聲字] | a character consisting of a phonetic and a semantic components |
|
|
|
They can be found at Wiktionary Category:Han characters by formation type.
In practice, there is also a rare category: phonetic compounds [會音字].
There may be multiple variants for a character. A form is chosen as the standard one in an orthography. An example is 龜 `tortoise`:
A character may be used differently than its original use. In some cases, the meaning of a character is diverted and a new character is created to bear that original meaning; in some other cases, a character is used to represent a homophone intentionally. There are disputes over those cases on how to classify them.
Strokes of a character
The elementary components of characters are strokes [筆畫]. Basic strokes [基本筆畫] are
| stroke | modern name of stroke | writing direction |
|---|---|---|
| ㇐ | horizontal [橫] | |
| ㇑ | vertical [豎] | |
| ㇒ | throw [撇] | |
| ㇏ | press [捺] | |
| ㇔ | dot [點] | |
| ㇀ | raise [提] | |
| ㇚ | hook [鉤] | |
| ㇁ | curve [彎] |
| stroke | modern name of stroke | writing direction |
|---|---|---|
| ㇄ | slant [曲] | |
| ㇣ | circle [圈] |
A compound stroke [複合筆畫] is a stroke that is combined by multiple basic strokes. They can be defined differently by different approaches. For example:
The character 永 `eternal` has all eight traditional basic strokes, so it is often used to practice calligraphy.
Components of a character
A character is made of one or more components. Those components are made of one or more strokes.
Those components are written one by one from left to right, from top to bottom, in order to make the character look good.
The components used for indexing in a dictionary are called radicals [部首]. In some cases, a character may be assigned multiple radicals. Explaining Graphs and Analyzing Characters [《說文解字》] and Kāngxī Dictionary [《康熙字典》] published in c. 100 and 1716 are two of the most authoritative dictionaries, where 540 and 214 radicals are listed.
The compositions of a characters are the followings:
| composition | common radicals |
|---|---|
| ⿰ | (many) |
| ⿱ | (many) |
| ⿲ | (many) |
| ⿳ | (many) |
| ⿴ | 囗 |
| ⿵ | 冂、几、門、鬥 |
| ⿶ | 凵 |
| ⿷ | 匚、匸 |
| | |
| ⿸ | 厂、尸、手、广、戶、疒、羊、老 |
| ⿹ | 乙、勹、弋、气 |
| ⿺ | 大、尢、廴、⽑、⽖、⾛、瓜、辵、鬼、麥、鼠 |
| | 水 |
| ⿻ | 木 |
| | |
| | |
| 〾 | |
| |
There are also ways to pick up a type in printing quickly. After the popularisation of computers, shape-based input methods emerged. Those components of such input methods are called roots [字根].
| shape-based input method | time | layout |
|---|---|---|
| 四角 | 1925 | |
| 三角 | 1971 | |
| 倉頡 | 1976 | |
| 大易 | 1988 | |
| 嘸蝦米 | 1990 | |
| 行列 | 1992 | |
| 縱橫 | 1993 |
Readings of a character
In pre-modern times, readings of characters by the officials are called the elegant pronunciations [雅音]. They were de facto standard pronunciations.
For Chinese language, each character has phonetically and semantically identical readings; for non-Chinese languages, each character has two types of reading: one being phonetically similar to the Chinese reading, which is called the sound reading [音讀], the other being the semantically corresponding native word's reading, which is called the instructed reading [訓讀].
The ancient method to note readings is called reverse cutting [反切], where each character's reading is denoted by a character's initial consonant and the rest of another character's syllable. For example, the reading of 東 is denoted by the initial consonant of 德 and the rest of 紅:
It is similar to a dictionary's approach: each word is defined by other words.
Before the widespread of Romanisations, other writing systems emerged to record readings: Kana, Nüshu, Hangul. For Mandarin Chinese, Bopomofo was developed in 1910s.
To address a single character while differentiating it from its homophones, one may say its components first and then say the character. For example, “bow–long stretch” (弓長張) for 張.
Calligraphy
There are tens of thousands of characters in Han script, and even more glyphs, in Han script. The calligraphy of it is also developed.
There is a website for searching characters in calligraphy 集古作书法字典.
Set phrases
Set phrases [成語] are a kind of idiomatic expressions in Han script. They are mostly as short as four characters. There are
- Chinese set phrases on Let's Travel To China
- Japanese set phrases on IMABI
- Korean set phrases on KoreanLII.
For languages other than Chinese, it is also possible to build set phrases in the same manner.
Example of popularised English idioms in Chinese set phrases:
- tip of the iceberg 冰山一角 : iceberg + a + corner
- kill two birds with one stone 一石二鳥 : one + stone + two + birds
| introduction | variants | parts | Restructure |
| Hanisation | texts | idioms | place names |