Opentopia Directory Encyclopedia Tools

Cangjie method

Encyclopedia : C : CA : CAN : Cangjie method


The Cangjie method (Traditional Chinese: ; Simplified Chinese: }}}), also spelt Changjei method, presumably a spelling mistake of some early Chinese system—is a system by which Chinese characters may be entered into the computer. Invented in 1976 by Chu Bong-Foo (Traditional Chinese: ; pinyin: ), the method is named after Cangjie, the man historically attributed with the invention of the first writing system of China, and the name is given by then Defence Minister Chiang Wei-kuo of the Republic of China. Although the input method was initially based upon Traditional Chinese characters, it has since been revamped such that interoperability between Cangjie and the Simplified Chinese character set was made possible.

Sometimes, for example in filenames, the name Cangjie is abbreviated as cj.

Unlike pinyin, Cangjie is based on the graphological aspect of the characters wherein each basic, graphical unit is represented by a basic character component, of which there are 24 in all, each mapped to a particular letter key on a standard QWERTY keyboard. An additional "difficult character" function is mapped to the X key. Within the keystroke-to-character representations, there also exist four subsections of characters: the Philosophical Set (corresponding to the letters 'A' to 'G' and representing the elements), the Strokes Set (corresponding to the letters 'H' to 'N' and representing the brief and subtle strokes), the Body-Related Set (corresponding to the letters 'O' to 'R' and representing various parts of the human anatomy), and the Shapes Set (corresponding to the letters 'S' to 'Y' and representing complex and encompassing character forms).

The basic character components in Cangjie are usually called "radicals"; nevertheless, Cangjie decomposition is not based on traditional Kangxi radicals, nor is it based on standard stroke order; it is in fact a simple geometric decomposition.

Cangjie is one of the very few input methods that can be found on most modern personal computers without the user having to download or install any additional software. Cangjie's widespread availability must be credited to Mr. Chu, who has allowed the public to use his invention freely and free of charge.

Overview of the input method

The keys and \"radicals\"

The basic character components in Cangjie are called "radicals" 字根 or "letters" 字母. There are 24 radicals but 26 keys; each of the 24 radicals (the basic shape 基本字形) are associated with one or more auxiliary shapes 輔助字形. The names of the 24 radicals, as well as their associated auxiliary shapes, are mnemonic:
GroupKeyNamePrimary meaning
Philosophical group A日 sun
B月 moon
C金 Venus (gold)
D木 Jupiter (wood)
E水 Mercury (water)
F火 Mars (fire)
G土 Saturn (earth)
Stroke group H竹 bamboothe slant and short slant, the Kangxi radical
I戈 weaponthe dot
J十 tenthe cross shape
K大 bigthe X shape
L中 centrethe vertical stroke
M一 onethe horizontal stroke
N弓 bowthe crossbow and the hook
Body parts group O人 personthe dismemberment, the Kangxi radical 人
P心 heartthe Kangxi radical 心
Q手 handthe Kangxi radical 手
R口 mouththe Kangxi radical 口
Character shapes group S尸 corpsethree-sided enclosure with an opening on the side
T廿 twentytwo vertical strokes connected by a horizontal stroke; the Kangxi radical 艸 when written as 艹 (whether the horizontal stroke is connected or broken)
U山 mountainthree-sided enclosure with an opening on the top
V女 womana hook to the right, a V shape
W田 fieldfour-sided enclosure
Y卜 fortune tellingthe 卜 shape and rotated forms
Collision/Difficult key*X重/難 collision/difficult(1) disambiguation of Cangjie code decomposition collisions, (2) code for a "difficult-to-decompose" part
Special character key*Z(See note)auxiliary code used for entering special characters (no meaning of its own)

*Note: Some variants use Z as a collision key instead of X, in those systems Z has the name 重 and X has the name 難; but the use of Z as a collision key is neither in the original Cangjie nor used in the current mainstream implementations; in some other variants, Z may have the name 造 (user-defined) or some other names

The auxiliary shapes of each Cangjie radical have changed slightly between different versions of the Cangjie method; this is one reason why different versions of the Cangjie method are not completely compatible.

Keyboard layout

A typical keyboard layout for Cangjie method, which is based on United States keyboard layout. Note the non-standard use of Z as the collision key.
A typical keyboard layout for Cangjie method, which is based on United States keyboard layout. Note the non-standard use of Z as the collision key.

The basic rules

The typist must be familiar with several decomposition rules 拆字規則 that defines how to analyse a character to arrive at a Cangjie code. The rules are subject to various principles:

The short list of exceptions

Some forms are always decomposed in the same way, whether the rules say they should be decomposed this way or not. Fortunately, the number of exceptions are few:
Form Fixed decomposition
Version 2 Version 3 Version 5
門 (door) 日 弓 (AN) 日 弓 (AN) 日 弓 (AN)
目 (eye) 月 山 (BU) 月 山 (BU)
鬼 (ghost) 竹 戈 (HI) 竹 戈 (HI)
几 (small table) 竹 山 (HU) 竹 弓 (HN)
贏 minus 月 卜 弓 (YN)
卜 心 (YP) 卜 心 (YP) 卜 心 (YP)
亡 on top of 口 卜 口 (YR) 卜 口 (YR)
隹 (fowl) 人 土 (OG) 人 土 (OG) 人 土 (OG)
人 山 (OU) 人 弓 (ON)
畿 minus the 田 女 戈 (VI) 女 戈 (VI) 女 戈 (VI)
鬥 (compete) 中 弓 (LN) 中 弓 (LN) 中 弓 (LN)
 (city) 弓 中 (NL) 弓 中 (NL) 弓 中 (NL)

Examples

Early Cangjie system

In the beginning, the Cangjie input method was not a way to produce a character in any character set. It was, instead, an integrated system consisting of the Cangjie input rules and a Cangjie controller board. The controller board contains character generator firmware, which dynamically generates Chinese characters from Cangjie codes when characters are output, using the hi-res graphics mode of an Apple II computer. In the preface of the Cangjie user's manual, Mr. Chu wrote in 1982
[Translation]
In terms of output: The output and input, in fact, [form] an integrated whole; there is no reason that [they be] dogmatically separated into two different facilities.… This is in fact necessary.…
In this early system, when the user types "yk " (for example) to get the Chinese character 文, the Cangjie codes does not get converted to any character encoding; the actual string "yk " is stored. In a very real sense, the Cangjie code of each character (string of 1 to 5 lowercase letters plus a space) was the encoding of that particular character.

A particular interesting "feature" of this early system is that if you send random lowercase words to the character generator, it will attempt to construct Chinese characters according to the Cangjie decomposition rules, sometimes causing strange, unknown characters to appear. This unusual feature, "automatic generation of characters", is actually described in the manual and is responsible for producing more than 10,000 of the about 15,000 characters that the system can handle. The name Cangjie, evocative of creation of new characters, was actually very apt for this early version of Cangjie.

The presence of the integrated character generator also explains the historical necessity of existence of the "X" key as used for disambiguation of decomposition collisions: because characters are "chosen" when the codes are output, every character that can be displayed must in fact have one and only one Cangjie decomposition. It would not make sense—nor would it be practical—for the system to provide a choice of candidate characters when some random text file is displayed; the user would not know which of the candidates are correct.

Issues

Cangjie was designed to be an easy-to-use system to help promote the use of Chinese computing; nevertheless, many users find Cangjie to be a difficult method. Many of the perceived difficulties arise from poor instruction

Perceived difficulties

Enough practice, however, can overcome the above problems. A typist with sufficient practice in Cangjie touch types, much like a typist that works on the English language; it is entirely possible for a touch typist to type at 25 words (Chinese characters) per minute or better in Cangjie, yet have difficulty remembering the list of auxiliary shapes or even the decomposition rules. Experienced Cangjie typists can reportedly attain a typing speed between 60 wpm and over 200 wpm.

Cangjie, however, also have some "real" problems:

Actual difficulties

In some situations it cannot be used at all. Cangjie uses all 26 keys in an English (United States) keyboard, it cannot be used to input Chinese on cell phones. For cell phones, the Q9 method is the current norm because it is designed specifically for use on numeric keypads.

Versions of Cangjie

The Cangjie input method is commonly said to have gone through 5 generations (commonly referred to as “versions” in English), each of which slightly incompatible with the other. Currently, version 3 (第三代倉頡) is the most common, being the version of Cangjie supported natively by Microsoft Windows. Version 5 (第五代倉頡), supported by the Free Cangjie IME and previously the only Cangjie supported by SCIM, is a significant minority.

The early Cangjie system supported by the Zero One card on the Apple II was Version 2; Version 1 had never been released.

The Cangjie input method supported on the Mac OS is somewhat like Version 3 and somewhat like Version 5.

Besides the original Cangjie input method, Version 5 was also created directly by Mr Chu, the inventor. Originally slated for release as Version 6, Mr Chu had hoped that the release of Version 5 would bring an end to the “more than ten versions of Cangjie input method” (slightly incompatible versions created by different vendors).

Variants of Cangjie

Most modern implementations of Cangjie IME's provide various convenience features: Besides the wildcard key, many of the above features are very convenient for casual users but unsuitable for touch typists because they make the Cangjie IME unpredictable.

There are also various attempts to "simplify" Cangjie one way or another:

See also

References

External links

 


From Wikipedia, the Free Encyclopedia. Original article here. Support Wikipedia by contributing or donating.
All text is available under the terms of the GNU Free Documentation License See Wikipedia Copyrights for details.

Search Titles
0123456789
ABCDEFGHIJ
KLMNOPQRST
UVWXYZ?

E-mail this article to:

Personal Message: