[go: up one dir, main page]

CN1331441A - Chinese-character input system 'three-digit code' - Google Patents

Chinese-character input system 'three-digit code' Download PDF

Info

Publication number
CN1331441A
CN1331441A CN 00104420 CN00104420A CN1331441A CN 1331441 A CN1331441 A CN 1331441A CN 00104420 CN00104420 CN 00104420 CN 00104420 A CN00104420 A CN 00104420A CN 1331441 A CN1331441 A CN 1331441A
Authority
CN
China
Prior art keywords
radical
word
key
code
coding
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN 00104420
Other languages
Chinese (zh)
Inventor
王以成
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to CN 00104420 priority Critical patent/CN1331441A/en
Publication of CN1331441A publication Critical patent/CN1331441A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Document Processing Apparatus (AREA)

Abstract

A Chinese-charactor "three-digit code" input system for computer technically features that the high-purity shape code is used, the less normalized roots are used, all roots are scientifically assigned to 26 letter keys on keyboard, the length of code string is 3, the single word is used as the basis of encoding, and a open system function is set up.

Description

Chinese-character input system ' three-digit code '
" three bit codes " Chinese character input system is to utilize keyboard that the into input code coding techniques of computing machine imported in Chinese character.Purpose is to be easier to grasp and use for the computer user provides, and input speed is Chinese character input method more efficiently.
One, the basic stroke of Chinese character
A Chinese character, no matter how complicated its structure is, all is made up of stroke one by one.The stroke of right understanding Chinese characters for the key bit pattern regularity of distribution of grasping radical, accurately uses radical to give Chinese characters disassembled coding, and significance is all arranged.
The basic stroke of Chinese character has five kinds:
(1) horizontal comprising " one " and distortion thereof
Figure A0010442000031
(carry or choose).As " lonely,
Figure A0010442000032
, tricky " in
Figure A0010442000033
But do not comprise " carrying a little " of combining with " point ".In " Bing, Rui "
Figure A0010442000034
(2) heavily fortified point comprises " Shu " and distortion “ 亅 thereof ".As De “ 亅 in " Dao, Rolling, " ".
(3) cast aside comprise " Pie ",
Figure A0010442000035
(the flat left-falling stroke), " Pie " (perpendicular left-falling stroke).As " Pie " in " matter, weight, the moon ".
(4) right-falling stroke comprises " ".
Figure A0010442000036
(the flat right-falling stroke) and distortion---the point of pressing down." point " has " left point ", " long point " etc. again.As: the point in " do, only ".
(5) folding comprises that all stroke lines continuously and the various strokes (" 亅 " exception) of obvious turnover are arranged.Concrete form is more.
What should point out is, these five kinds of basic strokes have regulation enumerating on the order, promptly " horizontal stroke, two perpendicular, three cast aside, four press down, five foldings ".
Two, radical
Radical be during Chinese character pattern decomposes the one-level word-building unit, be stroke combination body with relative independentability.Briefly, Chinese character is combined by radical, therefore can split into one or several radical to a Chinese character.This is the basic premise to encode Chinese characters for computer.
Radical is similar to " radical ", but is different from " radical ".Adopt the radical notion, avoided " radical ", " radicals by which characters are arranged in traditional Chinese dictionaries " limitation on using.Because split Chinese character, must not consider that it is watch sound, table shape or table justice, must not consider its primary and secondary effect and locus in the structure word yet, and only consider its sequential write and syntagmatic in the structure word with radical.
But radical is not arbitrarily confirmable.Determine that radical should have necessary science, and can be accepted by popular.166 radicals that native system is selected, the overwhelming majority is the radical and the Chinese character of national regulation, is familiar with by popular.
Three, the key bit pattern regularity of distribution of radical
(1) 26 letter keys of computer keyboard " English alphabet input keypad " are divided into five districts, each key in each district is located respectively again, and makes the position division of keyboard meet keyboard fingering standard.Sole exception be that the 5th " B " key in first district operated by left index finger.(seeing " always scheme radical key position ")
" position " is the foundation of the radical key position regularity of distribution.
(2) all radicals are divided into five classes by first stroke of a Chinese character stroke, by " horizontal stroke, two perpendicular, three cast aside, four press down, five foldings " order be arranged in one to five district respectively.And make and respectively distinguish the arrangement that radical first stroke of a Chinese character stroke moves towards Tong Ge district " position " and move towards consistent.Its exception has: " ninth of the ten Heavenly Stems,
Figure A0010442000041
Hand, an ancient type of spoon, Xin, river, pig, Yin " nine radicals.They are arranged on the present position, are because their other radicals with the place key on body are more approaching, thereby easier associative memory.
(3) do root by the radical of basic stroke simple combination and most of non-individual character and be arranged in first to five of each district by the complex situations of stroke number or structure respectively.Be arranged to five of the 4th district respectively as " Dian, Bing, Rui, Http, Chuo ".
(4) every key has three major word roots (being the leftmost radical of every capable radical in every key), and they all are to have very strong representativeness on physique structure, and the higher radical of usage frequency.The key position distributed combination of every district major word root also all has certain rules.Come after the major word root with other radicals of major word root architecture feature similarity, be easy to produce associative memory.We can say when memory " word root keyboard ", distribute as long as remember the key position of major word root, even if be crowned with success.The concrete combination distribution situation of all radicals is seen " Figure of description "---" always scheme radical key position ".Among the figure " position " of the every key of numeral of each key face lower left: " district " at this key place of numeral on " ten " wherein, " position " of this key of numeral on " individual position "; In the three composing roots, the radical of every row left side beginning is the major word root on each key; The bottom-right word of every key is the one-level brevity code word on this key.
(5) way of employing radical merger has been done clear and definite explanation to the body differentiation relation of known Chinese character of people and radical; Simultaneously, very approaching with some basic element of character a small amount of physique structure, the radical that usage frequency is low is again also used the way of radical merger, returns as in corresponding basic element of character.So that memory and accurate utilization radical.(seeing " radical merger table ")
At last, what specify is, so-called " major word root ", " basic element of character ", " incorporating radical into ", and the classification of just doing processing in order to be memonic, they are the same as the practical function of radical with the status.
Radical merger table
Four, the disassembly principle of individual character
(1) normative stroke order is exactly Chinese character to be split coding according to the standard of Chinese-character writing order.As:
Husband---two people night---Tou Ren Fan Dian
Justice---Dian Qe does---power
The Chinese character that several investing mechanisms are arranged, the stroke writing intersection of " encirclement radical " and " besieged radical " is carried out.In order to look after the integrality of radical, have only the sacrifice normative stroke order, and do suitable accommodation, stipulate which radical which radical elder generation first stroke of a Chinese character just splits earlier.As:
Can---the fourth mouth is solid---Kou Shikou
Minister---Contraband Shu
Figure A0010442000062
Shu prisoner---mouthful people
(2) getting big preferentially is exactly when splitting Chinese character, all selects the maximum radical of stroke to come it is split, with the radical minimum number that guarantees to split out at every turn.As:
Generation---twenty
Figure A0010442000063
Decline---one
Figure A0010442000065
Ten thousand — — Myeon Card---one or four is little
Come---one meter beans---a bite
First--- Pie order group---Rui Pie
Figure A0010442000068
(3) getting earlier preferentially is exactly that some stroke of working as in the Chinese character can constitute a radical with the stroke of front, in the time of constituting a radical with the stroke of back again, should select to constitute with the stroke of front the method for splitting of radical.As:
Hundred — — Myeon days are tight---an industry Pie
Zhang---Na is of a specified duration---Network
Figure A0010442000069
With---Jiong two Shu narrow-necked earthen jars--- ten Qian
(4) taking into account intuitively is exactly when splitting Chinese character, the visual sense during for the integrality of looking after radical and fractionation, and three principles in front are done suitable accommodation.
1, to the accommodation of " normative stroke order ".As:
The heart Pie of group's---mouthful ten Pie must---
Witch---workman people is refreshing---is big
Figure A00104420000610
Cao's---Lv says day and takes advantage of---standing grain Zhuang an ancient type of spoon
News---Yan second ten is ugly---
Figure A00104420000611
Soil
△ about the suitable explanation of " dagger-axe ", " shooting a retrievable arrow " in order to guarantee radical consistance in use and take into account intuitive that when " dagger-axe " or " shooting a retrievable arrow " appearred in regulation Chinese character the right, no matter order of writing strokes how, is all regarded it as an independently radical, and the back split.As:
Become---Pie Dagger-axe force---one only shoots a retrievable arrow
Or---a mouthful dagger-axe is contained---Pie The dagger-axe ware
2, to the accommodation of " get earlier preferential ".As:
Ask---a water Dian does not have---In-particular
---Chi one fourth subtracts---Bing Pie a bite dagger-axe OK
△ works as the several sections that a word both may be split into " linking to each other ", may be split into the several sections of " intersecting " again, should feel that " linking to each other " is more directly perceived than " intersecting ", and therefore regulation splits out it by the method for splitting of " linking to each other ", and this cries " can connect and not hand over ".As:
Do---10 open---European-allies
When " one " among " " (horizontal people) intersected with other strokes, on directly perceived, it no longer was " " to △ about the suitable explanation of " ", and " " wherein should become radical with the stroke combination of its back.As:
Pie two is little for system---Pie two Jiong Shu Zhu Dao---
Giving birth to-one Pie loses---Pie two people
△, should guarantee the integrality of back radical, and not gather forward when " one " in " " can constitute a radical with the stroke of back about the suitable explanation of " ".As:
Blue — — Ha three honor — — Ha are very little the tenth of the twelve Earthly Branches
Flat---Yi Ha 10 — — Ha two Shu
△ is identical with the fractionation at " end " for fear of " not " about the fractionation at " not ", " end " explanation, and the while also meets, the coinage original idea at " end ", and the split result of this two word is:
---two little ends---wood not
Five, the coding rule of individual character
(1) the all-key string length of all individual characters is " three ".That is to say,, just can import a Chinese character as long as hit key three times.
(2) coding of radical is got this radical place key first for its coding; By the described disassembly principle of last joint, split then, get two radicals of its head and the tail second and third position for its coding with other only little radicals than this radical.Not enough trigram, add " identification code ".As:
Stone---Shi Myeon mouth---HJD gas---gas second-WWV
Just---just Dian-LLY goes into---going into Pie ---WTY
(3) after the coding of non-radical Chinese character splits by the described disassembly principle of last joint, get its first, second and the end radical be the coding of this word.Not enough trigram, add " identification code ".As:
Compile---Si family Lv---XYN is large---Shi Myeon shellfish---HJA
Respectful---Ji Shu eight---CGQ unloads--- ten Jie---WLZ
Five---Shu---HGH resembles--- Mouthful ---QDW
△ about " debate, pigtail, distinguish, lobe " and these two groups of words of coding key of " device, clamor " as pressing the uniform rules coding, it is identical then to encode.For avoiding repeated code, the spy stipulates to get for second yard of these several words can be with the 3rd root coding of its differentiation.As:
Lobe---upright Ten---PRL device-mouth dog mouth---DMD
Identification code is formed with " font " is information combined by " end pen " information.Determine " district " of identification code exactly by the end stroke of this word.Determine " position " of identification code again by the font of this word.
Native system is divided into three types with Hanzi structure.
First type: after left and right sides structure package code had been beaten, it was just passable only to add the end stroke radical key of beating this word.As:
Old---Shu day---GFH hundred million---Ren second---RVV
Second type: after the up-down structure package code had been beaten, the two-position key that adds the end stroke location of beating this word was just passable.As:
Skill---Lv second---NVC dawn---day one---FHJ
Many---sunset at sunset---WWU has--- Month---JEJ
Logical sequence---people's an ancient type of spoon---WBC sky---cave worker---OMJ
The 3rd type: after various investing mechanisms and independent body structure package code had been beaten, the 3rd key that adds the end stroke location of beating this word was just passable.As:
Bag---ㄅ---EXX mistake---very little Chuo---BPI in the sixth of the twelve Earthly Branches
Can---fourth mouth---JDK be stranded---mouthful wood---DLI
Here be noted that what is called " end pen ", be meant the end pen of second radical.The minority word is arranged, when splitting, " normative stroke order " done accommodation, the end pen of second radical is not the end pen of the correct sequential write of whole word.In order to guarantee the continuity of disassembled coding.Special this regulation of doing.
Independent body structure (single character) comprises following several situation:
1. single is made one's cross and radical, as: " one " etc.
2. the words that intersect of two radical strokes, as:
Really---day wood hits---two mountain husbands---two people
3. most of by the single word that the same many strokes radical of root constitutes of making one's cross, obviously separately be not left and right sides structure and up-down structure as two radicals.All be considered as the independent body structure.As:
In---10 times---fore-telling is individual---people Shu
By " Pie, , Dian " three singles word that root constitutes with another radical of making one's cross, no matter two radicals are any structural relation, all are considered as the independent body structure.As:
Main---the upright Pie of Dian king's art---wooden Dian produces---
Rise---Pie European-allies---Pie Si chi---corpse
Too---big Dian lacks---township of Pie--- Pie
For the word that is made of stroke radical more than two on a small quantity, two radical strokes link to each other and do not have and intersect, and are not considered as three type-words, and are included into two type-words.As: " chastity, and, Bian " etc.
Seven, the repeated code of individual character
Two or more word codes are identical, are repeated code.When importing the outer sign indicating number of a repeated code word.With the identical word of this word code, can all be presented in " presenting bank " by the frequency of utilization order.Word is on first position as required, and after " space bar ", this word will be shown on the cursor position; Word can be selected by the numerical key on the keyboard is corresponding on first later position as required.
Eight, brevity code word
In order to improve input speed and to reduce repeated code, some everyday characters except that being composed of all-key, also are composed of brevity code.The brevity code input method is as follows:
(1) 26 very high Chinese characters of one-level brevity code frequency of utilization according to its font style characteristic, correspondingly are arranged on 26 key positions, (seeing " always scheme radical key position ").Import these Chinese characters, this key need only be beaten, add dozen space bar again and just can finish input process.
(2) its coding of secondary brevity code is formed by preceding two yards in the all-key of this word.When importing these words, be totally lost to add after preceding two yards and play space bar and just finished input process.
In order to improve input speed, should remember and use brevity code as possible.
When △ mixed input at words, the all-key word still will add behind the trigram of being totally lost played space bar.
Nine, word coding rule
(1) the sign indicating number string length of all words is " four ".That is to say, need only hit the input that four times key just can be finished a word.
The coding of (two) two words is got the word coding of synthetic this speech of preceding two code characters of each word.
The coding of (three) three words is got the coding of synthetic this speech of first code character of preceding two yards and second and third two words of first word.
The word coding of (four) four and above number of words is got the coding of synthetic this speech of first code character of first, second and third and end word.
(5) the coding regulation of one-level brevity code word in word.One-level brevity code word constitutes speech with other words, when encoding, still must not split and uses brevity code.If will from this word, get two yards in accordance with regulations, then behind second yard usefulness " L " key "; " the key replacement.As: study---O; CV grows up---TVT;
(6) as the repeated code speech, its situation is the same with the repeated code word with disposal route when input for the repeated code speech.The options button of repeated code speech is " H, J, K, L " by one to four key that frequency order is followed successively by " space bar " and first district.
Ten, about dictionary
One of advantage of native system is input as the master with word exactly.In order to realize this goal, set up a dictionary that 80,000 left and right sides words are arranged, and the repetition rate of coding is very low.Its dictionary content comprises:
1. almost whole words of taking in " modern Chinese dictionary ".
2. almost whole Chinese idioms of income in " Chinese and set phrase include file " (Commercial Press's publication).
3. related whole China and foreign countries place name in " junior middle school's geographical map volume ".
4. China's administrative realm name at county level and above.
5. do not take in the normal words and phrases language in the dictionary.
The word of this dictionary income is based on two words, three words.In order to reduce repeated code, those can generally not taken in phrase, phrase even the specific term of two words, the decomposition of three words.Should be principle with " short speech is preferential " during input, unless the coding that you have confirmed that this is arranged " long word language ".
11, the open function of dictionary
Chinese vocabulary is extremely abundant, because the restriction of space encoder and our carelessness, all income is advanced our dictionary.Word that those are professional, region are stronger and emerging word, frequency of utilization is bigger in certain customers.For the comprehensive needs that satisfy the user, the native system spy is provided with open function.The user can revise original word coding method according to the needs of oneself, increases neologisms and coding to dictionary, adjusts the frequency order and the deletion repeated code speech of repeated code speech.
Compare with several input methods relatively more commonly used now, this Chinese character input method has the following advantages:
(1) the unisonance sign indicating number is compared, and the advantage of font code is that the Chinese character pattern whole nation is unified, sees the word disassembled coding, easy operating; And the sound code plan can not had the user of the fine grasp Chinese phonetic alphabet and mandarin to cause very big obstacle.And it is lower than sound code weight code check.
(2) compare with other several font codes relatively more commonly used, the advantage of this programme has:
1, selects for use radical to strictly regulate, be easy to be user's approval and acceptance.
2, the bigger radical of code requirement as far as possible add that each single character code only gets three radicals, so disassembled coding is simple and clear more quick.
3, select for use radical less and standard adds that the key position distributed combination of radical is scientific and reasonable, be easier to memory.
4, be encoded to the basis with single-character splitting, be input as main body with the word coding, input speed is more quick.
H J K L B N M G F D S A T R E W Q Y U I O P V C X ZH J K L B N M G F D S A T R E W Q Y U I O P V C X Z

Claims (2)

1, " three bit codes " Chinese character input system is characterised in that: adopt pure font code scheme; The key position distributed combination of radical is scientific and reasonable; The all-key sign indicating number string of each individual character has only three; Be encoded to the basis with single-character splitting, be input as main body with the word coding; The all-key sign indicating number string of each word has only four.
2, according to claim 1, it is characterized in that: be provided with the open system function.Make the user change original word coding method according to the needs of oneself; Adjust the frequency order of repeated code word, speech, deletion repeated code speech; In dictionary, increase new words and coding thereof.
CN 00104420 2000-06-24 2000-06-24 Chinese-character input system 'three-digit code' Pending CN1331441A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN 00104420 CN1331441A (en) 2000-06-24 2000-06-24 Chinese-character input system 'three-digit code'

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN 00104420 CN1331441A (en) 2000-06-24 2000-06-24 Chinese-character input system 'three-digit code'

Publications (1)

Publication Number Publication Date
CN1331441A true CN1331441A (en) 2002-01-16

Family

ID=4577307

Family Applications (1)

Application Number Title Priority Date Filing Date
CN 00104420 Pending CN1331441A (en) 2000-06-24 2000-06-24 Chinese-character input system 'three-digit code'

Country Status (1)

Country Link
CN (1) CN1331441A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN100407114C (en) * 2006-04-13 2008-07-30 杨洪旭 Chinese characters information processing method
CN109495486A (en) * 2018-11-30 2019-03-19 成都知道创宇信息技术有限公司 A method of the single page Web application integration CAS based on JWT

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN100407114C (en) * 2006-04-13 2008-07-30 杨洪旭 Chinese characters information processing method
CN109495486A (en) * 2018-11-30 2019-03-19 成都知道创宇信息技术有限公司 A method of the single page Web application integration CAS based on JWT
CN109495486B (en) * 2018-11-30 2020-12-22 成都知道创宇信息技术有限公司 A method of integrating CAS for single-page web application based on JWT

Similar Documents

Publication Publication Date Title
CN1023916C (en) Simplified and Traditional Wubi Radical Chinese Character Input System
CN101038508A (en) GB phoneticize input method
CN1141633C (en) 24-radical sorting encode method for Chinese characters and its keyboard
CN1119739C (en) Chinese-character 5-stroke digital input method with keyboard of computer and its keyboard
CN1331441A (en) Chinese-character input system 'three-digit code'
CN102511021A (en) Number-order-code-element keyboard and information input method thereof
CN104123011B (en) Chinese character and Chinese phonetic alphabet coding input method
CN1081004A (en) Chinese-character digital encoding method based on structural strokes order
CN1570817A (en) Combined type pronunciation-form-meaning Chinese character coding input method
CN1034245C (en) Burmese characters four-code intelligent coding method and keyboard thereof
CN1150441C (en) Chinese-character 'shape-writing order code' input system and its keyboard
CN1256644C (en) Chinese-character radical input method
CN1052314C (en) Computer keyboard and input method of Chinese character two-dimensional numerals
CN1103181A (en) Multi-key pressing high-speed Chinese character input method and keyboard
CN1020386C (en) Structure strokes four-figure number coding method and keyboard
CN1553305A (en) Phonetical and shape four-digit code Chinese inputting method
CN1203388C (en) Double-stroke six-code Chinese character input method
CN100339808C (en) U Code Chinese character inputting method
CN1031812C (en) Coding method of Chinese characters and phrases and keyboard thereof
CN1201220C (en) Efficient key code input method in computer
CN1058342C (en) Chinese character byte codes and its keyboard of using the same
CN1095502A (en) Character spectrum Chinese character coding method (Yan Di and Huang Di, two legendary rulers of remote antiquity's sign indicating number) and keyboard thereof
CN1347023A (en) Intelligent two-stroke handwriting input system
CN1077303C (en) Chinese Eight Diagrams classification keyboard and coding
CN1337616A (en) Fast-easy code Chinese character input method and keyboard

Legal Events

Date Code Title Description
C06 Publication
C10 Entry into substantive examination
PB01 Publication
SE01 Entry into force of request for substantive examination
C02 Deemed withdrawal of patent application after publication (patent law 2001)
WD01 Invention patent application deemed withdrawn after publication