CN1536768A - Compression method for 2-byte character data - Google Patents
Compression method for 2-byte character data Download PDFInfo
- Publication number
- CN1536768A CN1536768A CNA2003101242211A CN200310124221A CN1536768A CN 1536768 A CN1536768 A CN 1536768A CN A2003101242211 A CNA2003101242211 A CN A2003101242211A CN 200310124221 A CN200310124221 A CN 200310124221A CN 1536768 A CN1536768 A CN 1536768A
- Authority
- CN
- China
- Prior art keywords
- byte
- character
- data
- code word
- code
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04B—TRANSMISSION
- H04B1/00—Details of transmission systems, not covered by a single one of groups H04B3/00 - H04B13/00; Details of transmission systems not characterised by the medium used for transmission
- H04B1/38—Transceivers, i.e. devices in which transmitter and receiver form a structural unit and in which at least one part is used for functions of transmitting and receiving
- H04B1/40—Circuits
Landscapes
- Engineering & Computer Science (AREA)
- Computer Networks & Wireless Communication (AREA)
- Signal Processing (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Abstract
本发明提供了一种在终端机的信息处理模块中,以2字节字符(朝鲜字符、汉语)为单位对信息进行压缩后再存储,从而可以减少存储空间的2字节字符数据的压缩方法。本发明的2字节字符数据的压缩方法的特征在于包括:根据频率数生成多个可压缩代码字,存储在基本词典表中,将登记的表示下一个代码字的变量初始化的步骤;识别输入的信息数据是否是2字节字符,并接收的输入步骤;比较输入的数据是否包含在该可压缩代码字中,当包含在该可压缩的代码字中时,从该词典表中经过映射过程搜索符合代码并输出,当词典中没有该符合代码时,将其登记在词典中的步骤;判断是否是数据的尾数,当数据没有输入完时,返回依次输入信息数据的输入步骤;以及当是数据的尾数时,进行清除过程的步骤,当编码该可压缩代码字得到的符合代码的位数比该可压缩代码字可以降低位的临界值小时,以log2(C1+1)-1位输出,当符合代码字比临界值大时,以log2(C1+1)位输出,该C1是当前被赋值的代码字数。
The present invention provides a method for compressing information in units of 2-byte characters (Korean characters, Chinese) before storing in an information processing module of a terminal, thereby reducing storage space for 2-byte character data. . The compression method of 2-byte character data of the present invention is characterized in that comprising: generate a plurality of compressible codewords according to the frequency number, store in the basic dictionary table, the step of the variable initialisation of the representation next codeword of registering; Identify input Whether the information data is a 2-byte character, and receive the input step; compare whether the input data is contained in the compressible code word, and when contained in the compressible code word, go through the mapping process from the dictionary table Searching for the matching code and outputting it, when there is no such matching code in the dictionary, registering it in the dictionary; judging whether it is the mantissa of the data, when the data has not been input, returning to the input step of inputting the information data in sequence; and when it is When the mantissa of the data, carry out the step of clearing process, when the number of digits of the conforming code obtained by encoding the compressible codeword is smaller than the critical value that the compressible codeword can reduce the bit, take log 2 (C1+1)-1 bit Output, when the matching code word is larger than the critical value, it is output with log 2 (C1+1) bits, where C1 is the number of code words currently assigned.
Description
| ????0~255 | ASCII (ASCII) |
| ????256~725 | Korea's character code (470 words) |
| ????726~1023 | 10 codings |
| ????1024~2047 | 11 codings |
| ????2048~4095 | 12 codings |
| Compressible code word | The code that is encoded | 10 systems |
| ????0 | ????000000000 | ????0 |
| ????1 | ????000000001 | ????1 |
| ????2 | ????000000010 | ????2 |
| ????. | ??????. | ????. |
| ????. | ??????. | ????. |
| ????273 | ????100010001 | ????273 |
| ????274 | ????1000100100 | ????548(274+274) |
| ????275 | ????1000100101 | ????549(274+275) |
| ????. | ??????. | ????. |
| ????. | ??????. | ????. |
| ????749 | ????1111111111 | ????1023(274+749) |
Claims (9)
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| KR1020030021924 | 2003-04-08 | ||
| KR10-2003-0021924A KR100494876B1 (en) | 2003-04-08 | 2003-04-08 | Data compression method for multi-byte character language |
| KR10-2003-0021924 | 2003-04-08 |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CN1536768A true CN1536768A (en) | 2004-10-13 |
| CN100474781C CN100474781C (en) | 2009-04-01 |
Family
ID=34374057
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CNB2003101242211A Expired - Fee Related CN100474781C (en) | 2003-04-08 | 2003-12-31 | Compression method of two-byte character data |
Country Status (2)
| Country | Link |
|---|---|
| KR (1) | KR100494876B1 (en) |
| CN (1) | CN100474781C (en) |
Cited By (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN101751451B (en) * | 2008-12-11 | 2012-04-25 | 高德软件有限公司 | A Chinese data compression and decompression method and related equipment |
| CN106354699A (en) * | 2015-07-13 | 2017-01-25 | 富士通株式会社 | Encoding computer program, encoding method, encoding apparatus, decoding computer program, decoding method, and decoding apparatus |
| CN106471743A (en) * | 2014-06-20 | 2017-03-01 | 甲骨文国际公司 | Encoding of plain ASCII data streams |
| CN104054316B (en) * | 2011-11-15 | 2017-04-12 | 思杰系统有限公司 | Systems and methods for load balancing SMS centers and establishing virtual private networks |
| CN112416315A (en) * | 2020-06-16 | 2021-02-26 | 上海哔哩哔哩科技有限公司 | CSS code compression method, electronic device and storage medium |
| CN114880523A (en) * | 2022-04-27 | 2022-08-09 | 深圳市优必选科技股份有限公司 | Character string processing method and device, electronic equipment and storage medium |
Families Citing this family (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| KR100755533B1 (en) * | 2005-07-25 | 2007-09-06 | 주식회사 팬택 | Character set generation method and apparatus |
| KR101386169B1 (en) * | 2007-08-09 | 2014-04-17 | 삼성전자주식회사 | Apparatus and method for compression and restoration SMS |
| KR102633001B1 (en) * | 2023-03-27 | 2024-02-05 | 주식회사 무브먼츠 | Method for implementing underground facilities as ar in an offline environment using combined data precessing of qr code and nfc |
-
2003
- 2003-04-08 KR KR10-2003-0021924A patent/KR100494876B1/en not_active Expired - Lifetime
- 2003-12-31 CN CNB2003101242211A patent/CN100474781C/en not_active Expired - Fee Related
Cited By (8)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN101751451B (en) * | 2008-12-11 | 2012-04-25 | 高德软件有限公司 | A Chinese data compression and decompression method and related equipment |
| CN104054316B (en) * | 2011-11-15 | 2017-04-12 | 思杰系统有限公司 | Systems and methods for load balancing SMS centers and establishing virtual private networks |
| CN106471743A (en) * | 2014-06-20 | 2017-03-01 | 甲骨文国际公司 | Encoding of plain ASCII data streams |
| CN106354699A (en) * | 2015-07-13 | 2017-01-25 | 富士通株式会社 | Encoding computer program, encoding method, encoding apparatus, decoding computer program, decoding method, and decoding apparatus |
| CN106354699B (en) * | 2015-07-13 | 2021-05-18 | 富士通株式会社 | Encoding method, encoding device, decoding method, and decoding device |
| CN112416315A (en) * | 2020-06-16 | 2021-02-26 | 上海哔哩哔哩科技有限公司 | CSS code compression method, electronic device and storage medium |
| CN112416315B (en) * | 2020-06-16 | 2024-05-14 | 上海哔哩哔哩科技有限公司 | Compression method of CSS code, electronic device and storage medium |
| CN114880523A (en) * | 2022-04-27 | 2022-08-09 | 深圳市优必选科技股份有限公司 | Character string processing method and device, electronic equipment and storage medium |
Also Published As
| Publication number | Publication date |
|---|---|
| CN100474781C (en) | 2009-04-01 |
| KR20040087503A (en) | 2004-10-14 |
| KR100494876B1 (en) | 2005-06-14 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US6778103B2 (en) | Encoding and decoding apparatus using context | |
| US5870036A (en) | Adaptive multiple dictionary data compression | |
| US6747582B2 (en) | Data compressing apparatus, reconstructing apparatus, and its method | |
| US7973680B2 (en) | Method and system for creating an in-memory physical dictionary for data compression | |
| US20130307709A1 (en) | Efficient techniques for aligned fixed-length compression | |
| US8509554B2 (en) | Systems and methods for optimizing bit utilization in data encoding | |
| JPH0869370A (en) | Method and system for compression of data | |
| US7770091B2 (en) | Data compression for use in communication systems | |
| CN101751451B (en) | A Chinese data compression and decompression method and related equipment | |
| KR20120137235A (en) | Method and apparatus for compressing genetic data | |
| US9236881B2 (en) | Compression of bitmaps and values | |
| CN1536768A (en) | Compression method for 2-byte character data | |
| JPS6356726B2 (en) | ||
| US7864085B2 (en) | Data compression method and apparatus | |
| US6518895B1 (en) | Approximate prefix coding for data compression | |
| EP4398120A1 (en) | Compact probabilistic data structure for storing streamed log lines | |
| CN108880559B (en) | Data compression method, data decompression method, compression device and decompression device | |
| EP1891545B1 (en) | Compressing language models with golomb coding | |
| CN115189696A (en) | Hardware compression and decompression method based on Huffman decoding table | |
| JPH03204234A (en) | Restoration of compressed data | |
| Robert et al. | Simple lossless preprocessing algorithms for text compression | |
| Dath et al. | Enhancing adaptive huffman coding through word by word compression for textual data | |
| CN112506876B (en) | Lossless compression query method supporting SQL query | |
| HK1070189A (en) | A method for compressing 2 bits characters data | |
| Zavadskyi | A family of data compression codes with multiple delimiters |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| C06 | Publication | ||
| PB01 | Publication | ||
| REG | Reference to a national code |
Ref country code: HK Ref legal event code: DE Ref document number: 1070189 Country of ref document: HK |
|
| C10 | Entry into substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| C14 | Grant of patent or utility model | ||
| GR01 | Patent grant | ||
| REG | Reference to a national code |
Ref country code: HK Ref legal event code: WD Ref document number: 1070189 Country of ref document: HK |
|
| C41 | Transfer of patent application or patent right or utility model | ||
| C56 | Change in the name or address of the patentee | ||
| CP01 | Change in the name or title of a patent holder |
Address after: Seoul, South Kerean Patentee after: Pantech property management Co. Address before: Seoul, South Kerean Patentee before: PANTECH Co.,Ltd. |
|
| TR01 | Transfer of patent right |
Effective date of registration: 20161026 Address after: Seoul, South Kerean Patentee after: PANTECH CO.,LTD. Address before: Seoul, South Kerean Patentee before: Pantech property management Co. |
|
| TR01 | Transfer of patent right | ||
| TR01 | Transfer of patent right |
Effective date of registration: 20200609 Address after: Seoul, South Kerean Patentee after: Pan Thai Co.,Ltd. Address before: Seoul, South Kerean Patentee before: Pantech Co.,Ltd. |
|
| CF01 | Termination of patent right due to non-payment of annual fee | ||
| CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20090401 |