CN104036344A - Method for standardizing enterprise names - Google Patents
Method for standardizing enterprise names Download PDFInfo
- Publication number
- CN104036344A CN104036344A CN201410206478.XA CN201410206478A CN104036344A CN 104036344 A CN104036344 A CN 104036344A CN 201410206478 A CN201410206478 A CN 201410206478A CN 104036344 A CN104036344 A CN 104036344A
- Authority
- CN
- China
- Prior art keywords
- enterprise name
- enterprise
- name
- sales data
- title
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Landscapes
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
The invention discloses a method for standardizing enterprise names. The method comprises the following steps that: enterprise names in received sales data are completely matched with names in a preset enterprise information database; messy code processing is performed on unmatched enterprise names; attached information is deleted according to name standardability; symbolic texts are transformed, and reasonable conversion is performed on symbolic information contained in the enterprise names in the sales data; digital standardization processing is performed; name decomposition processing is performed to extract data for describing a plurality of enterprise names in the sales data one by one; semantic conversion is performed according to a word library; and standardized enterprise names are outputted. With the method of the invention adopted, the enterprise names in the sales data can be standardized, and non-name information such as information originally containing symbols, messy codes and attached information can be deleted, and statistics can be facilitated.
Description
Technical field
The present invention relates to enterprise name treatment technology in sales data, refer more particularly to a kind of method of standard enterprise name.
Background technology
Enterprise marketing data are being carried out in data handling procedure, and enterprise name whether standard is that degree of accuracy for final sales report form statistics plays very large correlation.If enterprise name is standard effectively, not only affect the progress of whole operation process, also affect the precision of report form statistics simultaneously, so enterprise name standard is very necessary.
In most cases, the building form of corporate specification title is: administrative area+font size+industry characteristic+organizational form or font size+administrative area+industry characteristic+organizational form.
Title or the place name of the administrative division Shi Ben enterprise location administrative division above county level in Business Name;
Font size in Business Name is the title that has investor jointly to confer according to the corporate culture of this enterprise and feature;
Industry characteristic in Business Name only refers to the film name (establishing according to industrial and commercial bureau's relevant regulations) that investor manages
In Business Name, organizational form is determined according to enterprise economic activity character and relevant laws and regulations of the state
For example: Shanghai Leiyun Pharmaceutical Industry Co., Ltd., the building form of title:
Administrative division: Shanghai;
Font size: Lei Yunshang;
Industry characteristic: medicine;
Organizational form: company limited;
And in true enterprise goods entry, stock and sales data, enterprise is usually because self conveniently can add different special markings in enterprise name, these enterprise names, when statistical study, are carried out subsequent operation after non-type name translation need to being become to the enterprise name of codes and standards.
Present stage is while carrying out standard for enterprise name, often only remove the mess code in title, and ignored the arrangement to data name standardization, thereby cause follow-up manual operation workload huge, the performance period of whole process can be very long, and enterprise need to spend considerable resource for this reason and process.
For the problem in correlation technique, effective solution is not yet proposed at present.
Summary of the invention
For the problem in correlation technique, the present invention proposes a kind of method of standard enterprise name, the enterprise name that can produce effect in authority data, convenient statistics.
Technical scheme of the present invention is achieved in that
According to an aspect of the present invention, provide a kind of method of standard enterprise name, the method comprises the following steps:
The enterprise name receiving in sales data is mated completely with the title in the company information database setting in advance;
For the enterprise name of not mating, enterprise name is carried out to mess code processing;
According to title standardization, carry out additional information removing;
Symbol textization is transformed, remove in sales data enterprise name and contain symbolic information and rationally transform;
Carry out digital standard processing;
Title resolution process, extracts in sales data and has the data of describing a plurality of enterprise names to extract one by one;
According to character library, carry out semanteme and transform,
Output standard enterprise name.
Preferably, described in, carrying out digital standardization processing is specially: by containing digital data in enterprise name in sales data, change, unification is converted to capitalization by small letter.
Preferably, describedly title in enterprise name processed to semantic conversion specifically comprise:
1), proprietary name is transformed;
2), wrongly written character is changed;
3), the complex form of Chinese characters is changed.
The present invention, by adopting said method enterprise name in sales data can be carried out to standardization processing, will originally contain the non-name informations such as symbol, mess code and additional information and dispose, and be convenient to statistics.
Accompanying drawing explanation
In order to be illustrated more clearly in the embodiment of the present invention or technical scheme of the prior art, to the accompanying drawing of required use in embodiment be briefly described below, apparently, accompanying drawing in the following describes is only some embodiments of the present invention, for those of ordinary skills, do not paying under the prerequisite of creative work, can also obtain according to these accompanying drawings other accompanying drawing.
Fig. 1 is according to the process flow diagram of the method for the standard enterprise name of the embodiment of the present invention.
Embodiment
Below in conjunction with the accompanying drawing in the embodiment of the present invention, the technical scheme in the embodiment of the present invention is clearly and completely described, obviously, described embodiment is only the present invention's part embodiment, rather than whole embodiment.Embodiment based in the present invention, the every other embodiment that those of ordinary skills obtain, belongs to the scope of protection of the invention.
The embodiment of the present invention provides a kind of method of standard enterprise name, and the method comprises the following steps:
The enterprise name receiving in sales data is mated completely with the title in the company information database setting in advance;
For the enterprise name of not mating, enterprise name is carried out to mess code processing;
According to title standardization, carry out additional information removing;
Symbol textization is transformed, remove in sales data enterprise name and contain symbolic information and rationally transform;
Carry out digital standard processing;
Title resolution process, extracts in sales data and has the data of describing a plurality of enterprise names to extract one by one;
According to character library, carry out semanteme and transform,
Output standard enterprise name.
Preferably, described in, carrying out digital standardization processing is specially: by containing digital data in enterprise name in sales data, change, unification is converted to capitalization by small letter.
Preferably, describedly title in enterprise name processed to semantic conversion specifically comprise:
1), proprietary name is transformed;
2), wrongly written character is changed;
3), the complex form of Chinese characters is changed.
The said method that the present embodiment provides can carry out standardization processing by enterprise name in sales data, will originally contain the non-name informations such as symbol, mess code and additional information and dispose, and is convenient to statistics.
Referring to the accompanying drawing specific embodiment that develops simultaneously, the present invention is described in detail.
As shown in Figure 1, the inventive method comprises following steps.
Step 101, accepts enterprise name data.
Step 102, enterprise name in the enterprise name data of acceptance is carried out to complete similar coupling with the enterprise name in the company information database setting in advance, data to complete coupling, if can mate execution step 109, for the enterprise name execution step 103 of not mating.
Step 103, carries out mess code processing to enterprise name;
Concrete, the mess code of the non-Chinese character in title and numeral is removed, illustrate:
Huairou, # Beijing affiliated hospital of Chinese People's Liberation Army Air Force Institute Of Aviation Medicine is (former: the 466 doctor Li TEL:302%s of institute of PLA)
Carry out being converted to after mess code processing: Huairou, Beijing affiliated hospital of Chinese People's Liberation Army Air Force Institute Of Aviation Medicine is (former: the 466 doctor Li TEL:302 of institute of PLA).
Step 104, processes additional information in enterprise name;
Concrete, additional information subsidiary in enterprise name is deleted, as name, phone etc., illustrate:
1, Huairou, Beijing affiliated hospital of Chinese People's Liberation Army Air Force Institute Of Aviation Medicine is (former: the 466 doctor Li TEL:302 of institute of PLA);
Be converted to: Huairou, Beijing affiliated hospital of Chinese People's Liberation Army Air Force Institute Of Aviation Medicine is (former: 466 institutes of PLA).
2,1003776_ People's Hospital No.3, Bengbu City, is converted to after processing: People's Hospital No.3, Bengbu City.
3,33563 (G) of Chang Ping (Y) Bai Futang pharmacy, are converted to after processing: normal Ping Baifutang pharmacy.
4, the dark bamboo of auspicious thatched cottage medicine company branch, (Z Y H) ■ Shenzhen Heng Gang (Y) 556336, are converted to: the dark bamboo of the auspicious thatched cottage of Shenzhen Heng Gang medicine company branch after processing.
Step 105, transforms symbol text in enterprise name;
Concrete, by symbol completion incomplete in title, illustrate: " Huairou, Beijing affiliated hospital of Chinese People's Liberation Army Air Force Institute Of Aviation Medicine (former:: 466 institutes of PLA) ", be converted to: " Huairou, Beijing affiliated hospital of Chinese People's Liberation Army Air Force Institute Of Aviation Medicine (former: 466 institute of PLA) ".
Step 106, processes digital standardization in enterprise name;
Concrete, numeral unification is revised as to Chinese character by arabic numeral, illustrate: 466 institutes of PLA are converted to: the 4th Liu Liu institute of PLA.
Step 107, to title resolution process in enterprise name;
Concrete, compound title is resolved into two titles, illustrate: Huairou, Beijing affiliated hospital of Air Force Aviation Medical Inst. of PLA (former: the 4th Liu Liu institute of PLA) to be decomposed into:
1, Huairou, Beijing affiliated hospital of Air Force Aviation Medical Inst. of PLA
2, former: the 4th Liu Liu institute of PLA
Step 108, processes semantic conversion to title in enterprise name; Here comprise:
1, proprietary name is transformed, illustrate: Huairou, Beijing affiliated hospital of Air Force Aviation Medical Inst. of PLA is (former: the 4th Liu Liu institute of PLA) to be converted into: Huairou, Beijing affiliated hospital of Air Force Aviation Medical Inst. is (former: the 4th Liu Liu institute of PLA).
2, wrongly written character is changed: the land-reclaimable prosperous pharmacy in brontosaurus Jiang Baoquan ridge is converted to: the land-reclaimable prosperous pharmacy of Heilungkiang Bao Quanling.
3, the complex form of Chinese characters is changed, illustrated: the large pharmacy of Bao and hall is converted to: the large pharmacy of precious and hall;
Step 109, output standard enterprise name.
In sum, by means of technique scheme of the present invention, the said method providing by the present embodiment can carry out standardization processing by enterprise name in sales data, will originally contain the non-name informations such as symbol, mess code and additional information and dispose, and is convenient to statistics.
The foregoing is only preferred embodiment of the present invention, in order to limit the present invention, within the spirit and principles in the present invention not all, any modification of doing, be equal to replacement, improvement etc., within all should being included in protection scope of the present invention.
Claims (3)
1. a method for standard enterprise name, is characterized in that, the method comprises the following steps:
The enterprise name receiving in sales data is mated completely with the title in the company information database setting in advance;
For the enterprise name of not mating, enterprise name is carried out to mess code processing;
According to title standardization, carry out additional information removing;
Symbol textization is transformed, remove in sales data enterprise name and contain symbolic information and rationally transform;
Carry out digital standard processing;
Title resolution process, extracts in sales data and has the data of describing a plurality of enterprise names to extract one by one;
According to character library, carrying out semanteme transforms;
Output standard enterprise name.
2. the method for standard enterprise name according to claim 1, is characterized in that, described in carry out digital standardization and process and to be specially: by containing digital data in enterprise name in sales data, change, unified small letter be converted to capitalization.
3. the method for standard enterprise name according to claim 1, is characterized in that, describedly title in enterprise name is processed to semantic conversion specifically comprises:
1), proprietary name is transformed;
2), wrongly written character is changed;
3), the complex form of Chinese characters is changed.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201410206478.XA CN104036344A (en) | 2014-05-16 | 2014-05-16 | Method for standardizing enterprise names |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201410206478.XA CN104036344A (en) | 2014-05-16 | 2014-05-16 | Method for standardizing enterprise names |
Publications (1)
Publication Number | Publication Date |
---|---|
CN104036344A true CN104036344A (en) | 2014-09-10 |
Family
ID=51467108
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201410206478.XA Pending CN104036344A (en) | 2014-05-16 | 2014-05-16 | Method for standardizing enterprise names |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN104036344A (en) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104765806A (en) * | 2015-04-01 | 2015-07-08 | 国家电网公司 | Automatic processing technology for nonstandard marketing client basic information |
CN107341144A (en) * | 2017-06-15 | 2017-11-10 | 云程科技股份有限公司 | A kind of method by segmenting formal Specification enterprise name |
CN108874769A (en) * | 2018-05-16 | 2018-11-23 | 深圳开思时代科技有限公司 | Accessory name standardized method and device, electronic equipment and medium |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101339638A (en) * | 2007-07-03 | 2009-01-07 | 周磊 | Method and system for automatic matching of commercial articles dispensing scope and goods receiving address for ordering platform |
US8140531B2 (en) * | 2008-05-02 | 2012-03-20 | International Business Machines Corporation | Process and method for classifying structured data |
CN102651013A (en) * | 2012-03-23 | 2012-08-29 | 上海安捷力信息系统有限公司 | Method and system for extracting area information from enterprise name data |
CN103020037A (en) * | 2012-12-05 | 2013-04-03 | 福建亿榕信息技术有限公司 | Official document standardized calibration system |
CN202916832U (en) * | 2012-08-27 | 2013-05-01 | 中国工商银行股份有限公司 | Data matching device |
-
2014
- 2014-05-16 CN CN201410206478.XA patent/CN104036344A/en active Pending
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101339638A (en) * | 2007-07-03 | 2009-01-07 | 周磊 | Method and system for automatic matching of commercial articles dispensing scope and goods receiving address for ordering platform |
US8140531B2 (en) * | 2008-05-02 | 2012-03-20 | International Business Machines Corporation | Process and method for classifying structured data |
CN102651013A (en) * | 2012-03-23 | 2012-08-29 | 上海安捷力信息系统有限公司 | Method and system for extracting area information from enterprise name data |
CN202916832U (en) * | 2012-08-27 | 2013-05-01 | 中国工商银行股份有限公司 | Data matching device |
CN103020037A (en) * | 2012-12-05 | 2013-04-03 | 福建亿榕信息技术有限公司 | Official document standardized calibration system |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104765806A (en) * | 2015-04-01 | 2015-07-08 | 国家电网公司 | Automatic processing technology for nonstandard marketing client basic information |
CN104765806B (en) * | 2015-04-01 | 2018-09-18 | 国家电网公司 | The marketing nonstandard technology for automatically treating of customer basis information |
CN107341144A (en) * | 2017-06-15 | 2017-11-10 | 云程科技股份有限公司 | A kind of method by segmenting formal Specification enterprise name |
CN108874769A (en) * | 2018-05-16 | 2018-11-23 | 深圳开思时代科技有限公司 | Accessory name standardized method and device, electronic equipment and medium |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN108595389B (en) | Method for converting Word document into txt plain text document | |
US11574129B2 (en) | Systems and methods for generalized structured data discovery utilizing contextual metadata disambiguation via machine learning techniques | |
US20120317078A1 (en) | Replication Support for Structured Data | |
CN110909123A (en) | Data extraction method and device, terminal equipment and storage medium | |
CN104298725A (en) | Method for one-time editing input and multi-version output of on-line courseware development system | |
CN104036344A (en) | Method for standardizing enterprise names | |
CN103049745A (en) | How to convert a picture into a memo | |
CN105045583A (en) | Visualized process based IETM (Interactive Electronic Technical Manual) fault class data module creation apparatus and creation method therefor | |
CN117176981A (en) | Mixed cut video generation method and device, computer equipment and medium | |
CN109684395B (en) | Visual data interface universal analysis method based on natural language processing | |
CN106354731A (en) | Document inspection method and device | |
CN105718447A (en) | Time information extracting method and apparatus, and smart question and answer system | |
CN101393481A (en) | Multifunction input method implementing method and apparatus thereof | |
EP2565798A1 (en) | Document processing device and program | |
CN119577205A (en) | Method, device and medium for constructing minority language graphic and text data set for Internet data | |
CN110909726B (en) | Written document interaction system and method based on image recognition | |
CN113539518A (en) | Drug data processing method, device and electronic device based on RPA and AI | |
CN113221506A (en) | Lecture typesetting method and device, electronic equipment and storage medium | |
CN112650754A (en) | Method for importing total data of relational database into Hive | |
CN112307269B (en) | Intelligent analysis system and method for human-object relationship in novel | |
CN110019667A (en) | It is a kind of that word method and device is looked into based on voice input information | |
CN102723067A (en) | Character display method and device | |
JP2010204799A (en) | System and method for managing process | |
Lam et al. | The'rtry'R package for preprocessing plant trait data | |
CN102521359A (en) | Interface data file comparison method and device |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20140910 |
|
RJ01 | Rejection of invention patent application after publication |