[go: up one dir, main page]

CN104036344A - Method for standardizing enterprise names - Google Patents

Method for standardizing enterprise names Download PDF

Info

Publication number
CN104036344A
CN104036344A CN201410206478.XA CN201410206478A CN104036344A CN 104036344 A CN104036344 A CN 104036344A CN 201410206478 A CN201410206478 A CN 201410206478A CN 104036344 A CN104036344 A CN 104036344A
Authority
CN
China
Prior art keywords
enterprise name
enterprise
name
sales data
title
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201410206478.XA
Other languages
Chinese (zh)
Inventor
黄旭江
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shanghai Bei Tong Medical Sci-Tech Advisory Co Ltd
Original Assignee
Shanghai Bei Tong Medical Sci-Tech Advisory Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shanghai Bei Tong Medical Sci-Tech Advisory Co Ltd filed Critical Shanghai Bei Tong Medical Sci-Tech Advisory Co Ltd
Priority to CN201410206478.XA priority Critical patent/CN104036344A/en
Publication of CN104036344A publication Critical patent/CN104036344A/en
Pending legal-status Critical Current

Links

Landscapes

  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention discloses a method for standardizing enterprise names. The method comprises the following steps that: enterprise names in received sales data are completely matched with names in a preset enterprise information database; messy code processing is performed on unmatched enterprise names; attached information is deleted according to name standardability; symbolic texts are transformed, and reasonable conversion is performed on symbolic information contained in the enterprise names in the sales data; digital standardization processing is performed; name decomposition processing is performed to extract data for describing a plurality of enterprise names in the sales data one by one; semantic conversion is performed according to a word library; and standardized enterprise names are outputted. With the method of the invention adopted, the enterprise names in the sales data can be standardized, and non-name information such as information originally containing symbols, messy codes and attached information can be deleted, and statistics can be facilitated.

Description

A kind of method of standard enterprise name
Technical field
The present invention relates to enterprise name treatment technology in sales data, refer more particularly to a kind of method of standard enterprise name.
Background technology
Enterprise marketing data are being carried out in data handling procedure, and enterprise name whether standard is that degree of accuracy for final sales report form statistics plays very large correlation.If enterprise name is standard effectively, not only affect the progress of whole operation process, also affect the precision of report form statistics simultaneously, so enterprise name standard is very necessary.
In most cases, the building form of corporate specification title is: administrative area+font size+industry characteristic+organizational form or font size+administrative area+industry characteristic+organizational form.
Title or the place name of the administrative division Shi Ben enterprise location administrative division above county level in Business Name;
Font size in Business Name is the title that has investor jointly to confer according to the corporate culture of this enterprise and feature;
Industry characteristic in Business Name only refers to the film name (establishing according to industrial and commercial bureau's relevant regulations) that investor manages
In Business Name, organizational form is determined according to enterprise economic activity character and relevant laws and regulations of the state
For example: Shanghai Leiyun Pharmaceutical Industry Co., Ltd., the building form of title:
Administrative division: Shanghai;
Font size: Lei Yunshang;
Industry characteristic: medicine;
Organizational form: company limited;
And in true enterprise goods entry, stock and sales data, enterprise is usually because self conveniently can add different special markings in enterprise name, these enterprise names, when statistical study, are carried out subsequent operation after non-type name translation need to being become to the enterprise name of codes and standards.
Present stage is while carrying out standard for enterprise name, often only remove the mess code in title, and ignored the arrangement to data name standardization, thereby cause follow-up manual operation workload huge, the performance period of whole process can be very long, and enterprise need to spend considerable resource for this reason and process.
For the problem in correlation technique, effective solution is not yet proposed at present.
Summary of the invention
For the problem in correlation technique, the present invention proposes a kind of method of standard enterprise name, the enterprise name that can produce effect in authority data, convenient statistics.
Technical scheme of the present invention is achieved in that
According to an aspect of the present invention, provide a kind of method of standard enterprise name, the method comprises the following steps:
The enterprise name receiving in sales data is mated completely with the title in the company information database setting in advance;
For the enterprise name of not mating, enterprise name is carried out to mess code processing;
According to title standardization, carry out additional information removing;
Symbol textization is transformed, remove in sales data enterprise name and contain symbolic information and rationally transform;
Carry out digital standard processing;
Title resolution process, extracts in sales data and has the data of describing a plurality of enterprise names to extract one by one;
According to character library, carry out semanteme and transform,
Output standard enterprise name.
Preferably, described in, carrying out digital standardization processing is specially: by containing digital data in enterprise name in sales data, change, unification is converted to capitalization by small letter.
Preferably, describedly title in enterprise name processed to semantic conversion specifically comprise:
1), proprietary name is transformed;
2), wrongly written character is changed;
3), the complex form of Chinese characters is changed.
The present invention, by adopting said method enterprise name in sales data can be carried out to standardization processing, will originally contain the non-name informations such as symbol, mess code and additional information and dispose, and be convenient to statistics.
Accompanying drawing explanation
In order to be illustrated more clearly in the embodiment of the present invention or technical scheme of the prior art, to the accompanying drawing of required use in embodiment be briefly described below, apparently, accompanying drawing in the following describes is only some embodiments of the present invention, for those of ordinary skills, do not paying under the prerequisite of creative work, can also obtain according to these accompanying drawings other accompanying drawing.
Fig. 1 is according to the process flow diagram of the method for the standard enterprise name of the embodiment of the present invention.
Embodiment
Below in conjunction with the accompanying drawing in the embodiment of the present invention, the technical scheme in the embodiment of the present invention is clearly and completely described, obviously, described embodiment is only the present invention's part embodiment, rather than whole embodiment.Embodiment based in the present invention, the every other embodiment that those of ordinary skills obtain, belongs to the scope of protection of the invention.
The embodiment of the present invention provides a kind of method of standard enterprise name, and the method comprises the following steps:
The enterprise name receiving in sales data is mated completely with the title in the company information database setting in advance;
For the enterprise name of not mating, enterprise name is carried out to mess code processing;
According to title standardization, carry out additional information removing;
Symbol textization is transformed, remove in sales data enterprise name and contain symbolic information and rationally transform;
Carry out digital standard processing;
Title resolution process, extracts in sales data and has the data of describing a plurality of enterprise names to extract one by one;
According to character library, carry out semanteme and transform,
Output standard enterprise name.
Preferably, described in, carrying out digital standardization processing is specially: by containing digital data in enterprise name in sales data, change, unification is converted to capitalization by small letter.
Preferably, describedly title in enterprise name processed to semantic conversion specifically comprise:
1), proprietary name is transformed;
2), wrongly written character is changed;
3), the complex form of Chinese characters is changed.
The said method that the present embodiment provides can carry out standardization processing by enterprise name in sales data, will originally contain the non-name informations such as symbol, mess code and additional information and dispose, and is convenient to statistics.
Referring to the accompanying drawing specific embodiment that develops simultaneously, the present invention is described in detail.
As shown in Figure 1, the inventive method comprises following steps.
Step 101, accepts enterprise name data.
Step 102, enterprise name in the enterprise name data of acceptance is carried out to complete similar coupling with the enterprise name in the company information database setting in advance, data to complete coupling, if can mate execution step 109, for the enterprise name execution step 103 of not mating.
Step 103, carries out mess code processing to enterprise name;
Concrete, the mess code of the non-Chinese character in title and numeral is removed, illustrate:
Huairou, # Beijing affiliated hospital of Chinese People's Liberation Army Air Force Institute Of Aviation Medicine is (former: the 466 doctor Li TEL:302%s of institute of PLA)
Carry out being converted to after mess code processing: Huairou, Beijing affiliated hospital of Chinese People's Liberation Army Air Force Institute Of Aviation Medicine is (former: the 466 doctor Li TEL:302 of institute of PLA).
Step 104, processes additional information in enterprise name;
Concrete, additional information subsidiary in enterprise name is deleted, as name, phone etc., illustrate:
1, Huairou, Beijing affiliated hospital of Chinese People's Liberation Army Air Force Institute Of Aviation Medicine is (former: the 466 doctor Li TEL:302 of institute of PLA);
Be converted to: Huairou, Beijing affiliated hospital of Chinese People's Liberation Army Air Force Institute Of Aviation Medicine is (former: 466 institutes of PLA).
2,1003776_ People's Hospital No.3, Bengbu City, is converted to after processing: People's Hospital No.3, Bengbu City.
3,33563 (G) of Chang Ping (Y) Bai Futang pharmacy, are converted to after processing: normal Ping Baifutang pharmacy.
4, the dark bamboo of auspicious thatched cottage medicine company branch, (Z Y H) ■ Shenzhen Heng Gang (Y) 556336, are converted to: the dark bamboo of the auspicious thatched cottage of Shenzhen Heng Gang medicine company branch after processing.
Step 105, transforms symbol text in enterprise name;
Concrete, by symbol completion incomplete in title, illustrate: " Huairou, Beijing affiliated hospital of Chinese People's Liberation Army Air Force Institute Of Aviation Medicine (former:: 466 institutes of PLA) ", be converted to: " Huairou, Beijing affiliated hospital of Chinese People's Liberation Army Air Force Institute Of Aviation Medicine (former: 466 institute of PLA) ".
Step 106, processes digital standardization in enterprise name;
Concrete, numeral unification is revised as to Chinese character by arabic numeral, illustrate: 466 institutes of PLA are converted to: the 4th Liu Liu institute of PLA.
Step 107, to title resolution process in enterprise name;
Concrete, compound title is resolved into two titles, illustrate: Huairou, Beijing affiliated hospital of Air Force Aviation Medical Inst. of PLA (former: the 4th Liu Liu institute of PLA) to be decomposed into:
1, Huairou, Beijing affiliated hospital of Air Force Aviation Medical Inst. of PLA
2, former: the 4th Liu Liu institute of PLA
Step 108, processes semantic conversion to title in enterprise name; Here comprise:
1, proprietary name is transformed, illustrate: Huairou, Beijing affiliated hospital of Air Force Aviation Medical Inst. of PLA is (former: the 4th Liu Liu institute of PLA) to be converted into: Huairou, Beijing affiliated hospital of Air Force Aviation Medical Inst. is (former: the 4th Liu Liu institute of PLA).
2, wrongly written character is changed: the land-reclaimable prosperous pharmacy in brontosaurus Jiang Baoquan ridge is converted to: the land-reclaimable prosperous pharmacy of Heilungkiang Bao Quanling.
3, the complex form of Chinese characters is changed, illustrated: the large pharmacy of Bao and hall is converted to: the large pharmacy of precious and hall;
Step 109, output standard enterprise name.
In sum, by means of technique scheme of the present invention, the said method providing by the present embodiment can carry out standardization processing by enterprise name in sales data, will originally contain the non-name informations such as symbol, mess code and additional information and dispose, and is convenient to statistics.
The foregoing is only preferred embodiment of the present invention, in order to limit the present invention, within the spirit and principles in the present invention not all, any modification of doing, be equal to replacement, improvement etc., within all should being included in protection scope of the present invention.

Claims (3)

1. a method for standard enterprise name, is characterized in that, the method comprises the following steps:
The enterprise name receiving in sales data is mated completely with the title in the company information database setting in advance;
For the enterprise name of not mating, enterprise name is carried out to mess code processing;
According to title standardization, carry out additional information removing;
Symbol textization is transformed, remove in sales data enterprise name and contain symbolic information and rationally transform;
Carry out digital standard processing;
Title resolution process, extracts in sales data and has the data of describing a plurality of enterprise names to extract one by one;
According to character library, carrying out semanteme transforms;
Output standard enterprise name.
2. the method for standard enterprise name according to claim 1, is characterized in that, described in carry out digital standardization and process and to be specially: by containing digital data in enterprise name in sales data, change, unified small letter be converted to capitalization.
3. the method for standard enterprise name according to claim 1, is characterized in that, describedly title in enterprise name is processed to semantic conversion specifically comprises:
1), proprietary name is transformed;
2), wrongly written character is changed;
3), the complex form of Chinese characters is changed.
CN201410206478.XA 2014-05-16 2014-05-16 Method for standardizing enterprise names Pending CN104036344A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410206478.XA CN104036344A (en) 2014-05-16 2014-05-16 Method for standardizing enterprise names

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410206478.XA CN104036344A (en) 2014-05-16 2014-05-16 Method for standardizing enterprise names

Publications (1)

Publication Number Publication Date
CN104036344A true CN104036344A (en) 2014-09-10

Family

ID=51467108

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410206478.XA Pending CN104036344A (en) 2014-05-16 2014-05-16 Method for standardizing enterprise names

Country Status (1)

Country Link
CN (1) CN104036344A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104765806A (en) * 2015-04-01 2015-07-08 国家电网公司 Automatic processing technology for nonstandard marketing client basic information
CN107341144A (en) * 2017-06-15 2017-11-10 云程科技股份有限公司 A kind of method by segmenting formal Specification enterprise name
CN108874769A (en) * 2018-05-16 2018-11-23 深圳开思时代科技有限公司 Accessory name standardized method and device, electronic equipment and medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101339638A (en) * 2007-07-03 2009-01-07 周磊 Method and system for automatic matching of commercial articles dispensing scope and goods receiving address for ordering platform
US8140531B2 (en) * 2008-05-02 2012-03-20 International Business Machines Corporation Process and method for classifying structured data
CN102651013A (en) * 2012-03-23 2012-08-29 上海安捷力信息系统有限公司 Method and system for extracting area information from enterprise name data
CN103020037A (en) * 2012-12-05 2013-04-03 福建亿榕信息技术有限公司 Official document standardized calibration system
CN202916832U (en) * 2012-08-27 2013-05-01 中国工商银行股份有限公司 Data matching device

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101339638A (en) * 2007-07-03 2009-01-07 周磊 Method and system for automatic matching of commercial articles dispensing scope and goods receiving address for ordering platform
US8140531B2 (en) * 2008-05-02 2012-03-20 International Business Machines Corporation Process and method for classifying structured data
CN102651013A (en) * 2012-03-23 2012-08-29 上海安捷力信息系统有限公司 Method and system for extracting area information from enterprise name data
CN202916832U (en) * 2012-08-27 2013-05-01 中国工商银行股份有限公司 Data matching device
CN103020037A (en) * 2012-12-05 2013-04-03 福建亿榕信息技术有限公司 Official document standardized calibration system

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104765806A (en) * 2015-04-01 2015-07-08 国家电网公司 Automatic processing technology for nonstandard marketing client basic information
CN104765806B (en) * 2015-04-01 2018-09-18 国家电网公司 The marketing nonstandard technology for automatically treating of customer basis information
CN107341144A (en) * 2017-06-15 2017-11-10 云程科技股份有限公司 A kind of method by segmenting formal Specification enterprise name
CN108874769A (en) * 2018-05-16 2018-11-23 深圳开思时代科技有限公司 Accessory name standardized method and device, electronic equipment and medium

Similar Documents

Publication Publication Date Title
CN108595389B (en) Method for converting Word document into txt plain text document
US11574129B2 (en) Systems and methods for generalized structured data discovery utilizing contextual metadata disambiguation via machine learning techniques
US20120317078A1 (en) Replication Support for Structured Data
CN110909123A (en) Data extraction method and device, terminal equipment and storage medium
CN104298725A (en) Method for one-time editing input and multi-version output of on-line courseware development system
CN104036344A (en) Method for standardizing enterprise names
CN103049745A (en) How to convert a picture into a memo
CN105045583A (en) Visualized process based IETM (Interactive Electronic Technical Manual) fault class data module creation apparatus and creation method therefor
CN117176981A (en) Mixed cut video generation method and device, computer equipment and medium
CN109684395B (en) Visual data interface universal analysis method based on natural language processing
CN106354731A (en) Document inspection method and device
CN105718447A (en) Time information extracting method and apparatus, and smart question and answer system
CN101393481A (en) Multifunction input method implementing method and apparatus thereof
EP2565798A1 (en) Document processing device and program
CN119577205A (en) Method, device and medium for constructing minority language graphic and text data set for Internet data
CN110909726B (en) Written document interaction system and method based on image recognition
CN113539518A (en) Drug data processing method, device and electronic device based on RPA and AI
CN113221506A (en) Lecture typesetting method and device, electronic equipment and storage medium
CN112650754A (en) Method for importing total data of relational database into Hive
CN112307269B (en) Intelligent analysis system and method for human-object relationship in novel
CN110019667A (en) It is a kind of that word method and device is looked into based on voice input information
CN102723067A (en) Character display method and device
JP2010204799A (en) System and method for managing process
Lam et al. The'rtry'R package for preprocessing plant trait data
CN102521359A (en) Interface data file comparison method and device

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20140910

RJ01 Rejection of invention patent application after publication