[go: up one dir, main page]

WO2022262114A1 - Rpa and ai combined customs declaration information processing method and processing device - Google Patents

Rpa and ai combined customs declaration information processing method and processing device Download PDF

Info

Publication number
WO2022262114A1
WO2022262114A1 PCT/CN2021/114323 CN2021114323W WO2022262114A1 WO 2022262114 A1 WO2022262114 A1 WO 2022262114A1 CN 2021114323 W CN2021114323 W CN 2021114323W WO 2022262114 A1 WO2022262114 A1 WO 2022262114A1
Authority
WO
WIPO (PCT)
Prior art keywords
customs declaration
commodity
customs
entry
item
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/CN2021/114323
Other languages
French (fr)
Chinese (zh)
Inventor
潘庚生
汪冠春
胡一川
褚瑞
李玮
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Laiye Network Technology Co Ltd
Laiye Technology Beijing Co Ltd
Original Assignee
Beijing Laiye Network Technology Co Ltd
Laiye Technology Beijing Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Laiye Network Technology Co Ltd, Laiye Technology Beijing Co Ltd filed Critical Beijing Laiye Network Technology Co Ltd
Publication of WO2022262114A1 publication Critical patent/WO2022262114A1/en
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/334Query execution
    • G06F16/3343Query execution using phonetics
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/38Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/383Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02PCLIMATE CHANGE MITIGATION TECHNOLOGIES IN THE PRODUCTION OR PROCESSING OF GOODS
    • Y02P90/00Enabling technologies with a potential contribution to greenhouse gas [GHG] emissions mitigation
    • Y02P90/30Computing systems specially adapted for manufacturing

Definitions

  • the present disclosure relates to the technical fields of RPA and AI, and in particular to a processing method and processing device combining RPA and AI for customs declaration information.
  • Robotic Process Automation uses specific "robot software” to simulate human operations on computers and automatically execute process tasks according to rules.
  • AI Artificial Intelligence
  • the customs declaration personnel of Sinotrans need to declare hundreds of thousands of freight manifests every day, and it is required to input the customs declaration forms with different formats from each consignment company into the unified standard customs declaration form, and the business personnel need to spend a lot of money More human and material resources are invested in this work, resulting in poor timeliness, low efficiency, and error-prone.
  • the present disclosure aims to solve one of the technical problems in the related art at least to a certain extent.
  • the first purpose of this disclosure is to propose a processing method combining RPA and AI customs declaration information, which can improve timeliness, save labor costs, reduce the risk of input errors, and improve efficiency.
  • the second purpose of the present disclosure is to propose a processing device combining RPA and AI customs declaration information.
  • the third object of the present disclosure is to provide an electronic device.
  • a fourth object of the present disclosure is to propose a non-transitory computer-readable storage medium.
  • a fifth object of the present disclosure is to provide a computer program product.
  • the embodiment of the first aspect of the present disclosure proposes a processing method combining RPA and AI customs declaration information, including the following steps: based on optical character recognition OCR (Optical Character Recognition, optical character recognition) to carry out the content of the target customs declaration document Identification, to obtain the customs declaration data of multiple customs declaration items in the target customs declaration document; the public items in the multiple customs declaration items, according to the preset first robot process automation RPA operation process, the customs declaration of the public items Entering the data into the corresponding first standard entry in the customs declaration interface; querying the commodity identification from the multiple customs declaration entries; according to the commodity identification, determining the commodity entry associated with the commodity identification from the multiple customs declaration entries, And enter the customs declaration data of the commodity item into the second standard item corresponding to the customs declaration interface.
  • OCR Optical Character Recognition, optical character recognition
  • content recognition is performed on the target customs declaration document based on optical character recognition (OCR), so as to obtain the customs declaration data of multiple customs declaration items in the target customs declaration document;
  • OCR optical character recognition
  • the customs declaration data of public entries is entered into the corresponding first standard entry in the customs declaration interface;
  • the commodity identification is queried from multiple customs declaration entries;
  • the commodity identification from multiple Determine the commodity entry associated with the commodity identifier in the first customs declaration entry, and enter the customs declaration data of the commodity entry into the second standard entry corresponding to the customs declaration interface. Therefore, the method can improve timeliness, save labor costs, reduce the risk of input errors, and improve efficiency.
  • processing method combining RPA and AI customs declaration information proposed in the first aspect of the present disclosure may also have the following additional technical features:
  • the commodity item associated with the commodity identifier is determined from the plurality of customs declaration entries, and the customs declaration data of the commodity entry is entered into the corresponding customs declaration interface.
  • the second standard entry includes: querying the relationship between the commodity identifier and the commodity entry according to the commodity identifier; if the commodity entry associated with the commodity identifier is found, according to the preset second RPA operation process, Enter the customs declaration data of the commodity entry into the second standard entry corresponding to the customs declaration interface; if no commodity entry associated with the commodity identifier is found, according to the second standard entry in the customs declaration interface, from the Searching for a customs declaration item with similar semantics among the plurality of customs declaration items is used as the commodity item, and entering the customs declaration data of the commodity item into the second standard item corresponding to the customs declaration interface.
  • searching for customs declaration items with similar semantics from the plurality of customs declaration items as the commodity item includes: searching the customs declaration interface In the second standard entry, determine the semantic similarity with each of the customs declaration entries based on natural language processing NLP; according to the customs declaration entry with the highest semantic similarity, determine the semantic similarity with the second standard entry Product entry.
  • the commodity identifiers from the multiple customs declaration entries after querying the commodity identifiers from the multiple customs declaration entries, it further includes: inputting the commodity identifiers into the corresponding standard entries in the customs declaration interface, so as to display all items in the customs declaration interface. Describe the second standard item.
  • the target customs declaration document before performing content recognition on the target customs declaration document based on optical character recognition (OCR) to obtain the customs declaration data of multiple customs declaration items in the target customs declaration document, it further includes: according to the target customs declaration document The document type identifies each of the said customs declaration items to be identified.
  • OCR optical character recognition
  • the OCR-based optical character recognition is used to perform content recognition on the target customs declaration document, so as to obtain customs declarations of multiple customs declaration items in the target customs declaration document
  • the data includes: performing content identification on the customs declaration form based on OCR, so as to obtain customs declaration data of multiple customs declaration items contained in the customs declaration form.
  • the commodity item associated with the commodity identifier is determined from the plurality of customs declaration entries, and the customs declaration data of the commodity entry is entered into the corresponding customs declaration interface.
  • the second standard item it also includes: if there is the first standard item and/or the second standard item not entered in the customs declaration interface, based on OCR, the rest of the customs declaration form Target customs declaration documents for content identification.
  • the embodiment of the second aspect of the present disclosure proposes a processing device combining RPA and AI customs declaration information, including: a first recognition module, used to perform content recognition on the target customs declaration document based on optical character recognition (OCR), to obtain The customs declaration data of multiple customs declaration entries in the target customs declaration document; the first entry module is used to use the public entries in the multiple customs declaration entries according to the preset first robotic process automation RPA operation process, and transfer the public entries The customs declaration data of the entry is entered into the corresponding first standard entry in the customs declaration interface; the query module is used to query the commodity identification from the multiple customs declaration entries; the second input module is used to select from the multiple Determine the commodity entry associated with the commodity identifier in the first customs declaration entry, and enter the customs declaration data of the commodity entry into the second standard entry corresponding to the customs declaration interface.
  • OCR optical character recognition
  • the first recognition module performs content recognition on the target customs declaration document based on optical character recognition (OCR), so as to obtain the customs declaration data of multiple customs declaration items in the target customs declaration document, and through the second An input module inputs the public items among the multiple customs declaration items into the corresponding first standard item in the customs declaration interface according to the pre-set first robotic process automation RPA operation process, and uses the query module to select from multiple items
  • OCR optical character recognition
  • the commodity identification is queried in the customs declaration entry, and the commodity entry associated with the commodity identification is determined from the multiple customs declaration entries through the second input module according to the commodity logo, and the customs declaration data of the commodity entry is entered into the second standard entry corresponding to the customs declaration interface. Therefore, the device can improve timeliness, save labor costs, reduce the risk of input errors, and improve efficiency.
  • processing device combining RPA and AI customs declaration information proposed in the second aspect of the present disclosure may also have the following additional technical features:
  • the second entry module includes: a query unit, configured to query the relationship between the commodity identifier and the commodity entry according to the commodity identifier; In the case of a commodity item associated with the commodity identifier, according to the preset second RPA operation process, the customs declaration data of the commodity entry is entered into the second standard entry corresponding to the customs declaration interface; the second input unit is used to If no commodity item associated with the commodity identifier is found, according to the second standard entry in the customs declaration interface, query a customs declaration entry with similar semantics from the plurality of customs declaration entries as the commodity entry, and Entering the customs declaration data of the commodity item into the second standard item corresponding to the customs declaration interface.
  • the second entry unit includes: a first determining subunit, configured to determine, based on natural language processing (NLP), the items related to each of the customs declarations for the second standard item in the customs declaration interface. Semantic similarity between items; a second determining subunit, configured to determine commodity items semantically similar to the second standard item according to the customs declaration item with the highest semantic similarity.
  • NLP natural language processing
  • the above-mentioned processing device further includes: a display module, configured to enter the commodity identifier into a corresponding standard item in the customs declaration interface, so as to display the second standard item on the customs declaration interface .
  • the above processing device further includes: a determining module configured to determine each of the customs declaration items to be identified according to the document type of the target customs declaration document.
  • the first identification module includes: an identification unit, configured to perform content identification on the customs declaration form based on OCR, so as to obtain the The customs declaration data of multiple customs declaration items included in the customs declaration form.
  • the above-mentioned processing device further includes: a second identification module, configured to have the first standard item and/or the second standard item not entered in the customs declaration interface Next, based on OCR, perform content identification on the rest of the target customs declaration documents except the customs declaration form.
  • a second identification module configured to have the first standard item and/or the second standard item not entered in the customs declaration interface Next, based on OCR, perform content identification on the rest of the target customs declaration documents except the customs declaration form.
  • the embodiment of the third aspect of the present disclosure provides an electronic device, including: at least one processor; and a memory connected to the at least one processor in communication; wherein, the memory stores information that can be used by the Instructions executed by at least one processor, the instructions are executed by the at least one processor, so that the at least one processor can execute the above-mentioned processing method combining RPA and AI customs declaration information.
  • the embodiment of the fourth aspect of the present disclosure proposes a non-transitory computer-readable storage medium on which a computer program is stored, and when the program is executed by a processor, the above-mentioned processing of combining RPA and AI customs declaration information is realized method.
  • the embodiment of the fifth aspect of the present disclosure proposes a computer program product.
  • the instructions in the computer program product are executed by the processor, the above-mentioned processing method combining RPA and AI customs declaration information is executed.
  • FIG. 1 is a schematic flow diagram of a processing method combining RPA and AI customs declaration information provided by an embodiment of the present disclosure
  • FIG. 2 is a schematic flow diagram of a processing method combining RPA and AI customs declaration information provided by an embodiment of the present disclosure
  • Fig. 3 is a schematic flow diagram of a processing method combining RPA and AI customs declaration information provided by a specific embodiment of the present disclosure
  • FIG. 4 is an overall business flow chart of a processing method combining RPA and AI customs declaration information provided by a specific embodiment of the present disclosure
  • FIG. 5 is a schematic block diagram of a processing device combining RPA and AI customs declaration information provided by an embodiment of the present disclosure
  • FIG. 6 shows a block diagram of an exemplary electronic device suitable for use in implementing embodiments of the present disclosure.
  • the customs declaration personnel of Sinotrans need to declare hundreds of thousands of freight manifests every day, requiring the customs declaration data of various shipping companies to be input into a unified standard customs declaration form, and the business personnel need to spend a lot of money Human and material resources are invested in this work, and the specific process is as follows: manually check the customs declaration list data, and reply to the supplier's customs declaration data after the verification is correct; manually log in to the customs declaration system according to the data of the customs declaration documents, and the corresponding fields The data needs to be entered into the customs declaration system one by one; the customs declaration system will perform verification according to the entered data, and complete the submission after the verification is passed.
  • Customs declaration personnel need to collect customs declaration forms every day, and then input them into the customs declaration system according to the customs declaration forms.
  • customs declaration documents there are many styles of customs declaration documents, and the fields of each commodity code are in different positions. It is very troublesome to find, and the font is also small ,
  • the commodity code is very long, and 3,000 customs declaration forms need to be entered every day.
  • the work tasks are heavy, and the work efficiency and job satisfaction are not high.
  • this disclosure proposes a processing method combining RPA and AI customs declaration information, which can improve timeliness, save labor costs, reduce the risk of input errors, and improve efficiency.
  • FIG. 1 is a schematic flowchart of a processing method combining RPA and AI customs declaration information provided by an embodiment of the present disclosure.
  • the processing method of combining RPA and AI customs declaration information in the embodiment of the present disclosure includes the following steps:
  • S11 performing content recognition on the target customs declaration document based on optical character recognition (OCR), so as to obtain customs declaration data of multiple customs declaration items in the target customs declaration document.
  • OCR optical character recognition
  • target customs declaration documents such as including at least one of customs declaration form, packing list, waybill, various licenses, export collection verification and write-off form, power of attorney, contract, certificate of origin, etc. .
  • the target customs declaration document needs to be of the same quality as the sample, such as a clear scan.
  • What OCR can do is to recognize the frame line of the target customs declaration document, extract the content of each cell, and return the position of each item in the cell.
  • what needs to be done for extraction is to structure the frameless table, such as aligning rows and columns to extract structured information. It should be noted that during the extraction process, if the unit of measurement of an object in the target customs declaration document is different, normalization processing is required, that is, the unit of measurement is unified.
  • Multiple customs declaration entries include public entries and commodity entries, where the public entry is a public field extracted through OCR, including at least one of the domestic consignor, overseas consignee, exit customs, export date, declaration date, etc. ;
  • Commodity entries are commodity identifiers and element fields extracted through OCR, wherein the element fields include at least one of brand type, export preference, use, material, brand, specification or model.
  • the pre-set first robot process automation RPA operation process is the order in which the first robot fills in the customs declaration data of public items, for example, public items such as domestic consignor, overseas consignee, exit customs, export date 1.
  • the customs declaration data of the declaration date is entered into the corresponding first standard entry (public column) in the customs declaration interface.
  • the first standard entry can be shown in Table 1 below.
  • the domestic consignor and overseas consignee After obtaining the domestic consignor, overseas consignee, exit customs, export date, and declaration date, according to the pre-set first robotic process automation RPA operation process, the domestic consignor and overseas consignee
  • the customs declaration data of , exit customs, export date, and declaration date are entered in the corresponding first standard entry in the customs declaration interface in sequence.
  • Commodity identification refers to the collective term of various expressions and instructions used to identify a commodity or its characteristics and performance, for example, it may be a commodity number.
  • the product identification select the product item associated with the product identification from multiple customs declaration items, such as at least one of brand type, export preference, use, material, brand, specification or model, and submit the specific customs declaration data Enter the second standard entry corresponding to the customs declaration interface, the second standard entry can be shown in Table 2 below.
  • Form 2 After completing Form 1, Form 2 and Form 3, the customs declaration form in standard format can be exported.
  • the processing method of customs declaration information combined with RPA and AI in the embodiment of the present disclosure based on optical character recognition (OCR), performs content recognition on the target customs declaration document to obtain customs declaration data of multiple customs declaration items in the target customs declaration document, and multiple customs declaration items
  • OCR optical character recognition
  • the public entry in the public entry will enter the customs declaration data of the public entry into the corresponding first standard entry in the customs declaration interface, and query the commodity identification from multiple customs declaration entries.
  • a commodity entry associated with the commodity identifier is determined from the multiple customs declaration entries, and the customs declaration data of the commodity entry is entered into a second standard entry corresponding to the customs declaration interface.
  • this method can save manual operation steps, save labor costs, greatly improve efficiency, and greatly reduce the work pressure of personnel; because before the need to see with eyes, human errors are prone to occur during the data entry process, and In terms of data review, the system and paper-based document data are often inconsistent, which increases the risk of customs declaration.
  • This disclosure optimizes the process, which reduces the risk to a certain extent and makes the entry error rate controllable.
  • step S14 may include the following steps:
  • search for customs declaration entries with similar semantics as commodity entries from multiple customs declaration entries including: for the second standard entry in the customs declaration interface, based on natural language processing NLP Determine the semantic similarity with each customs declaration item; determine the commodity item semantically similar to the second standard item according to the customs declaration item with the highest semantic similarity.
  • the text needs to be segmented first.
  • the original text may consist of hundreds of thousands of Chinese entries, and the latitude is very high.
  • the words with little significance for decision-making are generally eliminated first, which is the purpose of feature word extraction.
  • the relationship between the commodity identifier and the commodity entry is queried from multiple customs declaration entries according to the commodity identifier.
  • the specific customs declaration data is entered into the second standard entry corresponding to the interface;
  • the semantic similarity between the customs declaration entry and each customs declaration entry is determined based on natural language processing NLP, and the customs declaration with the highest semantic similarity is selected entry, as a commodity entry semantically similar to the second standard entry.
  • the second standard item after querying the commodity identification from the multiple customs declaration items, it also includes: entering the commodity identification into the corresponding standard item in the customs declaration interface, so as to display the second standard item on the customs declaration interface.
  • OCR optical character recognition
  • customs declaration There are many types of documents that need to be reviewed for customs declaration, such as customs declaration, packing list, waybill, various licenses, export collection verification and write-off form, power of attorney, contract, certificate of origin, etc.
  • Each type of document requires different data to be extracted.
  • the format of each type of document may be different, and the packaging and unit price of the same product may be different.
  • target customs declaration documents For different types of target customs declaration documents, you can first determine the customs declaration items to be identified. For example, when the target customs declaration document is a customs declaration form, determine the customs declaration items to be identified as: domestic consignor, overseas consignee, exit customs Type, export date, declaration date, brand type, export preference, use, material, brand, specification or model; when the target customs declaration document is a contract or invoice, determine the customs declaration items to be identified as: brand, specification or model . In this way, customs declaration items can be identified more quickly based on the target customs declaration documents, further improving efficiency.
  • Fig. 3 is a schematic flowchart of a processing method combining RPA and AI customs declaration information provided by a specific embodiment of the present disclosure. As shown in Figure 3, when the target customs declaration document includes a customs declaration form, the processing method of combining RPA and AI customs declaration information according to the embodiment of the present disclosure includes the following steps:
  • the packing list, waybill, various licenses, and export collection checks other than the customs declaration form can be checked.
  • Content identification of target customs declaration documents such as sales order, power of attorney, contract, certificate of origin, etc., to obtain the first standard item and/or the second standard item that needs to be entered.
  • customers such as customer 1, customer 2, customer 3...etc. send their own company's customs declaration to Sinotrans company.
  • the format of the customs declaration form of each customer may be different, for example, customer 1 uses the customs declaration form 1, customer 2 uses the customs declaration form 2, customer 3 uses the customs declaration form 3, ....
  • the first robot recognizes the content of the customs declaration form based on optical character recognition (OCR) to extract the public fields and commodity numbers.
  • OCR optical character recognition
  • the public fields include domestic consignor, overseas consignee, exit customs, export date, declaration date, ...
  • the element fields are extracted from the customs declaration form according to the commodity number, where the element fields include brand type, export preference, use, material, ....
  • the processing method of customs declaration information combined with RPA and AI in the embodiment of the present disclosure based on optical character recognition (OCR), performs content recognition on the customs declaration form, so as to obtain the customs declaration data of multiple customs declaration items in the target customs declaration document.
  • OCR optical character recognition
  • the customs declaration data of the public entry is entered into the corresponding first standard entry in the customs declaration interface, and the commodity identification is queried from multiple customs declaration entries.
  • the commodity identification from Determine the commodity entry associated with the commodity identifier among the multiple customs declaration entries, and enter the customs declaration data of the commodity entry into the second standard entry corresponding to the customs declaration interface.
  • the method can save manual operation steps and labor costs, greatly improve efficiency, greatly reduce the work pressure of personnel, and control the input error rate.
  • the present disclosure also proposes a processing device combining RPA and AI customs declaration information.
  • Fig. 5 is a schematic block diagram of a processing device combining RPA and AI customs declaration information provided by an embodiment of the present disclosure.
  • the processing device combining RPA and AI customs declaration information in the embodiment of the present disclosure includes: a first identification module 51 , a first entry module 52 , a query module 53 and a second entry module 54 .
  • the first recognition module 51 is configured to perform content recognition on the target customs declaration document based on optical character recognition (OCR), so as to obtain customs declaration data of multiple customs declaration items in the target customs declaration document.
  • the first input module 52 is used to input the public items among the multiple customs declaration items into the corresponding first standard items in the customs declaration interface according to the preset first robotic process automation RPA operation process.
  • the inquiry module 53 is used for inquiring commodity identifiers from multiple customs declaration entries.
  • the second input module 54 is used to determine the commodity item associated with the commodity identifier from multiple customs declaration entries according to the commodity identifier, and enter the customs declaration data of the commodity item into the second standard entry corresponding to the customs declaration interface.
  • the second input module 54 includes: a query unit, a first input unit and a second input unit.
  • the query unit is used to query the relationship between the commodity identifier and the commodity entry according to the commodity identifier;
  • the first entry unit is used to query the commodity entry associated with the commodity identifier according to the preset second RPA
  • the operation process is to enter the customs declaration data of the commodity entry into the second standard entry corresponding to the customs declaration interface;
  • the second entry unit is used to, according to the second standard entry in the customs declaration interface, if no commodity entry associated with the commodity identifier is found, Query customs declaration entries with similar semantics from multiple customs declaration entries as commodity entries, and enter the customs declaration data of the commodity entries into the second standard entry corresponding to the customs declaration interface.
  • the second entry unit includes: a first determination subunit and a second determination subunit.
  • the first determining subunit is configured to determine the semantic similarity between the second standard item in the customs declaration interface and each customs declaration item based on natural language processing (NLP).
  • the second determination subunit is configured to determine commodity items semantically similar to the second standard item according to the customs declaration item with the highest semantic similarity.
  • the above-mentioned processing device combining RPA and AI customs declaration information further includes: a display module, configured to enter the commodity identifier into the corresponding standard item in the customs declaration interface, so as to display the second standard item on the customs declaration interface.
  • the above-mentioned processing device combining RPA and AI customs declaration information further includes: a determination module configured to determine each customs declaration item to be identified according to the document type of the target customs declaration document.
  • the first identification module 51 includes: an identification unit for performing content identification on the customs declaration form based on OCR, so as to obtain multiple items contained in the customs declaration form; The customs declaration data of the customs declaration item.
  • the above-mentioned processing device combining RPA and AI customs declaration information further includes: a second identification module, used for unrecorded first standard items and/or second standard items in the customs declaration interface In this case, based on OCR, perform content identification on the rest of the target customs declaration documents except the customs declaration form.
  • the first recognition module performs content recognition on the target customs declaration document based on optical character recognition (OCR), so as to obtain the customs declaration data of multiple customs declaration items in the target customs declaration document, and through the second An input module inputs the public items among the multiple customs declaration items into the corresponding first standard item in the customs declaration interface according to the pre-set first robotic process automation RPA operation process, and uses the query module to select from multiple items
  • OCR optical character recognition
  • the commodity identification is queried in the customs declaration entry, and the commodity entry associated with the commodity identification is determined from the multiple customs declaration entries through the second input module according to the commodity logo, and the customs declaration data of the commodity entry is entered into the second standard entry corresponding to the customs declaration interface. Therefore, the device can improve timeliness, save labor costs, reduce the risk of input errors, and improve efficiency.
  • the present disclosure also proposes an electronic device.
  • An electronic device includes: at least one processor; and a memory communicatively connected to the at least one processor; wherein, the memory stores instructions executable by the at least one processor, and the instructions are executed by the at least one processor to Enabling at least one processor to execute the above-mentioned processing method combining RPA and AI customs declaration information.
  • the present disclosure also proposes a non-transitory computer-readable storage medium.
  • the non-transitory computer-readable storage medium of the embodiment of the present disclosure stores a computer program thereon, and when the program is executed by a processor, the above-mentioned processing method combining RPA and AI customs declaration information is realized.
  • the present disclosure also proposes a computer program product.
  • the computer program product of the embodiment of the present disclosure when the instructions in the computer program product are executed by the processor, executes the above-mentioned processing method combining RPA and AI customs declaration information.
  • FIG. 6 shows a block diagram of an exemplary electronic device suitable for use in implementing embodiments of the present disclosure.
  • the electronic device 12 shown in FIG. 6 is only an example, and should not limit the functions and scope of use of the embodiments of the present disclosure.
  • electronic device 12 takes the form of a general-purpose computing device.
  • Components of electronic device 12 may include, but are not limited to, one or more processors or processing units 16, system memory 28, bus 18 connecting various system components including system memory 28 and processing unit 16.
  • Bus 18 represents one or more of several types of bus structures, including a memory bus or memory controller, a peripheral bus, an accelerated graphics port, a processor, or a local bus using any of a variety of bus structures.
  • these architectures include but are not limited to Industry Standard Architecture (Industry Standard Architecture; hereinafter referred to as: ISA) bus, Micro Channel Architecture (Micro Channel Architecture; hereinafter referred to as: MAC) bus, enhanced ISA bus, video electronics Standards Association (Video Electronics Standards Association; hereinafter referred to as: VESA) local bus and Peripheral Component Interconnection (hereinafter referred to as: PCI) bus.
  • Electronic device 12 typically includes a variety of computer system readable media. These media can be any available media that can be accessed by electronic device 12 and include both volatile and nonvolatile media, removable and non-removable media.
  • the memory 28 may include a computer system readable medium in the form of a volatile memory, such as a random access memory (Random Access Memory; hereinafter referred to as: RAM) 30 and/or a cache memory 32 .
  • RAM Random Access Memory
  • the electronic device 12 may further include other removable/non-removable, volatile/nonvolatile computer system storage media.
  • storage system 34 may be used to read from and write to non-removable, non-volatile magnetic media (not shown in Figure 7, commonly referred to as "hard drives").
  • a disk drive for reading and writing to a removable nonvolatile disk may be provided, as well as a disk drive for removable nonvolatile disks (such as a CD-ROM (Compact Disc Read Only Memory; hereinafter referred to as: CD-ROM), Digital Video Disc Read Only Memory (hereinafter referred to as: DVD-ROM) or other optical media).
  • each drive may be connected to bus 18 via one or more data media interfaces.
  • Memory 28 may include at least one program product having a set (eg, at least one) of program modules configured to perform the functions of various embodiments of the present disclosure.
  • a program/utility 40 having a set (at least one) of program modules 42 may be stored, for example, in memory 28, such program modules 42 including but not limited to an operating system, one or more application programs, other program modules, and program data , each or some combination of these examples may include implementations of network environments.
  • the program modules 42 generally perform the functions and/or methods of the embodiments described in this disclosure.
  • Electronic device 12 may also communicate with one or more external devices 14 (e.g., a keyboard, pointing device, display 24, etc.), and with one or more devices that enable a user to interact with the computer system/server 12, and/or Or communicate with any device (eg, network card, modem, etc.) that enables the computer system/server 12 to communicate with one or more other computing devices. Such communication may occur through input/output (I/O) interface 22 .
  • external devices 14 e.g., a keyboard, pointing device, display 24, etc.
  • any device eg, network card, modem, etc.
  • I/O input/output
  • the electronic device 12 can also communicate with one or more networks (such as a local area network (Local Area Network; hereinafter referred to as: LAN), a wide area network (Wide Area Network; hereinafter referred to as: WAN) and/or a public network, such as the Internet, through the network adapter 20. ) communication.
  • network adapter 20 communicates with other modules of electronic device 12 via bus 18 .
  • other hardware and/or software modules may be used in conjunction with electronic device 12, including but not limited to: microcode, device drivers, redundant processing units, external disk drive arrays, RAID (Redundant Arrays of Independent Disks, disk array) systems, tape drives, and data backup storage systems.
  • the processing unit 16 executes various functional applications and data processing by running the programs stored in the system memory 28 , such as implementing the methods mentioned in the foregoing embodiments.
  • first and second are used for descriptive purposes only, and cannot be interpreted as indicating or implying relative importance or implicitly specifying the quantity of indicated technical features. Thus, features defined as “first” and “second” may explicitly or implicitly include at least one of these features. In the description of the present disclosure, “plurality” means at least two, such as two, three, etc., unless otherwise specifically defined.
  • a "computer-readable medium” may be any device that can contain, store, communicate, propagate or transmit a program for use in or in conjunction with an instruction execution system, device or device.
  • computer-readable media include the following: electrical connection with one or more wires (electronic device), portable computer disk case (magnetic device), random access memory (RAM), Read Only Memory (ROM), Erasable and Editable Read Only Memory (EPROM or Flash Memory), Fiber Optic Devices, and Portable Compact Disc Read Only Memory (CDROM).
  • the computer-readable medium may even be paper or other suitable medium on which the program can be printed, since the program can be read, for example, by optically scanning the paper or other medium, followed by editing, interpretation or other suitable processing if necessary.
  • the program is processed electronically and stored in computer memory.
  • various parts of the present disclosure may be implemented in hardware, software, firmware or a combination thereof.
  • various steps or methods may be implemented by software or firmware stored in memory and executed by a suitable instruction execution system.
  • a suitable instruction execution system For example, if implemented in hardware as in another embodiment, it can be implemented by any one or a combination of the following techniques known in the art: a discrete Logic circuits, ASICs with suitable combinational logic gates, Programmable Gate Arrays (PGA), Field Programmable Gate Arrays (FPGA), etc.
  • each functional unit in each embodiment of the present disclosure may be integrated into one processing module, each unit may exist separately physically, or two or more units may be integrated into one module.
  • the above-mentioned integrated modules can be implemented in the form of hardware or in the form of software function modules. If the integrated modules are realized in the form of software function modules and sold or used as independent products, they can also be stored in a computer-readable storage medium.
  • the storage medium mentioned above may be a read-only memory, a magnetic disk or an optical disk, and the like.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Databases & Information Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Library & Information Science (AREA)
  • Health & Medical Sciences (AREA)
  • Acoustics & Sound (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • General Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Character Discrimination (AREA)

Abstract

The present disclosure provides a robotic process automation (RPA) and AI combined customs declaration information processing method and processing device. The method comprises: performing content recognition on a target customs declaration document on the basis of optical character recognition (OCR) to obtain customs declaration data of a plurality of customs declaration items in the target customs declaration document; for a public item in the plurality of customs declaration items, according to a preset first RPA operation process, inputting customs declaration data of the public item into a corresponding first standard item in a customs declaration interface; querying a commodity identifier from the plurality of customs declaration items; and according to the commodity identifier, determining, from the plurality of customs declaration items, a commodity item associated with the commodity identifier, and inputting customs declaration data of the commodity item into a corresponding second standard item in the customs declaration interface. In this way, the method can improve timeliness, save labor costs, reduce the risk of entry errors, and improve efficiency.

Description

结合RPA和AI报关信息的处理方法和处理装置Processing method and processing device combining RPA and AI customs declaration information

相关申请的交叉引用Cross References to Related Applications

本公开基于申请号为202110667458.2,申请日为2021年06月16日的中国专利申请提出,并要求该中国专利申请的优先权,该中国专利申请的全部内容在此引入本公开作为参考。This disclosure is based on a Chinese patent application with application number 202110667458.2 and a filing date of June 16, 2021, and claims the priority of this Chinese patent application. The entire content of this Chinese patent application is hereby incorporated by reference into this disclosure.

技术领域technical field

本公开涉及RPA、AI技术领域,尤其涉及一种结合RPA和AI报关信息的处理方法和处理装置。The present disclosure relates to the technical fields of RPA and AI, and in particular to a processing method and processing device combining RPA and AI for customs declaration information.

背景技术Background technique

机器人流程自动化(Robotic Process Automation,RPA)是通过特定的“机器人软件”,模拟人在计算机上的操作,按规则自动执行流程任务。Robotic Process Automation (RPA) uses specific "robot software" to simulate human operations on computers and automatically execute process tasks according to rules.

人工智能(Artificial Intelligence,AI)是研究、开发用于模拟、延伸和扩展人的智能的理论、方法、技术及应用系统的一门技术科学。Artificial Intelligence (AI) is a technical science that studies and develops theories, methods, technologies and application systems for simulating, extending and expanding human intelligence.

相关技术中,外运报关业务人员每天需要对成百上千的货运清单进行报关,要求将各托运公司的格式不一的报关单输入至统一的标准海关报关单中,业务人员需花费相当大的人力物力投入在这项工作中,导致及时性较差、效率低、容易出错。In related technologies, the customs declaration personnel of Sinotrans need to declare hundreds of thousands of freight manifests every day, and it is required to input the customs declaration forms with different formats from each consignment company into the unified standard customs declaration form, and the business personnel need to spend a lot of money More human and material resources are invested in this work, resulting in poor timeliness, low efficiency, and error-prone.

发明内容Contents of the invention

本公开旨在至少从一定程度上解决相关技术中的技术问题之一。The present disclosure aims to solve one of the technical problems in the related art at least to a certain extent.

为此,本公开的第一个目的在于提出一种结合RPA和AI报关信息的处理方法,该方法能够提高及时性,节省人工成本,降低录入错误的风险,提高效率。Therefore, the first purpose of this disclosure is to propose a processing method combining RPA and AI customs declaration information, which can improve timeliness, save labor costs, reduce the risk of input errors, and improve efficiency.

本公开的第二个目的在于提出一种结合RPA和AI报关信息的处理装置。The second purpose of the present disclosure is to propose a processing device combining RPA and AI customs declaration information.

本公开的第三个目的在于提出一种电子设备。The third object of the present disclosure is to provide an electronic device.

本公开的第四个目的在于提出一种非临时性计算机可读存储介质。A fourth object of the present disclosure is to propose a non-transitory computer-readable storage medium.

本公开的第五个目的在于提出一种计算机程序产品。A fifth object of the present disclosure is to provide a computer program product.

为达上述目的,本公开第一方面实施例提出了一种结合RPA和AI报关信息的处理方法,包括以下步骤:基于光学字符识别OCR(Optical Character Recognition,光学字符识别)对目标报关文档进行内容识别,以得到所述目标报关文档中多条报关条目的报关数据;将所述多条报关条目中的公共条目,根据预先设置的第一机器人流程自动化RPA操作流程, 将所述公共条目的报关数据录入报关界面中对应的第一标准条目中;从所述多条报关条目中查询商品标识;根据所述商品标识,从所述多条报关条目中确定与所述商品标识关联的商品条目,并将所述商品条目的报关数据录入所述报关界面对应的第二标准条目中。In order to achieve the above-mentioned purpose, the embodiment of the first aspect of the present disclosure proposes a processing method combining RPA and AI customs declaration information, including the following steps: based on optical character recognition OCR (Optical Character Recognition, optical character recognition) to carry out the content of the target customs declaration document Identification, to obtain the customs declaration data of multiple customs declaration items in the target customs declaration document; the public items in the multiple customs declaration items, according to the preset first robot process automation RPA operation process, the customs declaration of the public items Entering the data into the corresponding first standard entry in the customs declaration interface; querying the commodity identification from the multiple customs declaration entries; according to the commodity identification, determining the commodity entry associated with the commodity identification from the multiple customs declaration entries, And enter the customs declaration data of the commodity item into the second standard item corresponding to the customs declaration interface.

根据本公开实施例的结合RPA和AI报关信息的处理方法,基于光学字符识别OCR对目标报关文档进行内容识别,以得到目标报关文档中多条报关条目的报关数据;将多条报关条目中的公共条目,根据预先设置的第一机器人流程自动化RPA操作流程,将公共条目的报关数据录入报关界面中对应的第一标准条目中;从多条报关条目中查询商品标识;根据商品标识,从多条报关条目中确定与商品标识关联的商品条目,并将商品条目的报关数据录入报关界面对应的第二标准条目中。由此,该方法能够提高及时性,节省人工成本,降低录入错误的风险,提高效率。According to the processing method combining RPA and AI customs declaration information according to the embodiment of the present disclosure, content recognition is performed on the target customs declaration document based on optical character recognition (OCR), so as to obtain the customs declaration data of multiple customs declaration items in the target customs declaration document; For public entries, according to the pre-set first robotic process automation RPA operation process, the customs declaration data of public entries is entered into the corresponding first standard entry in the customs declaration interface; the commodity identification is queried from multiple customs declaration entries; according to the commodity identification, from multiple Determine the commodity entry associated with the commodity identifier in the first customs declaration entry, and enter the customs declaration data of the commodity entry into the second standard entry corresponding to the customs declaration interface. Therefore, the method can improve timeliness, save labor costs, reduce the risk of input errors, and improve efficiency.

另外,本公开第一方面提出的结合RPA和AI报关信息的处理方法还可以具有如下附加的技术特征:In addition, the processing method combining RPA and AI customs declaration information proposed in the first aspect of the present disclosure may also have the following additional technical features:

根据本公开的一个实施例,所述根据所述商品标识,从所述多条报关条目中确定与所述商品标识关联的商品条目,并将所述商品条目的报关数据录入所述报关界面对应的第二标准条目中,包括:根据所述商品标识,查询商品标识与商品条目之间的关联关系;若查询到所述商品标识关联的商品条目,则根据预先设置的第二RPA操作流程,将所述商品条目的报关数据录入所述报关界面对应的第二标准条目中;若未查询到所述商品标识关联的商品条目,则根据所述报关界面中所述第二标准条目,从所述多条报关条目中查询语义相似的报关条目作为所述商品条目,并将所述商品条目的报关数据录入所述报关界面对应的第二标准条目中。According to an embodiment of the present disclosure, according to the commodity identifier, the commodity item associated with the commodity identifier is determined from the plurality of customs declaration entries, and the customs declaration data of the commodity entry is entered into the corresponding customs declaration interface. The second standard entry includes: querying the relationship between the commodity identifier and the commodity entry according to the commodity identifier; if the commodity entry associated with the commodity identifier is found, according to the preset second RPA operation process, Enter the customs declaration data of the commodity entry into the second standard entry corresponding to the customs declaration interface; if no commodity entry associated with the commodity identifier is found, according to the second standard entry in the customs declaration interface, from the Searching for a customs declaration item with similar semantics among the plurality of customs declaration items is used as the commodity item, and entering the customs declaration data of the commodity item into the second standard item corresponding to the customs declaration interface.

根据本公开的一个实施例,所述根据所述报关界面中所述第二标准条目,从所述多条报关条目中查询语义相似的报关条目作为所述商品条目,包括:对所述报关界面中的所述第二标准条目,基于自然语言处理NLP确定与各所述报关条目之间的语义相似度;根据所述语义相似度最高的报关条目,确定与所述第二标准条目语义相似的商品条目。According to an embodiment of the present disclosure, according to the second standard item in the customs declaration interface, searching for customs declaration items with similar semantics from the plurality of customs declaration items as the commodity item includes: searching the customs declaration interface In the second standard entry, determine the semantic similarity with each of the customs declaration entries based on natural language processing NLP; according to the customs declaration entry with the highest semantic similarity, determine the semantic similarity with the second standard entry Product entry.

根据本公开的一个实施例,所述从所述多条报关条目中查询商品标识之后,还包括:将所述商品标识录入所述报关界面中对应的标准条目,以在所述报关界面展示所述第二标准条目。According to an embodiment of the present disclosure, after querying the commodity identifiers from the multiple customs declaration entries, it further includes: inputting the commodity identifiers into the corresponding standard entries in the customs declaration interface, so as to display all items in the customs declaration interface. Describe the second standard item.

根据本公开的一个实施例,所述基于光学字符识别OCR对目标报关文档进行内容识别,以得到所述目标报关文档中多条报关条目的报关数据之前,还包括:根据所述目标报关文档的文档类型,确定待识别的各所述报关条目。According to an embodiment of the present disclosure, before performing content recognition on the target customs declaration document based on optical character recognition (OCR) to obtain the customs declaration data of multiple customs declaration items in the target customs declaration document, it further includes: according to the target customs declaration document The document type identifies each of the said customs declaration items to be identified.

根据本公开的一个实施例,所述目标报关文档为多个,包括报关单;所述基于光学字符识别OCR对目标报关文档进行内容识别,以得到所述目标报关文档中多条报关条目的报 关数据,包括:基于OCR对所述报关单进行内容识别,以得到所述报关单中包含的多条报关条目的报关数据。According to an embodiment of the present disclosure, there are multiple target customs declaration documents, including a customs declaration form; the OCR-based optical character recognition is used to perform content recognition on the target customs declaration document, so as to obtain customs declarations of multiple customs declaration items in the target customs declaration document The data includes: performing content identification on the customs declaration form based on OCR, so as to obtain customs declaration data of multiple customs declaration items contained in the customs declaration form.

根据本公开的一个实施例,所述根据所述商品标识,从所述多条报关条目中确定与所述商品标识关联的商品条目,并将所述商品条目的报关数据录入所述报关界面对应的第二标准条目中之后,还包括:若所述报关界面中存在未录入的所述第一标准条目和/或所述第二标准条目,则基于OCR对除所述报关单之外的其余目标报关文档进行内容识别。According to an embodiment of the present disclosure, according to the commodity identifier, the commodity item associated with the commodity identifier is determined from the plurality of customs declaration entries, and the customs declaration data of the commodity entry is entered into the corresponding customs declaration interface. After the second standard item, it also includes: if there is the first standard item and/or the second standard item not entered in the customs declaration interface, based on OCR, the rest of the customs declaration form Target customs declaration documents for content identification.

为达上述目的,本公开第二方面实施例提出了一种结合RPA和AI报关信息的处理装置,包括:第一识别模块,用于基于光学字符识别OCR对目标报关文档进行内容识别,以得到所述目标报关文档中多条报关条目的报关数据;第一录入模块,用于将所述多条报关条目中的公共条目,根据预先设置的第一机器人流程自动化RPA操作流程,将所述公共条目的报关数据录入报关界面中对应的第一标准条目中;查询模块,用于从所述多条报关条目中查询商品标识;第二录入模块,用于根据所述商品标识,从所述多条报关条目中确定与所述商品标识关联的商品条目,并将所述商品条目的报关数据录入所述报关界面对应的第二标准条目中。In order to achieve the above purpose, the embodiment of the second aspect of the present disclosure proposes a processing device combining RPA and AI customs declaration information, including: a first recognition module, used to perform content recognition on the target customs declaration document based on optical character recognition (OCR), to obtain The customs declaration data of multiple customs declaration entries in the target customs declaration document; the first entry module is used to use the public entries in the multiple customs declaration entries according to the preset first robotic process automation RPA operation process, and transfer the public entries The customs declaration data of the entry is entered into the corresponding first standard entry in the customs declaration interface; the query module is used to query the commodity identification from the multiple customs declaration entries; the second input module is used to select from the multiple Determine the commodity entry associated with the commodity identifier in the first customs declaration entry, and enter the customs declaration data of the commodity entry into the second standard entry corresponding to the customs declaration interface.

根据本公开实施例的结合RPA和AI报关信息的处理装置,通过第一识别模块基于光学字符识别OCR对目标报关文档进行内容识别,以得到目标报关文档中多条报关条目的报关数据,通过第一录入模块将多条报关条目中的公共条目,根据预先设置的第一机器人流程自动化RPA操作流程,将公共条目的报关数据录入报关界面中对应的第一标准条目中,通过查询模块从多条报关条目中查询商品标识,通过第二录入模块根据商品标识,从多条报关条目中确定与商品标识关联的商品条目,并将商品条目的报关数据录入报关界面对应的第二标准条目中。由此,该装置能够提高及时性,节省人工成本,降低录入错误的风险,提高效率。According to the processing device combining RPA and AI customs declaration information according to an embodiment of the present disclosure, the first recognition module performs content recognition on the target customs declaration document based on optical character recognition (OCR), so as to obtain the customs declaration data of multiple customs declaration items in the target customs declaration document, and through the second An input module inputs the public items among the multiple customs declaration items into the corresponding first standard item in the customs declaration interface according to the pre-set first robotic process automation RPA operation process, and uses the query module to select from multiple items The commodity identification is queried in the customs declaration entry, and the commodity entry associated with the commodity identification is determined from the multiple customs declaration entries through the second input module according to the commodity logo, and the customs declaration data of the commodity entry is entered into the second standard entry corresponding to the customs declaration interface. Therefore, the device can improve timeliness, save labor costs, reduce the risk of input errors, and improve efficiency.

另外,本公开第二方面提出的结合RPA和AI报关信息的处理装置还可以具有如下附加的技术特征:In addition, the processing device combining RPA and AI customs declaration information proposed in the second aspect of the present disclosure may also have the following additional technical features:

根据本公开的一个实施例,所述第二录入模块,包括:查询单元,用于根据所述商品标识,查询商品标识与商品条目之间的关联关系;第一录入单元,用于在查询到所述商品标识关联的商品条目的情况下,根据预先设置的第二RPA操作流程,将所述商品条目的报关数据录入所述报关界面对应的第二标准条目中;第二录入单元,用于在未查询到所述商品标识关联的商品条目的情况下,根据所述报关界面中所述第二标准条目,从所述多条报关条目中查询语义相似的报关条目作为所述商品条目,并将所述商品条目的报关数据录入所述报关界面对应的第二标准条目中。According to an embodiment of the present disclosure, the second entry module includes: a query unit, configured to query the relationship between the commodity identifier and the commodity entry according to the commodity identifier; In the case of a commodity item associated with the commodity identifier, according to the preset second RPA operation process, the customs declaration data of the commodity entry is entered into the second standard entry corresponding to the customs declaration interface; the second input unit is used to If no commodity item associated with the commodity identifier is found, according to the second standard entry in the customs declaration interface, query a customs declaration entry with similar semantics from the plurality of customs declaration entries as the commodity entry, and Entering the customs declaration data of the commodity item into the second standard item corresponding to the customs declaration interface.

根据本公开的一个实施例,所述第二录入单元,包括:第一确定子单元,用于对所述 报关界面中的所述第二标准条目,基于自然语言处理NLP确定与各所述报关条目之间的语义相似度;第二确定子单元,用于根据所述语义相似度最高的报关条目,确定与所述第二标准条目语义相似的商品条目。According to an embodiment of the present disclosure, the second entry unit includes: a first determining subunit, configured to determine, based on natural language processing (NLP), the items related to each of the customs declarations for the second standard item in the customs declaration interface. Semantic similarity between items; a second determining subunit, configured to determine commodity items semantically similar to the second standard item according to the customs declaration item with the highest semantic similarity.

根据本公开的一个实施例,上述的处理装置,还包括:展示模块,用于将所述商品标识录入所述报关界面中对应的标准条目,以在所述报关界面展示所述第二标准条目。According to an embodiment of the present disclosure, the above-mentioned processing device further includes: a display module, configured to enter the commodity identifier into a corresponding standard item in the customs declaration interface, so as to display the second standard item on the customs declaration interface .

根据本公开的一个实施例,上述的处理装置,还包括:确定模块,用于根据所述目标报关文档的文档类型,确定待识别的各所述报关条目。According to an embodiment of the present disclosure, the above processing device further includes: a determining module configured to determine each of the customs declaration items to be identified according to the document type of the target customs declaration document.

根据本公开的一个实施例,所述目标报关文档为多个,包括报关单;所述第一识别模块,包括:识别单元,用于基于OCR对所述报关单进行内容识别,以得到所述报关单中包含的多条报关条目的报关数据。According to an embodiment of the present disclosure, there are multiple target customs declaration documents, including a customs declaration form; the first identification module includes: an identification unit, configured to perform content identification on the customs declaration form based on OCR, so as to obtain the The customs declaration data of multiple customs declaration items included in the customs declaration form.

根据本公开的一个实施例,上述的处理装置,还包括:第二识别模块,用于在所述报关界面中存在未录入的所述第一标准条目和/或所述第二标准条目的情况下,基于OCR对除所述报关单之外的其余目标报关文档进行内容识别。According to an embodiment of the present disclosure, the above-mentioned processing device further includes: a second identification module, configured to have the first standard item and/or the second standard item not entered in the customs declaration interface Next, based on OCR, perform content identification on the rest of the target customs declaration documents except the customs declaration form.

为达上述目的,本公开第三方面实施例提出了一种电子设备,包括:至少一个处理器;以及与所述至少一个处理器通信连接的存储器;其中,所述存储器存储有可被所述至少一个处理器执行的指令,所述指令被所述至少一个处理器执行,以使所述至少一个处理器能够执行上述的结合RPA和AI报关信息的处理方法。To achieve the above purpose, the embodiment of the third aspect of the present disclosure provides an electronic device, including: at least one processor; and a memory connected to the at least one processor in communication; wherein, the memory stores information that can be used by the Instructions executed by at least one processor, the instructions are executed by the at least one processor, so that the at least one processor can execute the above-mentioned processing method combining RPA and AI customs declaration information.

为达上述目的,本公开第四方面实施例提出了一种非临时性计算机可读存储介质,其上存储有计算机程序,该程序被处理器执行时实现上述的结合RPA和AI报关信息的处理方法。In order to achieve the above purpose, the embodiment of the fourth aspect of the present disclosure proposes a non-transitory computer-readable storage medium on which a computer program is stored, and when the program is executed by a processor, the above-mentioned processing of combining RPA and AI customs declaration information is realized method.

为达上述目的,本公开第五方面实施例提出了一种计算机程序产品,当所述计算机程序产品中的指令由处理器执行时,执行上述的结合RPA和AI报关信息的处理方法。To achieve the above purpose, the embodiment of the fifth aspect of the present disclosure proposes a computer program product. When the instructions in the computer program product are executed by the processor, the above-mentioned processing method combining RPA and AI customs declaration information is executed.

本公开附加的方面和优点将在下面的描述中部分给出,部分将从下面的描述中变得明显,或通过本公开的实践了解到。Additional aspects and advantages of the disclosure will be set forth in part in the description which follows, and in part will be obvious from the description, or may be learned by practice of the disclosure.

附图说明Description of drawings

本公开上述的和/或附加的方面和优点从下面结合附图对实施例的描述中将变得明显和容易理解,其中:The above and/or additional aspects and advantages of the present disclosure will become apparent and understandable from the following description of the embodiments in conjunction with the accompanying drawings, wherein:

图1为本公开实施例所提供的一种结合RPA和AI报关信息的处理方法的流程示意图;FIG. 1 is a schematic flow diagram of a processing method combining RPA and AI customs declaration information provided by an embodiment of the present disclosure;

图2为本公开一个实施例所提供的一种结合RPA和AI报关信息的处理方法的流程示意图;FIG. 2 is a schematic flow diagram of a processing method combining RPA and AI customs declaration information provided by an embodiment of the present disclosure;

图3为本公开一个具体实施例所提供的一种结合RPA和AI报关信息的处理方法的流 程示意图;Fig. 3 is a schematic flow diagram of a processing method combining RPA and AI customs declaration information provided by a specific embodiment of the present disclosure;

图4为本公开一个具体实施例所提供的一种结合RPA和AI报关信息的处理方法的整体业务流程图;FIG. 4 is an overall business flow chart of a processing method combining RPA and AI customs declaration information provided by a specific embodiment of the present disclosure;

图5为本公开实施例所提供的一种结合RPA和AI报关信息的处理装置的方框示意图;FIG. 5 is a schematic block diagram of a processing device combining RPA and AI customs declaration information provided by an embodiment of the present disclosure;

图6示出了适于用来实现本公开实施方式的示例性电子设备的框图。FIG. 6 shows a block diagram of an exemplary electronic device suitable for use in implementing embodiments of the present disclosure.

具体实施方式detailed description

下面详细描述本公开的实施例,所述实施例的示例在附图中示出,其中自始至终相同或类似的标号表示相同或类似的元件或具有相同或类似功能的元件。下面通过参考附图描述的实施例是示例性的,旨在用于解释本公开,而不能理解为对本公开的限制。Embodiments of the present disclosure are described in detail below, examples of which are illustrated in the drawings, in which the same or similar reference numerals denote the same or similar elements or elements having the same or similar functions throughout. The embodiments described below by referring to the figures are exemplary and are intended to explain the present disclosure and should not be construed as limiting the present disclosure.

下面参考附图描述本公开实施例的结合RPA和AI报关信息的处理方法、处理装置、电子设备、非临时性计算机可读存储介质和计算机程序产品。The following describes the processing method, processing device, electronic equipment, non-transitory computer-readable storage medium and computer program product combining RPA and AI customs declaration information according to the embodiments of the present disclosure with reference to the accompanying drawings.

相关技术,中外运公司的报关业务人员每天需要对成百上千的货运清单进行报关,要求将各托运公司的格式不一的报关数据输入至统一的标准海关报关单中,业务人员需花费相当大的人力物力投入在这项工作中,具体处理的流程方式如下:人工进行核对报关清单数据,在核对无误之后,回复供应商报关数据;人工根据报关单据的数据,登录到报关系统中,对应字段数据,需要一个一个的录入到报关系统中;报关系统根据录入的数据,进行核验,在核对通过之后,完成提交。In related technologies, the customs declaration personnel of Sinotrans need to declare hundreds of thousands of freight manifests every day, requiring the customs declaration data of various shipping companies to be input into a unified standard customs declaration form, and the business personnel need to spend a lot of money Human and material resources are invested in this work, and the specific process is as follows: manually check the customs declaration list data, and reply to the supplier's customs declaration data after the verification is correct; manually log in to the customs declaration system according to the data of the customs declaration documents, and the corresponding fields The data needs to be entered into the customs declaration system one by one; the customs declaration system will perform verification according to the entered data, and complete the submission after the verification is passed.

但是存在以下缺点:But there are following disadvantages:

①时性差:报关人员每天要催供应商把报关数据提交到中外运公司,不然容易影响报关时效性,一旦报关没有通过,影响货物的正常流程,大大折减了货物的运输效率,也对公司的运输成本巨大的占用和浪费;① Poor timeliness: Customs declaration personnel have to urge suppliers to submit customs declaration data to Sinotrans every day, otherwise it will easily affect the timeliness of customs declaration. Once the customs declaration is not passed, it will affect the normal process of goods, greatly reduce the transportation efficiency of goods, and also affect the company's Huge occupation and waste of transportation costs;

②效率低:报关人员需要每天收取报关单,然后根据报关单录入报关系统,一方面是报关单据样式很多,每个商品编码的字段都在不同的位置,需要寻找起来很麻烦,字体也很小,商品编码很长,每天需要录入3千份报关单,工作任务繁重,工作效率和工作满意度不高。②Low efficiency: Customs declaration personnel need to collect customs declaration forms every day, and then input them into the customs declaration system according to the customs declaration forms. On the one hand, there are many styles of customs declaration documents, and the fields of each commodity code are in different positions. It is very troublesome to find, and the font is also small , The commodity code is very long, and 3,000 customs declaration forms need to be entered every day. The work tasks are heavy, and the work efficiency and job satisfaction are not high.

③容易出错:工作人员要在很多的报关单中找到对应的商品编码,然后再寻找对应编码字段记录,再录入到报关系统中,容易出现,一旦录入错误了,要重新录入,很浪费时间,导致效率很差,实际上报关人员每天只能录入100份一个人一天。③ Error-prone: The staff has to find the corresponding commodity code in many customs declaration forms, and then search for the corresponding code field record, and then enter it into the customs declaration system. As a result, the efficiency is very poor. In fact, the customs declaration personnel can only enter 100 copies per person per day.

为了解决上述问题,本公开提出了一种结合RPA和AI报关信息的处理方法,该方法能够提高及时性,节省人工成本,降低录入错误的风险,提高效率。In order to solve the above problems, this disclosure proposes a processing method combining RPA and AI customs declaration information, which can improve timeliness, save labor costs, reduce the risk of input errors, and improve efficiency.

图1为本公开实施例所提供的一种结合RPA和AI报关信息的处理方法的流程示意图。FIG. 1 is a schematic flowchart of a processing method combining RPA and AI customs declaration information provided by an embodiment of the present disclosure.

如图1所述,本公开实施例的结合RPA和AI报关信息的处理方法,包括以下步骤:As shown in Figure 1, the processing method of combining RPA and AI customs declaration information in the embodiment of the present disclosure includes the following steps:

S11,基于光学字符识别OCR对目标报关文档进行内容识别,以得到目标报关文档中多条报关条目的报关数据。S11, performing content recognition on the target customs declaration document based on optical character recognition (OCR), so as to obtain customs declaration data of multiple customs declaration items in the target customs declaration document.

在该实施例中,目标报关文档可以为多个,如包括报关单、装箱单、运单、各类许可证、出口收汇核销单、授权委托书、合同、产地证等中的至少一个。目标报关文档需要跟样例质量一样如可以是清晰的扫描件。In this embodiment, there may be multiple target customs declaration documents, such as including at least one of customs declaration form, packing list, waybill, various licenses, export collection verification and write-off form, power of attorney, contract, certificate of origin, etc. . The target customs declaration document needs to be of the same quality as the sample, such as a clear scan.

光学字符识别OCR能够做到的是把目标报关文档的框线识别出来,把每个单元格的内容抽取出来,并返回单元格里每个条目的位置。其中,抽取需要做的是把无框线表格结构化,如将行列进行对其,提取结构化的信息。需要说明的是,在抽取的过程中,如果目标报关文档中一个物体的计量单位不一样,则需要进行归一化处理,即进行计量单位的统一。基于抽取的结果,需要做到单单(如报关单、装箱单、运单、出口收汇核销单)一致、单证(报关单、装箱单、运单、出口收汇核销单和各类许可证、产地证)一致、单同(报关单、装箱单、运单、出口收汇核销单和合同)一致。What OCR can do is to recognize the frame line of the target customs declaration document, extract the content of each cell, and return the position of each item in the cell. Among them, what needs to be done for extraction is to structure the frameless table, such as aligning rows and columns to extract structured information. It should be noted that during the extraction process, if the unit of measurement of an object in the target customs declaration document is different, normalization processing is required, that is, the unit of measurement is unified. Based on the extracted results, it is necessary to ensure that the documents (such as customs declaration, packing list, waybill, export foreign exchange verification and write-off form) are consistent, and the documents (customs declaration, packing list, waybill, export foreign exchange collection verification and write-off form and various license, certificate of origin), and the documents (customs declaration, packing list, waybill, export collection verification and contract).

多条报关条目包括公共条目和商品条目,其中,公共条目为通过OCR抽取出的公共字段,包括如境内发货人、境外收货人、出境关别、出口日期、申报日期等中的至少一个;商品条目为通过OCR抽取出的商品标识和要素字段,其中,要素字段包括如品牌类型、出口惠享情况、用途、材质、品牌、规格或型号中的至少一种。Multiple customs declaration entries include public entries and commodity entries, where the public entry is a public field extracted through OCR, including at least one of the domestic consignor, overseas consignee, exit customs, export date, declaration date, etc. ; Commodity entries are commodity identifiers and element fields extracted through OCR, wherein the element fields include at least one of brand type, export preference, use, material, brand, specification or model.

S12,将多条报关条目中的公共条目,根据预先设置的第一机器人流程自动化RPA操作流程,将公共条目的报关数据录入报关界面中对应的第一标准条目中。S12. Enter the common items among the multiple customs declaration items into the corresponding first standard items in the customs declaration interface according to the preset first robotic process automation RPA operation process.

其中,预先设置的第一机器人流程自动化RPA操作流程为第一机器人填写公共条目的报关数据的顺序,例如,可依次将公共条目如境内发货人、境外收货人、出境关别、出口日期、申报日期的报关数据录入报关界面中对应的第一标准条目(公共栏)中,第一标准条目可如下表1所示。Among them, the pre-set first robot process automation RPA operation process is the order in which the first robot fills in the customs declaration data of public items, for example, public items such as domestic consignor, overseas consignee, exit customs, export date 1. The customs declaration data of the declaration date is entered into the corresponding first standard entry (public column) in the customs declaration interface. The first standard entry can be shown in Table 1 below.

表1Table 1

Figure PCTCN2021114323-appb-000001
Figure PCTCN2021114323-appb-000001

举例说明,在获取到境内发货人、境外收货人、出境关别、出口日期、申报日期之后, 根据预先设置的第一机器人流程自动化RPA操作流程,将境内发货人、境外收货人、出境关别、出口日期、申报日期的报关数据依次录入报关界面中对应的第一标准条目中。For example, after obtaining the domestic consignor, overseas consignee, exit customs, export date, and declaration date, according to the pre-set first robotic process automation RPA operation process, the domestic consignor and overseas consignee The customs declaration data of , exit customs, export date, and declaration date are entered in the corresponding first standard entry in the customs declaration interface in sequence.

S13,从多条报关条目中查询商品标识。S13. Query commodity identifiers from multiple customs declaration entries.

商品标识是指用于识别商品或其特征、性能的各种表述和指示的统称,例如,可以为商品编号。Commodity identification refers to the collective term of various expressions and instructions used to identify a commodity or its characteristics and performance, for example, it may be a commodity number.

S14,根据商品标识,从多条报关条目中确定与商品标识关联的商品条目,并将商品条目的报关数据录入报关界面对应的第二标准条目中。S14. According to the commodity identifier, determine the commodity entry associated with the commodity identifier from the plurality of customs declaration entries, and enter the customs declaration data of the commodity entry into the second standard entry corresponding to the customs declaration interface.

根据商品标识,从多条报关条目中选出与商品标识关联的商品条目,如品牌类型、出口惠享情况、用途、材质、品牌、规格或型号中的至少一种,并将具体的报关数据录入报关界面对应的第二标准条目中,第二标准条目可如下表2所示。According to the product identification, select the product item associated with the product identification from multiple customs declaration items, such as at least one of brand type, export preference, use, material, brand, specification or model, and submit the specific customs declaration data Enter the second standard entry corresponding to the customs declaration interface, the second standard entry can be shown in Table 2 below.

表2Table 2

Figure PCTCN2021114323-appb-000002
Figure PCTCN2021114323-appb-000002

在填完第一标准条目和第二标准条目之后,单击生成表体信息,并填写货物信息栏,如下表3所示。After filling in the first standard entry and the second standard entry, click Generate Table Body Information and fill in the cargo information column, as shown in Table 3 below.

表3table 3

Figure PCTCN2021114323-appb-000003
Figure PCTCN2021114323-appb-000003

Figure PCTCN2021114323-appb-000004
Figure PCTCN2021114323-appb-000004

在填完表1、表2和表3之后,便可导出标准格式的报关单。After completing Form 1, Form 2 and Form 3, the customs declaration form in standard format can be exported.

由此,本公开实施例的结合RPA和AI报关信息的处理方法,基于光学字符识别OCR对目标报关文档进行内容识别,以得到目标报关文档中多条报关条目的报关数据,将多条报关条目中的公共条目,根据预先设置的第一机器人流程自动化RPA操作流程,将公共条目的报关数据录入报关界面中对应的第一标准条目中,从多条报关条目中查询商品标识,根据商品标识,从多条报关条目中确定与商品标识关联的商品条目,并将商品条目的报关数据录入报关界面对应的第二标准条目中。由此,该方法能够节约人工操作的步骤,节省人工成本,很大程度上提高了效率,大大减轻人员的工作压力;因为之前需要眼睛看,在录入数据过程中,容易出现人为的错误,而且数据审核方面系统和纸质的单据数据往往不一致,加大了报关的风险,本公开进行流程优化,这在一定程度上降低了风险,使得录入错误率得以控制。Therefore, the processing method of customs declaration information combined with RPA and AI in the embodiment of the present disclosure, based on optical character recognition (OCR), performs content recognition on the target customs declaration document to obtain customs declaration data of multiple customs declaration items in the target customs declaration document, and multiple customs declaration items According to the pre-set first robotic process automation RPA operation process, the public entry in the public entry will enter the customs declaration data of the public entry into the corresponding first standard entry in the customs declaration interface, and query the commodity identification from multiple customs declaration entries. According to the commodity identification, A commodity entry associated with the commodity identifier is determined from the multiple customs declaration entries, and the customs declaration data of the commodity entry is entered into a second standard entry corresponding to the customs declaration interface. Therefore, this method can save manual operation steps, save labor costs, greatly improve efficiency, and greatly reduce the work pressure of personnel; because before the need to see with eyes, human errors are prone to occur during the data entry process, and In terms of data review, the system and paper-based document data are often inconsistent, which increases the risk of customs declaration. This disclosure optimizes the process, which reduces the risk to a certain extent and makes the entry error rate controllable.

作为一种可实现的方式,如图2所示,上述步骤S14可包括以下步骤:As an achievable manner, as shown in FIG. 2, the above step S14 may include the following steps:

S21,根据商品标识,查询商品标识与商品条目之间的关联关系。S21. Query the association relationship between the commodity identifier and the commodity entry according to the commodity identifier.

S22,若查询到商品标识关联的商品条目,则根据预先设置的第二RPA操作流程,将商品条目的报关数据录入报关界面对应的第二标准条目中。S22. If the commodity item associated with the commodity identifier is found, according to the preset second RPA operation process, enter the customs declaration data of the commodity item into the second standard entry corresponding to the customs declaration interface.

S23,若未查询到商品标识关联的商品条目,则根据报关界面中第二标准条目,从多条报关条目中查询语义相似的报关条目作为商品条目,并将商品条目的报关数据录入报关界面对应的第二标准条目中。S23. If the commodity item associated with the commodity identifier is not found, then according to the second standard entry in the customs declaration interface, search for customs declaration entries with similar semantics from multiple customs declaration entries as commodity entries, and enter the customs declaration data of the commodity entries into the corresponding customs declaration interface. in the second standard item.

作为一种可实现的方式,根据报关界面中第二标准条目,从多条报关条目中查询语义相似的报关条目作为商品条目,包括:对报关界面中的第二标准条目,基于自然语言处理NLP确定与各报关条目之间的语义相似度;根据语义相似度最高的报关条目,确定与第二标准条目语义相似的商品条目。As an achievable way, according to the second standard entry in the customs declaration interface, search for customs declaration entries with similar semantics as commodity entries from multiple customs declaration entries, including: for the second standard entry in the customs declaration interface, based on natural language processing NLP Determine the semantic similarity with each customs declaration item; determine the commodity item semantically similar to the second standard item according to the customs declaration item with the highest semantic similarity.

需要说明的是,自然语言处理NLP在文本分类中,需要先对文本分词,原始的文本中可能由几十万个中文词条组成,纬度非常高。为了提高文本分类的准确性和效率,一般先剔除决策意义不大的词语,这就是特征词提取的目的。It should be noted that in the text classification of natural language processing NLP, the text needs to be segmented first. The original text may consist of hundreds of thousands of Chinese entries, and the latitude is very high. In order to improve the accuracy and efficiency of text classification, the words with little significance for decision-making are generally eliminated first, which is the purpose of feature word extraction.

具体地,在得到商品标识之后,根据商品标识从多条报关条目中查询商品标识与商品条目之间的关联关系。其中,在从多条报关条目中能够查询到与商品标识关联的商品条目时,根据预先设置的第二RPA操作流程,将具体的报关数据录入界面对应的第二标准条目中;在从多条报关条目中不能够查询到与商品标识关联的商品条目时,对报关界面中的第二标准条目,基于自然语言处理NLP确定与各报关条目之间的语义相似度,选取语义相似度最高的报关条目,作为与第二标准条目语义相似的商品条目。Specifically, after obtaining the commodity identifier, the relationship between the commodity identifier and the commodity entry is queried from multiple customs declaration entries according to the commodity identifier. Among them, when the commodity entry associated with the commodity identifier can be queried from multiple customs declaration entries, according to the preset second RPA operation process, the specific customs declaration data is entered into the second standard entry corresponding to the interface; When the commodity entry associated with the commodity identifier cannot be found in the customs declaration entry, for the second standard entry in the customs declaration interface, the semantic similarity between the customs declaration entry and each customs declaration entry is determined based on natural language processing NLP, and the customs declaration with the highest semantic similarity is selected entry, as a commodity entry semantically similar to the second standard entry.

为了使得业务人员清楚地了解第二标准条目,在从多条报关条目中查询商品标识之后,还包括:将商品标识录入报关界面中对应的标准条目,以在报关界面展示第二标准条目。In order for the business personnel to clearly understand the second standard item, after querying the commodity identification from the multiple customs declaration items, it also includes: entering the commodity identification into the corresponding standard item in the customs declaration interface, so as to display the second standard item on the customs declaration interface.

为了更快地根据目标报关文档识别出报关条目,进一步提高效率,基于光学字符识别OCR对目标报关文档进行内容识别,以得到目标报关文档中多条报关条目的报关数据之前,还包括:根据目标报关文档的文档类型,确定待识别的各报关条目。In order to quickly identify customs declaration items based on the target customs declaration document and further improve efficiency, content recognition is performed on the target customs declaration document based on optical character recognition (OCR) to obtain the customs declaration data of multiple customs declaration items in the target customs declaration document, including: According to the target The document type of the customs declaration document determines each customs declaration item to be identified.

由于报关需要审核的文档类型较多,如包括报关单、装箱单、运单、各类许可证、出口收汇核销单、授权委托书、合同、产地证等。每种类型的文档需要提取的数据也不同。其中,每类文档的格式可能不一样,相同的商品,包装、单价都有可能不一样。There are many types of documents that need to be reviewed for customs declaration, such as customs declaration, packing list, waybill, various licenses, export collection verification and write-off form, power of attorney, contract, certificate of origin, etc. Each type of document requires different data to be extracted. Among them, the format of each type of document may be different, and the packaging and unit price of the same product may be different.

针对不同类型的目标报关文档,可先确定待识别的各报关条目,例如,当目标报关文档为报关单时,确定待识别的各报关条目为:境内发货人、境外收货人、出境关别、出口日期、申报日期、品牌类型、出口惠享情况、用途、材质、品牌、规格或型号;当目标报关文档为合同或发票时,确定待识别的各报关条目为:品牌、规格或型号。这样便可以更快地根据目标报关文档识别出报关条目,进一步提高效率。For different types of target customs declaration documents, you can first determine the customs declaration items to be identified. For example, when the target customs declaration document is a customs declaration form, determine the customs declaration items to be identified as: domestic consignor, overseas consignee, exit customs Type, export date, declaration date, brand type, export preference, use, material, brand, specification or model; when the target customs declaration document is a contract or invoice, determine the customs declaration items to be identified as: brand, specification or model . In this way, customs declaration items can be identified more quickly based on the target customs declaration documents, further improving efficiency.

图3为本公开一个具体实施例所提供的一种结合RPA和AI报关信息的处理方法的流程示意图。如图3所示,当目标报关文档包括报关单时,本公开实施例的结合RPA和AI报关信息的处理方法,包括以下步骤:Fig. 3 is a schematic flowchart of a processing method combining RPA and AI customs declaration information provided by a specific embodiment of the present disclosure. As shown in Figure 3, when the target customs declaration document includes a customs declaration form, the processing method of combining RPA and AI customs declaration information according to the embodiment of the present disclosure includes the following steps:

S31,基于光学字符识别OCR对报关单进行内容识别,以得到目标报关文档中多条报关条目的报关数据。S31, performing content recognition on the customs declaration form based on optical character recognition (OCR), so as to obtain customs declaration data of multiple customs declaration items in the target customs declaration document.

S32,将多条报关条目中的公共条目,根据预先设置的第一机器人流程自动化RPA操作流程,将公共条目的报关数据录入报关界面中对应的第一标准条目中。S32. Enter the common items among the multiple customs declaration items into the corresponding first standard items in the customs declaration interface according to the preset first robotic process automation RPA operation process.

S33,从多条报关条目中查询商品标识。S33. Query commodity identifiers from multiple customs declaration entries.

S34,根据商品标识,从多条报关条目中确定与商品标识关联的商品条目,并将商品条目的报关数据录入报关界面对应的第二标准条目中。S34. According to the commodity identifier, determine the commodity entry associated with the commodity identifier from the plurality of customs declaration entries, and enter the customs declaration data of the commodity entry into the second standard entry corresponding to the customs declaration interface.

需要说明的是,上述步骤S31-S34的过程可参见上述步骤S11-S14,具体这里不再详述。It should be noted that, the process of the above-mentioned steps S31-S34 can refer to the above-mentioned steps S11-S14, which will not be detailed here.

S35,若报关界面中存在未录入的第一标准条目和/或第二标准条目,则基于OCR对除报关单之外的其余目标报关文档进行内容识别。S35. If there are unrecorded first standard items and/or second standard items in the customs declaration interface, perform content identification on other target customs declaration documents except the customs declaration form based on OCR.

当在报关单中查找不到报关界面中需要录入的第一标准条目和/或第二标准条目时,可以对除报关单之外的装箱单、运单、各类许可证、出口收汇核销单、授权委托书、合同、产地证等目标报关文档进行内容识别,以获取需要录入的第一标准条目和/或第二标准条目。When the first standard item and/or the second standard item that needs to be entered in the customs declaration interface cannot be found in the customs declaration form, the packing list, waybill, various licenses, and export collection checks other than the customs declaration form can be checked. Content identification of target customs declaration documents such as sales order, power of attorney, contract, certificate of origin, etc., to obtain the first standard item and/or the second standard item that needs to be entered.

为使本领域技术人员更清楚的了解本公开的结合RPA和AI报关信息的处理方法,结合图4所示,客户1、客户2、客户3…等客户分别将自己公司的报关单发送给中外运公司。需要说明的是,各个客户的报关单的板式可能不一样,例如,客户1使用报关单板式1,客户2使用报关单板式2,客户3使用报关单板式3,…。In order to make those skilled in the art more clearly understand the processing method of the present disclosure combining RPA and AI customs declaration information, as shown in Figure 4, customers such as customer 1, customer 2, customer 3...etc. send their own company's customs declaration to Sinotrans company. It should be noted that the format of the customs declaration form of each customer may be different, for example, customer 1 uses the customs declaration form 1, customer 2 uses the customs declaration form 2, customer 3 uses the customs declaration form 3, ....

由第一机器人基于光学字符识别OCR对报关单进行内容识别,以抽取出公共字段和商品编号,其中,公共字段包括境内发货人、境外收货人、出境关别、出口日期、申报日期、…。根据商品编号从报关单中抽取要素字段,其中,要素字段包括品牌类型、出口惠享情况、用途、材质、…。然后,进行汇总,并检查是否还有报关条目没有录入报关数据,如果还有报关条目未录入报关数据,则从除报关单之外的目标报关文档中查找未录入报关数据的报关条目,如果报关条目都录入了报关数据,则导出标准格式报关单。The first robot recognizes the content of the customs declaration form based on optical character recognition (OCR) to extract the public fields and commodity numbers. Among them, the public fields include domestic consignor, overseas consignee, exit customs, export date, declaration date, … The element fields are extracted from the customs declaration form according to the commodity number, where the element fields include brand type, export preference, use, material, .... Then, make a summary and check whether there are customs declaration items that have not been entered into the customs declaration data. If there are still customs declaration items that have not been entered into the customs declaration data, then search for the customs declaration items that have not entered the customs declaration data from the target customs declaration documents other than the customs declaration form. If the customs declaration If the customs declaration data has been entered in the entries, the standard format customs declaration form will be exported.

由此,本公开实施例的结合RPA和AI报关信息的处理方法,基于光学字符识别OCR对报关单进行内容识别,以得到目标报关文档中多条报关条目的报关数据,将多条报关条目中的公共条目,根据预先设置的第一机器人流程自动化RPA操作流程,将公共条目的报关数据录入报关界面中对应的第一标准条目中,从多条报关条目中查询商品标识,根据商品标识,从多条报关条目中确定与商品标识关联的商品条目,并将商品条目的报关数据录入报关界面对应的第二标准条目中,若报关界面中存在未录入的第一标准条目和/或第二标准条目,则基于OCR对除报关单之外的其余目标报关文档进行内容识别。由此,该方法能够节约人工操作的步骤,节省人工成本,很大程度上提高了效率,大大减轻人员的工作压力,使得录入错误率得以控制。Therefore, the processing method of customs declaration information combined with RPA and AI in the embodiment of the present disclosure, based on optical character recognition (OCR), performs content recognition on the customs declaration form, so as to obtain the customs declaration data of multiple customs declaration items in the target customs declaration document. According to the pre-set first robotic process automation RPA operation process, the customs declaration data of the public entry is entered into the corresponding first standard entry in the customs declaration interface, and the commodity identification is queried from multiple customs declaration entries. According to the commodity identification, from Determine the commodity entry associated with the commodity identifier among the multiple customs declaration entries, and enter the customs declaration data of the commodity entry into the second standard entry corresponding to the customs declaration interface. If there are unregistered first standard entries and/or second standard entries in the customs declaration interface Items, based on OCR, perform content identification on the rest of the target customs declaration documents except the customs declaration form. Therefore, the method can save manual operation steps and labor costs, greatly improve efficiency, greatly reduce the work pressure of personnel, and control the input error rate.

为了实现上述实施例,本公开还提出一种结合RPA和AI报关信息的处理装置。In order to realize the above embodiments, the present disclosure also proposes a processing device combining RPA and AI customs declaration information.

图5为本公开实施例所提供的一种结合RPA和AI报关信息的处理装置的方框示意图。Fig. 5 is a schematic block diagram of a processing device combining RPA and AI customs declaration information provided by an embodiment of the present disclosure.

如图5所示,本公开实施例的结合RPA和AI报关信息的处理装置,包括:第一识别模块51、第一录入模块52、查询模块53和第二录入模块54。As shown in FIG. 5 , the processing device combining RPA and AI customs declaration information in the embodiment of the present disclosure includes: a first identification module 51 , a first entry module 52 , a query module 53 and a second entry module 54 .

其中,第一识别模块51用于基于光学字符识别OCR对目标报关文档进行内容识别,以得到目标报关文档中多条报关条目的报关数据。第一录入模块52用于将多条报关条目中 的公共条目,根据预先设置的第一机器人流程自动化RPA操作流程,将公共条目的报关数据录入报关界面中对应的第一标准条目中。查询模块53用于从多条报关条目中查询商品标识。第二录入模块54用于根据商品标识,从多条报关条目中确定与商品标识关联的商品条目,并将商品条目的报关数据录入报关界面对应的第二标准条目中。Wherein, the first recognition module 51 is configured to perform content recognition on the target customs declaration document based on optical character recognition (OCR), so as to obtain customs declaration data of multiple customs declaration items in the target customs declaration document. The first input module 52 is used to input the public items among the multiple customs declaration items into the corresponding first standard items in the customs declaration interface according to the preset first robotic process automation RPA operation process. The inquiry module 53 is used for inquiring commodity identifiers from multiple customs declaration entries. The second input module 54 is used to determine the commodity item associated with the commodity identifier from multiple customs declaration entries according to the commodity identifier, and enter the customs declaration data of the commodity item into the second standard entry corresponding to the customs declaration interface.

根据本公开的一个实施例,第二录入模块54,包括:查询单元、第一录入单元和第二录入单元。其中,查询单元,用于根据商品标识,查询商品标识与商品条目之间的关联关系;第一录入单元,用于在查询到商品标识关联的商品条目的情况下,根据预先设置的第二RPA操作流程,将商品条目的报关数据录入报关界面对应的第二标准条目中;第二录入单元,用于在未查询到商品标识关联的商品条目的情况下,根据报关界面中第二标准条目,从多条报关条目中查询语义相似的报关条目作为商品条目,并将商品条目的报关数据录入报关界面对应的第二标准条目中。According to an embodiment of the present disclosure, the second input module 54 includes: a query unit, a first input unit and a second input unit. Among them, the query unit is used to query the relationship between the commodity identifier and the commodity entry according to the commodity identifier; the first entry unit is used to query the commodity entry associated with the commodity identifier according to the preset second RPA The operation process is to enter the customs declaration data of the commodity entry into the second standard entry corresponding to the customs declaration interface; the second entry unit is used to, according to the second standard entry in the customs declaration interface, if no commodity entry associated with the commodity identifier is found, Query customs declaration entries with similar semantics from multiple customs declaration entries as commodity entries, and enter the customs declaration data of the commodity entries into the second standard entry corresponding to the customs declaration interface.

根据本公开的一个实施例,第二录入单元,包括:第一确定子单元和第二确定子单元。其中,第一确定子单元用于对报关界面中的第二标准条目,基于自然语言处理NLP确定与各报关条目之间的语义相似度。第二确定子单元用于根据语义相似度最高的报关条目,确定与第二标准条目语义相似的商品条目。According to an embodiment of the present disclosure, the second entry unit includes: a first determination subunit and a second determination subunit. Wherein, the first determining subunit is configured to determine the semantic similarity between the second standard item in the customs declaration interface and each customs declaration item based on natural language processing (NLP). The second determination subunit is configured to determine commodity items semantically similar to the second standard item according to the customs declaration item with the highest semantic similarity.

根据本公开的一个实施例,上述的结合RPA和AI报关信息的处理装置,还包括:展示模块,用于将商品标识录入报关界面中对应的标准条目,以在报关界面展示第二标准条目。According to an embodiment of the present disclosure, the above-mentioned processing device combining RPA and AI customs declaration information further includes: a display module, configured to enter the commodity identifier into the corresponding standard item in the customs declaration interface, so as to display the second standard item on the customs declaration interface.

根据本公开的一个实施例,上述的结合RPA和AI报关信息的处理装置,还包括:确定模块,用于根据目标报关文档的文档类型,确定待识别的各报关条目。According to an embodiment of the present disclosure, the above-mentioned processing device combining RPA and AI customs declaration information further includes: a determination module configured to determine each customs declaration item to be identified according to the document type of the target customs declaration document.

根据本公开的一个实施例,目标报关文档为多个,包括报关单;第一识别模块51,包括:识别单元,用于基于OCR对报关单进行内容识别,以得到报关单中包含的多条报关条目的报关数据。According to an embodiment of the present disclosure, there are multiple target customs declaration documents, including a customs declaration form; the first identification module 51 includes: an identification unit for performing content identification on the customs declaration form based on OCR, so as to obtain multiple items contained in the customs declaration form; The customs declaration data of the customs declaration item.

根据本公开的一个实施例,上述的结合RPA和AI报关信息的处理装置,还包括:第二识别模块,用于在报关界面中存在未录入的第一标准条目和/或第二标准条目的情况下,基于OCR对除报关单之外的其余目标报关文档进行内容识别。According to an embodiment of the present disclosure, the above-mentioned processing device combining RPA and AI customs declaration information further includes: a second identification module, used for unrecorded first standard items and/or second standard items in the customs declaration interface In this case, based on OCR, perform content identification on the rest of the target customs declaration documents except the customs declaration form.

需要说明的是,前述对结合RPA和AI报关信息的处理方法实施例的解释说明也适用于该实施例的结合RPA和AI报关信息的处理装置,此处不再赘述。It should be noted that the foregoing explanations of the embodiment of the processing method combining RPA and AI customs declaration information are also applicable to the processing device combining RPA and AI customs declaration information of this embodiment, and details are not repeated here.

根据本公开实施例的结合RPA和AI报关信息的处理装置,通过第一识别模块基于光学字符识别OCR对目标报关文档进行内容识别,以得到目标报关文档中多条报关条目的报关数据,通过第一录入模块将多条报关条目中的公共条目,根据预先设置的第一机器人流程自动化RPA操作流程,将公共条目的报关数据录入报关界面中对应的第一标准条目中, 通过查询模块从多条报关条目中查询商品标识,通过第二录入模块根据商品标识,从多条报关条目中确定与商品标识关联的商品条目,并将商品条目的报关数据录入报关界面对应的第二标准条目中。由此,该装置能够提高及时性,节省人工成本,降低录入错误的风险,提高效率。According to the processing device combining RPA and AI customs declaration information according to an embodiment of the present disclosure, the first recognition module performs content recognition on the target customs declaration document based on optical character recognition (OCR), so as to obtain the customs declaration data of multiple customs declaration items in the target customs declaration document, and through the second An input module inputs the public items among the multiple customs declaration items into the corresponding first standard item in the customs declaration interface according to the pre-set first robotic process automation RPA operation process, and uses the query module to select from multiple items The commodity identification is queried in the customs declaration entry, and the commodity entry associated with the commodity identification is determined from the multiple customs declaration entries through the second input module according to the commodity logo, and the customs declaration data of the commodity entry is entered into the second standard entry corresponding to the customs declaration interface. Therefore, the device can improve timeliness, save labor costs, reduce the risk of input errors, and improve efficiency.

为了实现上述实施例,本公开还提出了一种电子设备。In order to realize the above-mentioned embodiments, the present disclosure also proposes an electronic device.

本公开实施例的电子设备,包括:至少一个处理器;以及与至少一个处理器通信连接的存储器;其中,存储器存储有可被至少一个处理器执行的指令,指令被至少一个处理器执行,以使至少一个处理器能够执行上述的结合RPA和AI报关信息的处理方法。An electronic device according to an embodiment of the present disclosure includes: at least one processor; and a memory communicatively connected to the at least one processor; wherein, the memory stores instructions executable by the at least one processor, and the instructions are executed by the at least one processor to Enabling at least one processor to execute the above-mentioned processing method combining RPA and AI customs declaration information.

为了实现上述实施例,本公开还提出了一种非临时性计算机可读存储介质。In order to realize the above-mentioned embodiments, the present disclosure also proposes a non-transitory computer-readable storage medium.

本公开实施例的非临时性计算机可读存储介质,其上存储有计算机程序,该程序被处理器执行时实现上述的结合RPA和AI报关信息的处理方法。The non-transitory computer-readable storage medium of the embodiment of the present disclosure stores a computer program thereon, and when the program is executed by a processor, the above-mentioned processing method combining RPA and AI customs declaration information is realized.

为了实现上述实施例,本公开还提出了一种计算机程序产品。In order to realize the above embodiments, the present disclosure also proposes a computer program product.

本公开实施例的计算机程序产品,当所述计算机程序产品中的指令由处理器执行时,执行上述的结合RPA和AI报关信息的处理方法。The computer program product of the embodiment of the present disclosure, when the instructions in the computer program product are executed by the processor, executes the above-mentioned processing method combining RPA and AI customs declaration information.

图6示出了适于用来实现本公开实施方式的示例性电子设备的框图。图6显示的电子设备12仅仅是一个示例,不应对本公开实施例的功能和使用范围带来任何限制。FIG. 6 shows a block diagram of an exemplary electronic device suitable for use in implementing embodiments of the present disclosure. The electronic device 12 shown in FIG. 6 is only an example, and should not limit the functions and scope of use of the embodiments of the present disclosure.

如图6所示,电子设备12以通用计算设备的形式表现。电子设备12的组件可以包括但不限于:一个或者多个处理器或者处理单元16,系统存储器28,连接不同系统组件(包括系统存储器28和处理单元16)的总线18。As shown in FIG. 6, electronic device 12 takes the form of a general-purpose computing device. Components of electronic device 12 may include, but are not limited to, one or more processors or processing units 16, system memory 28, bus 18 connecting various system components including system memory 28 and processing unit 16.

总线18表示几类总线结构中的一种或多种,包括存储器总线或者存储器控制器,外围总线,图形加速端口,处理器或者使用多种总线结构中的任意总线结构的局域总线。举例来说,这些体系结构包括但不限于工业标准体系结构(Industry Standard Architecture;以下简称:ISA)总线,微通道体系结构(Micro Channel Architecture;以下简称:MAC)总线,增强型ISA总线、视频电子标准协会(Video Electronics Standards Association;以下简称:VESA)局域总线以及外围组件互连(Peripheral Component Interconnection;以下简称:PCI)总线。Bus 18 represents one or more of several types of bus structures, including a memory bus or memory controller, a peripheral bus, an accelerated graphics port, a processor, or a local bus using any of a variety of bus structures. For example, these architectures include but are not limited to Industry Standard Architecture (Industry Standard Architecture; hereinafter referred to as: ISA) bus, Micro Channel Architecture (Micro Channel Architecture; hereinafter referred to as: MAC) bus, enhanced ISA bus, video electronics Standards Association (Video Electronics Standards Association; hereinafter referred to as: VESA) local bus and Peripheral Component Interconnection (hereinafter referred to as: PCI) bus.

电子设备12典型地包括多种计算机系统可读介质。这些介质可以是任何能够被电子设备12访问的可用介质,包括易失性和非易失性介质,可移动的和不可移动的介质。Electronic device 12 typically includes a variety of computer system readable media. These media can be any available media that can be accessed by electronic device 12 and include both volatile and nonvolatile media, removable and non-removable media.

存储器28可以包括易失性存储器形式的计算机系统可读介质,例如随机存取存储器(Random Access Memory;以下简称:RAM)30和/或高速缓存存储器32。电子设备12可以进一步包括其它可移动/不可移动的、易失性/非易失性计算机系统存储介质。仅作为举例,存储系统34可以用于读写不可移动的、非易失性磁介质(图7未显示,通常称为“硬 盘驱动器”)。尽管图7中未示出,可以提供用于对可移动非易失性磁盘(例如“软盘”)读写的磁盘驱动器,以及对可移动非易失性光盘(例如:光盘只读存储器(Compact Disc Read Only Memory;以下简称:CD-ROM)、数字多功能只读光盘(Digital Video Disc Read Only Memory;以下简称:DVD-ROM)或者其它光介质)读写的光盘驱动器。在这些情况下,每个驱动器可以通过一个或者多个数据介质接口与总线18相连。存储器28可以包括至少一个程序产品,该程序产品具有一组(例如至少一个)程序模块,这些程序模块被配置以执行本公开各实施例的功能。The memory 28 may include a computer system readable medium in the form of a volatile memory, such as a random access memory (Random Access Memory; hereinafter referred to as: RAM) 30 and/or a cache memory 32 . The electronic device 12 may further include other removable/non-removable, volatile/nonvolatile computer system storage media. By way of example only, storage system 34 may be used to read from and write to non-removable, non-volatile magnetic media (not shown in Figure 7, commonly referred to as "hard drives"). Although not shown in FIG. 7, a disk drive for reading and writing to a removable nonvolatile disk (such as a "floppy disk") may be provided, as well as a disk drive for removable nonvolatile disks (such as a CD-ROM (Compact Disc Read Only Memory; hereinafter referred to as: CD-ROM), Digital Video Disc Read Only Memory (hereinafter referred to as: DVD-ROM) or other optical media). In these cases, each drive may be connected to bus 18 via one or more data media interfaces. Memory 28 may include at least one program product having a set (eg, at least one) of program modules configured to perform the functions of various embodiments of the present disclosure.

具有一组(至少一个)程序模块42的程序/实用工具40,可以存储在例如存储器28中,这样的程序模块42包括但不限于操作系统、一个或者多个应用程序、其它程序模块以及程序数据,这些示例中的每一个或某种组合中可能包括网络环境的实现。程序模块42通常执行本公开所描述的实施例中的功能和/或方法。A program/utility 40 having a set (at least one) of program modules 42 may be stored, for example, in memory 28, such program modules 42 including but not limited to an operating system, one or more application programs, other program modules, and program data , each or some combination of these examples may include implementations of network environments. The program modules 42 generally perform the functions and/or methods of the embodiments described in this disclosure.

电子设备12也可以与一个或多个外部设备14(例如键盘、指向设备、显示器24等)通信,还可与一个或者多个使得用户能与该计算机系统/服务器12交互的设备通信,和/或与使得该计算机系统/服务器12能与一个或多个其它计算设备进行通信的任何设备(例如网卡,调制解调器等等)通信。这种通信可以通过输入/输出(I/O)接口22进行。并且,电子设备12还可以通过网络适配器20与一个或者多个网络(例如局域网(Local Area Network;以下简称:LAN),广域网(Wide Area Network;以下简称:WAN)和/或公共网络,例如因特网)通信。如图所示,网络适配器20通过总线18与电子设备12的其它模块通信。应当明白,尽管图中未示出,可以结合电子设备12使用其它硬件和/或软件模块,包括但不限于:微代码、设备驱动器、冗余处理单元、外部磁盘驱动阵列、RAID(Redundant Arrays of Independent Disks,磁盘阵列)系统、磁带驱动器以及数据备份存储系统等。Electronic device 12 may also communicate with one or more external devices 14 (e.g., a keyboard, pointing device, display 24, etc.), and with one or more devices that enable a user to interact with the computer system/server 12, and/or Or communicate with any device (eg, network card, modem, etc.) that enables the computer system/server 12 to communicate with one or more other computing devices. Such communication may occur through input/output (I/O) interface 22 . Moreover, the electronic device 12 can also communicate with one or more networks (such as a local area network (Local Area Network; hereinafter referred to as: LAN), a wide area network (Wide Area Network; hereinafter referred to as: WAN) and/or a public network, such as the Internet, through the network adapter 20. ) communication. As shown, network adapter 20 communicates with other modules of electronic device 12 via bus 18 . It should be understood that although not shown, other hardware and/or software modules may be used in conjunction with electronic device 12, including but not limited to: microcode, device drivers, redundant processing units, external disk drive arrays, RAID (Redundant Arrays of Independent Disks, disk array) systems, tape drives, and data backup storage systems.

处理单元16通过运行存储在系统存储器28中的程序,从而执行各种功能应用以及数据处理,例如实现前述实施例中提及的方法。The processing unit 16 executes various functional applications and data processing by running the programs stored in the system memory 28 , such as implementing the methods mentioned in the foregoing embodiments.

在本说明书的描述中,参考术语“一个实施例”、“一些实施例”、“示例”、“具体示例”、或“一些示例”等的描述意指结合该实施例或示例描述的具体特征、结构、材料或者特点包含于本公开的至少一个实施例或示例中。在本说明书中,对上述术语的示意性表述不必须针对的是相同的实施例或示例。而且,描述的具体特征、结构、材料或者特点可以在任一个或多个实施例或示例中以合适的方式结合。此外,在不相互矛盾的情况下,本领域的技术人员可以将本说明书中描述的不同实施例或示例以及不同实施例或示例的特征进行结合和组合。In the description of this specification, descriptions referring to the terms "one embodiment", "some embodiments", "example", "specific examples", or "some examples" mean that specific features described in connection with the embodiment or example , structure, material or characteristic is included in at least one embodiment or example of the present disclosure. In this specification, the schematic representations of the above terms are not necessarily directed to the same embodiment or example. Furthermore, the described specific features, structures, materials or characteristics may be combined in any suitable manner in any one or more embodiments or examples. In addition, those skilled in the art can combine and combine different embodiments or examples and features of different embodiments or examples described in this specification without conflicting with each other.

此外,术语“第一”、“第二”仅用于描述目的,而不能理解为指示或暗示相对重要性或者隐含指明所指示的技术特征的数量。由此,限定有“第一”、“第二”的特征可以明示 或者隐含地包括至少一个该特征。在本公开的描述中,“多个”的含义是至少两个,例如两个,三个等,除非另有明确具体的限定。In addition, the terms "first" and "second" are used for descriptive purposes only, and cannot be interpreted as indicating or implying relative importance or implicitly specifying the quantity of indicated technical features. Thus, features defined as "first" and "second" may explicitly or implicitly include at least one of these features. In the description of the present disclosure, "plurality" means at least two, such as two, three, etc., unless otherwise specifically defined.

流程图中或在此以其他方式描述的任何过程或方法描述可以被理解为,表示包括一个或更多个用于实现定制逻辑功能或过程的步骤的可执行指令的代码的模块、片段或部分,并且本公开的优选实施方式的范围包括另外的实现,其中可以不按所示出或讨论的顺序,包括根据所涉及的功能按基本同时的方式或按相反的顺序,来执行功能,这应被本公开的实施例所属技术领域的技术人员所理解。Any process or method descriptions in flowcharts or otherwise described herein may be understood to represent a module, segment or portion of code comprising one or more executable instructions for implementing custom logical functions or steps of a process , and the scope of preferred embodiments of the present disclosure includes additional implementations in which functions may be performed out of the order shown or discussed, including substantially concurrently or in reverse order depending on the functions involved, which shall It is understood by those skilled in the art to which the embodiments of the present disclosure pertain.

在流程图中表示或在此以其他方式描述的逻辑和/或步骤,例如,可以被认为是用于实现逻辑功能的可执行指令的定序列表,可以具体实现在任何计算机可读介质中,以供指令执行系统、装置或设备(如基于计算机的系统、包括处理器的系统或其他可以从指令执行系统、装置或设备取指令并执行指令的系统)使用,或结合这些指令执行系统、装置或设备而使用。就本说明书而言,"计算机可读介质"可以是任何可以包含、存储、通信、传播或传输程序以供指令执行系统、装置或设备或结合这些指令执行系统、装置或设备而使用的装置。计算机可读介质的更具体的示例(非穷尽性列表)包括以下:具有一个或多个布线的电连接部(电子装置),便携式计算机盘盒(磁装置),随机存取存储器(RAM),只读存储器(ROM),可擦除可编辑只读存储器(EPROM或闪速存储器),光纤装置,以及便携式光盘只读存储器(CDROM)。另外,计算机可读介质甚至可以是可在其上打印所述程序的纸或其他合适的介质,因为可以例如通过对纸或其他介质进行光学扫描,接着进行编辑、解译或必要时以其他合适方式进行处理来以电子方式获得所述程序,然后将其存储在计算机存储器中。The logic and/or steps represented in the flowcharts or otherwise described herein, for example, can be considered as a sequenced listing of executable instructions for implementing logical functions, can be embodied in any computer-readable medium, For use with instruction execution systems, devices, or devices (such as computer-based systems, systems including processors, or other systems that can fetch instructions from instruction execution systems, devices, or devices and execute instructions), or in conjunction with these instruction execution systems, devices or equipment used. For the purposes of this specification, a "computer-readable medium" may be any device that can contain, store, communicate, propagate or transmit a program for use in or in conjunction with an instruction execution system, device or device. More specific examples (non-exhaustive list) of computer-readable media include the following: electrical connection with one or more wires (electronic device), portable computer disk case (magnetic device), random access memory (RAM), Read Only Memory (ROM), Erasable and Editable Read Only Memory (EPROM or Flash Memory), Fiber Optic Devices, and Portable Compact Disc Read Only Memory (CDROM). In addition, the computer-readable medium may even be paper or other suitable medium on which the program can be printed, since the program can be read, for example, by optically scanning the paper or other medium, followed by editing, interpretation or other suitable processing if necessary. The program is processed electronically and stored in computer memory.

应当理解,本公开的各部分可以用硬件、软件、固件或它们的组合来实现。在上述实施方式中,多个步骤或方法可以用存储在存储器中且由合适的指令执行系统执行的软件或固件来实现。如,如果用硬件来实现和在另一实施方式中一样,可用本领域公知的下列技术中的任一项或他们的组合来实现:具有用于对数据信号实现逻辑功能的逻辑门电路的离散逻辑电路,具有合适的组合逻辑门电路的专用集成电路,可编程门阵列(PGA),现场可编程门阵列(FPGA)等。It should be understood that various parts of the present disclosure may be implemented in hardware, software, firmware or a combination thereof. In the embodiments described above, various steps or methods may be implemented by software or firmware stored in memory and executed by a suitable instruction execution system. For example, if implemented in hardware as in another embodiment, it can be implemented by any one or a combination of the following techniques known in the art: a discrete Logic circuits, ASICs with suitable combinational logic gates, Programmable Gate Arrays (PGA), Field Programmable Gate Arrays (FPGA), etc.

本技术领域的普通技术人员可以理解实现上述实施例方法携带的全部或部分步骤是可以通过程序来指令相关的硬件完成,所述的程序可以存储于一种计算机可读存储介质中,该程序在执行时,包括方法实施例的步骤之一或其组合。Those of ordinary skill in the art can understand that all or part of the steps carried by the methods of the above embodiments can be completed by instructing related hardware through a program, and the program can be stored in a computer-readable storage medium. During execution, one or a combination of the steps of the method embodiments is included.

此外,在本公开各个实施例中的各功能单元可以集成在一个处理模块中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个模块中。上述集成的模块既可以采用硬件的形式实现,也可以采用软件功能模块的形式实现。所述集成的模块如果以 软件功能模块的形式实现并作为独立的产品销售或使用时,也可以存储在一个计算机可读取存储介质中。In addition, each functional unit in each embodiment of the present disclosure may be integrated into one processing module, each unit may exist separately physically, or two or more units may be integrated into one module. The above-mentioned integrated modules can be implemented in the form of hardware or in the form of software function modules. If the integrated modules are realized in the form of software function modules and sold or used as independent products, they can also be stored in a computer-readable storage medium.

上述提到的存储介质可以是只读存储器,磁盘或光盘等。尽管上面已经示出和描述了本公开的实施例,可以理解的是,上述实施例是示例性的,不能理解为对本公开的限制,本领域的普通技术人员在本公开的范围内可以对上述实施例进行变化、修改、替换和变型。The storage medium mentioned above may be a read-only memory, a magnetic disk or an optical disk, and the like. Although the embodiments of the present disclosure have been shown and described above, it can be understood that the above embodiments are exemplary and should not be construed as limitations on the present disclosure, and those skilled in the art can understand the above-mentioned embodiments within the scope of the present disclosure. The embodiments are subject to changes, modifications, substitutions and variations.

Claims (17)

一种结合RPA和AI报关信息的处理方法,其特征在于,包括以下步骤:A processing method combining RPA and AI customs declaration information, characterized in that it comprises the following steps: 基于光学字符识别OCR对目标报关文档进行内容识别,以得到所述目标报关文档中多条报关条目的报关数据;performing content recognition on the target customs declaration document based on optical character recognition (OCR), so as to obtain customs declaration data of multiple customs declaration items in the target customs declaration document; 将所述多条报关条目中的公共条目,根据预先设置的第一机器人流程自动化RPA操作流程,将所述公共条目的报关数据录入报关界面中对应的第一标准条目中;Enter the public items in the multiple customs declaration items into the corresponding first standard items in the customs declaration interface according to the preset first robotic process automation RPA operation process; 从所述多条报关条目中查询商品标识;Querying commodity identifiers from the multiple customs declaration entries; 根据所述商品标识,从所述多条报关条目中确定与所述商品标识关联的商品条目,并将所述商品条目的报关数据录入所述报关界面对应的第二标准条目中。According to the commodity identifier, the commodity item associated with the commodity identifier is determined from the plurality of customs declaration entries, and the customs declaration data of the commodity entry is entered into the second standard entry corresponding to the customs declaration interface. 根据权利要求1所述的处理方法,其特征在于,所述根据所述商品标识,从所述多条报关条目中确定与所述商品标识关联的商品条目,并将所述商品条目的报关数据录入所述报关界面对应的第二标准条目中,包括:The processing method according to claim 1, characterized in that, according to the commodity identifier, the commodity entry associated with the commodity identifier is determined from the multiple customs declaration entries, and the customs declaration data of the commodity entry is Enter the second standard entry corresponding to the customs declaration interface, including: 根据所述商品标识,查询商品标识与商品条目之间的关联关系;According to the commodity identifier, query the relationship between the commodity identifier and the commodity entry; 若查询到所述商品标识关联的商品条目,则根据预先设置的第二RPA操作流程,将所述商品条目的报关数据录入所述报关界面对应的第二标准条目中;If the commodity item associated with the commodity identifier is found, according to the preset second RPA operation process, the customs declaration data of the commodity item is entered into the second standard entry corresponding to the customs declaration interface; 若未查询到所述商品标识关联的商品条目,则根据所述报关界面中所述第二标准条目,从所述多条报关条目中查询语义相似的报关条目作为所述商品条目,并将所述商品条目的报关数据录入所述报关界面对应的第二标准条目中。If the commodity item associated with the commodity identifier is not found, according to the second standard entry in the customs declaration interface, query the customs declaration entry with similar semantics from the multiple customs declaration entries as the commodity entry, and use all The customs declaration data of the commodity item is entered into the second standard item corresponding to the customs declaration interface. 根据权利要求2所述的处理方法,其特征在于,所述根据所述报关界面中所述第二标准条目,从所述多条报关条目中查询语义相似的报关条目作为所述商品条目,包括:The processing method according to claim 2, characterized in that, according to the second standard entry in the customs declaration interface, searching for customs declaration entries with similar semantics from the plurality of customs declaration entries as the commodity entry includes : 对所述报关界面中的所述第二标准条目,基于自然语言处理NLP确定与各所述报关条目之间的语义相似度;For the second standard item in the customs declaration interface, determine the semantic similarity with each of the customs declaration items based on natural language processing NLP; 根据所述语义相似度最高的报关条目,确定与所述第二标准条目语义相似的商品条目。According to the customs declaration item with the highest semantic similarity, commodity items semantically similar to the second standard item are determined. 根据权利要求2所述的处理方法,其特征在于,所述从所述多条报关条目中查询商品标识之后,还包括:The processing method according to claim 2, characterized in that, after querying the commodity identifiers from the multiple customs declaration entries, further comprising: 将所述商品标识录入所述报关界面中对应的标准条目,以在所述报关界面展示所述第二标准条目。Entering the commodity identifier into the corresponding standard item in the customs declaration interface, so as to display the second standard item in the customs declaration interface. 根据权利要求1-4任一项所述的处理方法,其特征在于,所述基于光学字符识别OCR对目标报关文档进行内容识别,以得到所述目标报关文档中多条报关条目的报关数据之前,还包括:The processing method according to any one of claims 1-4, wherein the OCR-based optical character recognition is used to perform content recognition on the target customs declaration document, so as to obtain the customs declaration data of multiple customs declaration items in the target customs declaration document ,Also includes: 根据所述目标报关文档的文档类型,确定待识别的各所述报关条目。According to the document type of the target customs declaration document, each of the customs declaration items to be identified is determined. 根据权利要求1-4任一项所述的处理方法,其特征在于,所述目标报关文档为多个, 包括报关单;所述基于光学字符识别OCR对目标报关文档进行内容识别,以得到所述目标报关文档中多条报关条目的报关数据,包括:According to the processing method described in any one of claims 1-4, it is characterized in that there are multiple target customs declaration documents, including a customs declaration form; the content recognition of the target customs declaration document based on optical character recognition OCR is carried out to obtain the Describe the customs declaration data of multiple customs declaration items in the target customs declaration document, including: 基于OCR对所述报关单进行内容识别,以得到所述报关单中包含的多条报关条目的报关数据。Content recognition is performed on the customs declaration form based on OCR to obtain customs declaration data of multiple customs declaration items included in the customs declaration form. 根据权利要求6所述的处理方法,其特征在于,所述根据所述商品标识,从所述多条报关条目中确定与所述商品标识关联的商品条目,并将所述商品条目的报关数据录入所述报关界面对应的第二标准条目中之后,还包括:The processing method according to claim 6, characterized in that, according to the commodity identifier, the commodity entry associated with the commodity identifier is determined from the multiple customs declaration entries, and the customs declaration data of the commodity entry is After entering the second standard entry corresponding to the customs declaration interface, it also includes: 若所述报关界面中存在未录入的所述第一标准条目和/或所述第二标准条目,则基于OCR对除所述报关单之外的其余目标报关文档进行内容识别。If there is the first standard item and/or the second standard item not entered in the customs declaration interface, content identification is performed on other target customs declaration documents except the customs declaration form based on OCR. 一种结合RPA和AI报关信息的处理装置,其特征在于,包括:A processing device combining RPA and AI customs declaration information, characterized in that it includes: 第一识别模块,用于基于光学字符识别OCR对目标报关文档进行内容识别,以得到所述目标报关文档中多条报关条目的报关数据;The first identification module is used to identify the content of the target customs declaration document based on optical character recognition (OCR), so as to obtain the customs declaration data of multiple customs declaration items in the target customs declaration document; 第一录入模块,用于将所述多条报关条目中的公共条目,根据预先设置的第一机器人流程自动化RPA操作流程,将所述公共条目的报关数据录入报关界面中对应的第一标准条目中;The first input module is used to input the public items among the multiple customs declaration items into the corresponding first standard items in the customs declaration interface according to the preset first robotic process automation RPA operation process. middle; 查询模块,用于从所述多条报关条目中查询商品标识;A query module, configured to query commodity identifiers from the multiple customs declaration entries; 第二录入模块,用于根据所述商品标识,从所述多条报关条目中确定与所述商品标识关联的商品条目,并将所述商品条目的报关数据录入所述报关界面对应的第二标准条目中。The second entry module is used to determine the commodity entry associated with the commodity identifier from the plurality of customs declaration entries according to the commodity identifier, and enter the customs declaration data of the commodity entry into the second corresponding to the customs declaration interface. standard entry. 根据权利要求8所述的处理装置,其特征在于,所述第二录入模块,包括:The processing device according to claim 8, wherein the second input module comprises: 查询单元,用于根据所述商品标识,查询商品标识与商品条目之间的关联关系;A query unit, configured to query the relationship between the commodity identifier and the commodity entry according to the commodity identifier; 第一录入单元,用于在查询到所述商品标识关联的商品条目的情况下,根据预先设置的第二RPA操作流程,将所述商品条目的报关数据录入所述报关界面对应的第二标准条目中;The first input unit is configured to input the customs declaration data of the commodity item into the second standard corresponding to the customs declaration interface according to the preset second RPA operation process when the commodity item associated with the commodity identifier is found. entry; 第二录入单元,用于在未查询到所述商品标识关联的商品条目的情况下,根据所述报关界面中所述第二标准条目,从所述多条报关条目中查询语义相似的报关条目作为所述商品条目,并将所述商品条目的报关数据录入所述报关界面对应的第二标准条目中。The second entry unit is configured to search for customs declaration items with similar semantics from the plurality of customs declaration entries according to the second standard entry in the customs declaration interface if no product item associated with the product identifier is found. as the commodity item, and enter the customs declaration data of the commodity item into the second standard item corresponding to the customs declaration interface. 根据权利要求9所述的处理装置,其特征在于,所述第二录入单元,包括:The processing device according to claim 9, wherein the second input unit comprises: 第一确定子单元,用于对所述报关界面中的所述第二标准条目,基于自然语言处理NLP确定与各所述报关条目之间的语义相似度;The first determining subunit is configured to determine the semantic similarity between the second standard item in the customs declaration interface and each of the customs declaration items based on natural language processing NLP; 第二确定子单元,用于根据所述语义相似度最高的报关条目,确定与所述第二标准条目语义相似的商品条目。The second determination subunit is configured to determine commodity items semantically similar to the second standard item according to the customs declaration item with the highest semantic similarity. 根据权利要求9所述的处理装置,其特征在于,还包括:The processing device according to claim 9, further comprising: 展示模块,用于将所述商品标识录入所述报关界面中对应的标准条目,以在所述报关界面展示所述第二标准条目。A display module, configured to enter the commodity identifier into a corresponding standard item in the customs declaration interface, so as to display the second standard item in the customs declaration interface. 根据权利要求8-11任一项所述的处理装置,其特征在于,还包括:The processing device according to any one of claims 8-11, further comprising: 确定模块,用于根据所述目标报关文档的文档类型,确定待识别的各所述报关条目。The determining module is configured to determine each of the customs declaration items to be identified according to the document type of the target customs declaration document. 根据权利要求8-11任一项所述的处理装置,其特征在于,所述目标报关文档为多个,包括报关单;所述第一识别模块,包括:The processing device according to any one of claims 8-11, wherein the target customs declaration document is multiple, including a customs declaration form; the first identification module includes: 识别单元,用于基于OCR对所述报关单进行内容识别,以得到所述报关单中包含的多条报关条目的报关数据。The identification unit is configured to perform content identification on the customs declaration form based on OCR, so as to obtain customs declaration data of multiple customs declaration items included in the customs declaration form. 根据权利要求13所述的处理装置,其特征在于,还包括:The processing device according to claim 13, further comprising: 第二识别模块,用于在所述报关界面中存在未录入的所述第一标准条目和/或所述第二标准条目的情况下,基于OCR对除所述报关单之外的其余目标报关文档进行内容识别。The second identification module is used to declare other targets except the customs declaration form based on OCR when the first standard item and/or the second standard item are not entered in the customs declaration interface. Documents are content-aware. 一种电子设备,其特征在于,包括:An electronic device, characterized in that it comprises: 至少一个处理器;以及at least one processor; and 与所述至少一个处理器通信连接的存储器;其中,a memory communicatively coupled to the at least one processor; wherein, 所述存储器存储有可被所述至少一个处理器执行的指令,所述指令被所述至少一个处理器执行,以使所述至少一个处理器能够执行权利要求1-7任一项所述的结合RPA和AI报关信息的处理方法。The memory stores instructions executable by the at least one processor, and the instructions are executed by the at least one processor, so that the at least one processor can perform the method described in any one of claims 1-7. Combining RPA and AI customs declaration information processing method. 一种非临时性计算机可读存储介质,其上存储有计算机程序,其特征在于,该程序被处理器执行时实现如权利要求1-7任一项所述的结合RPA和AI报关信息的处理方法。A non-transitory computer-readable storage medium on which a computer program is stored, characterized in that, when the program is executed by a processor, it realizes the processing of combining RPA and AI customs declaration information as described in any one of claims 1-7 method. 一种计算机程序产品,其特征在于,当所述计算机程序产品中的指令由处理器执行时,执行如权利要求1-7任一项所述的结合RPA和AI报关信息的处理方法。A computer program product, characterized in that when the instructions in the computer program product are executed by a processor, the method for processing customs declaration information combining RPA and AI according to any one of claims 1-7 is executed.
PCT/CN2021/114323 2021-06-16 2021-08-24 Rpa and ai combined customs declaration information processing method and processing device Ceased WO2022262114A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202110667458.2A CN113553393B (en) 2021-06-16 2021-06-16 Processing method and processing device combining RPA and AI customs clearance information
CN202110667458.2 2021-06-16

Publications (1)

Publication Number Publication Date
WO2022262114A1 true WO2022262114A1 (en) 2022-12-22

Family

ID=78102145

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/114323 Ceased WO2022262114A1 (en) 2021-06-16 2021-08-24 Rpa and ai combined customs declaration information processing method and processing device

Country Status (2)

Country Link
CN (1) CN113553393B (en)
WO (1) WO2022262114A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117010779A (en) * 2023-06-30 2023-11-07 南京希音电子商务有限公司 Customs import and export declaration method, system, equipment and medium based on intelligent coding
CN118692096A (en) * 2024-02-05 2024-09-24 宁波火箭数据科技有限公司 Method for filling in customs clearance form

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114185935A (en) * 2021-11-02 2022-03-15 北京来也网络科技有限公司 Social security data processing method and device combining RPA and AI and storage medium
CN114442883A (en) * 2022-01-19 2022-05-06 来也科技(北京)有限公司 Service processing method and device based on AI and RPA
CN117236310B (en) * 2023-10-26 2024-08-02 湖南中拓信息科技有限公司 Bill recognition method, system and readable storage medium based on OCR technology

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2009115855A1 (en) * 2008-03-18 2009-09-24 Council Of Scientific & Industrial Research Hand-held device and step-scanning technique for reading by the blind
US20100057507A1 (en) * 2008-08-29 2010-03-04 International Business Machines Corporation Real-Time Manufacturing Routing Optimization Using Content and Quality Level
CN107437085A (en) * 2017-08-09 2017-12-05 厦门商集企业咨询有限责任公司 A kind of method, apparatus and readable storage medium storing program for executing of lifting OCR discriminations
CN111191435A (en) * 2019-12-25 2020-05-22 上海三稻智能科技有限公司 Method and device for generating report form by using dynamic template of customs report form
CN111881265A (en) * 2020-09-28 2020-11-03 成都数联铭品科技有限公司 Method and system for processing customs declaration and commodity information, storage medium and electronic equipment
CN112558954A (en) * 2020-12-29 2021-03-26 北京来也网络科技有限公司 Information extraction method, device, medium and electronic equipment combining RPA and AI

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109146248A (en) * 2018-07-19 2019-01-04 深圳市有信达供应链集团股份有限公司 Customs declaration risk management and control method, calculates equipment and storage medium at device
CN112560423A (en) * 2020-12-22 2021-03-26 北京来也网络科技有限公司 Document processing method, device, equipment and medium combining RPA and AI

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2009115855A1 (en) * 2008-03-18 2009-09-24 Council Of Scientific & Industrial Research Hand-held device and step-scanning technique for reading by the blind
US20100057507A1 (en) * 2008-08-29 2010-03-04 International Business Machines Corporation Real-Time Manufacturing Routing Optimization Using Content and Quality Level
CN107437085A (en) * 2017-08-09 2017-12-05 厦门商集企业咨询有限责任公司 A kind of method, apparatus and readable storage medium storing program for executing of lifting OCR discriminations
CN111191435A (en) * 2019-12-25 2020-05-22 上海三稻智能科技有限公司 Method and device for generating report form by using dynamic template of customs report form
CN111881265A (en) * 2020-09-28 2020-11-03 成都数联铭品科技有限公司 Method and system for processing customs declaration and commodity information, storage medium and electronic equipment
CN112558954A (en) * 2020-12-29 2021-03-26 北京来也网络科技有限公司 Information extraction method, device, medium and electronic equipment combining RPA and AI

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117010779A (en) * 2023-06-30 2023-11-07 南京希音电子商务有限公司 Customs import and export declaration method, system, equipment and medium based on intelligent coding
CN118692096A (en) * 2024-02-05 2024-09-24 宁波火箭数据科技有限公司 Method for filling in customs clearance form

Also Published As

Publication number Publication date
CN113553393B (en) 2024-08-09
CN113553393A (en) 2021-10-26

Similar Documents

Publication Publication Date Title
WO2022262114A1 (en) Rpa and ai combined customs declaration information processing method and processing device
US11714831B2 (en) Data processing and classification
US12159105B2 (en) Automated categorization and summarization of documents using machine learning
US20210103964A1 (en) Account manager virtual assistant using machine learning techniques
US20250190940A1 (en) Data network
US7921367B2 (en) Application generator for data transformation applications
US20170103441A1 (en) Comparing Business Documents to Recommend Organizations
US11720649B2 (en) System and method for classification of data in a machine learning system
CN112560418B (en) Create line item information from free-form tabular data
US12106257B2 (en) Data processing system and method for operating an enterprise application
CN108596179A (en) A kind of VAT invoice amount of money method of inspection
WO2023159771A1 (en) Rpa and ai-based invoice processing method and apparatus, device, and medium
US11416904B1 (en) Account manager virtual assistant staging using machine learning techniques
CN114331316A (en) AI and RPA-based order information entry method, device, equipment and medium
US20240062235A1 (en) Systems and methods for automated processing and analysis of deduction backup data
US12373406B2 (en) Efficient data migration in enterprise resource planning systems
US11893817B2 (en) Method and system for generating document field predictions
US20250014066A1 (en) Systems and methods for automated processing and analysis of deduction backup data
US12530537B2 (en) Self-attentive key-value extraction
Kerutis et al. Intelligent Invoice Documents Processing Employing RPA Technologies
TW202232414A (en) Electronic invoice intelligent automatic accounting platform
TWM616385U (en) Electronic invoice intelligent automatic accounting platform
Kerutis et al. Employing RPA Technologies
US20220067807A1 (en) System and method for facilitating one or more freight transactions
CN120562388A (en) A method and system for generating procurement technical specifications

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21945695

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 180424)

122 Ep: pct application non-entry in european phase

Ref document number: 21945695

Country of ref document: EP

Kind code of ref document: A1