[go: up one dir, main page]

CN103455471B - A kind of method and apparatus that text resolution is become key-value pair - Google Patents

A kind of method and apparatus that text resolution is become key-value pair Download PDF

Info

Publication number
CN103455471B
CN103455471B CN201210180496.6A CN201210180496A CN103455471B CN 103455471 B CN103455471 B CN 103455471B CN 201210180496 A CN201210180496 A CN 201210180496A CN 103455471 B CN103455471 B CN 103455471B
Authority
CN
China
Prior art keywords
text
variable
source variable
secondary data
grammatical structure
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201210180496.6A
Other languages
Chinese (zh)
Other versions
CN103455471A (en
Inventor
王尹
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Advanced New Technologies Co Ltd
Original Assignee
Alibaba Group Holding Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Alibaba Group Holding Ltd filed Critical Alibaba Group Holding Ltd
Priority to CN201210180496.6A priority Critical patent/CN103455471B/en
Publication of CN103455471A publication Critical patent/CN103455471A/en
Application granted granted Critical
Publication of CN103455471B publication Critical patent/CN103455471B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Stored Programmes (AREA)
  • Devices For Executing Special Programs (AREA)

Abstract

This application provides a kind of method and apparatus that text resolution becomes key-value pair, method therein specifically includes: the text resolution template of preset text formatting;Described text resolution template is for describing the form of pending text and the form of target key value pair;The text resolution template of described text formatting is compiled, obtain computer program discernible compiling result, described compiling result includes: source variable corresponding with the form of pending text and separator, and target variable corresponding with the form of target key value pair;Mate in pending text according to described source variable and separator, obtain the value of described source variable;Retain the source variable identical with described target variable and corresponding be worth, as with the described corresponding key-value pair of pending text.The application can improve the efficiency that text resolution becomes key-value pair.

Description

A kind of method and apparatus that text resolution is become key-value pair
Technical field
The application relates to technical field of data processing, particularly relates to a kind of method that text resolution becomes key-value pair and dress Put.
Background technology
Key assignments (Key-value) is to being a kind of data store organisation of computer program, and it can by a major key (key) Quickly uniquely to navigate to a key assignments (value), quickly search at needs and obtain with preserving in the scenes such as data one to one Extensively application.
In actual applications, it is frequently present of such demand: valuable information in one section of text extracted, and Stored by the form of key-value pair.Such as, pending text is " icbc001,123456 ", wherein conceals channel numbering And serial number (no) such information (api), need by contents extraction therein out, ultimately form following key-value pair api: icbc001,no:123456}。
The form of pending text is analyzed, is appreciated that text is the character separated by a set symbol String, comprises irregular separator in text.Therefore with reference to existing string operation method, text resolution is become key-value pair needs Using the mode obtaining substring, wherein obtaining substring needs to know in advance the substring of extraction original position in the text and termination Position;It addition, also need to know the position of separators such as ", ";At separator, intercept two substrings, and be assigned to become accordingly Amount, can obtain key-value pair.
According to above-mentioned parsing thought, the existing method that text resolution becomes key-value pair mainly uses one character of a character The mode processed, detailed process is as follows:
First, for pending text, start to search backward from first character, it is judged that be ", " either with or without a character, The position at this symbol place is obtained after finding this symbol;
Secondly, according to the position of ", " obtained, know the original position in whole character string of the substring before ", " and Final position, and, the original position in whole character string of the substring below and final position;
Again, the two substring is extracted respectively by the function extracting substring of programming language;
Finally, two substrings are assigned to api and no respectively.
To sum up, the existing lookup that text resolution becomes the method for key-value pair need to carry out text one character of one character To determine text formatting, and after lookup, need to determine variable form according to complicated text formatting, then determining variable After form, need by text string extracting corresponding for these variablees out, then by its assignment, form key-value pair, namely existing general Text resolution becomes the method for key-value pair to need full dose traversal many times.But, (such as have many text formatting is extremely complex Plant separator), and in the case of text size the longest (such as substring length is the longest), repeatedly full dose traversal can cause resolving effect Declining to a great extent of rate.
In a word, the technical problem that those skilled in the art urgently solve is needed exactly: how can improve text Resolve to the efficiency of key-value pair.
Summary of the invention
Technical problems to be solved in this application are to provide a kind of method and apparatus that text resolution becomes key-value pair, it is possible to Improve the efficiency that text resolution is become key-value pair.
In order to solve the problems referred to above, this application discloses a kind of method that text resolution is become key-value pair, including:
The text resolution template of preset text formatting;Described text resolution template for describe pending text form and The form of target key value pair;
The text resolution template of described text formatting is compiled, obtains computer program discernible compiling result, Described compiling result includes: source variable corresponding with the form of pending text and separator, and with the lattice of target key value pair The corresponding target variable of formula;
Mate in pending text according to described source variable and separator, obtain the value of described source variable;
Retain the source variable identical with described target variable and corresponding be worth, as with the described corresponding key of pending text It is right to be worth.
Preferably, described text resolution template includes several rows text;The described text resolution mould to described text formatting Plate is compiled, and obtains the step of computer program discernible compiling result, including:
Read the content originally of composing a piece of writing of described text resolution template;
Determine the set grammatical structure that described style of writing content originally meets;Described set grammatical structure includes defining grammer knot Structure and reservation grammatical structure;
The form phase with pending text is identified from the described style of writing meeting described definition grammatical structure content originally The source variable answered and separator;
The form phase with target key value pair is identified from the described style of writing meeting described reservation grammatical structure content originally The target variable answered.
Preferably, described mate in pending text according to described source variable and separator, obtain described source and become The step of the value of amount, including:
From the beginning of the first character of pending text, according to the order of described source variable and separator at pending text In mate, in the matching result obtained, the text between first character to first separator is first source variable Value, the text between two separators is the value of the source variable of corresponding order.
Preferably, the data structure of described compiling result is class object.
Preferably, described text resolution template is additionally operable to describe the data check rule resolved;Described compiling result is also wrapped Include corresponding source variable regular with described data check and verify function accordingly;
Described method also includes:
Utilize verification function that the value of described source variable is verified, if verification is passed through, then perform described reservation with described Source variable that target variable is identical and being worth accordingly, as the step with the described corresponding key-value pair of pending text, if verification Do not pass through, then abandon described pending text.
Preferably, described text resolution template is additionally operable to describe the secondary data resolved and processes demand;Described compiling result Also include that processing the corresponding secondary data of demand with described secondary data processes source variable and corresponding new variables and secondary data Processing function, described secondary data processes function and is assigned to described for described secondary data processes the value of major key in source variable New variables;
The most described method also includes:
Process the corresponding secondary data of demand according to described and described secondary data and process source variable and corresponding new variables Process function with secondary data, described source variable and corresponding value are carried out secondary data process, obtains corresponding new variables and phase The value answered;
Retain the source variable identical with described target variable and new variables and corresponding be worth, as with described pending text Corresponding key-value pair.
Preferably, described text resolution template includes several rows text;The described text resolution mould to described text formatting Plate is compiled, and obtains the step of computer program discernible compiling result, including:
Read the content originally of composing a piece of writing of described text resolution template;
Determine the set grammatical structure that described style of writing content originally meets;Described set grammatical structure includes defining grammer knot Structure, reservation grammatical structure and verification grammatical structure;
The form phase with pending text is identified from the described style of writing meeting described definition grammatical structure content originally The source variable answered and separator;
The form phase with target key value pair is identified from the described style of writing meeting described reservation grammatical structure content originally The target variable answered;
Identify and described data check rule phase from the described style of writing meeting described verification grammatical structure content originally The source variable answered and verify function accordingly.
Preferably, described text resolution template includes several rows text;The described text resolution mould to described text formatting Plate is compiled, and obtains the step of computer program discernible compiling result, including:
Read the content originally of composing a piece of writing of described text resolution template;
Determine the set grammatical structure that described style of writing content originally meets;Described set grammatical structure includes defining grammer knot Structure, reservation grammatical structure and secondary data process grammatical structure;
The form phase with pending text is identified from the described style of writing meeting described definition grammatical structure content originally The source variable answered and separator;
The form phase with target key value pair is identified from the described style of writing meeting described reservation grammatical structure content originally The target variable answered;
Identify and described two numbers from the described style of writing meeting the process grammatical structure of described secondary data content originally Source variable and corresponding new variables and secondary data process function is processed according to processing the corresponding secondary data of demand.
Preferably, described verification function correspondence regular expression.
On the other hand, disclosed herein as well is a kind of device that text resolution is become key-value pair, including:
Template preset module, for the text resolution template of preset text formatting;Described text resolution template is used for describing The form of pending text and the form of target key value pair;
Template collector, for being compiled the text resolution template of described text formatting, obtains computer program Discernible compiling result, described compiling result includes: source variable corresponding with the form of pending text and separator, and Target variable corresponding with the form of target key value pair;
Text matches module, for mating in pending text according to described source variable and separator, obtains institute State the value of source variable;And
Variable reservation module, for retaining the source variable identical with described target variable and corresponding being worth, as with described The corresponding key-value pair of pending text.
Preferably, described text resolution template includes several rows text;Described template collector includes:
Reading submodule, for reading the content originally of composing a piece of writing of described text resolution template;
Grammatical structure determines submodule, for determining the set grammatical structure that described style of writing content originally meets;Described both Determine grammatical structure to include defining grammatical structure and retaining grammatical structure;
First identifies submodule, for identify from described this content of style of writing meeting described definition grammatical structure with The corresponding source variable of form of pending text and separator;
Second identifies submodule, for identify from described this content of style of writing meeting described reservation grammatical structure with The corresponding target variable of form of target key value pair.
Preferably, described text matches module, specifically for from the beginning of the first character of pending text, according to described The order of source variable and separator is mated in pending text, in the matching result obtained, and first character to first Text between individual separator is the value of first source variable, and the text between two separators is the source variable of corresponding order Value.
Preferably, the data structure of described compiling result is class object.
Preferably, described text resolution template is additionally operable to describe the data check rule resolved;Described compiling result is also wrapped Include corresponding source variable regular with described data check and verify function accordingly;
Described device also includes:
Correction verification module, for utilizing verification function to verify the value of described source variable, if verification is passed through, then performs institute State the reservation source variable identical with described target variable and corresponding be worth, as with the described corresponding key-value pair of pending text Step, if verification is not passed through, then abandons described pending text.
Preferably, described text resolution template is additionally operable to describe the secondary data resolved and processes demand;Described compiling result Also include that processing the corresponding secondary data of demand with described secondary data processes source variable and corresponding new variables and secondary data Processing function, described secondary data processes function and is assigned to described for described secondary data processes the value of major key in source variable New variables;
The most described device also includes:
After-treatment module, processes source change for processing the corresponding secondary data of demand according to described and described secondary data Amount and corresponding new variables and secondary data process function, and described source variable and corresponding value are carried out secondary data process, To corresponding new variables and it is worth accordingly;
Second variable reservation module, for retaining the source variable identical with described target variable and new variables and corresponding Value, as with the described corresponding key-value pair of pending text.
Compared with prior art, the application has the advantage that
The application uses the mode of preset text resolution template and pending text matches, extracts from pending text Valuable information is to realize text resolution;Relative in prior art repeatedly full dose traversal, the application Chinese version resolve template Only need a full dose traversal with mating of pending text, therefore analyzing efficiency can be effectively improved, especially at text formatting very In the case of complicated or text size is the longest, the advantage that the application improves analyzing efficiency is the most notable.
If it addition, it is continuous to there are text formatting change, the process of data secondary data, data check etc. in resolving The parsing demand of change, needs in prior art frequently, revises code in large quantities, namely prior art has lacking of very flexible Point;And the application by the way of being pre-configured with by text resolution template and computer program code (resolving Chinese version solution Analysis template operates corresponding computer program code with the coupling of pending text etc.) separate, in the parsing that there is constantly change During demand, computer program code, without variation, only need to change text resolution template, compared with prior art, and this Shen Please be greatly improved the motility that text resolution is become key-value pair.
Accompanying drawing explanation
Fig. 1 is the flow chart of a kind of embodiment of the method 1 that text resolution becomes key-value pair of the application;
Fig. 2 is the flow chart of a kind of embodiment of the method 2 that text resolution becomes key-value pair of the application;
Fig. 3 is the flow chart of a kind of embodiment of the method 3 that text resolution becomes key-value pair of the application;
Fig. 4 is the structure chart of a kind of device embodiment that text resolution becomes key-value pair of the application.
Detailed description of the invention
Understandable for enabling the above-mentioned purpose of the application, feature and advantage to become apparent from, real with concrete below in conjunction with the accompanying drawings The application is described in further detail by mode of executing.
The embodiment of the present application uses text resolution template to describe the form of pending text and the form of target key value pair, and The mode using text resolution template and pending text matches therefrom extracts valuable information to realize text resolution.Relatively In prior art, repeatedly full dose travels through, and the application Chinese version resolves template and only needs a full dose time with mating of pending text Go through, therefore analyzing efficiency can be effectively improved, in the case of especially extremely complex at text formatting or text size is the longest, the application The advantage improving analyzing efficiency is the most notable.
With reference to Fig. 1, it is shown that the flow chart of a kind of embodiment of the method 1 that text resolution is become key-value pair of the application, specifically May include that
Step 101, the text resolution template of preset text formatting;Described text resolution template is used for describing pending text Form and the form of target key value pair;
In the embodiment of the present application, text resolution template itself is also a kind of self-defining programming language, this language It is used to the form of form and the target key value pair describing pending text.
In a kind of application example of the application, described text resolution template specifically can include definition statement and retain defeated Going out statement two parts, wherein, definition statement can be used for describing the form of pending text, retains output statement and can be used for describing mesh The form of mark key-value pair.
In actual applications, can be by the programmer of the art according to the form situation of pending text and target key value To form situation, preset described text resolution template.
With reference to table 1, it is shown that statement defined in the application a kind of text resolution template and the example of reservation output statement.
Table 1
Step 102, text resolution template to described text formatting are compiled, and obtain the discernible volume of computer program Translating result, described compiling result specifically may include that source variable corresponding with the form of pending text and separator, Yi Jiyu The corresponding target variable of form of target key value pair;
Text resolution template is made up of several rows text, computer program can not Direct Recognition these style of writing this, computer Program is discernible is the data structure (such as class object etc.) in internal memory, therefore the embodiment of the present application is first by described text resolution mould Plate is compiled into computer program discernible compiling result.Described compiling result specifically can include following content: with pending The corresponding source variable of form of text and separator, and target variable corresponding with the form of target key value pair, specifically, Having which source variable in pending text, what the separator between source variable is, the position at separator place, final target Key-value pair need retain target variable have which etc..
In a preferred embodiment of the present application, the data structure of described compiling result is class object.Wherein, class object An attribute can include that Map data structure, Map data structure here are a kind of to call the turn key-value pair number at program design language According to the synonym of structure, the character string expression of this data structure is { the such form of key=value}, such as " { 123= abc,456=def}”.In this manner it is possible to the content of compiling result is saved in Map data structure.
In order to realize compiling content described above, needing the compilation operations carried out is each of parsing text resolution template OK, that reads that every a line meets in set grammatical structure is any.Find meet any after, according in this grammer body Deposit structure and preserve the content of this row in text resolution template.Such as if the such definition statement of #define (), then this is fixed Comprising which identification variables out in justice statement, separator is which type of identifies, and the information that will identify that is saved in In Map data structure.
According to the principle of above-mentioned compilation operations, in a preferred embodiment of the present application, described text resolution template has Body can include several rows text;The described text resolution template to described text formatting is compiled, and obtains computer program The step of discernible compiling result, may further include:
Step A1, read this content of style of writing of described text resolution template;
Step A2, determine the set grammatical structure that described this content of style of writing meets;Described set grammatical structure specifically may be used To include defining grammatical structure and retaining grammatical structure;
Step A3, identify and pending text from described this content of style of writing meeting described definition grammatical structure The corresponding source variable of form and separator;
Step A4, identify and target key value pair from described this content of style of writing meeting described reservation grammatical structure The corresponding target variable of form.
As a example by text resolution template shown in table 1, the process of corresponding compilation operations may include that reading text resolution mould The text of every a line in plate, is defined as the text formatting of every a line defining grammatical structure and retaining grammatical structure four type One of which, wherein
Definition grammatical structure #define (), determines that the source variable specified in this section of text definition specifically may include that Api, action and state, the separator of text definition is ", ", only this one;
Retain grammatical structure #keep (), determine that the target variable needing finally to be retained specifically may include that@api@and@ action@。
Described above is exactly the process of compilation operations, and after performing above-mentioned compilation operations, the application can accordingly Compiling result preserves to memory data structure, example of one memory data structure of offer at this:
{“vars”:[“api”,“action”,“state”],“keepVars”:[“api”,“action”], “replaceStrArray”:[“,”,“,”,]}
Wherein, " vars " is used for storing source variable orderly, and " keepVars " is used for storing target variable orderly, " replaceStrArray " is used for storing separator orderly;In the case of not specified otherwise, in the embodiment of the present application Order and order are for representing the character such as source variable, target variable or character string sequence of positions in the text.
Step 103, mate in pending text according to described source variable and separator, obtain described source variable Value;
One application scenarios that text resolution is become key-value pair of offer at this: the business flowing water that system for settling account will generate Information is sent to governor's system, needs governor's system to be monitored for these business flowing water information, and wherein send over treats Processing text is " icbc001, CREATED, S ".This section of pending text resolution can be become key-value pair by governor's system, then preserves In pending message queue.System for settling account will take out the message body of these key-value pairs from pending message queue, then Perform.Owing to governor's core processing engine needs to use the message of key-value pair form, so all pending flowing water texts are all Key-value pair to be extracted into.
In a preferred embodiment of the present application, described enter in pending text according to described source variable and separator Row coupling, obtains the step of the value of described source variable, may further include:
From the beginning of the first character of pending text, according to the order of described source variable and separator at pending text In mate, in the matching result obtained, the text between first character to first separator is first source variable Value, the text between two separators is the value of the source variable of corresponding order.
According to upper example, " vars " stores source variable orderly, and " replaceStrArray " stores and divide orderly Every symbol, it is assumed that pending text is " icbc001, CREATED, S " in above-mentioned application scenarios, then, can be at pending literary composition Mating in Ben, the text " icbc001 " between first character and first separator is exactly first source variable " api " Value, the text " CREATED " between first separator and second separator is exactly the value of second source variable " action ", Text " S " between second separator and the 3rd separator is exactly the value of the 3rd source variable " state ".
Step 104, retain the source variable identical with described target variable and corresponding be worth, as with described pending text Corresponding key-value pair.
Owing to there being the span of target key value centering major key described in text resolution template, and according to text resolution mould Plate resolves and has obtained corresponding target variable, and finally we need " api " and " action " these two variablees.Therefore step 104 can Screen with the value by the source variable got at present, remove unwanted source variable, finally retain be exactly api: Icbc001, action:CREATED} such key-value pair result.
With reference to Fig. 2, it is shown that the flow chart of a kind of embodiment of the method 2 that text resolution is become key-value pair of the application, specifically May include that
Step 201, the text resolution template of preset text formatting;Described text resolution template is used for describing pending text Form, the form of target key value pair and parsing data check rule;
Step 202, text resolution template to described text formatting are compiled, and obtain the discernible volume of computer program Translating result, described compiling result specifically may include that source variable corresponding with the form of pending text and separator, with target The corresponding target variable of form of key-value pair, and with described data check rule corresponding source variable and verify letter accordingly Number;
Step 203, mate in pending text according to described source variable and separator, obtain described source variable Value;
The value of described source variable is verified by step 204, utilization verification function, if verification is passed through, then performs step 205, if verification is not passed through, then perform step 206;
Step 205, retain the source variable identical with described target variable and corresponding be worth, as with described pending text Corresponding key-value pair.
Step 206, abandon described pending text.
In some cases, the extension that there may be data check in resolving resolves demand, and these data checks lead to The most corresponding certain data check rule, such as, a data check rule specifies that the value of certain source variable can not be for character "-" etc. Deng.
Relative to embodiment of the method 1, the present embodiment uses text resolution template to describe the data check rule resolved, compiling Obtain corresponding source variable regular with described data check and verify function accordingly, and utilizing verification function to described source variable Value verify.
In a kind of application example of the application, described text resolution template specifically can include definition statement, retain defeated Going out statement and verification statement three part, wherein, definition statement can be used for describing the form of pending text, and retaining output statement can For describing the form of target key value pair, verification statement can be used for describing the data check rule resolved.
In a preferred embodiment of the present application, verification statement can corresponding regular expression.Regular expression is to use In carrying out the instrument of text matches, generally it is made up of some general character and some metacharacters (metacharacters).Commonly Character includes the letter and number of capital and small letter, and metacharacter then has special implication.The coupling of regular expression is appreciated that For, in given character string, find the part matched with given regular expression.Can be the most square by regular expression Just judge whether pending text meets the form that text resolution template specifies.Such as, an example of verification statement is ,@ State@==regex (S | F), its implication needs to meet regular expression " S | F ", if be unsatisfactory for, then school for " state " variable Testing and do not pass through, pending text does not meets form;That is, the value that only " state " variable is corresponding meets regular expression " S | F " Time, the corresponding pending text resolution value out of " state " variable just can be retained.
In a preferred embodiment of the present application, described text resolution template specifically can include several rows text;Institute State the text resolution template to described text formatting to be compiled, obtain the step of computer program discernible compiling result, May further include:
Step B1, read this content of style of writing of described text resolution template;
Step B2, determine the set grammatical structure that described this content of style of writing meets;Described set grammatical structure includes fixed Justice grammatical structure, reservation grammatical structure and verification grammatical structure;
Step B3, identify and pending text from described this content of style of writing meeting described definition grammatical structure The corresponding source variable of form and separator;
Step B4, identify and target key value pair from described this content of style of writing meeting described reservation grammatical structure The corresponding target variable of form;
Step B5, identify and described data check from described this content of style of writing meeting described verification grammatical structure The corresponding source variable of rule and verify function accordingly.
Assume that step B5 is determined for compliance with described this content of style of writing of described verification grammatical structure :@state@==regex (S | F), then can therefrom identify source variable to be verified is "==" left side " state ", needs to use the mode of regular expression Verifying, the verification function corresponding with regular expression is regex (S | F).In actual applications, recognition result can be deposited Storage to corresponding memory data structure, the example of a memory data structure be " equal ": " state ": [" regex (S | F)”]}}。
Step 204 uses the verification statement inside equal to process, and variable to be verified is " state ", according to verification Statement regex () represents to be needed to be verified by regular expression.Then after the value that " state " is corresponding being verified, if symbol Closing the then text to extract successfully, if do not met, the text being abandoned, do not do other and process.Assume that step 203 is mated Value to " state " variable is S, meets the content that regular expression " S | F " is corresponding, then may determine that currently pending text solution The result separated out is needs, is retained.
With reference to Fig. 3, it is shown that the flow chart of a kind of embodiment of the method 3 that text resolution is become key-value pair of the application, specifically May include that
Step 301, the text resolution template of preset text formatting;Described text resolution template is used for describing pending text Form, the form of target key value pair and parsing secondary data process demand;
Step 302, text resolution template to described text formatting are compiled, and obtain the discernible volume of computer program Translating result, described compiling result specifically may include that source variable corresponding with the form of pending text and separator, with target The corresponding target variable of form of key-value pair, and demand corresponding secondary data process source variable is processed with described secondary data And new variables and secondary data process function accordingly, described secondary data processes function for described secondary data is processed source In variable, the value of major key is assigned to described new variables;
Step 303, mate in pending text according to described source variable and separator, obtain described source variable Value;
Step 304, process the corresponding secondary data of demand according to described and described secondary data and process source variable and corresponding New variables and secondary data process function, described source variable and corresponding value are carried out secondary data process, obtain newly Variable and being worth accordingly;
Step 305, retain the source variable identical with described target variable and new variables and corresponding be worth, treating as with described Process the corresponding key-value pair of text.
In some cases, resolving there may be the extension parsing demand that secondary data processes.Such as, if with When the corresponding source variable of form of pending text has Map data structure, need { key=value}'s by after-treatment Map data structure is converted into new variables and the key-value pair of corresponding value.
Relative to embodiment of the method 1, the present embodiment uses text resolution template to describe the secondary data resolved and processes demand, Compiling obtains processing the corresponding secondary data of demand with described secondary data and processes source variable and corresponding new variables and two numbers According to processing statement, and process demand corresponding secondary data process source variable and the newest according to described and described secondary data Variable and secondary data process statement, described source variable and corresponding value are carried out secondary data process, obtains corresponding new variables And be worth accordingly.
In a kind of application example of the application, described text resolution template specifically can include definition statement, retain defeated Going out statement and secondary data processes statement three part, wherein, definition statement can be used for describing the form of pending text, retains defeated Going out statement and can be used for describing the form of target key value pair, secondary data processes statement and can be used for describing the secondary data process resolved Demand.
With reference to table 2, it is shown that statement, reservation output statement and two numbers defined in the application another kind text resolution template According to the example processing statement.
Table 2
In a preferred embodiment of the present application, described text resolution template specifically can include several rows text;Institute State the text resolution template to described text formatting to be compiled, obtain the step of computer program discernible compiling result, May further include:
Step C1, read this content of style of writing of described text resolution template;
Step C2, determine the set grammatical structure that described this content of style of writing meets;Described set grammatical structure includes fixed Justice grammatical structure, reservation grammatical structure and secondary data process grammatical structure;
Step C3, identify and pending text from described this content of style of writing meeting described definition grammatical structure The corresponding source variable of form and separator;
Step C4, identify and target key value pair from described this content of style of writing meeting described reservation grammatical structure The corresponding target variable of form;
Step C5, identify with described from described this content of style of writing meeting described secondary data and processing grammatical structure Secondary data processes the corresponding secondary data of demand and processes source variable and corresponding new variables and secondary data process function.
Assume that step B5 is determined for compliance with verifying described this content of style of writing of grammatical structure :@no@=getMap (Map, no), Then can therefrom identify secondary data and process source variable " Map ", and corresponding new variables " no ", and secondary data processes function getMap().Assume pending text for " icbc001, { no=123456}, CREATED, S ", in actual applications, the most permissible All of compiling result is stored to corresponding memory data structure, one with table 2 and the described corresponding internal memory of pending text The example of data structure be " vars ": [" api ", " Map ", " action ", " state "], " keepVars ": [" api ", “no”,“action”],“subStrings”:[“Map”],“mines”:{“no”:[“getMap(Map,no)”]}, " equal ": " state ": [" regex (S | F) "], " replaceStrArray ": [", ", ", ", ", "] }, wherein, " subStrings " is used for storing secondary data and processes source variable, and it is corresponding that " mines " is used for storing secondary data process source variable New variables and secondary data process function.
All source variables that step 304 uses the value preserved in " mines " to obtain coupling carry out secondary data process, The content preserved in " mines " is { " no ": [" getMap (Map, no) "] }.According to the syntactic definition of getMap (), two numbers It is " Map " according to processing source variable, and " Map " is the character string expression format of Map data type of a kind of java: { no= , the major key that variable is " no " in this data type, can be there is in 123456}, therefore after-treatment function getMap (Map, no) Value corresponding for this " no " major key can be extracted, be assigned to a new variable " no ".
After the after-treatment of step 304, variable and the respective value got specifically may include that api: icbc001,Map:{no=123456},no:123456,action:CREATED,state:S}.Further, through step 305 The value of the variable got is screened, removes unwanted variable " Map ", finally retain be exactly api:icbc001, No:123456, action:CREATED, state:S} such key-value pair result.
It should be noted that in addition to Map data structure, the data structure that described secondary data processes source variable is all right Including properties data structure, that is, the text of properties data structure is also required to carry out after-treatment.With Map Data structure is similar, and properties data structure is also a kind of code name calling the turn key-value pair data structure at program design language Word, but unlike Map data structure, in properties data structure, the separation between each key-value pair is newline, And not having outmost " { } ", the example of a kind of properties data structure is as follows:
123=abc
456=def
Embodiment of the method 2 and embodiment of the method 3 are respectively to there is the process of data secondary data, data check in resolving It is described Deng the parsing scheme extending parsing demand;In actual applications, in addition to extension parsing demand, in resolving It is also possible to the parsing demand that text formatting change etc. is constantly changed occurs, prior art needs frequently, revise generation in large quantities Code, namely the shortcoming that prior art has very flexible.
And the application by the way of being pre-configured with by text resolution template and computer program code (resolving Chinese This parsing template operates corresponding computer program code with the coupling of pending text etc.) separate, there is constantly change During parsing demand, it is only necessary to the content of amendment template file, completely without amendment program code, the most flexibly configuration mode Can accomplish that the operating system residing for program code is not restarted, and the effect of dynamically configuration can be played;Therefore, compared with existing Having technology, the application can be greatly improved the motility that text resolution becomes key-value pair.
In said method embodiment, the description to each embodiment of the method all emphasizes particularly on different fields, in certain embodiment of the method There is no the part described in detail, may refer to the associated description of additive method embodiment.
Further, those skilled in the art will be readily apparent: said method embodiment 1-embodiment of the method 3 combination in any Application is all feasible, therefore the combination in any between said method embodiment 1-embodiment of the method 3 is all the embodiment party of the present invention Case, but this specification details the most one by one at this as space is limited,.
Corresponding to preceding method embodiment, present invention also provides a kind of device that text resolution is become key-value pair, reference Structure chart shown in Fig. 4, specifically may include that
Template preset module 401, for the text resolution template of preset text formatting;Described text resolution template is used for retouching State the form of pending text and the form of target key value pair;
Template collector 402, for being compiled the text resolution template of described text formatting, obtains computer journey Sequence discernible compiling result, described compiling result includes: source variable corresponding with the form of pending text and separator, with And target variable corresponding with the form of target key value pair;
Text matches module 403, for mating in pending text according to described source variable and separator, obtains The value of described source variable;And
Variable reservation module 404, for retaining the source variable identical with described target variable and corresponding being worth, as with institute State the corresponding key-value pair of pending text.
In a preferred embodiment of the present application, described text resolution template specifically can include several rows text;Phase Ying Di, described template collector 402 may further include:
Reading submodule, for reading the content originally of composing a piece of writing of described text resolution template;
Grammatical structure determines submodule, for determining the set grammatical structure that described style of writing content originally meets;Described both Determine grammatical structure to include defining grammatical structure and retaining grammatical structure;
First identifies submodule, for identify from described this content of style of writing meeting described definition grammatical structure with The corresponding source variable of form of pending text and separator;
Second identifies submodule, for identify from described this content of style of writing meeting described reservation grammatical structure with The corresponding target variable of form of target key value pair.
In another preferred embodiment of the present application, described text matches module 403, can be specifically for from pending literary composition This first character starts, and mates according to the order of described source variable and separator, obtain in pending text In matching result, the value that text is first source variable between first character to first separator, two separators it Between the value of the source variable that text is corresponding order.
In the embodiment of the present application, it is preferred that the data structure of described compiling result is class object.
In the still another preferable embodiment of the application, described text resolution template is additionally operable to describe the data check resolved Rule;Described compiling result can also include corresponding source variable regular with described data check and verify function accordingly;
The most described device can also include:
Correction verification module, for utilizing verification function to verify the value of described source variable, if verification is passed through, then performs institute State the reservation source variable identical with described target variable and corresponding be worth, as with the described corresponding key-value pair of pending text Step, if verification is not passed through, then abandons described pending text.
In a preferred embodiment of the present application, described text resolution template is additionally operable to describe at the secondary data resolved Reason demand;Described compiling result can also include with described secondary data process the corresponding secondary data of demand process source variable and Corresponding new variables and secondary data process function, and described secondary data processes function and becomes for described secondary data is processed source In amount, the value of major key is assigned to described new variables;
The most described device can also include:
After-treatment module, processes source change for processing the corresponding secondary data of demand according to described and described secondary data Amount and corresponding new variables and secondary data process function, and described source variable and corresponding value are carried out secondary data process, To corresponding new variables and it is worth accordingly;
Second variable reservation module, for retaining the source variable identical with described target variable and new variables and corresponding Value, as with the described corresponding key-value pair of pending text.
In a preferred embodiment of the present application, described text resolution template specifically can include several rows text;Then The described text resolution template to described text formatting is compiled, and obtains the step of computer program discernible compiling result Suddenly, may further include:
Read the content originally of composing a piece of writing of described text resolution template;
Determine the set grammatical structure that described style of writing content originally meets;Described set grammatical structure includes defining grammer knot Structure, reservation grammatical structure and verification grammatical structure;
The form phase with pending text is identified from the described style of writing meeting described definition grammatical structure content originally The source variable answered and separator;
The form phase with target key value pair is identified from the described style of writing meeting described reservation grammatical structure content originally The target variable answered;
Identify and described data check rule phase from the described style of writing meeting described verification grammatical structure content originally The source variable answered and verify function accordingly.
In the embodiment of the present application, it is preferred that described verification function can corresponding regular expression.
In another preferred embodiment of the present application, described text resolution template specifically can include several rows text; The most described text resolution template to described text formatting is compiled, and obtains the step of computer program discernible compiling result Suddenly, may further include:
Read the content originally of composing a piece of writing of described text resolution template;
Determine the set grammatical structure that described style of writing content originally meets;Described set grammatical structure includes defining grammer knot Structure, reservation grammatical structure and secondary data process grammatical structure;
The form phase with pending text is identified from the described style of writing meeting described definition grammatical structure content originally The source variable answered and separator;
The form phase with target key value pair is identified from the described style of writing meeting described reservation grammatical structure content originally The target variable answered;
Identify and described two numbers from the described style of writing meeting the process grammatical structure of described secondary data content originally Source variable and corresponding new variables and secondary data process function is processed according to processing the corresponding secondary data of demand.
Each embodiment in this specification all uses the mode gone forward one by one to describe, what each embodiment stressed is with The difference of other embodiments, between each embodiment, identical similar part sees mutually.For device embodiment For, due to itself and embodiment of the method basic simlarity, so describe is fairly simple, relevant part sees the portion of embodiment of the method Defend oneself bright.
Those skilled in the art are it should be appreciated that embodiments herein can be provided as method, system or computer program Product.Therefore, the reality in terms of the application can use complete hardware embodiment, complete software implementation or combine software and hardware Execute the form of example.And, the application can use at one or more computers wherein including computer usable program code The upper computer program product implemented of usable storage medium (including but not limited to disk memory, CD-ROM, optical memory etc.) The form of product.
The application is with reference to method, equipment (system) and the flow process of computer program according to the embodiment of the present application Figure and/or block diagram describe.It should be understood that can the most first-class by computer program instructions flowchart and/or block diagram Flow process in journey and/or square frame and flow chart and/or block diagram and/or the combination of square frame.These computer programs can be provided Instruction arrives the processor of general purpose computer, special-purpose computer, Embedded Processor or other programmable data processing device to produce A raw machine so that the instruction performed by the processor of computer or other programmable data processing device is produced for real The device of the function specified in one flow process of flow chart or multiple flow process and/or one square frame of block diagram or multiple square frame now.
These computer program instructions may be alternatively stored in and computer or other programmable data processing device can be guided with spy Determine in the computer-readable memory that mode works so that the instruction being stored in this computer-readable memory produces and includes referring to Make the manufacture of device, this command device realize at one flow process of flow chart or multiple flow process and/or one square frame of block diagram or The function specified in multiple square frames.
These computer program instructions also can be loaded in computer or other programmable data processing device so that at meter Perform sequence of operations step on calculation machine or other programmable devices to produce computer implemented process, thus at computer or The instruction performed on other programmable devices provides for realizing at one flow process of flow chart or multiple flow process and/or block diagram one The step of the function specified in individual square frame or multiple square frame.
Although having been described for the preferred embodiment of the application, but those skilled in the art once know basic creation Property concept, then can make other change and amendment to these embodiments.So, claims are intended to be construed to include excellent Select embodiment and fall into all changes and the amendment of the application scope.
Above to a kind of method and apparatus that text resolution is become key-value pair provided herein, carry out detailed Jie Continuing, principle and the embodiment of the application are set forth by specific case used herein, and the explanation of above example is only It is to be adapted to assist in understand the present processes and core concept thereof;Simultaneously for one of ordinary skill in the art, according to this Shen Thought please, the most all will change, and in sum, this specification content should not be managed Solve as the restriction to the application.

Claims (15)

1. the method that text resolution is become key-value pair, it is characterised in that including:
The text resolution template of preset text formatting;Described text resolution template is for describing form and the target of pending text The form of key-value pair;Described text resolution template includes definition statement and retains output statement, and wherein, definition statement is used for describing The form of pending text, retains output statement for describing the form of target key value pair;
The text resolution template of described text formatting is compiled, obtains computer program discernible compiling result, described Compiling result includes: source variable corresponding with the form of pending text and separator, and with the form phase of target key value pair The target variable answered;
Mate in pending text according to described source variable and separator, obtain the value of described source variable;
Retain the source variable identical with described target variable and corresponding be worth, as with the described corresponding key assignments of pending text Right.
2. the method for claim 1, it is characterised in that described text resolution template includes several rows text;Described right The text resolution template of described text formatting is compiled, and obtains the step of computer program discernible compiling result, including:
Read the content originally of composing a piece of writing of described text resolution template;
Determine the set grammatical structure that described style of writing content originally meets;Described set grammatical structure include define grammatical structure and Retain grammatical structure;
Identify corresponding with the form of pending text from the described style of writing meeting described definition grammatical structure content originally Source variable and separator;
Identify corresponding with the form of target key value pair from the described style of writing meeting described reservation grammatical structure content originally Target variable.
3. the method for claim 1, it is characterised in that described according to described source variable and separator at pending text In mate, obtain the step of the value of described source variable, including:
From the beginning of the first character of pending text, enter in pending text according to the order of described source variable and separator Row coupling, in the matching result obtained, the value that text is first source variable between first character to first separator, Text between two separators is the value of the source variable of corresponding order.
4. the method as described in claim 1 or 2 or 3, it is characterised in that the data structure of described compiling result is class object.
5. the method as described in claim 1 or 3, it is characterised in that described text resolution template is additionally operable to describe the number resolved According to verification rule;Described compiling result also includes corresponding source variable regular with described data check and verifies function accordingly;
Described method also includes:
Utilize verification function that the value of described source variable is verified, if verification is passed through, then perform described reservation and described target Source variable that variable is identical and being worth accordingly, as the step with the described corresponding key-value pair of pending text, if verifying obstructed Cross, then abandon described pending text.
6. the method as described in claim 1 or 3, it is characterised in that described text resolution template is additionally operable to describe two resolved Secondary data processes demand;Described compiling result also includes that processing the corresponding secondary data of demand with described secondary data processes source change Amount and corresponding new variables and secondary data process function, and described secondary data processes function for described secondary data being processed In source variable, the value of major key is assigned to described new variables;
Described method also includes:
Process the corresponding secondary data of demand according to described and described secondary data and process source variable and corresponding new variables and two Secondary data processes function, and described source variable and corresponding value are carried out secondary data process, obtains corresponding new variables and accordingly Value;
Retain the source variable identical with described target variable and new variables and corresponding be worth, as corresponding to described pending text Key-value pair.
7. method as claimed in claim 5, it is characterised in that described text resolution template includes several rows text;Described right The text resolution template of described text formatting is compiled, and obtains the step of computer program discernible compiling result, including:
Read the content originally of composing a piece of writing of described text resolution template;
Determine the set grammatical structure that described style of writing content originally meets;Described set grammatical structure include define grammatical structure, Retain grammatical structure and verification grammatical structure;
Identify corresponding with the form of pending text from the described style of writing meeting described definition grammatical structure content originally Source variable and separator;
Identify corresponding with the form of target key value pair from the described style of writing meeting described reservation grammatical structure content originally Target variable;
Identify corresponding with described data check rule from the described style of writing meeting described verification grammatical structure content originally Source variable and verify function accordingly.
8. method as claimed in claim 6, it is characterised in that described text resolution template includes several rows text;Described right The text resolution template of described text formatting is compiled, and obtains the step of computer program discernible compiling result, including:
Read the content originally of composing a piece of writing of described text resolution template;
Determine the set grammatical structure that described style of writing content originally meets;Described set grammatical structure include define grammatical structure, Retain grammatical structure and secondary data processes grammatical structure;
Identify corresponding with the form of pending text from the described style of writing meeting described definition grammatical structure content originally Source variable and separator;
Identify corresponding with the form of target key value pair from the described style of writing meeting described reservation grammatical structure content originally Target variable;
Identify from described this content of style of writing meeting described secondary data and processing grammatical structure with described secondary data at The corresponding secondary data of reason demand processes source variable and corresponding new variables and secondary data processes function.
9. method as claimed in claim 5, it is characterised in that described verification function correspondence regular expression.
10. the device that text resolution is become key-value pair, it is characterised in that including:
Template preset module, for the text resolution template of preset text formatting;Described text resolution template is used for describing to be waited to locate The form of reason text and the form of target key value pair;Described text resolution template includes definition statement and retains output statement, its In, definition statement, for describing the form of pending text, retains output statement for describing the form of target key value pair;
Template collector, for being compiled the text resolution template of described text formatting, obtaining computer program can know Other compiling result, described compiling result includes: source variable corresponding with the form of pending text and separator, and and mesh The corresponding target variable of form of mark key-value pair;
Text matches module, for mating in pending text according to described source variable and separator, obtains described source The value of variable;And
Variable reservation module, for retaining the source variable identical with described target variable and corresponding being worth, locates as with described waiting The reason corresponding key-value pair of text.
11. devices as claimed in claim 10, it is characterised in that described text resolution template includes several rows text;Described Template collector includes:
Reading submodule, for reading the content originally of composing a piece of writing of described text resolution template;
Grammatical structure determines submodule, for determining the set grammatical structure that described style of writing content originally meets;Described both attributes Method structure includes defining grammatical structure and retaining grammatical structure;
First identifies submodule, from identifying and treat from the described content originally of composing a piece of writing meeting described definition grammatical structure The corresponding source variable of form of reason text and separator;
Second identifies submodule, for identifying and target from the described content originally of composing a piece of writing meeting described reservation grammatical structure The corresponding target variable of form of key-value pair.
12. devices as claimed in claim 10, it is characterised in that described text matches module, specifically for from pending literary composition This first character starts, and mates according to the order of described source variable and separator, obtain in pending text In matching result, the value that text is first source variable between first character to first separator, two separators it Between the value of the source variable that text is corresponding order.
13. devices as described in claim 10 or 12, it is characterised in that the data structure of described compiling result is class object.
14. devices as described in claim 10 or 12, it is characterised in that described text resolution template is additionally operable to describe parsing Data check rule;Described compiling result also includes corresponding source variable regular with described data check and verifies letter accordingly Number;
Described device also includes:
Correction verification module, for utilizing verification function to verify the value of described source variable, if verification is passed through, then performs described guarantor Stay the source variable identical with described target variable and corresponding be worth, as the step with the described corresponding key-value pair of pending text Suddenly, if verification is not passed through, then described pending text is abandoned.
15. devices as described in claim 10 or 11 or 12, it is characterised in that described text resolution template is additionally operable to describe and solves The secondary data of analysis processes demand;Described compiling result also includes processing at the corresponding secondary data of demand with described secondary data Reason source variable and corresponding new variables and secondary data process function, and described secondary data processes function for by described two numbers It is assigned to described new variables according to processing the value of major key in source variable;
Described device also includes:
After-treatment module, for according to described and described secondary data process demand corresponding secondary data process source variable and Corresponding new variables and secondary data process function, described source variable and corresponding value are carried out secondary data process, obtains phase Answer new variables and be worth accordingly;
Second variable reservation module, for retaining the source variable identical with described target variable and new variables and corresponding being worth, work For with the described corresponding key-value pair of pending text.
CN201210180496.6A 2012-06-01 2012-06-01 A kind of method and apparatus that text resolution is become key-value pair Active CN103455471B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201210180496.6A CN103455471B (en) 2012-06-01 2012-06-01 A kind of method and apparatus that text resolution is become key-value pair

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201210180496.6A CN103455471B (en) 2012-06-01 2012-06-01 A kind of method and apparatus that text resolution is become key-value pair

Publications (2)

Publication Number Publication Date
CN103455471A CN103455471A (en) 2013-12-18
CN103455471B true CN103455471B (en) 2016-12-14

Family

ID=49737858

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201210180496.6A Active CN103455471B (en) 2012-06-01 2012-06-01 A kind of method and apparatus that text resolution is become key-value pair

Country Status (1)

Country Link
CN (1) CN103455471B (en)

Families Citing this family (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106919542B (en) * 2015-12-24 2020-04-21 北京国双科技有限公司 Rule matching method and device
CN108345511B (en) * 2017-01-24 2022-02-08 阿里巴巴集团控股有限公司 Application data checking method and device and electronic equipment
CN107977440B (en) * 2017-12-07 2020-11-27 网宿科技股份有限公司 A method, device and system for parsing data files
CN108170432B (en) * 2017-12-12 2020-10-16 上海天旦网络科技发展有限公司 Configurable data packet decoding system and method
CN108511077A (en) * 2018-01-25 2018-09-07 苏州麦迪斯顿医疗科技股份有限公司 Medical treatment and nursing document creation method, electronic equipment, storage medium
CN108491364A (en) * 2018-01-25 2018-09-04 苏州麦迪斯顿医疗科技股份有限公司 Medical treatment and nursing paperwork management system
CN108989902B (en) * 2018-07-03 2020-10-16 武汉斗鱼网络科技有限公司 Barrage message processing method and device, terminal and storage medium
CN109299887B (en) * 2018-11-05 2022-04-19 创新先进技术有限公司 Data processing method and device and electronic equipment
CN109726284B (en) * 2018-12-07 2021-02-02 成都品果科技有限公司 Data analysis method with strong universality
CN111950257A (en) * 2020-07-31 2020-11-17 新华三半导体技术有限公司 File analysis method and device
CN113553297B (en) * 2021-06-08 2023-01-06 优刻得科技股份有限公司 Management method and system for switch configuration information
CN117692202B (en) * 2023-12-11 2024-08-09 北京安胜华信科技有限公司 Method, system, electronic equipment and storage medium for grabbing authentication parameters

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101164039A (en) * 2005-03-02 2008-04-16 谷歌公司 Generating structured information
CN101799808A (en) * 2009-02-10 2010-08-11 中国移动通信集团公司 Data processing method and system thereof

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7440888B2 (en) * 2004-09-02 2008-10-21 International Business Machines Corporation Methods, systems and computer program products for national language support using a multi-language property file

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101164039A (en) * 2005-03-02 2008-04-16 谷歌公司 Generating structured information
CN101799808A (en) * 2009-02-10 2010-08-11 中国移动通信集团公司 Data processing method and system thereof

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
MapReduce: Simplified Data Processing on Large Clusters;Jeffrey Dean等;《COMMUNICATIONS OF THE ACM》;20080131;第51卷(第1期);第107-113页 *

Also Published As

Publication number Publication date
CN103455471A (en) 2013-12-18

Similar Documents

Publication Publication Date Title
CN103455471B (en) A kind of method and apparatus that text resolution is become key-value pair
CN109661663B (en) Context parsing device and computer-readable recording medium
Guerrouj et al. Tidier: an identifier splitting approach using speech recognition techniques
US20170103061A1 (en) Interaction apparatus and method
CN110716749B (en) Code searching method based on functional similarity matching
CN104462085A (en) Method and device for correcting search keywords
CN110059455A (en) Code encryption method, apparatus, electronic equipment and computer readable storage medium
CN113961768B (en) Sensitive word detection method and device, computer equipment and storage medium
CN105094941B (en) It is a kind of to realize multilingual method and device
JP2016157407A (en) Prior construction method of vocabulary semantic pattern for text analysis and response system
CN110096264A (en) A kind of code operation method and device
CN106547765B (en) SQ L-based database management method and device
CN112287071A (en) Text relation extraction method and device and electronic equipment
CN107515739A (en) Improve the method and device of code execution performance
Thomas et al. Not all links are equal: Exploiting dependency types for the extraction of protein-protein interactions from text
CN104298766B (en) A kind of method of data in removing SQLite databases
CN110688151A (en) A secure translation and parsing method for Ethereum Solidity smart contracts
CN106598828A (en) Method and device for determining invalid class in source code
CN105843661B (en) A kind of code method for relocating and its system towards host system
WO2019148797A1 (en) Natural language processing method, device, computer apparatus, and storage medium
CN119066189A (en) Text content extraction method, device, electronic device and storage medium
CN116205233A (en) A method, device, device and storage medium for identifying key fields in a document
Guest Parsing for role and reference grammar
JP6116983B2 (en) Entry point extraction device
Ridge Simple, efficient, sound and complete combinator parsing for all context-free grammars, using an oracle

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
TR01 Transfer of patent right

Effective date of registration: 20191210

Address after: P.O. Box 31119, grand exhibition hall, hibiscus street, 802 West Bay Road, Grand Cayman, Cayman Islands

Patentee after: Innovative advanced technology Co., Ltd

Address before: A four-storey 847 mailbox in Grand Cayman Capital Building, British Cayman Islands

Patentee before: Alibaba Group Holding Co., Ltd.

TR01 Transfer of patent right