[go: up one dir, main page]

CN114691265B - A batch capture method for structural elements of software interface - Google Patents

A batch capture method for structural elements of software interface Download PDF

Info

Publication number
CN114691265B
CN114691265B CN202210224918.9A CN202210224918A CN114691265B CN 114691265 B CN114691265 B CN 114691265B CN 202210224918 A CN202210224918 A CN 202210224918A CN 114691265 B CN114691265 B CN 114691265B
Authority
CN
China
Prior art keywords
node
selector
node group
rpa robot
structured data
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202210224918.9A
Other languages
Chinese (zh)
Other versions
CN114691265A (en
Inventor
张涛
何双钻
刘迪康
金克
邵万骏
纪达麒
陈运文
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Daguan Data Co ltd
Original Assignee
Daguan Data Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Daguan Data Co ltd filed Critical Daguan Data Co ltd
Priority to CN202210224918.9A priority Critical patent/CN114691265B/en
Publication of CN114691265A publication Critical patent/CN114691265A/en
Application granted granted Critical
Publication of CN114691265B publication Critical patent/CN114691265B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/44Arrangements for executing specific programs
    • G06F9/451Execution arrangements for user interfaces
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F8/00Arrangements for software engineering
    • G06F8/30Creation or generation of source code
    • G06F8/38Creation or generation of source code for implementing user interfaces

Landscapes

  • Engineering & Computer Science (AREA)
  • Software Systems (AREA)
  • Theoretical Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Stored Programmes (AREA)

Abstract

本发明涉及一种软件界面结构化元素的批量抓取方法,包括开发流程阶段和运行流程阶段,在开发流程阶段获取三个选择器,三个选择器分别对应祖先节点、孩子节点组和孩子节点组中的目标节点;运行流程阶段RPA机器人通过三个选择器提取结构化数据,获得由目标节点组成的节点组,RPA机器人获取节点组内目标节点的属性并按照节点组和孩子节点组生成结构化数据的二维数组。本发明解决现有技术中RPA机器人获取结构化数据时遇到的系统软件不提供API、数据分页、数据结构不明显的问题,简化了RPA机器人获取结果化数据的过程以及提高RPA机器人获取结果化数据的效率。

The present invention relates to a method for batch crawling of structured elements of a software interface, including a development process stage and an operation process stage, in which three selectors are obtained, and the three selectors correspond to an ancestor node, a child node group, and a target node in the child node group respectively; in the operation process stage, an RPA robot extracts structured data through three selectors, obtains a node group composed of target nodes, and the RPA robot obtains the attributes of the target node in the node group and generates a two-dimensional array of structured data according to the node group and the child node group. The present invention solves the problems encountered in the prior art when the RPA robot obtains structured data, that is, the system software does not provide an API, data paging, and the data structure is not obvious, simplifies the process of the RPA robot obtaining result data, and improves the efficiency of the RPA robot in obtaining result data.

Description

Batch grabbing method for structured elements of software interface
Technical Field
The invention relates to the field of RPA robot process automation, in particular to a batch grabbing method for software interface structural elements.
Background
Robot process automation (Robotic Process Automation, hereinafter referred to as RPA) is software running on a personal computer or a server, and has the main functions of simulating a person to control various system software and automatically execute a business process, and is suitable for a large-batch and high-repetition business scenario. One of the main functions of RPA robots is to acquire data on the system software interface, which includes structured data (structured data refers to data that exists in a fixed format within a record file, such as a table, a directory, etc.).
The main method for obtaining structured data at present is to call an application programming interface (Application Programming Interface, hereinafter abbreviated as API) provided by system software, and the method has the following problems:
1. invalidating system software that does not provide an API;
2. Invalidating structured data for the paged display;
3. Data that is not obvious to the structure is invalidated.
In order to solve the above technical problems, the solution proposed in the prior art is to acquire each data in the structuring separately and then splice the data according to the rule of discovery, however, the solution has the following problems:
1. Finding the regularity of structured data is difficult;
2. when the data volume is large, it takes time to acquire each data individually;
3. the obtained data is low in combination efficiency according to a specific rule and is easy to make mistakes.
Disclosure of Invention
The invention provides a batch grabbing method for structured elements of a software interface, which aims to solve the problems that system software does not provide an API (application program interface), data paging and an unobvious data structure when an RPA (remote procedure for processing) robot acquires structured data in the prior art, simplify the process of acquiring the resultant data by the RPA robot and improve the efficiency of acquiring the resultant data by the RPA robot.
The technical purpose of the invention is realized by the following technical scheme:
The batch grabbing method for the structured elements of the software interface comprises a development flow stage and an operation flow stage, wherein three selectors are obtained in the development flow stage and correspond to ancestor nodes, child node groups and target nodes in the child node groups respectively; and in the operation flow stage, the RPA robot extracts the structured data through three selectors to obtain a node group consisting of target nodes, acquires the attribute of the target nodes in the node group, and generates a two-dimensional array of the structured data according to the node group and the child node group.
Further, in the operation flow stage, when the structural data are in different pages, the RPA robot performs page turning operation, acquires the structural data of each page, and splices the acquired structural data together.
Further, the selector maintains element attributes and hierarchical relationships among the target node, child node group, and ancestor nodes.
Further, the development flow stage includes the following steps:
Manually selecting interface elements with the same type of two controls in the structured data;
Analyzing the hierarchy and the position relation of two interface elements by means of an analysis module, finding a common ancestor node of the two interface elements, and taking a selector of the ancestor node as a first selector;
Step three, obtaining all child node groups of ancestor nodes, and taking a selector for searching the child node groups from the ancestor nodes as a second selector;
step four, taking a selector of nodes with the same type of the searching control in the child node group as a third selector; if two interface elements in step 1 are at the child node group level, then the third selector is empty.
Further, in the operation flow stage, the RPA robot extracts the structured data through three selectors, and includes the following steps:
step 1, the RPA robot finds an ancestor node according to a first selector;
Step 2, the RPA robot finds a child node group according to the second selector;
Step 3, the RPA robot searches corresponding target nodes under the child nodes from each child node of the child node group according to the second selector and forms a target node group;
and 4, the RPA robot acquires the appointed attribute of the target node and forms a two-dimensional array of the structured data according to the target node group and the child node group.
Further, the analysis module is a syntax analysis algorithm.
Compared with the prior art, the invention has the beneficial effects that:
1. The batch grabbing method of the software interface structural elements supports system software which does not provide an API for extracting the structural data;
2. the method comprises the steps of automatically turning pages through an RPA robot to support acquisition of structured data of paging reality;
3. The method of the invention supports the extraction of data with unobvious structure;
4. The method is simple to operate, and the structural rule among the data is not required to be understood;
5. The generated result is three element selectors, the result relation of the data is intuitively displayed, and the understanding and the modification are convenient;
6. the method of the invention greatly improves the efficiency of obtaining the structured data.
Drawings
FIG. 1 is a schematic diagram of a system application interface element node in the present invention.
FIG. 2 is a diagram showing the structure of example 1.
FIG. 3 is a diagram showing the structure of example 2.
FIG. 4 is a diagram showing the structure of example 3.
Detailed Description
The technical scheme of the invention is further described below with reference to the specific embodiments:
1-4, a method for capturing structured elements of a software interface in batches, which includes a development process stage and an operation process stage, wherein the selector is a mode for selecting elements, and is stored and transmitted in text form, such as a CSS selector, and the selector is used for positioning the interface elements, some attributes of target interface elements and hierarchical relations among the elements are stored in the selector, and an RPA robot can position one or more interface elements meeting the conditions of the selector according to the attributes in the selector.
In the development flow stage, two interface elements in the structured data are captured, and three selectors are obtained through analysis:
Step one, manually selecting two interface elements (node A and node B for short) in the structured data, and prompting to re-select if the control types of the two interface elements (the control types are called as control type C) are different.
And step two, analyzing the hierarchy and the position relation of the two data by an analysis module, wherein the analysis module adopts a general grammar analysis algorithm to find a common ancestor node (called node G for short) of the two elements, and takes a selector of the node G as a first selector. The function of the grammar analysis algorithm is to analyze two input selector texts, generate three output selector texts according to specific rules, and indirectly realize the analysis processing of the selection conditions through the analysis processing of the texts.
Step three, obtaining all child node groups (called node group F for short) of the node G, and taking a selector for searching the node group F from the node G as a second selector;
Step four, a selector for searching a node of the control type C from a certain node in the node group F is used as a third selector; the third selector is empty if node a and node B are at the level of node group F, i.e. node a and node B belong directly to node G.
In the run-time phase, the RPA robot extracts structured data according to three selectors:
Step 1, the RPA robot finds a node G according to a first selector;
Step 2, the RPA robot finds a node group F according to the second selector;
Step 3, the RPA robot searches all target nodes from each node of the node group F according to a second selector and forms a target node group (node group C);
Step4, the RPA robot acquires the appointed attribute of the target node and forms a two-dimensional array of the structured data according to the node group C and the node group F;
And 5, if the page needs to be turned, the RPA robot automatically clicks a page turning button to turn the page, a new page acquires the structured data from the first step, and the data acquired before are spliced together.
The following description is made in connection with specific embodiments:
Taking a company as an example, the employee is regarded as an element, the name and the sex of the employee are regarded as element attributes, and the organization structure of the company corresponds to the hierarchical relationship of the elements.
Example 1
A men are development group members of the research and development department, B men are test group members of the research and development department;
analyzing the common points of the A men and the B men to generate three selectors which respectively correspond to the research and development departments, the group owners under the research and development departments and the male panelists managed by each group owner;
selector 1-select research and development department;
selector 2-select all group lengths;
Selector 3-select all male panelists;
the RPA robot extracts structured data through three selectors, and a table can be obtained, wherein the rows and columns of the table respectively correspond to the research and development department group length and male panelists managed by the corresponding research and development department group length.
Example 2
Male C is a development group leader of a research and development department, and female D is a test group leader of the research and development department; analyzing the common points of the male C and the female D to generate three selectors which respectively correspond to the research and development departments, group companies under the research and development departments and group members under the management of each group company;
selector 1-select research and development department;
selector 2-select all group lengths;
Selector 3-select to group leader itself, selector 3 is empty;
The RPA robot extracts structured data through three selectors, e.g., a single list may be obtained, containing all the group owners of the development department.
Example 3
Man E is a development group leader of a research and development department, and man F is a development group member of the research and development department; analysis of the commonality of men E and men F reveals that the positions of men E and men F relative to the common node development department are different, at different levels, and that the generation of the selector fails and needs to be re-selected.
The present embodiment is further illustrative of the present invention and is not to be construed as limiting the invention, and those skilled in the art can make no inventive modifications to the present embodiment as required after reading the present specification, but only as long as they are within the scope of the claims of the present invention.

Claims (3)

1.一种软件界面结构化元素的批量抓取方法,其特征在于,包括开发流程阶段和运行流程阶段,在开发流程阶段获取三个选择器,三个选择器分别对应祖先节点、孩子节点组和孩子节点组中的目标节点,所述选择器保存了目标节点、孩子节点组以及祖先节点中的元素属性及层级关系;运行流程阶段RPA机器人通过三个选择器提取结构化数据,获得由目标节点组成的节点组,RPA机器人获取节点组内目标节点的属性并按照节点组和孩子节点组生成结构化数据的二维数组;1. A method for batch crawling of structured elements of a software interface, characterized in that it includes a development process stage and an operation process stage. In the development process stage, three selectors are obtained, and the three selectors correspond to an ancestor node, a child node group, and a target node in the child node group, respectively. The selectors store the element attributes and hierarchical relationships in the target node, the child node group, and the ancestor node; in the operation process stage, an RPA robot extracts structured data through the three selectors to obtain a node group composed of target nodes, and the RPA robot obtains the attributes of the target node in the node group and generates a two-dimensional array of structured data according to the node group and the child node group; 开发流程阶段包括以下步骤:The development process phase includes the following steps: 步骤一、人工选取结构化数据中的两个控件类型相同的界面元素;Step 1: Manually select two interface elements of the same control type in the structured data; 步骤二、借助分析模块分析两个界面元素的层级和位置关系,找到两个所述界面元素的共同祖先节点,将祖先节点的选择器作为第一个选择器;Step 2: Analyze the hierarchy and position relationship of the two interface elements with the help of the analysis module, find the common ancestor node of the two interface elements, and use the selector of the ancestor node as the first selector; 步骤三、获取祖先节点的所有孩子节点组,将从祖先节点查找孩子节点组的选择器作为第二个选择器;Step 3: Get all child node groups of the ancestor node, and use the selector for searching child node groups from the ancestor node as the second selector; 步骤四、将孩子节点组中的查找控件类型相同的节点的选择器作为第三个选择器;若步骤1中的两个界面元素在孩子节点组层级,则第三个选择器为空;Step 4: Use the selector of the node with the same control type in the child node group as the third selector; if the two interface elements in step 1 are at the child node group level, the third selector is empty; 在运行流程阶段,RPA机器人通过三个选择器提取结构化数据时包括以下步骤:During the process run phase, the RPA robot extracts structured data through three selectors, including the following steps: 步骤1、RPA机器人根据第一个选择器找到祖先节点;Step 1: The RPA robot finds the ancestor node based on the first selector; 步骤2、RPA机器人根据第二个选择器找到孩子节点组;Step 2: The RPA robot finds the child node group based on the second selector; 步骤3、RPA机器人根据第三个选择器,分别从孩子节点组的每个孩子节点出发查找孩子节点下对应的目标节点并形成目标节点组;Step 3: Based on the third selector, the RPA robot searches for the corresponding target node under each child node of the child node group and forms a target node group. 步骤4、RPA机器人获取目标节点的指定属性,并按照目标节点组和孩子节点组形成结构化数据的二维数组。Step 4: The RPA robot obtains the specified attributes of the target node and forms a two-dimensional array of structured data according to the target node group and the child node group. 2.根据权利要求1所述的一种软件界面结构化元素的批量抓取方法,其特征在于,运行流程阶段,当结构数据处于不同页面时,RPA机器人进行翻页操作,获取每页的结构化数据并将获取的结构化数据拼接在一起。2. According to claim 1, a method for batch crawling of structured elements of a software interface is characterized in that, during the process operation stage, when the structured data is on different pages, the RPA robot performs a page turning operation, obtains the structured data of each page and splices the obtained structured data together. 3.根据权利要求1所述的一种软件界面结构化元素的批量抓取方法,其特征在于,所述分析模块为语法分析算法。3. A method for batch capturing structural elements of a software interface according to claim 1, characterized in that the analysis module is a grammatical analysis algorithm.
CN202210224918.9A 2022-03-07 2022-03-07 A batch capture method for structural elements of software interface Active CN114691265B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202210224918.9A CN114691265B (en) 2022-03-07 2022-03-07 A batch capture method for structural elements of software interface

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210224918.9A CN114691265B (en) 2022-03-07 2022-03-07 A batch capture method for structural elements of software interface

Publications (2)

Publication Number Publication Date
CN114691265A CN114691265A (en) 2022-07-01
CN114691265B true CN114691265B (en) 2024-11-15

Family

ID=82137428

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210224918.9A Active CN114691265B (en) 2022-03-07 2022-03-07 A batch capture method for structural elements of software interface

Country Status (1)

Country Link
CN (1) CN114691265B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117573006B (en) * 2024-01-17 2024-05-10 杭州实在智能科技有限公司 Method and system for batch picking of RPA screen interface elements

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111460259A (en) * 2020-03-31 2020-07-28 腾讯科技(深圳)有限公司 Method and device for determining similar elements, computer equipment and storage medium
CN113015956A (en) * 2019-10-14 2021-06-22 尤帕斯公司 System and method for active target selection for robotic process automation

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8788525B2 (en) * 2012-09-07 2014-07-22 Splunk Inc. Data model for machine data for semantic search
CN112579852B (en) * 2019-09-30 2023-01-10 厦门邑通智能科技集团有限公司 Interactive webpage data accurate acquisition method
JP2023528092A (en) * 2020-05-11 2023-07-04 ユーアイパス,インコーポレイテッド Selection of Graphical Element Search Techniques, Fuzzy Logic Selection of Anchors and Targets, and/or Hierarchical Graphical Element Identification for Robotic Process Automation
CN113176922B (en) * 2021-04-30 2024-05-24 上海云扩信息科技有限公司 Universal interface element automatic detection system

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113015956A (en) * 2019-10-14 2021-06-22 尤帕斯公司 System and method for active target selection for robotic process automation
CN111460259A (en) * 2020-03-31 2020-07-28 腾讯科技(深圳)有限公司 Method and device for determining similar elements, computer equipment and storage medium

Also Published As

Publication number Publication date
CN114691265A (en) 2022-07-01

Similar Documents

Publication Publication Date Title
US12164568B2 (en) Tool to build and store a data model and queries for a graph database
CN110941612B (en) System and method for constructing autonomous data lake based on linked data
US11023500B2 (en) Systems and methods for code parsing and lineage detection
CN101971165B (en) Graphical representation of data relationships
CN105122243B (en) Expansible analysis platform for semi-structured data
CN111984745B (en) Database field dynamic expansion method, device, equipment and storage medium
CN111858608A (en) A data management method, device, server and storage medium
CN114385620A (en) Data processing method, device, equipment and readable storage medium
CN110109981B (en) Information display method and device for work queue, computer equipment and storage medium
CN117216042B (en) A method and device for constructing a data standardization platform
US20220374337A1 (en) Techniques for visual software test management
CN114356964A (en) Data lineage construction method, device, storage medium and electronic device
CN114594927A (en) Low-code development method, device, system, server and storage medium
CN114691265B (en) A batch capture method for structural elements of software interface
KR102275135B1 (en) Data collection and analyzing system and construction method of the same for project
CN113779117A (en) Data monitoring method and device, storage medium and electronic equipment
Wu et al. On the use of virtual knowledge graphs to improve environmental sensor data accessibility
US10223086B2 (en) Systems and methods for code parsing and lineage detection
CN103809915B (en) The reading/writing method of a kind of disk file and device
CN118035204A (en) Data blood edge display method, distributed task scheduling system and storage medium
CN116127047B (en) Method and device for establishing enterprise information database
CN111881660A (en) Report generation method and device, computer equipment and storage medium
CN116841900A (en) Interface document, test case generation method, test case generation device, computer equipment and storage medium
CN115033634B (en) Data collection method, device, electronic device and medium
CN116594628A (en) Data tracing method and device and computer equipment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
CB02 Change of applicant information

Country or region after: China

Address after: Room 501, 502, and 503, No. 66 Boxia Road, Pudong New Area, Shanghai, March 2012

Applicant after: Daguan Data Co.,Ltd.

Address before: 201203 rooms 301, 303 and 304, block B, 112 liangxiu Road, Pudong New Area, Shanghai

Applicant before: DATAGRAND INFORMATION TECHNOLOGY (SHANGHAI) Co.,Ltd.

Country or region before: China

CB02 Change of applicant information
GR01 Patent grant
GR01 Patent grant