[go: up one dir, main page]

CN116524525A - Table processing method, device, system and storage medium - Google Patents

Table processing method, device, system and storage medium Download PDF

Info

Publication number
CN116524525A
CN116524525A CN202310389640.5A CN202310389640A CN116524525A CN 116524525 A CN116524525 A CN 116524525A CN 202310389640 A CN202310389640 A CN 202310389640A CN 116524525 A CN116524525 A CN 116524525A
Authority
CN
China
Prior art keywords
area
image
target
table area
processing
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202310389640.5A
Other languages
Chinese (zh)
Inventor
李超
陈永录
李变
刘斐
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Industrial and Commercial Bank of China Ltd ICBC
Original Assignee
Industrial and Commercial Bank of China Ltd ICBC
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Industrial and Commercial Bank of China Ltd ICBC filed Critical Industrial and Commercial Bank of China Ltd ICBC
Priority to CN202310389640.5A priority Critical patent/CN116524525A/en
Publication of CN116524525A publication Critical patent/CN116524525A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/40Document-oriented image-based pattern recognition
    • G06V30/41Analysis of document content
    • G06V30/412Layout analysis of documents structured with printed lines or input boxes, e.g. business forms or tables
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/20Image preprocessing
    • G06V10/34Smoothing or thinning of the pattern; Morphological operations; Skeletonisation
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/19Recognition using electronic means
    • G06V30/19007Matching; Proximity measures
    • G06V30/19073Comparing statistics of pixel or of feature values, e.g. histogram matching
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/40Document-oriented image-based pattern recognition
    • G06V30/41Analysis of document content
    • G06V30/413Classification of content, e.g. text, photographs or tables
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Theoretical Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • Probability & Statistics with Applications (AREA)
  • Image Analysis (AREA)

Abstract

本申请提供一种表格处理方法、装置、系统及存储介质,涉及人工智能领域。该方法包括:获取源表格图像;采用直方图统计法提取源表格图像中的表格区域,得到预提取的表格区域;检测预提取的表格区域中是否存在目标属性,并根据检测结果从预提取的表格区域中筛选出目标表格区域;对目标表格区域中的文字部分进行定位,得到文字定位的表格图像。本申请的方法,实现了对表格的准确识别。

The present application provides a form processing method, device, system and storage medium, which relate to the field of artificial intelligence. The method comprises: obtaining a source table image; extracting a table area in the source table image by using a histogram statistical method to obtain a pre-extracted table area; detecting whether there is a target attribute in the pre-extracted table area, and extracting from the pre-extracted table area according to the detection result A target table area is screened out from the table area; a text part in the target table area is positioned to obtain a table image for text positioning. The method of the present application realizes accurate identification of forms.

Description

表格处理方法、装置、系统及存储介质Table processing method, device, system and storage medium

技术领域technical field

本申请涉及人工智能领域,尤其涉及一种表格处理方法、装置、系统及存储介质。The present application relates to the field of artificial intelligence, and in particular to a form processing method, device, system and storage medium.

背景技术Background technique

表格数据数量庞大,对表格数据的数字化保存显得尤为重要,对表格图像的处理是对表格数据的数字化保存的重要前提。The amount of tabular data is huge, and the digital preservation of tabular data is particularly important. The processing of tabular images is an important prerequisite for the digital preservation of tabular data.

目前现有技术中,对表格图像的处理主要针对表格框线完整、连续,横纵向框线又交界点的规则表格,采用分割法对表格进行分割,使用传统相加表格或表内文字定位的方式对表格中的文字进行定位处理。In the current existing technology, the processing of form images is mainly aimed at regular forms with complete and continuous form frames, horizontal and vertical frame lines and intersection points, and the segmentation method is used to segment the form, using traditional addition forms or text positioning in the form method to position the text in the table.

然而,发明人发现至少存在以下技术问题:对于框线残缺等非规则表格的图像,现有技术在处理过程中鲁棒性欠佳,准确率不高。However, the inventors have found at least the following technical problems: For images of irregular forms such as incomplete frame lines, the prior art has poor robustness and low accuracy during processing.

发明内容Contents of the invention

本申请提供一种表格处理方法、装置、系统及存储介质,用以解决无法对非规则表格进行准确处理的问题。The present application provides a form processing method, device, system and storage medium to solve the problem that irregular forms cannot be accurately processed.

第一方面,本申请提供一种表格处理方法,包括:In a first aspect, the present application provides a form processing method, including:

获取源表格图像;Get the source table image;

采用直方图统计法提取源表格图像中的表格区域,得到预提取的表格区域;Using the histogram statistical method to extract the form area in the source form image, to obtain the pre-extracted form area;

检测预提取的表格区域中是否存在目标属性,并根据检测结果从预提取的表格区域中筛选出目标表格区域;Detect whether there is a target attribute in the pre-extracted table area, and filter out the target table area from the pre-extracted table area according to the detection result;

对目标表格区域中的文字部分进行定位,得到文字定位的表格图像Locate the text part in the target table area to get the table image where the text is positioned

在一种可能的设计中,采用直方图统计法提取源表格图像中的表格区域,得到预提取的表格区域,包括:对源表格图像进行灰度二值化处理,得到灰度二值化处理后的图像;对灰度二值化处理后的图像进行纵向直方图统计处理,得到纵向直方图统计处理后的图像;对纵向直方图统计处理后的图像进行最小外界矩形检测,确定表格区域的横向边界;对灰度二值化处理后的图像进行横向直方图统计处理,得到横向直方图统计处理后的图像;对横向直方图统计处理后的图像进行最小外接矩形检测,确定表格区域的纵向边界;根据横向边界和纵向边界,确定表格区域的矩形;根据预定义的第一筛选规则对表格区域的矩形进行筛选,得到筛选后的表格区域的矩形;基于筛选后的表格区域的矩形从源表格图像中,提取得到预提取的表格区域。In a possible design, the histogram statistical method is used to extract the table area in the source table image, and the pre-extracted table area is obtained, including: performing grayscale binarization processing on the source table image to obtain grayscale binarization processing The image after the grayscale binarization processing is carried out to obtain the image after the statistical processing of the vertical histogram; the image after the statistical processing of the vertical histogram is subjected to the minimum external rectangle detection to determine the size of the table area Horizontal boundary; carry out horizontal histogram statistical processing on the image after grayscale binarization processing, and obtain the image after horizontal histogram statistical processing; perform minimum circumscribed rectangle detection on the image after horizontal histogram statistical processing, and determine the vertical direction of the table area Boundary; determine the rectangle of the table area according to the horizontal boundary and the vertical boundary; filter the rectangle of the table area according to the predefined first filtering rule to obtain the rectangle of the filtered table area; obtain the rectangle of the filtered table area based on the rectangle of the filtered table area from the source In the table image, extract the pre-extracted table area.

在一种可能的设计中,在对灰度二值化处理后的图像进行横向直方图统计处理,得到横向直方图统计处理后的图像之后,还包括:通过预定义的第一膨胀核对横向直方图统计处理后的图像进行横向膨胀处理,得到横向膨胀处理后的图像;相应地,对横向直方图统计处理后的图像进行最小外接矩形检测,确定表格区域的纵向边界,包括:对横向膨胀处理后的图像进行最小外接矩形检测,确定表格区域的纵向边界。In a possible design, after performing horizontal histogram statistical processing on the image after grayscale binarization processing to obtain the image after horizontal histogram statistical processing, it also includes: checking the horizontal histogram through the predefined first expansion kernel The image after the statistical processing of the graph is subjected to horizontal expansion processing to obtain the image after the horizontal expansion processing; correspondingly, the minimum circumscribed rectangle detection is performed on the image after the horizontal histogram statistical processing to determine the vertical boundary of the table area, including: horizontal expansion processing The final image is subjected to minimum enclosing rectangle detection to determine the vertical boundary of the table area.

在一种可能的设计中,其中目标属性为横向直线;相应的,检测预提取的表格区域中是否存在目标属性,并根据检测结果从预提取的表格区域中筛选出目标表格区域,包括:对预提取的表格区域进行灰度二值化,得到灰度二值化的预提取的表格区域;采用形态学方法,提取灰度二值化的预提取的表格区域中的横向线段;采用直线提取算法,将横向线段重构成为横向直线;检测每个预提取的表格区域中是否存在横向直线,得到检测结果;将检测结果为是的所有预提取的表格区域,确定为目标表格区域。In a possible design, the target attribute is a horizontal straight line; correspondingly, detecting whether the target attribute exists in the pre-extracted table area, and filtering out the target table area from the pre-extracted table area according to the detection result, including: Perform grayscale binarization on the pre-extracted table area to obtain the gray-scale binarized pre-extracted table area; use the morphological method to extract the horizontal line segment in the gray-scale binarized pre-extracted table area; use straight line extraction The algorithm reconstructs the horizontal line segment into a horizontal straight line; detects whether there is a horizontal line in each pre-extracted table area, and obtains a detection result; determines all pre-extracted table areas whose detection results are yes, as the target table area.

在一种可能的设计中,对目标表格区域中的文字部分进行定位,得到文字定位的表格图像,包括:对目标表格区域进行灰度化处理,得到灰度化处理后的目标表格区域;对灰度化处理后的目标表格区域进行边缘检测,得到边缘检测后的目标表格区域;对边缘检测后的目标表格区域进行二值化,得到二值化的目标表格区域;对二值化的目标表格区域进行腐蚀处理,去除二值化的目标表格区域中的竖直线得到剔除竖直线后的目标表格区域;根据预定义的第二膨胀核和腐蚀核对剔除竖直线后的目标表格区域进行处理,得到腐蚀处理后的目标表格区域;根据预定义的第三膨胀核对腐蚀处理后的目标表格区域再次进行膨胀处理,得到膨胀处理后的目标表格区域;查找膨胀处理后的目标表格区域中的矩形轮廓,对文字进行定位;根据预定义的第二筛选规则对矩形轮廓进行筛选;从目标表格区域中提取含有筛选后矩形轮廓的区域,得到文字定位的表格图像。In a possible design, the text part in the target table area is positioned to obtain the table image of the text positioning, including: performing grayscale processing on the target table area to obtain the grayscale processed target table area; Edge detection is performed on the target table area after grayscale processing to obtain the target table area after edge detection; binarization is performed on the target table area after edge detection to obtain a binarized target table area; the binarized target table area is obtained The table area is corroded, and the vertical line in the binarized target table area is removed to obtain the target table area after the vertical line is removed; the target table area after the vertical line is removed according to the predefined second expansion kernel and corrosion check Perform processing to obtain the target table area after corrosion processing; perform expansion processing on the target table area after corrosion processing according to the predefined third expansion kernel to obtain the target table area after expansion processing; find the target table area after expansion processing The rectangular outline of the text is positioned; the rectangular outline is screened according to a predefined second screening rule; the area containing the filtered rectangular outline is extracted from the target table area to obtain a table image of the text positioning.

在一种可能的设计中,在对目标表格区域中的文字部分进行定位,得到文字定位的表格图像之后,还包括:识别表格图像中的文字,并根据文字生成电子表格。In a possible design, after locating the text part in the target table area to obtain the text-located table image, the method further includes: identifying the text in the table image, and generating an electronic form according to the text.

第二方面,本申请提供一种表格处理装置,包括:In a second aspect, the present application provides a form processing device, including:

获取模块,用于获取源表格图像;The acquisition module is used to acquire the source table image;

预提取模块,用于采用直方图统计法提取源表格图像中的表格区域,得到预提取的表格区域;The pre-extraction module is used to extract the form area in the source form image by histogram statistics to obtain the pre-extracted form area;

筛选模块,用于检测预提取的表格区域中是否存在目标属性,并根据检测结果从预提取的表格区域中筛选出目标表格区域;A screening module, configured to detect whether there is a target attribute in the pre-extracted form area, and filter out the target form area from the pre-extracted form area according to the detection result;

定位模块,用于对目标表格区域中的文字部分进行定位,得到文字定位的表格图像。The positioning module is used to locate the text part in the target form area, and obtain the form image of the text positioning.

在一种可能的设计中,预提取模块具体用于:对源表格图像进行灰度二值化处理,得到灰度二值化处理后的图像;对灰度二值化处理后的图像进行纵向直方图统计处理,得到纵向直方图统计处理后的图像;对纵向直方图统计处理后的图像进行最小外界矩形检测,确定表格区域的横向边界;对灰度二值化处理后的图像进行横向直方图统计处理,得到横向直方图统计处理后的图像;对横向直方图统计处理后的图像进行最小外接矩形检测,确定表格区域的纵向边界;根据横向边界和纵向边界,确定表格区域的矩形;根据预定义的第一筛选规则对表格区域的矩形进行筛选,得到筛选后的表格区域的矩形;基于筛选后的表格区域的矩形从源表格图像中,提取得到预提取的表格区域。In a possible design, the pre-extraction module is specifically used to: perform grayscale binarization processing on the source table image to obtain the image after grayscale binarization processing; Statistical processing of the histogram to obtain the image after the statistical processing of the vertical histogram; the minimum external rectangle detection is performed on the image after the statistical processing of the vertical histogram to determine the horizontal boundary of the table area; horizontal histogram is performed on the image after the grayscale binarization processing Graph statistical processing, obtain the image after the horizontal histogram statistical processing; Carry out minimum circumscribed rectangle detection to the image after the horizontal histogram statistical processing, determine the vertical boundary of the table area; According to the horizontal boundary and the vertical boundary, determine the rectangle of the table area; According to The predefined first filtering rule filters the rectangle of the table area to obtain the rectangle of the filtered table area; based on the rectangle of the filtered table area, the pre-extracted table area is obtained by extracting from the source table image.

在一种可能的设计中,定位模块具体用于:对目标表格区域进行灰度化处理,得到灰度化处理后的目标表格区域;对灰度化处理后的目标表格区域进行边缘检测,得到边缘检测后的目标表格区域;对边缘检测后的目标表格区域进行二值化,得到二值化的目标表格区域;对二值化的目标表格区域进行腐蚀处理,去除二值化的目标表格区域中的竖直线得到剔除竖直线后的目标表格区域;根据预定义的第二膨胀核和腐蚀核对剔除竖直线后的目标表格区域进行处理,得到腐蚀处理后的目标表格区域;根据预定义的第三膨胀核对腐蚀处理后的目标表格区域再次进行膨胀处理,得到膨胀处理后的目标表格区域;查找膨胀处理后的目标表格区域中的矩形轮廓,对文字进行定位;根据预定义的第二筛选规则对矩形轮廓进行筛选;从目标表格区域中提取含有筛选后矩形轮廓的区域,得到文字定位的表格图像。In a possible design, the positioning module is specifically used to: perform grayscale processing on the target table area to obtain the grayscale processed target table area; perform edge detection on the grayscale processed target table area to obtain The target table area after edge detection; binarize the target table area after edge detection to obtain a binarized target table area; corrode the binarized target table area to remove the binarized target table area The vertical line in the vertical line obtains the target table area after removing the vertical line; According to the predefined second expansion kernel and corrosion check, the target table area after removing the vertical line is processed, and the target table area after corrosion processing is obtained; according to the pre-defined The defined third expansion kernel performs expansion processing on the target table area after corrosion processing again to obtain the target table area after expansion processing; find the rectangular outline in the target table area after expansion processing, and position the text; according to the predefined first The second filtering rule is to filter the rectangular outline; extract the area containing the filtered rectangular outline from the target table area, and obtain the form image of the text positioning.

第三方面,本申请提供表格处理系统,包括:In a third aspect, the application provides a form processing system, including:

摄像机,用于采集源表格图像;A camera for capturing source form images;

服务器包括:至少一个处理器和存储器;存储器存储计算机执行指令;处理器执行存储器存储的计算机执行指令,使得至少一个处理器执行如上第一方面以及第一方面各种可能的设计的表格处理方法。The server includes: at least one processor and a memory; the memory stores computer-executable instructions; the processor executes the computer-executable instructions stored in the memory, so that at least one processor executes the table processing method of the above first aspect and various possible designs of the first aspect.

第四方面,本申请提供一种计算机存储介质,计算机存储介质中存储有计算机执行指令,当处理器执行计算机执行指令时,实现如上第一方面以及第一方面各种可能的设计的表格处理方法。In the fourth aspect, the present application provides a computer storage medium, in which computer-executable instructions are stored, and when the processor executes the computer-executable instructions, the above-mentioned first aspect and the table processing method of various possible designs of the first aspect are realized .

第五方面,本申请实施例提供一种计算机程序产品,包括计算机程序,计算机程序被处理器执行时,实现如上第一方面以及第一方面各种可能的设计的表格处理方法。In the fifth aspect, the embodiment of the present application provides a computer program product, including a computer program. When the computer program is executed by a processor, the above first aspect and the table processing method of various possible designs of the first aspect can be realized.

本申请提供的表格处理方法、装置、系统及存储介质,通过采用直方图统计法对源表格图像进行表格提取,并对提取到的目标表格区域进行文字部分的定位,得到文字定位的表格图像,识别方法不依赖表格框线,对框线残缺的表格也可以做出准确识别。The form processing method, device, system and storage medium provided by the present application extract the form from the source form image by using the histogram statistics method, and locate the text part of the extracted target form area, so as to obtain the form image of the text positioning, The identification method does not depend on the frame line of the form, and can also accurately identify the form with incomplete frame lines.

附图说明Description of drawings

此处的附图被并入说明书中并构成本说明书的一部分,示出了符合本申请的实施例,并与说明书一起用于解释本申请的原理。The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate embodiments consistent with the application and together with the description serve to explain the principles of the application.

图1为本申请实施例提供的表格处理方法的应用场景示意图;FIG. 1 is a schematic diagram of an application scenario of a table processing method provided in an embodiment of the present application;

图2为本申请一个实施例提供的表格处理方法流程示意图;FIG. 2 is a schematic flow chart of a form processing method provided by an embodiment of the present application;

图3为本申请实施例提供的纵向直方图统计处理后的图像;Fig. 3 is the image after statistical processing of the longitudinal histogram provided by the embodiment of the present application;

图4为本申请实施例提供的最小外界矩形检测后的图像;Fig. 4 is the image after the detection of the minimum outer rectangle provided by the embodiment of the present application;

图5为本申请实施例提供的横向直方图统计处理后的图像;Fig. 5 is the image after statistical processing of the horizontal histogram provided by the embodiment of the present application;

图6为本申请实施例提供的预提取的表格区域;Fig. 6 is the pre-extracted table area provided by the embodiment of the present application;

图7为本申请实施例提供的文字定位输出结果图像;Fig. 7 is the output result image of text positioning provided by the embodiment of the present application;

图8为本申请实施例提供的横向膨胀处理后的图像;FIG. 8 is an image after lateral expansion processing provided by the embodiment of the present application;

图9为本申请实施例提供的最小外接矩形检测的图像;FIG. 9 is an image of the minimum circumscribed rectangle detection provided by the embodiment of the present application;

图10为本申请实施例提供的表格处理装置的结构示意图;FIG. 10 is a schematic structural diagram of a form processing device provided in an embodiment of the present application;

图11为本申请实施例提供的服务器的硬件结构示意图。FIG. 11 is a schematic diagram of a hardware structure of a server provided by an embodiment of the present application.

通过上述附图,已示出本申请明确的实施例,后文中将有更详细的描述。这些附图和文字描述并不是为了通过任何方式限制本申请构思的范围,而是通过参考特定实施例为本领域技术人员说明本申请的概念。By means of the above drawings, specific embodiments of the present application have been shown, which will be described in more detail hereinafter. These drawings and text descriptions are not intended to limit the scope of the concept of the application in any way, but to illustrate the concept of the application for those skilled in the art by referring to specific embodiments.

具体实施方式Detailed ways

这里将详细地对示例性实施例进行说明,其示例表示在附图中。下面的描述涉及附图时,除非另有表示,不同附图中的相同数字表示相同或相似的要素。以下示例性实施例中所描述的实施方式并不代表与本申请相一致的所有实施方式。相反,它们仅是与如所附权利要求书中所详述的、本申请的一些方面相一致的装置和方法的例子。Reference will now be made in detail to the exemplary embodiments, examples of which are illustrated in the accompanying drawings. When the following description refers to the accompanying drawings, the same numerals in different drawings refer to the same or similar elements unless otherwise indicated. The implementations described in the following exemplary embodiments do not represent all implementations consistent with this application. Rather, they are merely examples of apparatuses and methods consistent with aspects of the present application as recited in the appended claims.

需要说明的是,本申请所涉及的用户信息(包括但不限于用户设备信息、用户个人信息等)和数据(包括但不限于用于分析的数据、存储的数据、展示的数据等),均为经用户授权或者经过各方充分授权的信息和数据,并且相关数据的收集、使用和处理需要遵守相关法律法规和标准,并提供有相应的操作入口,供用户选择授权或者拒绝。It should be noted that the user information (including but not limited to user equipment information, user personal information, etc.) and data (including but not limited to data used for analysis, stored data, displayed data, etc.) involved in this application are all It is information and data authorized by the user or fully authorized by all parties, and the collection, use and processing of relevant data must comply with relevant laws, regulations and standards, and provide corresponding operation portals for users to choose to authorize or refuse.

需要说明的是,本申请表格处理的方法和装置可用于人工智能领域,也可用于除人工智能领域之外的任意领域,本申请表格处理的方法和装置的应用领域不作限定。It should be noted that the form processing method and device of the present application can be used in the field of artificial intelligence, and can also be used in any field other than the field of artificial intelligence, and the application field of the form processing method and device of the present application is not limited.

针对表格处理过程中对非规则表格识别准确度不高的的问题,本申请实施例提出以下技术方案:采用直方图统计法提取源表格图像中的表格区域,并对表格区域中的文字部分进行定位,实现了对表格的准确识别。下面采用详细的实施例进行详细说明。Aiming at the problem that the recognition accuracy of irregular forms is not high in the form processing process, the embodiment of the present application proposes the following technical solutions: use the histogram statistical method to extract the form area in the source form image, and perform text analysis on the text part in the form area. Positioning, to achieve accurate identification of the form. The following uses detailed embodiments to describe in detail.

图1为本申请实施例提供的表格识别方法的应用场景示意图。如图1所示,包括:摄像机101和服务器102。摄像机101拍摄纸质表格以获取表格图像。服务器102,用于摄像机101获取的表格图像,并对表格图像进行表格文字的识别与录入。FIG. 1 is a schematic diagram of an application scenario of a table recognition method provided by an embodiment of the present application. As shown in FIG. 1 , it includes: a camera 101 and a server 102 . The camera 101 shoots the paper form to obtain an image of the form. The server 102 is used for the form image captured by the camera 101 , and recognizes and enters the form text on the form image.

下面以具体地实施例对本申请的技术方案以及本申请的技术方案如何解决上述技术问题进行详细说明。下面这几个具体的实施例可以相互结合,对于相同或相似的概念或过程可能在某些实施例中不再赘述。下面将结合附图,对本申请的实施例进行描述。The technical solution of the present application and how the technical solution of the present application solves the above technical problems will be described in detail below with specific embodiments. The following specific embodiments may be combined with each other, and the same or similar concepts or processes may not be repeated in some embodiments. Embodiments of the present application will be described below in conjunction with the accompanying drawings.

图2为本申请一个实施例提供的表格处理方法流程示意图,本实施例的执行主体可以为图1所示实施例中的服务器,本实施例此处不做特别限制。如图2所示,该方法包括:FIG. 2 is a schematic flow chart of a form processing method provided by an embodiment of the present application. The execution subject of this embodiment may be the server in the embodiment shown in FIG. 1 , and this embodiment does not make special limitations here. As shown in Figure 2, the method includes:

S201:获取源表格图像。S201: Acquire a source table image.

具体地,获取由摄像机上传的源表格图像。Specifically, the source table image uploaded by the camera is acquired.

S202:采用直方图统计法提取源表格图像中的表格区域,得到预提取的表格区域。S202: Using a histogram statistics method to extract form areas in the source form image to obtain pre-extracted form areas.

具体地,包括S2021~S2027:Specifically, including S2021~S2027:

S2021:对源表格图像进行灰度二值化处理,得到灰度二值化处理后的图像。S2021: Perform grayscale binarization processing on the source table image to obtain an image after grayscale binarization processing.

其中,灰度二值化为将图像在RGB模型中的每个像素点的三原色的变量值调整至相同值,这个值为灰度值,再将图像中每个像素点的灰度值调整为0(黑色)或255(白色)。Among them, the grayscale binarization is to adjust the variable values of the three primary colors of each pixel in the RGB model to the same value, which is the grayscale value, and then adjust the grayscale value of each pixel in the image to 0 (black) or 255 (white).

具体地,通过灰度二值化处理将源表格图像中每个像素点的三原色的变量值调整至相同灰度值,再将每个像素点的灰度值调整为0(黑色)或255(白色),得到灰度二值化处理后的图像。Specifically, the variable values of the three primary colors of each pixel in the source table image are adjusted to the same gray value through grayscale binarization, and then the grayscale value of each pixel is adjusted to 0 (black) or 255 ( White), to obtain the grayscale binarized image.

S2022:对灰度二值化处理后的图像进行纵向直方图统计处理,得到纵向直方图统计处理后的图像。S2022: Perform longitudinal histogram statistical processing on the grayscale binarized image to obtain an image after longitudinal histogram statistical processing.

其中,直方图统计通过统计图像区域的的亮度分布来改变图像中个像素的灰度,主要用于增强动态范围偏小的图像的对比度。Among them, the histogram statistics change the gray level of each pixel in the image by counting the brightness distribution of the image area, which is mainly used to enhance the contrast of the image with a small dynamic range.

具体地,通过统计灰度二值化处理后的图像中每个像素点的亮度分布来改变图像中每个像素的灰度,得到纵向直方图统计处理中的图像。Specifically, the gray level of each pixel in the image is changed by counting the brightness distribution of each pixel in the gray level binarized image to obtain the image in the longitudinal histogram statistical process.

示例性地,图3中(a)、(b)、(c)分别为三张表格图像经过纵向直方图统计处理后的图像,得到纵向的表格线。Exemplarily, (a), (b) and (c) in FIG. 3 are images of three table images subjected to statistical processing of longitudinal histograms, and vertical table lines are obtained.

S2023:对纵向直方图统计处理后的图像进行最小外界矩形检测,确定表格区域的横向边界。S2023: Perform minimum outer rectangle detection on the image after statistical processing of the vertical histogram, and determine the horizontal boundary of the table area.

具体地,对纵向直方图处理后的图像进行轮廓检测,找到外接矩形,调整矩形的角度,找到最小的外接矩形。Specifically, contour detection is performed on the image processed by the vertical histogram, a circumscribing rectangle is found, an angle of the rectangle is adjusted, and the smallest circumscribing rectangle is found.

示例性地,图4中(a)、(b)、(c)分别为三张表格图像经过最小外界矩形检测后的图像,确定出表格区域的最左、最右侧框线,即表格区域的横向边界。Exemplarily, (a), (b), and (c) in Fig. 4 are the images of the three table images after the minimum outer rectangle detection, and the leftmost and rightmost frame lines of the table area are determined, that is, the table area horizontal border.

S2024:对灰度二值化处理后的图像进行横向直方图统计处理,得到横向直方图统计处理后的图像。S2024: Perform horizontal histogram statistical processing on the grayscale binarized image to obtain an image after horizontal histogram statistical processing.

具体地,通过统计灰度二值化处理后的图像中每个像素点的亮度分布来改变图像中每个像素的灰度,得到横向直方图统计处理中的图像。Specifically, the gray level of each pixel in the image is changed by counting the brightness distribution of each pixel in the gray level binarized image to obtain the image in the horizontal histogram statistical process.

示例性地,图5中(a)、(b)、(c)分别为三张表格图像经过横向直方图统计处理后的图像,得到横向的表格线。Exemplarily, (a), (b), and (c) in FIG. 5 are images of three table images subjected to statistical processing of horizontal histograms, and horizontal table lines are obtained.

S2025:对横向直方图统计处理后的图像进行最小外接矩形检测,确定表格区域的纵向边界。S2025: Perform minimum circumscribed rectangle detection on the image after the statistical processing of the horizontal histogram, and determine the vertical boundary of the table area.

S2026:根据横向边界和纵向边界,确定表格区域的矩形。S2026: Determine the rectangle of the table area according to the horizontal boundary and the vertical boundary.

具体地,将横向边界和纵向边界包围的区域,确定为表格区域的矩形。Specifically, the area surrounded by the horizontal boundary and the vertical boundary is determined as a rectangle of the table area.

S2027:根据预定义的第一筛选规则对表格区域的矩形进行筛选,得到筛选后的表格区域的矩形。S2027: Filter the rectangle of the table area according to the predefined first screening rule to obtain the screened rectangle of the table area.

其中,第一筛选规则为面积、高度和宽度等满足预设的尺寸标准的表格区域的矩形。Wherein, the first filtering rule is a rectangle of a table area satisfying preset size standards such as area, height and width.

示例性地,第一筛选规则为满足面积大于20cm2、高度大于10cm、宽度大于20cm的表格区域的矩形。Exemplarily, the first screening rule is a rectangle that satisfies a table area with an area greater than 20 cm 2 , a height greater than 10 cm, and a width greater than 20 cm.

S2028:基于筛选后的表格区域的矩形从源表格图像中,提取得到预提取的表格区域。S2028: Extract the pre-extracted table area from the source table image based on the rectangle of the screened table area.

具体地,根据筛选后的表格区域的矩形的位置,对源表格图像相同的位置进行表格提取,得到预提取的表格区域。Specifically, according to the position of the rectangle of the filtered table area, the table is extracted from the same position of the source table image to obtain a pre-extracted table area.

示例性地,图6中(a)、(b)、(c)分别为三张表格图像经过预提取的表格区域。Exemplarily, (a), (b) and (c) in FIG. 6 are the pre-extracted form areas of the three form images respectively.

S203:检测预提取的表格区域中是否存在目标属性,并根据检测结果从预提取的表格区域中筛选出目标表格区域。S203: Detect whether there is a target attribute in the pre-extracted table area, and filter out the target table area from the pre-extracted table area according to the detection result.

在本实施例中,该目标属性可以为横向直线,也可以是超过两条的纵向直线。In this embodiment, the target attribute may be a horizontal straight line, or more than two vertical straight lines.

在一种实现方式中,该目标属性为横向直线,S203具体包括S2031~S2035:In an implementation manner, the target attribute is a horizontal straight line, and S203 specifically includes S2031-S2035:

S2031:对预提取的表格区域进行灰度二值化,得到灰度二值化的预提取的表格区域。S2031: Perform gray scale binarization on the pre-extracted table area to obtain a gray scale binarized pre-extracted table area.

S2032:采用形态学方法,提取灰度二值化的预提取的表格区域中的横向线段。S2032: Using a morphological method, extracting horizontal line segments in the pre-extracted table area after grayscale binarization.

其中,形态学是一种非线性滤波方法,通过目标图像的形状创建对应的结构元素,对目标图像进行提取。Among them, morphology is a nonlinear filtering method, which creates corresponding structural elements through the shape of the target image to extract the target image.

具体地,通过预先创建横向线段对应的结构元素,对灰度二值化的预提取的表格区域中的横向线段进行提取。Specifically, by pre-creating structural elements corresponding to the horizontal line segments, the horizontal line segments in the pre-extracted table area of the gray scale binarization are extracted.

S2033:采用直线提取算法,将横向线段重构成为横向直线。S2033: Using a straight line extraction algorithm to reconstruct the horizontal line segment into a horizontal straight line.

其中,直线提取算法是根据图像中像素点的梯度对直线进行提取的算法。Wherein, the straight line extraction algorithm is an algorithm for extracting the straight line according to the gradient of the pixel points in the image.

在本实施例中,可以使用最小显著性差异法对直线进行提取。In this embodiment, the straight line can be extracted using the least significant difference method.

具体地,采用高斯核对输入图像进行采样,计算每一个采样点的梯度值以及梯度方向,根据梯度值对所有采样点进行伪排序,根据梯度值对采样点的状态进行排序,最后通过直线纵向坐标的排序处理,极值聚类均值处理,得到直线坐标。Specifically, the Gaussian kernel is used to sample the input image, the gradient value and gradient direction of each sampling point are calculated, all sampling points are pseudo-sorted according to the gradient value, the states of the sampling points are sorted according to the gradient value, and finally the linear longitudinal coordinate Sorting processing, extreme value clustering mean value processing, to obtain linear coordinates.

S2034:检测每个预提取的表格区域中是否存在横向直线,得到检测结果。S2034: Detect whether there is a horizontal straight line in each pre-extracted table area, and obtain a detection result.

S2035:将检测结果为是的所有预提取的表格区域,确定为目标表格区域。S2035: Determine all pre-extracted table areas whose detection result is yes as target table areas.

S204:对目标表格区域中的文字部分进行定位,得到文字定位的表格图像。S204: Position the text part in the target table area to obtain a text-positioned table image.

其中,文字包括文本和数据。Among them, literal includes text and data.

具体地,包括S2041~S2049,图7为本申请实施例提供的文字定位输出结果图像。Specifically, including S2041 to S2049, FIG. 7 is an image of the text positioning output result provided by the embodiment of the present application.

S2041:对目标表格区域进行灰度化处理,得到灰度化处理后的目标表格区域。S2041: Perform grayscale processing on the target table area to obtain the grayscale processed target table area.

具体地,通过灰度化处理将目标表格区域中每个像素点的灰度值调整为0(黑色)或255(白色),得到灰度化处理后的目标表格区域。Specifically, the grayscale value of each pixel in the target table area is adjusted to 0 (black) or 255 (white) through grayscale processing to obtain the grayscale processed target table area.

示例性地,图7中(a)所示为目标表格区域。Exemplarily, (a) in FIG. 7 shows the target table area.

S2042:对灰度化处理后的目标表格区域进行边缘检测,得到边缘检测后的目标表格区域。S2042: Perform edge detection on the gray-scaled target table area to obtain the edge-detected target table area.

在本实施例中,可以使用Sobel算子对灰度化处理后的目标表格区域进行边缘检测。In this embodiment, a Sobel operator may be used to perform edge detection on the grayscaled target table area.

具体地,Sobel算子求x方向梯度,函数调用采用以下参数xorder=1,yorder=0,aperture_size=3来计算一阶x-或y-方向的图像差分,即Sobel(gray,sobel,CV_8U,1,0,3)。Specifically, the Sobel operator seeks the gradient in the x direction, and the function call uses the following parameters xorder=1, yorder=0, aperture_size=3 to calculate the image difference in the first-order x- or y-direction, that is, Sobel(gray, sobel, CV_8U, 1,0,3).

S2043:对边缘检测后的目标表格区域进行最大类间方差法二值化,得到二值化的目标表格区域。S2043: Binarize the target table area after the edge detection by the maximum inter-class variance method to obtain a binarized target table area.

具体地,通过阈值进行前后背景分割,使类间方差最大,将边缘检测后的目标表格区域分成背景和前景两部分,得到二值化的目标表格区域,二值化的目标表格区域如图7中(b)所示。Specifically, the front and rear backgrounds are segmented by the threshold to maximize the variance between classes, and the target table area after edge detection is divided into two parts, the background and the foreground, and the binarized target table area is obtained. The binarized target table area is shown in Figure 7 Shown in (b).

S2044:对二值化的目标表格区域进行腐蚀处理,去除二值化的目标表格区域中的竖直线得到剔除竖直线后的目标表格区域。S2044: Erosion processing is performed on the binarized target table area, and vertical lines in the binarized target table area are removed to obtain a target table area from which the vertical lines have been removed.

具体地,采用形态学方法,创建用于提取垂直线的结构元素,剔除二值化的目标表格区域中的竖直线,得到剔除竖直线后的目标表格区域,图7中(c)为二值化的目标表格区域中的竖直线,图7中(d)为剔除竖直线后的目标表格区域。Specifically, the morphological method is used to create structural elements for extracting vertical lines, and the vertical lines in the binarized target table area are removed to obtain the target table area after the vertical lines are removed, as shown in (c) in Figure 7 The vertical lines in the binarized target table area, (d) in FIG. 7 is the target table area after the vertical lines are removed.

S2045:根据预定义的第二膨胀核和腐蚀核对剔除竖直线后的目标表格区域进行处理,得到腐蚀处理后的目标表格区域。S2045: Process the target table area after removing the vertical lines according to the predefined second expansion kernel and corrosion kernel, to obtain the target table area after corrosion processing.

具体地,应用形态学计算,对剔除竖直线后的目标表格区域先腐蚀再膨胀,并设置高度以控制上下向的膨胀程度,得到腐蚀处理后的目标表格区域,图7中(e)为经过第二膨胀核膨胀后的目标表格区域,突出了文字轮廓,图7中(f)为经过腐蚀后的目标表格区域,再次去除掉一些纵向表格线。Specifically, by applying morphological calculations, the target form area after removing the vertical lines is corroded and then expanded, and the height is set to control the degree of expansion in the up and down directions to obtain the target form area after corrosion treatment. Figure 7(e) is The target table area expanded by the second expansion kernel highlights the outline of the text. (f) in FIG. 7 is the target table area after corrosion, and some vertical table lines are removed again.

示例性地,预定义的第二膨胀核和腐蚀核的尺寸为(20,10)。Exemplarily, the predefined dimensions of the second expansion core and corrosion core are (20, 10).

S2046:根据预定义的第三膨胀核对腐蚀处理后的目标表格区域再次进行膨胀处理,得到膨胀处理后的目标表格区域。S2046: Perform dilation processing on the corroded target table area again according to the predefined third expansion kernel, to obtain the dilated target table area.

具体地,根据预定义的第二膨胀核对腐蚀处理后的目标表格区域再次进行横向膨胀处理,得到膨胀处理后的目标表格区域,膨胀处理后的目标表格区域文字轮廓更加明显,膨胀处理后的目标表格区域如图7中(g)所示。Specifically, according to the predefined second expansion kernel, the corroded target table area is horizontally expanded again to obtain the expanded target table area. The text outline of the expanded target table area is more obvious, and the expanded target table area The table area is shown in (g) in Figure 7.

示例性地,第二膨胀核尺寸为(10,1)。Exemplarily, the size of the second expansion kernel is (10, 1).

S2047:查找膨胀处理后的目标表格区域中的矩形轮廓,对文字进行定位。S2047: Find the rectangle outline in the expanded target table area, and position the text.

具体地,根据膨胀处理后的目标表格区域中的矩形轮廓的位置信息,找到文字的位置信息。Specifically, the position information of the text is found according to the position information of the rectangular outline in the target table area after the dilation process.

S2048:根据预定义的第二筛选规则对矩形轮廓进行筛选。S2048: Filter the rectangular contours according to a predefined second screening rule.

其中,第二筛选规则为面积、高度和宽度等满足预设的尺寸标准的矩形轮廓。Wherein, the second screening rule is a rectangular outline satisfying preset size standards such as area, height and width.

示例性地,第二筛选规则满足面积小于10cm2的矩形轮廓,高度大于2倍宽度的矩形轮廓、高度小于40cm的矩形轮廓、宽度小于20cm的矩形轮廓。Exemplarily, the second screening rule satisfies a rectangular outline with an area less than 10 cm2, a rectangular outline with a height greater than twice the width, a rectangular outline with a height less than 40 cm, and a rectangular outline with a width less than 20 cm.

S2049:从目标表格区域中提取含有筛选后矩形轮廓的区域,得到文字定位的表格图像。S2049: Extract an area containing the filtered rectangular outline from the target table area to obtain a text-positioned table image.

具体地,从目标表格区域中提取含有满足第二筛选规则的矩形轮廓的区域,得到文字定位的表格图像,如图7中(h)所示。Specifically, an area containing a rectangular outline satisfying the second screening rule is extracted from the target table area to obtain a text-located table image, as shown in (h) in FIG. 7 .

综上,本实施例提供的表格处理方法,通过采用直方图统计法对源表格图像进行表格提取,并对提取到的目标表格区域进行文字部分的定位,得到文字定位的表格图像,识别方法不依赖表格框线,对框线残缺的表格也可以做出准确识别。To sum up, the table processing method provided in this embodiment extracts the table from the source table image by using the histogram statistics method, and locates the text part of the extracted target table area to obtain the table image with the text positioning. The recognition method does not Relying on the frame line of the table, it can also accurately identify the table with incomplete frame line.

另外,筛选出含有目标属性的表格区域,减少了不必要表格的识别,进一步提高了表格识别的效率和准确度。In addition, the table area containing the target attribute is screened out, which reduces the recognition of unnecessary tables and further improves the efficiency and accuracy of table recognition.

本申请实施例在图2提供的实施例的基础上,对S2024之后对横向直方图统计处理后的图像进行横向膨胀处理的具体实现方法进行了详细说明。该方法包括:通过预定义的第一膨胀核对横向直方图统计处理后的图像进行横向膨胀处理,得到横向膨胀处理后的图像。In this embodiment of the present application, on the basis of the embodiment provided in FIG. 2 , the specific implementation method of performing horizontal dilation processing on the image after the statistical processing of the horizontal histogram after S2024 is described in detail. The method includes: performing horizontal dilation processing on the image after the statistical processing of the horizontal histogram through a predefined first dilation kernel to obtain the image after the horizontal dilation processing.

具体地,通过预定义的第一膨胀核对横向直方图统计处理后的图像高亮部分进行扩张,减小图像高亮部分之间的间隙,得到横向膨胀处理后的图像,图8为横向膨胀处理后的图像。Specifically, the image highlight part after the horizontal histogram statistical processing is expanded by the predefined first expansion kernel, the gap between the image highlight parts is reduced, and the image after the horizontal expansion process is obtained. Figure 8 shows the horizontal expansion process after the image.

示例性地,第一膨胀核的尺寸设置为(20,10)。Exemplarily, the size of the first expansion kernel is set to (20, 10).

相应地,对横向直方图统计处理后的图像进行最小外接矩形检测,确定表格区域的纵向边界,包括:Correspondingly, the minimum circumscribed rectangle detection is performed on the image after the statistical processing of the horizontal histogram to determine the vertical boundary of the table area, including:

对横向膨胀处理后的图像进行最小外接矩形检测,确定表格区域的纵向边界。The minimum circumscribed rectangle detection is performed on the image processed by horizontal dilation to determine the vertical boundary of the table area.

示例性地,图9为最小外接矩形检测的图像,确定出表格区域的最上,最右侧框线,即表格区域的纵向边界。Exemplarily, FIG. 9 is an image of the minimum circumscribed rectangle detection, and the uppermost and rightmost frame lines of the table area are determined, that is, the vertical boundary of the table area.

综上,本实施例提供的表格处理方法,通过对图像高亮部分进行扩张,减少图像高亮部分之间的间隙,增强了最小矩形检测的准确度,进一步增强了表格识别的精确度。To sum up, the table processing method provided by this embodiment reduces the gap between the highlighted parts of the image by expanding the highlighted parts of the image, thereby enhancing the accuracy of the minimum rectangle detection and further enhancing the accuracy of table recognition.

本申请实施例在图2提供的实施例的基础上,对S204之后生成电子表格的具体实现方法进行了详细说明。该方法包括:In this embodiment of the present application, based on the embodiment provided in FIG. 2 , a specific implementation method for generating an electronic form after S204 is described in detail. The method includes:

识别表格图像中的文字,并根据文字生成电子表格。Recognize text in table images and generate spreadsheets from the text.

具体地,采取智能算法,识别表格图像中的文字,并根据文字定位的表格图像将文字输入电子表格中。Specifically, an intelligent algorithm is adopted to recognize the text in the form image, and the text is entered into the electronic form according to the form image located by the text.

在本实施例中,智能算法可以是模板匹配算法,也可以是深度学习算法。In this embodiment, the intelligent algorithm may be a template matching algorithm, or a deep learning algorithm.

综上,本申请实施例提供的表格处理方法,通过识别表格图像中的文字,对表格进行识别,解决了人工录入电子表格过程中存在的工作量大、操作繁琐,准确度低的问题。In summary, the form processing method provided by the embodiment of the present application recognizes the form by identifying the text in the form image, and solves the problems of heavy workload, cumbersome operation and low accuracy in the process of manually entering the electronic form.

图10为本申请实施例提供的表格处理装置的结构示意图。如图10所示,该表格处理装置包括:获取模块1001、预提取模块1002、筛选模块1003以及定位模块1004。FIG. 10 is a schematic structural diagram of a form processing device provided by an embodiment of the present application. As shown in FIG. 10 , the form processing apparatus includes: an acquisition module 1001 , a pre-extraction module 1002 , a screening module 1003 and a positioning module 1004 .

获取模块1001,用于获取源表格图像;An acquisition module 1001, configured to acquire a source table image;

预提取模块1002,用于采用直方图统计法提取源表格图像中的表格区域,得到预提取的表格区域;The pre-extraction module 1002 is used to extract the form area in the source form image by histogram statistics to obtain the pre-extracted form area;

筛选模块1003,用于检测预提取的表格区域中是否存在目标属性,并根据检测结果从预提取的表格区域中筛选出目标表格区域;A screening module 1003, configured to detect whether there is a target attribute in the pre-extracted form area, and filter out the target form area from the pre-extracted form area according to the detection result;

定位模块1004,用于对目标表格区域中的文字部分进行定位,得到文字定位的表格图像。The positioning module 1004 is configured to locate the text part in the target form area, and obtain the form image of the text positioning.

在一种可能的实现方式中,预提取模块1002具体用于:对源表格图像进行灰度二值化处理,得到灰度二值化处理后的图像;对灰度二值化处理后的图像进行纵向直方图统计处理,得到纵向直方图统计处理后的图像;对纵向直方图统计处理后的图像进行最小外界矩形检测,确定表格区域的横向边界;对灰度二值化处理后的图像进行横向直方图统计处理,得到横向直方图统计处理后的图像;对横向直方图统计处理后的图像进行最小外接矩形检测,确定表格区域的纵向边界;根据横向边界和纵向边界,确定表格区域的矩形;根据预定义的第一筛选规则对表格区域的矩形进行筛选,得到筛选后的表格区域的矩形;基于筛选后的表格区域的矩形从源表格图像中,提取得到预提取的表格区域。In a possible implementation, the pre-extraction module 1002 is specifically configured to: perform grayscale binarization processing on the source table image to obtain an image after grayscale binarization processing; Perform statistical processing of the vertical histogram to obtain the image after the statistical processing of the vertical histogram; perform minimum external rectangle detection on the image after the statistical processing of the vertical histogram to determine the horizontal boundary of the table area; Horizontal histogram statistical processing to obtain the image after horizontal histogram statistical processing; perform minimum circumscribed rectangle detection on the image after horizontal histogram statistical processing to determine the vertical boundary of the table area; determine the rectangle of the table area according to the horizontal boundary and vertical boundary ; Filter the rectangle of the table area according to the predefined first screening rule to obtain the rectangle of the filtered table area; extract the rectangle of the table area based on the filtered table area from the source table image to obtain the pre-extracted table area.

在一种可能的实现方式中,预提取模块1002还用于:通过预定义的第一膨胀核对横向直方图统计处理后的图像进行横向膨胀处理,得到横向膨胀处理后的图像;相应地,对横向直方图统计处理后的图像进行最小外接矩形检测,确定表格区域的纵向边界,包括:对横向膨胀处理后的图像进行最小外接矩形检测,确定表格区域的纵向边界。In a possible implementation manner, the pre-extraction module 1002 is further configured to: perform horizontal dilation processing on the image after the statistical processing of the horizontal histogram through a predefined first dilation kernel to obtain the image after the horizontal dilation processing; correspondingly, the The minimum circumscribed rectangle detection is performed on the image after the horizontal histogram statistical processing to determine the vertical boundary of the table area, including: the minimum circumscribed rectangle detection is performed on the image after the horizontal expansion processing to determine the vertical boundary of the table area.

在一种可能的实现方式中,筛选模块1003具体用于:目标属性为横向直线;相应的,检测预提取的表格区域中是否存在目标属性,并根据检测结果从预提取的表格区域中筛选出目标表格区域,包括:对预提取的表格区域进行灰度二值化,得到灰度二值化的预提取的表格区域;采用形态学方法,提取灰度二值化的预提取的表格区域中的横向线段;采用直线提取算法,将横向线段重构成为横向直线;检测每个预提取的表格区域中是否存在横向直线,得到检测结果;将检测结果为是的所有预提取的表格区域,确定为目标表格区域。In a possible implementation manner, the screening module 1003 is specifically configured to: the target attribute is a horizontal straight line; correspondingly, detect whether the target attribute exists in the pre-extracted table area, and filter out from the pre-extracted table area according to the detection result The target table area includes: performing grayscale binarization on the pre-extracted table area to obtain the gray-scale binarized pre-extracted table area; using a morphological method to extract the gray-scale binarized pre-extracted table area the horizontal line segment; use the straight line extraction algorithm to reconstruct the horizontal line segment into a horizontal straight line; detect whether there is a horizontal line in each pre-extracted table area, and obtain the detection result; determine all the pre-extracted table areas where the detection result is yes is the target table area.

在一种可能的实现方式中,定位模块1004具体用于:对目标表格区域进行灰度化处理,得到灰度化处理后的目标表格区域;对灰度化处理后的目标表格区域进行边缘检测,得到边缘检测后的目标表格区域;对边缘检测后的目标表格区域进行二值化,得到二值化的目标表格区域;对二值化的目标表格区域进行腐蚀处理,去除二值化的目标表格区域中的竖直线得到剔除竖直线后的目标表格区域;根据预定义的第二膨胀核和腐蚀核对剔除竖直线后的目标表格区域进行处理,得到腐蚀处理后的目标表格区域;根据预定义的第三膨胀核对腐蚀处理后的目标表格区域再次进行膨胀处理,得到膨胀处理后的目标表格区域;查找膨胀处理后的目标表格区域中的矩形轮廓,对文字进行定位;根据预定义的第二筛选规则对矩形轮廓进行筛选;从目标表格区域中提取含有筛选后矩形轮廓的区域,得到文字定位的表格图像。In a possible implementation manner, the positioning module 1004 is specifically configured to: grayscale the target table area to obtain the grayscaled target table area; perform edge detection on the grayscaled target table area , to obtain the target table area after edge detection; binarize the target table area after edge detection to obtain a binarized target table area; corrode the binarized target table area to remove the binarized target The vertical line in the table area obtains the target table area after the vertical line is removed; according to the predefined second expansion kernel and corrosion check, the target table area after the vertical line is removed is processed to obtain the target table area after the corrosion process; According to the predefined third expansion kernel, the corroded target table area is expanded again to obtain the expanded target table area; the rectangular outline in the expanded target table area is searched for, and the text is positioned; according to the predefined The second screening rule filters the rectangular outline; extracts the area containing the filtered rectangular outline from the target table area, and obtains the text-positioned table image.

在一种可能的实现方式中,继续参考图10,装置还包括生成模块1005,具体用于:识别表格图像中的文字,并根据文字生成电子表格。In a possible implementation manner, with continued reference to FIG. 10 , the device further includes a generation module 1005 , specifically configured to: recognize text in the form image, and generate an electronic form according to the text.

本实施例提供的装置,可用于执行上述方法实施例的技术方案,其实现原理和技术效果类似,本实施例此处不再赘述。The device provided in this embodiment can be used to implement the technical solutions of the above method embodiments, and its implementation principle and technical effect are similar, so this embodiment will not repeat them here.

本申请实施例提供的表格处理系统,包括:The form processing system provided by the embodiment of this application includes:

摄像机101,用于采集源表格图像;A camera 101, configured to collect source form images;

服务器102,用于执行如上的表格处理方法。The server 102 is configured to execute the above form processing method.

图11为本申请实施例提供的服务器的硬件结构示意图。如图11所示,本实施例的服务器包括:处理器1101以及存储器1102;其中FIG. 11 is a schematic diagram of a hardware structure of a server provided by an embodiment of the present application. As shown in Figure 11, the server in this embodiment includes: a processor 1101 and a memory 1102;

存储器1102,用于存储计算机执行指令;memory 1102, for storing computer-executable instructions;

处理器1101,用于执行存储器存储的计算机执行指令,以实现上述实施例中服务器所执行的各个步骤。具体可以参见前述方法实施例中的相关描述。The processor 1101 is configured to execute the computer-executed instructions stored in the memory, so as to implement the various steps executed by the server in the foregoing embodiments. For details, refer to the related descriptions in the foregoing method embodiments.

可选地,存储器1102既可以是独立的,也可以跟处理器1101集成在一起。Optionally, the memory 1102 can be independent or integrated with the processor 1101 .

当存储器1102独立设置时,该服务器还包括总线1103,用于连接存储器1102和处理器1101。When the memory 1102 is set independently, the server further includes a bus 1103 for connecting the memory 1102 and the processor 1101 .

本申请实施例还提供一种计算机存储介质,计算机存储介质中存储有计算机执行指令,当处理器执行计算机执行指令时,实现如上的表格处理方法。The embodiment of the present application also provides a computer storage medium, in which computer-executable instructions are stored, and when the processor executes the computer-executable instructions, the above table processing method is implemented.

本申请实施例还提供一种计算机程序产品,包括计算机程序,计算机程序被处理器执行时,实现如上的表格处理方法。本申请实施例还提供一种计算机程序产品,包括计算机程序,计算机程序被处理器执行时,实现如上的表格处理方法。An embodiment of the present application further provides a computer program product, including a computer program, and when the computer program is executed by a processor, the above table processing method is realized. An embodiment of the present application further provides a computer program product, including a computer program, and when the computer program is executed by a processor, the above table processing method is realized.

在本申请所提供的几个实施例中,应该理解到,所揭露的设备和方法,可以通过其它的方式实现。例如,以上所描述的设备实施例仅仅是示意性的,例如,模块的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,例如多个模块可以结合或者可以集成到另一个系统,或一些特征可以忽略,或不执行。另一点,所显示或讨论的相互之间的耦合或直接耦合或通信连接可以是通过一些接口,装置或模块的间接耦合或通信连接,可以是电性,机械或其它的形式。In the several embodiments provided in this application, it should be understood that the disclosed devices and methods may be implemented in other ways. For example, the device embodiments described above are only illustrative. For example, the division of modules is only a logical function division. In actual implementation, there may be other division methods. For example, multiple modules can be combined or integrated into another A system, or some feature, can be ignored, or not implemented. In another point, the mutual coupling or direct coupling or communication connection shown or discussed may be through some interfaces, and the indirect coupling or communication connection of devices or modules may be in electrical, mechanical or other forms.

作为分离部件说明的模块可以是或者也可以不是物理上分开的,作为模块显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部模块来实现本实施例方案。A module described as a separate component may or may not be physically separated, and a component shown as a module may or may not be a physical unit, that is, it may be located in one place, or may be distributed to multiple network units. Part or all of the modules can be selected according to actual needs to implement the solution of this embodiment.

另外,在本申请各个实施例中的各功能模块可以集成在一个处理单元中,也可以是各个模块单独物理存在,也可以两个或两个以上模块集成在一个单元中。上述模块成的单元既可以采用硬件的形式实现,也可以采用硬件加软件功能单元的形式实现。In addition, each functional module in each embodiment of the present application may be integrated into one processing unit, each module may exist separately physically, or two or more modules may be integrated into one unit. The units formed by the above modules can be implemented in the form of hardware, or in the form of hardware plus software functional units.

上述以软件功能模块的形式实现的集成的模块,可以存储在一个计算机可读取存储介质中。上述软件功能模块存储在一个存储介质中,包括若干指令用以使得一台计算机设备(可以是个人计算机,服务器,或者网络设备等)或处理器执行本申请各个实施例方法的部分步骤。The above-mentioned integrated modules implemented in the form of software function modules can be stored in a computer-readable storage medium. The above-mentioned software function modules are stored in a storage medium, and include several instructions to make a computer device (which may be a personal computer, server, or network device, etc.) or a processor execute some steps of the methods in various embodiments of the present application.

应理解,上述处理器可以是中央处理单元(Central Processing Unit,简称CPU),还可以是其他通用处理器、数字信号处理器(Digital Signal Processor,简称DSP)、专用集成电路(Application Specific Integrated Circuit,简称ASIC)等。通用处理器可以是微处理器或者该处理器也可以是任何常规的处理器等。结合发明所公开的方法的步骤可以直接体现为硬件处理器执行完成,或者用处理器中的硬件及软件模块组合执行完成。It should be understood that the above-mentioned processor may be a central processing unit (Central Processing Unit, referred to as CPU), and may also be other general-purpose processors, a digital signal processor (Digital Signal Processor, referred to as DSP), an application specific integrated circuit (Application Specific Integrated Circuit, referred to as ASIC) and so on. A general-purpose processor may be a microprocessor, or the processor may be any conventional processor, or the like. The steps of the method disclosed in conjunction with the invention can be directly implemented by a hardware processor, or implemented by a combination of hardware and software modules in the processor.

存储器可能包含高速RAM存储器,也可能还包括非易失性存储NVM,例如至少一个磁盘存储器,还可以为U盘、移动硬盘、只读存储器、磁盘或光盘等。The storage may include a high-speed RAM memory, and may also include a non-volatile storage NVM, such as at least one disk storage, and may also be a U disk, a mobile hard disk, a read-only memory, a magnetic disk, or an optical disk.

总线可以是工业标准体系结构(Industry Standard Architecture,简称ISA)总线、外部设备互连(Peripheral Component Interconnect,简称PCI)总线或扩展工业标准体系结构(Extended Industry Standard Architecture,简称EISA)总线等。总线可以分为地址总线、数据总线、控制总线等。为便于表示,本申请附图中的总线并不限定仅有一根总线或一种类型的总线。The bus may be an Industry Standard Architecture (Industry Standard Architecture, ISA for short) bus, a Peripheral Component Interconnect (PCI for short) bus, or an Extended Industry Standard Architecture (EISA for short) bus. The bus can be divided into address bus, data bus, control bus and so on. For ease of representation, the buses in the drawings of the present application are not limited to only one bus or one type of bus.

上述存储介质可以是由任何类型的易失性或非易失性存储设备或者它们的组合实现,如静态随机存取存储器(SRAM),电可擦除可编程只读存储器(EEPROM),可擦除可编程只读存储器(EPROM),可编程只读存储器(PROM),只读存储器(ROM),磁存储器,快闪存储器,磁盘或光盘。存储介质可以是通用或专用计算机能够存取的任何可用介质。The above-mentioned storage medium can be realized by any type of volatile or non-volatile storage device or their combination, such as static random access memory (SRAM), electrically erasable programmable read-only memory (EEPROM), erasable In addition to programmable read-only memory (EPROM), programmable read-only memory (PROM), read-only memory (ROM), magnetic memory, flash memory, magnetic disk or optical disk. A storage media may be any available media that can be accessed by a general purpose or special purpose computer.

一种示例性的存储介质耦合至处理器,从而使处理器能够从该存储介质读取信息,且可向该存储介质写入信息。当然,存储介质也可以是处理器的组成部分。处理器和存储介质可以位于专用集成电路(Application Specific Integrated Circuits,简称ASIC)中。当然,处理器和存储介质也可以作为分立组件存在于电子设备或主控设备中。An exemplary storage medium is coupled to the processor such the processor can read information from, and write information to, the storage medium. Of course, the storage medium may also be a component of the processor. The processor and the storage medium may be located in application specific integrated circuits (Application Specific Integrated Circuits, ASIC for short). Of course, the processor and the storage medium can also exist in the electronic device or the main control device as discrete components.

本领域普通技术人员可以理解:实现上述各方法实施例的全部或部分步骤可以通过程序指令相关的硬件来完成。前述的程序可以存储于一计算机可读取存储介质中。该程序在执行时,执行包括上述各方法实施例的步骤;而前述的存储介质包括:ROM、RAM、磁碟或者光盘等各种可以存储程序代码的介质。Those of ordinary skill in the art can understand that all or part of the steps for implementing the above method embodiments can be completed by program instructions and related hardware. The aforementioned program can be stored in a computer-readable storage medium. When the program is executed, it executes the steps including the above-mentioned method embodiments; and the aforementioned storage medium includes: ROM, RAM, magnetic disk or optical disk and other various media that can store program codes.

本领域技术人员在考虑说明书及实践这里公开的发明后,将容易想到本申请的其它实施方案。本申请旨在涵盖本申请的任何变型、用途或者适应性变化,这些变型、用途或者适应性变化遵循本申请的一般性原理并包括本申请未公开的本技术领域中的公知常识或惯用技术手段。说明书和实施例仅被视为示例性的,本申请的真正范围和精神由下面的权利要求书指出。Other embodiments of the present application will be readily apparent to those skilled in the art from consideration of the specification and practice of the invention disclosed herein. This application is intended to cover any modification, use or adaptation of the application, these modifications, uses or adaptations follow the general principles of the application and include common knowledge or conventional technical means in the technical field not disclosed in the application . The specification and examples are to be considered exemplary only, with a true scope and spirit of the application indicated by the following claims.

应当理解的是,本申请并不局限于上面已经描述并在附图中示出的精确结构,并且可以在不脱离其范围进行各种修改和改变。本申请的范围仅由所附的权利要求书来限制。It should be understood that the present application is not limited to the precise constructions which have been described above and shown in the accompanying drawings, and various modifications and changes may be made without departing from the scope thereof. The scope of the application is limited only by the appended claims.

Claims (12)

1. A form processing method, comprising:
acquiring a source form image;
extracting a table region in the source table image by adopting a histogram statistical method to obtain a pre-extracted table region;
detecting whether the target attribute exists in the pre-extracted form area or not, and screening the target form area from the pre-extracted form area according to a detection result;
and positioning the text part in the target table area to obtain a table image with text positioning.
2. The method of claim 1, wherein extracting the table regions in the source table image using histogram statistics results in pre-extracted table regions, comprising:
carrying out gray level binarization processing on the source table image to obtain an image after gray level binarization processing;
performing vertical histogram statistical processing on the image subjected to gray level binarization processing to obtain an image subjected to vertical histogram statistical processing;
Performing minimum external rectangle detection on the image subjected to the vertical histogram statistical processing, and determining the transverse boundary of a table area;
performing horizontal histogram statistical processing on the image subjected to gray level binarization processing to obtain an image subjected to horizontal histogram statistical processing;
performing minimum circumscribed rectangle detection on the image subjected to the transverse histogram statistical processing to determine a longitudinal boundary of a table area;
determining a rectangle of the form area according to the transverse boundary and the longitudinal boundary;
screening the rectangles of the table areas according to a predefined first screening rule to obtain the rectangles of the screened table areas;
and extracting the pre-extracted form area from the source form image based on the rectangle of the screened form area.
3. The method according to claim 2, further comprising, after said subjecting said gray-level binarized image to a lateral histogram statistical process, the step of:
performing transverse expansion processing on the image subjected to the transverse histogram statistical processing through a predefined first expansion check to obtain an image subjected to the transverse expansion processing;
Correspondingly, the detecting the minimum circumscribed rectangle of the image after the statistics processing of the transverse histogram to determine the longitudinal boundary of the table area includes:
and detecting the minimum circumscribed rectangle of the image after the transverse expansion processing, and determining the longitudinal boundary of the table area.
4. The method of claim 1, wherein the target property is a transverse straight line;
correspondingly, the detecting whether the target attribute exists in the pre-extracted form area, and screening the target form area from the pre-extracted form area according to the detection result includes:
performing gray level binarization on the pre-extracted form area to obtain a pre-extracted form area with gray level binarization;
extracting a transverse line segment in the pre-extracted form area of the gray level binarization by adopting a morphological method;
reconstructing the transverse line segment into a transverse straight line by adopting a straight line extraction algorithm;
detecting whether a transverse straight line exists in each pre-extracted table area to obtain a detection result;
and determining all the pre-extracted table areas with the detection result of yes as target table areas.
5. The method of claim 1, wherein locating the text portion in the target form area results in a text located form image, comprising:
Carrying out graying treatment on the target table area to obtain a target table area after graying treatment;
performing edge detection on the target table area subjected to the graying treatment to obtain a target table area subjected to edge detection;
binarizing the target table area after edge detection to obtain a binarized target table area;
performing corrosion treatment on the binarized target table area, and removing vertical lines in the binarized target table area to obtain a target table area with the vertical lines removed;
processing the target form area after the vertical line is removed according to a predefined second expansion core and a corrosion core to obtain a target form area after corrosion processing;
performing expansion treatment again on the target table area after the corrosion treatment according to a predefined third expansion check to obtain the target table area after the expansion treatment;
searching a rectangular outline in the expanded target table area, and positioning the characters;
screening the rectangular outline according to a predefined second screening rule;
and extracting the region containing the screened rectangular outline from the target table region to obtain a table image with text positioning.
6. The method according to any one of claims 1 to 5, further comprising, after said locating the text portion in the target form area to obtain a text located form image:
and recognizing characters in the form image, and generating a spreadsheet according to the characters.
7. A form processing apparatus, comprising:
the acquisition module is used for acquiring the source form image;
the pre-extraction module is used for extracting the table areas in the source table image by adopting a histogram statistical method to obtain pre-extracted table areas;
the screening module is used for detecting whether the target attribute exists in the pre-extracted form area or not, and screening the target form area from the pre-extracted form area according to a detection result;
and the positioning module is used for positioning the text part in the target table area to obtain a table image of text positioning.
8. The apparatus of claim 7, wherein the pre-extraction module is specifically configured to: carrying out gray level binarization processing on the source table image to obtain an image after gray level binarization processing; performing vertical histogram statistical processing on the image subjected to gray level binarization processing to obtain an image subjected to vertical histogram statistical processing; performing minimum external rectangle detection on the image subjected to the vertical histogram statistical processing, and determining the transverse boundary of a table area; performing horizontal histogram statistical processing on the image subjected to gray level binarization processing to obtain an image subjected to horizontal histogram statistical processing; performing minimum circumscribed rectangle detection on the image subjected to the transverse histogram statistical processing to determine a longitudinal boundary of a table area; determining a rectangle of the form area according to the transverse boundary and the longitudinal boundary; screening the rectangles of the table areas according to a predefined first screening rule to obtain the rectangles of the screened table areas; and extracting the pre-extracted form area from the source form image based on the rectangle of the screened form area.
9. The apparatus of claim 8, wherein the positioning module is specifically configured to: carrying out graying treatment on the target table area to obtain a target table area after graying treatment; performing edge detection on the target table area subjected to the graying treatment to obtain a target table area subjected to edge detection; binarizing the target table area after edge detection to obtain a binarized target table area; performing corrosion treatment on the binarized target table area, and removing vertical lines in the binarized target table area to obtain a target table area with the vertical lines removed; processing the target form area after the vertical line is removed according to a predefined second expansion core and a corrosion core to obtain a target form area after corrosion processing; performing expansion treatment again on the target table area after the corrosion treatment according to a predefined third expansion check to obtain the target table area after the expansion treatment; searching a rectangular outline in the expanded target table area, and positioning the characters; screening the rectangular outline according to a predefined second screening rule; and extracting the region containing the screened rectangular outline from the target table region to obtain a table image with text positioning.
10. A form processing system, comprising:
the camera is used for acquiring a source form image;
the server comprises: at least one processor and memory;
the memory stores computer-executable instructions;
the processor executes computer-executable instructions stored in the memory to implement the method of any one of claims 1 to 6.
11. A computer storage medium having stored therein computer executable instructions which when executed by a processor implement the method of any of claims 1 to 6.
12. A computer program product comprising a computer program, characterized in that the computer program, when executed by a processor, implements the form processing method of any one of claims 1 to 6.
CN202310389640.5A 2023-04-12 2023-04-12 Table processing method, device, system and storage medium Pending CN116524525A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202310389640.5A CN116524525A (en) 2023-04-12 2023-04-12 Table processing method, device, system and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202310389640.5A CN116524525A (en) 2023-04-12 2023-04-12 Table processing method, device, system and storage medium

Publications (1)

Publication Number Publication Date
CN116524525A true CN116524525A (en) 2023-08-01

Family

ID=87402205

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202310389640.5A Pending CN116524525A (en) 2023-04-12 2023-04-12 Table processing method, device, system and storage medium

Country Status (1)

Country Link
CN (1) CN116524525A (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101366020A (en) * 2005-12-21 2009-02-11 微软公司 Table detection in ink notes
CN110363095A (en) * 2019-06-20 2019-10-22 华南农业大学 A kind of recognition methods for table font
CN111079756A (en) * 2018-10-19 2020-04-28 杭州萤石软件有限公司 Method and equipment for extracting and reconstructing table in document image
WO2022134771A1 (en) * 2020-12-23 2022-06-30 深圳壹账通智能科技有限公司 Table processing method and apparatus, and electronic device and storage medium

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101366020A (en) * 2005-12-21 2009-02-11 微软公司 Table detection in ink notes
CN111079756A (en) * 2018-10-19 2020-04-28 杭州萤石软件有限公司 Method and equipment for extracting and reconstructing table in document image
CN110363095A (en) * 2019-06-20 2019-10-22 华南农业大学 A kind of recognition methods for table font
WO2022134771A1 (en) * 2020-12-23 2022-06-30 深圳壹账通智能科技有限公司 Table processing method and apparatus, and electronic device and storage medium

Similar Documents

Publication Publication Date Title
CN107085726A (en) Single character location method in oracle bone rubbings based on multi-method denoising and connected region analysis
CN111986183B (en) Chromosome scattered image automatic segmentation and identification system and device
TW201740316A (en) Image text recognition method and device
CN104361336A (en) Character recognition method for underwater video images
CN114049499A (en) Target object detection method, device and storage medium for continuous contour
CN113112567B (en) Method, device, electronic device and storage medium for generating editable flow chart
CN111507344A (en) Method and device for recognizing text from images
CN103824091A (en) Vehicle license plate recognition method for intelligent transportation system
Tan et al. License plate localization based on edge-geometrical features using morphological approach
Azad et al. New method for optimization of license plate recognition system with use of edge detection and connected component
CN118314590A (en) Table identification and reconstruction method, system, electronic equipment and storage medium
CN112200789B (en) Image recognition method and device, electronic equipment and storage medium
CN116580397A (en) Pathological image recognition method, device, equipment and storage medium
CN111199240A (en) Training method of bank card identification model, and bank card identification method and device
Feild et al. Scene text recognition with bilateral regression
CN112101323B (en) Method, system, electronic device and storage medium for identifying title list
CN105869147B (en) A kind of amblyopia visual aid stray light recognition methods
CN113221778A (en) Method and device for detecting and identifying handwritten form
CN114627456A (en) Bill text information detection method, device and system
Liang et al. Robust table recognition for printed document images
CN116524525A (en) Table processing method, device, system and storage medium
CN112200053A (en) Table identification method fusing local features
Rajan et al. Text detection and character extraction in natural scene images using fractional Poisson model
CN111932515B (en) Short circuit detection method and system for product residual defects and defect classification system
CN117409433A (en) Regional contour extraction algorithm based on text features

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination