[go: up one dir, main page]

TWI765422B - Data capturing method, template generating method and non-transitory computer readable storage medium - Google Patents

Data capturing method, template generating method and non-transitory computer readable storage medium Download PDF

Info

Publication number
TWI765422B
TWI765422B TW109140874A TW109140874A TWI765422B TW I765422 B TWI765422 B TW I765422B TW 109140874 A TW109140874 A TW 109140874A TW 109140874 A TW109140874 A TW 109140874A TW I765422 B TWI765422 B TW I765422B
Authority
TW
Taiwan
Prior art keywords
data
capture
information
anchor point
content
Prior art date
Application number
TW109140874A
Other languages
Chinese (zh)
Other versions
TW202221489A (en
Inventor
田金山
莊竣評
Original Assignee
全友電腦股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 全友電腦股份有限公司 filed Critical 全友電腦股份有限公司
Priority to TW109140874A priority Critical patent/TWI765422B/en
Application granted granted Critical
Publication of TWI765422B publication Critical patent/TWI765422B/en
Publication of TW202221489A publication Critical patent/TW202221489A/en

Links

Images

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

A data capturing method, a template generating method and non-transitory computer readable storage medium are provided. The data capturing method reads a to-be-processed data and a capturing template, the to-be-processed data including a to-be-processed data content, the capturing template including a capturing information and a verifying information, the capturing information including an anchor point information and a capturing range relative to the anchor point information. The data capturing method retrieves the reference point from the to-be-processed data content with the anchor point information. The data capturing method captures a capturing data relative to the reference point on the to-be-processed data content. The data capturing method screens the capturing data with the verifying information and generates a result data.

Description

資料擷取方法、樣板生成方法及非暫態電腦可讀取記錄媒體Data acquisition method, template generation method, and non-transitory computer-readable recording medium

本案係有關一種資料擷取方法、一種用於資料擷取之樣板之生成方法以及一種非暫態電腦可讀取記錄媒體。This case relates to a data acquisition method, a method for generating a template for data acquisition, and a non-transitory computer-readable recording medium.

習知技術在摘錄書面或數位資料之摘要內容的過程,受限於資料之間部分內容的歧異,僅能透過人工方式逐一審閱每份文件再擷取資料。然而,如此方式不僅效率低落且易產生人為錯誤。In the process of extracting the abstract content of written or digital data, the conventional technology is limited by the difference of some content between the data, and can only manually review each document one by one and then extract the data. However, this approach is not only inefficient but also prone to human error.

舉例而言,同一醫院在不同科別所使用的初診資料表可能不完全相同。對於皮膚科與心臟科的初診資料表,兩者可能皆包含患者基本資料、緊急聯絡人、過敏藥物等欄位資料。但對於心臟科的初診資料表,可能還包含是否有植入物、是否對顯影劑過敏、是否有中風病史等問題欄位。該些內容差異導致排板格式的差異。因此,當醫院管理人員欲將各科病患的初診資料表之基本資料欄位內容擷取歸檔時,僅能以人工方式逐一處理。For example, the initial consultation data sheets used by different departments in the same hospital may not be exactly the same. For the initial diagnosis data sheet of dermatology and cardiology, both may include the basic information of the patient, emergency contact person, allergy medicine and other fields. However, for the initial diagnosis data sheet of the Department of Cardiology, it may also include questions such as whether there are implants, whether there is an allergy to contrast agents, and whether there is a history of stroke. These content differences lead to differences in layout formats. Therefore, when hospital administrators want to retrieve and archive the contents of the basic data fields of the initial diagnosis data table of patients in various departments, they can only process them one by one manually.

有鑑於此,申請人提出一種資料擷取方法。該資料擷取方法讀取一待處理資料及一擷取樣板,該待處理資料包含一待處理資料內容,該擷取樣板包含一擷取資訊與一驗證資訊,該擷取資訊包含一錨點資訊及相對於該錨點資訊之一擷取範圍。該資料擷取方法根據該錨點資訊檢索該待處理資料內容以獲得符合該錨點資訊之一參考點。該資料擷取方法依據該擷取範圍於該待處理資料擷取相對於該參考點之一擷取資料。該資料擷取方法更根據該驗證資訊篩選該擷取資料,以產生一結果資料。In view of this, the applicant proposes a data acquisition method. The data capture method reads a data to be processed and a capture sample board, the data to be processed includes a content of the data to be processed, the capture sample board includes a capture information and a verification information, the capture information includes an anchor point Information and a retrieval range relative to one of the anchor information. The data retrieval method retrieves the to-be-processed data content according to the anchor point information to obtain a reference point corresponding to the anchor point information. The data retrieval method retrieves data relative to one of the reference points in the to-be-processed data retrieval according to the retrieval range. The data retrieval method further filters the captured data according to the verification information to generate a result data.

申請人亦提出一種非暫態電腦可讀取記錄媒體用以儲存一應用程式。該應用程式用以讀取一待處理資料及一擷取樣板,該待處理資料包含一待處理資料內容,該擷取樣板包含一擷取資訊與一驗證資訊,該擷取資訊包含一錨點資訊及相對於該錨點資訊之一擷取範圍。該應用程式根據該錨點資訊檢索該待處理資料內容以獲得符合該錨點資訊之一參考點。該應用程式依據該擷取範圍於該待處理資料擷取相對於該參考點之一擷取資料。該應用程式更根據該驗證資訊篩選該擷取資料,以產生一結果資料。The applicant also proposes a non-transitory computer-readable recording medium for storing an application program. The application is used for reading a pending data and a capture sample board, the pending data includes a content of the pending data, the capture sample board includes a capture information and a verification information, the capture information includes an anchor point Information and a retrieval range relative to one of the anchor information. The application retrieves the content of the pending data according to the anchor information to obtain a reference point that matches the anchor information. The application retrieves data relative to one of the reference points in the pending data retrieval according to the retrieval range. The application program further filters the captured data according to the authentication information to generate a result data.

申請人還提出一種樣板生成方法。該樣板生成方法用以產生一擷取樣板,該擷取樣板用以擷取一待處理資料,該待處理資料包含一待處理資料內容。該樣板生成方法讀取一原始資料,該原始資料包含一原始資料內容,該原始資料內容包含一特徵資料及相對於該特徵資料之一資料區塊。該樣板生成方法擷取該原始資料內容之特徵資料而定義一錨點資訊,該錨點資訊用以檢索該待處理資料內容中符合該錨點資訊之一參考點。該樣板生成方法擷取該原始資料內容之資料區塊而定義一擷取範圍,該擷取範圍用以擷取該待處理資料內容中符合該擷取範圍之一擷取資料。該樣板生成方法產生該錨點資訊與該擷取範圍之一相對位置關係。該樣板生成方法定義一擷取資訊,該擷取資訊包含該錨點資訊、該擷取範圍及該相對位置關係。該樣板生成方法定義一驗證資訊,該驗證資訊用以篩選該擷取資料。該樣板生成方法更輸出該擷取樣板,該擷取樣板包含該擷取資訊及該驗證資訊。The applicant also proposes a template generation method. The template generating method is used for generating a capture sample board, and the capture sample board is used for capturing a data to be processed, and the data to be processed includes a content of the data to be processed. The template generating method reads an original data, the original data includes an original data content, and the original data content includes a characteristic data and a data block corresponding to the characteristic data. The template generation method retrieves characteristic data of the original data content to define an anchor point information, and the anchor point information is used for retrieving a reference point in the to-be-processed data content that matches the anchor point information. The template generating method retrieves a data block of the original data content to define a capture range, and the capture range is used to capture a capture data in the to-be-processed data content that conforms to the capture range. The template generation method generates a relative positional relationship between the anchor point information and the capture range. The template generation method defines a capture information, the capture information includes the anchor point information, the capture range and the relative position relationship. The template generation method defines a verification information for filtering the captured data. The template generating method further outputs the capture sample plate, the capture sample plate including the capture information and the verification information.

圖1係依據一些實施例之資料擷取方法之流程圖,請參照圖1。資料擷取方法讀取待處理資料1(步驟S01),待處理資料1包含待處理資料內容11。資料擷取方法亦讀取擷取樣板3(步驟S02),擷取樣板3包含擷取資訊與驗證資訊33,擷取資訊包含錨點資訊32及相對於錨點資訊32之擷取範圍31。其後,資料擷取方法根據錨點資訊32檢索待處理資料內容11以獲得符合錨點資訊32之參考點111(步驟S03)。資料擷取方法依據擷取範圍31於待處理資料1擷取相對於參考點111之擷取資料112(步驟S04)。最後,資料擷取方法根據驗證資訊33篩選擷取資料112(步驟S05),以產生結果資料(步驟S06)。前述步驟並非必須採順序方式執行。舉例而言,將步驟S01及步驟S02順序對調。FIG. 1 is a flowchart of a data acquisition method according to some embodiments, please refer to FIG. 1 . The data retrieval method reads the data 1 to be processed (step S01 ). The data 1 to be processed includes the content 11 of the data to be processed. The data capture method also reads the capture sample board 3 (step S02 ). The capture sample board 3 includes capture information and verification information 33 . The capture information includes anchor point information 32 and a capture range 31 relative to the anchor point information 32 . Thereafter, the data retrieval method retrieves the data content 11 to be processed according to the anchor point information 32 to obtain the reference point 111 that matches the anchor point information 32 (step S03 ). The data capture method captures the captured data 112 relative to the reference point 111 from the data to be processed 1 according to the capture range 31 (step S04 ). Finally, the data retrieval method filters the captured data 112 according to the verification information 33 (step S05 ) to generate result data (step S06 ). The foregoing steps do not necessarily have to be performed in a sequential manner. For example, the order of step S01 and step S02 is reversed.

資料擷取方法可以但不限於運作於伺服器、手機、個人電腦或平板電腦等裝置。依據一些實施例,資料擷取方法為運作於伺服器6之背景程式,並在偵測到待處理資料1時自動執行資料擷取程序。依據一些實施例,資料擷取方法為儲存在非暫態電腦可讀取記錄媒體之應用程式。非暫態電腦可讀取記錄媒體可以是但不限於光碟、磁碟、隨身碟、硬碟等儲存媒體。The data capture method can be, but is not limited to, operating on devices such as servers, mobile phones, personal computers or tablet computers. According to some embodiments, the data capture method is a background program that operates on the server 6 and automatically executes the data capture process when the pending data 1 is detected. According to some embodiments, the data retrieval method is an application program stored on a non-transitory computer-readable recording medium. The non-transitory computer-readable recording medium may be, but is not limited to, storage media such as optical disks, magnetic disks, pen drives, and hard disks.

圖2A係依據一些實施例之待處理資料之示意圖,請參照圖2A。待處理資料1可以是但不限於可攜式文件格式(Portable Document Format,PDF)、文字格式、影像格式或前述格式之組合。待處理資料1包含待處理資料內容11。待處理資料內容11可以是但不限於文字、圖片、符號、圖案、註記、孔洞及前述資料之結合。FIG. 2A is a schematic diagram of data to be processed according to some embodiments, please refer to FIG. 2A . The data to be processed 1 may be, but not limited to, a Portable Document Format (PDF), a text format, an image format, or a combination of the foregoing formats. The data to be processed 1 contains the content 11 of the data to be processed. The data content 11 to be processed may be, but not limited to, text, pictures, symbols, patterns, annotations, holes and combinations of the foregoing data.

資料擷取方法根據擷取樣板3而擷取待處理資料內容11,以獲得擷取資料112。依據一些實施例,擷取樣板3為預設的參考資訊。擷取樣板3包含擷取資訊與驗證資訊33,該擷取資訊包含錨點資訊32及相對於該錨點資訊32之擷取範圍31。錨點資訊32可以是但不限於文字、圖片、符號、圖案、註記、孔洞及前述資料之結合。舉例而言,錨點資訊32為活頁夾書面文件上的打孔孔洞;舉例而言,錨點資訊32為手寫書面文件上的打勾註記;舉例而言,錨點資訊32為文件上的螢光標示註記。圖2B係依據一些實施例之文字內容之擷取資料之示意圖,請參照圖2B。舉例而言,錨點資訊32為文字「Heart Rate」,資料擷取方法於待處理資料內容11中檢索文字「Heart Rate」而獲得參考點111。參考點111為文字「Heart Rate」在待處理資料內容11上的座標位置。The data capture method captures the data content 11 to be processed according to the capture sample board 3 to obtain the captured data 112 . According to some embodiments, the sampling plate 3 is the default reference information. The capture sample board 3 includes capture information and verification information 33 . The capture information includes anchor point information 32 and a capture range 31 relative to the anchor point information 32 . The anchor point information 32 can be, but is not limited to, text, pictures, symbols, patterns, annotations, holes and combinations of the foregoing. For example, the anchor point information 32 is a punch hole on a written document in a binder; for example, the anchor point information 32 is a check mark on a handwritten written document; for example, the anchor point information 32 is a firefly on the document Light mark annotation. FIG. 2B is a schematic diagram of capturing data of text content according to some embodiments, please refer to FIG. 2B . For example, the anchor point information 32 is the text "Heart Rate", and the data retrieval method searches the text "Heart Rate" in the data content 11 to be processed to obtain the reference point 111 . The reference point 111 is the coordinate position of the word "Heart Rate" on the data content 11 to be processed.

擷取範圍31之座標位置相對於參考點111之座標位置,其相對位置關係為擷取樣板3內預設的參考資訊;擷取範圍31之形狀及大小亦為擷取樣板3內預設的參考資訊。對於單一參考點111,擷取範圍31可以為一個或多個。例如圖2B之擷取資料112即為相對於文字「Heart Rate」的一個矩形區塊。依據一些實施例,擷取資料112可以不包含參考點111。The coordinate position of the capture area 31 is relative to the coordinate position of the reference point 111 , and the relative positional relationship is the reference information preset in the capture sample panel 3 ; the shape and size of the capture area 31 are also preset in the capture sample panel 3 . Reference information. For a single reference point 111, the capture range 31 may be one or more. For example, the captured data 112 in FIG. 2B is a rectangular block corresponding to the text "Heart Rate". According to some embodiments, the retrieved data 112 may not include the reference point 111 .

驗證資訊33可以是但不限於文字、圖片、符號、圖案、註記、孔洞及前述資料之結合。驗證資訊33用於篩選擷取資料112,依據一些實施例,擷取資料112包含所有驗證資訊33。舉例而言,驗證資訊33為文字「Total Beats」、「Min HR」、「Max HR」及「Beat analyzed %」,當擷取範圍31內之待處理資料內容11包含前述驗證資訊33時,則擷取資料112為所欲存取之資料。相反而言,請一併參照圖2A及圖2C,圖2A之待處理資料內容11共包含兩個文字「Heart Rate」,其中之一位於圖2B所例示之參考點111,另一個位於圖2C所例示之參考點111(即文字「Heart Rate Variability」中的文字「Heart Rate」)。同前述圖2B之例示,資料擷取方法亦擷取相對於文字「Heart Rate」之相同大小之矩形區塊。然而,圖2C擷取範圍31內之待處理資料內容11並不包含驗證資訊33「Total Beats」、「Min HR」、「Max HR」及「Beat analyzed %」,因此,擷取資料112為所欲篩除之資料。依據一些實施例,驗證資訊33包含驗證公式,資料擷取方法根據驗證公式篩選擷取資料112。以圖2B為例,當驗證公式為驗證資訊33之文字交集(「Total Beats」AND「Min HR」AND「Max HR」AND「Beat analyzed %」)時,圖2B之擷取資料112為所欲存取之資料;當驗證公式為驗證資訊33之文字聯集(「Total Beats」OR「Min HR」OR「Max HR」OR「Beat analyzed %」)時,圖2B之擷取資料112為所欲存取之資料;當驗證公式為(「Total Beats」AND「Min HR」AND「Max HR」NOT「Beat analyzed %」)時,圖2B之擷取資料112為所欲篩除之資料。The verification information 33 can be, but is not limited to, text, pictures, symbols, patterns, notes, holes and combinations of the foregoing. The verification information 33 is used to filter the captured data 112 , which, according to some embodiments, includes all of the verification information 33 . For example, the verification information 33 is the text "Total Beats", "Min HR", "Max HR" and "Beat analyzed %". When the content 11 of the data to be processed in the extraction range 31 includes the aforementioned verification information 33, then The retrieved data 112 is the data to be accessed. On the contrary, please refer to FIG. 2A and FIG. 2C together. The data content 11 to be processed in FIG. 2A includes two words “Heart Rate”, one of which is located at the reference point 111 illustrated in FIG. 2B and the other is located at the reference point 111 shown in FIG. 2C The illustrated reference point 111 (ie the text "Heart Rate" in the text "Heart Rate Variability"). As with the aforementioned example of FIG. 2B , the data capture method also captures a rectangular block of the same size relative to the text “Heart Rate”. However, the content 11 of the data to be processed within the capture range 31 in FIG. 2C does not include the verification information 33 "Total Beats", "Min HR", "Max HR" and "Beat analyzed %", therefore, the captured data 112 is Data to be filtered out. According to some embodiments, the verification information 33 includes a verification formula, and the data retrieval method filters the captured data 112 according to the verification formula. Taking FIG. 2B as an example, when the verification formula is the text intersection of verification information 33 (“Total Beats” AND “Min HR” AND “Max HR” AND “Beat analyzed %”), the captured data 112 in FIG. 2B can be whatever you want Data to be accessed; when the verification formula is the text union of verification information 33 (“Total Beats” OR “Min HR” OR “Max HR” OR “Beat analyzed %”), the extracted data 112 in FIG. 2B can do whatever you want Data to be accessed; when the verification formula is (“Total Beats” AND “Min HR” AND “Max HR” NOT “Beat analyzed %”), the captured data 112 in FIG. 2B is the data to be filtered.

依據一些實施例,擷取資料112可以為待處理資料內容11上的特定文字,亦可以為特定圖片。圖3係依據一些實施例之圖片內容之擷取資料之示意圖,請參照圖3,擷取資料112為待處理資料內容11上的圖表,而符合驗證資訊33之待處理資料內容11為圖表上的座標軸數字。另外,於本實施例中,符合驗證資訊33之待處理資料內容11亦可以為圖表上的座標欄位線所形成之數個直線紋圖案,抑或是包含直線紋圖案及特定座標軸數字之交集。According to some embodiments, the captured data 112 may be a specific text on the data content 11 to be processed, or a specific image. 3 is a schematic diagram of captured data of image content according to some embodiments, please refer to FIG. 3 , the captured data 112 is a graph on the data content 11 to be processed, and the data content 11 to be processed that conforms to the verification information 33 is on the graph axis numbers. In addition, in this embodiment, the to-be-processed data content 11 conforming to the verification information 33 may also be several linear patterns formed by the coordinate field lines on the chart, or may include the intersection of the linear patterns and specific coordinate axis numbers.

資料擷取方法完成擷取資料112之篩選後,產生結果資料。結果資料可以是但不限於可攜式文件格式(Portable Document Format,PDF)、文字格式、影像格式或前述格式之組合。單筆結果資料可以包含一個或多個擷取資料112。After the data retrieval method completes the screening of the retrieved data 112, the result data is generated. The resulting data may be, but not limited to, Portable Document Format (PDF), text format, image format, or a combination of the foregoing formats. A single piece of result data may include one or more retrieved data 112 .

圖4係依據一些實施例之來源及目標資料庫之方塊圖,請參照圖4。依據一些實施例,資料擷取方法運行於伺服器6,並存取至少一來源資料庫4。舉例而言,來源資料庫4可以為超音波檢查室之資料庫、電腦斷層檢查室之資料庫、血液檢查室之資料庫。各個來源資料庫4儲存不同種類之患者檢查報告,即待處理資料1。依據一些實施例,資料擷取方法偵測各個來源資料庫4之儲存內容,當來源資料庫4被置入新的待處理資料1時,即處理前述新的待處理資料1而產生相對應之結果資料。依據一些實施例,資料擷取方法輸出結果資料到至少一目標資料庫5。舉例而言,目標資料庫5可以為心臟科資料庫、骨科資料庫、內分泌科資料庫。對於心臟科所需之結果資料,可能應包含超音波檢查報告中心臟部分之圖片或血液檢查報告中總膽固醇量之數據;對於骨科所需之結果資料,可能應包含電腦斷層檢查報告中股骨部分之圖片及血液檢查報告中血鈣濃度之數據;對於內分泌科所需之結果資料,可能應包含血液檢查報告中雄性激素濃度之數據。因此,就血液檢查室之檢查報告而言,各科室所需要的數據有所不同。於本實施例中,資料擷取方法依據各科室預先設定之擷取樣板3,達到客製化地提供各科室所需要的資料。依據一些實施例,擷取樣板3包含有目標資料庫位址資訊,例如資料夾位址,資料擷取方法根據各個擷取樣板3產生相對應之結果資料,再將前述結果資料根據目標資料庫位址資訊輸出到相對應之目標資料庫5。FIG. 4 is a block diagram of source and target databases according to some embodiments, please refer to FIG. 4 . According to some embodiments, the data retrieval method runs on the server 6 and accesses at least one source database 4 . For example, the source database 4 may be a database of an ultrasound examination room, a database of a computed tomography room, or a database of a blood examination room. Each source database 4 stores different types of patient examination reports, ie, pending data 1 . According to some embodiments, the data retrieval method detects the storage content of each source database 4, and when the source database 4 is placed with new data 1 to be processed, it processes the aforementioned new data 1 to be processed to generate corresponding data. result data. According to some embodiments, the data retrieval method outputs result data to at least one target database 5 . For example, the target database 5 can be a cardiology database, an orthopedic database, and an endocrinology database. For the results required by the cardiology department, it may include the picture of the heart part in the ultrasound examination report or the data of the total cholesterol level in the blood test report; for the result data required by the orthopedics department, it may include the femur part in the computer tomography examination report. The picture and the blood calcium concentration data in the blood test report; for the result data required by the endocrinology department, the data of the male hormone concentration in the blood test report may be included. Therefore, the data required by each department is different for the examination report of the blood laboratory. In this embodiment, the data acquisition method is based on the pre-set acquisition sample board 3 of each department, so as to provide customized data required by each department. According to some embodiments, the capture sample board 3 includes target database address information, such as a folder address, the data capture method generates corresponding result data according to each capture sample board 3, and then uses the aforementioned result data according to the target database. The address information is output to the corresponding target database 5 .

本案更提供一種擷取樣板3之生成方法。樣板生成方法用於設定並產生擷取樣板3,擷取樣板3被資料擷取方法讀取而被用於擷取待處理資料1。圖5係依據一些實施例之樣板生成方法之流程圖,請參照圖5。樣板生成方法讀取原始資料2(步驟S11),原始資料2包含原始資料內容21。樣板生成方法擷取原始資料內容21之特徵資料而定義錨點資訊32(步驟S12)。樣板生成方法擷取原始資料內容21之資料區塊211而定義擷取範圍31(步驟S13)。樣板生成方法產生錨點資訊32與擷取範圍31之相對位置關係(步驟S14)。樣板生成方法定義擷取資訊,擷取資訊包含錨點資訊32、擷取範圍31及相對位置關係。樣板生成方法定義驗證資訊33(步驟S15)。樣板生成方法輸出擷取樣板3(步驟S16),擷取樣板3包含擷取資訊及驗證資訊33。前述步驟並非必須採順序方式執行。舉例而言,將步驟S12及步驟S13順序對調;舉例而言,將步驟S15提前至步驟S14之前。The present application further provides a method for generating the capture sample plate 3 . The template generation method is used to set and generate the capture sample plate 3 , and the capture sample plate 3 is read by the data capture method and used to capture the data to be processed 1 . FIG. 5 is a flowchart of a template generation method according to some embodiments, please refer to FIG. 5 . The template generation method reads the original data 2 (step S11 ). The original data 2 includes the original data content 21 . The template generation method extracts the feature data of the original data content 21 to define the anchor point information 32 (step S12). The template generation method captures the data block 211 of the original data content 21 to define the capture range 31 (step S13). The template generation method generates the relative positional relationship between the anchor point information 32 and the capture range 31 (step S14). The template generation method defines capture information, and the capture information includes anchor point information 32 , capture range 31 and relative positional relationship. The template generation method defines the verification information 33 (step S15). The template generation method outputs a capture sample board 3 (step S16 ), and the capture sample board 3 includes the capture information and the verification information 33 . The foregoing steps do not necessarily have to be performed in a sequential manner. For example, the order of step S12 and step S13 are reversed; for example, step S15 is advanced before step S14.

樣板生成方法可以但不限於運作於伺服器、手機、個人電腦或平板電腦等裝置。依據一些實施例,樣板生成方法為運作於手機之應用程式,並在完成樣板之設定後輸出樣板。The template generation method can, but is not limited to, operate on devices such as servers, mobile phones, personal computers, or tablet computers. According to some embodiments, the template generation method is an application program running on the mobile phone, and the template is output after the template setting is completed.

圖6A係依據一些實施例之原始資料之示意圖,請參照圖6A。原始資料2包含原始資料內容21,原始資料內容21包含資料區塊211。圖6B係依據一些實施例之資料區塊之示意圖,請參照圖6B。原始資料內容21包含有特徵資料,例如文字「Heart Rate」,樣板生成方法擷取前述特徵資料並將其定義為錨點資訊32。樣板生成方法擷取原始資料內容21之資料區塊211而定義擷取範圍31。依據一些實施例,樣板生成方法將原始資料內容21顯示於顯示器。使用者於該顯示器框選之原始資料內容21之資料區塊211後,樣板生成方法將前述被框選之區塊定義為擷取範圍31,並存取擷取範圍31於該原始資料內容21之相對位置。所述顯示器可以為觸控或非觸控顯示器。所述框選可以採用但不限於滑鼠、觸控筆、手指等方式定義框選區域,亦可以採用鍵盤輸入座標值之方式定義框選區域。依據一些實施例,樣板生成方法擷取原始資料內容21之兩個特徵資料而定義第一錨點資訊321及第二錨點資訊322,並擷取位處第一錨點資訊321及第二錨點資訊322之間之資料區塊211,而定義擷取範圍31。圖7係依據另一些實施例之原始資料之示意圖,請參照圖7。舉例而言,第一錨點資訊321為文字「病史:」,第二錨點資訊322為文字「備註:」,樣板生成方法擷取文字「病史:」與文字「備註:」之間的原始資料內容21。FIG. 6A is a schematic diagram of source data according to some embodiments, please refer to FIG. 6A . The original data 2 includes the original data content 21 , and the original data content 21 includes the data block 211 . FIG. 6B is a schematic diagram of a data block according to some embodiments, please refer to FIG. 6B . The original data content 21 includes feature data, such as the text “Heart Rate”, and the template generation method captures the aforementioned feature data and defines it as anchor point information 32 . The template generation method captures the data block 211 of the original data content 21 to define the capture range 31 . According to some embodiments, the template generation method displays the raw material content 21 on the display. After the user frames the data block 211 of the original data content 21 on the display, the template generation method defines the framed block as the capture range 31 , and accesses the capture range 31 to the original data content 21 the relative position. The display may be a touch or non-touch display. The frame selection can be defined by, but not limited to, a mouse, a stylus, a finger, etc. to define a frame selection area, or a method of inputting coordinate values with a keyboard to define the frame selection area. According to some embodiments, the template generation method extracts two characteristic data of the original data content 21 to define the first anchor point information 321 and the second anchor point information 322, and extracts the first anchor point information 321 and the second anchor point information at the location The data block 211 between the point information 322 defines the capture range 31 . FIG. 7 is a schematic diagram of source data according to other embodiments, please refer to FIG. 7 . For example, the first anchor point information 321 is the text "medical history:", the second anchor point information 322 is the text "remarks:", and the template generation method captures the original text between the text "medical history:" and the text "remarks:" Data content 21.

樣板生成方法產生錨點資訊32與擷取範圍31之相對位置關係。依據一些實施例,相對位置關係為位置座標的數值差。舉例而言,定義圖6B之錨點資訊32之座標位置為文字「Heart Rate」之中心點座標,圖6B之擷取範圍31之座標位置為矩形區域之中心點座標,兩者之相對位置關係為兩個中心點座標的數值差。依據一些實施例,相對位置關係為所處行次的行次差。舉例而言,定義圖7之第一錨點資訊321之行次為文字「病史:」所處之行次,第二錨點資訊322之行次為文字「備註:」所處之行次,圖7之擷取範圍31之行次為相對於文字「病史:」所處之行次及文字「備註:」所處之行次之間的所有行次。The template generation method generates the relative positional relationship between the anchor point information 32 and the capture range 31 . According to some embodiments, the relative positional relationship is the numerical difference of the positional coordinates. For example, define the coordinate position of the anchor point information 32 in FIG. 6B as the coordinate of the center point of the text “Heart Rate”, and the coordinate position of the capture range 31 in FIG. 6B as the center point coordinate of the rectangular area, and the relative positional relationship between the two is the numerical difference of the coordinates of the two center points. According to some embodiments, the relative positional relationship is the row order difference of the row order. For example, define the line order of the first anchor point information 321 in FIG. 7 as the line order of the text "medical history:", and the line order of the second anchor point information 322 to be the line order of the text "Remarks:", The line order of the extraction range 31 in FIG. 7 is relative to the line order of the text "medical history:" and the line order of the text "Remarks:".

樣板生成方法定義驗證資訊33,驗證資訊33用以提供資料擷取方法篩選擷取資料112。舉例而言,圖6B之擷取範圍31應包含驗證資訊33文字「Total Beats」、「Min HR」、「Max HR」及「Beat analyzed %」;舉例而言,圖7之擷取範圍31應包含文字「主訴:」。依據一些實施例,樣板生成方法產生驗證資訊33與擷取範圍31之相對位置關係,並判斷擷取範圍31是否包含驗證資訊33。依據一些實施例,當樣板生成方法判斷擷取範圍31不包含驗證資訊33,樣板生成方法發出警示,以提醒使用者重新設定驗證資訊33之條件。The template generation method defines the verification information 33 , and the verification information 33 is used to provide the data capture method to filter the captured data 112 . For example, the capture range 31 in FIG. 6B should include the verification information 33 text “Total Beats”, “Min HR”, “Max HR” and “Beat analyzed %”; for example, the capture range 31 in FIG. 7 should Contains the text "Plain Complaint:". According to some embodiments, the template generation method generates the relative positional relationship between the verification information 33 and the capture area 31 , and determines whether the capture area 31 includes the verification information 33 . According to some embodiments, when the template generation method determines that the capture range 31 does not contain the verification information 33 , the template generation method issues a warning to remind the user to reset the conditions of the verification information 33 .

1:待處理資料 11:待處理資料內容 111:參考點 112:擷取資料 2:原始資料 21:原始資料內容 211:資料區塊 3:擷取樣板 31:擷取範圍 32:錨點資訊 321:第一錨點資訊 322:第二錨點資訊 33:驗證資訊 4:來源資料庫 5:目標資料庫 6:伺服器 S01-S06:步驟 S11-S16:步驟 1: Data to be processed 11: Content of pending data 111: Reference point 112: Retrieve data 2: original data 21: Source material content 211:Data block 3: Capture sample board 31: Capture range 32: Anchor Information 321: First Anchor Information 322: Second Anchor Information 33: Verification Information 4: Source Repository 5: Target database 6: Server S01-S06: Steps S11-S16: Steps

[圖1]係依據一些實施例之資料擷取方法之流程圖; [圖2A]係依據一些實施例之待處理資料之示意圖; [圖2B]係依據一些實施例之文字內容之擷取資料之示意圖; [圖2C]係依據另一些實施例之文字內容之擷取資料之示意圖; [圖3]係依據一些實施例之圖片內容之擷取資料之示意圖; [圖4]係依據一些實施例之來源及目標資料庫之方塊圖; [圖5]係依據一些實施例之樣板生成方法之流程圖; [圖6A]係依據一些實施例之原始資料之示意圖; [圖6B]係依據一些實施例之資料區塊之示意圖;以及 [圖7]係依據另一些實施例之原始資料之示意圖。 [FIG. 1] is a flowchart of a data acquisition method according to some embodiments; [FIG. 2A] is a schematic diagram of data to be processed according to some embodiments; [FIG. 2B] is a schematic diagram of data extraction of text content according to some embodiments; [FIG. 2C] is a schematic diagram of capturing data of text content according to other embodiments; [FIG. 3] is a schematic diagram of capturing data of image content according to some embodiments; [FIG. 4] is a block diagram of source and target databases according to some embodiments; [FIG. 5] is a flowchart of a template generation method according to some embodiments; [FIG. 6A] is a schematic diagram of raw data according to some embodiments; [FIG. 6B] is a schematic diagram of a data block according to some embodiments; and [FIG. 7] is a schematic diagram of source data according to other embodiments.

S01~S06:步驟 S01~S06: Steps

Claims (10)

一種資料擷取方法,包含:讀取一待處理資料及一擷取樣板,該待處理資料包含一待處理資料內容,該擷取樣板包含一擷取資訊與一驗證資訊,該擷取資訊包含一錨點資訊及相對於該錨點資訊之一擷取範圍;根據該錨點資訊檢索該待處理資料內容以獲得符合該錨點資訊之一參考點;依據該擷取範圍於該待處理資料擷取相對於該參考點之一擷取資料;以及根據該驗證資訊篩選該擷取資料,以產生一結果資料。 A data capture method, comprising: reading a data to be processed and a capture sample board, the data to be processed includes a content of the data to be processed, the capture sample board includes a capture information and a verification information, the capture information includes an anchor point information and a retrieval range relative to the anchor point information; retrieve the content of the data to be processed according to the anchor point information to obtain a reference point corresponding to the anchor point information; retrieving a captured data relative to the reference point; and filtering the captured data according to the verification information to generate a result data. 如請求項1所述之資料擷取方法,其中,該錨點資訊係選自文字、圖片、符號、圖案、註記、孔洞及其組合所構成的群組。 The data retrieval method of claim 1, wherein the anchor point information is selected from the group consisting of text, pictures, symbols, patterns, annotations, holes and combinations thereof. 如請求項1所述之資料擷取方法,其中,該驗證資訊包含一驗證公式,該資料擷取方法根據該驗證公式篩選該擷取資料。 The data capture method of claim 1, wherein the verification information includes a verification formula, and the data capture method filters the captured data according to the verification formula. 如請求項1所述之資料擷取方法,更輸出該結果資料到至少一目標資料庫,該擷取樣板更包含一目標資料庫位址資訊,該資料擷取方法讀取多個擷取樣板,並根據各該擷取樣板之目標資料庫位址資訊決定輸出該結果資料之該目標資料庫。 The data capture method according to claim 1 further outputs the result data to at least one target database, the capture sample board further includes address information of a target database, and the data capture method reads a plurality of capture sample boards , and determine the target database for outputting the result data according to the target database address information of each acquisition sample board. 如請求項4所述之資料擷取方法,更存取至少一來源資料庫,該來源資料庫用以儲存該待處理資料,該資料擷取方法偵測該至少一來源資料庫包含該待處理資料時,該資料擷取方法處理該待處理資料以產生該結果資料。 The data retrieval method according to claim 4 further accesses at least one source database for storing the pending data, and the data retrieval method detects that the at least one source database contains the pending data data, the data retrieval method processes the pending data to generate the result data. 一種非暫態電腦可讀取記錄媒體,用以儲存一資料擷取應用程式,該資料擷取應用程式用以:讀取一待處理資料及一擷取樣板,該待處理資料包含一待處理資料內容,該擷取樣板包含一擷取資訊與一驗證資訊,該擷取資訊包含一錨點資訊及相對於該錨點資訊之一擷取範圍;根據該錨點資訊檢索該待處理資料內容以獲得符合該錨點資訊之一參考點;依據該擷取範圍於該待處理資料擷取相對於該參考點之一擷取資料;以及根據該驗證資訊篩選該擷取資料,以產生一結果資料。 A non-transitory computer readable recording medium for storing a data acquisition application program for: reading a pending data and a sampling board, the pending data includes a pending data data content, the capture sample board includes a capture information and a verification information, the capture information includes an anchor point information and a capture range relative to the anchor point information; retrieve the to-be-processed data content according to the anchor point information to obtain a reference point that matches the anchor point information; to extract the data to be processed relative to one of the reference points according to the retrieval range; and to filter the captured data according to the verification information to generate a result material. 一種樣板生成方法,用以產生一擷取樣板,該擷取樣板用以擷取一待處理資料,該待處理資料包含一待處理資料內容,該樣板生成方法包含:讀取一原始資料,該原始資料包含一原始資料內容,該原始資料內容包含一特徵資料及相對於該特徵資料之一資料區塊;擷取該原始資料內容之特徵資料而定義一錨點資訊,該錨點資訊用以檢索該待處理資料內容中符合該錨點資訊之一參考點;擷取該原始資料內容之資料區塊而定義一擷取範圍,該擷取範圍用以擷取該待處理資料內容中符合該擷取範圍之一擷取資料;產生該錨點資訊與該擷取範圍之一相對位置關係;定義一擷取資訊,該擷取資訊包含該錨點資訊、該擷取範圍及該相對位置關係; 定義一驗證資訊,該驗證資訊用以篩選該擷取資料;以及輸出該擷取樣板,該擷取樣板包含該擷取資訊及該驗證資訊。 A template generation method is used to generate a capture sample board, the capture sample board is used to capture a data to be processed, the data to be processed includes a content of the data to be processed, the template generation method includes: reading an original data, the The original data includes an original data content, the original data content includes a feature data and a data block relative to the feature data; the feature data of the original data content is extracted to define an anchor point information, the anchor point information is used for Retrieve a reference point in the data content to be processed that matches the anchor point information; retrieve a data block of the original data content to define a capture range, and the capture range is used for retrieving the data content to be processed that matches the anchor point capture data in one of the capture ranges; generate a relative positional relationship between the anchor point information and the capture range; define capture information, the capture information includes the anchor point information, the capture range and the relative positional relationship ; A verification information is defined, the verification information is used to filter the captured data; and the capture sample board is output, the capture sample board includes the captured information and the verification information. 如請求項7所述之樣板生成方法,更包含顯示該原始資料內容於一顯示器,該樣板生成方法擷取於該顯示器所被框選之該原始資料內容之資料區塊而定義該擷取範圍,並存取該擷取範圍於該原始資料內容之相對位置。 The template generation method according to claim 7, further comprising displaying the original data content on a display, and the template generation method captures the data block of the original data content framed by the display to define the capture range , and access the retrieval range relative to the content of the original data. 如請求項7所述之樣板生成方法,更包括擷取該原始資料內容之特徵資料而定義一第一錨點資訊及一第二錨點資訊,並擷取該第一錨點資訊及該第二錨點資訊之間之該原始資料內容之資料區塊而定義該擷取範圍。 The template generation method according to claim 7, further comprising retrieving feature data of the original data content to define a first anchor point information and a second anchor point information, and retrieving the first anchor point information and the first anchor point information The data block of the original data content between the two anchor points defines the retrieval range. 如請求項7所述之樣板生成方法,更包括產生該驗證資訊與該擷取範圍之相對位置關係,當判斷該擷取範圍不包含該驗證資訊,樣板生成方法發出一警示。 The template generation method according to claim 7 further includes generating a relative positional relationship between the verification information and the capture range, and when it is determined that the capture range does not contain the verification information, the template generation method issues a warning.
TW109140874A 2020-11-20 2020-11-20 Data capturing method, template generating method and non-transitory computer readable storage medium TWI765422B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
TW109140874A TWI765422B (en) 2020-11-20 2020-11-20 Data capturing method, template generating method and non-transitory computer readable storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
TW109140874A TWI765422B (en) 2020-11-20 2020-11-20 Data capturing method, template generating method and non-transitory computer readable storage medium

Publications (2)

Publication Number Publication Date
TWI765422B true TWI765422B (en) 2022-05-21
TW202221489A TW202221489A (en) 2022-06-01

Family

ID=82594487

Family Applications (1)

Application Number Title Priority Date Filing Date
TW109140874A TWI765422B (en) 2020-11-20 2020-11-20 Data capturing method, template generating method and non-transitory computer readable storage medium

Country Status (1)

Country Link
TW (1) TWI765422B (en)

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI567570B (en) * 2011-06-15 2017-01-21 微軟技術授權有限責任公司 Techniques for generating custom objects representing content files
CN106484663A (en) * 2016-10-12 2017-03-08 天闻数媒科技(湖南)有限公司 A kind of extracting method of document content and device
CN106682233A (en) * 2017-01-16 2017-05-17 华侨大学 Method for Hash image retrieval based on deep learning and local feature fusion
CN106777038A (en) * 2016-12-09 2017-05-31 厦门大学 A kind of ultralow complexity image search method for retaining Hash based on sequence
CN109145143A (en) * 2018-08-03 2019-01-04 厦门大学 Sequence constraints hash algorithm in image retrieval
CN109726398A (en) * 2018-12-27 2019-05-07 北京奇安信科技有限公司 An entity identification and attribute judgment method, system, device and medium
JP6509486B2 (en) * 2012-11-20 2019-05-08 三星電子株式会社Samsung Electronics Co.,Ltd. Wearable electronic device
US10474701B2 (en) * 2005-07-15 2019-11-12 Indxit Systems, Inc. Systems and methods for document sorting
US10515140B1 (en) * 2007-03-30 2019-12-24 Amazon Technologies, Inc. Method and system for displaying items

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10474701B2 (en) * 2005-07-15 2019-11-12 Indxit Systems, Inc. Systems and methods for document sorting
US10515140B1 (en) * 2007-03-30 2019-12-24 Amazon Technologies, Inc. Method and system for displaying items
TWI567570B (en) * 2011-06-15 2017-01-21 微軟技術授權有限責任公司 Techniques for generating custom objects representing content files
JP6509486B2 (en) * 2012-11-20 2019-05-08 三星電子株式会社Samsung Electronics Co.,Ltd. Wearable electronic device
CN106484663A (en) * 2016-10-12 2017-03-08 天闻数媒科技(湖南)有限公司 A kind of extracting method of document content and device
CN106777038A (en) * 2016-12-09 2017-05-31 厦门大学 A kind of ultralow complexity image search method for retaining Hash based on sequence
CN106682233A (en) * 2017-01-16 2017-05-17 华侨大学 Method for Hash image retrieval based on deep learning and local feature fusion
CN109145143A (en) * 2018-08-03 2019-01-04 厦门大学 Sequence constraints hash algorithm in image retrieval
CN109726398A (en) * 2018-12-27 2019-05-07 北京奇安信科技有限公司 An entity identification and attribute judgment method, system, device and medium

Also Published As

Publication number Publication date
TW202221489A (en) 2022-06-01

Similar Documents

Publication Publication Date Title
McDonald et al. The Regenstrief medical record system: a quarter century experience
US9189569B2 (en) Non-transitory computer readable medium, medical record search apparatus, and medical record search method
US20070046649A1 (en) Multi-functional navigational device and method
US20070250345A1 (en) Electronic medical record system, method, and computer process for the testing, diagnosis, and treatment of sleep disorders
US20230410955A1 (en) Electronic data document for use in clinical trial verification system and method
CN110797111A (en) Device and method for establishing electronic medical record
JPH04333973A (en) Input/output control method for electronic chart
WO2007121332A2 (en) Business transaction documentation system and method
CN112560424A (en) Form generation method, device and system suitable for hospital operation electronic medical record
CN104932850A (en) Image processing apparatus and image processing method
JP6751128B2 (en) Research information management system
JP2001084317A (en) Electronic medical record processing method, electronic medical record processing system, computer-readable recording medium storing electronic medical record processing program
US20150178449A1 (en) Methods and Systems For Automated Personal Health Information Retrieval and Release
CN106650249A (en) Medical data management system
US20090132279A1 (en) Method and apparatus for significant and key image navigation
Jelacic et al. The effects of an aviation‐style computerised pre‐induction anaesthesia checklist on pre‐anaesthetic set‐up and non‐routine events
TWI765422B (en) Data capturing method, template generating method and non-transitory computer readable storage medium
KR20140034468A (en) Apparatus and method for generating electron form in medical information system
JP6525523B2 (en) INFORMATION PROCESSING APPARATUS, CONTROL METHOD, AND PROGRAM
CN111554372A (en) Medication check method and medication check device
TWI687938B (en) Apparatus and method for establishing an electronic medical record
Waegemann et al. Healthcare documentation: A report on information capture and report generation
TWM611199U (en) Data acquisition system and template generation system
CN101313326A (en) Medical record information management system and medical record information management method
JP2001344346A (en) Electronic medical record processing device having audio input