[go: up one dir, main page]

TWI888283B - Processing method of data quality dynamic information and data processing device - Google Patents

Processing method of data quality dynamic information and data processing device Download PDF

Info

Publication number
TWI888283B
TWI888283B TW113136151A TW113136151A TWI888283B TW I888283 B TWI888283 B TW I888283B TW 113136151 A TW113136151 A TW 113136151A TW 113136151 A TW113136151 A TW 113136151A TW I888283 B TWI888283 B TW I888283B
Authority
TW
Taiwan
Prior art keywords
data
measurement value
value
update
data quality
Prior art date
Application number
TW113136151A
Other languages
Chinese (zh)
Inventor
陳維超
張明淇
楊舒惠
Original Assignee
英業達股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 英業達股份有限公司 filed Critical 英業達股份有限公司
Priority to TW113136151A priority Critical patent/TWI888283B/en
Application granted granted Critical
Publication of TWI888283B publication Critical patent/TWI888283B/en

Links

Images

Landscapes

  • Indication And Recording Devices For Special Purposes And Tariff Metering Devices (AREA)
  • General Factory Administration (AREA)

Abstract

A data processing device is provided, which includes a storage device and a data quality assessment measurement calculation module. The storage device is configured to obtain and store change data of data tables from a data center. The data quality assessment measurement calculation module is configured to calculate an updated data quality assessment measurement value and an updated data quality characteristic data according to the change data and a data quality assessment measurement reference value, record the updated data quality assessment measurement value and the updated data quality characteristic data into a data quality assessment measurement value table, and perform a notification function according to the updated data quality assessment measurement value and a data quality threshold. The data quality assessment measurement calculation module is configured to compare the updated data quality assessment measurement value with the data quality threshold to generate a comparison result. The data quality assessment measurement calculation module is configured to generate and send a notification signal to perform a notification function when the comparison result indicates that the updated data quality assessment measurement value is greater than or equal to the data quality threshold.

Description

料品質動態資訊處理方法及資料處理裝置Material quality dynamic information processing method and data processing device

本發明係關於一種資料品質動態資訊處理方法及資料處理裝置,尤指一種有利於數位轉型研發之資料品質動態資訊處理方法及資料處理裝置。 The present invention relates to a data quality dynamic information processing method and a data processing device, and in particular to a data quality dynamic information processing method and a data processing device that are beneficial to digital transformation research and development.

資料品質是一種衡量資料表單內的資料好壞之概念。同時,也是一種對資料表單評估是否值得投入成本進行資料分析、資料探勘(data mining)或數位應用開發的重要依據。然而,資料品質的好壞並沒有統一的標準定義,通常是取決於資料用途。例如,很多時候資料表單內的資料量還不足夠多到可進行訓練人工智慧模型。一般來說,在進行資料分析、探勘或數位應用開發之前,為了避免數位轉型的研發成本付之一炬,資料使用者就必須不時確認資料品質是否已經足夠好到可以進行研發。不過資料中心中通常有成千上萬筆大量資料的資料表單,並且隨時持續在更新中。資料使用者為了能在最快時間內啟動研發工作就必須每天下載龐大資料量的資料表單全部資料至其本機以進行計算資料品質數值。然,如此一來,將非常耗費資料使用者與資料中心的運算資源,也很容易會危害到系統資訊安全。因此,現有的技術實有改進之必要。 Data quality is a concept that measures the quality of data in a data form. At the same time, it is also an important basis for evaluating whether a data form is worth the cost of data analysis, data mining, or digital application development. However, there is no uniform standard definition of data quality, and it usually depends on the purpose of the data. For example, many times the amount of data in a data form is not enough to train an artificial intelligence model. Generally speaking, before conducting data analysis, mining, or digital application development, in order to avoid the R&D costs of digital transformation going to waste, data users must constantly confirm whether the data quality is good enough to conduct research and development. However, there are usually thousands of data forms with large amounts of data in the data center, and they are constantly being updated. In order to start research and development work as quickly as possible, data users must download all the data in the huge data spreadsheets to their local machines every day to calculate the data quality values. However, this will consume a lot of computing resources of data users and data centers, and it will easily endanger system information security. Therefore, the existing technology really needs to be improved.

為了解決上述之問題,本發明提供一種有利於數位轉型研發之資料品質動態資訊處理方法及資料處理裝置,以解決上述問題。 In order to solve the above problems, the present invention provides a data quality dynamic information processing method and data processing device that are beneficial to digital transformation research and development to solve the above problems.

本發明提供一種資料處理裝置,包括:一儲存裝置,用以自一資料中心取得並儲存該資料中心之資料表單之一異動資料;以及一資料品質評估測度運算模組,用以根據該異動資料以及一資料品質評估測度參考值,計算出一更新後資料品質評估測度值以及一更新後資料品質特徵資料,將該更新後資料品質評估測度值及該更新後資料品質特徵資料記錄至一資料品質評估測度值表單,並根據該更新後資料品質評估測度值及一資料品質閥值執行一通知功能;其中該資料品質評估測度運算模組比較該更新後資料品質評估測度值與該資料品質閥值以產生一比較結果,並於該比較結果顯示該更新後資料品質評估測度值大於或等於該資料品質閥值時產生並發送出一通知信號以執行該通知功能。 The present invention provides a data processing device, comprising: a storage device for obtaining and storing a change data of a data table of a data center from a data center; and a data quality evaluation measurement calculation module for calculating an updated data quality evaluation measurement value and an updated data quality feature data according to the change data and a data quality evaluation measurement reference value, and storing the updated data quality evaluation measurement value and the updated data quality feature data. The data is recorded in a data quality assessment measurement value table, and a notification function is executed according to the updated data quality assessment measurement value and a data quality valve value; wherein the data quality assessment measurement calculation module compares the updated data quality assessment measurement value with the data quality valve value to generate a comparison result, and when the comparison result shows that the updated data quality assessment measurement value is greater than or equal to the data quality valve value, a notification signal is generated and sent to execute the notification function.

本發明另提供一種資料品質動態資訊處理方法,包括:自資料中心取得並儲存異動資料;根據異動資料計算出異動資料之一資料品質評估測度值及一資料品質特徵資料;根據該異動資料之該資料品質評估測度值及該資料品質特徵資料以及一資料品質評估測度參考值計算出一更新後資料品質評估測度值以及一更新後資料品質特徵資料;將該更新後資料品質評估測度值及該更新後資料品質特徵資料記錄至一資料品質評估測度值表單;以及根據該更新後資料品質評估測度值及一資料品質閥值執行一通知功能,包括比較該更新後資料品質評估測度值與該資料品質閥值以產生一比較結果以及於該比較結果顯示該更新後資料品質評估測度值大於或等於該資料品質閥值時產生並發送出一通知信號,以執行該通知功能。 The present invention also provides a data quality dynamic information processing method, comprising: obtaining and storing changed data from a data center; calculating a data quality evaluation measurement value and a data quality characteristic data of the changed data according to the changed data; calculating an updated data quality evaluation measurement value and an updated data quality characteristic data according to the data quality evaluation measurement value and the data quality characteristic data of the changed data and a data quality evaluation measurement reference value; and updating the updated data quality evaluation measurement value and the updated data quality characteristic data. The evaluation measurement value and the updated data quality characteristic data are recorded in a data quality evaluation measurement value table; and a notification function is executed according to the updated data quality evaluation measurement value and a data quality valve value, including comparing the updated data quality evaluation measurement value with the data quality valve value to generate a comparison result and generating and sending a notification signal when the comparison result shows that the updated data quality evaluation measurement value is greater than or equal to the data quality valve value to execute the notification function.

1:資料處理系統 1:Data processing system

10:資料中心 10: Data Center

2:流程 2: Process

20:資料處理裝置 20: Data processing device

200:資料品質評估測度運算模組 200: Data quality assessment measurement calculation module

202:儲存裝置 202: Storage device

204:資料品質評估測度值表單 204: Data quality assessment measurement value form

206:使用者資訊表單 206: User information form

208:使用者介面 208: User Interface

S200,S202,S204,S206,S208,S210,S212:步驟 S200, S202, S204, S206, S208, S210, S212: Steps

第1圖為本發明實施例一資料處理系統之示意圖。 Figure 1 is a schematic diagram of a data processing system according to the first embodiment of the present invention.

第2圖為本發明實施例之一流程之示意圖。 Figure 2 is a schematic diagram of a process of one embodiment of the present invention.

第3圖為本發明實施例之使用者介面視覺化顯示資料品質資訊之示意圖。 Figure 3 is a schematic diagram of the user interface of an embodiment of the present invention visually displaying data quality information.

第4圖為本發明實施例之自動通知資料品質資訊之示意圖。 Figure 4 is a schematic diagram of the automatic notification of data quality information in an embodiment of the present invention.

請參考第1圖,第1圖為本發明實施例之一資料處理系統1之示意圖。資料處理系統1包含一資料中心10以及一資料處理裝置20。資料中心10儲存有複數資料表單。資料中心10可包括資料庫以儲存資料表單。資料中心10可包括多個聯網之電腦裝置,相互協同工作、處理、儲存以及分享資料。本發明實施例提供資料處理裝置20,讓資料中心10的資料使用者可對任何資料量的資料表單自動化追蹤和評估資料品質,同時又不會占用到資料中心10的運算資源,將非常有利於推動數位轉型研發工作。 Please refer to Figure 1, which is a schematic diagram of a data processing system 1 of an embodiment of the present invention. The data processing system 1 includes a data center 10 and a data processing device 20. The data center 10 stores a plurality of data forms. The data center 10 may include a database to store the data forms. The data center 10 may include a plurality of networked computer devices that work, process, store and share data in collaboration with each other. The embodiment of the present invention provides a data processing device 20, which allows data users of the data center 10 to automatically track and evaluate the data quality of data forms of any amount of data without occupying the computing resources of the data center 10, which will be very helpful in promoting digital transformation research and development.

資料處理裝置20包括一資料品質評估測度運算模組200、一儲存裝置202、一資料品質評估測度值表單204以及一使用者資訊表單206。儲存裝置202可用以儲存資料。例如,儲存裝置202可儲存與資料中心10有關的異動資料,所述異動資料可包括欲新增至資料中心10之資料表單之新增資料或欲從資料中心10之資料表單中刪除之刪除資料。資料品質評估測度運算模組200可存取儲存裝置202所儲存之資料,資料品質評估測度運算模組200亦可接收與處理來自外部裝置的資料。資料品質評估測度運算模組200可以是一種主控制器或處理裝置,如中央處理單元(Central Processing Unit,CPU)、微處理器(Microprocessor)、或微控制器單元(Micro Controller Unit,MCU),但不限於此。資料品質評估測度運算模組200可存取資料品質評估測度值表單204與使用者資訊表單206的資 料內容。資料品質評估測度運算模組200可將相關資料記錄至資料品質評估測度值表單204以及使用者資訊表單之中。資料品質評估測度運算模組200可根據異動資料及資料品質評估測度參考值計算出更新後資料品質評估測度值以及更新後資料品質特徵資料,並將更新後資料品質評估測度值及更新後資料品質特徵資料記錄至資料品質評估測度值表單206。資料品質評估測度值表單204包括所有用於計算資料中心10之資料表單的資料品質相關資訊之表單。資料品質評估測度值表單204包括資料表名稱、資料表欄位名稱、資料異動行為、資料品質評估測度值、資料品質評估特徵資料和更新時間等資訊。使用者資訊表單206包括使用者設定資訊之表單。使用者資訊表單206包括使用者帳號、資料表名稱、資料欄位名稱、資料品質測度名稱、資料品質閥值以及欲通知之終端裝置(使用者裝置)等資訊。 The data processing device 20 includes a data quality assessment measurement calculation module 200, a storage device 202, a data quality assessment measurement value form 204, and a user information form 206. The storage device 202 can be used to store data. For example, the storage device 202 can store change data related to the data center 10, and the change data can include new data to be added to the data form of the data center 10 or deleted data to be deleted from the data form of the data center 10. The data quality assessment measurement calculation module 200 can access the data stored in the storage device 202, and the data quality assessment measurement calculation module 200 can also receive and process data from an external device. The data quality assessment measurement calculation module 200 can be a main controller or processing device, such as a central processing unit (CPU), a microprocessor, or a micro controller unit (MCU), but is not limited thereto. The data quality assessment measurement calculation module 200 can access the data content of the data quality assessment measurement value table 204 and the user information table 206. The data quality assessment measurement calculation module 200 can record the relevant data in the data quality assessment measurement value table 204 and the user information table. The data quality assessment measurement calculation module 200 can calculate the updated data quality assessment measurement value and the updated data quality characteristic data according to the changed data and the data quality assessment measurement reference value, and record the updated data quality assessment measurement value and the updated data quality characteristic data into the data quality assessment measurement value form 206. The data quality assessment measurement value form 204 includes all forms of data quality related information used to calculate the data tables of the data center 10. The data quality assessment measurement value form 204 includes information such as the data table name, the data table field name, the data change behavior, the data quality assessment measurement value, the data quality assessment characteristic data and the update time. The user information form 206 includes a form for user setting information. The user information form 206 includes information such as the user account, table name, data field name, data quality measurement name, data quality threshold, and the terminal device (user device) to be notified.

本發明實施例透過資料處理裝置20可自動化動態追蹤和評估資料中心10資料表單的資料品質變化,而不須逐次重新取得資料中心整體系統資料來評估整體資料品質。請參考第2圖,第2圖為本發明實施例之一流程2之示意圖。流程2包含以下步驟: The embodiment of the present invention can automatically and dynamically track and evaluate the data quality changes of the data table of the data center 10 through the data processing device 20, without having to re-acquire the overall system data of the data center to evaluate the overall data quality. Please refer to Figure 2, which is a schematic diagram of process 2 of one embodiment of the present invention. Process 2 includes the following steps:

步驟S200:開始。 Step S200: Start.

步驟S202:儲存裝置儲存異動資料。 Step S202: The storage device stores the changed data.

步驟S204:根據異動資料計算出異動資料之一資料品質評估測度值及一資料品質特徵資料。 Step S204: Calculate a data quality evaluation measurement value and a data quality feature data of the changed data based on the changed data.

步驟S206:根據該異動資料以及一資料品質評估測度參考值,計算出一更新後資料品質評估測度值以及一更新後資料品質特徵資料。 Step S206: Calculate an updated data quality assessment measurement value and an updated data quality feature data based on the changed data and a data quality assessment measurement reference value.

步驟S208:將該更新後資料品質評估測度值及該更新後資料品質特徵資料記錄至一資料品質評估測度值表單。 Step S208: Record the updated data quality assessment measurement value and the updated data quality feature data into a data quality assessment measurement value table.

步驟S210:根據該更新後資料品質評估測度值及一資料品質閥值執行一通知功能。 Step S210: Execute a notification function based on the updated data quality evaluation measurement value and a data quality threshold.

步驟S212:結束。 Step S212: End.

根據流程2,於步驟S202中,資料中心10可將有關於其所有資料表單之資料異動資訊(包含新增,刪除和更新)通知資料處理裝置20。當資料處理裝置20接收到來自資料中心10之資料表單之資料異動資訊後,儲存裝置202可儲存異動資料以及資料表單之資料異動資訊。所述異動資料包括欲新增至資料中心10之複數個資料表單之新增資料或欲從資料中心10之複數個資料表單中刪除之刪除資料。例如,資料中心10可將資料表單之資料異動資訊以及包含有欲新增至資料表單之新增資料或欲從資料表單中刪除之刪除資料的異動資料傳送至儲存裝置202進行儲存。例如,儲存裝置202可依據所接收到之資料表單之資料異動資訊存取資料中心10,以由資料中心10取得包含有欲新增至資料表單之新增資料或欲從資料表單中刪除之刪除資料的異動資料並將所取得之異動資料儲存起來。 According to process 2, in step S202, the data center 10 may notify the data processing device 20 of data change information (including addition, deletion and update) related to all its data forms. After the data processing device 20 receives the data change information of the data form from the data center 10, the storage device 202 may store the change data and the data change information of the data form. The change data includes the added data to be added to the multiple data forms of the data center 10 or the deleted data to be deleted from the multiple data forms of the data center 10. For example, the data center 10 can transmit the data change information of the data form and the change data including the new data to be added to the data form or the deleted data to be deleted from the data form to the storage device 202 for storage. For example, the storage device 202 can access the data center 10 according to the received data change information of the data form to obtain the change data including the new data to be added to the data form or the deleted data to be deleted from the data form from the data center 10 and store the obtained change data.

於步驟S204中,資料品質評估測度運算模組200可讀取儲存裝置202中所儲存的異動資料,並根據異動資料計算出異動資料之一資料品質評估測度值及一資料品質特徵資料。例如,當資料品質評估測度值包括均值測度值時,資料品質評估測度運算模組200可計算異動資料之均值以做為異動資料之資料品質評估測度值。例如,當資料品質評估測度值包括母體變異數測度值時,資料品質評估測度運算模組200可計算異動資料之母體變異數以做為異動資料之資料品質評估測度值。此外,當資料品質評估測度運算模組200讀取儲存裝置202中所儲存的異動資料後,儲存裝置202亦可釋放儲存資料之空間,以節省儲存空 間。 In step S204, the data quality assessment measure calculation module 200 can read the change data stored in the storage device 202, and calculate a data quality assessment measure value and a data quality feature data of the change data according to the change data. For example, when the data quality assessment measure value includes a mean measure value, the data quality assessment measure calculation module 200 can calculate the mean of the change data as the data quality assessment measure value of the change data. For example, when the data quality assessment measure value includes a parent variance measure value, the data quality assessment measure calculation module 200 can calculate the parent variance of the change data as the data quality assessment measure value of the change data. In addition, after the data quality assessment measurement calculation module 200 reads the changed data stored in the storage device 202, the storage device 202 can also release the space for storing data to save storage space.

於步驟S206中,資料品質評估測度運算模組200根據異動資料以及一資料品質評估測度參考值計算出一更新後資料品質評估測度值以及一更新後資料品質特徵資料。資料品質評估測度參考值包括一更新前資料品質評估測度值以及一更新前資料品質特徵資料。資料品質評估測度值表單204儲存有最新的資料品質評估測度參考值。資料品質評估測度值表單204紀錄了將異動資料更新至資料中心10之前的資料品質評估測度參考值(即更新前資料品質評估測度值及更新前資料品質特徵資料)。資料品質評估測度運算模組200可存取資料品質評估測度值表單204以取得更新前資料品質評估測度值以及一更新前資料品質特徵資料。資料品質評估測度運算模組200可基於與更新前資料品質評估測度值、更新前資料品質特徵資料、異動資料之資料品質評估測度值及異動資料之資料品質特徵資料有關之函式計算出更新後資料品質評估測度值以及更新後資料品質特徵資料。其中更新後資料品質評估測度值更新後資料品質特徵資料可以下式表示:Mnew=FM(MOri,Mchange,COri,Cchange)Cnew=FC(COri,Cchange) (1) In step S206, the data quality assessment measurement calculation module 200 calculates an updated data quality assessment measurement value and an updated data quality characteristic data according to the changed data and a data quality assessment measurement reference value. The data quality assessment measurement reference value includes a pre-update data quality assessment measurement value and a pre-update data quality characteristic data. The data quality assessment measurement value table 204 stores the latest data quality assessment measurement reference value. The data quality assessment measurement value table 204 records the data quality assessment measurement reference value before the changed data is updated to the data center 10 (i.e., the pre-update data quality assessment measurement value and the pre-update data quality characteristic data). The data quality evaluation measure calculation module 200 can access the data quality evaluation measure value table 204 to obtain the pre-update data quality evaluation measure value and the pre-update data quality characteristic data. The data quality evaluation measure calculation module 200 can calculate the post-update data quality evaluation measure value and the post-update data quality characteristic data based on the function related to the pre-update data quality evaluation measure value, the pre-update data quality characteristic data, the data quality evaluation measure value of the changed data, and the data quality characteristic data of the changed data. The post-update data quality evaluation measure value and the post-update data quality characteristic data can be expressed as follows: M new = FM (M Ori , M change , C Ori , C change ) C new = FC (C Ori , C change ) (1)

其中,Mnew表示更新後資料品質評估測度值、MOri表示更新前資料品質評估測度值,Mchange表示異動資料的資料品質評估測度值,FM(.)及FC(.)表示函式,Cnew表示更新後資料品質特徵資料,COri表示更新前資料品質特徵資料,Cchange表示異動資料之資料品質特徵資料。 Among them, M new represents the data quality evaluation measurement value after the update, M Ori represents the data quality evaluation measurement value before the update, M change represents the data quality evaluation measurement value of the changed data, FM(.) and FC(.) represent functions, C new represents the data quality feature data after the update, C Ori represents the data quality feature data before the update, and C change represents the data quality feature data of the changed data.

在一實施例中,更新前資料品質評估測度值包括一更新前空值比例 (Missing Value Ratio,MR)測度值,異動資料之資料品質評估測度值包括異動資料之一空值比例測度值,更新後資料品質評估測度值包括一更新後空值比例測度值。資料品質評估測度運算模組200根據更新前空值比例測度值、更新前資料品質特徵資料、異動資料之空值比例測度值及異動資料之資料品質特徵資料計算出更新後空值比例測度值以及更新後資料品質特徵資料。其中於異動資料為欲新增至複數個資料表單之新增資料時,資料品質評估測度運算模組200係根據下式計算出更新後空值比例測度值以及更新後資料品質特徵資料:

Figure 113136151-A0305-12-0007-1
In one embodiment, the data quality evaluation measurement value before updating includes a missing value ratio (MR) measurement value before updating, the data quality evaluation measurement value of the changed data includes a missing value ratio measurement value of the changed data, and the data quality evaluation measurement value after updating includes a missing value ratio measurement value after updating. The data quality evaluation measurement calculation module 200 calculates the missing value ratio measurement value after updating and the data quality feature data after updating according to the missing value ratio measurement value before updating, the missing value ratio measurement value of the changed data, and the data quality feature data of the changed data. When the changed data is new data to be added to multiple data tables, the data quality evaluation measurement calculation module 200 calculates the updated null value ratio measurement value and the updated data quality feature data according to the following formula:
Figure 113136151-A0305-12-0007-1

其中,MRnew表示更新後空值比例測度值,MROri表示更新前空值比例測度值,MRchange表示異動資料之空值比例測度值,Cnew表示更新後資料品質特徵資料,COri表示更新前資料品質特徵資料,Cchange表示異動資料之資料品質特徵資料,N表示用來計算更新前資料品質評估測度值的資料量,也就是說,資料量N為在對資料中心10的資料表單進行了該些異動資料的異動之前用以計算更新前資料品質評估測度值所用到的資料量,其中COri=N,k表示用來計算異動資料之該資料品質評估測度值的資料量,其中Cchange=k。 Among them, MR new represents the measurement value of the null value ratio after updating, MR Ori represents the measurement value of the null value ratio before updating, MR change represents the measurement value of the null value ratio of the changed data, C new represents the data quality feature data after updating, C Ori represents the data quality feature data before updating, C change represents the data quality feature data of the changed data, N represents the amount of data used to calculate the data quality evaluation measurement value before updating, that is, the data amount N is the amount of data used to calculate the data quality evaluation measurement value before updating before the changed data are changed in the data table of the data center 10, wherein C Ori =N, k represents the amount of data used to calculate the data quality evaluation measurement value of the changed data, wherein C change =k.

於該異動資料為欲由該複數個資料表單中刪除之刪除資料時,該資料品質評估測度運算模組200係根據下式計算出該更新後空值比例測度值以及該更新後資料品質特徵資料:

Figure 113136151-A0305-12-0007-2
When the changed data is deleted data to be deleted from the plurality of data tables, the data quality evaluation measurement calculation module 200 calculates the updated null value ratio measurement value and the updated data quality characteristic data according to the following formula:
Figure 113136151-A0305-12-0007-2

其中,COri=N,以及Cchange=K。 Where, C Ori =N, and C change =K.

在一實施例中,更新前資料品質評估測度值包括一更新前均值(mean)測度值以及一更新前母體變異數(variance)測度值,異動資料之資料品質評估測度值包括異動資料之一均值測度值以及一母體變異數測度值,更新後資料品質評估測度值包括一更新後均值測度值以及一更新後母體變異數測度值。資料品質評估測度運算模組200根據更新前均值測度值、更新前母體變異數測度值、更新前資料品質特徵資料、異動資料之均值測度值、異動資料之母體變異數測度值以及異動資料之資料品質特徵資料計算出更新後均值測度值、更新後母體變異數測度值以及更新後資料品質特徵資料。其中於異動資料為欲新增至複數個資料表單之新增資料時,資料品質評估測度運算模組200係根據下式計算出該更新後均值測度值、更新後母體變異數測度值以及該更新後資料品質特徵資料:

Figure 113136151-A0305-12-0008-3
In one embodiment, the data quality assessment measurement value before updating includes a mean measurement value before updating and a variance measurement value before updating, the data quality assessment measurement value of the changed data includes a mean measurement value and a variance measurement value of the changed data, and the data quality assessment measurement value after updating includes a mean measurement value after updating and a variance measurement value after updating. The data quality evaluation measurement calculation module 200 calculates the updated mean measurement value, the updated parent variation measurement value, and the updated data quality feature data according to the pre-updated mean measurement value, the pre-updated parent variation measurement value, the pre-updated data quality feature data, the mean measurement value of the abnormal data, the parent variation measurement value of the abnormal data, and the data quality feature data of the abnormal data. When the abnormal data is new data to be added to a plurality of data tables, the data quality evaluation measurement calculation module 200 calculates the updated mean measurement value, the updated parent variation measurement value, and the updated data quality feature data according to the following formula:
Figure 113136151-A0305-12-0008-3

其中,μnew表示更新後均值測度值,μOri表示更新前均值測度值,μchange表示異動資料之均值測度值,σ2 new表示更新後母體變異數測度值,σ2 Ori表示更新前母體變異數測度值,σ2 change表示異動資料之該母體變異數測度值,Cnew表示更新後資料品質特徵資料,COri表示更新前資料品質特徵資料,Cchange 表示異動資料之資料品質特徵資料,N表示用來計算更新前均值測度值及更新前母體變異數測度值的資料量,其中COri=N,k表示用來計算異動資料之該均值測度值及母體變異數測度值的資料量,其中Cchange=k。 Among them, μ new represents the mean measurement value after updating, μ Ori represents the mean measurement value before updating, μ change represents the mean measurement value of the changed data, σ 2 new represents the matrix variation measurement value after updating, σ 2 Ori represents the matrix variation measurement value before updating, σ 2 change represents the matrix variation measurement value of the changed data, C new represents the data quality feature data after updating, C Ori represents the data quality feature data before updating, C change represents the data quality feature data of the changed data, N represents the data amount used to calculate the mean measurement value and the matrix variation measurement value before updating, wherein C Ori =N, k represents the data amount used to calculate the mean measurement value and the matrix variation measurement value of the changed data, wherein C change =k.

於異動資料為欲由複數個資料表單中刪除之刪除資料時,資料品質評估測度運算模組200係根據下式計算出更新後均值測度值、更新後母體變異數測度值以及更新後資料品質特徵資料:

Figure 113136151-A0305-12-0009-4
When the changed data is deleted data to be deleted from multiple data tables, the data quality assessment measurement calculation module 200 calculates the updated mean measurement value, the updated parent variance measurement value, and the updated data quality feature data according to the following formula:
Figure 113136151-A0305-12-0009-4

其中Cchange=k。 Where C change =k.

在一實施例中,更新前資料品質評估測度值包括一更新前母體相關係數(correlation coefficient)測度值,異動資料之資料品質評估測度值包括異動資料之一母體相關係數測度值,更新後資料品質評估測度值包括一更新後母體相關係數測度值,資料品質評估測度運算模組200根據更新前母體相關係數測度值、更新前資料品質特徵資料、異動資料之母體相關係數測度值及異動資料之資料品質特徵資料計算出更新後母體相關係數測度值以及更新後資料品質特徵資料,其中於異動資料為欲新增至複數個資料表單之新增資料時資料品質評估測度運算模組200係根據下式計算出更新後母體相關係數測度值以及更新後資料品質特徵資料:

Figure 113136151-A0305-12-0010-5
In one embodiment, the pre-update data quality assessment measure value includes a pre-update matrix correlation coefficient (correlation coefficient) measurement value, the data quality evaluation measurement value of the changed data includes a matrix correlation coefficient measurement value of the changed data, and the updated data quality evaluation measurement value includes an updated matrix correlation coefficient measurement value. The data quality evaluation measurement calculation module 200 calculates the updated matrix correlation coefficient measurement value and the updated data quality feature data according to the matrix correlation coefficient measurement value before updating, the data quality feature data before updating, the matrix correlation coefficient measurement value of the changed data, and the data quality feature data of the changed data. When the changed data is new data to be added to a plurality of data tables, the data quality evaluation measurement calculation module 200 calculates the updated matrix correlation coefficient measurement value and the updated data quality feature data according to the following formula:
Figure 113136151-A0305-12-0010-5

其中,ρnew表示更新後母體相關係數測度值,ρOri表示更新前母體相關係數測度值,ρchange表示異動資料之母體相關係數測度值,Cnew表示更新後資料品質特徵資料,x,y表示變數(表示不同資料群),σx_new、σy_new表示變數之更新後標準差,σx_Ori、σy_Ori表示變數之更新前標準差,σx_change、σy_change表示異動資料之變數之標準差,μy_new表示變數之更新後均值測度值,μx_Ori、μy_Ori表示變數之更新前均值測度值,μx_change、μy_change表示異動資料之變數之均值測度值,其中,Cnew=(N+k,μnew2 new)。 Among them, ρ new represents the measurement value of the matrix correlation coefficient after update, ρ Ori represents the measurement value of the matrix correlation coefficient before update, ρ change represents the measurement value of the matrix correlation coefficient of the changed data, C new represents the data quality characteristic data after update, x, y represent variables (representing different data groups), σ x_new , σ y_new represent the standard deviation of the variable after update, σ x_Ori , σ y_Ori represent the standard deviation of the variable before update, σ x_change , σ y_change represent the standard deviation of the variable of the changed data, μ y_new represents the mean measurement value of the variable after update, μ x_Ori , μ y_Ori represent the mean measurement value of the variable before update, μ x_change , μ y_change represent the mean measurement value of the variable of the changed data, among them, C new =(N+k,μ new2 new ).

於異動資料為欲由複數個資料表單中刪除之刪除資料時,資料品質評估測度運算模組200係根據下式計算出更新後母體相關係數以及更新後資料品質特徵資料:

Figure 113136151-A0305-12-0010-6
When the changed data is deleted data to be deleted from multiple data tables, the data quality evaluation measurement calculation module 200 calculates the updated matrix correlation coefficient and the updated data quality characteristic data according to the following formula:
Figure 113136151-A0305-12-0010-6

其中,Cnew=(N+k,μnew2 new)。 Among them, C new =(N+k,μ new2 new ).

在一實施例中,更新前資料品質評估測度值包括一更新前獨特值比例(unique value ratio,UR)測度值,異動資料之資料品質評估測度值包括異動資料之一獨特值比例測度值,更新後資料品質評估測度值包括一更新後獨特值比例測度值。資料品質評估測度運算模組200根據更新前獨特值比例測度值、更新前資料品質特徵資料、異動資料之獨特值比例測度值及異動資料之資料品質特徵資料計算出更新後獨特值比例測度值以及更新後資料品質特徵資料,其中於異動資料為欲新增至複數個資料表單之新增資料時,資料品質評估測度運算模組200係根據下式計算出更新後獨特值比例測度值以及更新後資料品質特徵資料:

Figure 113136151-A0305-12-0011-7
In one embodiment, the data quality assessment measurement value before updating includes a unique value ratio (UR) measurement value before updating, the data quality assessment measurement value of the changed data includes a unique value ratio measurement value of the changed data, and the data quality assessment measurement value after updating includes a unique value ratio measurement value after updating. The data quality evaluation measurement calculation module 200 calculates the updated unique value ratio measurement value and the updated data quality feature data according to the unique value ratio measurement value before the update, the data quality feature data before the update, the unique value ratio measurement value of the changed data, and the data quality feature data of the changed data. When the changed data is new data to be added to a plurality of data tables, the data quality evaluation measurement calculation module 200 calculates the updated unique value ratio measurement value and the updated data quality feature data according to the following formula:
Figure 113136151-A0305-12-0011-7

其中,URnew表示更新後獨特值比例測度值,∪表示聯集運算,key(.)表示鍵值,#表示個數計算,MROri表示更新前獨特值比例測度值,MRchange表示異動資料之獨特值比例測度值,Cnew表示更新後資料品質特徵資料,COri表示更新前資料品質特徵資料及更新前獨特值比例測度值,Cchange表示異動資料之資料品質特徵資料及獨特值比例測度值,N表示用來計算更新前資料品質評估測度值的資料量:於異動資料為欲由複數個資料表單中刪除之刪除資料時,資料品質評估測度運算模組200係根據下式計算出更新後獨特值比例測度值以及更新後資料品質特徵資料;

Figure 113136151-A0305-12-0011-8
Wherein, UR new represents the unique value ratio measurement value after update, ∪ represents the union operation, key(.) represents the key value, # represents the number calculation, MR Ori represents the unique value ratio measurement value before update, MR change represents the unique value ratio measurement value of the changed data, C new represents the data quality feature data after update, C Ori represents the data quality feature data before update and the unique value ratio measurement value before update, C change represents the data quality feature data of the changed data and the unique value ratio measurement value, and N represents the amount of data used to calculate the data quality evaluation measurement value before update: When the changed data is the deleted data to be deleted from a plurality of data tables, the data quality evaluation measurement operation module 200 calculates the unique value ratio measurement value after update and the data quality feature data after update according to the following formula;
Figure 113136151-A0305-12-0011-8

例如,更新前獨特值比例測度值Cori={"Ivy":2,"Ben":3},表示資料有Ivy和Ben兩個獨特值,其中,Ivy有2個,而Ben有3個。此外,若異動資料之獨特值比例測度值Cchange={"Ivy":2,"Ben":1},表示資料有Ivy和Ben兩個獨特值,其中,Ivy有2個,而Ben有1個。在此情況下,式(8)中的Cori ∪ Cchange={"Ivy":4,"Ben":4},而式(9)中的Cori-Cchange={"Ben":2}。 For example, the unique value ratio measure before update C ori ={"Ivy":2,"Ben":3}, which means that the data has two unique values, Ivy and Ben, of which Ivy has 2 and Ben has 3. In addition, if the unique value ratio measure of the changed data C change ={"Ivy":2,"Ben":1}, it means that the data has two unique values, Ivy and Ben, of which Ivy has 2 and Ben has 1. In this case, C ori ∪ C change ={"Ivy":4,"Ben":4} in formula (8), and C ori -C change ={"Ben":2} in formula (9).

換言之,本發明實施例僅需取得異動資料再依據更新前資料品質評估測度參考值即可計算出更新後資料品質評估測度值及特徵資料,而能即時瞭解當前的資料品質,又毋須重新下載取得整體系統資料。再者。對於資料中心而言,資料中心僅需提供新增至資料表單之新增資料或從資料表單刪除之刪除資料的異動資料資訊至資料處理裝置20,而能有效節省資料中心的運算資源 In other words, the embodiment of the present invention only needs to obtain the changed data and then calculate the updated data quality evaluation measurement value and feature data based on the data quality evaluation measurement reference value before the update, so that the current data quality can be understood in real time without having to download and obtain the entire system data again. Furthermore, for the data center, the data center only needs to provide the changed data information of the added data added to the data form or the deleted data deleted from the data form to the data processing device 20, which can effectively save the computing resources of the data center

於步驟S208中,資料品質評估測度運算模組200可將更新後資料品質評估測度值及更新後資料品質特徵資料記錄至資料品質評估測度值表單204,以供後續資料中心10之資料表單有其他資料異動情況時,可做為資料異動前的資料品質評估測度值資訊。另一方面,資料處理系統1另包括一使用者介面,使用者介面可用以提供查詢資料品質評估測度值表單204之視覺化服務。例如,請參考第3圖,第3圖為本發明實施例之使用者介面視覺化顯示資料品質資訊之示意圖。如第3圖所示,使用者介面視覺化顯示與資料中心10有關之資料表單的資料筆數(row count)、一致性(consistency)、離群值(outlier)、遺失值(missing value)及冗餘值(redundant)圖形化資訊。使用者介面亦可提供相關功能(例如自動化通知功能)之設定介面,讓使用者可以輸入設定所需的內容。 In step S208, the data quality assessment measurement calculation module 200 can record the updated data quality assessment measurement value and the updated data quality characteristic data into the data quality assessment measurement value table 204, so that when there are other data changes in the data table of the data center 10, it can be used as the data quality assessment measurement value information before the data change. On the other hand, the data processing system 1 also includes a user interface, which can be used to provide a visualization service for querying the data quality assessment measurement value table 204. For example, please refer to Figure 3, which is a schematic diagram of the user interface visually displaying data quality information in an embodiment of the present invention. As shown in FIG. 3 , the user interface visually displays the row count, consistency, outlier, missing value, and redundant information of the data table related to the data center 10. The user interface may also provide a setting interface for related functions (such as automatic notification function) so that the user can input the required settings.

於步驟S210中,資料品質評估測度運算模組200可根據更新後資料品質評估測度值及一資料品質閥值判斷是否執行一通知功能。資料品質評估測度運算模組200可比較更新後資料品質評估測度值與資料品質閥值以產生一比較結果。於比較結果顯示更新後資料品質評估測度值大於或等於資料品質閥值時,資料品質評估測度運算模組200可產生並發送出一通知信號至一使用者裝置(可預先設定裝置對象)來提醒使用者,資料品質已達到期望的水準。例如,請參考第4圖,第4圖為本發明實施例之自動通知資料品質資訊之示意圖。如第4圖所示,資料處理系統1提供了使用者介面208,以供使用者輸入設定所需的資料品質閥值。使用者透過使用者介面208輸入資料品質閥值至使用者資訊表單206後,資料品質評估測度運算模組200可存取使用者資訊表單206以取得一資料品質閥值以及欲通知之使用者裝置資訊。資料品質評估測度運算模組200可存取資料品質評估測度值表單204以取得更新後資料品質評估測度值。接著,資料品質評估測度運算模組200比較更新後資料品質評估測度值與資料品質閥值以產生一比較結果。於比較結果顯示更新後資料品質評估測度值大於或等於資料品質閥值時,資料品質評估測度運算模組200可產生並發送出一通知信號至一使用者裝置UE來提醒資料中心的管理員、工程師或是使用者,此時資料中心的資料品質已達到期望的水準,可立即啟動相關研發工作,以實現執行通知功能。例如,資料品質評估測度運算模組200可利用通訊軟體,如E-mail,Teams等,通知使用者裝置UE之資料使用者目前資料品質評估測度值已超過資料品質閥值。如此一來,資料使用者便可在最快時間內啟動資料分析、探勘或數位應用的研發工作。 In step S210, the data quality assessment measurement calculation module 200 can determine whether to execute a notification function based on the updated data quality assessment measurement value and a data quality valve value. The data quality assessment measurement calculation module 200 can compare the updated data quality assessment measurement value with the data quality valve value to generate a comparison result. When the comparison result shows that the updated data quality assessment measurement value is greater than or equal to the data quality valve value, the data quality assessment measurement calculation module 200 can generate and send a notification signal to a user device (the device object can be pre-set) to remind the user that the data quality has reached the expected level. For example, please refer to Figure 4, which is a schematic diagram of automatically notifying data quality information in an embodiment of the present invention. As shown in FIG. 4 , the data processing system 1 provides a user interface 208 for the user to input the data quality valve value required for setting. After the user inputs the data quality valve value into the user information form 206 through the user interface 208, the data quality assessment measurement calculation module 200 can access the user information form 206 to obtain a data quality valve value and the user device information to be notified. The data quality assessment measurement calculation module 200 can access the data quality assessment measurement value form 204 to obtain the updated data quality assessment measurement value. Then, the data quality assessment measurement calculation module 200 compares the updated data quality assessment measurement value with the data quality valve value to generate a comparison result. When the comparison result shows that the updated data quality assessment measurement value is greater than or equal to the data quality threshold, the data quality assessment measurement calculation module 200 can generate and send a notification signal to a user device UE to remind the administrator, engineer or user of the data center that the data quality of the data center has reached the expected level and the relevant research and development work can be started immediately to realize the execution notification function. For example, the data quality assessment measurement calculation module 200 can use communication software such as E-mail, Teams, etc. to notify the data user of the user device UE that the current data quality assessment measurement value has exceeded the data quality threshold. In this way, the data user can start the research and development of data analysis, exploration or digital application in the fastest time.

因此,當使用者將資料品質閥值輸入至使用者資訊表單206後,本發明實施例之資料處理裝置20於資料中心10有資料異動時便會據以計算出最新的 資料品質評估測度值,同時串接使用者資訊表單206取得資料品質閥值,以評估更新後的資料品質評估測度值是否有達到資料品質閥值,一旦資料品質達到使用者所訂下的水準時就會執行通知功能以通知預設的使用者裝置,藉以以實現客製化自動通知功能,讓使用者不需時時去監測資料品質的變化。 Therefore, after the user inputs the data quality threshold into the user information form 206, the data processing device 20 of the embodiment of the present invention will calculate the latest data quality evaluation measurement value based on the data change in the data center 10, and simultaneously connect the user information form 206 to obtain the data quality threshold value to evaluate whether the updated data quality evaluation measurement value reaches the data quality threshold value. Once the data quality reaches the level set by the user, the notification function will be executed to notify the preset user device, thereby realizing a customized automatic notification function, so that the user does not need to monitor the changes in data quality all the time.

本領域具通常知識者當可依本發明的精神加以結合、修飾或變化以上所述的實施例,而不限於此。上述所有的陳述、步驟、及/或流程(包含建議步驟),可透過硬體、軟體、韌體(即硬體裝置與電腦指令的組合,硬體裝置中的資料為唯讀軟體資料)、電子系統、或上述裝置的組合等方式實現。其中裝置可為資料處理系統1。硬體可包含類比、數位及混合電路(即微電路、微晶片或矽晶片)。例如,硬體可為特定應用集成電路(ASIC)、現場可程序邏輯閘陣列(field programmable gate array,FPGA)、可程序化邏輯元件、耦接的硬體元件,或上述硬體的組合。在其他實施例中,硬件可包括通用處理器、微處理器、控制器、數字信號處理器(digital signal processor,DSP),或上述硬件的組合。軟體可為程式碼的組合、指令的組合及/或函數(功能)的組合,其儲存在一儲存裝置中,例如一電腦可讀取記錄媒體或一非瞬時性電腦可讀取介質(non-transitory computer-readable medium)。舉例來說,電腦可讀取記錄媒體可包括唯讀記憶體(read-only memory,ROM)、快閃記憶體(Flash Memory)、隨機存取記憶體(random-access memory,RAM)、用戶識別模組(Subscriber Identity Module,SIM)、硬碟、軟碟或光碟唯讀記憶體(CD-ROM/DVD-ROM/BD-ROM),但不以此為限。本發明實施例的資料處理系統1可包括一處理電路以及儲存裝置202。本發明之流程步驟與實施例可被編譯成程式碼或指令的型態存在而儲存於所述儲存裝置202中。處理電路可用於讀取與執行儲存裝置202所儲存的程式碼或指令以實現前述所有步驟與功能。 A person of ordinary skill in the art may combine, modify or change the above-described embodiments according to the spirit of the present invention, but is not limited thereto. All of the above statements, steps, and/or processes (including recommended steps) may be implemented through hardware, software, firmware (i.e., a combination of hardware devices and computer instructions, where the data in the hardware devices are read-only software data), electronic systems, or a combination of the above devices. The device may be a data processing system 1. The hardware may include analog, digital and hybrid circuits (i.e., microcircuits, microchips or silicon chips). For example, the hardware may be an application specific integrated circuit (ASIC), a field programmable gate array (FPGA), a programmable logic element, a coupled hardware element, or a combination of the above hardware. In other embodiments, the hardware may include a general purpose processor, a microprocessor, a controller, a digital signal processor (DSP), or a combination of the above hardware. The software may be a combination of program codes, a combination of instructions, and/or a combination of functions (functionality) stored in a storage device, such as a computer-readable recording medium or a non-transitory computer-readable medium. For example, the computer-readable recording medium may include read-only memory (ROM), flash memory, random-access memory (RAM), subscriber identity module (SIM), hard disk, floppy disk or CD-ROM/DVD-ROM/BD-ROM, but is not limited thereto. The data processing system 1 of the embodiment of the present invention may include a processing circuit and a storage device 202. The process steps and embodiments of the present invention may be compiled into a program code or instruction form and stored in the storage device 202. The processing circuit can be used to read and execute the program code or instructions stored in the storage device 202 to implement all the aforementioned steps and functions.

綜上所述,本發明實施例提供了資料處理裝置使得資料中心的資料使用者可對任何資料量的資料表單自動化追蹤和評估資料品質。本發明實施例僅需取得異動資料即能計算出當前資料品質評估測度值而毋須重新下載取得整體系統資料,而且一旦資料品質達到使用者所訂下的水準時就會執行通知功能以通知預設的使用者裝置,藉以實現客製化自動通知功能,讓使用者不需時時去監測資料品質的變化。同時,對於資料中心而言,資料中心僅需提供新增至資料表單之新增資料或從資料表單刪除之刪除資料的異動資料資訊至資料處理裝置而能有效節省資料中心的運算資源,因此將非常有利於推動數位轉型研發工作。 In summary, the embodiment of the present invention provides a data processing device that enables data users in a data center to automatically track and evaluate data quality for data forms of any amount of data. The embodiment of the present invention only needs to obtain the changed data to calculate the current data quality evaluation measurement value without having to re-download and obtain the entire system data. In addition, once the data quality reaches the level set by the user, the notification function will be executed to notify the preset user device, thereby realizing a customized automatic notification function, so that the user does not need to monitor the changes in data quality all the time. At the same time, for the data center, the data center only needs to provide the changed data information of the new data added to the data form or the deleted data deleted from the data form to the data processing device, which can effectively save the computing resources of the data center, and will be very helpful to promote the research and development of digital transformation.

以上所述僅為本發明之較佳實施例,凡依本發明申請專利範圍所做之均等變化與修飾,皆應屬本發明之涵蓋範圍。 The above is only the preferred embodiment of the present invention. All equivalent changes and modifications made within the scope of the patent application of the present invention shall fall within the scope of the present invention.

2:流程 2: Process

S200,S202,S204,S206,S208,S210,S212:步驟 S200, S202, S204, S206, S208, S210, S212: Steps

Claims (8)

一種資料處理裝置,包括:一儲存裝置,用以自一資料中心取得並儲存該資料中心之資料表單之一異動資料;以及一資料品質評估測度運算模組,用以根據該異動資料之一資料品質評估測度值、該異動資料之一資料品質特徵資料、一資料品質評估測度參考值之一更新前資料品質評估測度值以及該資料品質評估測度參考值之一更新前資料品質特徵資料計算出一更新後資料品質評估測度值以及一更新後資料品質特徵資料,將該更新後資料品質評估測度值及該更新後資料品質特徵資料記錄至一資料品質評估測度值表單,並根據該更新後資料品質評估測度值及一資料品質閥值執行一通知功能;其中該資料品質評估測度運算模組比較該更新後資料品質評估測度值與該資料品質閥值以產生一比較結果,並於該比較結果顯示該更新後資料品質評估測度值大於或等於該資料品質閥值時產生並發送出一通知信號以執行該通知功能;其中該異動資料包括欲新增至複數個資料表單之新增資料或欲從該複數個資料表單中刪除之刪除資料,該更新前資料品質評估測度值包括一更新前空值比例測度值,該異動資料之該資料品質評估測度值包括該異動資料之一空值比例測度值,該更新後資料品質評估測度值包括一更新後空值比例測度值,該資料品質評估測度運算模組根據該更新前空值比例測度值、該更新前資料品質特徵資料、該異動資料之該空值比例測度值及該異動資料之該資料品質特徵資料計算出該更新後空值比例測度值以及該更新後資料品質特徵資料,其中於該異動資料為欲新增至該複數個資料表單之新增資料時該資料品質評估測度運算模組係根據下式計算出該更新後空值比例測度值以及該更新後資料品質特徵資料; 其中,表示該更新後空值比例測度值,表示該更新前空值比例測度值,表示該異動資料之該空值比例測度值,表示該更新後資料品質特徵資料,表示該更新前資料品質特徵資料,表示該異動資料之該資料品質特徵資料,N表示用來計算該更新前資料品質評估測度值的資料量,其中=N,k表示用來計算該異動資料之該資料品質評估測度值的資料量,其中=k;其中於該異動資料為欲由該複數個資料表單中刪除之刪除資料時該資料品質評估測度運算模組係根據下式計算出該更新後空值比例測度值以及該更新後資料品質特徵資料; 其中,表示該更新後空值比例測度值,表示該更新前空值比例測度值,表示該異動資料之該空值比例測度值,表示該更新後資料品質特徵資料,表示該更新前資料品質特徵資料,表示該異動資料之該資料品質特徵資料,N表示用來計算該更新前資料品質評估測度值的資料量,其中=N,K表示用來計算該異動資料之該資料品質評估測度值的資料量,其中=K。A data processing device includes: a storage device for obtaining and storing a change data of a data table of a data center from a data center; and a data quality evaluation measurement calculation module for calculating a post-update data quality evaluation value based on a data quality evaluation measurement value of the change data, a data quality characteristic data of the change data, a data quality evaluation measurement value of a data quality evaluation measurement reference value before updating, and a data quality characteristic data of the data quality evaluation measurement reference value before updating. The method comprises: collecting the updated data quality evaluation measurement value and an updated data quality characteristic data, recording the updated data quality evaluation measurement value and the updated data quality characteristic data in a data quality evaluation measurement value table, and executing a notification function according to the updated data quality evaluation measurement value and a data quality valve value; wherein the data quality evaluation measurement calculation module compares the updated data quality evaluation measurement value with the data quality valve value to generate a comparison result, and displays in the comparison result that the updated data quality evaluation measurement value is greater than or equal to When the data quality threshold is reached, a notification signal is generated and sent to execute the notification function; wherein the changed data includes the newly added data to be added to the plurality of data tables or the deleted data to be deleted from the plurality of data tables, the data quality evaluation measurement value before the update includes a null value ratio measurement value before the update, the data quality evaluation measurement value of the changed data includes a null value ratio measurement value of the changed data, the data quality evaluation measurement value after the update includes a null value ratio measurement value after the update, the data quality evaluation measurement value The estimation measurement calculation module calculates the post-update null value ratio measurement value and the post-update data quality characteristic data according to the pre-update null value ratio measurement value, the pre-update data quality characteristic data, the null value ratio measurement value of the changed data, and the data quality characteristic data of the changed data, wherein when the changed data is new data to be added to the plurality of data tables, the data quality assessment measurement calculation module calculates the post-update null value ratio measurement value and the post-update data quality characteristic data according to the following formula; in, Indicates the null value ratio measurement value after the update. Indicates the null value ratio measurement value before the update. Indicates the null value ratio measurement value of the change data. Indicates the updated data quality feature data. Indicates the data quality characteristics before the update. represents the data quality characteristic data of the changed data, N represents the amount of data used to calculate the data quality evaluation measurement value before the update, where = N, k represents the amount of data used to calculate the data quality evaluation measure value of the change data, where = k; wherein when the changed data is deleted data to be deleted from the plurality of data tables, the data quality evaluation measurement calculation module calculates the updated null value ratio measurement value and the updated data quality characteristic data according to the following formula; in, Indicates the null value ratio measurement value after the update. Indicates the null value ratio measurement value before the update. Indicates the null value ratio measurement value of the change data. Indicates the updated data quality feature data. Indicates the data quality characteristics before the update. represents the data quality characteristic data of the changed data, N represents the amount of data used to calculate the data quality evaluation measurement value before the update, where = N, K represents the amount of data used to calculate the data quality evaluation measure value of the change data, where =K. 如請求項1所述之資料處理裝置,其中該異動資料包括欲新增至該複數個資料表單之新增資料或欲從該複數個資料表單中刪除之刪除資料,該更新前資料品質評估測度值包括一更新前均值測度值以及一更新前母體變異數測度值,該異動資料之該資料品質評估測度值包括該異動資料之一均值測度值以及一母體變異數測度值,該更新後資料品質評估測度值包括一更新後均值測度值以及一更新後母體變異數測度值,該資料品質評估測度運算模組根據該更新前均值測度值、該更新前母體變異數測度值、該更新前資料品質特徵資料、該異動資料之該均值測度值、該異動資料之該母體變異數測度值以及該異動資料之該資料品質特徵資料計算出該更新後均值測度值、更新後母體變異數測度值以及該更新後資料品質特徵資料,其中於該異動資料為欲新增至該複數個資料表單之新增資料時該資料品質評估測度運算模組係根據下式計算出該更新後均值測度值、更新後母體變異數測度值以及該更新後資料品質特徵資料; 其中,表示該更新後均值測度值,表示該更新前均值測度值,表示該異動資料之該均值測度值,表示該更新後母體變異數測度值,表示該更新前母體變異數測度值,表示該異動資料之該母體變異數測度值,表示該更新後資料品質特徵資料,表示該更新前資料品質特徵資料,表示該異動資料之該資料品質特徵資料,N表示用來計算該更新前均值測度值及該更新前母體變異數測度值的資料量,其中=N,k表示用來計算該異動資料之該均值測度值及該母體變異數測度值的資料量,其中=k;其中於該異動資料為欲由該複數個資料表單中刪除之刪除資料時該資料品質評估測度運算模組係根據下式計算出該更新後均值測度值、更新後母體變異數測度值以及該更新後資料品質特徵資料; 其中,表示該更新後均值測度值,表示該更新前均值測度值,表示該異動資料之該均值測度值,表示該更新後母體變異數測度值,表示該更新前母體變異數測度值,表示該異動資料之該母體變異數測度值,表示該更新後資料品質特徵資料,表示該更新前資料品質特徵資料,表示該異動資料之該資料品質特徵資料,N表示用來計算該更新前均值測度值及該更新前母體變異數測度值的資料量,其中=N,k表示用來計算該異動資料之該均值測度值及該母體變異數測度值的資料量,其中=k。A data processing device as described in claim 1, wherein the changed data includes new data to be added to the plurality of data tables or deleted data to be deleted from the plurality of data tables, the data quality assessment measure value before the update includes a mean measure value before the update and a matrix variance measure value before the update, the data quality assessment measure value of the changed data includes a mean measure value and a matrix variance measure value of the changed data, the data quality assessment measure value after the update includes a mean measure value after the update and a matrix variance measure value after the update, and the data quality assessment measure calculation module calculates the data quality assessment measure value according to the data quality assessment measure value before the update. The updated mean measurement value, the updated matrix variation measurement value, and the updated data quality characteristic data are calculated by the mean measurement value, the matrix variation measurement value before the update, the data quality characteristic data before the update, the mean measurement value of the abnormal data, the matrix variation measurement value of the abnormal data, and the data quality characteristic data of the abnormal data, wherein when the abnormal data is the newly added data to be added to the plurality of data tables, the data quality assessment measurement calculation module calculates the updated mean measurement value, the updated matrix variation measurement value, and the updated data quality characteristic data according to the following formula; in, represents the updated mean measurement value, represents the mean measurement value before the update, represents the mean measurement value of the change data. represents the updated measure of the population variance, represents the measurement value of the parent population variance before the update, represents the measure of the parent population variance of the change data. Indicates the updated data quality feature data. Indicates the data quality characteristics before the update. represents the data quality characteristic data of the change data, N represents the amount of data used to calculate the mean measurement value before the update and the matrix variance measurement value before the update, where = N, k represents the amount of data used to calculate the mean measurement value and the parent variance measurement value of the change data, where = k; wherein when the changed data is deleted data to be deleted from the plurality of data tables, the data quality assessment measurement calculation module calculates the updated mean measurement value, the updated parent population variance measurement value and the updated data quality characteristic data according to the following formula; in, represents the updated mean measurement value, represents the mean measurement value before the update, represents the mean measurement value of the change data. represents the updated measure of the population variance, represents the measurement value of the parent population variance before the update, represents the measure of the parent population variance of the change data. Indicates the updated data quality feature data. Indicates the data quality characteristics before the update. represents the data quality characteristic data of the change data, N represents the amount of data used to calculate the mean measurement value before the update and the matrix variance measurement value before the update, where = N, k represents the amount of data used to calculate the mean measurement value and the parent variance measurement value of the change data, where =k. 如請求項1所述之資料處理裝置,其中該異動資料包括欲新增至該複數個資料表單之新增資料或欲從該複數個資料表單中刪除之刪除資料,該更新前資料品質評估測度值包括一更新前母體相關係數測度值,該異動資料之該資料品質評估測度值包括該異動資料之一母體相關係數測度值,該更新後資料品質評估測度值包括一更新後母體相關係數測度值,該資料品質評估測度運算模組根據該更新前母體相關係數測度值、該更新前資料品質特徵資料、該異動資料之該母體相關係數測度值及該異動資料之該資料品質特徵資料計算出該更新後母體相關係數測度值以及該更新後資料品質特徵資料,其中於該異動資料為欲新增至該複數個資料表單之新增資料時該資料品質評估測度運算模組係根據下式計算出該更新後母體相關係數測度值以及該更新後資料品質特徵資料;其中,表示該更新後母體相關係數測度值,表示該更新前母體相關係數測度值,表示該異動資料之該母體相關係數測度值,表示該更新後資料品質特徵資料,x, y表示變數,表示變數之更新後標準差,表示變數之更新前標準差,表示該異動資料之變數之標準差,表示變數之更新後均值測度值,表示變數之更新前均值測度值,表示該異動資料之變數之均值測度值,其中,;其中於該異動資料為欲由該複數個資料表單中刪除之刪除資料時該資料品質評估測度運算模組係根據下式計算出該更新後母體相關係數以及該更新後資料品質特徵資料;其中,表示該更新後母體相關係數測度值,表示該更新前母體相關係數測度值,表示該異動資料之該母體相關係數測度值,表示該更新後資料品質特徵資料,x, y表示變數,表示變數之更新後標準差,表示變數之更新前標準差,表示該異動資料之變數之標準差,表示變數之更新後均值測度值,表示變數之更新前均值測度值,表示該異動資料之變數之均值測度值,其中,A data processing device as described in claim 1, wherein the changed data includes new data to be added to the plurality of data tables or deleted data to be deleted from the plurality of data tables, the pre-update data quality assessment measure value includes a pre-update matrix correlation coefficient measure value, the data quality assessment measure value of the changed data includes a matrix correlation coefficient measure value of the changed data, the post-update data quality assessment measure value includes a post-update matrix correlation coefficient measure value, and the data quality assessment measure calculation module The updated matrix correlation coefficient measurement value and the updated data quality characteristic data are calculated according to the matrix correlation coefficient measurement value before the update, the data quality characteristic data before the update, the matrix correlation coefficient measurement value of the changed data, and the data quality characteristic data of the changed data, wherein when the changed data is new data to be added to the plurality of data tables, the data quality evaluation measurement calculation module calculates the updated matrix correlation coefficient measurement value and the updated data quality characteristic data according to the following formula; in, represents the updated matrix correlation coefficient measure value, represents the value of the matrix correlation coefficient before the update, Indicates the value of the parent correlation coefficient of the change data. represents the updated data quality feature data, x, y represent variables, represents the updated standard deviation of the variable, represents the standard deviation of the variable before updating, Indicates the standard deviation of the variable of the change data. represents the updated mean measurement value of the variable, represents the mean measurement value of the variable before updating, Represents the mean measurement value of the variable of the change data, where, ; Wherein when the changed data is deleted data to be deleted from the plurality of data tables, the data quality assessment measurement calculation module calculates the updated matrix correlation coefficient and the updated data quality characteristic data according to the following formula; in, represents the updated matrix correlation coefficient measure value, represents the value of the matrix correlation coefficient before the update, Indicates the value of the parent correlation coefficient of the change data. represents the updated data quality feature data, x, y represent variables, represents the updated standard deviation of the variable, represents the standard deviation of the variable before updating, Indicates the standard deviation of the variable of the change data. represents the updated mean measurement value of the variable, represents the mean measurement value of the variable before updating, Represents the mean measurement value of the variable of the change data, where, . 如請求項1所述之資料處理裝置,其中該異動資料包括欲新增至該複數個資料表單之新增資料或欲從該複數個資料表單中刪除之刪除資料,該更新前資料品質評估測度值包括一更新前獨特值比例測度值,該異動資料之該資料品質評估測度值包括該異動資料之一獨特值比例測度值,該更新後資料品質評估測度值包括一更新後獨特值比例測度值,該資料品質評估測度運算模組根據該更新前獨特值比例測度值、該更新前資料品質特徵資料、該異動資料之該獨特值比例測度值及該異動資料之該資料品質特徵資料計算出該更新後獨特值比例測度值以及該更新後資料品質特徵資料,其中於該異動資料為欲新增至該複數個資料表單之新增資料時該資料品質評估測度運算模組係根據下式計算出該更新後獨特值比例測度值以及該更新後資料品質特徵資料; 其中,表示該更新後獨特值比例測度值,表示聯集運算,key(.)表示鍵值,#表示個數計算,表示該更新前獨特值比例測度值,表示該異動資料之該獨特值比例測度值,表示該更新後資料品質特徵資料,表示該更新前資料品質特徵資料及該更新前獨特值比例測度值,表示該異動資料之該資料品質特徵資料及該獨特值比例測度值,N表示用來計算該更新前資料品質評估測度值的資料量;其中於該異動資料為欲由該複數個資料表單中刪除之刪除資料時該資料品質評估測度運算模組係根據下式計算出該更新後獨特值比例測度值以及該更新後資料品質特徵資料; 其中,表示該更新後獨特值比例測度值,表示聯集運算,key(.)表示鍵值,#表示個數計算,表示該更新前獨特值比例測度值,表示該異動資料之該獨特值比例測度值,表示該更新後資料品質特徵資料,表示該更新前資料品質特徵資料及該更新前獨特值比例測度值,表示該異動資料之該資料品質特徵資料及該獨特值比例測度值,N表示用來計算該更新前資料品質評估測度值的資料量。A data processing device as described in claim 1, wherein the changed data includes new data to be added to the plurality of data tables or deleted data to be deleted from the plurality of data tables, the pre-update data quality assessment measure value includes a pre-update unique value ratio measure value, the data quality assessment measure value of the changed data includes a unique value ratio measure value of the changed data, the post-update data quality assessment measure value includes a post-update unique value ratio measure value, and the data quality assessment measure calculation module The updated unique value ratio measurement value and the updated data quality characteristic data are calculated based on the unique value ratio measurement value before the update, the data quality characteristic data before the update, the unique value ratio measurement value of the changed data, and the data quality characteristic data of the changed data, wherein when the changed data is new data to be added to the plurality of data tables, the data quality assessment measurement calculation module calculates the updated unique value ratio measurement value and the updated data quality characteristic data according to the following formula; in, Represents the unique value ratio measurement value after the update. represents a union operation, key(.) represents a key value, and # represents a count. It represents the unique value ratio measurement value before the update. The unique value ratio measure of the change data. Indicates the updated data quality feature data. represents the data quality feature data before the update and the unique value ratio measurement value before the update, represents the data quality characteristic data and the unique value ratio measurement value of the changed data, and N represents the amount of data used to calculate the data quality assessment measurement value before the update; wherein when the changed data is deleted data to be deleted from the plurality of data tables, the data quality assessment measurement calculation module calculates the unique value ratio measurement value after the update and the data quality characteristic data after the update according to the following formula; in, Represents the unique value ratio measurement value after the update. represents a union operation, key(.) represents a key value, and # represents a count. It represents the unique value ratio measurement value before the update. The unique value ratio measure of the change data. Indicates the updated data quality feature data. represents the data quality feature data before the update and the unique value ratio measurement value before the update, represents the data quality characteristic data and the unique value ratio measurement value of the changed data, and N represents the amount of data used to calculate the data quality assessment measurement value before the update. 一種資料品質動態資訊處理方法,包括:自資料中心取得並儲存異動資料;根據異動資料計算出異動資料之一資料品質評估測度值及一資料品質特徵資料;根據該異動資料之該資料品質評估測度值、該異動資料之該資料品質特徵資料、一資料品質評估測度參考值之一更新前資料品質評估測度值以及該資料品質評估測度參考值之一更新前資料品質特徵資料計算出一更新後資料品質評估測度值以及一更新後資料品質特徵資料;將該更新後資料品質評估測度值及該更新後資料品質特徵資料記錄至一資料品質評估測度值表單;以及根據該更新後資料品質評估測度值及一資料品質閥值執行一通知功能,包括比較該更新後資料品質評估測度值與該資料品質閥值以產生一比較結果以及於該比較結果顯示該更新後資料品質評估測度值大於或等於該資料品質閥值時產生並發送出一通知信號,以執行該通知功能;其中該異動資料包括欲新增至複數個資料表單之新增資料或欲從該複數個資料表單中刪除之刪除資料,其中根據該異動資料之該資料品質評估測度值及該資料品質特徵資料、該資料品質評估測度參考值之該更新前資料品質評估測度值及該更新前資料品質特徵資料計算出該更新後資料品質評估測度值以及該更新後資料品質特徵資料之步驟包括:根據該更新前資料品質評估測度值之一更新前空值比例測度值、該更新前資料品質特徵資料、該異動資料之該資料品質評估測度值之一空值比例測度值及該異動資料之該資料品質特徵資料計算出該更新後資料品質評估測度值之一更新後空值比例測度值以及該更新後資料品質特徵資料,其中於該異動資料為欲新增至該複數個資料表單之新增資料時係根據下式計算出該更新後空值比例測度值以及該更新後資料品質特徵資料: 其中,表示該更新後空值比例測度值,表示該更新前空值比例測度值,表示該異動資料之該空值比例測度值,表示該更新後資料品質特徵資料,表示該更新前資料品質特徵資料,表示該異動資料之該資料品質特徵資料,N表示用來計算該更新前資料品質評估測度值的資料量,其中=N,k表示用來計算該異動資料之該資料品質評估測度值的資料量,其中=k;其中於該異動資料為欲由該複數個資料表單中刪除之刪除資料時係根據下式計算出該更新後空值比例測度值以及該更新後資料品質特徵資料; 其中,表示該更新後空值比例測度值,表示該更新前空值比例測度值,表示該異動資料之該空值比例測度值,表示該更新後資料品質特徵資料,表示該更新前資料品質特徵資料,表示該異動資料之該資料品質特徵資料,N表示用來計算該更新前資料品質評估測度值的資料量,其中=N,K表示用來計算該異動資料之該資料品質評估測度值的資料量,其中=K。A data quality dynamic information processing method includes: obtaining and storing changed data from a data center; calculating a data quality evaluation measurement value and a data quality characteristic data of the changed data according to the changed data; calculating an updated data quality evaluation measurement value according to the data quality evaluation measurement value of the changed data, the data quality characteristic data of the changed data, a data quality evaluation measurement value before updating of a data quality evaluation measurement reference value, and a data quality characteristic data before updating of the data quality evaluation measurement reference value. and an updated data quality characteristic data; recording the updated data quality evaluation measurement value and the updated data quality characteristic data into a data quality evaluation measurement value table; and executing a notification function according to the updated data quality evaluation measurement value and a data quality valve value, including comparing the updated data quality evaluation measurement value with the data quality valve value to generate a comparison result and generating and sending a notification signal when the comparison result shows that the updated data quality evaluation measurement value is greater than or equal to the data quality valve value, so as to Execute the notification function; wherein the changed data includes new data to be added to a plurality of data tables or deleted data to be deleted from the plurality of data tables, wherein the step of calculating the updated data quality assessment measurement value and the updated data quality characteristic data based on the data quality assessment measurement value and the data quality characteristic data before the update of the data quality assessment measurement reference value includes: calculating the updated data quality assessment measurement value and the updated data quality characteristic data based on the data quality assessment measurement value and the data quality characteristic data before the update of the data quality assessment measurement reference value; The method further comprises calculating a post-update null value ratio measurement value of the updated data quality assessment measurement value and the post-update data quality feature data according to a null value ratio measurement value before the update of the measurement value, the data quality feature data before the update, a null value ratio measurement value of the data quality assessment measurement value of the changed data, and the data quality feature data of the changed data, wherein when the changed data is new data to be added to the plurality of data tables, the post-update null value ratio measurement value and the post-update data quality feature data are calculated according to the following formula: in, Indicates the null value ratio measurement value after the update. Indicates the null value ratio measurement value before the update. Indicates the null value ratio measurement value of the change data. Indicates the updated data quality feature data. Indicates the data quality characteristics before the update. represents the data quality characteristic data of the changed data, N represents the amount of data used to calculate the data quality evaluation measurement value before the update, where = N, k represents the amount of data used to calculate the data quality evaluation measure value of the change data, where = k; wherein when the changed data is deleted data to be deleted from the plurality of data tables, the updated null value ratio measurement value and the updated data quality characteristic data are calculated according to the following formula; in, Indicates the null value ratio measurement value after the update. Indicates the null value ratio measurement value before the update. Indicates the null value ratio measurement value of the change data. Indicates the updated data quality feature data. Indicates the data quality characteristics before the update. represents the data quality characteristic data of the changed data, N represents the amount of data used to calculate the data quality evaluation measurement value before the update, where = N, K represents the amount of data used to calculate the data quality evaluation measure value of the change data, where =K. 如請求項5所述之資料品質動態資訊處理方法,其中該異動資料包括欲新增至該複數個資料表單之新增資料或欲從該複數個資料表單中刪除之刪除資料,該更新前資料品質評估測度值包括一更新前均值測度值以及一更新前母體變異數測度值,該異動資料之該資料品質評估測度值包括該異動資料之一均值測度值以及一母體變異數測度值,該更新後資料品質評估測度值包括一更新後均值測度值以及一更新後母體變異數測度值,該資料品質動態資訊處理方法另包含:根據該更新前均值測度值、該更新前母體變異數測度值、該更新前資料品質特徵資料、該異動資料之該均值測度值、該異動資料之該母體變異數測度值以及該異動資料之該資料品質特徵資料計算出該更新後均值測度值、更新後母體變異數測度值以及該更新後資料品質特徵資料,其中於該異動資料為欲新增至該複數個資料表單之新增資料時係根據下式計算出該更新後均值測度值、更新後母體變異數測度值以及該更新後資料品質特徵資料; 其中,表示該更新後均值測度值,表示該更新前均值測度值,表示該異動資料之該均值測度值,表示該更新後母體變異數測度值,表示該更新前母體變異數測度值,表示該異動資料之該母體變異數測度值,表示該更新後資料品質特徵資料,表示該更新前資料品質特徵資料,表示該異動資料之該資料品質特徵資料,N表示用來計算該更新前均值測度值及該更新前母體變異數測度值的資料量,其中=N,k表示用來計算該異動資料之該均值測度值及該母體變異數測度值的資料量,其中=k;其中於該異動資料為欲由該複數個資料表單中刪除之刪除資料時係根據下式計算出該更新後均值測度值、更新後母體變異數測度值以及該更新後資料品質特徵資料; 其中,表示該更新後均值測度值,表示該更新前均值測度值,表示該異動資料之該均值測度值,表示該更新後母體變異數測度值,表示該更新前母體變異數測度值,表示該異動資料之該母體變異數測度值,表示該更新後資料品質特徵資料,表示該更新前資料品質特徵資料,表示該異動資料之該資料品質特徵資料,N表示用來計算該更新前均值測度值及該更新前母體變異數測度值的資料量,其中=N,k表示用來計算該異動資料之該均值測度值及該母體變異數測度值的資料量,其中=k。A data quality dynamic information processing method as described in claim 5, wherein the changed data includes new data to be added to the plurality of data tables or deleted data to be deleted from the plurality of data tables, the data quality assessment measurement value before updating includes a mean measurement value before updating and a matrix variance measurement value before updating, the data quality assessment measurement value of the changed data includes a mean measurement value and a matrix variance measurement value of the changed data, the data quality assessment measurement value after updating includes a mean measurement value after updating and a matrix variance measurement value after updating, and the data quality dynamic information processing method The method further comprises: calculating the updated mean measurement value, the updated matrix variation measurement value and the updated data quality characteristic data according to the pre-update mean measurement value, the pre-update matrix variation measurement value, the pre-update data quality characteristic data, the mean measurement value of the abnormal data, the matrix variation measurement value of the abnormal data and the data quality characteristic data of the abnormal data, wherein when the abnormal data is new data to be added to the plurality of data tables, the updated mean measurement value, the updated matrix variation measurement value and the updated data quality characteristic data are calculated according to the following formula; in, represents the updated mean measurement value, represents the mean measurement value before the update, represents the mean measurement value of the change data. represents the updated measure of the population variance, represents the measurement value of the parent population variance before the update, represents the measure of the parent population variance of the change data. Indicates the updated data quality feature data. Indicates the data quality characteristics before the update. represents the data quality characteristic data of the change data, N represents the amount of data used to calculate the mean measurement value before the update and the matrix variance measurement value before the update, where = N, k represents the amount of data used to calculate the mean measurement value and the parent variance measurement value of the change data, where = k; wherein when the changed data is deleted data to be deleted from the plurality of data tables, the updated mean measurement value, the updated parent population variance measurement value and the updated data quality characteristic data are calculated according to the following formula; in, represents the updated mean measurement value, represents the mean measurement value before the update, represents the mean measurement value of the change data. represents the updated measure of the population variance, represents the measurement value of the parent population variance before the update, represents the measure of the parent population variance of the change data. Indicates the updated data quality feature data. Indicates the data quality characteristics before the update. represents the data quality characteristic data of the change data, N represents the amount of data used to calculate the mean measurement value before the update and the matrix variance measurement value before the update, where = N, k represents the amount of data used to calculate the mean measurement value and the parent variance measurement value of the change data, where =k. 如請求項5所述之資料品質動態資訊處理方法,其中該異動資料包括欲新增至該複數個資料表單之新增資料或欲從該複數個資料表單中刪除之刪除資料,該更新前資料品質評估測度值包括一更新前母體相關係數測度值,該異動資料之該資料品質評估測度值包括該異動資料之一母體相關係數測度值,該更新後資料品質評估測度值包括一更新後母體相關係數測度值,該資料品質動態資訊處理方法另包含:根據該更新前母體相關係數測度值、該更新前資料品質特徵資料、該異動資料之該母體相關係數測度值及該異動資料之該資料品質特徵資料計算出該更新後母體相關係數測度值以及該更新後資料品質特徵資料,其中於該異動資料為欲新增至該複數個資料表單之新增資料時係根據下式計算出該更新後母體相關係數測度值以及該更新後資料品質特徵資料;其中,表示該更新後母體相關係數測度值,表示該更新前母體相關係數測度值,表示該異動資料之該母體相關係數測度值,表示該更新後資料品質特徵資料,x, y表示變數,表示變數之更新後標準差,表示變數之更新前標準差,表示該異動資料之變數之標準差,表示變數之更新後均值測度值,表示變數之更新前均值測度值,表示該異動資料之變數之均值測度值,其中,;其中於該異動資料為欲由該複數個資料表單中刪除之刪除資料時係根據下式計算出該更新後母體相關係數以及該更新後資料品質特徵資料;其中,表示該更新後母體相關係數測度值,表示該更新前母體相關係數測度值,表示該異動資料之該母體相關係數測度值,表示該更新後資料品質特徵資料,x, y表示變數,表示變數之更新後標準差,表示變數之更新前標準差,表示該異動資料之變數之標準差,表示變數之更新後均值測度值,表示變數之更新前均值測度值,表示該異動資料之變數之均值測度值,其中,The data quality dynamic information processing method as described in claim 5, wherein the changed data includes new data to be added to the plurality of data tables or deleted data to be deleted from the plurality of data tables, the data quality evaluation measure value before updating includes a matrix correlation coefficient measure value before updating, the data quality evaluation measure value of the changed data includes a matrix correlation coefficient measure value of the changed data, the data quality evaluation measure value after updating includes a matrix correlation coefficient measure value after updating, and the data quality dynamic information processing method includes new data to be added to the plurality of data tables or deleted data to be deleted from the plurality of data tables, the data quality evaluation measure value before updating includes a matrix correlation coefficient measure value before updating, and the data quality dynamic information processing method includes new data to be added to the plurality of data tables or deleted data ... or deleted data from the plurality of data tables or deleted data from the plurality of data tables or deleted data from the plurality of data tables or deleted data from the plurality of data tables or deleted data from the plurality of data tables or deleted data from the plurality of data tables or deleted data from the plurality of data tables or deleted data from the plurality of data tables or deleted data from the plurality of data tables or deleted data from the plurality of data tables or deleted data from the plurality of data tables or deleted data from the plurality of data tables or deleted data from the plurality of data tables or deleted data from the plurality of data tables or deleted data from the plurality of data tables or deleted data The state information processing method further includes: calculating the updated matrix correlation coefficient measurement value and the updated data quality characteristic data according to the matrix correlation coefficient measurement value before the update, the data quality characteristic data before the update, the matrix correlation coefficient measurement value of the changed data, and the data quality characteristic data of the changed data, wherein when the changed data is new data to be added to the plurality of data tables, the updated matrix correlation coefficient measurement value and the updated data quality characteristic data are calculated according to the following formula; in, represents the updated matrix correlation coefficient measure value, represents the value of the matrix correlation coefficient before the update, Indicates the value of the parent correlation coefficient of the change data. represents the updated data quality feature data, x, y represent variables, represents the updated standard deviation of the variable, represents the standard deviation of the variable before updating, Indicates the standard deviation of the variable of the change data. represents the updated mean measurement value of the variable, represents the mean measurement value of the variable before updating, Represents the mean measurement value of the variable of the change data, where, ; Wherein when the changed data is deleted data to be deleted from the plurality of data tables, the updated matrix correlation coefficient and the updated data quality characteristic data are calculated according to the following formula; in, represents the updated matrix correlation coefficient measure value, represents the value of the matrix correlation coefficient before the update, Indicates the value of the parent correlation coefficient of the change data. represents the updated data quality feature data, x, y represent variables, represents the updated standard deviation of the variable, represents the standard deviation of the variable before updating, Indicates the standard deviation of the variable of the change data. represents the updated mean measurement value of the variable, represents the mean measurement value of the variable before updating, Represents the mean measurement value of the variable of the change data, where, . 如請求項5所述之資料品質動態資訊處理方法,其中該異動資料包括欲新增至該複數個資料表單之新增資料或欲從該複數個資料表單中刪除之刪除資料,該更新前資料品質評估測度值包括一更新前獨特值比例測度值,該異動資料之該資料品質評估測度值包括該異動資料之一獨特值比例測度值,該更新後資料品質評估測度值包括一更新後獨特值比例測度值,該資料品質動態資訊處理方法另包含:根據該更新前獨特值比例測度值、該更新前資料品質特徵資料、該異動資料之該獨特值比例測度值及該異動資料之該資料品質特徵資料計算出該更新後獨特值比例測度值以及該更新後資料品質特徵資料,其中於該異動資料為欲新增至該複數個資料表單之新增資料時係根據下式計算出該更新後獨特值比例測度值以及該更新後資料品質特徵資料; 其中,表示該更新後獨特值比例測度值,表示聯集運算,key(.)表示鍵值,#表示個數計算,表示該更新前獨特值比例測度值,表示該異動資料之該獨特值比例測度值,表示該更新後資料品質特徵資料,表示該更新前資料品質特徵資料及該更新前獨特值比例測度值,表示該異動資料之該資料品質特徵資料及該獨特值比例測度值,N表示用來計算該更新前資料品質評估測度值的資料量;其中於該異動資料為欲由該複數個資料表單中刪除之刪除資料時係根據下式計算出該更新後獨特值比例測度值以及該更新後資料品質特徵資料; 其中,表示該更新後獨特值比例測度值,表示聯集運算,key(.)表示鍵值,#表示個數計算,表示該更新前獨特值比例測度值,表示該異動資料之該獨特值比例測度值,表示該更新後資料品質特徵資料,表示該更新前資料品質特徵資料及該更新前獨特值比例測度值,表示該異動資料之該資料品質特徵資料及該獨特值比例測度值,N表示用來計算該更新前資料品質評估測度值的資料量。A data quality dynamic information processing method as described in claim 5, wherein the changed data includes new data to be added to the plurality of data tables or deleted data to be deleted from the plurality of data tables, the data quality evaluation measure value before the update includes a unique value ratio measure value before the update, the data quality evaluation measure value of the changed data includes a unique value ratio measure value of the changed data, the data quality evaluation measure value after the update includes a unique value ratio measure value after the update, and the data quality The dynamic information processing method further includes: calculating the updated unique value ratio measurement value and the updated data quality feature data according to the unique value ratio measurement value before the update, the data quality feature data before the update, the unique value ratio measurement value of the changed data, and the data quality feature data of the changed data, wherein when the changed data is new data to be added to the plurality of data tables, the updated unique value ratio measurement value and the updated data quality feature data are calculated according to the following formula; in, Represents the unique value ratio measurement value after the update. represents a union operation, key(.) represents a key value, and # represents a count. It represents the unique value ratio measurement value before the update. The unique value ratio measure of the change data. Indicates the updated data quality feature data. represents the data quality feature data before the update and the unique value ratio measurement value before the update, represents the data quality characteristic data and the unique value ratio measurement value of the changed data, and N represents the amount of data used to calculate the data quality assessment measurement value before the update; wherein when the changed data is deleted data to be deleted from the plurality of data tables, the unique value ratio measurement value after the update and the data quality characteristic data after the update are calculated according to the following formula; in, Represents the unique value ratio measurement value after the update. represents a union operation, key(.) represents a key value, and # represents a count. It represents the unique value ratio measurement value before the update. The unique value ratio measure of the change data. Indicates the updated data quality feature data. represents the data quality feature data before the update and the unique value ratio measurement value before the update, represents the data quality characteristic data and the unique value ratio measurement value of the changed data, and N represents the amount of data used to calculate the data quality assessment measurement value before the update.
TW113136151A 2024-09-24 2024-09-24 Processing method of data quality dynamic information and data processing device TWI888283B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
TW113136151A TWI888283B (en) 2024-09-24 2024-09-24 Processing method of data quality dynamic information and data processing device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
TW113136151A TWI888283B (en) 2024-09-24 2024-09-24 Processing method of data quality dynamic information and data processing device

Publications (1)

Publication Number Publication Date
TWI888283B true TWI888283B (en) 2025-06-21

Family

ID=97227773

Family Applications (1)

Application Number Title Priority Date Filing Date
TW113136151A TWI888283B (en) 2024-09-24 2024-09-24 Processing method of data quality dynamic information and data processing device

Country Status (1)

Country Link
TW (1) TWI888283B (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TW200905497A (en) * 2007-07-30 2009-02-01 Chunghwa Telecom Co Ltd Quality inspection system of data warehouse
CN102855170A (en) * 2011-07-01 2013-01-02 国际商业机器公司 System and method for data quality monitoring
CN116860740A (en) * 2023-07-28 2023-10-10 北京沃东天骏信息技术有限公司 Method, device, electronic equipment and medium for monitoring data quality
US20240281419A1 (en) * 2023-02-22 2024-08-22 Confie Holding II Co. Data Visibility and Quality Management Platform

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TW200905497A (en) * 2007-07-30 2009-02-01 Chunghwa Telecom Co Ltd Quality inspection system of data warehouse
CN102855170A (en) * 2011-07-01 2013-01-02 国际商业机器公司 System and method for data quality monitoring
US20240281419A1 (en) * 2023-02-22 2024-08-22 Confie Holding II Co. Data Visibility and Quality Management Platform
CN116860740A (en) * 2023-07-28 2023-10-10 北京沃东天骏信息技术有限公司 Method, device, electronic equipment and medium for monitoring data quality

Similar Documents

Publication Publication Date Title
WO2022134348A1 (en) Method and apparatus for monitoring software development process, terminal, and storage medium
CN109543891B (en) Method and apparatus for establishing capacity prediction model, and computer-readable storage medium
CN107992401A (en) Performance test evaluation method, device, terminal device and storage medium
CN115795928B (en) Gamma process-based accelerated degradation test data processing method and device
CN111679968A (en) Detecting method, device, computer equipment and storage medium for abnormal interface call
CN113504935A (en) Software development quality evaluation method and device, electronic equipment and readable storage medium
CN114880306A (en) Database management and control method, device, computer equipment and storage medium
CN119130395B (en) Business process configuration method, system, electronic equipment and medium based on Activiti enterprise
CN116258420A (en) A product quality detection method, device, terminal equipment and medium
CN111782527A (en) Interface testing method, device, computer equipment and storage medium
CN114493193A (en) Supplier material spot inspection method, supplier material spot inspection system, supplier material spot inspection terminal and storage medium
CN116880398B (en) Fault analysis method, system, electronic equipment and storage medium for instrumentation and control equipment
TWI888283B (en) Processing method of data quality dynamic information and data processing device
CN114444570A (en) Fault detection method, device, electronic equipment and medium
CN114785616A (en) Data risk detection method and device, computer equipment and storage medium
CN111506455A (en) Method and device for checking service release result
CN115550222B (en) Method, system, terminal and storage medium for detecting abnormal state of equipment
CN111221567A (en) Program version switching method and device, computer equipment and storage medium
CN114417070B (en) Data authority convergence method, device, equipment and storage medium
JP2010130436A (en) Communication band calculation method and apparatus, and traffic management method
CN115904464A (en) Core developer identification method, device and equipment based on developer collaboration network
CN114860608A (en) Scene construction based system automation testing method, device, equipment and medium
CN106897201A (en) Device hardware information updating determines method and device in a kind of data center's O&M
CN115577820A (en) Method and device for predicting residual life of equipment, computer equipment and medium
CN113806159A (en) Data processing method and device, electronic equipment and readable storage medium