[go: up one dir, main page]

TWM648156U - System for predicting extubation using machine learning models - Google Patents

System for predicting extubation using machine learning models Download PDF

Info

Publication number
TWM648156U
TWM648156U TW112205118U TW112205118U TWM648156U TW M648156 U TWM648156 U TW M648156U TW 112205118 U TW112205118 U TW 112205118U TW 112205118 U TW112205118 U TW 112205118U TW M648156 U TWM648156 U TW M648156U
Authority
TW
Taiwan
Prior art keywords
extubation
data
prediction
machine learning
learning model
Prior art date
Application number
TW112205118U
Other languages
Chinese (zh)
Inventor
趙文震
白鎧誌
詹明澄
吳杰亮
王敏嫻
廖建倫
洪大鈞
林彥男
楊惠喬
許瑞愷
陳倫奇
Original Assignee
臺中榮民總醫院
東海大學
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 臺中榮民總醫院, 東海大學 filed Critical 臺中榮民總醫院
Priority to TW112205118U priority Critical patent/TWM648156U/en
Publication of TWM648156U publication Critical patent/TWM648156U/en

Links

Landscapes

  • Testing And Monitoring For Control Systems (AREA)
  • Numerical Control (AREA)
  • Jib Cranes (AREA)

Abstract

本創作係提供一種以機器學習模型建立拔管預測之系統,其係能夠透過一機器學習模型之訓練及/或驗證得到一拔管預測模型及其所使用之關鍵特徵,而透過該拔管預測模型即時分析一病人之關鍵特徵資料,係能得到該病人之拔管可能性及其相關說明。據此,本創作所揭以機器學習模型建立拔管預測之系統係做為臨床照護者評估拔管之工具,以降低拔管後無法自主呼吸致使重新插管之可能性。This invention provides a system for establishing extubation prediction using a machine learning model, which can obtain an extubation prediction model and the key features used through the training and/or verification of a machine learning model, and through the extubation prediction The model analyzes a patient's key characteristic data in real time to obtain the patient's extubation possibility and related explanations. Accordingly, the system disclosed in this work using a machine learning model to establish extubation prediction is used as a tool for clinical caregivers to evaluate extubation, so as to reduce the possibility of reintubation due to inability to breathe spontaneously after extubation.

Description

以機器學習模型建立拔管預測之系統Establishing a system for predicting extubation using machine learning models

本創作係有關於一種拔管預測系統,特別係指一種以機器學習模型建立拔管預測之系統。 This work relates to an extubation prediction system, specifically a system that uses a machine learning model to establish extubation prediction.

按,根據統計,有約80%以上重症加護病房中病人需要靠著機械式呼吸輔助(mechanical ventilation)維持生命,而研究顯示,當病人依賴機械式呼吸輔助之時間越長,要使病人脫離機械式呼吸輔助則會難度增加,甚至演變成會長期依賴機械式呼吸輔助才能維生,或是短時間脫離呼吸器後,又很快地必須重新插管仰賴機械式呼吸輔助才能維持存活。 According to statistics, more than 80% of patients in intensive care units require mechanical ventilation to maintain their lives. Research shows that the longer a patient relies on mechanical ventilation, the harder it is to wean the patient off the mechanical ventilation. The difficulty of mechanical breathing assistance will increase, and it may even evolve into long-term dependence on mechanical breathing assistance to survive. Or, after being separated from the respirator for a short period of time, the patient must quickly be reintubated and rely on mechanical breathing assistance to survive.

為能提高病人脫離機械式呼吸輔助之成功率增加,目前許多研究是透過各類研究方式來預測病人脫離機械式呼吸輔助之時間點及成功率,例如有研究係透過整理加護病房及慢性呼吸照顧病房中病人及其臨床特徵、使用呼吸器之原因、慢性併發症、呼吸器脫離困難原因等之資訊進行統計分析,以歸納出難以脫離機械式呼吸輔助之病人特徵,惟,各個病人病情不相當且病況瞬息萬變,透過回溯性統計分析方法所得之結果將無法準確預測適合拔管之時間點。 In order to increase the success rate of patients weaning off mechanical respiratory assistance, many studies are currently using various research methods to predict the time point and success rate of patients weaning off mechanical respiratory assistance. For example, some studies are through reorganizing intensive care units and chronic respiratory care Statistical analysis was conducted on the patients in the ward and their clinical characteristics, reasons for using respirators, chronic complications, reasons for difficulty in weaning off respirators, etc., in order to summarize the characteristics of patients who are difficult to wean off mechanical respiratory assistance. However, the conditions of each patient are not the same. Moreover, the disease condition changes rapidly, and the results obtained through retrospective statistical analysis methods will not be able to accurately predict the time point suitable for extubation.

本創作之主要目的係在於提供一種以機器學習模型建立拔管預測之系統,其係能夠自動化方式即時地預測住院病人之拔管可能性,以做為臨床醫生評估病人拔管成功率之工具。 The main purpose of this creation is to provide a system for establishing extubation prediction using a machine learning model, which can automatically predict the possibility of extubation of inpatients in real time and serve as a tool for clinicians to evaluate the success rate of patient extubation.

本創作之另一目的係在於提供一種以機器學習模型建立拔管預測之系統,其係提供得到預測拔管可能性之理由或是相關說明,以能達到提升拔管可能性之可信度及可解釋性。 Another purpose of this creation is to provide a system for establishing extubation prediction using a machine learning model, which provides reasons or related explanations for predicting the possibility of extubation, so as to improve the credibility of the possibility of extubation and Interpretability.

緣是,為能達成上述目的,本創作係提供一種以機器學習模型建立拔管預測之系統,係包含有一處理裝置,用以透過一拔管預測模型分析一待測病人之一關鍵特徵資料,以產出該待測病人之拔管可能性,其中,該關鍵特徵資料係包含有該生理參數資料,以及於進行拔管預測之日的前1日或/及前2日所獲得之該意識資料、該輸入/輸出液體資料、該呼吸功能資料。 Therefore, in order to achieve the above purpose, this invention provides a system for establishing extubation prediction using a machine learning model, which includes a processing device for analyzing key characteristic data of a patient to be tested through an extubation prediction model. To generate the possibility of extubation of the patient to be tested, where the key characteristic data includes the physiological parameter data and the awareness obtained one day or/and two days before the date of extubation prediction. data, the input/output fluid data, the respiratory function data.

其中,該拔管預測模型係選自由極限梯度提升(Extreme Gradient Boosting,XGBoost)、類別梯度提升(Categorical Boosting,CatBoost)、輕量梯度提升機器(Light Gradient Boosting Machine,LightGBM)、隨機森林演算法(Random Forest,RF)及邏輯回歸(Logistic Regression,LR)所組成之群。 Among them, the extubation prediction model is selected from Extreme Gradient Boosting (XGBoost), Categorical Boosting (CatBoost), Light Gradient Boosting Machine (LightGBM), Random Forest Algorithm ( Random Forest (RF) and logistic regression (Logistic Regression, LR).

其中,該拔管可能性係為該待測病人於該於進行拔管預測日後24小時內脫離呼吸輔助設備之可能性。 Among them, the possibility of extubation is the possibility that the patient to be tested will be separated from the respiratory assistance device within 24 hours after the predicted date of extubation.

其中,該處理裝置係具有一資料處理模組及一拔管預測模組,而該資料處理模組係用以自該待測病人之特徵資料中擷取出該關鍵特徵資料;該拔管預測模組係以該拔管預測模型分析該關鍵特徵資料。 Among them, the processing device has a data processing module and an extubation prediction module, and the data processing module is used to extract the key characteristic data from the characteristic data of the patient to be tested; the extubation prediction module The group uses the extubation prediction model to analyze the key characteristic data.

於本創作之一實施例中,以機器學習模型建立拔管預測之系統係更包含有一資料庫,其係用以收集複數病人之特徵資料,其中,該特徵資料係包含有一意識資料、一輸入/輸出液體資料、一呼吸功能資料及一生理參數資料; 該處理裝置係包含有一資料處理模組;並各該病人於住院期間係有使用一呼吸輔助設備之記錄。 In one embodiment of this invention, a system for establishing extubation prediction using a machine learning model further includes a database for collecting characteristic data of a plurality of patients, where the characteristic data includes a consciousness data, an input /Output liquid data, respiratory function data and physiological parameter data; The processing device includes a data processing module; and each patient has a record of using a respiratory assistance device during hospitalization.

於本創作之另一實施例中,該處理裝置係與該資料庫連接,而該資料處理模組係能用以對該資料庫中該些病人之該特徵資料進行一資料預處理程序,以刪除該特徵資料中超出一預設合理範圍之資料,及/或補足該特徵資料中缺失之資料。 In another embodiment of the present invention, the processing device is connected to the database, and the data processing module can be used to perform a data preprocessing procedure on the characteristic data of the patients in the database to Delete the data in the characteristic data that exceeds a preset reasonable range, and/or fill in the missing data in the characteristic data.

於本創作之另一實施例中,該拔管預測模組以該拔管預測模型分析該關鍵特徵資料後,產出該關鍵特徵資料與拔管可能率間之關連性資訊。 In another embodiment of the invention, the extubation prediction module uses the extubation prediction model to analyze the key characteristic data and generates correlation information between the key characteristic data and the extubation probability.

於本創作之次一實施例中,該處理裝置係更包含有一可視化模組,其係將該關鍵特徵資料與拔管可能率間之關連性資訊轉化為一可視化介面,例如直條圖、折線圖、SHAP力值圖、部分依賴圖(PDP圖)等。 In a second embodiment of this invention, the processing device further includes a visualization module, which converts the correlation information between the key characteristic data and the probability of extubation into a visual interface, such as a bar chart or a broken line. graph, SHAP force value graph, partial dependency graph (PDP graph), etc.

本創作之又一實施例中,該處理裝置係更包含有一模型訓練模組,其係以該些病人之該些特徵資料之至少一部訓練一機器學習模型,並得進行驗證,以產生該拔管預測模型及一關鍵特徵。 In another embodiment of the present invention, the processing device further includes a model training module, which trains a machine learning model using at least part of the characteristic data of the patients, and can perform verification to generate the machine learning model. Extubation prediction model and one key feature.

其中,該關鍵特徵係包含年紀、使用呼吸器之天數,以及進行於拔管預測之日前1天及前2天的GCS分數、尿量、注射量、營養量、RASS分數、PIP、MAP、呼吸頻率、心率。 Among them, the key features include age, number of days using a respirator, as well as GCS score, urine output, injection volume, nutrition amount, RASS score, PIP, MAP, respiratory rate 1 day and 2 days before the predicted date of extubation. Frequency, heart rate.

本創作所以機器學習模型建立拔管預測之系統的實際執行方法,包含有下列步驟:(a)取得訓練用特徵資料;(b)以機器學習模型進行訓練並得到拔管訓練模型;(c)取得待測病人之關鍵特徵資料;(d)以拔管訓練模型分析該關鍵特徵資料而得到待測病人之拔管可能性。 The actual implementation method of this creation to build a system for extubation prediction using a machine learning model includes the following steps: (a) Obtaining characteristic data for training; (b) Training with a machine learning model and obtaining an extubation training model; (c) Obtain key characteristic data of the patient to be tested; (d) analyze the key characteristic data using an extubation training model to obtain the extubation possibility of the patient to be tested.

其中,該步驟a中係更包含有一數據預處理步驟,用以除去該些訓練用特徵資料中不符合一標準者,及/或以插補方式補足該些訓練用特徵資料數量不足之部分。 Among them, step a further includes a data preprocessing step to remove those that do not meet a standard from the training feature data, and/or make up for the insufficient amount of the training feature data by interpolation.

其中,該步驟d係更包含得到各該關鍵特徵與該拔管可能率間之關連性,並將之轉化為一可視化介面。 Among them, step d further includes obtaining the correlation between each key feature and the extubation probability, and converting it into a visual interface.

10:以機器學習模型建立拔管預測之系統 10: Establish an extubation prediction system using machine learning models

20:資料庫 20:Database

30:處理裝置 30: Processing device

31:資料處理模組 31:Data processing module

32:模型訓練模組 32: Model training module

33:拔管預測模組 33: Extubation prediction module

34:可視化模組 34:Visualization module

圖1係為本創作之一實施例所揭以機器學習模型建立拔管預測之系統的示意圖。 Figure 1 is a schematic diagram of a system for establishing extubation prediction using a machine learning model according to an embodiment of the present invention.

圖2A係為本創作所揭數據分析之流程圖(一)。 Figure 2A is a flow chart (1) of the data analysis disclosed in this creation.

圖2B係為本發明所揭數據分析之流程圖(二)。 Figure 2B is a flow chart (2) of the data analysis disclosed in the present invention.

圖3A係以校正曲線分析顯示不同機器學習模型預測拔管之表現。 Figure 3A shows the performance of different machine learning models in predicting extubation using calibration curve analysis.

圖3B係以決策曲線分析顯示不同機器學習模型預測拔管之表現。 Figure 3B shows the performance of different machine learning models in predicting extubation using decision curve analysis.

圖3C係以曲線下面積顯示不同機器學習模型預測拔管之表現。 Figure 3C shows the performance of different machine learning models in predicting extubation using the area under the curve.

圖4係為顯示各主要臨床區塊對於重症監護之重要度。 Figure 4 shows the importance of each major clinical area to critical care.

圖5係以SHAP值說明於各關鍵特徵於拔管預測模型之關連性。 Figure 5 illustrates the correlation between each key feature and the extubation prediction model using SHAP values.

圖6A係為GCS影響XGBoost預測拔管概率之部分依賴圖。 Figure 6A is a partial dependence diagram of the influence of GCS on the predicted extubation probability by XGBoost.

圖6B係為RASS影響XGBoost預測拔管概率之部分依賴圖。 Figure 6B is a partial dependence diagram of RASS affecting XGBoost’s prediction of extubation probability.

圖6C係為尿量影響XGBoost預測拔管概率之部分依賴圖。 Figure 6C is a partial dependence diagram of the effect of urine volume on the predicted extubation probability by XGBoost.

圖6D係為注射量影響XGBoost預測拔管概率之部分依賴圖。 Figure 6D is a partial dependence diagram of the injection volume affecting the predicted extubation probability by XGBoost.

圖6E係為PIP影響XGBoost預測拔管概率之部分依賴圖。 Figure 6E is a partial dependence diagram of PIP affecting XGBoost’s prediction of extubation probability.

圖6F係為MAP影響XGBoost預測拔管概率之部分依賴圖。 Figure 6F is a partial dependence diagram of MAP affecting XGBoost’s prediction of extubation probability.

圖7A係為案例1藉由關鍵特徵的LIME及SHAP力值顯示關鍵特徵對於兩個不同個體之拔管預測的總體影響之結果。 Figure 7A is the result of Case 1 showing the overall impact of key features on the prediction of extubation for two different individuals through the LIME and SHAP force values of key features.

圖7B係為案例2藉由關鍵特徵的LIME及SHAP力值顯示關鍵特徵對於兩個不同個體之拔管預測的總體影響之結果。 Figure 7B is the result of Case 2 showing the overall impact of key features on the prediction of extubation for two different individuals through the LIME and SHAP force values of key features.

本創作係揭露一種以機器學習模型建立拔管預測之系統,其係能夠透過一機器學習模型之訓練及/或驗證得到一拔管預測模型及其所使用之關鍵特徵,而透過該拔管預測模型即時分析一病人之關鍵特徵資料,係能得到該病人之拔管可能性及其相關說明。據此,本創作所揭以機器學習模型建立拔管預測之系統係做為臨床照護者評估拔管之工具,以降低拔管後無法自主呼吸或是需重新插管之可能性。 This invention discloses a system that uses a machine learning model to establish extubation prediction. It can obtain an extubation prediction model and its key features through the training and/or verification of a machine learning model, and through the extubation prediction The model analyzes a patient's key characteristic data in real time to obtain the patient's extubation possibility and related explanations. Accordingly, the system disclosed in this work using a machine learning model to predict extubation is used as a tool for clinical caregivers to evaluate extubation, so as to reduce the possibility of being unable to breathe spontaneously or needing to be reintubated after extubation.

必須要加以強調者,本創作所揭露之以機器學習模型建立拔管預測之系統係納入病人意識資料及其輸入/輸出液體資料,以達到使所得到之拔管預測結果能夠更為貼近醫生於臨床上判斷之結果,意即本創作所揭以機器學習模型建立拔管預測之系統及其方法係能夠達到更為精準之拔管預測率。 It must be emphasized that the system for establishing extubation prediction using machine learning models disclosed in this work incorporates patient consciousness data and input/output fluid data, so that the obtained extubation prediction results can be closer to what the doctor expects. The result of clinical judgment means that the system and method for establishing extubation prediction based on machine learning model disclosed in this work can achieve a more accurate extubation prediction rate.

以下將說明本創作中所使用之術語,若未列於以下說明者,將依據本創作所屬技術領域且具通常知識者所認可參考資料,如辭典、字典、文獻或周知常識進行解釋。 The terminology used in this work will be explained below. If it is not listed in the description below, it will be explained based on the reference materials recognized by those with ordinary knowledge in the technical field of this work, such as dictionaries, dictionaries, literature or common knowledge.

術語「動態參數」,係指病人於住院期間定期或不定期進行檢測所得到之數據或結果。 The term "dynamic parameters" refers to the data or results obtained from regular or irregular testing of patients during their hospitalization.

術語「意識/認知區塊」,係指依據評估病人意識狀態之系統、量表或方法所得之數據,例如格拉斯哥昏迷量表(Glasgow Coma Scale,GCS)、RASS鎮靜程度評估量表(Richmond Agitation Sedation Scale,RASS)等。 The term "consciousness/cognition block" refers to data obtained based on systems, scales or methods for assessing a patient's state of consciousness, such as the Glasgow Coma Scale (GCS), RASS Sedation Assessment Scale (Richmond Agitation Sedation) Scale, RASS), etc.

術語「體液平衡區塊」,係指與進入病人體內之液體或是病人所排除之液體相關之數據,例如尿量、注射量、輸液量、營養總量。 The term "fluid balance block" refers to data related to the fluid entering the patient's body or the fluid eliminated by the patient, such as urine output, injection volume, infusion volume, and total nutrition.

術語「呼吸功能區塊,」,係指與病人呼吸功能或心肺功能相關之數據,如氣道壓峰值(PIP)、平均氣道壓(MAP)、呼吸器天數、呼吸頻率。 The term "respiratory function block," refers to data related to the patient's respiratory function or cardiopulmonary function, such as peak airway pressure (PIP), mean airway pressure (MAP), number of ventilator days, and respiratory rate.

術語「生理參數區塊」,係指病人之生理數據,如體重、身體質量指數、身高等。 The term "physiological parameter block" refers to the patient's physiological data, such as weight, body mass index, height, etc.

術語「呼吸輔助設備」或「呼吸器」,係指設置於病人身上,用以協助病人進行換氣/吸氣之外部裝置。 The term "respiratory assistance equipment" or "respirator" refers to an external device installed on a patient to assist the patient in ventilating/inhaling.

術語「模組」,係指由數個基礎功能元件所組成具有特定功能之組件,其係能夠用以組成一具完整功能的系統、裝置或程式,舉例來說,模組係得為一電子電路。 The term "module" refers to a component with specific functions composed of several basic functional components, which can be used to form a complete functional system, device or program. For example, a module can be an electronic circuit.

術語「ICU」,全名為Intensive Care Unit,為醫院中之重症加護病房,會依據科別而有隸屬於不同科室。 The term "ICU", the full name is Intensive Care Unit, is an intensive care unit in a hospital, which is affiliated to different departments according to departments.

術語「機器學習」,係會以一機器學習模型於資料中進行學習以及改善,尋找到模式與關連,並根據其學習及分析結果制訂出決策與預測。以本創作中所列舉之實例來說,該機器學習模型係包含有XGBoost(Extreme Gradient Boosting)、CatBoost(Categorical Boosting)、LightGBM(Light Gradient Boosting Machine)、隨機森林演算法(Random Forest,RF)及邏輯回歸(Logistic Regression,LR)。 The term "machine learning" refers to using a machine learning model to learn and improve in data, find patterns and relationships, and formulate decisions and predictions based on its learning and analysis results. Taking the examples listed in this creation as an example, the machine learning model includes XGBoost (Extreme Gradient Boosting), CatBoost (Categorical Boosting), LightGBM (Light Gradient Boosting Machine), Random Forest algorithm (Random Forest, RF) and Logistic Regression (LR).

術語「拔管」中之「管」,係指用以通氣或提供氣體之管路,因此,「拔管」代表脫離呼吸器或是其他呼吸輔助設備。 The "tube" in the term "extubation" refers to the tube used for ventilation or gas supply. Therefore, "extubation" means disconnecting from the respirator or other respiratory assistance equipment.

術語「APACHE II」,全名為acute physiology and chronic health evaluation II,係為一種疾病嚴重度評分系統,介於0-71分,該評分系統係藉由12項生理數據、病人年齡與健康狀態作為評分的輸入,各項數據為病人入院24小時後的最差值。 The term "APACHE II", full name acute physiology and chronic health evaluation II, is a disease severity scoring system, ranging from 0-71 points. The scoring system is based on 12 physiological data, patient age and health status. When entering the score, each data is the worst value of the patient 24 hours after admission.

術語「SOFA」,全名Sequential Organ Failure Assessment,係一種評估疾病嚴重度之系統,其係將器官分為肺、凝血、肝、心臟、神經、腎,並分別進行評估,每個器官最高分為4分,最低分為0分,於進入ICU 24小時後,每48小時會計算一次總分。 The term "SOFA", the full name is Sequential Organ Failure Assessment, is a system for assessing the severity of a disease. It divides organs into lungs, coagulation, liver, heart, nerves, and kidneys, and evaluates them separately. The highest score for each organ is 4 points, with the lowest score being 0 points. The total score will be calculated every 48 hours after 24 hours after entering the ICU.

術語「GCS(Glasgow Coma Scale)」,中文為格拉斯哥昏迷量表,其係為一昏迷指標,是昏迷指數中最被廣泛使用之一種,其評估包含有三個面向:睜眼反應、說話反應、運動反應,每個面向最高可得5分,最低為1分,當總得分月低代表昏迷程序越嚴重。 The term "GCS (Glasgow Coma Scale)" in Chinese is the Glasgow Coma Scale. It is a coma index and is one of the most widely used coma indexes. Its evaluation includes three aspects: eye opening reaction, speaking reaction, and movement. For each aspect, the maximum score is 5 points and the minimum score is 1 point. When the total score is lower, it means the coma procedure is more serious.

術語「RASS鎮靜程度評估表(Richmond Agitation-Sedation Scale)」,係用於測量病人之躁動或鎮靜程度之量表,分數由-4-+4,分別代表不同之行為狀態。 The term "RASS Sedation Scale (Richmond Agitation-Sedation Scale)" is a scale used to measure a patient's agitation or sedation level. The scores range from -4 to +4, representing different behavioral states.

術語「SHAP值」、「SHAP力值(SHAP force plot)」或「SHAP數值」,係指當一預定機器學習模型對一數據或是一組數據產生一個預測值,於數據或是該組數據中每個特徵所被分配到之數值。 The term "SHAP value", "SHAP force plot" or "SHAP value" means that when a predetermined machine learning model generates a predicted value for a data or set of data, the data or set of data The value assigned to each feature in .

術語「LIME(Local Interpretable Model-Agnostic Explanations)」,係為局部可解釋模型,其目的係在於瞭解複雜難解釋之模型,而所使用之原理係根據欲解釋之個體提供一個局部可解釋或是可理解的模型。 The term "LIME (Local Interpretable Model-Agnostic Explanations)" refers to a locally interpretable model. Its purpose is to understand complex and difficult-to-interpret models, and the principle used is to provide a locally interpretable or interpretable model based on the individual to be explained. Models of understanding.

術語「可視化界面」,係指藉由圖形化將數據或訊息以圖形之方式表現出來,以利使用者能夠更好理解數據與訊息,而所謂之圖形係包含有圖、表、顏色、符號、記號、圖標或其他視覺元素。 The term "visual interface" refers to the use of graphics to represent data or information in a graphical manner so that users can better understand the data and information. The so-called graphics include diagrams, tables, colors, symbols, Symbols, icons or other visual elements.

具體來說,請參圖1,於本創作之一實施例係提供一種以機器學習模型建立拔管預測之系統10,其包含有一資料庫20及一處理裝置30,其中:該資料庫20係為一結構化之資訊或資料集合,會以一預定方式儲存,如以電子方式存於電腦系統、硬碟或是雲端空間內等。該資料庫20收集且存 放複數病人之一特徵資料,其中,各該病人於重症加護病房住院期間係有使用一呼吸輔助設備之記錄,並該特徵資料係包含有一意識資料、一輸入/輸出液體資料、一呼吸功能資料及一生理參數資料。而該資料庫20之資料來源係得為直接輸入或是與其他外部資料庫串接後所得者。 Specifically, please refer to Figure 1. One embodiment of the present invention provides a system 10 for establishing extubation prediction using a machine learning model, which includes a database 20 and a processing device 30, wherein: the database 20 is It is a structured information or data collection that will be stored in a predetermined way, such as electronically stored in a computer system, hard drive or cloud space. The database 20 collects and stores Store characteristic data of a plurality of patients, wherein each patient has a record of using a respiratory assist device during hospitalization in the intensive care unit, and the characteristic data includes a consciousness data, an input/output fluid data, and a respiratory function data. and a physiological parameter data. The data source of the database 20 can be directly input or obtained after being connected with other external databases.

於本實施例中,該意識資料係與病人意識或認知程度評估之結果,如格拉斯哥昏迷量表所得之分數、RASS鎮靜程度評估量表所得之分數;該輸入/輸出液體資料係為一預定時間內投入病人體內之液體量,或是病人排出之液體量,例如輸液量、注射量、飲食中所攝取之液體量、尿量等;該呼吸功能資料係為與病人呼吸或/及心肺功能狀態相關之數據,例如配戴呼吸輔助設備之天數、氣道壓峰值(PIP)、平均氣道壓(MAP)、呼吸頻率;該生理參數資料係為病人生理數據,例如年紀、體重、身體質量指數等。 In this embodiment, the consciousness data is the result of assessing the patient's consciousness or cognitive level, such as the score obtained by the Glasgow Coma Scale and the score obtained by the RASS Sedation Assessment Scale; the input/output liquid data is a predetermined time The amount of fluid injected into the patient's body, or the amount of fluid excreted by the patient, such as infusion volume, injection volume, liquid intake in the diet, urine output, etc.; the respiratory function data is related to the patient's respiratory or/and cardiopulmonary function status Relevant data, such as the number of days of wearing respiratory assistance equipment, peak airway pressure (PIP), mean airway pressure (MAP), respiratory frequency; the physiological parameter data is the patient's physiological data, such as age, weight, body mass index, etc.

該處理裝置30係能夠用以一系列執行程式、指令或是對資料進行操作之硬體,如電腦、計算機處理器等。該處理裝置30係與該資料庫20連接,並具有一資料處理模組31、一模型訓練模組32、一拔管預測模組33及一可視化模組34;其中:該資料處理模組31係用以處理來自病人之特徵資料,包含有進行一資料預處理程序及一擷取特徵資料程序。具體來說,該資料預處理程序係指該資料處理模組31讀取該資料庫20中該些病人之該特徵資料後,會對該些特徵資料進行刪除或/及差補之步驟,意即該資料處理模組係接收各特徵之一預設合理範圍,並以之做為判斷標準,當有一特徵資料未落於該預設合理範圍時,則刪除該特徵資料;並檢核該特徵資料之數量是否符合一預定值,倘若不符合該預定值時,則判定該特徵資料缺失,應透過取該特徵資料之平均值補足該資料缺失的部分。而該擷取關鍵特徵程序係指該資料處理模組31係讀取一待測病人之特徵資料後,並從中擷取出與一關鍵特徵相關之數據做為一關鍵特徵資料。 The processing device 30 can be used as a series of hardware to execute programs, instructions or operate on data, such as computers, computer processors, etc. The processing device 30 is connected to the database 20 and has a data processing module 31, a model training module 32, an extubation prediction module 33 and a visualization module 34; wherein: the data processing module 31 It is used to process characteristic data from patients, including performing a data preprocessing process and a characteristic data retrieval process. Specifically, the data preprocessing procedure refers to the steps in which the data processing module 31 reads the characteristic data of the patients in the database 20 and then deletes or/and supplements the characteristic data. That is, the data processing module receives a preset reasonable range of each feature and uses it as a judgment standard. When a feature data does not fall within the preset reasonable range, the feature data is deleted; and the feature is checked. Whether the amount of data meets a predetermined value. If it does not meet the predetermined value, it is determined that the characteristic data is missing, and the missing part of the data should be filled in by taking the average of the characteristic data. The process of retrieving key features means that the data processing module 31 reads the characteristic data of a patient to be tested and extracts data related to a key feature as a key feature data.

該模型訓練模組32係接收來自該資料處理模組31處理後之該些特徵資料,並以該些特徵資料之至少一部訓練一機器學習模型,且以該些特徵資料之至少一部驗證訓練後之機器學習模型及其所得結果,以產出一拔管預測模型及用於該拔管預測訓練模型之該關鍵特徵。 The model training module 32 receives the processed feature data from the data processing module 31, trains a machine learning model with at least a part of the feature data, and verifies it with at least a part of the feature data. The trained machine learning model and its obtained results are used to produce an extubation prediction model and the key features used in the extubation prediction training model.

於本實施例中,該機器學習模型係為本創作所屬技術領域所周知之演算邏輯或演算法,例如XGBoost(Extreme Gradient Boosting)、CatBoost(Categorical Boosting)、LightGBM(Light Gradient Boosting Machine)、隨機森林演算法(Random Forest,RF)及邏輯回歸(Logistic Regression,LR)等。 In this embodiment, the machine learning model is a well-known calculation logic or algorithm in the technical field of this creation, such as XGBoost (Extreme Gradient Boosting), CatBoost (Categorical Boosting), LightGBM (Light Gradient Boosting Machine), Random Forest Algorithm (Random Forest, RF) and logistic regression (Logistic Regression, LR), etc.

於本實施例中,該關鍵特徵係有20個,包含有年齡、使用呼吸輔助設備之天數、並以進行於拔管預測之當日的前1天及前2天所測得的GCS分數、尿量、注射量、營養量、RASS分數、PIP、MAP、呼吸頻率、心率。 In this embodiment, there are 20 key features, including age, number of days of using respiratory assistance equipment, GCS scores, urine and urine measured one day and two days before the day of extubation prediction. Volume, injection volume, nutritional amount, RASS score, PIP, MAP, respiratory rate, heart rate.

該拔管預測模組33係自該模型訓練模組32取得該拔管預測模型,並以該拔管預測模型分析來自該待測病人之該關鍵特徵資料,以產出該待測病人於該於進行拔管預測之日後的24小時內之拔管可能率、以及各關鍵特徵與該拔管可能率間之關連性資訊。 The extubation prediction module 33 obtains the extubation prediction model from the model training module 32, and uses the extubation prediction model to analyze the key characteristic data from the patient to be tested to generate the patient to be tested in the The probability of extubation within 24 hours after the date of extubation prediction, and the correlation information between each key feature and the probability of extubation.

該可視化模組34係將各關鍵特徵與該拔管可能率間之關連性資訊轉化為一可視化介面,用以呈現於一顯示器或是具有顯示器之設備上,如電腦、螢幕、平版電腦、手機等。 The visualization module 34 converts the correlation information between each key feature and the extubation probability into a visual interface for presentation on a display or a device with a display, such as a computer, screen, tablet, or mobile phone. wait.

藉由上述構件之組成,本創作係揭露一種以機器學習模型建立拔管預測之系統係能以自動化方式即時地分析住院於重症照護病房之病人是否處於適合脫離呼吸輔助設備之狀態,除能以量化方式將拔管可能率供予臨床醫生做為評估拔管之參考,且更能進一步提供各關鍵特徵與拔管可能率間之關連性資訊與臨床醫生,以透過增加拔管可能率之可解釋性,達到能增加該拔管可能率之可信度的功效。 Through the composition of the above components, this invention discloses a system that uses a machine learning model to establish extubation prediction, which can automatically analyze in real time whether patients hospitalized in the intensive care unit are in a state suitable for weaning from respiratory assistance equipment. In addition to using The quantitative method provides clinicians with the possibility of extubation as a reference for evaluating extubation, and can further provide clinicians with information on the correlation between each key feature and the possibility of extubation, thereby increasing the possibility of extubation. Explanatory, achieving the effect of increasing the credibility of the possibility of extubation.

更進一步來說,本創作所揭以機器學習模型建立拔管預測之系統實際執行步驟係如下: Furthermore, the actual execution steps of the system that uses machine learning models to establish extubation prediction are as follows:

步驟101:輸入複數訓練用特徵資料,其中,該些特徵資料係來自於重症照護病房住院期間有使用呼吸輔助設備之複數病人,並該特徵資料係包含有一意識資料、一輸入/輸出液體資料、一呼吸功能資料及一生理參數資料,而該意識資料、該輸入/輸出液體資料、該呼吸功能資料及該生理參數資料之說明係與前述實施例中所述者相同,故於此不加以贅述。 Step 101: Input a plurality of characteristic data for training, wherein the characteristic data comes from a plurality of patients who used respiratory assistance equipment during hospitalization in the intensive care unit, and the characteristic data includes a consciousness data, an input/output liquid data, A respiratory function data and a physiological parameter data, and the description of the consciousness data, the input/output liquid data, the respiratory function data and the physiological parameter data are the same as those described in the previous embodiment, so they will not be described again here. .

步驟102:對該些訓練用特徵資料進行一資料預處理程序,以刪除其內不合理之特徵資料及/或補足缺失之特徵資料;以完成該資料預處理程序之該些訓練用特徵資料的至少一部訓練一機器學習模型,並得以完成該資料預處理程序之該些訓練用特徵資料的至少一部對於完成訓練之該機器學習模型進行驗證程序,以產出一拔管預測模型及一關鍵特徵。 Step 102: Perform a data preprocessing procedure on the training feature data to delete unreasonable feature data and/or fill in the missing feature data; to complete the data preprocessing procedure of the training feature data. At least one part trains a machine learning model, and at least one part of the training feature data is able to complete the data preprocessing process and performs a verification process on the machine learning model that has completed the training to produce an extubation prediction model and an extubation prediction model. Key Features.

其中,該拔管預測模型係選自由XGBoost(Extreme Gradient Boosting)、CatBoost(Categorical Boosting)、LightGBM(Light Gradient Boosting Machine)、隨機森林演算法(Random Forest,RF)及邏輯回歸(Logistic Regression,LR)所組成之群。 Among them, the extubation prediction model is selected from XGBoost (Extreme Gradient Boosting), CatBoost (Categorical Boosting), LightGBM (Light Gradient Boosting Machine), Random Forest algorithm (Random Forest, RF) and Logistic Regression (LR) The group formed.

其中,該關鍵特徵係包含年紀、使用呼吸器之天數,以及進行於拔管預測之日前1天及前2天的GCS分數、尿量、注射量、營養量、RASS分數、PIP、MAP、呼吸頻率、心率。 Among them, the key features include age, number of days using a respirator, as well as GCS score, urine output, injection volume, nutrition amount, RASS score, PIP, MAP, respiratory rate 1 day and 2 days before the predicted date of extubation. Frequency, heart rate.

步驟103:輸入一待測病人之一關鍵特徵資料,其中,該待測病人係處於施用一呼吸輔助設備之狀態,而該關鍵特徵資料係為自該待測病人所測得之該關鍵特徵的數據或數值。 Step 103: Enter key characteristic data of a patient to be tested, wherein the patient to be tested is in a state of using a respiratory assistance device, and the key characteristic data is the key characteristic measured from the patient to be tested. data or numerical value.

步驟104:以該拔管預測模型分析關鍵特徵資料,產出該待測病人於拔管預測之日後24小時內之拔管可能率,以及該關鍵特徵資料與該拔管可能 率間之關連性資訊,並得之轉化為一可視化介面,包含如顏色、圖樣、圖形、形狀、線條或上述至少任二之組合。 Step 104: Use the extubation prediction model to analyze the key characteristic data, and generate the extubation probability rate of the patient to be tested within 24 hours after the extubation prediction date, as well as the relationship between the key characteristic data and the extubation possibility. The correlation information between the rates is converted into a visual interface, including colors, patterns, graphics, shapes, lines or a combination of at least any two of the above.

以下,為能說明本創作之技術特徵所能達成之功效,將茲舉若干實例做詳細說明如後。 In the following, in order to illustrate the effects that the technical features of this invention can achieve, several examples will be given for detailed explanation.

以下實例中所有試驗皆經過台中榮民總醫院研究倫理審查委員會審查且批准,並且遵循知情後同意及匿名化之準則。 All trials in the following examples were reviewed and approved by the Research Ethics Review Board of Taichung Veterans General Hospital, and followed the guidelines of informed consent and anonymity.

實例一:受試者之資料分析 Example 1: Subject data analysis

收集2015年7月至2019年7月間入住臺中榮民總醫院重症加護病房之病人的資料,並排除未使用呼吸器之病人資料、使用呼吸器天數少於72小時之病人資料,最後篩選出使用呼吸器超過48小時的受試者,共5940位,而受試者的特徵資料之統計分析結果如表1所示,其中,表1中之數據係以平均值±標準差或數量(百分比)之方式呈現;CCI代表查爾森合併症指數(Charlson Comorbidity Index)。由表1之內容可知,於ICU期間脫離呼吸器之受試者有3657位,於ICU期間未脫離呼吸器之受試者有2283位;所使用之特徵資料共有65個;受試者之平均年齡為66.2±16.2歲,並64.0%為男性;受試者之疾病嚴重度皆明顯偏高,APACHE II分數及SOFA分數分別為25.7±6.6及8.5±3.6,並且,有61.5%之受試者於ICU住院期間拔管,不過拔管受試者與未拔管受試者於年齡、性別、CCI指數間分布相似,但未拔管者之APACHE II分數及SOFA分數係高於拔管者。 Collect the data of patients admitted to the Intensive Care Unit of Taichung Veterans General Hospital from July 2015 to July 2019, and exclude the data of patients who are not using respirators, and the data of patients who have used respirators for less than 72 hours, and finally filter out the data of patients who have used respirators for less than 72 hours. A total of 5940 subjects were exposed to respirators for more than 48 hours. The statistical analysis results of the subject's characteristic data are shown in Table 1. The data in Table 1 are expressed as mean ± standard deviation or number (percentage). Presented in a way; CCI stands for Charlson Comorbidity Index. It can be seen from the contents of Table 1 that there were 3657 subjects who were weaned from the respirator during the ICU period, and 2283 subjects who were not weaned from the respirator during the ICU period; a total of 65 characteristic data were used; the average of the subjects The age was 66.2±16.2 years old, and 64.0% were male; the disease severity of the subjects was significantly higher, the APACHE II score and SOFA score were 25.7±6.6 and 8.5±3.6 respectively, and 61.5% of the subjects The patients were extubated during ICU hospitalization. However, the distribution of age, gender, and CCI index between extubated and non-extubated subjects was similar. However, the APACHE II scores and SOFA scores of non-extubated subjects were higher than those of extubated subjects.

再根據臨床工作流程中之四個主要臨床區塊(main clinical domain),將之受試者於ICU住院期間之動態參數進行分類處理及其統計分析,結果如表2所示,其中,該四個主要臨床區塊分別為意識/認知區塊(consciousness/awareness domain)、體液平衡區塊(fluid balance domain)、呼吸功能區塊(ventilatory function domain)、生理參數區塊(physiological parameter domain),其中,意識/認知區塊包含格拉斯哥昏迷量表(Glasgow coma scale,以下簡稱GCS)及RASS鎮靜程度評 估表(Richmond Agitation Sedation Scale,以下簡稱RASS);體液平衡區塊包含給予病人之液體、尿量、營養總量;呼吸功能區塊包含有氣道壓峰值((PIP)、平均氣道壓(MAP)、呼吸器天數、呼吸頻率;生理參數區塊則包含有心率。由表2之結果可知,於ICU住院期間拔管之病人的意識持續改善、鎮靜狀態下降、心率及輸液量逐漸下降,且尿量與營養量係呈現穩定增加。 Then, according to the four main clinical domains in the clinical workflow, the dynamic parameters of the subjects during their hospitalization in the ICU were classified and analyzed statistically. The results are shown in Table 2. Among them, the four The main clinical areas are consciousness/awareness domain, fluid balance domain, ventilatory function domain, and physiological parameter domain, among which , the consciousness/cognition block includes the Glasgow coma scale (GCS) and RASS sedation assessment. Richmond Agitation Sedation Scale (hereinafter referred to as RASS); the body fluid balance block includes the fluid, urine output, and total nutrition given to the patient; the respiratory function block includes peak airway pressure (PIP) and mean airway pressure (MAP) , ventilator days, respiratory rate; the physiological parameter block includes heart rate. From the results in Table 2, it can be seen that the consciousness of extubated patients during ICU hospitalization continued to improve, the sedation state decreased, the heart rate and infusion volume gradually decreased, and the urine The amount and nutrient content showed a steady increase.

Figure 112205118-A0305-02-0013-1
Figure 112205118-A0305-02-0013-1

Figure 112205118-A0305-02-0013-2
Figure 112205118-A0305-02-0013-2
Figure 112205118-A0305-02-0014-3
Figure 112205118-A0305-02-0014-3

實例二:進行機器學習模型演算之結果 Example 2: Results of machine learning model calculation

請參圖2A,自實例一中所收集之受試者特徵資料中取關連性最高之20個特徵(以下稱為20個關鍵特徵)做為建模數據,並以不同之機器學習模型進行分析,其中,所使用之機器學習模型係包含有XGBoost(Extreme Gradient Boosting)、CatBoost(Categorical Boosting)、LightGBM(Light Gradient Boosting Machine)、隨機森林(Random Forest,簡稱RF)及邏輯回歸(Logistic Regression,簡稱LR),並且訓練/測試比例為80/20。而20個關鍵特徵係透過遞歸特徵剔除分析法(recursive feature elimination analysis)進行原始特徵數據之分析得到之結果。 Please refer to Figure 2A. From the subject characteristic data collected in Example 1, the 20 most relevant features (hereinafter referred to as the 20 key features) are taken as modeling data and analyzed with different machine learning models. , among which, the machine learning models used include XGBoost (Extreme Gradient Boosting), CatBoost (Categorical Boosting), LightGBM (Light Gradient Boosting) Machine), Random Forest (RF for short) and Logistic Regression (LR for short), and the training/testing ratio is 80/20. The 20 key features are the result of analyzing the original feature data through recursive feature elimination analysis.

請參圖2B,為能達到能夠於拔管前1天進行預測之目的,係以受試者特徵資料取得時點進行分類,意即將拔管前1天之數據做為預測視窗(prediction window),並以脫離呼吸管前兩天(及拔管前第2天及第3天)之特徵資料做為特徵視窗(feature window),因此,建模資料係包含有年紀、使用呼吸器之天數,以及分別於拔管前第2天及第3天的GCS、尿量、注射量、營養量、RASS、PIP、MAP、呼吸頻率、心率。 Please refer to Figure 2B. In order to achieve the purpose of prediction one day before extubation, classification is based on the time point when the subject characteristic data is obtained, which means that the data one day before extubation is used as the prediction window. The characteristic data two days before being separated from the breathing tube (and the second and third days before extubation) are used as the feature window (feature window). Therefore, the modeling data includes age, number of days using a respirator, and GCS, urine volume, injection volume, nutritional content, RASS, PIP, MAP, respiratory rate, and heart rate on the 2nd and 3rd days before extubation respectively.

於進行分析前,得對於實例一中所收集之受試者特徵資料進行數據預處理程序。所謂數據預處理係包含有去除異常數據及輸入缺失數據,其中,異常數據係指超出變數合理範圍之數值者,於本實例中,變數合理範圍係為醫生所設定者,舉例來說,各變數之合理範圍如下:年齡為1-100歲、配戴呼吸器的天數為1-60日、GCS為3-15、尿量為0-5000ml、注射量為0-10000ml、營養量為0-3000ml、RASS為-5-+4、PIP為0-50、MAP為10-40、呼吸頻率為0-40、心率為0-300;並於本實例中係輸入每個變數之平均值而填補缺失數據。 Before analysis, the subject characteristic data collected in Example 1 can be subjected to data preprocessing procedures. The so-called data preprocessing includes removing abnormal data and inputting missing data. Among them, abnormal data refers to values that exceed the reasonable range of variables. In this example, the reasonable range of variables is set by the doctor. For example, each variable The reasonable range is as follows: age is 1-100 years old, number of days wearing a respirator is 1-60 days, GCS is 3-15, urine output is 0-5000ml, injection volume is 0-10000ml, and nutritional amount is 0-3000ml , RASS is -5-+4, PIP is 0-50, MAP is 10-40, respiratory rate is 0-40, heart rate is 0-300; and in this example, the average value of each variable is entered to fill in the gaps. data.

又,於進行機器學習模型進行數據分析前,所有數據都進行+1~-1的標準化處理;並,為避免抽樣偏差之產生,於拔管受試者中使用兩組數據,其中一組為拔管前1天的數據,另一組為隨機數據,並從未拔管受試者中隨機挑選五組數據;拔管受試者之特徵數據與未拔管受試者之特徵數據的比例為1:3.4。 In addition, before conducting data analysis with the machine learning model, all data were standardized from +1 to -1; and, in order to avoid the occurrence of sampling bias, two sets of data were used in extubated subjects, one of which was The data of the day before extubation, the other group is random data, and five groups of data are randomly selected from the subjects who have not been extubated; the ratio of the characteristic data of the extubated subjects to the characteristic data of the subjects who have not been extubated is 1:3.4.

各機器學習模型之分析結果如圖3及表3所示,其中,表3中之準確度係以下列公式算出:(TP+TN)/(TP+FN+TN+FP)。 The analysis results of each machine learning model are shown in Figure 3 and Table 3. The accuracy in Table 3 is calculated according to the following formula: (TP+TN)/(TP+FN+TN+FP).

表3:各機器學習模型進行拔管預測之表現指標

Figure 112205118-A0305-02-0016-4
Table 3: Performance indicators of each machine learning model for extubation prediction
Figure 112205118-A0305-02-0016-4

由圖3及表3之結果可知,相對於LR之較低準確率,其他四個機器學習模型之準確率皆為高,具體來說,XGBoost、LightGBM、CatBoost及RF之AUC分別為0.921、0.921、0.920及0.918;並且由圖3B可知各機器學習模型之預測值與實際觀察值間具有良好一致性,又以XGBoost之一致性最佳:又由圖3C之結果可知,五個機器學習模型皆有一定之臨床有效性,其中又以XGBoost及LightGBM之表現最佳。 It can be seen from the results in Figure 3 and Table 3 that compared to the lower accuracy of LR, the accuracy of the other four machine learning models is high. Specifically, the AUCs of XGBoost, LightGBM, CatBoost and RF are 0.921 and 0.921 respectively. , 0.920 and 0.918; and from Figure 3B, it can be seen that there is good consistency between the predicted values of each machine learning model and the actual observed values, and XGBoost has the best consistency: and from the results of Figure 3C, it can be seen that all five machine learning models have It has certain clinical effectiveness, among which XGBoost and LightGBM perform best.

實例三:與拔管預測相關之特徵關連性分析 Example 3: Feature correlation analysis related to extubation prediction

於本實例中,所使用之機器學習模型係為XGBoost。 In this example, the machine learning model used is XGBoost.

按照實例一中所揭示的四個主要臨床區塊,將實例二所得之20個關鍵特徵中分別分類所對應的主要臨床區塊中,分析各關鍵特徵之重要度,並藉由累加各主要臨床區塊中之關鍵特徵的重要度數值,得到各主要臨床區塊與拔管 預測間之重要度,結果如圖4所示。由圖4之結果可知,意識/認知區塊、體液平衡區塊、呼吸功能區塊、生理參數區塊於重症監護之重要性,其數值分別為0.284、0.425、0.232、0.045。 According to the four main clinical blocks revealed in Example 1, the 20 key features obtained in Example 2 were classified into the corresponding main clinical blocks, and the importance of each key feature was analyzed, and by accumulating the main clinical The importance value of the key features in the block is obtained to obtain the main clinical blocks and extubation The importance between predictions, the results are shown in Figure 4. From the results in Figure 4, it can be seen that the importance of consciousness/cognition block, body fluid balance block, respiratory function block, and physiological parameter block in intensive care are 0.284, 0.425, 0.232, and 0.045 respectively.

再藉由SHAP值得到20個關鍵特徵分別如何影響拔管可能性,結果如圖5所示。由圖5之結果可知,GCS之改善及尿量增加與一天後有較高拔管概率呈正相關;而注射液體量高則於拔管概率呈現負相關。 Then, we used SHAP values to determine how the 20 key features affect the possibility of extubation. The results are shown in Figure 5. It can be seen from the results in Figure 5 that improvement of GCS and increase in urine output are positively correlated with a higher probability of extubation one day later; while a high volume of injected fluid is negatively correlated with the probability of extubation.

更進一步,如圖6A至圖6F所示,藉由部分依賴圖(Partial dependence plot,PDP),得到GCS、RASS、尿量、注射量、PIP、MAP等關鍵特徵係分別如何影響機器學習模型評估拔管概率。由圖6A至圖6F之結果可顯示透過將各主要臨床區塊或/及各特徵進行可視化處理後,係可以作為解釋拔管成功率之基礎,意即透過各主要臨床區塊或/及各特徵之SHAP值或部分依賴圖,可使臨床醫生得知透過本創作所揭以機器學習模型建立拔管預測之方法得到之拔管預測結果的理由或是立論基礎。 Furthermore, as shown in Figure 6A to Figure 6F, through the partial dependence plot (PDP), we can obtain how key features such as GCS, RASS, urine volume, injection volume, PIP, and MAP affect the machine learning model evaluation respectively. Probability of extubation. The results from Figure 6A to Figure 6F can show that by visualizing each main clinical block or/and each characteristic, it can be used as a basis for explaining the success rate of extubation, that is, through each main clinical block or/and each characteristic The SHAP value or partial dependence diagram of the features can enable clinicians to know the reasons or the basis for the extubation prediction results obtained through the method of establishing extubation prediction using the machine learning model disclosed in this work.

實例四:分析關鍵特徵對於拔管預測之影響 Example 4: Analyze the impact of key features on prediction of extubation

圖7A及圖7B分別藉由關鍵特徵的LIME及SHAP力值說明關鍵特徵對於兩個不同個體之拔管預測的總體影響,其中,於圖7A及圖7B中,紅色代表對於拔管總預測概率有增量影響(incremental effect)的變量,而藍色代表對於拔管總預測概率有減量影響(decremental effect)的變量。具體來說,由圖7A之結果可知,雖然案例1於拔管前第2天的注射量偏高(2521ml),但是仍具有許多有利於拔管的變數存在,包含意識清楚(GCS為14及RASS為0)、高尿量(於拔管前第2天的尿量為2450ml)且低呼吸頻率(於拔管前第2天的呼吸頻率為14.5),因此,於案例1的拔管預測可能性為0.81。反之,由圖7B之結果可知,案例2存在許多不利拔管之變數,包含有高注射量(拔管前1天為2811ml)、高PIP(29.50cmH2O) 及MAP(15.5mg/dL),因此即便案例2具有相對清楚的意識(GCS為15及RASS為-1),案例2之拔管預測可能性仍僅有0.19。 Figures 7A and 7B illustrate the overall impact of key features on the prediction of extubation for two different individuals through the LIME and SHAP force values of key features respectively. In Figures 7A and 7B , red represents the total predicted probability of extubation. Variables that have an incremental effect (incremental effect), while blue represents variables that have a decremental effect (decremental effect) on the overall predicted probability of extubation. Specifically, it can be seen from the results in Figure 7A that although the injection volume of Case 1 on the second day before extubation was relatively high (2521ml), there are still many variables that are conducive to extubation, including clear consciousness (GCS of 14 and RASS is 0), high urine output (urine output on the 2nd day before extubation is 2450ml) and low respiratory rate (respiratory rate on the 2nd day before extubation is 14.5), therefore, the extubation prediction in Case 1 The probability is 0.81. On the contrary, it can be seen from the results in Figure 7B that Case 2 has many variables that are unfavorable to extubation, including high injection volume (2811ml one day before extubation) and high PIP (29.50cmH2O) and MAP (15.5mg/dL), so even if Case 2 has relatively clear consciousness (GCS is 15 and RASS is -1), the predicted probability of extubation in Case 2 is still only 0.19.

實例五:缺少關鍵特徵進行機器學習模型演算之結果 Example 5: The result of machine learning model calculation without key features

參照實例二所揭內容,以不同機器學習模型分析受試者資料特徵,惟,不同者在於,所用以進行分析受試者資料特徵中不包含有意識區塊之特徵:GCS及RASS,而後,檢核每個機器學習模型所得結果之精準度、專一性、靈敏性、準確度及AUROC,結果如下表4所示。 Referring to the content disclosed in Example 2, different machine learning models are used to analyze the characteristics of the subject's data. However, the difference is that the characteristics of the subject's data used to analyze the characteristics do not include the features of conscious blocks: GCS and RASS. Then, Check the precision, specificity, sensitivity, accuracy and AUROC of the results obtained by each machine learning model. The results are shown in Table 4 below.

Figure 112205118-A0305-02-0018-5
Figure 112205118-A0305-02-0018-5

將表3與表4之結果進行比較可知,當缺少本創作所揭關鍵特徵時,所得之拔管預測可能性的可信度將會降低。 Comparing the results in Table 3 and Table 4, it can be seen that when the key features disclosed in this creation are missing, the reliability of the predicted possibility of extubation will be reduced.

10:以機器學習模型建立拔管預測之系統 10: Establish an extubation prediction system using machine learning models

20:資料庫 20:Database

30:處理裝置 30: Processing device

31:資料處理模組 31:Data processing module

32:模型訓練模組 32: Model training module

33:拔管預測模組 33: Extubation prediction module

34:可視化模組 34:Visualization module

Claims (6)

一種以機器學習模型建立拔管預測之系統,其包含有:一資料庫,係收集複數病人之一特徵資料,其中,各該病人於住院期間係有使用一呼吸輔助設備之記錄,並該特徵資料係包含有一意識資料、一輸入/輸出液體資料、一呼吸功能資料及一生理參數資料;一處理裝置,係與該資料庫連接,並具有一資料處理模組,自一待測病人之特徵資料中擷取出一關鍵特徵資料,其中,該關鍵特徵資料係包含有該生理參數資料,以及於進行拔管預測之日的前1日或/及前2日所獲得之該意識資料、該輸入/輸出液體資料、該呼吸功能資料;一拔管預測模組,以一拔管預測模型分析該關鍵特徵資料,得到該待測病人於該於進行拔管預測之日後的24小時內之拔管可能率。 A system for establishing extubation prediction using a machine learning model, which includes: a database that collects characteristic data of a plurality of patients, in which each patient has a record of using a respiratory assist device during hospitalization, and the characteristics The data includes a consciousness data, an input/output fluid data, a respiratory function data and a physiological parameter data; a processing device is connected to the database and has a data processing module to obtain the characteristics of a patient to be tested. Extract a key feature data from the data, wherein the key feature data includes the physiological parameter data, as well as the consciousness data and input obtained one day or/and two days before the date of extubation prediction. /Output the fluid data and the respiratory function data; an extubation prediction module analyzes the key characteristic data with an extubation prediction model to obtain the extubation of the patient to be tested within 24 hours after the date of extubation prediction. probability. 如請求項1所述以機器學習模型建立拔管預測之系統,其中,該拔管預測模組係以該拔管預測模型分析該關鍵特徵資料後,產出該關鍵特徵資料與拔管可能率間之關連性資訊。 Establish an extubation prediction system using a machine learning model as described in claim 1, wherein the extubation prediction module analyzes the key feature data using the extubation prediction model to generate the key feature data and the extubation probability. information related to each other. 如請求項2所述以機器學習模型建立拔管預測之系統,其中,該處理裝置係更包含有一可視化模組,其係將該關鍵特徵資料與拔管可能率間之關連性資訊轉化為一可視化介面。 As described in claim 2, a system for predicting extubation is established using a machine learning model, wherein the processing device further includes a visualization module that converts the correlation information between the key characteristic data and the probability of extubation into a Visual interface. 如請求項1所述以機器學習模型建立拔管預測之系統,其中,該拔管預測模型係選自由XGBoost(Extreme Gradient Boosting)、CatBoost(Categorical Boosting)、LightGBM(Light Gradient Boosting Machine)、隨機森林演算法(Random Forest,RF)及邏輯回歸(Logistic Regression,LR)所組成之群。 Establish a system for extubation prediction using a machine learning model as described in request 1, wherein the extubation prediction model is selected from XGBoost (Extreme Gradient Boosting), CatBoost (Categorical Boosting), LightGBM (Light Gradient Boosting Machine), Random Forest A group composed of algorithm (Random Forest, RF) and logistic regression (Logistic Regression, LR). 如請求項1所述以機器學習模型建立拔管預測之系統,其中,該處理裝置係更包含有一模型訓練模組; 該模型訓練模組係藉由該些病人之該些特徵資料之至少一部訓練一機器學習模型,並得進行驗證,以產生一拔管預測模型及一關鍵特徵。 As claimed in claim 1, a system for predicting extubation is established using a machine learning model, wherein the processing device further includes a model training module; The model training module trains a machine learning model using at least part of the characteristic data of the patients, and can perform verification to generate an extubation prediction model and a key feature. 如請求項5所述以機器學習模型建立拔管預測之系統,其中,該資料處理模組係讀取該資料庫中該些病人之該特徵資料並進行一資料預處理程序,以刪除超出一預設合理範圍之資料或/及補足缺失部分之資料。 As described in request 5, a system for predicting extubation is established using a machine learning model, wherein the data processing module reads the characteristic data of the patients in the database and performs a data preprocessing procedure to delete data exceeding a certain threshold. Define a reasonable range of data or/and fill in missing parts of the data.
TW112205118U 2023-05-23 2023-05-23 System for predicting extubation using machine learning models TWM648156U (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
TW112205118U TWM648156U (en) 2023-05-23 2023-05-23 System for predicting extubation using machine learning models

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
TW112205118U TWM648156U (en) 2023-05-23 2023-05-23 System for predicting extubation using machine learning models

Publications (1)

Publication Number Publication Date
TWM648156U true TWM648156U (en) 2023-11-11

Family

ID=89721150

Family Applications (1)

Application Number Title Priority Date Filing Date
TW112205118U TWM648156U (en) 2023-05-23 2023-05-23 System for predicting extubation using machine learning models

Country Status (1)

Country Link
TW (1) TWM648156U (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI886994B (en) * 2024-06-07 2025-06-11 中國醫藥大學 Explainable artificial intelligence method applied to clinical medicine and system thereof

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI886994B (en) * 2024-06-07 2025-06-11 中國醫藥大學 Explainable artificial intelligence method applied to clinical medicine and system thereof

Similar Documents

Publication Publication Date Title
TWI852579B (en) Establishing a system and method for extubation prediction using machine learning model
CN110051324B (en) Method and system for predicting death rate of acute respiratory distress syndrome
CN110827993A (en) Early death risk assessment model establishing method and device based on ensemble learning
CN111899866B (en) Surgical operation complication evaluation system based on deep learning
CN110136788A (en) It is a kind of based on the case history quality detecting method, device, equipment and the storage medium that detect automatically
CN113257371B (en) Clinical examination result analysis method and system based on medical knowledge map
JP2013513846A (en) Automatic annotation of medical data
CN112652398A (en) New coronary pneumonia severe prediction method and system based on machine learning algorithm
CN117219275A (en) Child drug-induced liver injury risk identification prediction method and system
CN107145715B (en) A clinical medical intelligent discrimination device based on recommendation algorithm
CN116844733B (en) Medical data integrity analysis method based on artificial intelligence
CN112967803A (en) Early mortality prediction method and system for emergency patients based on integrated model
WO2022178947A1 (en) Monitoring and early warning method and apparatus based on multiple dimensions, and device and storage medium
CN111063448A (en) Establishment method, storage system and active early warning system of blood transfusion adverse reaction database
CN116564521A (en) Chronic disease risk assessment model establishment method, medium and system
CN118312816B (en) Cluster weighted clustering integrated medical text processing method and system based on member selection
CN117672532A (en) Hospitalized patient nursing risk assessment early warning monitoring system and method
CN120600308A (en) Intelligent health examination data quality control analysis method and system
TWM648156U (en) System for predicting extubation using machine learning models
Junger et al. Automatic calculation of a modified APACHE II score using a patient data management system (PDMS)
CN118737438B (en) A perioperative risk grading model construction method and a perioperative risk grading method
CN114724701A (en) Noninvasive ventilation curative effect prediction system based on superposition integration algorithm and automatic encoder
CN120221055A (en) Clinical performance verification system based on infectious lung disease diagnostic model
CN118737467A (en) A machine learning model for monitoring the risk of in-hospital mortality in patients with heart failure in the intensive care unit
CN112205965B (en) Health risk key event detection method and system based on time window cutting