WO2015130929A2 - Apparatus and method for detecting and removing artifacts in optically acquired biological signals - Google Patents
Apparatus and method for detecting and removing artifacts in optically acquired biological signals Download PDFInfo
- Publication number
- WO2015130929A2 WO2015130929A2 PCT/US2015/017746 US2015017746W WO2015130929A2 WO 2015130929 A2 WO2015130929 A2 WO 2015130929A2 US 2015017746 W US2015017746 W US 2015017746W WO 2015130929 A2 WO2015130929 A2 WO 2015130929A2
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- segment
- eigenvalues
- data
- corrupted
- eigenvectors
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Classifications
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61B—DIAGNOSIS; SURGERY; IDENTIFICATION
- A61B5/00—Measuring for diagnostic purposes; Identification of persons
- A61B5/72—Signal processing specially adapted for physiological signals or for diagnostic purposes
- A61B5/7235—Details of waveform analysis
- A61B5/7264—Classification of physiological signals or data, e.g. using neural networks, statistical classifiers, expert systems or fuzzy systems
- A61B5/7267—Classification of physiological signals or data, e.g. using neural networks, statistical classifiers, expert systems or fuzzy systems involving training the classification device
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61B—DIAGNOSIS; SURGERY; IDENTIFICATION
- A61B5/00—Measuring for diagnostic purposes; Identification of persons
- A61B5/02—Detecting, measuring or recording for evaluating the cardiovascular system, e.g. pulse, heart rate, blood pressure or blood flow
- A61B5/024—Measuring pulse rate or heart rate
- A61B5/02416—Measuring pulse rate or heart rate using photoplethysmograph signals, e.g. generated by infrared radiation
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61B—DIAGNOSIS; SURGERY; IDENTIFICATION
- A61B5/00—Measuring for diagnostic purposes; Identification of persons
- A61B5/72—Signal processing specially adapted for physiological signals or for diagnostic purposes
- A61B5/7203—Signal processing specially adapted for physiological signals or for diagnostic purposes for noise prevention, reduction or removal
- A61B5/7207—Signal processing specially adapted for physiological signals or for diagnostic purposes for noise prevention, reduction or removal of noise induced by motion artifacts
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61B—DIAGNOSIS; SURGERY; IDENTIFICATION
- A61B5/00—Measuring for diagnostic purposes; Identification of persons
- A61B5/72—Signal processing specially adapted for physiological signals or for diagnostic purposes
- A61B5/7221—Determining signal validity, reliability or quality
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16H—HEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
- G16H50/00—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
- G16H50/70—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for mining of medical data, e.g. analysing previous cases of other patients
Definitions
- These teachings relate generally to an apparatus and a method for detecting and removing artifacts in optically acquired biological signals. More particularly, these teachings relate generally to an apparatus and a method for detecting and reconstructing motion and noise artifacts (MNA) in photoplethysmography (PPG) signals.
- MNA motion and noise artifacts
- PPG is a non-invasive and low cost device to continuously monitor blood volume changes in peripheral tissues.
- PPi ⁇ is a useful technique since it is widely used to monitor heart rate (BR), arterial oxygen saturation (Sp02), and can also he used to measure respiratory rates.
- BR heart rate
- Sp02 arterial oxygen saturation
- MNA can distort PPG recordings, causing erroneous estimation of HR and Sp02.
- MNA MNA artifacts that can distort PPG recordings: (1) environmental, physiological, and experimental artifacts, which cars be attributed to power interference surrounding the body; (2) correlated dynamics from other physiological signals; and (3) instrumental noise, respectively, MNA, which are comprised of all of the aforementioned noise sources, are difficult to filter since they do not have a prede termi ned frequency band and their spectrum often overlaps with that of the desired PPG signal
- MNA in PPG readings are caused by 1) the movement of venous blood as well as other non-pulsatile components along with pulsatile arterial blood and 2) variations in the optical coupling between the sensor and the skin.
- Various approaches to mitigate motion artifacts by improving sensor attachment have been proposed. However, these design improvements do not provide a significant reduction of motion artifacts.
- Algorithm- based MNA reduction methods are also proposed. These include time and frequency domain filtering, power spectrum analysis, and blind source separation techniques. However, these have high computational complexity and more importantly, they operate even on clean PPG portions where MNA reduction is not needed.
- MNA detection which identifies clean PPG recordings from corrupted portions, is essential for the subsequent MNA reduction algorithm so that it does not distort the non-corrupted data segments.
- MNA detection methods are mostly based on a signal quality index (SQI) which quantifies the severity of the artifacts.
- SQL signal quality index
- Some approaches quantify SQI using waveform morphology or filtered output, while others derive SQI with the help of additional hardware such as accelerometer and electrocardiogram sensing.
- Statistical measures such as skewness, kurtosis. Shannon entropy, and Renyi's entropy, have been shown to be helpful in
- arterial oxygen saturation reflects the relative amount of oxyhemoglobin in the blood.
- the most common method to measure it is based on pulse oximetry, whereby oxidized hemoglobin and reduced hemoglobin have significantly different optical spectra. Specifically, at a wavelength of about 660 nm, and a second wavelength between 805 and 960, there is a large difference in light absorbance between reduced and oxidized hemoglobin, A measurement of the percent oxygen saturati on of blood is defined as the ratio of oxyhemoglobin to the total concentration of hemoglobin present in the blood. Pulse oximetry assumes that the attenuation of light is due to both the blood and bloodless tissue.
- Fluctuations of the PPG signal are caused by changes in arterial blood volume associated with each heartbeat, where the magnitude of the fluctuations depends on the amount of blood rushing into the peripheral vascular bed, the optical absorption of the blood, skin, and tissue, and the wavelength used to illuminate the blood.
- the pulse oximeter signal contains not only the blood oxygen saturation and heart rate data, but also other vital physiological information.
- the fluctuations of PPG signals contain the influences of arterial, venous, autonomic and respiratory systems on the peripheral circulation.
- a single sensor that has multiple functions is very attractive from a financial perspective.
- utilizing a pulse oximeter as a multi-purpose vital sign monitor has clinical appeal, since it is familiar to the clinician and comfortable for the patient.
- Knowledge of respiratory rate and heart rate patterns can provide more useful clinical information in many situations in which pulse oximeter is the sole monitor available.
- MNA result in unreliable heart rate and Sp02 estimation.
- Clinicians have cited motion artifacts in pulse oximetry as the most common cause of false alarms, loss of signal, and inaccurate readings.
- MNA are difficult to remove because they do not have a predefined narrow frequency band and their spectrum often overlaps that of the desired signal.
- An adaptive filter is easy to implement and it also can be used in real-time applications, though the requirement of additional sensors to provide reference inputs is the major drawback of such methods.
- BSS blind source separation
- ICA the recorded signals are decomposed into their independent components or sources.
- CCA uses the second order statistics (SOS) to generate components derived from their uncorrelated nature.
- SOS second order statistics
- PCA is another nois reduction technique which aims to separate the clean signal dynamics from the MNA data.
- a multi-scale PCA has also heen proposed to account for time-varying dynamics of the signal and motion artifacts from PPG recordings,
- a promising approach that can be applied to signal reconstruction is the singular spectrum analysis (SSA).
- the SSA is a model-free BSS technique, which decomposes the data into a number of components which may include trends, oscillatory components, and noise (see, for example, B. S, Kim and S. K, Yoo, "Motion artifact reduction in
- SSA photoplethysmography using independent component analysis
- these teachings provide systems and methods that can distinguish clean from corrupted PPG signals under various types of motions and reconstruct the MNA contaminated data segments, such that biological parameters, e.g., heart rates and SpG2 values, can be accurately estimated.
- the system of these teachings includes one or more processors and one or more computer usable media having computer readable code embodied therein, the computer readable code causing the one or more processors to execute the method of these teachings.
- the method of these teachings includes a method for determining MNA are present, in a segment of PPG data by determining a plurality of time domain features for each segment from a plurality of test segments of the PPG data, the plurality of test segments including segments without motion and noise artifacts and other segments with motion and noise artifacts, the plurality of time domain features for said each segment from the plurality of test segments constituting a training set, using the training set to train a SVM, training resulting in a trained SVM, determining the plurality of time domain features for the segment, and using the trained SVM to determine whether motion and noise artifacts are present in the segment,
- the method of these teachings includes a method for removal of MNA present in a. segment of PPG data, by the steps of: (a) for each one segment from a segment of PPG data in which presence of motion and noise artifacts has been previously detected, referred to as a corrupted segment, and a most prior adjacent segment of PPG data in which motion and noise artifacts are not detected, referred to as a clean segment, performing the following: (al ) assemble a data transition matrix, each row of the data transition matrix being a vector of a predetermined length, a number of vectors being equal to a number of samples in a segment for which the data transition matrix is assembled minus the predetermined length and pins one; a stalling value of each vector being displaced by one sample from a previous vector, resulting in the data transition matrix having a number of columns equal to the predetermined length and a number of rows equal to the number of vectors; (a2) obtain eigenvectors and eigenvalues for the data
- the system of these teachings includes a system for determining whether MNA are present in a segment of PPG data, having one or more processors and non-transitory computer usable media having computer readable code embodied therein, the computer readable code, when executed by the one or more processors, causes the one or more processors to: determine a plurality of time domain features for each segment from a plurality of test segments of the PPG data, the plurality of test segments including segments without motion and noise artifacts and other segments with motion and noise artifacts, the plurality of time domain features for said each segment from the plurality of test segments constituting a training set; use the training set to train a SVM, training resulting in a trained SVM; determine the plurality of time domain features for the segment; and use the trained SVM to determine whether motion and noise artifacts are present in the segment.
- the system of these teachings includes a system for removal of MNA. present in a segment of PPG data, having one or more processors and non- transitory computer usable media, having computer readable code embodied therein, the computer readable code, when executed by the one or more processors, causes the one or more processors to: (a) for each one segment from a segment of PPG data in which presence of motion and noise artifacts has been previously detected, referred to as a corrupted segment, and a most prior adjacent segment of PPG data in which motion and noise artifacts are not detected, referred to as a clean segment, performing the following: (al) assemble a data transition matrix, each row of the data transition matrix being a vector of a predetermined length, a number of vectors being equal to a number of samples in a segment for which the data transition matrix is assembled minus the predetermined length and plus one; a starting value of each vector being displaced by one sample from a previous vector, resulting in the data transition matrix having
- FIG. 1 A representative clean forehead- PPG signal recorded during voluntary motion artifact conducted in a laboratory setting (1 t row). The mixed (up-down and left- right) movement of the forehead to which the PPG probe is attached for predetermined time interval induced 10% to 50% noise (2nd - 6th row) within a 60s PPG segment.
- FIG. 3 Test phase of the disclosed SVM-based motion detection algorithm.
- the hidden layers correspond to kernel function of the SVM,
- the function between hidden layer and output layer is a linear operator.
- Neighbor segments are the segments surrounding a target segment within ⁇ 2 seconds, Decisions on the target segment are based on a majority vote from the decisions of neighbor segments as well as the one of the target segment (red).
- FIG. 5A ⁇ F A sample forehead recorded PPG signal (a) along with the (b) standard deviation of P-P intervals (c) standard deviation of P-P amplitudes (d) standard deviation of systolic artd diastolic time ratio, and (e) mean standard deviation of pulse shape, computed for each segment.
- the normalized sampled corrupt and clean PPGs for mean standard deviation of pulse shape is given in (f).
- FIG. 6A-B Trained SVM classification with a sample training finger recorded PPG signal is given with (a)-(b) pairs of two parameters.
- the SVM decision and margin boundaries are marked by black and green lines, respectively.
- FIG. 7A-B Validation: pairs of parameters for clean and corrupted PPG signals.
- Figure 8 A representative PPG signal with detected peaks (red) (a) along with the (b) standard deviation of P ⁇ P intervals (c) standard deviation of P-P amplitudes (d) mean standard deviation of pulse shape and (e) standard deviation of systolic and diastolic time ratio, computed for each segment
- FIG. 9 Detection Probability of Corruption by additive white Gaussian noise (AWGN) for varying SN from -20 to 0 dB. 50 AWGN realizations for each SNR level are separately added to a non-MNA corrupted PPG. Each realization is tested by the disclosed M A detection algorithm to compute the detection probability of corruption
- AWGN additive white Gaussian noise
- Figure lOA-C Classification performance comparison between our SVM algorithm, Hjorlh (HI, H2), Kurtorsis and Shanon Entropy ( , SE) parameters, (a) Accuracy; (b) Sensitivity; (c) Specificity.
- the central mark on each bo corresponds to the median; the edges of the box correspond to the 25th and 75th percentiles, the whiskers extend to the most extreme data points not considered outliers, and outliers are plotted individually.
- (*) indicate the mean is significantly different (p ⁇ 0.05 at 95% CI) between SVM and other methods used for comparison
- FIG 11 A ⁇ B Comparison of mean errors and detection error fraction between original signal (labeled "None") and artifact removed signal from five detection methods (SVM, HI , H2 S K, and SE).
- SVM artifact removed signal from five detection methods
- HR error HR error
- Sp02 error Sp02 error
- Figure 12A-C Mean error comparison between our SVM algorithm, Hjorth (HI , H2), Kurtorsis and Shanon Entropy (K. SE) parameters, (a) heart rate; (b) Sp02; (c) detection error.
- the central mark on each box corresponds to the median; the edges of the box correspond to the 25th and 75th percentiles, the whiskers extend to the most extreme data points not considered outliers, and outliers are plotted individually.
- (*) indicate the mean is significantly different (p ⁇ 0.05 at 95% CI) between SVM and other methods used for comparison.
- the x-axis labeled "None" in all panels refers to the mean errors when compared to the reference signals without removing the MNA detected segments as identified by any of the five computational methods
- Figure 13 Typical infrared PPG signal; (a) clean, (b) corrupted with motion artifacts, Figure 14A-B.
- Figure I 5A-C Iterative reconstruction of a corrupted eigenvector with frequency of 0.967 Hz.
- Black font signals (top panels) represent the clean component with frequency of 0.967 Hz; Blue font signals (2nd rows) indicate the corrupted component with the same frequency; Pink font signals are related to iterative evolution of corrupted component to a clean oscillatory signal, (a) Reconstruction of 4th corrupted eigenvector compared to the corresponding clean component. The final pattern after 4 iterations resembles the black font clean component in the top panel.
- This component is chosen among the components with the same frequency, since it shows the most similarity to the black font clean component, (b) Reconstruction of 9th corrupted eigenvector compared to the corresponding clean component, (c) Reconstruction of 22nd corrupted eigenvector compared to the corresponding clean component
- Figure 18A-D (a) HR estimated from IMAR-reeonstructed PPG compared to reference and corrupted PPG; (b) HR estimated from ICA-reconstructed PPG compared to reference and corrupted PPG; (c) Sp02 estimated from IM AR-reconstmcted PPG compared to reference and corrupted PPG; (d) Sp02 estimated from ICA-reeonstrucled PPG compared to reference and corrupted PPG.
- FIG. 19 is a schematic block diagram representation of one embodiment of the system of these teachings.
- an accurate and comprehensive MNA detection algorithm which detects MNA in PPG under various types of motion.
- time-domain parameters are introduced to quantify MNA in the recorded PPG signal.
- the statistical measures of the time-domain parameters are considered as input var bles for a machine learning-based MNA detection algorithm.
- the MNA detection algorithm may be self-trained by the SVM with clean and corrupted PPG data sets, and then the trained SVM can be used to test the unknown PPG data.
- the efficacy of the MNA detection algorithm is tested on PPG data sets recorded from the finger and forehead pulse oximeters in simulations, laboratory- controlled and walking/stair-elimbing experiments, respectively.
- PPG signals can be obtained from custom reflectance-mode prototype pulse oximeters.
- PPG data with laboratory-controlled head and finger movement, daily-activity movement, or simulated movement are collected respectively from healthy subjects recruited from the student community of Worcester Polytechnic Institute (WPI). This study is approved by WPFs I B and all subjects are given informed consent prior to data recording.
- WPI Worcester Polytechnic Institute
- motion artifacts are induced by head movements for specific time intervals in both horizontal and vertical directions.
- eleven healthy volunteers are asked to wear a forehead reflectance pulse oximeter along with a reference Masimo Radical (Masimo SET®) fmger type transmiitance pulse oximeter.
- subjects are instructed to introduce motion artifacts for specific time intervals varying from 10 to 50% within a 1 minute segment. For example, if a subject is instructed to perform left-right movements for 6 seconds, a 1 minute segment of data would contain 10% noise.
- the right middle fmger with the sensor attached to the Masimo pulse oximeter is kept stationary.
- HR and Sp02 signals are acquired by the Masimo pulse oximeter at 80Hz and 1 Hz, respectively, and are acquired synchronously with the PPG signals recorded from the forehead sensor.
- motion artifacts are induced by left- right movements of the index finger
- nine healthy volunteers are asked to sit and wear two reflection type PPG pulse oximeters (TSD200) on their index and middle fingers, respectively.
- TSD200 reflection type PPG pulse oximeters
- motion artifacts are induced by left-right movements of the index finger while the middle finger is kept stationary as a reference.
- motion is induced at specific time intervals corresponding to 10-50% duration in a 1 minute segment.
- Such controlled movement is repeated five times per subject.
- the pulse oximeters are connected to a biopotential amplifier (PPG100) having a gain of 100 and cut-off frequencies of 0.05-10 Hz,
- PPG100 biopotential amplifier
- the MPIOOO BIOPAC Systems Inc., CA, USA
- PPG data are recorded while subjects are walking straight or climbing stairs for 45 min.
- the nine subjects are asked to walk or climb stairs after wearing a forehead reflectance pulse oximeter along with a Holter
- ECG electrocardiogram
- Rozinn RZ153+ Rozinn RZ153+
- Masimo Rad-57 pulse oximeter at 0.5Hz
- the reference ECG is obtained from the Holier ECG monitor while HR and Sp02 readings are measured from the Masimo pulse oximeter connected to the subject ' s righ index finger, which is held against the chest to minimize motion artifacts.
- HR and Sp02 readings are measured from the Masimo pulse oximeter connected to the subject ' s righ index finger, which is held against the chest to minimize motion artifacts.
- the simulati on movement PPG data are generated by the addition of white noise to the clean P PG data.
- PPG data are preprocessed by a 6th order infinite impulse response (ilR.) band pass filter with cut-off frequencies of 0,5 Hz and 12Hz.
- infinite impulse response (ilR.) band pass filter with cut-off frequencies of 0,5 Hz and 12Hz.
- Zero-phase forward and reverse filtering is applied to account for the non-linear phase of the OR filter.
- the method of these teachings includes a method for determining whether MNA are present in segment of PPG data by determining a plurality of time domain features for each segment from a plurality of test segments of the PPG data, the plurality of test segments including segments withou t motion and noise artifacts and other segments with motion and noise artifacts, the plurality of time domain features for said each segment from the plurality of test segments constituting a training set, using the training set to train a SVM, training resulting in a trained SVM, determining the plurality of time domain features for the segment, and using the trained SVM to determine whether motion and noise artifacts are present in the segment.
- the method also includes band pass before determining the plurality of time domain features, each segment from the plurality of test segments.
- the method still further includes determining whether motion and noise artifacts are present in segments neighboring the segment, referred to as neighboring segments, neighboring segments being segments surrounding the segment within a predetermined time Interval .
- the method includes applying a majority vote algoritlim to determinations of whether motion and noise artifacts are present in the segment and the neighboring segments,
- the time domain features include at least one of standard deviation of peak to peak interval within a segment, standard deviation of peak to peak amplitude within a segment, standard deviation of systolic, and diastolic ratio within a segment, and mean standard devi ation of pulse shape within an interval.
- the STD HR !I of the segment is defined by:
- a I is peak amplitude at the i* pulse of the «* segment and A is mean peak- to-peak interval of the n & segment.
- the A n J is defined by the difference between the i tk peak and the forthcoming ( +!)* trough amplitudes.
- R sa n is systolic and diastolic time interval ratio at the i* pulse of the A segment and R SP lf is the mean systolic and diastolic time interval ratio of the w" 1 segment.
- the R m n is calculated by
- N srap sample points of a pulse.
- the 5 2 . AV , tone of the segment is derived by taking average of the standard deviation at each sample point as follows:
- SVM can be applied to build a decision boimdaiy classifying motion corruption from clean PPG signals
- SVM is widely used in classification and regression due to its accuracy and robustness to noise (see, for example, C.-W. Hsu, C.-C. Chang, and C.-J. Lin, "A Practical Guide to Support Vector Classification,” Department of Computer Science, National Taiwan University 2003, a copy of which is incorporated by reference here in its entirety and for all purposes)
- the SVM includes training and test phases described further below.
- Training phase A flow chart of the training phase in the SV -based MN A detection algorithm is shown in FIG. 2.
- the SVM takes the parameter values of clean and corrupted PPG segments as a training data set, finds the support vectors among the training data set which maximize the margin (or the distance) between different classes, and finally builds a decision boundary. If the estimated decision is different from its known label, the decision is regarded as a training error.
- a soft-margin SVM is considered, which can set the boundary even when the data sets are mixed and cannot be separated.
- slack variables are introduced to minimize the training error with maximizing the margin.
- Soft-margin SVM uses the following equation to find the support vectors.
- the non-linear SVM can be transformed to a linear SVM,
- Eq. (7) is modified as
- FIG, 3 shows a flow chart of the test phase in the SVM-based MNA. detection algorithm.
- the PPG data can be partitioned into many 7-second segments.
- Parameters can be deri ved from each PPG portion to examine if it is corrupted by motion artifact or not.
- Neighbor segment is defined as a segment surrounding a target segment within iTneighbor seconds. Decision on a neighbor segment is highly likely to be the same as the decision on a target segment since PPG pulses in tfie neighbor segments are most likely to exhibit similar dynamics to the target segment.
- the algorithm gathers the decisions of neighbor segments as well as target segment (see, for example. FIG. 4) and makes a final decision on the target segment based on a majority vote concept (see, for example, Wim H. Hesselink, The Boyer-Moore Majority Vote
- the performance of the MNA detection algorithm can be evaluated for various types (simulated, laboratory controlled, and daily activities) of motion-corrupted PPGs so as to validate the performance in a wide range of scenarios.
- the PPG recordings are divided into 7-second segments since this is determined to be the optimal size among the data length tested from 3-1 1 seconds (see below PERFORMANCE
- Table 1 below describes the number of clean and corrupted PPG segments for each motion type used in the experiment as determined by the criteria defined above.
- FIG. 5 A and FIGs, 5B through 5E A sample forehead PPG signal and lis corresponding parameters calculated segment- by ⁇ segmexit are given in FIG, 5 A and FIGs, 5B through 5E, respectively.
- the normalized sampled corrupt and clean PPGs for mean standard deviation of pulse shape is given in FIG, 5F.
- FIGs, 6 A and 6B show (STD ⁇ STD ⁇ ) and (STD m) STD WAV ) of clean (circle) and corrupted (star) forehead signals, respectively, with corresponding SV boundaries (black line).
- a linear kernel is considered for the SVM in the experiment.
- FIG. 7 shows classification results by the SVM boundaries obtained from FIG. 6.
- FIG. 8 shows a representative PPG signal with detected peaks (red) along with the corresponding statistical parameter values. Note the corrupted PPG signal interval between 21 to 31 seconds. The discrepancy between corrupted and clean portions is reflected by parameters STD ⁇ , STD ⁇ v p , STD sa and STD WA . The parameter values from the corrupted PPG segments exhibit larger variability and consequently have higher standard deviation value compared to those from clean data segments.
- the STD m , >5 D AMP and STD WAV have large values between 21-35 seconds (see FIGs, 8B-8D), while STD S0 has large value only between 21-28 seconds (see FIG. ⁇ E).
- Table II below presents C for finger, forehead, and walking/siair-elimbing data.
- the disclosed algorithm is tested to different segment lengths varying from 3 to 11 seconds and calculated their mean classification accuracies, which are provided In below Table III.
- the 7 ⁇ second segment provided the highest classification accuracies for all data; finger, forehead and walking/stair-climbing PPG signals.
- Accuracy, specificity, and sensitivity for each dataset are presented in Table IV.
- the SVM performance using the 7-second segment showed a 93.9% accuracy, 92.4% specificity, and 94.3% sensitivity.
- Gaussian white noise of varying signal ⁇ to ⁇ noise (SNR) levels is added to a representaiive non ⁇ MNA corrupted PPG signal.
- SNR signal ⁇ to ⁇ noise
- 50 independent clean PPG signal 50 independent clean PPG signal.
- the PPG signals with a SNR below -10 dB are detected as corrupted data with our algorithm.
- SNR of -20 dB every segment is detected as corrupted.
- the disclosed algorithm is compared with other artifact detection methods based on HI, H2 , K and SE since these methods have been shown to provide good detection accuracies.
- the HI and H2 parameters represent the central frequency and half of bandwidth, respectively, and are defined as follows;
- FIGs. 1GA- I 0C compare the medians and 25th and 75th perceniiies of detection accuracy, sensitivity, and specificity for all five detection methods for the finger, head and walking/stair-cllmbing data sets.
- the disclosed SVM method consistently yields higher performance with a mean accuracy of 94%, sensitivity of 97%, and a specificity of 92%; whereas other methods show fluctuations depending on which datasets are used, in the finger recorded data, HI yields a slightly higher accuracy than ail other methods due to higher specificity, but the detection sensitivity is lower.
- FIG. 1 1 A shows a comparison of the mean HR error and detection error fraction from five MNA detection methods for walking/stair-climbing data.
- the HR errors are defined by the difference between the estimated HR derived from the PPG and the reference HR readings. Low error values reflect an effective artifact detection algorithm, The disclosed algorithm yields the lowest HR error and detection error fraction as compared with other MNA methods.
- FIG. 1 IB shows a comparison of mean Sp02 error and detection error fraction from five MNA detection methods.
- the SE based detection method shows a lower mean Sp02 error than the disclosed algorithm, but its detection error fraction is very high (>70%), indicating that the error is computed based on only 30% of clean data.
- FIG, 12 shows a comparison of live MNA detection methods in terms of paired-t test results of HR and Sp02 estimation and detection accuracy.
- the SVM algorithm outperformed the K, SE, HI and H2 methods with HR errors of 2,3 bpm, Sp02 errors of 2.7% and detection error fraction of 6,3%,
- the disclosed MNA detection algorithm has been designed based on four parameters: (a) standard deviation of peak-to-peak intervals (b) standard deviation of peak-to-peak amplitudes (e) standard deviation of systolic and diastolic time ratios, and (d) mean-standard deviation of pulse shapes.
- the disclosed MNA algorithm is compared to other well- established MNA detection methods, using the 7-second data segment as this length has been determined to provide the optimal classification accuracy.
- FIG. 10A indicates that the mean classification accuracy is significantly different (p ⁇ 0,05 at 95% CI) between the disclosed SVM method and other methods, except for HI.
- FIGs. 11 A and 1 IB summarizes paired ⁇ t test results for HR and Sp02 estimations as well as detection accuracy.
- SVM is significantly different from HL H2, K, and SE in terms of HR estimation and detection accuracy (see FIGs, ⁇ 2 ⁇ and 12C), while Sp02 derived from the S VM method is
- the disclosed MNA detection algorithm coded with Matlab (2012a) takes only 7 ms on an Intel Xeon 3.6 GHz computer for the 7-second data segment. Hence, the disclosed algorithm is real-time realizable especially when It is coded in either C or CA+.
- the disclosed computational MNA detection algorithm has provided high HR and Sp02 estimation accuracy as well as classification accuracy. Moreover, the disclosed algorithm shows significantly better performance than some well-cited methods with good detection accuracy, Another key advantage of the disclosed algorithm is that it is able to detail with a near pinpoint accuracy when MNA starts and ends.
- a PPG signal can be reconstructed from those portions of data that have been identified to be comipted using the algorithm detailed hereinabove.
- the fidelity of the reconstructed signal is determined by comparing the estimated Sp02 and heart rate (HR) to reference values,
- HR heart rate
- the reconstructed Sp02 and HR values ohtained via the ICA are compared to those obtained by the method disclosed herein.
- the ICA results are chosen as the point of comparison, because ICA has recently been shown to provide accurate reconstruction of corrupted PPG signals,
- Subjects are directed to Introduce the motions for specific time intervals that determined the percentage of noise within each 1 minute segment, varying from 10 to 50%, For example, if a subject is instructed to make left- right movements for 6 seconds, a 1 minute segment of data would contain 10% noise.
- the second dataset includes finger-PPG signals from the same 9 healthy volunteers in an upright sitting posture using an infrared reflection type PPG transducer (TSD20Q).
- An MP 1000 pulse oximeter (commercially available from BIOPAC Systems inc., CA, USA) is also used to acquire finger PPG signals at 100 Hz.
- One pulse oximeter of each model is placed on the same hand's index finger (one model) and middle finger (the other model) simultaneously. After baseline recording for 5 minutes without any movement (i.e.
- motion artifacts are induced in the PPG data by the left-right movements of the inde finger while the middle finger is kept stationary to provide a reference. Similar to the first dataset, motion is induced at specific time intervals corresponding to 10 to 50% corruption duration in 1 minute segments, i.e. the controlled movement is carried out five times per subject.
- the third dataset includes data measurements from 9 subjects with the PPG signal recorded from the subjects' forehead using a custom sensor simultaneously with the reference EGG, HR and Sp02 from a Holier Monitor at 180 Hz and Masimo (Rad-57) pulse oximeter at 0.5 Hz respectively.
- the reference pulse oximeter provided HR and Sp02 measured from the subject's right index linger, which is held steadily to their chest.
- the signals are recorded while the subjects are going through sets of walking and climbing up and down flights of stairs for approximately 45 min.
- PPG signals from all three experiments outlined above are preprocessed offline using, for example, Matlab (MathWorks, R2012a).
- the PPG signals are filtered using a zero-phase forward-reverse 4th order IIR band-pass filter with cutoff frequency 0,5-12Hz.
- a method of these teachings includes a method for removal of motion and noise artifacts (MN A) present in a segment of PPG data, by the steps of: (a) for each one segment from a segment of PPG data in which presence of motion and noise artifacts has been previously detected, referred to as a corrupted segment, and a most prior adjacent segment of PPG data in which modem and noise artifacts are not detected, referred to as a clean segment, performing the following: (al) assemble a data transition matrix, each row of the data transition matrix being a vector of a predetermined length, a number of vectors being equal to a number of samples in a segment for which the data transition matrix is assembled minus the predetermined length and plus one; a starting value of each vector being displaced by one sample from a previous vector, resulting in the data transition matrix having a number of columns equal to the predetermined length and a number of rows equal to the number of vectors; (a2) obtain eigenvectors and eigenvalues for
- the predetermined convergence criterion is a difference between a discarding metric for the corrupted segment reconstructed from the data transition matrix using replaced eigenvalues and retained eigenvectors and a discarding metric for the clean segment, the discarding metric being a sum of absolute values of signal components divided by a length metric for the signal components,
- the predetermined frequency range is a heart rate range of PPG data.
- the predetermined frequency range includes frequencies greater than 0,66 Hz and less than 31 lz.
- the top predetermined percentage is a top 5%. in this method, the presence of motion and noise artifacts had been previously detected rising the method previously described.
- the SSA is composed of two stages: A) singular decomposition and B) spectral reconstruction.
- the former is the spectral decomposition or eigen-decornposition of the data matrix whereas the latter is the reconstruction of the signal, based on using only the significant eigenvectors and associated eigenvalues.
- the assumption is that given a relatively high signal- to-noise ratio of data, significant eigenvectors and associated eigenvalues represent the signal dynamics and less significant values represent the MNA components.
- the calculation of the singular stage of the SSA includes two steps: i) embedding followed by ii) singular value decomposition (SVD). in essence, these procedures decompose the data into signal dynamics including trends, oscillatory components, and MNA.
- the spectral stage of the SSA algorithm also includes two steps: i) grouping and ii) diagonal averaging. These two procedures are used to reconstruct the signal dynamics but without the MNA components.
- window length f j] ⁇ L ⁇ N/2 is chosen to embed the initial time series, where f s is the sampling frequency and , is the lowest frequency in the signal.
- the time series X is mapped into the L lagged vectors, x ⁇ * / » * , ⁇ x i + L -i ) for
- trajectory matrix ⁇ ⁇ can be denoted as
- T x T ⁇ +T 2 ...+T d
- the reconstruction stage has two steps: i) grouping and ii) diagonal averaging. First, the subgroups of the decomposed trajectory matrices are grouped and then a diagonal averaging step is needed so that a new time series can be formed.
- the grouping step of the reconstruction stage decomposes the L x K matrix ⁇ ) in to subgroups according to the trend, oscillatory components, and MNA dynamics.
- T f corresponds to the group / ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ i ⁇ ,...,/ w ⁇ .
- T ⁇ is a sum of T j , where ./ « ⁇ /, ⁇ . So T x can be expanded as
- each resultant matrix , 3 ⁇ 4, in Eq. (13) is transformed into a time series of length N .
- FIGs. 14A and 14B show the first 12 eigen vectors of the clean and MNA corrupted data as shown in FIG. 13, respectively.
- the most important part of the SSA is to choose the proper eigenvector components for reconstruction of the signal. Under the assumption of high SNR, the normal practice is to select only the largest eigenvalues and associated eigenvectors for signal, reconstruction. However, most often it is difficult to determine the demarcation of the significant from non-significant eigenvalues. Further, the MNA dynamics can overlap with the signal dynamics, hence, choosing the largest eigenvalues does not necessarily result in an MNA-free signal.
- the SSA approach is modified.
- the first step of the modified SSA involves computing singular value decomposition on both a corrupted data segment and its most prior adjacent clean data segment.
- the second step is to retain only the top 5% of the eigenvalues and their associated eigenvectors.
- the third step is to replace the corrupted segment's top 5% eigenvalues with the clean segment's eigenvalues.
- the fourth step is to further limit the number of eigenvectors by choosing only those eigenvectors that have heart rates between for both the clean and noise corrupted data segments. The two extreme heart rates are chosen so that they account for possible scenarios that one may encounter with low and high heart rates.
- non-significant eigenvectors are further pruned by performing frequency matching of the noise corrupted eigenvectors to those of the clean data segment's eigenvectors, in the fifth step. Only those eigenvectors' frequencies that match to those of the clean eigenvectors are retained from the pool of eigenvectors remaining from step four.
- iterative SSA is performed to further reduce MN A and match the dynamics of the clean data segments ' eigenvectors for the final step. For each iteration, the standard SSA algorithm is performed. Experience shows that convergence is achieved within 4 iterations.
- FIGs. 15A-15C show examples of the iterative SSA procedure applied to candidate eigenvectors that have resulted from step four of the procedure for the modified SSA algorithm. Note that there may be several eigenvectors remaining after the fifth step, hence, these examples show an iterative SSA procedure performed on a particular set of candidate eigenvectors that may match most closely to an eigenvector of a clean data segment.
- the row of panels in FIG. 15A represents one of the eigenvectors of the clean signal.
- the row of panels in FIG. 15B represents the MNA corrupted signal's candidate eigenvectors which have the same frequency as that of the clean signal's eigenvector.
- the row of panels in FIG, 15C represents the candidate eigenvectors after they have gone through four successive iterations of the SSA algorithm, For this portion of the SSA algorithm.
- SVD is performed on the trajectory matrix of Eq. (11) created from the candidate eigenvector and then reconstruct the eigenvectors based on SSA using only the first 3 largest eigenvalues obtained from the SVD. This process repeats iteratively until the shape of the reconstructed eigenvector closely resembles one of the clean eigenvectors with the same frequency. It can be seen from FIGs. 15A-15C that after 4 iterations the result shown in the panel of FIG.
- the discarding metric (DM) is calculated at each iteration and the value is compared to the DM value of the corresponding clean component.
- the DM is calculated according to;
- Step 1 First, compute SVD on both corrupted data segments and their most prior adj cent clean data segments
- Step 2 keep the top 5% of the clean and corrupted components.
- Step 3 Replace the corrupted eigenvalues with corresponding clean eigenvalues.
- Step 4 Among the clean and corrupted components, only choose those with frequency within the heart rate frequency range of 0.66 ⁇ F s ⁇ 3Hz.
- Step 5 Apply frequency matching to discard those corrupted components (from Step 4) with different frequencies compared to clean components' frequencies.
- Step 6 Remove corruption from each component obtained from Step 5 by applying the basic SSA algorithm iteratively.
- Step 7 Finally, reconstruct the corrupted PPG segment based on the components achieved from Step 6.
- FIG. 16 shows the results of these simulations with additive GWM.
- the left panels (FIGs. 16A1 to 16A7) show pre- and post-reconstruction HR in comparison to the reference HR; the right panels (FIGs.
- Tables VI and VII show the mean and standard deviation values of the pre- (2nd column) and post-reconstruction (4th column), and the reference (3rd column) HR and SpG2 values, respectively for all SNR.
- the last columns of Tables II and III also show the estimated HR and Sp02 values obtained by the ICA method.
- the reconstructed HR and Sp02 values using our IMAR approach are found to be not statistically different when compared to the reference values for all SNR except for -20 and - 25 dB.
- the ICA method fails and significantly different values are obtained to those of the reference HR and Sp02 values when the SNR is lower than -10 dB,
- FIG. 17 and below Tables VIII and ⁇ show corresponding results to that of FIG. 16 and Tables VI and VII, but with additive colored noise. Similar to the GW case, the reconstructed HR and Sp02 values using the disclosed IMAR approach are found to be not significantly different than the reference values for all SNR except for -20 and -25 dB, Moreover, the ICA compares poorly compared to our MAR as the HR and Sp02 values from the former method are found to be significantly different to the reference values for all SNR,
- Red and IR PPG signals with clearly separable DC and AC components are required.
- the pulsatile components of the Red and IR P PG signals are denoted as AC Rsa . and DC Red , respectively, and the "ratio-of-ratio" is estimated as
- Sp02 is computed by substituting the R value in an empirical linear approximate relation given by
- the performance of the signal reconstruclion of the disclosed IMAR approach is compared to ICA for the PPG data with an index finger moving left-to-right patterns.
- the pulse oximeter on the middle finger of the right hand which is stationary, is used as the reference signal. Since the subjects are directed to produce the motions for 30 seconds within each 1 -minute segment, corresponding to 50% corruption by duration, the window length of both clean and corrupted segments are both set as half length of the signal.
- Table ⁇ compares the HR reconstruction results between the IMAR and ICA methods for all 10 subjects. As shown in Table XII, the IMAR reconstructed HR values are not significantly different from the reference HR in 7 out. of 10 subjects. However, the ICA's reconstructed HR is significantly different from the reference HR in 8 out of 10 subjects indicating poor reconstruction fidelity.
- the disclosed algorithm again significantly outperforms ICA, All but one subject are not significantly different than the Sp02 reference values for ICA.
- the disclosed IMAR algorithm only 4 out of 9 subjects do not show significant difference from the reference values, Note the zero standard deviation reference Sp02 values from Massimo's pulse oximeter in 7 out of 9 subjects. This is because Massimo uses a proprietary averaging scheme based on several past values. Hence, it is possible that the significant difference seen with our algorithm in some of the subjects would turn out to be not significant if the averaging scheme are not used. While some of the Sp02 values from our algorithm are significantly different from the reference, the actual deviations are minimal and they are far less than with CA.
- a novel IMAR method is introduced to reconstruct MN A contaminated segments of PPG data. Detection of MNA. using a support vector machine algorithm is introduced in the companion paper.
- One aim of this disclosure is to reconstruct the MNA corrupted segments as closely as possible to the non-corrupted data so that accurate heart rates and Sp02 values can be derived.
- the question is how to reconstruct the MNA data segments when there is no reference signal.
- the most adjacent prior clean data segment and its dynamics are used to derive the MNA contaminated segment's heart rates and oxygen saturation values.
- the key assumption with, the disclosed IMAR technique is that signal's dynamics do not change abruptly between the MNA contaminated segment and its most adjacent prior dean portion of data. Clearly, if this assumption is violated, the IMAR's ability to reconstruct the dynamics of the signal may be compromised.
- a time-varying IMAR algorithm can address this issue.
- the disclosed approach is compared to an ICA method using simulated data, laboratory controlled data as well as daily activity data involving both, walking and stair climbing movements. Comparison of the performance of the disclosed method to ICA is based on reconstruction of HR and Sp02 values since these measures are currently used by clinicians.
- SSA singular spectrum analysis
- the disclosed IMAR algorithm can accurately reconstruct HR and Sp02 values from MNA contaminated data segments.
- the system of these teachings includes one or more processors and one or more computer usable media having computer readable code embodied therein, the computer readable code causing the one or more processors to execute the method of these teachings, shown in Fig. 19.
- one or more processors 1 10 are operatively connected to computer usable media 120 that has computer readable code embodied therein, which, whe executed by the one or more processors 1 10, causes the one or more processors to perform the method of these teachings
- An input device 130 is operatively connected to the one or more processors 110 and to the computer usable media 120 and enables the inputs of the PPG data segments.
- the one or more processors 1 10, the computer readable media 120 and the input device 130 are operatively connected by means of a computer connection component 125 (such as a computer bus).
- the system of these teachings includes a system for determining whether MNA are present in a segment of PPG data, having one or more processors and non-transitory computer usable media having computer readable code embodied therein, the computer readable code, when executed by the one or more processors, causes the one or more processors to: determine a plurality of time domain features for each segment from a plurality of test segments of the PPG data, the plurality of test segments including segments without motion and noise artifacts and other segments with motion and noise artifacts, the plurality of time domain features for said each segment from the plurality of test segments constituting a training set; use the training set to train a SVM, training resulting in a trained.
- the computer readable code further causes the one or more processors to band pass filter, before determining the plurality of time domain features, each segment from the plurality of test segments, The computer readable code further causes the one or more processors to determine whether motion and noise artifacts are present in segments neighboring the segment, referred to as neighboring segments, neighboring segments being segments surrounding the segment within a predetermined time interval, and apply a majority vote al gorithm to determinations of whether motion and noise artifacts are present in the segment and the neighboring segments.
- the time domain features comprise at least one of standard deviation of peak to peak interval within a segment, standard deviation of peak to peak amplitude within a segment, standard deviation of systolic and diastolic ratio within a segment, and mean standard deviation of pulse shape within an interval
- the system of these teachings includes a system for removal of MNA present in a segment of PPG data, having one or more processors and non- transitory computer usable media, having computer readable code embodied therein, the computer readable code, when executed by the one or more processors, causes the one or more processors to: (a) for each one segment from a segment of PPG data in which presence of motion and noise anifacts has been previously detected, referred to as a corrupted segment, and a most prior adjacent segment of PPG data in which motion and noise artifacts are not detected, referred to as a clean segment, performing the following: (al) assemble a data transition matrix, each row of the data transition matrix being a vector of a predetermined length, a number of vectors being equal to a number of samples in a segment for which the data transition matrix is assembled minus th predetermined length and plus one; a starting value of each vector being displaced by one sample from a previous vector, resulting in the data transition matrix having
- the predetermined length is less than one half of a number of samples in the segment for which the data transition matrix is assembled and is larger than a ratio of a sampling frequency to a lowest frequency in said segment being considered.
- the predetermined convergence criterion comprises a difference between discarding metric for the corrupted segment reconstructed from the data transition matrix using replaced eigenvalues and retained eigenvectors and a discarding metric for the clean segment; the discarding metric being a sum of absolute values of signal components divided by a length metric for the signal components.
- the predetermined frequency range is a heart rate range of PPG data.
- the predetermined frequency range includes frequencies greater than 0.66 Hz and less than 3Hz.
- the top predetermined percentage is a top 5%, in this system, the presence of motion and noise artifacts has been previously detected using the system described above.
- Elements and components described herein may be further divided into additional components or joined together to form fewer components for performing the same functions.
- the following is a d sclosure by way of example of a device configured to execute functions (hereinafter referred to as computing device) which may be used with the presently disclosed subject matter.
- computing device configured to execute functions
- the description of the various components of a computing device is not intended to represent any particular architecture or manner of interconnecting the components. Other systems that have fewer or more components may also be used with the disclosed subject matter.
- a communication device may constitute a form of a computing device and may at least include a computing device,
- the computing device may include an inter-connect (e.g,, bus and system core logic), which can interconnect such components of a computing device to a data processing device, such as a processor(s) or microprocessor(s), or other form of partly or completely programmable or pre-programmed device, e.g., hard wired and or application specific integrated circuit (“ASIC") customized logic circuitry, such as a controller or microcontroller, a digital signal processor, or any other form of device that can fetch instructions, operate on pre-loaded/pre-programmed instructions, and/or followed instructions found in hard-wired or customized circuitry to carry out logic operations that, together, perform steps of and whole processes and functionalities as described in the present disclosure.
- ASIC application specific integrated circuit
- Each computer program may be implemented in any programming language, such as assembly language, machine language, a high-level procedural programming language, or an object-oriented programming language.
- the programming language may be a. compiled or interpreted programming language.
- Each computer program may be implemented in a computer program product tangibly embodied in a computer-readable storage device for execution by a computer processor. Method steps of the invention may be performed by a computer processor executing a program tangibly embodied on a computer-readable medium to perform functions of the invention by operating on input and generating output.
- the application specific integrated circuit (“ASIC") logic may b such as gate arrays or standard cells, or the like, implementing customized logic by metalization(s) interconnects of the base gate array ASIC architecture or selecting and providing inetalization(s) interconnects between standard cell functional blocks included in a manufacturer's library of functional blocks, etc.
- ASIC application specific integrated circuit
- Embodiments can thus be implemented using hardwired circuitry without program software code/instructions, or in combination with circuitry using programmed software
- the techniques are limited neither to any specific combination of hardware circuitry and software, nor to any particular tangible source for the instructions executed by the data processors) within the computing device. While some embodiments can be implemented in fully functioning computers and computer systems, various embodiments are capable of being distributed as a computing device including, e.g., a variety of forms and capable of being applied regardless of the particular type of machine or tangible computer- readable media used to actually effect the performance of the functions and operations and/or the distribution of the performance of the functions, functionalities and/or operations.
- the interconnect may connect the data processing device to define logic circuitry including memory.
- the interconnect may be internal to the data processing device, such as coupling a microprocessor to on-board cache memory or external (to the microprocessor) memor such as main memory, or a disk drive or external to the computing device, such as a remote memory, a disc farm or other mass storage device, etc.
- microprocessors one or more of which could be a computing device or part of a computing device, include a PA-RISC series microprocessor from Hewlett-Packard Company, an 80x86 or Pentium series microprocessor from Intel Corporation, a PowerPC microprocessor from IBM, a Sparc microprocessor from Sun Microsystems, Inc., or a 68xxx series microprocessor from Motorola Corporation as examples.
- PA-RISC series microprocessor from Hewlett-Packard Company
- 80x86 or Pentium series microprocessor from Intel Corporation
- PowerPC microprocessor from IBM
- Sparc microprocessor from Sun Microsystems, Inc.
- 68xxx series microprocessor from Motorola Corporation as examples.
- the inter-connect in addition to interconnecting such as microprocessors) and memory may also interconnect such elements to a display controller and display device, and/or to other peripheral devices such as input output (I O) devices, e.g., through an input/output controllers).
- I O input output
- Typical I/O devices can include a mouse, a keyboard(s), a modem(s), a network interface(s), printers, scanners, video cameras and other devices which are well known in the art.
- the inter-connect may include one or more buses connected to one another through various bridges, controllers and/or adapters.
- the I/O controller includes a USB (Universal Serial Bus) adapter for controlling USB peripherals, and/or an IEEE- 1394 bus adapter for controlling IEEE- 1394 peripherals.
- USB Universal Serial Bus
- the memory may include any tangible computer-readable media, which may include but are not limited to recordable and non-recordable type media such as volatile and nonvolatile memory devices, such as volatile RAM (Random Access Memory), typically implemented as dynamic RAM (DRAM) which requires power continually in order to refresh or maintain the data in the memory, and non-volatile ROM (Read Only Memory), and other types of non- volatile memory, such as a hard drive, flash memory, detachable memory stick, etc.
- Non- volatile memory typically may include a magnetic hard drive, a magnetic optical drive, or an optical drive (e.g., a DVD RAM, a CD ROM. a DVD or a CD), or other type of memory system which maintains data even after power is removed from the system.
- the term “substantially” is utilized herein to represent the inherent degree of uncertainty that may be attributed to any quantitative comparison, value, measurement, or other representation.
- the term “substantially” is also utilized herein to represent the degree by which a quantitative representation may vary from a stated reference without resulting in a change in the basic function of the subject matter at issue.
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Engineering & Computer Science (AREA)
- Artificial Intelligence (AREA)
- Physics & Mathematics (AREA)
- Biophysics (AREA)
- Public Health (AREA)
- Veterinary Medicine (AREA)
- Physiology (AREA)
- General Health & Medical Sciences (AREA)
- Signal Processing (AREA)
- Animal Behavior & Ethology (AREA)
- Surgery (AREA)
- Pathology (AREA)
- Biomedical Technology (AREA)
- Heart & Thoracic Surgery (AREA)
- Medical Informatics (AREA)
- Molecular Biology (AREA)
- Psychiatry (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Cardiology (AREA)
- Evolutionary Computation (AREA)
- Fuzzy Systems (AREA)
- Mathematical Physics (AREA)
- Measuring Pulse, Heart Rate, Blood Pressure Or Blood Flow (AREA)
- Measurement And Recording Of Electrical Phenomena And Electrical Characteristics Of The Living Body (AREA)
Abstract
Systems and methods that can distinguish clean from corrupted PPG signals under various types of motions and reconstruct the MNA contaminated data segments, such that biological parameters, e.g., heart rates and SpO2 values, can be accurately estimated, are disclosed.
Description
APPARATUS AND METHOD FOR DETECTING AND REMOVING ARTIFACTS IN OPTICALLY ACQUIRED BIOLOGICAL SIGNALS
BACKGROUND
These teachings relate generally to an apparatus and a method for detecting and removing artifacts in optically acquired biological signals. More particularly, these teachings relate generally to an apparatus and a method for detecting and reconstructing motion and noise artifacts (MNA) in photoplethysmography (PPG) signals.
PPG is a non-invasive and low cost device to continuously monitor blood volume changes in peripheral tissues. PPi} is a useful technique since it is widely used to monitor heart rate (BR), arterial oxygen saturation (Sp02), and can also he used to measure respiratory rates. However, MNA can distort PPG recordings, causing erroneous estimation of HR and Sp02. 'There are three distinct sources of MNA artifacts that can distort PPG recordings: (1) environmental, physiological, and experimental artifacts, which cars be attributed to power interference surrounding the body; (2) correlated dynamics from other physiological signals; and (3) instrumental noise, respectively, MNA, which are comprised of all of the aforementioned noise sources, are difficult to filter since they do not have a prede termi ned frequency band and their spectrum often overlaps with that of the desired PPG signal
MNA in PPG readings are caused by 1) the movement of venous blood as well as other non-pulsatile components along with pulsatile arterial blood and 2) variations in the optical coupling between the sensor and the skin. Various approaches to mitigate motion artifacts by improving sensor attachment have been proposed. However, these design improvements do not provide a significant reduction of motion artifacts. Algorithm- based MNA reduction methods are also proposed. These include time and frequency domain filtering, power spectrum analysis, and blind source separation techniques. However, these have high computational complexity and more importantly, they operate even on clean PPG portions where MNA reduction is not needed. Hence, accurate MNA detection, which identifies clean PPG recordings from corrupted portions, is essential for the subsequent MNA reduction algorithm so that it does not distort the non-corrupted data segments. Moreover, more computationally efficient MNA algorithms can be designed since they can be tailored only to the MNA contaminated data segments.
MNA detection methods are mostly based on a signal quality index (SQI) which quantifies the severity of the artifacts, Some approaches quantify SQI using waveform morphology or filtered output, while others derive SQI with the help of additional hardware such as accelerometer and electrocardiogram sensing. Statistical measures, such as skewness, kurtosis. Shannon entropy, and Renyi's entropy, have been shown to be helpful in
determining a SQI. However, these techniques require manual threshold settings for each parameter to classify if the PPG signal is clean or corrupted. Although a support vector machine (SVM)-based classification method addresses the need of threshold setting, this approach considers limited and controlled types of motions.
On the other hand, arterial oxygen saturation reflects the relative amount of oxyhemoglobin in the blood. The most common method to measure it is based on pulse oximetry, whereby oxidized hemoglobin and reduced hemoglobin have significantly different optical spectra. Specifically, at a wavelength of about 660 nm, and a second wavelength between 805 and 960, there is a large difference in light absorbance between reduced and oxidized hemoglobin, A measurement of the percent oxygen saturati on of blood is defined as the ratio of oxyhemoglobin to the total concentration of hemoglobin present in the blood. Pulse oximetry assumes that the attenuation of light is due to both the blood and bloodless tissue. Fluctuations of the PPG signal are caused by changes in arterial blood volume associated with each heartbeat, where the magnitude of the fluctuations depends on the amount of blood rushing into the peripheral vascular bed, the optical absorption of the blood, skin, and tissue, and the wavelength used to illuminate the blood.
The pulse oximeter signal contains not only the blood oxygen saturation and heart rate data, but also other vital physiological information, The fluctuations of PPG signals contain the influences of arterial, venous, autonomic and respiratory systems on the peripheral circulation. In the current environment where health care costs are ever increasing, a single sensor that has multiple functions is very attractive from a financial perspective. Moreover, utilizing a pulse oximeter as a multi-purpose vital sign monitor has clinical appeal, since it is familiar to the clinician and comfortable for the patient. Knowledge of respiratory rate and heart rate patterns can provide more useful clinical information in many situations in which pulse oximeter is the sole monitor available.
Although there are many promising and attractive features of using pulse oximeters for vital sign monitoring, currently they are used on stationary patients. This is mainly because MNA result in unreliable heart rate and Sp02 estimation. Clinicians have cited motion artifacts in pulse oximetry as the most common cause of false alarms, loss of signal, and inaccurate readings.
In practice, MNA are difficult to remove because they do not have a predefined narrow frequency band and their spectrum often overlaps that of the desired signal.
Consequently, development of algorithms capable of reconstructing the corrapted signal and removing artifacts is challenging,
There are a number of general techniques used for artifact detection and removal. One of the methods used to remove motion artifacts is adaptive filtering. An adaptive filter is easy to implement and it also can be used in real-time applications, though the requirement of additional sensors to provide reference inputs is the major drawback of such methods.
There are many MNA reduction techniques based on the concept of blind source separation (BSS). BSS is attractive and has garnered significant interest since this approach does not require a reference signal The aim of the BSS is to estimate a set of uncorrupted signals from a set of mixed signals which is assumed to contain both the clean and MNA sources. Some of the popular BSS techniques are independent component analysis (ICA), canonical correlation analysis (CCA), principle component analysis (PCA), and singular spectrum analysis (SSA),
In ICA, the recorded signals are decomposed into their independent components or sources. CCA uses the second order statistics (SOS) to generate components derived from their uncorrelated nature. PCA is another nois reduction technique which aims to separate the clean signal dynamics from the MNA data. A multi-scale PCA has also heen proposed to account for time-varying dynamics of the signal and motion artifacts from PPG recordings, A promising approach that can be applied to signal reconstruction is the singular spectrum analysis (SSA). The SSA is a model-free BSS technique, which decomposes the data into a number of components which may include trends, oscillatory components, and noise (see, for example, B. S, Kim and S. K, Yoo, "Motion artifact reduction in
photoplethysmography using independent component analysis," Biomedical Engineering, IEEE Transactions on, vol. 53, pp. 566-568, 2006, which is incorporated herein by reference
in lis entirety for all purposes,) The main advantage of SSA over ICA is that SSA does not require user input to choose the appropriate components for reconstruction and MNA removal. Comparing PCA to SSA, SSA can be applied in cases where the number of signal components is more than the rank of the PCA covarianee matrix. Applications of the SSA include extraction of the amplitude and low frequency artifacts from single channel EEG recordings, and removing heart sound dynamics from respiratory signals.
Accordingly, there is a need to develop a new apparatus and a new method to distinguish clean from corrupted PPG signals under various types of motions. There is also a need to develop a new apparatus and a new method to remove MNA from corrupted PPG signals and to reconstruct PPG signals from the corrupted PPG signals,
BRIEF SUMMARY
In view of the foregoing, these teachings provide systems and methods that can distinguish clean from corrupted PPG signals under various types of motions and reconstruct the MNA contaminated data segments, such that biological parameters, e.g., heart rates and SpG2 values, can be accurately estimated.
in one embodiment, the system of these teachings includes one or more processors and one or more computer usable media having computer readable code embodied therein, the computer readable code causing the one or more processors to execute the method of these teachings.
In another embodiment, the method of these teachings includes a method for determining MNA are present, in a segment of PPG data by determining a plurality of time domain features for each segment from a plurality of test segments of the PPG data, the plurality of test segments including segments without motion and noise artifacts and other segments with motion and noise artifacts, the plurality of time domain features for said each segment from the plurality of test segments constituting a training set, using the training set to train a SVM, training resulting in a trained SVM, determining the plurality of time domain features for the segment, and using the trained SVM to determine whether motion and noise artifacts are present in the segment,
In yet another embodiment, the method of these teachings includes a method for removal of MNA present in a. segment of PPG data, by the steps of: (a) for each one segment
from a segment of PPG data in which presence of motion and noise artifacts has been previously detected, referred to as a corrupted segment, and a most prior adjacent segment of PPG data in which motion and noise artifacts are not detected, referred to as a clean segment, performing the following: (al ) assemble a data transition matrix, each row of the data transition matrix being a vector of a predetermined length, a number of vectors being equal to a number of samples in a segment for which the data transition matrix is assembled minus the predetermined length and pins one; a stalling value of each vector being displaced by one sample from a previous vector, resulting in the data transition matrix having a number of columns equal to the predetermined length and a number of rows equal to the number of vectors; (a2) obtain eigenvectors and eigenvalues for the data transition matrix, resulting in eigenvectors and eigenvalues for the corrupted segment and eigenvectors and eigenvalues for the clean segment; (b) sorting the eigenvalues for the corrupted segment from largest to smallest; and sorting the eigenvalues for the clean segment from largest to smallest; (c) retaining only a top predetermined percentage of the eigenvalues for the corrupted segment and the eigenvalues for the clean segment; (d) replacing the eigenvalues for the corrupted segment with the eigenvalues for the clean segment, where only the top predetermined percentage of the eigenvalues and corresponding eigenvectors have been retained; (e) retaining only eigenvectors for the corrupted segment and eigenvectors for the clean segment that have data in a predetermined frequency range; (f) discarding eigenvectors for the corrupted segment that have different frequencies from the eigenvectors for the clean segment; (g) obtaining the data transition matrix for the corrupted segment from the eigenvalues and eigenvectors of the corrupted segment and the data transition matrix for the clean segment from the eigenvalues and eigenvectors of the clean segment; (h) repeating steps (a2) to (g) until a predetermined convergence criterion is satisfied; and (i) reconstructing, after the predetermined convergence criterion is satisfied, the corrupted segment from the data transition matrix for the corrupted segment using replaced eigenvalues and retained eigenvectors,
in still another embodiment, the system of these teachings includes a system for determining whether MNA are present in a segment of PPG data, having one or more processors and non-transitory computer usable media having computer readable code embodied therein, the computer readable code, when executed by the one or more processors,
causes the one or more processors to: determine a plurality of time domain features for each segment from a plurality of test segments of the PPG data, the plurality of test segments including segments without motion and noise artifacts and other segments with motion and noise artifacts, the plurality of time domain features for said each segment from the plurality of test segments constituting a training set; use the training set to train a SVM, training resulting in a trained SVM; determine the plurality of time domain features for the segment; and use the trained SVM to determine whether motion and noise artifacts are present in the segment.
In yet another embodiment, the system of these teachings includes a system for removal of MNA. present in a segment of PPG data, having one or more processors and non- transitory computer usable media, having computer readable code embodied therein, the computer readable code, when executed by the one or more processors, causes the one or more processors to: (a) for each one segment from a segment of PPG data in which presence of motion and noise artifacts has been previously detected, referred to as a corrupted segment, and a most prior adjacent segment of PPG data in which motion and noise artifacts are not detected, referred to as a clean segment, performing the following: (al) assemble a data transition matrix, each row of the data transition matrix being a vector of a predetermined length, a number of vectors being equal to a number of samples in a segment for which the data transition matrix is assembled minus the predetermined length and plus one; a starting value of each vector being displaced by one sample from a previous vector, resulting in the data transition matrix having a number of columns equal to the predetermined length and a number of rows equal to the number of vectors; (a2) obtain eigenvectors and eigenvalues for the data transition matrix, resulting in eigenvectors and eigenvalues for the corrupted segment and eigenvectors and eigenvalues for the clean segment; (b) sorting the eigenvalues for the corrupted segment from largest to smallest; and sorting the eigenvalues for the clean segment from largest to smallest; (c) retaining only a top predetermined percentage of the eigenvalues for the corrupted segment and the eigenvalues for the clean segment; (d) replacing the eigenvalues for the corrupted segment with the eigenvalues for the clean segment, where only the top predetermined percentage of the eigenvalues and corresponding eigenvectors have been retained; (e) retaining only eigenvectors for the corrupted segment and eigenvectors for the clean segment that have data in a predetermined frequency range; (f) discarding
eigenvectors for the corrupted segment that have different frequencies from the eigenvectors for the clean segment; (g) obtaining the data transition matrix for the corrupted segment from the eigenvalues and eigenvectors of the corrupted segment and the data transition matrix for the clean segment from the eigenvalues and eigenvectors of the clean segment; (h) repeating steps (a2) to (g) until a predetermined convergence criterion is satisfied; and (i)
reconstructing, after the predetermined convergence criterion is satisfied, the corrupted segment from the data transition matrix for the corrupted segment using replaced eigenvalues and retained eigenvectors, BRIEF DESCRIPTION OF THE DRAWINGS
For a better understanding of the present teachings, together with other and further objects thereof, reference is made to the accompanying drawings and detailed description and its scope will be pointed out in the appended claims.
Figure 1. A representative clean forehead- PPG signal recorded during voluntary motion artifact conducted in a laboratory setting (1 t row). The mixed (up-down and left- right) movement of the forehead to which the PPG probe is attached for predetermined time interval induced 10% to 50% noise (2nd - 6th row) within a 60s PPG segment.
Figure 2. Training phase of the disclosed SVM-based motion detection algorithm. Four time-domain features corresponding to (1) standard deviation of peak-to-peak intervals (2) standard deviation of peak-to-peak amplitudes (3) standard deviation of systolic and diastolic interval ratio, and (4) mean standard deviation of pulse shape, are candidate input variables to the SVM.
Figure 3, Test phase of the disclosed SVM-based motion detection algorithm. The hidden layers correspond to kernel function of the SVM, The function between hidden layer and output layer is a linear operator.
Figure 4, Enhancement of MNA detection by diversity. Neighbor segments are the segments surrounding a target segment within ± 2 seconds, Decisions on the target segment are based on a majority vote from the decisions of neighbor segments as well as the one of the target segment (red).
Figure 5A~F. A sample forehead recorded PPG signal (a) along with the (b) standard deviation of P-P intervals (c) standard deviation of P-P amplitudes (d) standard deviation of
systolic artd diastolic time ratio, and (e) mean standard deviation of pulse shape, computed for each segment. The normalized sampled corrupt and clean PPGs for mean standard deviation of pulse shape is given in (f).
Figure 6A-B, Trained SVM classification with a sample training finger recorded PPG signal is given with (a)-(b) pairs of two parameters. The SVM decision and margin boundaries are marked by black and green lines, respectively.
Figure 7A-B, Validation: pairs of parameters for clean and corrupted PPG signals.
Figure 8. A representative PPG signal with detected peaks (red) (a) along with the (b) standard deviation of P~P intervals (c) standard deviation of P-P amplitudes (d) mean standard deviation of pulse shape and (e) standard deviation of systolic and diastolic time ratio, computed for each segment
Figure 9. Detection Probability of Corruption by additive white Gaussian noise (AWGN) for varying SN from -20 to 0 dB. 50 AWGN realizations for each SNR level are separately added to a non-MNA corrupted PPG. Each realization is tested by the disclosed M A detection algorithm to compute the detection probability of corruption
Figure lOA-C. Classification performance comparison between our SVM algorithm, Hjorlh (HI, H2), Kurtorsis and Shanon Entropy ( , SE) parameters, (a) Accuracy; (b) Sensitivity; (c) Specificity. The central mark on each bo corresponds to the median; the edges of the box correspond to the 25th and 75th percentiles, the whiskers extend to the most extreme data points not considered outliers, and outliers are plotted individually. (*) indicate the mean is significantly different (p<0.05 at 95% CI) between SVM and other methods used for comparison
Figure 11 A~B. Comparison of mean errors and detection error fraction between original signal (labeled "None") and artifact removed signal from five detection methods (SVM, HI , H2S K, and SE). (a) HR error; (b) Sp02 error
Figure 12A-C. Mean error comparison between our SVM algorithm, Hjorth (HI , H2), Kurtorsis and Shanon Entropy (K. SE) parameters, (a) heart rate; (b) Sp02; (c) detection error. The central mark on each box corresponds to the median; the edges of the box correspond to the 25th and 75th percentiles, the whiskers extend to the most extreme data points not considered outliers, and outliers are plotted individually. (*) indicate the mean is significantly different (p<0.05 at 95% CI) between SVM and other methods used for
comparison. The x-axis labeled "None" in all panels refers to the mean errors when compared to the reference signals without removing the MNA detected segments as identified by any of the five computational methods
Figure 13. Typical infrared PPG signal; (a) clean, (b) corrupted with motion artifacts, Figure 14A-B. The first 12 eigenvector components of the PPG signal for; (a) Clean infrared PPG. (b) Corrupted infrared PPG.
Figure I 5A-C. Iterative reconstruction of a corrupted eigenvector with frequency of 0.967 Hz. Black font signals (top panels) represent the clean component with frequency of 0.967 Hz; Blue font signals (2nd rows) indicate the corrupted component with the same frequency; Pink font signals are related to iterative evolution of corrupted component to a clean oscillatory signal, (a) Reconstruction of 4th corrupted eigenvector compared to the corresponding clean component. The final pattern after 4 iterations resembles the black font clean component in the top panel. This component is chosen among the components with the same frequency, since it shows the most similarity to the black font clean component, (b) Reconstruction of 9th corrupted eigenvector compared to the corresponding clean component, (c) Reconstruction of 22nd corrupted eigenvector compared to the corresponding clean component
Figure 16A1-B7. (Left) HR estimated from reconstructed PPG for different additive white noise levels; (Right) Sp02 estimated from reconstructed PPG for different levels of additive white noise
Figure 17A1 ~B7, (Left) HR estimated from reconstructed PPG for different additive colored noise levels; (Right) Sp02 estimated from reconstructed PPG for different levels of additive colored noise.
Figure 18A-D. (a) HR estimated from IMAR-reeonstructed PPG compared to reference and corrupted PPG; (b) HR estimated from ICA-reconstructed PPG compared to reference and corrupted PPG; (c) Sp02 estimated from IM AR-reconstmcted PPG compared to reference and corrupted PPG; (d) Sp02 estimated from ICA-reeonstrucled PPG compared to reference and corrupted PPG.
Figure 19 is a schematic block diagram representation of one embodiment of the system of these teachings.
DETAILED DESCRIPTION
The following detailed description presents the currently contemplated modes of carrying out the invention. The description is not to be taken in a limiting sense, but is made merely for the purpose of illustrating the general principles of the invention, since the scope of the invention is best defined by the appended claims.
As used herein, the singular forms "a." "an," and "the" include the plural reference unless the context clearly dictates otherwise.
Excep where otherwise indicated, all numbers expressing quantities of ingredients, reaction conditions, and so forth used in the specification and claims are to be understood as being modified in all instances by the term "about,"
MOTION AND NOISE ARTIFACTS DETECTION
In these teachings, an accurate and comprehensive MNA detection algorithm is provided, which detects MNA in PPG under various types of motion. First, time-domain parameters are introduced to quantify MNA in the recorded PPG signal. Then, the statistical measures of the time-domain parameters are considered as input var bles for a machine learning-based MNA detection algorithm. The MNA detection algorithm may be self-trained by the SVM with clean and corrupted PPG data sets, and then the trained SVM can be used to test the unknown PPG data. The efficacy of the MNA detection algorithm is tested on PPG data sets recorded from the finger and forehead pulse oximeters in simulations, laboratory- controlled and walking/stair-elimbing experiments, respectively.
EXPERIMENTAL PROTOCOL AND PREPROCESSING
In order to further elucidate the teachings presented hereinbelow, data for exemplary embodiments was collected, PPG signals can be obtained from custom reflectance-mode prototype pulse oximeters. PPG data with laboratory-controlled head and finger movement, daily-activity movement, or simulated movement are collected respectively from healthy subjects recruited from the student community of Worcester Polytechnic Institute (WPI). This study is approved by WPFs I B and all subjects are given informed consent prior to data recording.
In laboratory-controlled head movement data, motion artifacts are induced by head movements for specific time intervals in both horizontal and vertical directions. In one example, eleven healthy volunteers are asked to wear a forehead reflectance pulse oximeter
along with a reference Masimo Radical (Masimo SET®) fmger type transmiitance pulse oximeter. After baseline recording for 5 minutes without any movement, subjects are instructed to introduce motion artifacts for specific time intervals varying from 10 to 50% within a 1 minute segment. For example, if a subject is instructed to perform left-right movements for 6 seconds, a 1 minute segment of data would contain 10% noise. The right middle fmger with the sensor attached to the Masimo pulse oximeter is kept stationary. HR and Sp02 signals are acquired by the Masimo pulse oximeter at 80Hz and 1 Hz, respectively, and are acquired synchronously with the PPG signals recorded from the forehead sensor.
In laboratory-controlled fmger movement data, motion artifacts are induced by left- right movements of the index finger, In one example, nine healthy volunteers are asked to sit and wear two reflection type PPG pulse oximeters (TSD200) on their index and middle fingers, respectively, After baseline recording for 5 minutes without any movement to acquire clean data, motion artifacts are induced by left-right movements of the index finger while the middle finger is kept stationary as a reference. Similar to the head movement data, motion is induced at specific time intervals corresponding to 10-50% duration in a 1 minute segment. Such controlled movement is repeated five times per subject. The pulse oximeters are connected to a biopotential amplifier (PPG100) having a gain of 100 and cut-off frequencies of 0.05-10 Hz, The MPIOOO (BIOPAC Systems Inc., CA, USA) is used to acquire fmger PPG signals at 100 Hz. The daily-activity movement. PPG data are recorded while subjects are walking straight or climbing stairs for 45 min. The nine subjects are asked to walk or climb stairs after wearing a forehead reflectance pulse oximeter along with a Holter
electrocardiogram (ECG) monitor (Rozinn RZ153+) at 180Hz and a Masimo Rad-57 pulse oximeter at 0.5Hz, The reference ECG is obtained from the Holier ECG monitor while HR and Sp02 readings are measured from the Masimo pulse oximeter connected to the subject's righ index finger, which is held against the chest to minimize motion artifacts. Finally, the simulati on movement PPG data are generated by the addition of white noise to the clean P PG data.
PPG data are preprocessed by a 6th order infinite impulse response (ilR.) band pass filter with cut-off frequencies of 0,5 Hz and 12Hz. Zero-phase forward and reverse filtering is applied to account for the non-linear phase of the OR filter. After these preprocessing, the following parameters for classifying clean and corruption are derived.
In one embodiment, the method of these teachings includes a method for determining whether MNA are present in segment of PPG data by determining a plurality of time domain features for each segment from a plurality of test segments of the PPG data, the plurality of test segments including segments withou t motion and noise artifacts and other segments with motion and noise artifacts, the plurality of time domain features for said each segment from the plurality of test segments constituting a training set, using the training set to train a SVM, training resulting in a trained SVM, determining the plurality of time domain features for the segment, and using the trained SVM to determine whether motion and noise artifacts are present in the segment. The method also includes band pass before determining the plurality of time domain features, each segment from the plurality of test segments. The method still further includes determining whether motion and noise artifacts are present in segments neighboring the segment, referred to as neighboring segments, neighboring segments being segments surrounding the segment within a predetermined time Interval . Final ly, the method includes applying a majority vote algoritlim to determinations of whether motion and noise artifacts are present in the segment and the neighboring segments, The time domain features include at least one of standard deviation of peak to peak interval within a segment, standard deviation of peak to peak amplitude within a segment, standard deviation of systolic, and diastolic ratio within a segment, and mean standard devi ation of pulse shape within an interval.
PARAMETERS FROM PPG SIGNALS
The following four parameters are selected since they represent the variability present in corrupted PPG signals as shown in FIG, 1.
1) Standard deviation of peak-to-peak interval ( STDm ):
where ¾ , is peak-to-peak interval at the i'k pulse of the na segment and l¾ is mean peak-to-peak interval of the ? ' segment. The £>„ , is calculated by the difference
between two successive peak times.
2) Standard deviation of peak-to-peak amplitude ( STDMR ): The 8TDMF>„ of the «* segment is defined by:
where A I is peak amplitude at the i* pulse of the «* segment and A is mean peak- to-peak interval of the n& segment. The An J is defined by the difference between the itk peak and the forthcoming ( +!)* trough amplitudes.
3) Standard deviation of systolic and diastolic ratio ( STDS0 ): The STDSOTLT of the «*· segment is defined by:
^¾. =^∑( -¾ (3)
where Rsa n is systolic and diastolic time interval ratio at the i* pulse of the A segment and RSP lf is the mean systolic and diastolic time interval ratio of the w"1 segment. The Rm n , is calculated by
¾». «,! ~ ( trough, n-!,.' ~ "^peak ) / ( -^ww, ι.-,ί "" ^irough, .·:-!, ί ) i^)
where Γ^^, denotes the trough (or lowest point) at the j81 pulse of the segment. 4) Mean-standard deviation of pulse shape ( STL .A ): To derive pulse shape, we take
N!srap sample points of a pulse. The 5 2 .AV,„ of the segment is derived by taking average of the standard deviation at each sample point as follows:
where 5 DWAY , ¾ is calculated by: SrowAv,B>* (6)
where ¾ is the w* puise sample at the /* pulse of the n* segment and is the mean at the m* pulse sample of the segment.
CLASSIFICATION BY SUPPORT VECTOR MACHINE (SVM)
SVM can be applied to build a decision boimdaiy classifying motion corruption from clean PPG signals, SVM is widely used in classification and regression due to its accuracy and robustness to noise (see, for example, C.-W. Hsu, C.-C. Chang, and C.-J. Lin, "A
Practical Guide to Support Vector Classification," Department of Computer Science, National Taiwan University 2003, a copy of which is incorporated by reference here in its entirety and for all purposes), The SVM includes training and test phases described further below.
1) Training phase; A flow chart of the training phase in the SV -based MN A detection algorithm is shown in FIG. 2. The SVM takes the parameter values of clean and corrupted PPG segments as a training data set, finds the support vectors among the training data set which maximize the margin (or the distance) between different classes, and finally builds a decision boundary. If the estimated decision is different from its known label, the decision is regarded as a training error. A soft-margin SVM is considered, which can set the boundary even when the data sets are mixed and cannot be separated. In the soft-margin SVM algorithm, slack variables are introduced to minimize the training error with maximizing the margin. Soft-margin SVM uses the following equation to find the support vectors.
Minimize CY* ?L +—(w. , wi ) ,
Subject to r„((w„y„) + 6. )≥1 = S„ for sv = l,2,...,_V
=1,25...,N, and SSV≥Q (7) where C is regulation parameter, Λ' is the number of vectors, δ,ν is the slack variable, ws is weight vector and < y > is the inner product operation. The Tsv is the sv,h target variable, yiV is the svih input vector data, and hs is the bias. The SVM decision boundary Fsv is derived as
^ = «y) ÷ i» = 0 (8)
where * and 6* are weight factor and bias, respectively, obtained from Eq. (7) and y is the input point.
By transforming the ysv and y term to yiy→®(y.iV.) and y→ (y) , the non-linear SVM can be transformed to a linear SVM, For nonlinear SVM, Eq. (7) is modified as
To facilitate the operation in nonlinear SVM, a kernel function Ks (·,-) , which is a dot- product in the transformed feature space as follows, is used,
κ. (γ„>γ„· ) = {<ι>(γ„)>φ(ν«; )) 10} where sv' - l, 2,,..,N .
2) Test phase: FIG, 3 shows a flow chart of the test phase in the SVM-based MNA. detection algorithm. The PPG data can be partitioned into many 7-second segments.
Parameters can be deri ved from each PPG portion to examine if it is corrupted by motion artifact or not.
ENHANCEMENT OF MNA DETECTION BY MAJOR VOTES
To enhance MNA detection performance, the disclosed algorithm incorporates multiple decisions OK a set of neighbor segments in deciding whether a "target" segment is clean or corrupted. Neighbor segment is defined as a segment surrounding a target segment within iTneighbor seconds. Decision on a neighbor segment is highly likely to be the same as the decision on a target segment since PPG pulses in tfie neighbor segments are most likely to exhibit similar dynamics to the target segment.
The algorithm gathers the decisions of neighbor segments as well as target segment (see, for example. FIG. 4) and makes a final decision on the target segment based on a majority vote concept (see, for example, Wim H. Hesselink, The Boyer-Moore Majority Vote
Algorithm, 7th November 2005, which is incorporated by reference herein in its entirety and for all purposes),
RESULTS- I order to further elucidate these teachings, results of exemplary embodiments are presented hereinbelow.
The performance of the MNA detection algorithm can be evaluated for various types (simulated, laboratory controlled, and daily activities) of motion-corrupted PPGs so as to validate the performance in a wide range of scenarios. For all types of motions, the PPG recordings are divided into 7-second segments since this is determined to be the optimal size among the data length tested from 3-1 1 seconds (see below PERFORMANCE
COMPARISON). Results of the disclosed algorithm are compared with four recently published MNA detection algorithms based on kurtosis (K), Shannon entropy (SE), Hjorth 1 (HI), and Hjorth 2 (FI2) metrics, respectively. As performance metrics, classification accuracy, sensitivity, and specificity are considered, in addition, mean HR and Sp02 errors are also investigated as well as detection error ratio.
REFERENCE: CLEAN VS. CORRUPTED
The following are criteria which are adopted to reference PPG segments (clean or corrupted) for each experiment, A visual reference is excluded to avoid subjective decisions by visual inspectors; for subtle MNA, there are large disagreements among visual inspectors. Instead, objective decisions arc performed based on controlled corruption start (TcorriStart) a d end (Tcorr,end) time points, ECG-derived heart rate (HRECG), PPG-derived heart rate
(HRPPG), and Sp02 (SpG2PPG) from PPG signals,
Laboratory controlled data (forehead and finger):
-- If more than 85% of a segment is outside of [Tcorr.start, cWr,end]5 the segment is considered clean. Otherwise, the segment, is referenced to be corrupted.
- If Sp02(PPG) deviates by 10 % from the mean of Sp02(PPG) in a segment, then the segment is referenced to be corrupted.
- Successive difference, |diff(HRppG(i+l)- HRp?o(i))i, from PPG signals is larger than 20 bpm for at least one pulse during a segment, then the segment is referenced to be corrupted.
Daily activity data (Walking and stair-climbing):
- Successive difference, |diff(HRECG(i-H )- HRECGC )!, from ECG signals is larger than 20 hpm for at least one pulse during a segment, then die segment is excluded.
- if Sp02(PPG) deviates by 10 % from the mean of Sp02(PPG) in a segment, then the segment is referenced to be corrupted.
- if jdiff(HRppG +l)~ HRppo(i))| is larger than 20 bpm for at least one pulse during a segment, then the segment is referenced to be corrupted.
- If |HRECG - H ppoj < 5 bpm during more than 85 % of a segment, the segment is considered clean. Otherwise, the segment is referenced to he corrupted.
Table 1 below describes the number of clean and corrupted PPG segments for each motion type used in the experiment as determined by the criteria defined above.
TABLE L
Numbers of Subjects and Numbers of Clean and Corrupted Segment's per Each Motion Artifact
# of # of # of
Type Subtype
Subjects Clean ! < orru ted
Simulation Simulation N/A N/A j N/A
Laboratory Finger 13 195 105
Controlled Forehead 1 1 190 1 10
Daily- Walking
9 125 1 75
Activity Stair-
I climbing
CLASSIFICATION ACCURACY
A sample forehead PPG signal and lis corresponding parameters calculated segment- by~segmexit are given in FIG, 5 A and FIGs, 5B through 5E, respectively. The normalized sampled corrupt and clean PPGs for mean standard deviation of pulse shape is given in FIG, 5F. The sample signal is corrupted from t=56 to 1=85 seconds. Corrupted PPG segments between 56-85 seconds have larger parameter values compared to clean segments between 1- 56 seconds and 85-1 12 seconds,
FIGs, 6 A and 6B show (STD^STD^) and (STDm)STDWAV ) of clean (circle) and corrupted (star) forehead signals, respectively, with corresponding SV boundaries (black line). To lower computational complexity, a linear kernel is considered for the SVM in the experiment. Regularizaiion parameter value ( C) of the linear kernel SVM is optimized in terms of minimizing the training error rate, A 1 1-fold cross-validation and grid search ( C = {i(r3 ,i cr\ \ U 0! J G2 , i 03 } ) is adopted, which is widely used to determine C .
FIG, 7 shows classification results by the SVM boundaries obtained from FIG. 6. FIG. 8 shows a representative PPG signal with detected peaks (red) along with the corresponding statistical parameter values. Note the corrupted PPG signal interval between 21 to 31 seconds. The discrepancy between corrupted and clean portions is reflected by parameters STD^ , STD^vp , STDsa and STDWA . The parameter values from the corrupted PPG segments exhibit larger variability and consequently have higher standard deviation value compared to those from clean data segments. The STDm , >5 DAMP and STDWAV have large values between 21-35 seconds (see FIGs, 8B-8D), while STDS0 has large value only between 21-28 seconds (see FIG. §E). Using SVM with these parameter values, the disclosed algorithm correctly discriminated MNA corrupted segment between 21-35 seconds (see FIG. 8F), Table II below presents C for finger, forehead, and walking/siair-elimbing data. The disclosed algorithm is tested to different segment lengths varying from 3 to 11 seconds and calculated their mean classification accuracies, which are provided In below Table III. Among the different data segment lengths tested, the 7~second segment provided the highest classification accuracies for all data; finger, forehead and walking/stair-climbing PPG signals. Accuracy, specificity,
and sensitivity for each dataset are presented in Table IV. On average, the SVM performance using the 7-second segment showed a 93.9% accuracy, 92.4% specificity, and 94.3% sensitivity.
TABLE II.
c obtained by 9 fold cross-validation and gdd search method
i vr Subtype
Simulation Simulation 100
Laboratory [ Finger 1000
Controlled [ Forehead 1
Walking/
Dai!y- Stair- 0.01
Activiiy
climbing
TABLE HL
TABLE IV.
segment
To evaluate the sensitivity of our MNA detection algorithm to noise, Gaussian white noise (OWN) of varying signal~to~noise (SNR) levels is added to a representaiive non~MNA corrupted PPG signal. For each SNR, 50 independent clean PPG signal. As shown in FIG. 9, the PPG signals with a SNR below -10 dB are detected as corrupted data with our algorithm. For a SNR of -20 dB. every segment is detected as corrupted.
PERFORMANCE COMPARISON OF MNA DETECTION ALGORITHMS
The disclosed algorithm is compared with other artifact detection methods based on HI, H2 , K and SE since these methods have been shown to provide good detection accuracies. The HI and H2 parameters represent the central frequency and half of bandwidth, respectively, and are defined as follows;
H ... ¾ S arid H = (½L½
vo(«) * y vz(n) voiii)
where v,-(«) = Γ v?$ (e3 '")dv . Here, Sv (eJ y) is the power spectrum of signal 3 c 0 -
For a fair comparison, all detection methods used 7 second data segments. FIGs. 1GA- I 0C compare the medians and 25th and 75th perceniiies of detection accuracy, sensitivity, and specificity for all five detection methods for the finger, head and walking/stair-cllmbing data sets. In general, the disclosed SVM method consistently yields higher performance with a mean accuracy of 94%, sensitivity of 97%, and a specificity of 92%; whereas other methods show fluctuations depending on which datasets are used, in the finger recorded data, HI yields a slightly higher accuracy than ail other methods due to higher specificity, but the detection sensitivity is lower.
HR AND Sp02 ESTIMATION
FIG. 1 1 A shows a comparison of the mean HR error and detection error fraction from five MNA detection methods for walking/stair-climbing data. The HR errors are defined by the difference between the estimated HR derived from the PPG and the reference HR readings. Low error values reflect an effective artifact detection algorithm, The disclosed algorithm yields the lowest HR error and detection error fraction as compared with other MNA methods. FIG. 1 IB shows a comparison of mean Sp02 error and detection error fraction from five MNA detection methods. The SE based detection method shows a lower mean Sp02 error than the disclosed algorithm, but its detection error fraction is very high (>70%), indicating that the error is computed based on only 30% of clean data. On the other hand, the disclosed SVM algorithm resulted in a mean Sp02 error of 2.7 with a detection error of only 6.3%. FIG, 12 shows a comparison of live MNA detection methods in terms of paired-t test results of HR and Sp02 estimation and detection accuracy. On average, the SVM
algorithm outperformed the K, SE, HI and H2 methods with HR errors of 2,3 bpm, Sp02 errors of 2.7% and detection error fraction of 6,3%,
DISCUSSION
Robust real-time MNA detection algorithms for raw PPG signals have been elusive to date. The disclosed MNA detection algorithm has been designed based on four parameters: (a) standard deviation of peak-to-peak intervals (b) standard deviation of peak-to-peak amplitudes (e) standard deviation of systolic and diastolic time ratios, and (d) mean-standard deviation of pulse shapes. The disclosed MNA algorithm is compared to other well- established MNA detection methods, using the 7-second data segment as this length has been determined to provide the optimal classification accuracy.
The results demonstrate tha the disclosed SVM-based MNA detection algorithm has offered higher classification accuracy as well as lower HR and Sp02 errors compared to the conventional detection methods. The paired-t test is performed to determine whether there is a significant difference between classification errors obtained from the disclosed SVM approach compared with other known methods, For the .finger recorded PPG segments, FIG, 10A indicates that the mean classification accuracy is significantly different (p<0,05 at 95% CI) between the disclosed SVM method and other methods, except for HI, On the other hand, all other methods are significantly different from the disclosed SVM method for forehead and wafking/stair-climbing PPG data, FIGs. 11 A and 1 IB summarizes paired~t test results for HR and Sp02 estimations as well as detection accuracy. As shown in FIGs. 12A-12C, SVM is significantly different from HL H2, K, and SE in terms of HR estimation and detection accuracy (see FIGs, Γ2Α and 12C), while Sp02 derived from the S VM method is
significantly different from only HI (see FIG. 12B).
The disclosed MNA detection algorithm coded with Matlab (2012a) takes only 7 ms on an Intel Xeon 3.6 GHz computer for the 7-second data segment. Hence, the disclosed algorithm is real-time realizable especially when It is coded in either C or CA+. The disclosed computational MNA detection algorithm has provided high HR and Sp02 estimation accuracy as well as classification accuracy. Moreover, the disclosed algorithm shows significantly better performance than some well-cited methods with good detection accuracy, Another key advantage of the disclosed algorithm is that it is able to detail with a near pinpoint accuracy when MNA starts and ends. The other four methods fare poorly when
compared to the disclosed algorithm in detecting the start and end time of the MNA, The potential for the method disclosed in this work to have practical applications is high, and the integration of the algorithm described with a pulse oximeter device may have significant implications for real-time clinical applications and especially for ambulatory monitoring of vital signs.
PART II - MOTION AND NOISE ARTIFACTS REMOVAL
In these teachings, a PPG signal can be reconstructed from those portions of data that have been identified to be comipted using the algorithm detailed hereinabove. The fidelity of the reconstructed signal is determined by comparing the estimated Sp02 and heart rate (HR) to reference values, In addition, the reconstructed Sp02 and HR values ohtained via the ICA are compared to those obtained by the method disclosed herein. The ICA results are chosen as the point of comparison, because ICA has recently been shown to provide accurate reconstruction of corrupted PPG signals,
EXPERIMENTAL PROTOCOL AND PREPROCESSING
In order to further elucidate the teachings presented herembelow, data for exemplary embodiments was collected. Three sets of data are collected from healthy subjects recruited from the student community of Worcester Polytechnic institute (WPI). This study is approved by WPFs institutional review board and all the subjects give informed consent before data recording.
In the first experiment, eleven healthy volunteers are asked to wear a forehead reflectance pulse oximeter developed in the lab along with a reference Masimo Radical (Masimo SET®) finger transmittance pulse oximeter. PPG signals from the forehead sensor and reference (HR) derived from a finger pulse oximeter are acquired simultaneously. The HR and Sp02 signals are acquired at 80 Hz and 1 Hz, respectively. After baseline recording for 5 minutes without any movement (i.e. clean data), motion artifacts are induced in the PPG data by the spontaneous movements in both horizontal and vertical directions of the subject's head while the right middle finger is kept stationary. Subjects are directed to Introduce the motions for specific time intervals that determined the percentage of noise within each 1 minute segment, varying from 10 to 50%, For example, if a subject is instructed to make left- right movements for 6 seconds, a 1 minute segment of data would contain 10% noise.
The second dataset includes finger-PPG signals from the same 9 healthy volunteers in an upright sitting posture using an infrared reflection type PPG transducer (TSD20Q). An MP 1000 pulse oximeter (commercially available from BIOPAC Systems inc., CA, USA) is also used to acquire finger PPG signals at 100 Hz. One pulse oximeter of each model is placed on the same hand's index finger (one model) and middle finger (the other model) simultaneously. After baseline recording for 5 minutes without any movement (i.e. clean data), motion artifacts are induced in the PPG data by the left-right movements of the inde finger while the middle finger is kept stationary to provide a reference. Similar to the first dataset, motion is induced at specific time intervals corresponding to 10 to 50% corruption duration in 1 minute segments, i.e. the controlled movement is carried out five times per subject.
The third dataset includes data measurements from 9 subjects with the PPG signal recorded from the subjects' forehead using a custom sensor simultaneously with the reference EGG, HR and Sp02 from a Holier Monitor at 180 Hz and Masimo (Rad-57) pulse oximeter at 0.5 Hz respectively. The reference pulse oximeter provided HR and Sp02 measured from the subject's right index linger, which is held steadily to their chest. The signals are recorded while the subjects are going through sets of walking and climbing up and down flights of stairs for approximately 45 min.
Once data are acquired, PPG signals from all three experiments outlined above are preprocessed offline using, for example, Matlab (MathWorks, R2012a). The PPG signals are filtered using a zero-phase forward-reverse 4th order IIR band-pass filter with cutoff frequency 0,5-12Hz.
MOTION ARTIFACT REMOVAL
To reconstruct the artifact-corrupted portion of the PPG signal that has been detected using the support vector machine approach provided herein, a hybrid procedure is developed, using Iterative Singular Spectrum Analysis (I5SA) and a frequency matching algorithm. Henceforth, the combined procedures is referenced as the iterative motion artifact removal (IMAR) algorithm.
A method of these teachings includes a method for removal of motion and noise artifacts (MN A) present in a segment of PPG data, by the steps of: (a) for each one segment from a segment of PPG data in which presence of motion and noise artifacts has been
previously detected, referred to as a corrupted segment, and a most prior adjacent segment of PPG data in which modem and noise artifacts are not detected, referred to as a clean segment, performing the following: (al) assemble a data transition matrix, each row of the data transition matrix being a vector of a predetermined length, a number of vectors being equal to a number of samples in a segment for which the data transition matrix is assembled minus the predetermined length and plus one; a starting value of each vector being displaced by one sample from a previous vector, resulting in the data transition matrix having a number of columns equal to the predetermined length and a number of rows equal to the number of vectors; (a2) obtain eigenvectors and eigenvalues for the data transition matrix, resulting in eigenvectors and eigenvalues for the corrupted segment and eigenvectors and eigenvalues for the clean segment; (b) sorting the eigenvalues for the corrupted segment from largest to smallest; and sorting the eigenvalues for the clean segment from largest to smallest; (c) retaining only a top predetermined percentage of the eigenvalues for the corrupted segment and the eigenvalues for the clean segment; (d) replacing the eigenvalues for the corrupted segment with the eigenvalues for the clean segment, where only the top predetermined percentage of the eigenvalues and corresponding eigenvectors have been retained; (e) retaining only eigenvectors for the corrupted segment and eigenvectors for the clean segment that have data in a predetermined frequency range; (f) discarding eigenvectors for the corrupted segment that have different frequencies from the eigenvectors for the clean segment; (g) obtaining the data transition matrix for the corrupted segment from the eigenvalues and eigenvectors of the corrupted segment and the data transition matrix for the clean segment from the eigenvalues and eigenvectors of the clean segment; (h) repeating steps (a.2) to (g) until a predetermined convergence criterion is satisfied; and (i) reconstructing, after the predetermined convergence criterion is satisfied, the corrupted segment from the data transition matrix for the corrupted segment using replaced eigenvalues and retained eigenvectors. The predetermined length is less than one half of a number of samples in the segment for which the dat transition matrix is assembled and is larger than a ratio of a sampling frequency to a lowest frequency in said segment being considered. The
predetermined convergence criterion is a difference between a discarding metric for the corrupted segment reconstructed from the data transition matrix using replaced eigenvalues and retained eigenvectors and a discarding metric for the clean segment, the discarding metric
being a sum of absolute values of signal components divided by a length metric for the signal components, The predetermined frequency range is a heart rate range of PPG data. The predetermined frequency range includes frequencies greater than 0,66 Hz and less than 31 lz. The top predetermined percentage is a top 5%. in this method, the presence of motion and noise artifacts had been previously detected rising the method previously described.
SINGULAR SPECTRUM ANALYSIS (SSA)
The SSA is composed of two stages: A) singular decomposition and B) spectral reconstruction. The former is the spectral decomposition or eigen-decornposition of the data matrix whereas the latter is the reconstruction of the signal, based on using only the significant eigenvectors and associated eigenvalues. The assumption is that given a relatively high signal- to-noise ratio of data, significant eigenvectors and associated eigenvalues represent the signal dynamics and less significant values represent the MNA components.
The calculation of the singular stage of the SSA includes two steps: i) embedding followed by ii) singular value decomposition (SVD). in essence, these procedures decompose the data into signal dynamics including trends, oscillatory components, and MNA. The spectral stage of the SSA algorithm also includes two steps: i) grouping and ii) diagonal averaging. These two procedures are used to reconstruct the signal dynamics but without the MNA components. In the following section, we detail all four steps in the SSA algorithm,
SINGULAR DECOMPOSITION - EMBEDDING
Assume there is a nonzero real-value time series of length N samples, i.e.,
x =- {x x 2 ,...,x N ) , In the embedding step, window length f j] < L < N/2 is chosen to embed the initial time series, where fs is the sampling frequency and , is the lowest frequency in the signal. The time series X is mapped into the L lagged vectors, x Η*/ »* ,·^ x i +L -i ) for
/ ~ ί,,.,,κ , where κ = N -i +l . The result is the trajectory data matrix rx or vector x i that is each row of rr for = ί,.,.,κ .
From Eq. 1 L it is evident that the trajectory matrix, i is a Hankel matrix.
SINGULAR DECOMPOSITION - SINGULAR VALUE DECOMPOSITION
The next step is to apply the SVD to the trajectory matrix rx which results in eigenvalues and eigenvectors of the matrix TXTX T where r, for i = 3, .,., /, can he defined as T = USVT . u, for ] < i <L is a K xL orthonoraiai matrix. _¾ for 3 < < L is a diagonal matrix and v I for ! < / < .£, is an square orthonormai matrix, which is considered the principle component. In this step, τχ has L many singular values which are ^ > '¾ >,->%[ - Thus, the i'h eigentriple of τ, can be written as U.-
, in which
d
> 0) is the number of nonzero singular values of rx . Normally, every harmonic component with a different frequency produces two eigeniriples with similar singular values. So the trajectory matrix τχ can be denoted as
Tx =T} +T2 ...+Td
ux ? + ...+ud4¼vf (12)
Projecting the time series onto the direction of each eigenvector yields the
corresponding temporal principal component (PC),
S ECTRAL RECONSTRUCTION
The reconstruction stage has two steps: i) grouping and ii) diagonal averaging. First, the subgroups of the decomposed trajectory matrices are grouped and then a diagonal averaging step is needed so that a new time series can be formed.
SPECTRAL RECONSTRUCTION - GROUPING
The grouping step of the reconstruction stage decomposes the L x K matrix Ί) in to subgroups according to the trend, oscillatory components, and MNA dynamics. The grouping step divides the set of indices {1,2,...,d } into a collection of m disjoint subsets of / = {ix,... m } .
Thus, Tf corresponds to the group /■■■■■■ {i{,...,/w } . T{ is a sum of Tj , where ./ «≡/,· . So Tx can be expanded as
Sip Grouping
= η ÷..'.+?>; = ?>, +..'.+?} (i 3)
SPECTRAL RECONSTRUCTION - DIAGONAL AVERAGING
In the final step of analysis, each resultant matrix , ¾, in Eq. (13) is transformed into a time series of length N , We obtain the new Hankel matrices by averaging the diagonal
elements of the matrix Ts , Let H be denoted as the Hankel operator. So that we obtain the Hankel matrix X l ' = HTj, for i ~ \, ,.., m , Under the assumption of weak separability and applying the Hankel procedure to all matrix components of Eq. (13), we obtain the following expansion
= (1 +· 2> +..,+ ί'*> (14)
We can assert that X (V) is related to the trend of the signal; however, harmonic and noisy components do not necessarily follow the order of ^ > « ¾ > - > y¾T ·
ITERATIVE MOTION ARTIFACT REMOVAL BASED ON SSA
In order to reconstruct the MNA corrupted segment of the signal, an iterative motion artifact removal approach based on SSA is explained in the last section. The ultimate goodness of the reconstructed signal is determined by the accuracy of the estimated Sp02 and HR values. The top and bottom panels of FIG, 13 show clean and MNA corrupted signals, respectively,
FIGs. 14A and 14B show the first 12 eigen vectors of the clean and MNA corrupted data as shown in FIG. 13, respectively. The most important part of the SSA is to choose the proper eigenvector components for reconstruction of the signal. Under the assumption of high SNR, the normal practice is to select only the largest eigenvalues and associated eigenvectors for signal, reconstruction. However, most often it is difficult to determine the demarcation of the significant from non-significant eigenvalues. Further, the MNA dynamics can overlap with the signal dynamics, hence, choosing the largest eigenvalues does not necessarily result in an MNA-free signal.
To overcome the above limitations, the SSA approach is modified. The first step of the modified SSA involves computing singular value decomposition on both a corrupted data segment and its most prior adjacent clean data segment. Under the assumption of a high SNR of the data, the second step is to retain only the top 5% of the eigenvalues and their associated eigenvectors. The third step is to replace the corrupted segment's top 5% eigenvalues with the clean segment's eigenvalues. The fourth step is to further limit the number of eigenvectors by choosing only those eigenvectors that have heart rates between for both the clean and noise corrupted data segments. The two extreme heart rates are chosen so that they account for possible scenarios that one may encounter with low and high heart rates. With the remaining
candidate eigenvectors resulting from step four, non-significant eigenvectors are further pruned by performing frequency matching of the noise corrupted eigenvectors to those of the clean data segment's eigenvectors, in the fifth step. Only those eigenvectors' frequencies that match to those of the clean eigenvectors are retained from the pool of eigenvectors remaining from step four. For the remaining eigenvector candidates, iterative SSA is performed to further reduce MN A and match the dynamics of the clean data segments' eigenvectors for the final step. For each iteration, the standard SSA algorithm is performed. Experience shows that convergence is achieved within 4 iterations.
FIGs. 15A-15C show examples of the iterative SSA procedure applied to candidate eigenvectors that have resulted from step four of the procedure for the modified SSA algorithm. Note that there may be several eigenvectors remaining after the fifth step, hence, these examples show an iterative SSA procedure performed on a particular set of candidate eigenvectors that may match most closely to an eigenvector of a clean data segment. The row of panels in FIG. 15A represents one of the eigenvectors of the clean signal. The row of panels in FIG. 15B represents the MNA corrupted signal's candidate eigenvectors which have the same frequency as that of the clean signal's eigenvector. The row of panels in FIG, 15C represents the candidate eigenvectors after they have gone through four successive iterations of the SSA algorithm, For this portion of the SSA algorithm. SVD is performed on the trajectory matrix of Eq. (11) created from the candidate eigenvector and then reconstruct the eigenvectors based on SSA using only the first 3 largest eigenvalues obtained from the SVD. This process repeats iteratively until the shape of the reconstructed eigenvector closely resembles one of the clean eigenvectors with the same frequency. It can be seen from FIGs. 15A-15C that after 4 iterations the result shown in the panel of FIG. 15A corresponds most closely to the clea signal's eigenvector, hence, this eigenvector is selected rather than the eigenvectors shown in panels in FIGs. I5B and 15C, The discarding metric (DM) is calculated at each iteration and the value is compared to the DM value of the corresponding clean component. The DM is calculated according to;
DM = ^yr i , (15)
where u is the signal component, and | .| , L are absolute operator and component length, respectively. The entire procedure for the modified SSA algorithm is summarized in TABLE V. TABLE V
iterative Motion Artifact Removal (1MAR) Procedure
Assumption -Heart rate and Sp(¾ do not change abruptly and are stationary within the short data segment.
Application - Offline Motion Artifact Removal
Objective - Reconstruction of corrupted PPG segment for the purpose of estimating heart rates and S Qi-
^ Routine
Step 1. First, compute SVD on both corrupted data segments and their most prior adj cent clean data segments
Step 2. Next, keep the top 5% of the clean and corrupted components.
based on the eigenvalues being sorted from largest to smallest.
Step 3. Replace the corrupted eigenvalues with corresponding clean eigenvalues.
Step 4. Among the clean and corrupted components, only choose those with frequency within the heart rate frequency range of 0.66 <Fs<3Hz.
Step 5. Apply frequency matching to discard those corrupted components (from Step 4) with different frequencies compared to clean components' frequencies.
Step 6. Remove corruption from each component obtained from Step 5 by applying the basic SSA algorithm iteratively.
6, a. Calculate the discarding metric for components achieved from SSA iterations and their counterpart clean components from Eq, 15.
6. b. Select, those processed components with the closest. DM and frequency value to the corresponding clean component's DM and frequency value.
Step 7. Finally, reconstruct the corrupted PPG segment based on the components achieved from Step 6.
RESULTS - NOISE SENSITIVITY ANALYSIS
To validate the disclosed IMAR procedure, different SNR levels of Gaussian white noise (GWN) and colored noise are added to an experimentally collected clean segment of PPG signal. One purpose of the simulation is to quantitatively determine the level of noise that can be tolerated by the algorithm. Seven different. SNR levels ranging from 10 dB to -25 dB are considered. For each SNR level, 50 independent realizations of GWN and colored noise are added separately to a clean PPG signal. The Euler-Maruyama method is used to generate colored noise.
FIG. 16 shows the results of these simulations with additive GWM. The left panels (FIGs. 16A1 to 16A7) show pre- and post-reconstruction HR in comparison to the reference HR; the right panels (FIGs. ί 6B1 to 16B7) show the corresponding comparison for Sp02. Below Tables VI and VII show the mean and standard deviation values of the pre- (2nd column) and post-reconstruction (4th column), and the reference (3rd column) HR and SpG2 values, respectively for all SNR. The last columns of Tables II and III also show the estimated HR and Sp02 values obtained by the ICA method. As shown in FIG 16 and Tables VI and VII, the reconstructed HR and Sp02 values using our IMAR approach are found to be not statistically different when compared to the reference values for all SNR except for -20 and - 25 dB. However, the ICA method fails and significantly different values are obtained to those of the reference HR and Sp02 values when the SNR is lower than -10 dB,
TABLE VI
Comparison & Statistical Analysis of HR Estimations from IMAR-reconstructed PPG for Different Levels of
Additive White Noise. * represents p<0.05.
TABLE VII
Comparison & Statistical Analysis of Estimations from IMAR-reconstructed PPG for Different Levels of
FIG. 17 and below Tables VIII and ΪΧ show corresponding results to that of FIG. 16 and Tables VI and VII, but with additive colored noise. Similar to the GW case, the reconstructed HR and Sp02 values using the disclosed IMAR approach are found to be not significantly different than the reference values for all SNR except for -20 and -25 dB, Moreover, the ICA compares poorly compared to our MAR as the HR and Sp02 values from the former method are found to be significantly different to the reference values for all SNR,
TABLE VIII
Comparison &nStatistieai Analysis of HR Estimations from IMA -reconstructed PPG for Different Levels of
Additive Colored Noise. * represents p<0.05.
TABLE IX
Comparison & Statistical Analysis of Sp02 Estimations from IMAR-reconstructed PPG for Different Levels of Additive Colored Noise. * represents p<0.05.
Head Finger IMAR ICA
SNR Sp02 Sp02 Reconstructed Reconstracted
(dB) (mean (Reference) Sp02 Sp02
± ...std) (Tneari. std) (mean, std) (ηκηαη > std}
94.14,
10 94.23,0.80 94.85,0,41 90.95,0.18*
0.99
94.71 *
0 94,23*0.80 94.85,0.53 86.84,0.24*
1.20
96.19,
-5 94.23,0.80 93.92±0,83 82,86= 0.34*
1 .41
99.27 ;
-10 94,23 , 0.80 94.88,0.96 78.89,0.18*
1.46
103.00
-15 94.23,0.80 94.42, 1.71 : 74.87,0.25*
,0.88
107.63
-20 94.23 ,0.80 74.74.7.92" 70.89,0.17" t0.26
105.91
-25 94.23,0,80 70.75, 15.08* 66.89.0.26*
,0.49
RESULTS - HEART RATE AND ESTIMATION FROM FOREHEAD SENSOR As described above, PPG data are collected under three different experimental settings so that the disclosed approach could be more thoroughly tested and validated. For all three experimental settings, the efficacy of the disclosed IMAR approach for the reconstruction of the MNA-affected portion of the signal is compared with the reference HR and Sp02 values for ail experimental daiasets.
For the error-free Sp02 estimation. Red and IR PPG signals with clearly separable DC and AC components are required. The pulsatile components of the Red and IR P PG signals are denoted as ACRsa. and DCRed , respectively, and the "ratio-of-ratio" is estimated as
j¾ - ^C e:j ! DCReil (16)
AC1K/DCm
Accordingly, Sp02 is computed by substituting the R value in an empirical linear approximate relation given by
Sp02 (%) = (1 10 - 25/?)(%) (1 )
After applying the disclosed IMAR procedure to the identified MNA segment of the PPG signal, the Sp02 (using Eqs, 16-17) and HR are estimated and compared to the corresponding reference and MNA contaminated segment values. As is the case with the noise sensitivity analysis section, the performance of the IMAR algorithm is compared to the ICA method. The top panel (FIGs. ISA and 18B) and bottom panel (FIGs. 18C and 18D) of FIG. 18 represent a representative HR and Sp02 comparison result, respectively. These figures show that the estimated values for both HR. (left panels) and Sp02 (right panels) from the IMAR (black font) track closely to the reference values recorded by the Masimo transmittaiice type fmger pulse oximeter (red square line), while the estimated HR and Sp02 obtained from the ICA method (green font) deviate significantly from the reference signal. Below Tables X and XI show comparison of the IMAR and the ICA reconstructed HR and Sp02 values, respectively, for all 10 subjects. As shown in Table X, there is no significant difference between the finger reference HR and the IMAR reconstructed HR in 6 out of 10 subjects. However, there is significant difference between the finger reference HR and the ICA reconstructed HR in all 10 subjects. Similarly, the reconstructed Sp02 values from the IMAR are found to be not significantly different than the fmger reference values in 6 out of 10 subjects, but the ICA method is found to be significantly different for all 10 subjects.
TABLE X
Comparison & Statistical Analysis of HR Estimations from IMAR-reconstructed PPG for 10 Different
Subjects (Head Experiment), * represents p<0.05.
TABLE XI
Comparison & Statistical Analysis of Sp02 Estimations from iMAR-reconstrueled PPG for id Different
Subjects (Head Experiment). * represents p<0,05.
RESULTS - PPG SIGNAL RECONSTRUCTION PERFORMANCE IN FINGER EXPERIMENT
The performance of the signal reconstruclion of the disclosed IMAR approach is compared to ICA for the PPG data with an index finger moving left-to-right patterns. The
pulse oximeter on the middle finger of the right hand, which is stationary, is used as the reference signal. Since the subjects are directed to produce the motions for 30 seconds within each 1 -minute segment, corresponding to 50% corruption by duration, the window length of both clean and corrupted segments are both set as half length of the signal. Table ΧΪΙ compares the HR reconstruction results between the IMAR and ICA methods for all 10 subjects. As shown in Table XII, the IMAR reconstructed HR values are not significantly different from the reference HR in 7 out. of 10 subjects. However, the ICA's reconstructed HR is significantly different from the reference HR in 8 out of 10 subjects indicating poor reconstruction fidelity.
TABLE XK
Comparison & Statistical Analysis of HR Estimations from lMAR-reconstntcted PPG for 10 Different
Subjects (Finger Experiment). * represents p<Q.Q5,
RESULTS - PPG SIGNAL RECONSTRUCTION PERFORMANCE FOR THE WALKING AND STAIR CLIMBING EXPERIMENTAL DATA
The signal reconstruction of the MNA identified data segments of the walking and stair climbing experiments using our disclosed IMAR and its comparison to ICA are provided in this section. Detection of the MNA data segments is performed using the algorithm described in Part I of the this disclosure. The reconstructed HR and Sp02 values using our disclosed algorithm and ICA are provided in below Tables ΧΙΠ and XIV, respectively. For both HR and Sp02 reconstruction, the measurements are earned out using PPCJ data recorded from the head pulse oximeter. The right hand index finger's PPG data is used as HR and Sp02 references. As shown in Table XIII, 7 out of 9 subjects' reconstructed HR values are found to be not significantly different from the reference HR values using our algorithm. While 2 subjects' reconstructed HR values are found to be significantly different than the reference, the differences in the actual HR values are minimal. For ICA's reconstructed HR values, all values deviate significantly from the reference values.
TABLE XIII
Comparisot! & Statistical Analysis of HR Estimations from IMAR-rsconstructed PPG for 9 Different Subjects
(Walking & Stair Climbing Experiment). * represents p<0.05.
For the reconstructed Sp02 values, the disclosed algorithm again significantly outperforms ICA, All but one subject are not significantly different than the Sp02 reference values for ICA. For the disclosed IMAR algorithm, only 4 out of 9 subjects do not show significant difference from the reference values, Note the zero standard deviation reference Sp02 values from Massimo's pulse oximeter in 7 out of 9 subjects. This is because Massimo uses a proprietary averaging scheme based on several past values. Hence, it is possible that the significant difference seen with our algorithm in some of the subjects would turn out to be not significant if the averaging scheme are not used. While some of the Sp02 values from our algorithm are significantly different from the reference, the actual deviations are minimal and they are far less than with CA.
DISCUSSION
In this disclosure, a novel IMAR method is introduced to reconstruct MN A contaminated segments of PPG data. Detection of MNA. using a support vector machine algorithm is introduced in the companion paper. One aim of this disclosure is to reconstruct the MNA corrupted segments as closely as possible to the non-corrupted data so that accurate heart rates and Sp02 values can be derived. The question is how to reconstruct the MNA data segments when there is no reference signal. To address this question, the most adjacent prior clean data segment and its dynamics are used to derive the MNA contaminated segment's heart rates and oxygen saturation values. Hence, the key assumption with, the disclosed IMAR technique is that signal's dynamics do not change abruptly between the MNA contaminated segment and its most adjacent prior dean portion of data. Clearly, if this assumption is violated, the IMAR's ability to reconstruct the dynamics of the signal may be compromised. A time-varying IMAR algorithm can address this issue.
There are hosts of algorithms available for MN A e!imi nation and signal
reconstruction. Various adaptive filter approaches to remove MNA have been proposed with good results but the test data to fully evaluate the algorithms are either limited or confined to laboratory controlled MNA involving simple finger or arm movements. Moreover, these adaptive filter methods work best when a reference signal is available.
For those methods that do not. require a reference signal to remove MNA, there have been many algorithms developed based on variants of the ICA. Most of the IC A -based methods produced reasonably good signal reconstructions of the MNA contaminated data. However, most of these methods are validated on data that are collected using laboratory controlled MNA involving pre-defined simple side-to-side or up-and-down finger and arm movements.
Given that ICA-based methods produced good signal reconstructions of the MNA contaminated data, the disclosed approach is compared to an ICA method using simulated data, laboratory controlled data as well as daily activity data involving both, walking and stair climbing movements. Comparison of the performance of the disclosed method to ICA is based on reconstruction of HR and Sp02 values since these measures are currently used by clinicians.
Comparing HR and Sp02 estimations of the reconstructed signal to the reference measurements using both simulation and experimental data have shown that the proposed
IMAR method is a promising tool as the reconstructed values are found to be accurate. The simulation results from noise sensitivity analysis showed that SNR. level down to -20dB and - 15dB from additive white and colored noise, respectively, can be tolerated well by the application of the proposed IMAR procedure, compared to the SNR values of -1 OdB and - 15dB for the ICA method . Application of the proposed IMAR approach and the ICA to three different sets of experimental data have also shown significantly better signal reconstruction performance with our IMAR algorithm.
The use of singular spectrum analysis (SSA) to a single channel EEG recordings to extract high amplitude and low frequency MNA has been performed. The main aim of this work is to remove the artifacts in EEG signals, hence, an iterative approach to reconstruct the main dynamics of the signal is not implemented. The disclosed approach is based on the use of SSA combined with an iterative approach to reconstruct the portion of the MNA
contaminated data with the most likely true dynamics (i.e., non-MNA contaminated data) of the pulse oximeter signal. This disclosure applies SSA-based algorithms for MNA
reconstmction of pulse oximeter data, in conclusion, a scenario where a reference signal is not available to remove the MNA, the disclosed IMAR algorithm can accurately reconstruct HR and Sp02 values from MNA contaminated data segments.
In one embodiment, the system of these teachings includes one or more processors and one or more computer usable media having computer readable code embodied therein, the computer readable code causing the one or more processors to execute the method of these teachings, shown in Fig. 19. Referring to Fig. 19, in the embodiment shown there in, one or more processors 1 10 are operatively connected to computer usable media 120 that has computer readable code embodied therein, which, whe executed by the one or more processors 1 10, causes the one or more processors to perform the method of these teachings, An input device 130 is operatively connected to the one or more processors 110 and to the computer usable media 120 and enables the inputs of the PPG data segments. The one or more processors 1 10, the computer readable media 120 and the input device 130 are operatively connected by means of a computer connection component 125 (such as a computer bus).
In still another embodiment, the system of these teachings includes a system for determining whether MNA are present in a segment of PPG data, having one or more processors and non-transitory computer usable media having computer readable code
embodied therein, the computer readable code, when executed by the one or more processors, causes the one or more processors to: determine a plurality of time domain features for each segment from a plurality of test segments of the PPG data, the plurality of test segments including segments without motion and noise artifacts and other segments with motion and noise artifacts, the plurality of time domain features for said each segment from the plurality of test segments constituting a training set; use the training set to train a SVM, training resulting in a trained. SVM; determine the plurality of time domain features for the segment; and use the trained SVM to determine whether motion and noise artifacts are present in the segment. The computer readable code further causes the one or more processors to band pass filter, before determining the plurality of time domain features, each segment from the plurality of test segments, The computer readable code further causes the one or more processors to determine whether motion and noise artifacts are present in segments neighboring the segment, referred to as neighboring segments, neighboring segments being segments surrounding the segment within a predetermined time interval, and apply a majority vote al gorithm to determinations of whether motion and noise artifacts are present in the segment and the neighboring segments. The time domain features comprise at least one of standard deviation of peak to peak interval within a segment, standard deviation of peak to peak amplitude within a segment, standard deviation of systolic and diastolic ratio within a segment, and mean standard deviation of pulse shape within an interval
In yet another embodiment, the system of these teachings includes a system for removal of MNA present in a segment of PPG data, having one or more processors and non- transitory computer usable media, having computer readable code embodied therein, the computer readable code, when executed by the one or more processors, causes the one or more processors to: (a) for each one segment from a segment of PPG data in which presence of motion and noise anifacts has been previously detected, referred to as a corrupted segment, and a most prior adjacent segment of PPG data in which motion and noise artifacts are not detected, referred to as a clean segment, performing the following: (al) assemble a data transition matrix, each row of the data transition matrix being a vector of a predetermined length, a number of vectors being equal to a number of samples in a segment for which the data transition matrix is assembled minus th predetermined length and plus one; a starting value of each vector being displaced by one sample from a previous vector, resulting in the
data transition matrix having a number of columns equal to the predetermined length and a number of rows equal to the number of vectors; (a2) obtain eigenvectors and eigenvalues for the data transition matrix, resulting in eigenvectors and eigenvalues for the corrupted segment and eigenvectors and eigenvalues for the clean segment; (h) sorting the eigenvalues for the corrupted segment from largest to smallest; and sorting the eigenvalues for the clean segment from largest to smallest; (c) retaining only a top predetermined percentage of the eigenvalues for the corrupted segment and the eigenvalues for the clean segment; (d) replacing the eigenvalues for the corrupted segment with the eigenvalues for the clean segment, where only the top predetermined percentage of the eigenvalues and corresponding eigenvectors have been retained; (e) retaining only eigenvectors for the corrupted segment and eigenvectors for the clean segment that have data in a predetermined frequency range; (f) discarding eigenvectors for the corrupted segment that have different frequencies from the eigenvectors for the clean segment; (g) obtaining the data transition matrix for the corrupted segment from the eigenvalues and eigenvectors of the corrupted segment and the data transition matrix for the clean segment from the eigenvalues and eigenvectors of the clean segment; (h) repeating steps (a2) to (g) until a predetermined convergence criterion is satisfied; and (i)
reconstructing, after the predetermined convergence criterion is satisfied, the corrupted segment from the data transition matrix for the corrupted segment using replaced eigenvalues and retained eigenvectors. The predetermined length is less than one half of a number of samples in the segment for which the data transition matrix is assembled and is larger than a ratio of a sampling frequency to a lowest frequency in said segment being considered. The predetermined convergence criterion comprises a difference between discarding metric for the corrupted segment reconstructed from the data transition matrix using replaced eigenvalues and retained eigenvectors and a discarding metric for the clean segment; the discarding metric being a sum of absolute values of signal components divided by a length metric for the signal components. The predetermined frequency range is a heart rate range of PPG data. The predetermined frequency range includes frequencies greater than 0.66 Hz and less than 3Hz. The top predetermined percentage is a top 5%, in this system, the presence of motion and noise artifacts has been previously detected using the system described above.
Elements and components described herein may be further divided into additional components or joined together to form fewer components for performing the same functions.
The following is a d sclosure by way of example of a device configured to execute functions (hereinafter referred to as computing device) which may be used with the presently disclosed subject matter. The description of the various components of a computing device is not intended to represent any particular architecture or manner of interconnecting the components. Other systems that have fewer or more components may also be used with the disclosed subject matter. A communication device may constitute a form of a computing device and may at least include a computing device, The computing device may include an inter-connect (e.g,, bus and system core logic), which can interconnect such components of a computing device to a data processing device, such as a processor(s) or microprocessor(s), or other form of partly or completely programmable or pre-programmed device, e.g., hard wired and or application specific integrated circuit ("ASIC") customized logic circuitry, such as a controller or microcontroller, a digital signal processor, or any other form of device that can fetch instructions, operate on pre-loaded/pre-programmed instructions, and/or followed instructions found in hard-wired or customized circuitry to carry out logic operations that, together, perform steps of and whole processes and functionalities as described in the present disclosure.
Each computer program may be implemented in any programming language, such as assembly language, machine language, a high-level procedural programming language, or an object-oriented programming language. The programming language may be a. compiled or interpreted programming language.
Each computer program may be implemented in a computer program product tangibly embodied in a computer-readable storage device for execution by a computer processor. Method steps of the invention may be performed by a computer processor executing a program tangibly embodied on a computer-readable medium to perform functions of the invention by operating on input and generating output.
in this description, various functions, functionalities and/or operations may be described as being performed by or caused by software program code to simplify description. However, those skilled in the art will recognize what is meant by such expressions is that the functions result from execution of the program code/instructions by a computing device as described above, e.g., including a processor, such as a microprocessor, microcontroller, logic circuit or the like. Alternatively, or in combination, the functions and operations can be
impiemenied using special purpose circuitry, with or without software instructions, such as using Application- Specific Integrated Circuit (ASIC) or Field-Programmable Gate Array (FPGA), which may be programmable, partly programmable or hard wired. The application specific integrated circuit ("ASIC") logic may b such as gate arrays or standard cells, or the like, implementing customized logic by metalization(s) interconnects of the base gate array ASIC architecture or selecting and providing inetalization(s) interconnects between standard cell functional blocks included in a manufacturer's library of functional blocks, etc.
Embodiments can thus be implemented using hardwired circuitry without program software code/instructions, or in combination with circuitry using programmed software
code/instructions.
Thus, the techniques are limited neither to any specific combination of hardware circuitry and software, nor to any particular tangible source for the instructions executed by the data processors) within the computing device. While some embodiments can be implemented in fully functioning computers and computer systems, various embodiments are capable of being distributed as a computing device including, e.g., a variety of forms and capable of being applied regardless of the particular type of machine or tangible computer- readable media used to actually effect the performance of the functions and operations and/or the distribution of the performance of the functions, functionalities and/or operations.
The interconnect may connect the data processing device to define logic circuitry including memory. The interconnect may be internal to the data processing device, such as coupling a microprocessor to on-board cache memory or external (to the microprocessor) memor such as main memory, or a disk drive or external to the computing device, such as a remote memory, a disc farm or other mass storage device, etc. Commercially available microprocessors, one or more of which could be a computing device or part of a computing device, include a PA-RISC series microprocessor from Hewlett-Packard Company, an 80x86 or Pentium series microprocessor from Intel Corporation, a PowerPC microprocessor from IBM, a Sparc microprocessor from Sun Microsystems, Inc., or a 68xxx series microprocessor from Motorola Corporation as examples.
The inter-connect in addition to interconnecting such as microprocessors) and memory may also interconnect such elements to a display controller and display device, and/or to other peripheral devices such as input output (I O) devices, e.g., through an
input/output controllers). Typical I/O devices can include a mouse, a keyboard(s), a modem(s), a network interface(s), printers, scanners, video cameras and other devices which are well known in the art. The inter-connect may include one or more buses connected to one another through various bridges, controllers and/or adapters. In one embodiment the I/O controller includes a USB (Universal Serial Bus) adapter for controlling USB peripherals, and/or an IEEE- 1394 bus adapter for controlling IEEE- 1394 peripherals.
The memory may include any tangible computer-readable media, which may include but are not limited to recordable and non-recordable type media such as volatile and nonvolatile memory devices, such as volatile RAM (Random Access Memory), typically implemented as dynamic RAM (DRAM) which requires power continually in order to refresh or maintain the data in the memory, and non-volatile ROM (Read Only Memory), and other types of non- volatile memory, such as a hard drive, flash memory, detachable memory stick, etc. Non- volatile memory typically may include a magnetic hard drive, a magnetic optical drive, or an optical drive (e.g., a DVD RAM, a CD ROM. a DVD or a CD), or other type of memory system which maintains data even after power is removed from the system.
For the purposes of describing and defining the present teachings, it is noted that the term "substantially" is utilized herein to represent the inherent degree of uncertainty that may be attributed to any quantitative comparison, value, measurement, or other representation. The term "substantially" is also utilized herein to represent the degree by which a quantitative representation may vary from a stated reference without resulting in a change in the basic function of the subject matter at issue.
Although these teachings have been described with respect to various embodiments, it should be realized these teachings are also capable of a wide variety of further and other embodiments within the spirit and scope of the appended claims.
Claims
1. A method for determining whether motion and noise artifacts (MNA) are present in a segment of photoplethysmography (PPG) data, the method comprising:
determining a plurality of time domain features for each segment from a plurality of test segments of the PPG data, the plurality of test segments including segments without motion and noise artifacts and other segments with motion and noise artifacts; the plurality of time domain features for said each segment from the plurality of test segments constituting a training set;
using the training set to train a support vector machine (SVM), training resulting in a trained SVM;
determining the plurality of time domain features for the segment; and
using the trained SVM to determine whether motion and noise artifacts are present in the segment.
2. The method of claim 1 further comprising:
band pass filtering, before determining the plurality of time domain features, each segment from the plurality of test segments.
3. The method of claim 1 further comprising:
determining whether motion and noise artifacts are present in segments neighboring the segment, referred to as neighboring segments; neighboring segments being segments surrounding the segment within a predetermined time interval; and
applying a majority vote algorithm to determinations of whether motion and noise artifacts are present in the segment and the neighboring segments.
4. The method of claim 1 wherein the time domain features comprise at least one of standard deviation of peak to peak interval within a segment, standard deviation of peak to peak amplitude within a segment, standard deviation of systolic and diastolic ratio within a segment, and mean standard deviation of pulse shape within an interval.
5. A method for removal of motion and noise artifacts (MNA) present in a segment of photoplethysmography (PPG) data, the method comprising:
(a) for each one segment from a segment of PPG data in which presence of motion and noise artifacts has been previously detected, referred to as a corrupted segment. and a most pr or adjacent segment of PPG data in which motion and noise artifacts are not detected, referred to as a clean segment, performing the following:
(al) assemble a data transition matrix, each row of the data transition matrix being a vector of a predetermined length, a number of vectors being equal to a number of samples in a segment for which the data transition matrix is assembled minus the predetermined length and plus one; a starting value of each vector being displaced by one sample from a previous vector, resulting in the data transition matrix having a number of columns equal to the predetermined length and a number of rows equal to the number of vectors;
(a2) obtain eigenvectors and eigenvalues for the data transition matrix, resulting in eigenvectors and eigenvalues for the corrapted segment and eigenvectors and eigenvalues for the clears segment;
(b) sorting the eigenvalues for the corrupted segment from largest to smallest; and sorting the eigenvalues for the clean segment, from largest to smallest;
(c) retaining only a top predetermined percentage of the eigenvalues for the corrupted segment and the eigenvalues for the clean segment;
(d) replacing the eigenvalues for the corrupted segment with the eigenvalues for the clean segment, where only the top predetermined percentage of the eigenvalues and corresponding eigenvectors have been retained;
(e) retaining only eigenvectors for the corrupted segment and eigenvectors for the clean segment that have data in a predetermined frequency range;
(ί) discarding eigenvectors for the corrupted segment that have different frequencies from the eigenvectors for the clean segment;
(g) obtaining the data transition matrix for the corrupted segment from the eigenvalues and eigenvectors of the corrupted segment and the da ta transition matrix for the clean segment from the eigenvalues and eigenvectors of the clean segment;
(h) repeating steps (a2) to (g) until a predetermined convergence criterion is satisfied; and
(i) reconstructing, after the predetermined convergence criterion is satisfied, the corrupted segment from the data transition matrix for the corrupted segment using replaced eigenvalues and retained eigenvectors.
6. The method of claim 5 wherein the predetermined length is less than one half of a number of samples in the segment for which the data transition matrix is assembled and is larger than a ratio of a sampling frequency to a lowest frequency in said segment being considered,
7. The method of claim 5 wherein the predetermined convergence criterion comprises a difference between a discarding metric for the corrupted segment reconstructed from the data transition matrix using replaced eigenvalues and retained eigenvectors and a discarding metric for the clean segment; the discarding metric being a sum of absolute values of signal components divided by a length metric for the signal components.
8. The method of claim 5 wherein the predetermined frequency range is a heart rate range of PPG data.
9. The method of claim 8 wherein the predetermined frequency range includes frequencies greater than 0.66 Hz and less than 3Hz.
10. The method of claim 5 wherein the top predetermined percentage is a top 5%,
1 1. The method of claim 5 wherein the presence of motion and noise artifacts had been previously detected using the method of claim 1.
12. A system for determining whether motion and noise artifacts (MNA) are present In a segment of photoplethysmography (PPG) data, the system comprising:
one or more processors; arid
non-transitory computer usable media, having computer readable code embodied therein, the computer readable code, when executed by the one or more processors, causes the one or more processors to:
determine a plurality of time domain features for each segment from a plurality of test segments of the PPG data, the plurality of test segmen ts including segments without motion and noise artifacts and other segments with motion and noise artifacts; the plurality of time domain features for said each segment from the plurality of test segments constituting a training set;
use the training set to train a support vector machine (SVM), training resulting in a trained SVM;
determine the plurality of time domain features for the segment; and use the trained SVM to determine whether motion and noise artifacts are present in the segment.
13. The system of claim 12 wherein the computer readable code further causes the one or more processors to:
band pass filter, before determining the plurality of time domain features, each segment from the plurality of test segments.
14. The system of claim 12 wherein the computer readable code further causes the one or more processors to:
determine whether motion and noise artifacts are present in segments neighboring the segment, referred to as neighboring segments; neighboring segments being segments surrounding the segment within a predetermined time interval; and
apply a majority vote algorithm to determinations of whether motion and noise artifacts are present in the segment and the neighboring segments,
15, The system of claim 12 wherein the time domain features comprise at least one of standard deviation of peak to peak interval within a segment, standard deviation of peak to peak amplitude within a segment, standard deviation of systolic and diastolic ratio within a segment, and mean standard deviation of pulse shape within an interval.
16. A system for removal of motion and noise artifacts (MNA) present in a segment of photoplethysmography (PPG) data, the system comprising;
one or more processors; and
non-transitory computer usable media, having computer readable code embodied therein, the computer readable code, when executed by the one or more processors, causes the one or more processors to:
(a) for each one segment from a segment of PPG data in which presence of motion and noise artifacts has been previously detected, referred to as a corrupted segment, and a most prior adjacent segment of PPG data in which motion and noise artifacts are not detected, referred to as a clean segment, perform the following:
(al) assemble a data transition matrix, each row of the data transition matrix being a vector of a predetermined length, a number of vectors being equal to a number of samples in a segment for which the data transition matrix is assembled minus the predetermined length and plus one; a starting value of each vector being displaced by one sample from a previous vector, resulting in the data transition matrix having a number of columns equal to the
predetermined length and a number of rows equal to the number of vectors:
(a2) obtain eigenvectors and eigenvalues for the data transition matrix, resulting in eigenvectors and eigenvalues for the corrupted segment and eigenvectors and eigenvalues for the clean segment;
(b) sort the eigenvalues for the corrupted segment from largest to smallest; and sort the eigenvalues for the clean segment from largest to smallest;
(c) retain only a top predetermined percentage of the eigenvalues for the corrupted segment and the eigenvalues for the clean segment;
(d) replace the eigenvalues for the corrupted segment with the eigenvalues for the clean segment, where only the top predetermined percentage of the eigenvalues and corresponding eigenvectors have been retained;
(e) retain only eigenveetors for the corrupted segment and eigenvectors for the clean segment that have data in a predetermined frequency range;
(f) discard eigenvectors for the corrupted segment that have different frequencies from the eigenvectors for the clean segment:
(g) obtain the data transition matrix for the corrupted segment from the eigenvalues and eigenvectors of the comipted segment and the data transition matrix for the clean segment from the eigenvalues and eigenvectors of the clean segment;
(h) repeat steps (a2) to (g) until a predetermined convergence criterion is satisfied; and
(1) reconstruct, after the predetermined convergence criterion is satisfied, the corrupted segment from the data transition matrix for the corrupted segment using replaced eigenvalues and retained eigenvectors.
17. The system of claim 16 wherein the predetermined length is less than one half of a number of samples in the segment for which the data transition matrix is assembled and is larger than a ratio of a sampling frequency to a lowest frequency in said segment being considered.
18. The system of claim 16 wherein the predetermined convergence criterion comprises a difference between a discarding metric for the corrupted segment reconstructed from the data transition matrix using replaced eigenvalues and retained eigenvectors and a discarding metric for the clean segment; the discarding metric being a sum of absolute values of signal components divided by a length metric for the signal components.
19. The system of claim 16 wherein the predetermined frequency range is a heart rate range of PPG data.
20. The system of claim 19 wherein the predetermined frequency range includes frequencies greater than 0.66 Hz and less than 3Hz,
21. The system of claim 16 wherein the top predetermined percentage is a top 5%, 22, The system of claim 16 wherein the presence of motion and noise artifacts had been previously detected using the system of claim 12,
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US15/121,277 US20160367198A1 (en) | 2014-02-26 | 2015-02-26 | Apparatus and method for detecting and removing artifacts in optically acquired biological signals |
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US201461944726P | 2014-02-26 | 2014-02-26 | |
| US61/944,726 | 2014-02-26 |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| WO2015130929A2 true WO2015130929A2 (en) | 2015-09-03 |
| WO2015130929A3 WO2015130929A3 (en) | 2015-10-15 |
Family
ID=54009781
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/US2015/017746 Ceased WO2015130929A2 (en) | 2014-02-26 | 2015-02-26 | Apparatus and method for detecting and removing artifacts in optically acquired biological signals |
Country Status (2)
| Country | Link |
|---|---|
| US (1) | US20160367198A1 (en) |
| WO (1) | WO2015130929A2 (en) |
Cited By (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2017140663A1 (en) * | 2016-02-15 | 2017-08-24 | Koninklijke Philips N.V. | Device and method for extracting heart rate information |
| EP3219254A1 (en) * | 2016-03-14 | 2017-09-20 | Tata Consultancy Services Limited | Method and system for removing corruption in photoplethysmogram signals for monitoring cardiac health of patients |
| EP3501381A1 (en) * | 2017-12-22 | 2019-06-26 | Stichting IMEC Nederland | A method and a system for time domain signal reconstruction for representing heart activity |
Families Citing this family (23)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US10561321B2 (en) * | 2013-12-12 | 2020-02-18 | Alivecor, Inc. | Continuous monitoring of a user's health with a mobile device |
| EP3148404B1 (en) * | 2014-05-28 | 2024-11-13 | Koninklijke Philips N.V. | Motion artifact reduction using multi-channel ppg signals |
| GB201608170D0 (en) * | 2016-05-10 | 2016-06-22 | Isis Innovation | A method of determining the frequency of a periodic physiological process of a subject, and a device and system for determining the frequency |
| KR102014597B1 (en) * | 2017-08-23 | 2019-08-26 | 원광대학교산학협력단 | Wearable multichannel photo plethysmography measuring device using singular value decomposition and method for removing noise from a signal using the same |
| EP3684463B1 (en) | 2017-09-19 | 2025-05-14 | Neuroenhancement Lab, LLC | Method and apparatus for neuroenhancement |
| EP3479763B1 (en) * | 2017-11-06 | 2023-03-01 | Tata Consultancy Services Limited | System and method for photoplethysmogram (ppg) signal quality assessment |
| US11717686B2 (en) | 2017-12-04 | 2023-08-08 | Neuroenhancement Lab, LLC | Method and apparatus for neuroenhancement to facilitate learning and performance |
| US11478603B2 (en) | 2017-12-31 | 2022-10-25 | Neuroenhancement Lab, LLC | Method and apparatus for neuroenhancement to enhance emotional response |
| US12280219B2 (en) | 2017-12-31 | 2025-04-22 | NeuroLight, Inc. | Method and apparatus for neuroenhancement to enhance emotional response |
| US11364361B2 (en) | 2018-04-20 | 2022-06-21 | Neuroenhancement Lab, LLC | System and method for inducing sleep by transplanting mental states |
| EP3849410A4 (en) | 2018-09-14 | 2022-11-02 | Neuroenhancement Lab, LLC | SLEEP ENHANCEMENT SYSTEM AND METHOD |
| TW202021528A (en) * | 2018-12-05 | 2020-06-16 | 宏碁股份有限公司 | Method for obtaining cardiac arrhythmia information and device for detecting cardiac arrhythmia based on photoplethysmogram signal |
| CN109657646B (en) * | 2019-01-07 | 2023-04-07 | 哈尔滨工业大学(深圳) | Method and device for representing and extracting features of physiological time series and storage medium |
| US11188617B2 (en) * | 2019-01-10 | 2021-11-30 | Nokia Technologies Oy | Method and network node for internet-of-things (IoT) feature selection for storage and computation |
| US11786694B2 (en) | 2019-05-24 | 2023-10-17 | NeuroLight, Inc. | Device, method, and app for facilitating sleep |
| CN110313902B (en) * | 2019-07-10 | 2021-03-12 | 四川大学 | Blood volume change pulse signal processing method and related device |
| US20220313098A1 (en) * | 2019-09-06 | 2022-10-06 | Valencell, Inc. | Wearable biometric waveform analysis systems and methods |
| EP3884863B1 (en) * | 2020-03-24 | 2022-07-27 | Tata Consultancy Services Limited | Method and system for tremor assessment using photoplethysmography (ppg) |
| CN114820832B (en) * | 2021-01-21 | 2025-06-20 | 西门子(深圳)磁共振有限公司 | Medical imaging method for detecting motion and magnetic resonance imaging system |
| EP4140392A1 (en) | 2021-08-23 | 2023-03-01 | Nokia Technologies Oy | Noise removal in physiological signals |
| WO2023121167A1 (en) * | 2021-12-23 | 2023-06-29 | 주식회사 씨젠 | Method for predicting performance of detection device |
| CN115005775B (en) * | 2022-05-26 | 2025-03-21 | 丹阳慧创医疗设备有限公司 | Artifact correction method, device and storage medium for near infrared signal data |
| US12026220B2 (en) * | 2022-07-08 | 2024-07-02 | Predict Hq Limited | Iterative singular spectrum analysis |
Family Cites Families (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| KR100462182B1 (en) * | 2002-04-15 | 2004-12-16 | 삼성전자주식회사 | Apparatus and method for detecting heart beat using ppg |
| KR20100065084A (en) * | 2008-12-05 | 2010-06-15 | 한국전자통신연구원 | Apparatus for measuring motion noise robust pulse wave and method thereof |
| KR101033472B1 (en) * | 2009-01-13 | 2011-05-12 | 강재민 | Form and Method of Sensor Module for Optical Pulse Wave Measurement for Dynamic Noise Reduction |
-
2015
- 2015-02-26 WO PCT/US2015/017746 patent/WO2015130929A2/en not_active Ceased
- 2015-02-26 US US15/121,277 patent/US20160367198A1/en not_active Abandoned
Cited By (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2017140663A1 (en) * | 2016-02-15 | 2017-08-24 | Koninklijke Philips N.V. | Device and method for extracting heart rate information |
| CN108697331A (en) * | 2016-02-15 | 2018-10-23 | 皇家飞利浦有限公司 | Device and method for extracting heart rate information |
| JP2019508123A (en) * | 2016-02-15 | 2019-03-28 | コーニンクレッカ フィリップス エヌ ヴェKoninklijke Philips N.V. | Device and method for extracting heart rate information |
| EP3219254A1 (en) * | 2016-03-14 | 2017-09-20 | Tata Consultancy Services Limited | Method and system for removing corruption in photoplethysmogram signals for monitoring cardiac health of patients |
| EP3501381A1 (en) * | 2017-12-22 | 2019-06-26 | Stichting IMEC Nederland | A method and a system for time domain signal reconstruction for representing heart activity |
Also Published As
| Publication number | Publication date |
|---|---|
| WO2015130929A3 (en) | 2015-10-15 |
| US20160367198A1 (en) | 2016-12-22 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US20160367198A1 (en) | Apparatus and method for detecting and removing artifacts in optically acquired biological signals | |
| Salehizadeh et al. | Photoplethysmograph signal reconstruction based on a novel motion artifact detection-reduction approach. Part II: Motion and noise artifact removal | |
| Roy et al. | Improving photoplethysmographic measurements under motion artifacts using artificial neural network for personal healthcare | |
| Chong et al. | Photoplethysmograph signal reconstruction based on a novel hybrid motion artifact detection–reduction approach. Part I: Motion and noise artifact detection | |
| Lee et al. | Bidirectional recurrent auto-encoder for photoplethysmogram denoising | |
| Lim et al. | Adaptive template matching of photoplethysmogram pulses to detect motion artefact | |
| Lin et al. | A physiological information extraction method based on wearable PPG sensors with motion artifact removal | |
| Huang et al. | Real-time motion artifact removal using a dual-stage median filter | |
| EP2303108A1 (en) | Signal processing mirroring technique | |
| Wu et al. | Camera-based blood pressure estimation via windkessel model and waveform features | |
| Roy et al. | On-device reliability assessment and prediction of missing photoplethysmographic data using deep neural networks | |
| TWI855635B (en) | Non-invasive blood glucose prediction by deduction learning system | |
| Hossain et al. | A deep convolutional autoencoder for automatic motion artifact removal in electrodermal activity | |
| Chowdhury et al. | Estimation of blood glucose level of type-2 diabetes patients using smartphone video through PCA-DA | |
| Davies et al. | Rapid extraction of respiratory waveforms from photoplethysmography: A deep corr-encoder approach | |
| Banerjee et al. | Estimation of ECG parameters using photoplethysmography | |
| Ahmed et al. | Multivariate multiscale entropy for brain consciousness analysis | |
| Hossain et al. | A preliminary study on automatic motion artifact detection in electrodermal activity data using machine learning | |
| Motin et al. | PPG derived respiratory rate estimation in daily living conditions | |
| WO2024061487A1 (en) | Analyzing method for and apparatus of intracranial dynamics | |
| Sawangjai et al. | Removal of motion artifacts from the PPG signal using attentive generative adversarial networks with dual discriminator | |
| Kraft et al. | Reliability factor for accurate remote PPG systems | |
| Motaman et al. | A Dilated CNN‐Based Model for Stress Detection Using Raw PPG Signals | |
| Haq et al. | Feature Selection of Photoplethysmograph Data in Machine Learning | |
| Roy et al. | Reconstruction of corrupted and lost segments from photoplethysmographic data using recurrent neural network |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 15754881 Country of ref document: EP Kind code of ref document: A2 |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 15121277 Country of ref document: US |
|
| NENP | Non-entry into the national phase |
Ref country code: DE |
|
| 122 | Ep: pct application non-entry in european phase |
Ref document number: 15754881 Country of ref document: EP Kind code of ref document: A2 |