[go: up one dir, main page]

TWI860159B - Establishing method for prediction model on probe mark defect, prediction system and prevention method thereof - Google Patents

Establishing method for prediction model on probe mark defect, prediction system and prevention method thereof Download PDF

Info

Publication number
TWI860159B
TWI860159B TW112143258A TW112143258A TWI860159B TW I860159 B TWI860159 B TW I860159B TW 112143258 A TW112143258 A TW 112143258A TW 112143258 A TW112143258 A TW 112143258A TW I860159 B TWI860159 B TW I860159B
Authority
TW
Taiwan
Prior art keywords
factor
correlation
information
defect
layer
Prior art date
Application number
TW112143258A
Other languages
Chinese (zh)
Other versions
TW202520107A (en
Inventor
張保榮
吳易儒
Original Assignee
國立高雄大學
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 國立高雄大學 filed Critical 國立高雄大學
Priority to TW112143258A priority Critical patent/TWI860159B/en
Application granted granted Critical
Publication of TWI860159B publication Critical patent/TWI860159B/en
Publication of TW202520107A publication Critical patent/TW202520107A/en

Links

Images

Landscapes

  • Testing Or Measuring Of Semiconductors Or The Like (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

An establishing method for a prediction model on probe mark defect is adapted for solving the problem of the lack of a method for predicting probe mark defect in the conventional techniques. The establishing method includes a data collecting step, an importance analysis step, a factor re-filtering step and a model training step. In the importance analysis step, an importance analysis method is applied to define an important association data. In the factor re-filtering step, the important association data is transformed into a corresponding correlation coefficient matrix, which includes an association among each factors and defect and a correlation between any two of the factors, so that a predicting factor data is determined. In the model training step, the predicting factor data is inputted into a prediction model to be trained to establish a trained prediction model. This invention provides the effect of predicting the probability of occurrence of the probe mark defect.

Description

探針記號缺陷之預測模型建立方法及其預測系統與預 防方法 Method for establishing a prediction model for probe mark defects and its prediction system and prevention method

本發明係關於一種晶圓製程改善,尤其是一種探針記號缺陷之預測與預防方法。 The present invention relates to a wafer process improvement, in particular to a method for predicting and preventing probe mark defects.

在半導體晶圓在製造的過程中,需要經過多次的蝕刻、沉積、濕製程、乾製程等步驟,每一步驟都會影響晶圓的品質和性能。因此,為了確保製程的穩定性和品質的一致性,需要對晶圓進行測試。在測試晶圓時,運用探針卡(Probe Card)的探針接觸晶圓表面上的特定檢測位置,以測試晶圓的電性參數,如電阻、電容、電流等。 During the manufacturing process of semiconductor wafers, they need to go through multiple steps such as etching, deposition, wet process, and dry process. Each step will affect the quality and performance of the wafer. Therefore, in order to ensure the stability of the process and the consistency of quality, the wafer needs to be tested. When testing the wafer, the probe of the probe card is used to contact the specific detection position on the surface of the wafer to test the electrical parameters of the wafer, such as resistance, capacitance, current, etc.

在探針接觸晶圓的表面時,測試用的探針會在晶圓表面留下一些痕跡,這些痕跡稱為探針記號(Probe Mark),可以用來記錄測試點的位置;其中,探針記號應形成在晶圓表面上的特定檢測位置,否則探針記號的偏位將造成晶圓的電性參數有非預期的變化。因此,當探針記號與其預定位置的距離偏差到一定程度時,對應的晶圓將被視為不良品,且這種偏位而使晶圓電性劣化的現象稱為探針記號缺陷(Probe Mark Damage)。惟,尚未有研究提出如何預測探針記號缺陷的產生。 When the probe contacts the surface of the wafer, the test probe will leave some marks on the wafer surface. These marks are called probe marks, which can be used to record the location of the test point. The probe mark should be formed at a specific detection position on the wafer surface, otherwise the deviation of the probe mark will cause unexpected changes in the electrical parameters of the wafer. Therefore, when the distance between the probe mark and its predetermined position deviates to a certain extent, the corresponding wafer will be considered defective, and this phenomenon of deteriorating the electrical properties of the wafer due to deviation is called probe mark damage. However, no research has yet proposed how to predict the occurrence of probe mark defects.

有鑑於此,習知技術在探針記號缺陷的預測確實仍有加以改善 之必要。 In view of this, there is still a need to improve the knowledge technology in the prediction of probe mark defects.

為解決上述問題,本發明的目的是提供一種探針記號缺陷之預測模型建立方法,係可以預測探針記號缺陷的缺陷機率者。 In order to solve the above problems, the purpose of the present invention is to provide a method for establishing a prediction model of probe mark defects, which can predict the defect probability of probe mark defects.

本發明的次一目的是提供一種探針記號缺陷之預測模型建立方法,係可以確保或提升對應預測模型在訓練過程中與建立後的應用過程中的預測準確率者。 The second purpose of the present invention is to provide a method for establishing a prediction model for probe mark defects, which can ensure or improve the prediction accuracy of the corresponding prediction model during the training process and the application process after establishment.

本發明的又一目的是提供一種用於預測探針記號缺陷的預測系統,係可以預測探針記號缺陷的缺陷機率者。 Another object of the present invention is to provide a prediction system for predicting probe mark defects, which can predict the defect probability of probe mark defects.

本發明的再一目的是提供一種用於減少探針記號缺陷的預防方法,係可以降低檢測過程中發生探針記號缺陷的風險者。 Another object of the present invention is to provide a preventive method for reducing probe mark defects, which can reduce the risk of probe mark defects occurring during the detection process.

本發明全文所記載的元件及構件使用「一」或「一個」之量詞,僅是為了方便使用且提供本發明範圍的通常意義;於本發明中應被解讀為包括一個或至少一個,且單一的概念也包括複數的情況,除非其明顯意指其他意思。 The quantifiers "one" or "a" used in the components and parts described throughout the present invention are only for the convenience of use and to provide a general meaning of the scope of the present invention; in the present invention, they should be interpreted as including one or at least one, and the single concept also includes the plural case, unless it is obvious that it means otherwise.

本發明全文所述有關「電腦」或「系統」相關用語,整體或各別可包含至少一「處理器(Processor)」,所述處理器係指具備特定功能且以硬體或硬體與軟體實現的各式資料處理裝置,以處理分析資訊及/或產生對應控制資訊,例如:電子控制器、伺服器、雲端平台、虛擬機器、桌上型電腦、筆記型電腦、平板電腦或智慧型手機等,係本發明所屬技術領域中具有通常知識者可以理解。另,可包含對應的資料接收或傳輸單元,以進行所需資料的接收或傳輸。另,可包含對應的資料庫/儲存單元(特別是非暫態記憶單元),以供對應資料的讀取與儲存。特別是,除非另外特別排除或矛盾,所述 處理器可以是基於分散式系統架構中的多個處理器的集合,用於包含/代表多個處理器間資訊串流處理的過程、機制及結果。 The terms "computer" or "system" described in the present invention may include at least one "processor" as a whole or individually. The processor refers to various data processing devices with specific functions and implemented by hardware or hardware and software to process and analyze information and/or generate corresponding control information, such as electronic controllers, servers, cloud platforms, virtual machines, desktop computers, laptops, tablet computers or smart phones, etc., which can be understood by those with ordinary knowledge in the technical field to which the present invention belongs. In addition, corresponding data receiving or transmitting units may be included to receive or transmit the required data. In addition, corresponding databases/storage units (especially non-transient memory units) may be included for reading and storing corresponding data. In particular, unless otherwise specifically excluded or contradictory, the processor may be a collection of multiple processors based on a distributed system architecture, used to contain/represent the process, mechanism and result of information stream processing between multiple processors.

本發明的探針記號缺陷之預測模型建立方法,係透過一電腦執行以下步驟,所述步驟包含:一資料收集步驟,收集每次探針記號產生過程中的數種不同因子,且由每次所收集的數種不同因子形成一初步因子組合,並將各該初步因子組合與是否發生缺陷的資訊關聯,以定義一初步關聯資訊;一重要性分析步驟,運用一重要性分析法,將所述初步關聯資訊進行篩選,以獲得與該缺陷發生相關的經初步篩選因子組合,並將所述經初步篩選因子組合與是否發生缺陷的資訊關聯,以定義一重要關聯資訊;一因子再篩選步驟,包含運用一因子間關聯性分析方法將該重要關聯資訊轉換為對應的一相關係數矩陣,該相關係數矩陣中包含各因子與缺陷間之一相關性且包含任二種因子間的一關聯度;選擇該相關性不小於一第一篩選閾值所對應的各因子,以定義為一高相關性因子組合;在任二種因子間所對應的關聯度不小於一第二篩選閾值時,將該二種因子中與該缺陷的該相關性為較低者定義為一剔除因子;及自該高相關性因子組合中將與該剔除因子相同的因子移除,以獲取對應的經再次篩選因子組合,將所述經再次篩選因子組合與是否發生缺陷的資訊關聯,以定義一預測因子資訊;及一模型訓練步驟,將所述預測因子資訊輸入至待訓練的一預測模型,以建立經訓練的一預測模型,該預測模型係以一基礎轉換器架構所建立;所述基礎轉換器架構包含一編碼器網路與一解碼器網路;該編碼器網路依序包含一多頭自注意力層、一加總與歸一化層、一前饋網路層及另一加總與歸一化層;該解碼器網路依序包含一遮罩多頭注意力層、一加總與歸一化層、一多頭跨注意力層、另一加總與歸一化層、一前饋網路層及再一加總與歸一化層。 The method for establishing a prediction model of a probe mark defect of the present invention is to execute the following steps through a computer, the steps comprising: a data collection step, collecting a plurality of different factors in each probe mark generation process, and forming a preliminary factor combination from the plurality of different factors collected each time, and associating each preliminary factor combination with information on whether a defect occurs to define a preliminary associated information; an importance analysis step, using an importance analysis method to screen the preliminary associated information to obtain information related to the occurrence of the defect; A combination of initially screened factors that are correlated with the defects is generated, and the combination of initially screened factors is correlated with information on whether a defect occurs to define an important correlation information; a factor re-screening step includes using a factor correlation analysis method to convert the important correlation information into a corresponding correlation coefficient matrix, wherein the correlation coefficient matrix includes a correlation between each factor and the defect and includes a correlation degree between any two factors; each factor corresponding to a correlation that is not less than a first screening threshold is selected to define a high correlation factor. sub-combination; when the corresponding correlation between any two factors is not less than a second screening threshold, the factor with the lower correlation with the defect is defined as a rejection factor; and the factor identical to the rejection factor is removed from the high correlation factor combination to obtain a corresponding re-screened factor combination, and the re-screened factor combination is associated with information on whether a defect occurs to define a prediction factor information; and a model training step, the prediction factor information is input into a prediction model to be trained , to establish a trained prediction model, which is established with a basic transformer architecture; the basic transformer architecture includes an encoder network and a decoder network; the encoder network sequentially includes a multi-head self-attention layer, a summing and normalization layer, a feedforward network layer and another summing and normalization layer; the decoder network sequentially includes a masked multi-head attention layer, a summing and normalization layer, a multi-head cross-attention layer, another summing and normalization layer, a feedforward network layer and another summing and normalization layer.

本發明的探針記號缺陷之預測模型建立方法,係透過一電腦執 行以下步驟,所述步驟包含:一資料收集步驟,收集每次探針記號產生過程中的數種不同因子,且由每次所收集的數種不同因子形成一初步因子組合,並將各該初步因子組合與是否發生缺陷的資訊關聯,以定義一初步關聯資訊;一重要性分析步驟,運用一重要性分析法,將所述初步關聯資訊進行篩選,以獲得與該缺陷發生相關的經初步篩選因子組合,並將所述經初步篩選因子組合與是否發生缺陷的資訊關聯,以定義一重要關聯資訊;一因子再篩選步驟,包含運用一因子間關聯性分析方法將該重要關聯資訊轉換為對應的一相關係數矩陣,該相關係數矩陣中包含各因子與缺陷間之一相關性且包含任二種因子間的一關聯度;選擇該相關性不小於一第一篩選閾值所對應的各因子,以定義為一高相關性因子組合;在任二種因子間所對應的關聯度不小於一第二篩選閾值時,將該二種因子中與該缺陷的該相關性為較低者定義為一剔除因子;及自該高相關性因子組合中將與該剔除因子相同的因子移除,以獲取對應的經再次篩選因子組合,將所述經再次篩選因子組合與是否發生缺陷的資訊關聯,以定義一預測因子資訊;及一模型訓練步驟,將所述預測因子資訊輸入至待訓練的一預測模型,以建立經訓練的一預測模型,該預測模型係以一稀疏注意力轉換器架構所建立;所述稀疏注意力轉換器架構包含一編碼器網路與一解碼器網路;該編碼器網路依序包含一多頭自注意力層、一加總與歸一化層、一前饋網路層及另一加總與歸一化層;該解碼器網路依序包含一稀疏多頭自注意力層、一加總與歸一化層、一多頭跨注意力層、另一加總與歸一化層、一前饋網路層、再一加總與歸一化層、一線性層及一歸一化指數層。 The method for establishing a prediction model of a probe mark defect of the present invention is to execute the following steps through a computer, the steps comprising: a data collection step, collecting a plurality of different factors in each probe mark generation process, and forming a preliminary factor combination from the plurality of different factors collected each time, and associating each preliminary factor combination with information on whether a defect occurs to define a preliminary associated information; an importance analysis step, using an importance analysis method to screen the preliminary associated information to obtain information related to the occurrence of the defect; A combination of factors is preliminarily screened, and the preliminarily screened combination of factors is associated with information on whether a defect occurs to define an important correlation information; a factor re-screening step includes using a factor correlation analysis method to convert the important correlation information into a corresponding correlation coefficient matrix, wherein the correlation coefficient matrix includes a correlation between each factor and the defect and a correlation between any two factors; each factor corresponding to the correlation being not less than a first screening threshold is selected to define a high correlation factor combination; between any two factors When the correlation between the two factors is not less than a second screening threshold, the factor with the lower correlation with the defect is defined as a rejection factor; and the factor identical to the rejection factor is removed from the high correlation factor combination to obtain a corresponding re-screened factor combination, and the re-screened factor combination is associated with information on whether a defect occurs to define a prediction factor information; and a model training step is performed, wherein the prediction factor information is input into a prediction model to be trained to establish a trained prediction model, and ... The prediction model is established with a sparse attention transformer architecture; the sparse attention transformer architecture includes an encoder network and a decoder network; the encoder network sequentially includes a multi-head self-attention layer, a summation and normalization layer, a feedforward network layer and another summation and normalization layer; the decoder network sequentially includes a sparse multi-head self-attention layer, a summation and normalization layer, a multi-head cross-attention layer, another summation and normalization layer, a feedforward network layer, another summation and normalization layer, a linear layer and a normalized exponential layer.

據此,本發明的探針記號缺陷之預測模型建立方法,藉由所述重要性分析步驟與所述因子再篩選步驟所獲得的因子組合,一方面,可以確保該因子組合與預測目標(即為是否發生探針記號缺陷的機率)間具有高度 相關性,而可以達成確保或提升對應預測模型在訓練過程中與建立後的應用過程中預測準確率的功效;另一方面,可以自高關聯度的因子中篩選出可剔除者,進而將用於訓練/輸入該預測模型的因子的種類數量降維,可以達成降低該預測模型運算量而提升運算效率的功效;藉由該轉換器架構中的注意力機制,能更聚焦須被關注的特徵,可以達成更準確辨識/預測功效,該轉換器架構中各層的配置,可以提升模型訓練與運算效率的功效;或,藉由稀疏多頭自注意力層中所具有的間隔選擇機制,一方面,對應模型能夠更好地捕捉因子/參數間的長期依賴性,能專注於計算具有高注意力分數的少數重要位置,而可以達成減少運算量而提高運算效率的功效;另一方面,能減少對應模型在訓練過程中過度擬合的風險,且對應模型可以更容易泛化到新數據,而可以達成提升模型的準確度。 Accordingly, the method for establishing a prediction model of a probe mark defect of the present invention can ensure, through the combination of factors obtained in the importance analysis step and the factor re-screening step, that the combination of factors is highly correlated with the prediction target (i.e., the probability of a probe mark defect), thereby ensuring or improving the prediction accuracy of the corresponding prediction model during the training process and the application process after establishment; on the other hand, it can screen out factors that can be eliminated from factors with high correlation, thereby reducing the number of factors used to train/input the prediction model, thereby reducing the amount of calculation of the prediction model and improving the calculation efficiency; through the converter The attention mechanism in the architecture can focus more on the features that need to be paid attention to, and can achieve more accurate recognition/prediction. The configuration of each layer in the transformer architecture can improve the model training and computational efficiency. Or, through the interval selection mechanism in the sparse multi-head self-attention layer, on the one hand, the corresponding model can better capture the long-term dependency between factors/parameters, and can focus on calculating a few important positions with high attention scores, thereby reducing the amount of computation and improving computational efficiency. On the other hand, it can reduce the risk of overfitting of the corresponding model during training, and the corresponding model can be more easily generalized to new data, thereby improving the accuracy of the model.

其中,該預測模型中的超參數係配置如下;在初始超參數中,批處理大小為64,訓練回合為50,學習率為0.0001;在該編碼器網路中:該多頭自注意力層的頭部數量為8個,注意力鍵的維度為64;該前饋網路層中的丟棄率為0.1,全連接層的輸出維度為64並使用ReLU激活函數;及在該解碼器網路中:該稀疏多頭自注意力層的頭部數量為8個,注意力鍵的維度64;該多頭跨注意力層的頭部數量為8個,注意力鍵的維度為64;該前饋網路層中的丟棄率為0.1,全連接層的輸出維度為64並使用ReLU激活函數;該線性層的輸出維度為2,使用ReLU激活函數。 Among them, the hyperparameters in the prediction model are configured as follows; in the initial hyperparameters, the batch size is 64, the training rounds are 50, and the learning rate is 0.0001; in the encoder network: the number of heads of the multi-head self-attention layer is 8, and the dimension of the attention key is 64; the dropout rate in the feedforward network layer is 0.1, and the output dimension of the fully connected layer is 64 and uses the ReLU activation function ; and in the decoder network: the number of heads of the sparse multi-head self-attention layer is 8, and the dimension of the attention key is 64; the number of heads of the multi-head cross-attention layer is 8, and the dimension of the attention key is 64; the dropout rate in the feedforward network layer is 0.1, the output dimension of the fully connected layer is 64 and uses the ReLU activation function; the output dimension of the linear layer is 2, and the ReLU activation function is used.

其中,該重要性分析法係為一隨機森林演算法。如此,可以達成有效率、高準確率、高穩定性地獲取高關聯性的因子的功效。 Among them, the importance analysis method is a random forest algorithm. In this way, it is possible to achieve the effect of obtaining highly correlated factors efficiently, accurately and stably.

其中,該預測因子資訊係至少由一針尖最小直徑的因子、一校正位置的因子及對應的缺陷發生資訊所組成。如此,藉由該等因子係經所述重要性分析步驟與所述因子再篩選步驟所獲得,該等因子係為高度相關性且 具代表性的預測因子,使該預測模型至少可由上述因子的資料進行訓練與預測,且可以達成提升運算效率與準確率的功率。 The prediction factor information is composed of at least a factor of the minimum diameter of the needle tip, a factor of the correction position and the corresponding defect occurrence information. Thus, these factors are obtained through the importance analysis step and the factor re-screening step, and these factors are highly relevant and representative prediction factors, so that the prediction model can at least be trained and predicted by the data of the above factors, and can achieve the power of improving the computational efficiency and accuracy.

本發明的用於預測探針記號缺陷的預測系統,包含由上述探針記號缺陷之預測模型建立方法所建立的一預測模型,該預測模型係用於接收一監控資料以輸出是否會產生探針記號缺陷的一缺陷機率;該監控資料中各因子的種類係與該預測因子資訊中各因子的種類相同。 The prediction system for predicting probe mark defects of the present invention includes a prediction model established by the above-mentioned method for establishing a prediction model for probe mark defects. The prediction model is used to receive monitoring data to output a defect probability of whether a probe mark defect will occur; the type of each factor in the monitoring data is the same as the type of each factor in the prediction factor information.

據此,本發明的用於預測探針記號缺陷的預測系統,藉由具有上述探針記號缺陷之預測模型建立方法所建立的該預測模型,可以達成提升運算效率與準確率的功率。 Accordingly, the prediction system for predicting probe mark defects of the present invention can achieve the power of improving computational efficiency and accuracy by using the prediction model established by the above-mentioned method for establishing a prediction model of probe mark defects.

本發明的用於預測探針記號缺陷的預測系統,包含上述預測系統,在該預測系統接收當前一工件所對應的一監控資料而產生對應的一缺陷機率,且在該缺陷機率超出一預設警示閾值的一情形中,於後續其他工件所對應的檢測程序執行前,調整該監控資料中至少一因子在後續的數值不同於該監控資料中該至少一因子在當前的數值,使後續的該監控資料中所具有的各因子的數值所對應的一缺陷機率不超出該預設警示閾值。 The prediction system for predicting probe mark defects of the present invention includes the above-mentioned prediction system. When the prediction system receives a monitoring data corresponding to a current workpiece and generates a corresponding defect probability, and in a situation where the defect probability exceeds a preset warning threshold, before executing the inspection procedure corresponding to other subsequent workpieces, the subsequent value of at least one factor in the monitoring data is adjusted to be different from the current value of the at least one factor in the monitoring data, so that the defect probability corresponding to the value of each factor in the subsequent monitoring data does not exceed the preset warning threshold.

據此,本發明的用於預測探針記號缺陷的預測系統,藉由運用上述預測模型,且搭配該預設警示值與缺陷機率的關係,進而調整後續監控資料中至少一因子使對應缺陷機率不超出該預設警示閾值的機制,可以達成減少探針記號缺陷發生機率的功效。 Accordingly, the prediction system for predicting probe mark defects of the present invention can achieve the effect of reducing the probability of probe mark defects by applying the above prediction model and matching the relationship between the preset warning value and the defect probability, and then adjusting at least one factor in the subsequent monitoring data so that the corresponding defect probability does not exceed the preset warning threshold value.

S1:資料收集步驟 S1: Data collection step

S1A:資料篩選步驟 S1A: Data screening steps

S1A-1:重要性分析步驟 S1A-1: Importance analysis steps

S1A-2:因子再篩選步驟 S1A-2: Factor re-screening step

S2:模型訓練步驟 S2: Model training steps

〔第1圖〕本發明探針記號缺陷之預測模型建立方法的流程示意圖。 [Figure 1] Schematic diagram of the process of establishing a prediction model for probe mark defects of the present invention.

〔第2圖〕一種基礎轉換器架構之配置示意圖。 [Figure 2] A schematic diagram of a basic converter architecture.

〔第3圖〕一種稀疏注意力轉換器架構之配置示意圖。 [Figure 3] A schematic diagram of the configuration of a sparse attention switch architecture.

為讓本發明之上述及其他目的、特徵及優點能更明顯易懂,下文特舉本發明之較佳實施例,並配合所附圖式作詳細說明。 In order to make the above and other purposes, features and advantages of the present invention more clearly understood, the following specifically cites a preferred embodiment of the present invention and provides a detailed description in conjunction with the attached drawings.

請參照第1圖所示,其係本發明探針記號缺陷之預測模型建立方法的一較佳實施例,係透過一電腦執行一資料收集步驟S1、可選的一資料篩選步驟S1A及一模型訓練步驟S2,以完成目標模型的訓練/建立。 Please refer to FIG. 1, which is a preferred embodiment of the method for establishing a prediction model of a probe mark defect of the present invention, which is to complete the training/establishment of the target model by executing a data collection step S1, an optional data screening step S1A and a model training step S2 through a computer.

在該資料收集步驟S1中,收集每次探針記號產生過程中相關的數種不同因子/參數,且由每次所收集的數種不同因子形成一初步因子組合,並將各該初步因子組合與是否發生缺陷的資訊關聯,以定義一初步關聯資訊。其中,所述初步關聯資訊中的內容例如是呈現如某個時間點的各種因子組合與是否發生缺陷;舉例而言,在第一時間點t1,所述初步關聯資訊可包含一第一因子A的對應的參數a1,一第二因子B的對應參數b1,一第三因子C的對應參數c1,及是否產生缺陷的結果r1;同樣地,在第二時間點t2,所述初步關聯資訊可包含該第一因子A的對應的參數a2,一第二因子B的對應參數b2,一第三因子C的對應參數c2,及是否產生缺陷的結果r2。 In the data collection step S1, several different factors/parameters related to each probe mark generation process are collected, and a preliminary factor combination is formed from the several different factors collected each time, and each preliminary factor combination is associated with information on whether a defect occurs to define preliminary associated information. The content of the preliminary correlation information, for example, presents various factor combinations and whether defects occur at a certain time point; for example, at the first time point t1, the preliminary correlation information may include a corresponding parameter a1 of the first factor A, a corresponding parameter b1 of the second factor B, a corresponding parameter c1 of the third factor C, and a result r1 of whether a defect occurs; similarly, at the second time point t2, the preliminary correlation information may include a corresponding parameter a2 of the first factor A, a corresponding parameter b2 of the second factor B, a corresponding parameter c2 of the third factor C, and a result r2 of whether a defect occurs.

詳言之,所述數種不同因子的資訊,係指一工件(例如是一晶圓)透過運用探針卡裝置進行檢測,而在該工件上產生探針記號的數種不同因子,且各該因子的具體數值或資訊,可透過所述探針卡裝置中內建的系統來取得而實現,及/或透過額外裝設對應感測元件來偵測而實現,及/或透過後續相關檢測/量測來獲取而實現。舉例而言,在本發明的一具體實施範例中,所關注是否發生的「缺陷」係指「探針記號缺陷」,特別是指在進行量測時,對應的探針記號與探針卡邊緣的一最小距離所定義的一偏移量不大於一距離 閾值時,即定義為發生所述探針記號缺陷;較佳地,所述距離閾值為5nm。 In detail, the information of the several different factors refers to the several different factors that generate probe marks on a workpiece (e.g., a wafer) when the workpiece is inspected using a probe card device, and the specific value or information of each factor can be obtained through a built-in system in the probe card device, and/or detected by additionally installing corresponding sensing elements, and/or obtained through subsequent related inspections/measurements. For example, in a specific embodiment of the present invention, the "defect" of concern refers to the "probe mark defect", and in particular, when the offset defined by the minimum distance between the corresponding probe mark and the edge of the probe card is not greater than a distance threshold during measurement, it is defined as the occurrence of the probe mark defect; preferably, the distance threshold is 5nm.

在本發明的一具體實施範例中,所述數種不同因子或所述初步因子組合係包含: In a specific embodiment of the present invention, the several different factors or the preliminary factor combination include:

(1)工件批次數量(Count by Lot):代表透過探針卡進行檢測的一工件批次數量。 (1) Count by Lot: represents the number of workpiece batches tested by the probe card.

(2)接觸點數量(Number of contacts):代表於檢測一工件時,探針卡之探針在工件上所產生的接觸點數量。 (2) Number of contacts: represents the number of contact points generated by the probe of the probe card on the workpiece when inspecting the workpiece.

(3)針尖最小直徑(Needle Tip Min):探針卡之探針在與工件接觸所產生的所有針尖記號之直徑的最小數值。 (3) Needle Tip Min: The minimum diameter of all needle tip marks produced when the probe card's probe contacts the workpiece.

(4)針尖最大直徑(Needle Tip Max):探針卡之探針在與工件接觸所產生的所有針尖記號之直徑的最大數值。 (4) Needle Tip Max: The maximum diameter of all needle tip marks produced when the probe card's probe contacts the workpiece.

(5)針尖平均直徑(Needle Tip Ave):探針卡之探針在與工件接觸所產生的所有針尖記號之直徑的平均數值。 (5) Needle Tip Ave: The average diameter of all needle tip marks produced when the probe card's probe contacts the workpiece.

(6)探針高度(Pb Height):在探針卡執行檢測程序前,探針卡與工件之間的距離。 (6) Probe height (Pb Height): The distance between the probe card and the workpiece before the probe card performs the inspection procedure.

(7)探針卡的污染程度(Probe Card Clean Cnt):用已表示探針卡上的汙染物髒污程度,數值越高表示汙染越嚴重。 (7) Probe Card Clean Cnt: This value indicates the degree of contamination on the probe card. The higher the value, the more serious the contamination.

(8)探針負載(Probing Overdrive):探針卡裝置在進行測量時的電壓值。 (8) Probing Overdrive: The voltage value of the probe card device when measuring.

(9)校正位置(Alignment End Position):探針卡在進行垂直移動至對應工件表面上形成探針記號前,探針卡的初始位置在水平方向上移動到一預設量測位置的距離。 (9) Alignment End Position: Before the probe card moves vertically to form a probe mark on the corresponding workpiece surface, the initial position of the probe card moves horizontally to a preset measurement position.

在可選的該資料篩選步驟S1A中,係包含一重要性分析步驟S1A-1與可選的一因子再篩選步驟S1A-2。在該重要性分析步驟S1A-1中,運用一重要性分析法,將所述初步關聯資訊進行篩選,以獲得與該缺陷發生 (可定義為一目標特性)為高度相關的經初步篩選因子組合,並將所述經初步篩選因子組合與是否發生缺陷的資訊關聯,以定義一重要關聯資訊。特別地,所述重要性分析法係可運用任何已知方法,且為本發明領域中具有通常知識者可理解;舉例而言,可應用隨機森林演算法作為所述重要性分析法。詳言之,在該重要性分析步驟S1A-1中,假定初始定義與預測結果相關的因子為i個,透過該資料篩選步驟S1A後,可自i個因子中得到較相關的j個因子;其中,i、j皆為正整數,且j不大於i。因此,透過該重要性分析步驟S1A-1,可篩選出與關注結果較高度關聯的因子,並運用較高度關聯的因子進行後續預測模型的學習。如此,一方面可減少整體訓練因子的維度,提升模型訓練與運算速度;另一方面,模型係以較高度關聯的因子訓練,有助於提升模型預測的準確性。 In the optional data screening step S1A, an importance analysis step S1A-1 and an optional factor re-screening step S1A-2 are included. In the importance analysis step S1A-1, an importance analysis method is used to screen the preliminary correlation information to obtain a combination of preliminary screened factors that are highly correlated with the occurrence of the defect (which can be defined as a target characteristic), and the combination of preliminary screened factors is associated with information on whether a defect occurs to define an important correlation information. In particular, the importance analysis method can use any known method and is understandable to a person with ordinary knowledge in the field of the present invention; for example, a random forest algorithm can be applied as the importance analysis method. In detail, in the importance analysis step S1A-1, it is assumed that the factors initially defined to be relevant to the prediction results are i. After the data screening step S1A, j factors with higher correlation can be obtained from the i factors; i and j are both positive integers, and j is not greater than i. Therefore, through the importance analysis step S1A-1, factors with higher correlation to the results of interest can be screened out, and the factors with higher correlation can be used to learn the subsequent prediction model. In this way, on the one hand, the dimension of the overall training factors can be reduced, and the model training and calculation speed can be improved; on the other hand, the model is trained with factors with higher correlation, which helps to improve the accuracy of model prediction.

舉例而言,在本發明的一具體實施範例中,承上述資料收集步驟S1所獲取的該九種不同因子及其對應缺陷資訊/目標特性所定義的初步關聯資訊,在該重要性分析步驟S1A-1中,將所述初步關聯資訊運用重要性分析法中的隨機森林演算法進行篩選,可獲得五種對於缺陷發生/目標特性具有顯著影響的因子,以定義上述的重要關聯資訊。詳言之,該重要關聯資訊中的五種因子分別為探針高度、針尖最小直徑、針尖平均直徑、針尖最大直徑及校正位置,且對應的重要性(總和為1)分別為0.3001、0.2440、0.2221、0.1675、0.0663。 For example, in a specific implementation example of the present invention, based on the nine different factors obtained in the above data collection step S1 and the preliminary correlation information defined by the corresponding defect information/target characteristics, in the importance analysis step S1A-1, the preliminary correlation information is screened using the random forest algorithm in the importance analysis method, and five factors with significant effects on defect occurrence/target characteristics can be obtained to define the above important correlation information. In detail, the five factors in the important correlation information are probe height, minimum needle tip diameter, average needle tip diameter, maximum needle tip diameter and correction position, and the corresponding importance (sum is 1) is 0.3001, 0.2440, 0.2221, 0.1675, 0.0663 respectively.

在可選的該因子再篩選步驟S1A-2中,可運用一因子間關聯性分析方法將所述初步關聯資訊或該重要關聯資訊轉換為對應的一相關係數矩陣(Correlation Coefficient Matrix),該相關係數矩陣中包含各因子與缺陷間之一相關性且包含任二種因子間的一關聯度;選擇該相關性不小於一第一篩選閾值所對應的各因子,以獲得與該缺陷發生/目標特性為高度相關的高 相關性因子組合;同時,在任二種因子間的該關聯度不小於一第二篩選閾值時,將該二種因子中與該缺陷的該相關性為較低者定義為一剔除因子,即該剔除因子係代表在考量缺陷是否發生的因子中,可被視為實質上為重複考量的因子;接著,自該高相關性因子組合中將與該剔除因子相同的因子移除,以獲取對應的經再次篩選因子組合,將所述經再次篩選因子組合與是否發生缺陷的資訊關聯,以定義一預測因子資訊。特別地,所述因子間關聯性分析方法係可運用任何已知方法,且為本發明領域中具有通常知識者可理解;舉例而言,所述因子間關聯性分析方法可以是皮爾遜相關係數(Pearson Correlation Coefficient)、斯皮爾曼相關係數(Spearman Rank Correlation Coefficient)、肯德爾相關係數(Kendall Rank Correlation Coefficient)、點二列相關係數(Point-biserial Correlation Coefficient)或其他雙變量回歸分析法,且並不以上述舉例之方法為限,以獲取對應的相關係數矩陣。 In the optional factor re-screening step S1A-2, a factor correlation analysis method can be used to convert the preliminary correlation information or the important correlation information into a corresponding correlation coefficient matrix, which includes a correlation between each factor and the defect and a correlation between any two factors; select each factor corresponding to the correlation not less than a first screening threshold to obtain a high correlation factor combination that is highly correlated with the defect occurrence/target characteristic; at the same time, when the correlation between any two factors is not less than a second screening threshold, the two factors that are highly correlated with the defect are selected. The one with a lower correlation is defined as a rejection factor, that is, the rejection factor represents a factor that can be considered as a substantially repeated consideration among the factors for considering whether a defect occurs; then, the factor that is the same as the rejection factor is removed from the high correlation factor combination to obtain a corresponding re-screened factor combination, and the re-screened factor combination is associated with the information of whether a defect occurs to define a prediction factor information. In particular, the inter-factor correlation analysis method may use any known method and is understandable to those with ordinary knowledge in the field of the present invention; for example, the inter-factor correlation analysis method may be Pearson Correlation Coefficient, Spearman Rank Correlation Coefficient, Kendall Rank Correlation Coefficient, Point-biserial Correlation Coefficient or other bivariate regression analysis methods, and is not limited to the above-mentioned methods, to obtain the corresponding correlation coefficient matrix.

在本發明的一具體實施範例中,承上述重要性分析步驟S1A-1所篩選的五種因子與對應缺陷所定義的重要關聯資訊,該因子再篩選步驟S1A-2中,將所述重要關聯資訊運用皮爾遜相關係數的關聯性分析方法,可獲得該重要關聯資訊的相關係數矩陣(Correlation Coefficient Matrix)如下列表1所示。 In a specific implementation example of the present invention, the five factors selected in the importance analysis step S1A-1 and the important correlation information defined by the corresponding defects are further screened in the factor screening step S1A-2, and the important correlation information is subjected to the correlation analysis method of the Pearson correlation coefficient, and the correlation coefficient matrix (Correlation Coefficient Matrix) of the important correlation information can be obtained as shown in the following table 1.

Figure 112143258-A0305-02-0012-1
Figure 112143258-A0305-02-0012-1
Figure 112143258-A0305-02-0013-3
Figure 112143258-A0305-02-0013-3

由表1可知,與探針缺陷記號相關的因子依相關性由大至小排列分別為:針尖最小直徑(對應數值為-0.04)、校正位置(對應數值為-0.034)、針尖平均直徑(對應數值為-0.033)、針尖最大直徑(對應數值為-0.021)及探針高度(-0.0015)。因此,可再由上述五種因子中,選擇相關性較大的k種因子進行後續相關預測模型的訓練,所述k為正整數,且不大於j。詳言之,由上述相關性,可將所有因子的相關性取絕對值相加得到一相關性總和後,在以各因子的相關性除以該相關性總和來觀察各因子的對結果的影響程度。 As shown in Table 1, the factors related to the probe defect mark are arranged from large to small according to the correlation: the minimum diameter of the needle tip (corresponding to the value of -0.04), the correction position (corresponding to the value of -0.034), the average diameter of the needle tip (corresponding to the value of -0.033), the maximum diameter of the needle tip (corresponding to the value of -0.021) and the height of the probe (-0.0015). Therefore, k factors with greater correlation can be selected from the above five factors for subsequent training of the correlation prediction model, where k is a positive integer and is not greater than j. In detail, based on the above correlation, the absolute values of the correlations of all factors can be added to obtain a sum of correlations, and then the influence of each factor on the result can be observed by dividing the correlation of each factor by the sum of the correlations.

詳言之,在選擇k種因子的過程中,對應前述有關高相關因子組合的選擇過程,可運用一第一篩選閾值(例如是可定義為不大於100%除以當前因子的數量,為並不以此為限),並可定義當一因子的相關性占比不小於該第一篩選閾值時,該因子被選擇為後續預測模型中訓練用的資訊。其中,可理解是,當因子所對應的相關性占比越高,該因子與該目標特性的相依程度越高。舉例而言,以表1為例,所述相關性總合為0.1295,針尖最小直徑的相關性占比為30.89%,校正位置的相關性占比為26.25%,針尖平均直徑的相關性占比為25.48%,針尖最大直徑的相關性占比為16.22%,探針高度的相關性占比為1.16%;此時,可使該第一篩選閾值根據實際需求而定義為任意合理的一數值,例如是5%,對應的高相關性因子組合即是由針尖最小直徑、校正位置、針尖平均直徑及針尖最大直徑等因子所組成。 In detail, in the process of selecting k factors, corresponding to the aforementioned selection process of the combination of highly correlated factors, a first screening threshold may be used (for example, it may be defined as not more than 100% divided by the number of current factors, but is not limited thereto), and it may be defined that when the correlation ratio of a factor is not less than the first screening threshold, the factor is selected as information for training in the subsequent prediction model. It can be understood that the higher the correlation ratio corresponding to the factor, the higher the degree of dependence of the factor on the target characteristic. For example, taking Table 1 as an example, the total correlation is 0.1295, the correlation of the minimum tip diameter accounts for 30.89%, the correlation of the correction position accounts for 26.25%, the correlation of the average tip diameter accounts for 25.48%, the correlation of the maximum tip diameter accounts for 16.22%, and the correlation of the probe height accounts for 1.16%; at this time, the first screening threshold can be defined as any reasonable value according to actual needs, such as 5%, and the corresponding high correlation factor combination is composed of factors such as the minimum tip diameter, the correction position, the average tip diameter and the maximum tip diameter.

接著,對應前述有關剃除因子的選擇,由表1亦可觀察到任二種因子間的關聯度,並可設定一第二篩選閾值,當任二種因子間的關聯度不小於該第二篩選閾值時,可選擇該二種因子間與該目標特性的相關性高者為代表因子,使其中相關性較低者為剔除因子,並於前述高相關性因子組合中 將與該剔除因子相同的因子移除,以簡化輸入預測模型的資訊量(因子種類的數量)。其中,可理解是,當二種因子間關聯度越高,該二種因子中的一者可反應另一者的程度越高。舉例而言,以表1為例,可觀察到針尖最小直徑、針尖平均直徑、針尖最大直徑間任二者的關聯度最少為0.77(此時對應的第二篩選閾值可例如被設定為不大於0.77),且針尖最小直徑的因子與該缺陷的相關性占比為該等因子中的最高者,使針尖最小直徑的因子係對應前述的代表因子,且使針尖平均直徑及針尖最大直徑的二種因子係對應前述的剃除因子。 Next, corresponding to the aforementioned selection of the shaving factors, the correlation between any two factors can also be observed from Table 1, and a second screening threshold can be set. When the correlation between any two factors is not less than the second screening threshold, the factor with the higher correlation between the two factors and the target characteristic can be selected as the representative factor, and the factor with the lower correlation can be selected as the elimination factor. In addition, the factors that are the same as the elimination factor in the aforementioned high correlation factor combination are removed to simplify the amount of information (the number of factor types) input into the prediction model. It can be understood that the higher the correlation between the two factors, the higher the degree to which one of the two factors can reflect the other. For example, taking Table 1 as an example, it can be observed that the correlation between any two of the minimum tip diameter, the average tip diameter, and the maximum tip diameter is at least 0.77 (the corresponding second screening threshold can be set to be no greater than 0.77, for example), and the correlation ratio between the factor of the minimum tip diameter and the defect is the highest among these factors, so that the factor of the minimum tip diameter corresponds to the aforementioned representative factor, and the two factors of the average tip diameter and the maximum tip diameter correspond to the aforementioned shaving factor.

最後,自該高相關性因子組合〔針尖最小直徑,校正位置,針尖平均直徑,針尖最大直徑〕將與所述剔除因子〔針尖平均直徑,針尖最大直徑〕相同的因子移除,以定義所述經再次篩選因子〔針尖最小直徑,校正位置〕。換言之,經上述相關性與關聯度的相關篩選規則,在訓練對應的預測模型時,可以考量使用針尖最小直徑的因子代表針尖平均直徑及針尖最大直徑所對應的兩種因子;亦即,該因子再篩選步驟S1A-2係同時考量該第一篩選閾值(影響缺陷發生的相關性占比)與該第二篩選閾值(因子間的關聯度)以獲得對應的預測因子資訊(包含針尖最小直徑與校正位置的二種因子與對應的缺陷發生資訊),並以該預測因子資訊進行對應預測模型的訓練。 Finally, the factors identical to the rejection factor [average needle tip diameter, maximum needle tip diameter] are removed from the high correlation factor combination [minimum needle tip diameter, correction position, average needle tip diameter, maximum needle tip diameter] to define the re-screened factor [minimum needle tip diameter, correction position]. In other words, through the above-mentioned relevant screening rules of correlation and correlation degree, when training the corresponding prediction model, it is possible to consider using the factor of the minimum diameter of the needle tip to represent the two factors corresponding to the average diameter of the needle tip and the maximum diameter of the needle tip; that is, the factor re-screening step S1A-2 simultaneously considers the first screening threshold (the correlation ratio affecting the occurrence of defects) and the second screening threshold (the correlation degree between factors) to obtain the corresponding prediction factor information (including the two factors of the minimum diameter of the needle tip and the correction position and the corresponding defect occurrence information), and use the prediction factor information to train the corresponding prediction model.

據由上述的該資料收集步驟S1、可選的該資料篩選步驟S1A中的該重要性分析步驟S1A-1與可選的該因子再篩選步驟S1A-2,可得到數種不同訓練資料的組合一~三,分別為: According to the above-mentioned data collection step S1, the optional importance analysis step S1A-1 in the data screening step S1A and the optional factor re-screening step S1A-2, several different training data combinations one to three can be obtained, which are:

(1)訓練資料組合一:用於訓練預測模型的訓練資料(對應前述的初步關聯資訊)僅為經過該資料收集步驟S1所得到的因子與對應的缺陷發生資訊,該因子的種類數量為i種。 (1) Training data combination 1: The training data used to train the prediction model (corresponding to the aforementioned preliminary correlation information) is only the factors obtained through the data collection step S1 and the corresponding defect occurrence information. The number of types of the factors is i.

(2)訓練資料組合二:用於訓練預測模型的訓練資料(對應前述的重要關 聯資訊)係經過資料收集步驟S1與該重要性分析步驟S1A-1所得到的因子與對應的缺陷發生資訊,該因子的種類數量為j種。 (2) Training data combination 2: The training data used to train the prediction model (corresponding to the aforementioned important correlation information) is the factors and corresponding defect occurrence information obtained through the data collection step S1 and the importance analysis step S1A-1. The number of types of the factors is j.

(3)訓練資料組合三:用於訓練預測模型的訓練資料(對應前述的預測因子資訊)係經過資料收集步驟S1、該重要性分析步驟S1A-1及該因子再篩選步驟S1A-2所得到的因子與對應的缺陷發生資訊,該因子的種類數量為k種。 (3) Training data combination 3: The training data used to train the prediction model (corresponding to the aforementioned prediction factor information) is the factors and corresponding defect occurrence information obtained through the data collection step S1, the importance analysis step S1A-1 and the factor re-screening step S1A-2. The number of types of the factors is k.

應注意的是,在該等訓練資料組合中,亦包含j種因子與i種因子完全相同情形,或k種因子與j種因子完全相同的情形。用於訓練後續預測模型的資料可以包含前述初步關聯資訊(對應該資料收集步驟S1),較佳是前述重要性關聯資訊(對應該重要性分析步驟S1A-1),更佳是前述預測因子資訊(對應該因子再篩選步驟S1A-2)。 It should be noted that the training data combinations also include situations where j factors are exactly the same as i factors, or situations where k factors are exactly the same as j factors. The data used to train the subsequent prediction model may include the aforementioned preliminary correlation information (corresponding to the data collection step S1), preferably the aforementioned importance correlation information (corresponding to the importance analysis step S1A-1), and more preferably the aforementioned prediction factor information (corresponding to the factor re-screening step S1A-2).

在該模型訓練步驟S2中,本發明係提出待訓練一預測模型,該預測模型係運用一機器學習網路架構所建立,特別是運用轉換器架構(Transformer Architecture),並將前述預測因子資訊輸入至待訓練的該預測模型,以建立經訓練的一預測模型。舉例而言,所述轉換器架構可以是一基礎轉換器架構或一稀疏注意力轉換器架構。 In the model training step S2, the present invention proposes a prediction model to be trained, which is established using a machine learning network architecture, especially a transformer architecture, and the aforementioned prediction factor information is input into the prediction model to be trained to establish a trained prediction model. For example, the transformer architecture can be a basic transformer architecture or a sparse attention transformer architecture.

詳言之,如第2圖所示,該基礎轉換器架構具有一編碼器網路(Encoder)與一解碼器網路(Decoder)。該編碼器網路依序包含一多頭自注意力層(Multi-head Self-attention Layer)、一加總與歸一化層(Add & Norm Layer)、一前饋網路層(Feedforward Network)及另一加總與歸一化層。該解碼器網路依序包含一遮罩多頭注意力層(Masked Multi-head Attention Layer)、一加總與歸一化層、一多頭跨注意力層(Multi-head Cross-attention Layer)、另一加總與歸一化層、一前饋網路層、再一加總與歸一化層、一線性層及一歸一化指數層。 In detail, as shown in Figure 2, the basic converter architecture has an encoder network (Encoder) and a decoder network (Decoder). The encoder network sequentially includes a multi-head self-attention layer (Multi-head Self-attention Layer), an addition and normalization layer (Add & Norm Layer), a feedforward network layer (Feedforward Network) and another addition and normalization layer. The decoder network sequentially includes a masked multi-head attention layer (Masked Multi-head Attention Layer), an addition and normalization layer, a multi-head cross-attention layer (Multi-head Cross-attention Layer), another addition and normalization layer, a feedforward network layer, another addition and normalization layer, a linear layer and a normalized exponential layer.

詳言之,如第3圖所示,該稀疏注意力轉換器架構係可視為由 該基礎轉換器架構所轉換;其中,該稀疏注意力轉換器架構係以一稀疏多頭自注意力層(Sparse Multi-head Self-attention)取代該基礎轉換器架構中的該遮罩多頭注意力層。 Specifically, as shown in FIG. 3, the sparse attention transformer architecture can be regarded as a transformation of the basic transformer architecture; wherein the sparse attention transformer architecture replaces the masked multi-head attention layer in the basic transformer architecture with a sparse multi-head self-attention layer.

應注意的是,所述基礎轉換器架構與稀疏注意力轉換器架構皆為本領域中具有通常知識依現有技術可理解且實現之技術特徵,故不再贅述對應架構中各層間的數據串流與處理結果的細節;此外,本發明的預測模型亦可運用其他各種不同機器學習網路架構所建立,並不以本發明中所舉例者為限。 It should be noted that the basic converter architecture and the sparse attention converter architecture are both technical features in the field that are commonly known and can be understood and implemented based on existing technologies, so the details of the data flow and processing results between the layers in the corresponding architectures will not be repeated; in addition, the prediction model of the present invention can also be established using various other machine learning network architectures, and is not limited to the examples given in the present invention.

在本發明的一具體實施範例中,在該模型訓練步驟S2中係以前述預測因子資訊進行該預測模型的訓練,且該預測模型係以所述基礎轉換器架構所建立;其中,所述預測因子資訊係可作為後續模型的訓練資料,並對應前述訓練資料的組合三。詳言之,該預測因子資訊(作為訓練資料)可拆分成一訓練資料集與一驗證資料集,以完成該預測模型的訓練。其中,根據重要關聯資訊(五種因子與缺陷發生資訊,僅對應該重要性分析步驟S1A-1的結果)與預測因子資訊(二種因子與缺陷發生資訊,包含該重要性分析步驟S1A-1與該因子再篩選步驟S1A-2的結果)所分別建立的預測模型的預測準確率進行比較,使用重要關聯資訊(對應該重要性分析步驟S1A-1)的預測準確率為92.67%,使用預測因子資訊(對應步驟S1A-1、S1A-2)的預測準確率為93.27%,而可證明初步所收集資料可以透過所述重要性分析步驟S1A-1與因子再篩選步驟S1A-2,來確認是否為與所要預測之目標特性(例如是缺陷發生),當最初所選擇因子種類與數量與最後篩選所得的因子種類與數量相同時,可以驗證/確保所選擇的因子與目標特性間為高關聯性,或當最後篩選所得的因子種類係為最初所選擇因子種類中所縮減的結果時,可以自所選擇的因子中移除與目標特性間較低關聯性者,進而使輸入資訊/訓 練資料降維(縮減因子種類),而提升對應預測模型在訓練過程中與建立後的應用過程中的預測準確率。其中,在所述重要關聯資訊中的五種因子係為前述之探針高度、針尖最小直徑、針尖平均直徑、針尖最大直徑及校正位置;在所述預測因子資訊中的二種因子係為前述之針尖最小直徑與校正位置。 In a specific implementation example of the present invention, in the model training step S2, the prediction model is trained with the prediction factor information, and the prediction model is established with the basic converter architecture; wherein the prediction factor information can be used as training data for subsequent models, and corresponds to the combination of the training data mentioned above three. In detail, the prediction factor information (as training data) can be split into a training data set and a validation data set to complete the training of the prediction model. Among them, the prediction accuracy of the prediction models established based on the important correlation information (five factors and defect occurrence information, corresponding only to the result of the importance analysis step S1A-1) and the prediction factor information (two factors and defect occurrence information, including the results of the importance analysis step S1A-1 and the factor re-screening step S1A-2) is compared. The prediction accuracy using the important correlation information (corresponding to the importance analysis step S1A-1) is 92.67%, and the prediction accuracy using the prediction factor information (corresponding to steps S1A-1 and S1A-2) is 93.27%, which can prove that the preliminary collected data can be used through the importance analysis Step S1A-1 and the factor re-screening step S1A-2 are used to confirm whether the target characteristic (e.g., defect occurrence) to be predicted is the same as the type and quantity of the factors initially selected and the type and quantity of the factors obtained by the final screening. It can be verified/ensured that the selected factors are highly correlated with the target characteristic. Alternatively, when the type of factors obtained by the final screening is the result of a reduction in the type of factors initially selected, the factors with a lower correlation with the target characteristic can be removed from the selected factors, thereby reducing the dimension of the input information/training data (reducing the type of factors) and improving the prediction accuracy of the corresponding prediction model during the training process and in the application process after establishment. Among them, the five factors in the important related information are the aforementioned probe height, minimum needle tip diameter, average needle tip diameter, maximum needle tip diameter and correction position; the two factors in the prediction factor information are the aforementioned minimum needle tip diameter and correction position.

特別地,針對所述預測因子資訊所建立的預測模型的預測準確率,基於相同的預測模型的架構,本發明進一步運用不同組合的因子來建立對應的預測模型,比較各種因子組合所建立之預測模型的預測準確率如下列表2所呈現,並由表2的結果可驗證,經過上述重要性分析步驟S1A-1與因子再篩選步驟S1A-2所篩選得到的因子,確實可以達成在建立對應的預測模型中具有最高的預測準確率的功效。 In particular, with respect to the prediction accuracy of the prediction model established by the prediction factor information, based on the same prediction model architecture, the present invention further uses different combinations of factors to establish corresponding prediction models, and compares the prediction accuracy of the prediction models established by various factor combinations as shown in Table 2 below. The results in Table 2 verify that the factors selected by the above-mentioned importance analysis step S1A-1 and factor re-screening step S1A-2 can indeed achieve the effect of having the highest prediction accuracy in establishing the corresponding prediction model.

Figure 112143258-A0305-02-0017-4
Figure 112143258-A0305-02-0017-4

特別地,基於運用前述針尖最小直徑與校正位置的兩種因子,進一步比較本發明的預測模型分別使用所述基礎轉換器架構、所述稀疏注意力轉換器架構的結果;其中,在運用所述基礎轉換器架構所建立的預測模型的預測準確率係為前述之93.27%,而在運用所述稀疏注意力轉換器架構所建立的預測模型的預測準確率係為93.98%。換言之,本發明的預測模型可運用該稀疏注意力轉換器架構取代該基礎轉換器架構,以提升所建立模型的預測 準確率。 In particular, based on the two factors of the minimum needle tip diameter and the correction position, the prediction model of the present invention is further compared using the basic converter architecture and the sparse attention converter architecture respectively; wherein the prediction accuracy of the prediction model established using the basic converter architecture is 93.27% as mentioned above, and the prediction accuracy of the prediction model established using the sparse attention converter architecture is 93.98%. In other words, the prediction model of the present invention can use the sparse attention converter architecture to replace the basic converter architecture to improve the prediction accuracy of the established model.

較佳地,在運用所述稀疏注意力轉換器架構所建立的訓練模型,係應用以下超參數組合。 Preferably, the following hyperparameter combination is applied in the training model established using the sparse attention transformer architecture.

1.在初始超參數中: 1. In the initial hyperparameters:

(1)批處理大小(batch_size)為64(預設為32)。 (1) The batch size (batch_size) is 64 (the default is 32).

(2)訓練回合(epoachs)為50。 (2) The number of training rounds (epochs) is 50.

(3)學習率(learning rate)為0.0001(預設為0.0001)。 (3) The learning rate is 0.0001 (the default setting is 0.0001).

2.在編碼器網路中的超參數配置: 2. Hyperparameter configuration in the encoder network:

(1)在多頭自注意力層中:頭部數量為8個,注意力鍵的維度(key_dim)為64。 (1) In the multi-head self-attention layer: the number of heads is 8, and the dimension of the attention key (key_dim) is 64.

(2)在前饋網路層中:丟棄率(dropout rate)設置為0.1;全連接層(Dense)的輸出維度為64,並使用ReLU激活函數。 (2) In the feedforward network layer: the dropout rate is set to 0.1; the output dimension of the fully connected layer (Dense) is 64, and the ReLU activation function is used.

3.在解碼器網路中的超參數配置: 3. Hyperparameter configuration in the decoder network:

(1)在稀疏注意力層中:頭部數量為8個,注意力鍵的維度64。 (1) In the sparse attention layer: the number of heads is 8 and the dimension of the attention key is 64.

(2)在多頭跨注意力層中:頭部數量為8個,注意力鍵的維度為64。 (2) In the multi-head cross-attention layer: the number of heads is 8 and the dimension of the attention key is 64.

(3)在前饋網路層中:丟棄率設置為0.1;全連接層的輸出維度為64,並使用ReLU激活函數。 (3) In the feedforward network layer: the dropout rate is set to 0.1; the output dimension of the fully connected layer is 64, and the ReLU activation function is used.

(4)在線性層中:輸出維度為2,使用ReLU激活函數。 (4) In the linear layer: the output dimension is 2, and the ReLU activation function is used.

據由前述內容,本發明提出一種探針記號缺陷之預測模型建立 方法,包含以該資料收集步驟S1(僅步驟S1),較佳額外以該重要性分析步驟S1A-1(包含步驟S1、S1A-1),更佳再額外以該因子再篩選步驟S1A-2(包含步驟S1、S1A-1、S1A-2),所獲取的各種因子與對應的探針記號缺陷是否發生的資訊作為一訓練資料(即以該預測因子資訊作為該訓練資料相同),將該訓練資料(該預測因子資訊)輸入待訓練的一預測模型以完成該預測模型的訓練/建立。 According to the above content, the present invention proposes a method for establishing a prediction model of a probe mark defect, including the data collection step S1 (only step S1), preferably the additional importance analysis step S1A-1 (including steps S1 and S1A-1), and more preferably the additional factor re-screening step S1A-2 (including steps S1, S1A-1, S1A-2), the various factors obtained and the corresponding information on whether the probe mark defect occurs are used as a training data (that is, the prediction factor information is the same as the training data), and the training data (the prediction factor information) is input into a prediction model to be trained to complete the training/establishment of the prediction model.

接者,透過上述方法所建立預測模型,本發明提出一種預測系統具有該預測模型,並透過由該預測模型持續接收/監控當前的一監控資料,即可獲得/預測是否會產生探針記號缺陷的一缺陷機率;其中,所述監控資料中各因子的種類係與該訓練資料(該預測因子資訊)中各因子的種類相同。 Next, through the prediction model established by the above method, the present invention proposes a prediction system having the prediction model, and by continuously receiving/monitoring the current monitoring data through the prediction model, a defect probability of whether a probe mark defect will be generated can be obtained/predicted; wherein the type of each factor in the monitoring data is the same as the type of each factor in the training data (the prediction factor information).

特別地,基於上述預測系統,本發明提出一種探針記號缺陷之預防方法,包含:在該預測系統接收/監控到當前一工件所對應的一監控資料而產生對應的一缺陷機率的一情形中,比對該缺陷機率是否超出一預設警示閾值;並在該缺陷機率超出該預設警示閾值的一情形中,於後續其他工件所對應的檢測程序執行前,調整該監控資料中至少一因子在後續的數值不同於該監控資料中該至少一因子在當前的數值,使後續的該監控資料中所具有的各因子的數值所對應的一缺陷機率不會超出該預設警示閾值,以避免持續使用具有高機率產生缺陷的數值,而降低產生缺陷的風險,達成預防缺陷發生的效果。其中,所述預設警示閾值係可依實際加工需求而設定;調整該監控資料中至少一因子的數值之方法係依各因子的特性而具有對應的調整方法。 In particular, based on the above-mentioned prediction system, the present invention proposes a method for preventing probe mark defects, comprising: in a situation where the prediction system receives/monitors a monitoring data corresponding to a current workpiece and generates a corresponding defect probability, comparing whether the defect probability exceeds a preset warning threshold; and in a situation where the defect probability exceeds the preset warning threshold, before the execution of the subsequent inspection procedures corresponding to other workpieces, adjusting the subsequent value of at least one factor in the monitoring data to be different from the current value of the at least one factor in the monitoring data, so that the defect probability corresponding to the value of each factor in the subsequent monitoring data will not exceed the preset warning threshold, so as to avoid the continuous use of values with a high probability of generating defects, thereby reducing the risk of generating defects and achieving the effect of preventing the occurrence of defects. Among them, the preset warning threshold can be set according to the actual processing requirements; the method of adjusting the value of at least one factor in the monitoring data has a corresponding adjustment method according to the characteristics of each factor.

可選地,可預先輸入對應該訓練資料/監控資料中各因子的不同數值所形成的數據組合,以預先建立一預測機率對照資料,該預測機率對照資料包含各種不同數據組合所對應的是否會產生探針記號缺陷的一缺陷機 率。如此,透過該預測機率對照資料,可作為調整該監控資料中各因子的數值之參考,使調整後的各因子具有低於該預設警示閾值的缺陷機率。 Optionally, a data combination formed by different values of each factor in the training data/monitoring data can be pre-entered to pre-establish a predicted probability comparison data, which includes a defect probability corresponding to various different data combinations for whether a probe mark defect will occur. In this way, the predicted probability comparison data can be used as a reference for adjusting the values of each factor in the monitoring data, so that the adjusted defect probability of each factor is lower than the preset warning threshold.

較佳地,在該缺陷機率超出該預設警示閾值的一情形中,調整該監控資料中至少一因子的數值的調整方式,係可由一電腦依預先建立的規則執行對應的調整,而可實現自動化的調整。 Preferably, in a situation where the defect probability exceeds the preset warning threshold, the adjustment method for adjusting the value of at least one factor in the monitoring data can be performed by a computer according to pre-established rules to perform corresponding adjustments, thereby achieving automated adjustment.

綜上所述,本發明的探針記號缺陷之預測模型建立方法,透過重要性分析步驟與因子再篩選步驟所獲得的因子組合,可以確保或提升對應預測模型在訓練過程中與建立後的應用過程中的預測準確率。另,透過使用稀疏注意力轉換器架構所建立的訓練模型及對應的超參數,可以提升所建立的預測模型的準確率。本發明的預測系統,透過具有上述方法所建立的預測模型,在接收對應的因子時可以產生對應的缺陷機率,以監控當前狀態下發生缺陷的風險。本發明的預防方法,透過比對缺陷機率是否超出預設警示閾值,而可以即時調整對應因子,以降低於探針卡檢測晶圓特性的過程中發生缺陷的風險。 In summary, the method for establishing a prediction model for probe mark defects of the present invention can ensure or improve the prediction accuracy of the corresponding prediction model during the training process and the application process after establishment through the combination of factors obtained by the importance analysis step and the factor re-screening step. In addition, the accuracy of the established prediction model can be improved by using the training model established by the sparse attention converter architecture and the corresponding hyperparameters. The prediction system of the present invention, through the prediction model established by the above method, can generate a corresponding defect probability when receiving the corresponding factor to monitor the risk of defects in the current state. The prevention method of the present invention can adjust the corresponding factor in real time by comparing whether the defect probability exceeds the preset warning threshold to reduce the risk of defects occurring during the process of the probe card detecting wafer characteristics.

雖然本發明已利用上述較佳實施例揭示,然其並非用以限定本發明,任何熟習此技藝者在不脫離本發明之精神和範圍之內,相對上述實施例進行各種更動與修改仍屬本發明所保護之技術範疇,因此本發明之保護範圍當包含後附之申請專利範圍所記載的文義及均等範圍內之所有變更。 Although the present invention has been disclosed using the above preferred embodiments, they are not intended to limit the present invention. Any person skilled in the art may make various changes and modifications to the above embodiments within the spirit and scope of the present invention, and the changes and modifications are still within the technical scope protected by the present invention. Therefore, the protection scope of the present invention shall include all changes within the meaning and equivalent scope recorded in the attached patent application scope.

S1:資料收集步驟 S1: Data collection step

S1A:資料篩選步驟 S1A: Data screening steps

S1A-1:重要性分析步驟 S1A-1: Importance analysis steps

S1A-2:因子再篩選步驟 S1A-2: Factor re-screening step

S2:模型訓練步驟 S2: Model training steps

Claims (7)

一種探針記號缺陷之預測模型建立方法,係透過一電腦執行以下步驟,所述步驟包含:一資料收集步驟,收集每次探針記號產生過程中的數種不同因子,且由每次所收集的數種不同因子形成一初步因子組合,並將各該初步因子組合與是否發生缺陷的資訊關聯,以定義一初步關聯資訊;一重要性分析步驟,運用一重要性分析法,將所述初步關聯資訊進行篩選,以獲得與該缺陷發生相關的經初步篩選因子組合,並將所述經初步篩選因子組合與是否發生缺陷的資訊關聯,以定義一重要關聯資訊;一因子再篩選步驟,包含運用一因子間關聯性分析方法將該重要關聯資訊轉換為對應的一相關係數矩陣,該相關係數矩陣中包含各因子與缺陷間之一相關性且包含任二種因子間的一關聯度;選擇該相關性不小於一第一篩選閾值所對應的各因子,以定義為一高相關性因子組合;在任二種因子間所對應的關聯度不小於一第二篩選閾值時,將該二種因子中與該缺陷的該相關性為較低者定義為一剔除因子;及自該高相關性因子組合中將與該剔除因子相同的因子移除,以獲取對應的經再次篩選因子組合,將所述經再次篩選因子組合與是否發生缺陷的資訊關聯,以定義一預測因子資訊;及一模型訓練步驟,將所述預測因子資訊輸入至待訓練的一預測模型,以建立經訓練的一預測模型,該預測模型係以一基礎轉換器架構所建立;所述基礎轉換器架構包含一編碼器網路與一解碼器網路;該編碼器網路依序包含一多頭自注意力層、一加總與歸一化層、一前饋網路層及另一加總與歸一化層; 該解碼器網路依序包含一遮罩多頭注意力層、一加總與歸一化層、一多頭跨注意力層、另一加總與歸一化層、一前饋網路層、一加總與歸一化層、一線性層及一歸一化指數層。 A method for establishing a prediction model of a probe mark defect is performed by a computer to execute the following steps, the steps comprising: a data collection step, collecting a plurality of different factors in each probe mark generation process, and forming a preliminary factor combination from the plurality of different factors collected each time, and associating each preliminary factor combination with information on whether a defect occurs to define a preliminary associated information; an importance analysis step, using an importance analysis method to screen the preliminary associated information to obtain experience related to the occurrence of the defect; A preliminary screening factor combination is performed, and the preliminary screening factor combination is associated with information on whether a defect occurs to define an important correlation information; a factor re-screening step includes using a factor correlation analysis method to convert the important correlation information into a corresponding correlation coefficient matrix, wherein the correlation coefficient matrix includes a correlation between each factor and the defect and a correlation between any two factors; each factor corresponding to the correlation being not less than a first screening threshold is selected to define a high correlation factor combination; between any two factors, When the correlation between the factors is not less than a second screening threshold, the factor with the lower correlation with the defect is defined as a rejection factor; and the factor identical to the rejection factor is removed from the high correlation factor combination to obtain a corresponding re-screened factor combination, and the re-screened factor combination is associated with information on whether a defect occurs to define a prediction factor information; and a model training step is performed to input the prediction factor information into a prediction model to be trained to establish a trained prediction factor. The prediction model is established by a basic transformer architecture; the basic transformer architecture includes an encoder network and a decoder network; the encoder network includes a multi-head self-attention layer, a summing and normalization layer, a feedforward network layer and another summing and normalization layer in sequence; the decoder network includes a masked multi-head attention layer, a summing and normalization layer, a multi-head cross-attention layer, another summing and normalization layer, a feedforward network layer, a summing and normalization layer, a linear layer and a normalized exponential layer in sequence. 一種探針記號缺陷之預測模型建立方法,係透過一電腦執行以下步驟,所述步驟包含:一資料收集步驟,收集每次探針記號產生過程中的數種不同因子,且由每次所收集的數種不同因子形成一初步因子組合,並將各該初步因子組合與是否發生缺陷的資訊關聯,以定義一初步關聯資訊;一重要性分析步驟,運用一重要性分析法,將所述初步關聯資訊進行篩選,以獲得與該缺陷發生相關的經初步篩選因子組合,並將所述經初步篩選因子組合與是否發生缺陷的資訊關聯,以定義一重要關聯資訊;一因子再篩選步驟,包含運用一因子間關聯性分析方法將該重要關聯資訊轉換為對應的一相關係數矩陣,該相關係數矩陣中包含各因子與缺陷間之一相關性且包含任二種因子間的一關聯度;選擇該相關性不小於一第一篩選閾值所對應的各因子,以定義為一高相關性因子組合;在任二種因子間所對應的關聯度不小於一第二篩選閾值時,將該二種因子中與該缺陷的該相關性為較低者定義為一剔除因子;及自該高相關性因子組合中將與該剔除因子相同的因子移除,以獲取對應的經再次篩選因子組合,將所述經再次篩選因子組合與是否發生缺陷的資訊關聯,以定義一預測因子資訊;及一模型訓練步驟,將所述預測因子資訊輸入至待訓練的一預測模型,以建立經訓練的一預測模型,該預測模型係以一稀疏注意力轉換器架構所建立; 所述稀疏注意力轉換器架構包含一編碼器網路與一解碼器網路;該編碼器網路依序包含一多頭自注意力層、一加總與歸一化層、一前饋網路層及另一加總與歸一化層;該解碼器網路依序包含一稀疏多頭自注意力層、一加總與歸一化層、一多頭跨注意力層、另一加總與歸一化層、一前饋網路層、一加總與歸一化層、一線性層及一歸一化指數層。 A method for establishing a prediction model of a probe mark defect is performed by a computer to execute the following steps, the steps comprising: a data collection step, collecting a plurality of different factors in each probe mark generation process, and forming a preliminary factor combination from the plurality of different factors collected each time, and associating each preliminary factor combination with information on whether a defect occurs to define preliminary associated information; an importance analysis step, using an importance analysis method to screen the preliminary associated information to obtain preliminary associated information related to the occurrence of the defect. A step of screening factor combinations, and correlating the preliminarily screened factor combinations with information on whether defects occur, to define important correlation information; a step of factor re-screening, including using a factor correlation analysis method to convert the important correlation information into a corresponding correlation coefficient matrix, wherein the correlation coefficient matrix includes a correlation between each factor and the defect and a correlation between any two factors; selecting each factor corresponding to a correlation not less than a first screening threshold to define a high correlation factor combination; between any two factors When the corresponding correlation is not less than a second screening threshold, the factor with the lower correlation with the defect is defined as a rejection factor; and the factor identical to the rejection factor is removed from the high correlation factor combination to obtain a corresponding re-screened factor combination, and the re-screened factor combination is associated with information on whether a defect occurs to define a prediction factor information; and a model training step is performed to input the prediction factor information into a prediction model to be trained to establish a trained prediction model, the prediction factor information The test model is established with a sparse attention transformer architecture; The sparse attention transformer architecture includes an encoder network and a decoder network; the encoder network sequentially includes a multi-head self-attention layer, a summation and normalization layer, a feedforward network layer and another summation and normalization layer; the decoder network sequentially includes a sparse multi-head self-attention layer, a summation and normalization layer, a multi-head cross-attention layer, another summation and normalization layer, a feedforward network layer, a summation and normalization layer, a linear layer and a normalized exponential layer. 如請求項2之探針記號缺陷之預測模型建立方法,其中,該預測模型中的超參數係配置如下;在初始超參數中:批處理大小為64,訓練回合為50,學習率為0.0001;在該編碼器網路中:該多頭自注意力層的頭部數量為8個,注意力鍵的維度為64;該前饋網路層中的丟棄率為0.1,全連接層的輸出維度為64並使用ReLU激活函數;及在該解碼器網路中:該稀疏多頭自注意力層的頭部數量為8個,注意力鍵的維度64;該前饋網路層中的丟棄率為0.1,全連接層的輸出維度為64並使用ReLU激活函數;該線性層的輸出維度為2,使用ReLU激活函數。 As in claim 2, a method for establishing a prediction model of a probe mark defect, wherein the hyperparameters in the prediction model are configured as follows; in the initial hyperparameters: the batch size is 64, the training rounds are 50, and the learning rate is 0.0001; in the encoder network: the number of heads of the multi-head self-attention layer is 8, and the dimension of the attention key is 64; the discard rate in the feedforward network layer is 0. 1, the output dimension of the fully connected layer is 64 and uses the ReLU activation function; and in the decoder network: the number of heads of the sparse multi-head self-attention layer is 8, and the dimension of the attention key is 64; the discard rate in the feedforward network layer is 0.1, the output dimension of the fully connected layer is 64 and uses the ReLU activation function; the output dimension of the linear layer is 2, using the ReLU activation function. 如請求項1至3中任一項之探針記號缺陷之預測模型建立方法,其中,該重要性分析法係為一隨機森林演算法。 A method for establishing a prediction model for probe mark defects as in any one of claims 1 to 3, wherein the importance analysis method is a random forest algorithm. 如請求項1至3中任一項之探針記號缺陷之預測模型建立方法,其中,該預測因子資訊係至少由一針尖最小直徑的因子、一校正位置的因子及對應的缺陷發生資訊所組成。 A method for establishing a prediction model for a probe mark defect as claimed in any one of claims 1 to 3, wherein the prediction factor information is composed of at least a factor of the minimum diameter of the needle tip, a factor of the correction position, and corresponding defect occurrence information. 一種用於預測探針記號缺陷的預測系統,包含由請求項1至5中任一項之探針記號缺陷之預測模型建立方法所建立的一預測模型,該預測模型係用於接收一監控資料以輸出是否會產生探針記號缺陷的一缺陷機率;該監控資料中各因子的種類係與該預測因子資訊中各因子的種類相同。 A prediction system for predicting probe mark defects, comprising a prediction model established by the method for establishing a prediction model for probe mark defects of any one of claim items 1 to 5, the prediction model is used to receive monitoring data to output a defect probability of whether a probe mark defect will occur; the type of each factor in the monitoring data is the same as the type of each factor in the prediction factor information. 一種用於減少探針記號缺陷的預防方法,包含如請求項6之 預測系統,在該預測系統接收當前一工件所對應的一監控資料而產生對應的一缺陷機率,且在該缺陷機率超出一預設警示閾值的一情形中,於後續其他工件所對應的檢測程序執行前,調整該監控資料中至少一因子在後續的數值不同於該監控資料中該至少一因子在當前的數值,使後續的該監控資料中所具有的各因子的數值所對應的一缺陷機率不超出該預設警示閾值。 A preventive method for reducing probe mark defects, comprising a prediction system as claimed in claim 6, wherein the prediction system receives monitoring data corresponding to a current workpiece and generates a corresponding defect probability, and in a case where the defect probability exceeds a preset warning threshold, before executing the inspection procedure corresponding to other subsequent workpieces, the subsequent value of at least one factor in the monitoring data is adjusted to be different from the current value of the at least one factor in the monitoring data, so that the defect probability corresponding to the value of each factor in the subsequent monitoring data does not exceed the preset warning threshold.
TW112143258A 2023-11-09 2023-11-09 Establishing method for prediction model on probe mark defect, prediction system and prevention method thereof TWI860159B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
TW112143258A TWI860159B (en) 2023-11-09 2023-11-09 Establishing method for prediction model on probe mark defect, prediction system and prevention method thereof

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
TW112143258A TWI860159B (en) 2023-11-09 2023-11-09 Establishing method for prediction model on probe mark defect, prediction system and prevention method thereof

Publications (2)

Publication Number Publication Date
TWI860159B true TWI860159B (en) 2024-10-21
TW202520107A TW202520107A (en) 2025-05-16

Family

ID=94084151

Family Applications (1)

Application Number Title Priority Date Filing Date
TW112143258A TWI860159B (en) 2023-11-09 2023-11-09 Establishing method for prediction model on probe mark defect, prediction system and prevention method thereof

Country Status (1)

Country Link
TW (1) TWI860159B (en)

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050282299A1 (en) * 2004-06-18 2005-12-22 Kwang-Soo Kim Wafer inspection system and method thereof
CN101261285A (en) * 2007-03-06 2008-09-10 台湾积体电路制造股份有限公司 System and method for automatically managing probe mark offset
CN116051498A (en) * 2023-01-09 2023-05-02 北京航空航天大学 A wafer surface defect detection method and detection model

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050282299A1 (en) * 2004-06-18 2005-12-22 Kwang-Soo Kim Wafer inspection system and method thereof
CN101261285A (en) * 2007-03-06 2008-09-10 台湾积体电路制造股份有限公司 System and method for automatically managing probe mark offset
CN116051498A (en) * 2023-01-09 2023-05-02 北京航空航天大学 A wafer surface defect detection method and detection model

Also Published As

Publication number Publication date
TW202520107A (en) 2025-05-16

Similar Documents

Publication Publication Date Title
US12152869B2 (en) Monitoring system and method for verifying measurements in patterned structures
US10761128B2 (en) Methods and systems for inline parts average testing and latent reliability defect detection
KR102170473B1 (en) Systems and methods for wafer map analysis
TW202013314A (en) Method for determining wafer inspection parameters
CN110503288B (en) System and method for identifying yield loss reason considering machine interaction
CN105702595B (en) Wafer yield judgment method and multivariate inspection method for wafer qualification test
TWI623830B (en) System and method for identifying root causes of yield loss
CN116245256A (en) A capacitor quality prediction method, system and storage medium combining multiple factors
US12141230B2 (en) Process abnormality identification using measurement violation analysis
TW201931015A (en) Design criticality analysis augmented process window qualification sampling
TWI860159B (en) Establishing method for prediction model on probe mark defect, prediction system and prevention method thereof
CN113887990B (en) Optimization method for electrical equipment maintenance decision-making
TWI647770B (en) Yield rate determination method for wafer and method for multiple variable detection of wafer acceptance test
CN118916670A (en) Al-based integrated circuit test management system
JP7712473B2 (en) System and method for multi-dimensional dynamic part average testing - Patents.com
TW201135474A (en) Method for sampling workpiece for inspection and computer program product performing the same
TW202139114A (en) System and method for determining cause of abnormality in semiconductor manufacturing processes
CN119688004B (en) A method for detecting quality of electronic components
CN108459948B (en) Determination method of failure data distribution type in system reliability assessment
Aye et al. Data driven framework for degraded pogo pin detection in semiconductor manufacturing
Azlan et al. Automated Defect Detection in Package Physical Components Using Deep Learning Architecture-ResNeXt-101 with ATSS
CN116467557A (en) An accurate comparison method for user power outage event load data