[go: up one dir, main page]

CN116703155A - Risk identification method, risk identification equipment and computer readable storage medium - Google Patents

Risk identification method, risk identification equipment and computer readable storage medium Download PDF

Info

Publication number
CN116703155A
CN116703155A CN202310630270.XA CN202310630270A CN116703155A CN 116703155 A CN116703155 A CN 116703155A CN 202310630270 A CN202310630270 A CN 202310630270A CN 116703155 A CN116703155 A CN 116703155A
Authority
CN
China
Prior art keywords
risk
data
business data
sample
business
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202310630270.XA
Other languages
Chinese (zh)
Inventor
王海祥
张伟忠
李思璇
韦耀浩
林轶欢
杨凡
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Yuanguang Software Co Ltd
Original Assignee
Yuanguang Software Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Yuanguang Software Co Ltd filed Critical Yuanguang Software Co Ltd
Priority to CN202310630270.XA priority Critical patent/CN116703155A/en
Publication of CN116703155A publication Critical patent/CN116703155A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/06Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
    • G06Q10/063Operations research, analysis or management
    • G06Q10/0635Risk analysis of enterprise or organisation activities
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/241Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
    • G06F18/2415Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on parametric or probabilistic models, e.g. based on likelihood ratio or false acceptance rate versus a false rejection rate
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q40/00Finance; Insurance; Tax strategies; Processing of corporate or income taxes
    • G06Q40/12Accounting
    • G06Q40/125Finance or payroll

Landscapes

  • Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Physics & Mathematics (AREA)
  • Human Resources & Organizations (AREA)
  • Theoretical Computer Science (AREA)
  • Economics (AREA)
  • Finance (AREA)
  • Accounting & Taxation (AREA)
  • Strategic Management (AREA)
  • General Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Marketing (AREA)
  • Development Economics (AREA)
  • Entrepreneurship & Innovation (AREA)
  • General Business, Economics & Management (AREA)
  • Game Theory and Decision Science (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Operations Research (AREA)
  • Educational Administration (AREA)
  • Technology Law (AREA)
  • Probability & Statistics with Applications (AREA)
  • Tourism & Hospitality (AREA)
  • Quality & Reliability (AREA)
  • Artificial Intelligence (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Evolutionary Biology (AREA)
  • Evolutionary Computation (AREA)
  • General Engineering & Computer Science (AREA)
  • Financial Or Insurance-Related Operations Such As Payment And Settlement (AREA)

Abstract

本申请公开了一种风险识别方法、设备和计算机可读存储介质,该方法包括:获取目标业务数据;利用风险识别模型划分目标业务数据得到风险业务数据和非风险业务数据;利用风险分类模型确定风险业务数据的风险类别;基于由风险业务数据所生成的样本业务数据,对风险识别模型进行增强训练,并基于由已确定风险类别的风险业务数据所生成的样本待分类数据,对风险分类模型进行增强训练,上述方式,能够提高风险识别的准确率。

The present application discloses a risk identification method, device and computer-readable storage medium. The method includes: obtaining target business data; using a risk identification model to divide the target business data to obtain risky business data and non-risk business data; using a risk classification model to determine The risk category of risky business data; based on the sample business data generated by the risky business data, the risk identification model is enhanced for training, and based on the sample data to be classified generated by the risky business data of the determined risk category, the risk classification model is Enhanced training, the above method, can improve the accuracy of risk identification.

Description

一种风险识别方法、设备和计算机可读存储介质A risk identification method, device and computer-readable storage medium

技术领域technical field

本申请涉及人工智能技术领域,特别是涉及一种风险识别方法、设备和计算机可读存储介质。The present application relates to the technical field of artificial intelligence, in particular to a risk identification method, device and computer-readable storage medium.

背景技术Background technique

当下,随着财务管理系统的广泛应用,与财务相关的业务可以通过财务管理系统而进行。为了强化资金风险管理,提升财务业务的安全性,可以基于财务管理系统产生的业务数据进行财务风险管控。At present, with the wide application of financial management systems, financial-related businesses can be carried out through financial management systems. In order to strengthen capital risk management and improve the security of financial business, financial risk control can be carried out based on the business data generated by the financial management system.

目前,存在利用风险识别模型进行风险管控的方式,但是现有的风险识别模型的准确率较低,对风险识别准确率低,因此,如何提高风险识别的准确率成为急需解决的问题。At present, there are ways to use risk identification models for risk management and control, but the accuracy of existing risk identification models is low, and the accuracy of risk identification is low. Therefore, how to improve the accuracy of risk identification has become an urgent problem to be solved.

发明内容Contents of the invention

本申请主要解决的技术问题是提供一种风险识别方法、设备和计算机可读存储介质,能够提高风险识别的准确率。The technical problem mainly solved by this application is to provide a risk identification method, device and computer-readable storage medium, which can improve the accuracy of risk identification.

为解决上述技术问题,本申请采用的一个技术方案是:提供一种风险识别方法,该方法包括:获取目标业务数据;利用风险识别模型划分目标业务数据得到风险业务数据和非风险业务数据;利用风险分类模型确定风险业务数据的风险类别;基于由风险业务数据所生成的样本业务数据,对风险识别模型进行增强训练,并基于由已确定风险类别的风险业务数据所生成的样本待分类数据,对风险分类模型进行增强训练。In order to solve the above technical problems, a technical solution adopted by this application is to provide a risk identification method, which includes: obtaining target business data; using a risk identification model to divide target business data to obtain risky business data and non-risky business data; The risk classification model determines the risk category of the risk business data; based on the sample business data generated by the risk business data, the risk identification model is strengthened for training, and based on the sample data to be classified generated by the risk business data of which the risk category has been determined, Augmentation training of risk classification models.

其中,利用风险识别模型划分目标业务数据得到风险业务数据和非风险业务数据包括:利用风险识别模型从目标业务数据中依次识别得到直接风险数据、间接风险数据和罕见风险数据;将直接风险数据、间接风险数据和罕见风险数据一并作为风险业务数据,以及将目标业务数据中除直接风险数据、间接风险数据和罕见风险数据之外的部分业务数据作为非风险业务数据。Among them, using the risk identification model to divide the target business data to obtain risk business data and non-risk business data includes: using the risk identification model to sequentially identify direct risk data, indirect risk data and rare risk data from the target business data; Indirect risk data and rare risk data are used together as risky business data, and some business data in the target business data except direct risk data, indirect risk data and rare risk data are regarded as non-risk business data.

其中,利用风险识别模型从目标业务数据中依次识别得到直接风险数据、间接风险数据和罕见风险数据包括:获取目标业务数据属于直接风险数据的第一概率;基于第一概率从目标业务数据中确定直接风险数据;获取目标业务数据中除直接风险数据之外的部分业务数据属于间接风险数据的第二概率;基于第二概率从目标业务数据中除直接风险数据之外的部分业务数据中确定间接风险数据;获取目标业务数据中除直接风险数据和间接风险数据之外的部分业务数据属于罕见风险数据的第三概率;基于第三概率从目标业务数据中除直接风险数据和间接风险数据之外的部分业务数据中确定罕见风险数据。Among them, using the risk identification model to sequentially identify the direct risk data, indirect risk data and rare risk data from the target business data includes: obtaining the first probability that the target business data belongs to the direct risk data; determining from the target business data based on the first probability Direct risk data; obtain the second probability that part of the business data except the direct risk data in the target business data belongs to the indirect risk data; determine the indirect risk data from the part of the target business data except the direct risk data based on the second probability Risk data; obtain the third probability that part of the business data in the target business data except direct risk data and indirect risk data belongs to rare risk data; based on the third probability, exclude direct risk data and indirect risk data from the target business data Identify rare risk data in some of the business data.

其中,利用风险识别模型划分目标业务数据得到风险业务数据和非风险业务数据之后,利用风险分类模型确定风险业务数据的风险类别之前,该方法还包括:确定风险业务数据的明细类别,其中,明细类别数量大于风险类别的数量;利用风险分类模型确定风险业务数据的风险类别包括:利用风险分类模型将已确定明细类别的风险业务数据映射到分类函数维度,并确定各风险类别分别对应的分离超平面;在分类函数维度,基于分离超平面将已确定明细类别的风险业务数据划分为与各风险类别对应。Wherein, after using the risk identification model to divide the target business data to obtain the risk business data and non-risk business data, before using the risk classification model to determine the risk category of the risk business data, the method also includes: determining the detailed category of the risk business data, wherein the detailed The number of categories is greater than the number of risk categories; using the risk classification model to determine the risk category of risk business data includes: using the risk classification model to map the risk business data of the determined detailed category to the classification function dimension, and determine the corresponding separation of each risk category. plane; in the classification function dimension, based on the separation hyperplane, the risk business data of the determined detailed categories are divided into corresponding risk categories.

其中,该方法还包括:利用已确定明细类别的风险业务数据生成样本业务数据,其中,样本业务数据标注有真实风险标签,真实风险标签表征样本业务数据属于风险数据类型和非风险数据类型中的何种;基于由风险业务数据所生成的样本业务数据,对风险识别模型进行增强训练包括:利用风险识别模型划分样本业务数据,得到样本风险类型预测结果,基于样本风险类型预测结果确定样本风险数据和样本非风险数据,样本风险预测结果表征预测样本业务数据属于风险数据类型和非风险数据类型中的何种;基于风险预测结果和真实风险标签之间的第一差异调整风险识别模型的参数。Wherein, the method further includes: generating sample business data by using the risk business data of the determined detailed category, wherein the sample business data is marked with a real risk label, and the real risk label indicates that the sample business data belongs to the risk data type and the non-risk data type. What; Based on the sample business data generated by the risk business data, the enhanced training of the risk identification model includes: using the risk identification model to divide the sample business data, obtaining the prediction result of the sample risk type, and determining the sample risk data based on the prediction result of the sample risk type and sample non-risk data, the sample risk prediction result represents which of the risk data type and non-risk data type the predicted sample business data belongs to; adjust the parameters of the risk identification model based on the first difference between the risk prediction result and the real risk label.

其中,该方法还包括:利用已确定风险类别的风险业务数据生成样本待分类数据,其中,样本待分类数据标注有真实类别标签,真实类别标签表征样本待分类数据所属的风险类别;基于由已确定风险类别的风险业务数据所生成的样本待分类数据,对风险分类模型进行增强训练包括:利用风险分类模型确定样本待分类数据的风险类别,作为样本预测分类结果;基于样本预测分类结果和真实类别标签之间的第二差异调整风险分类模型的参数。Wherein, the method further includes: using the risk business data of the determined risk category to generate sample data to be classified, wherein the sample data to be classified is marked with a real category label, and the real category label represents the risk category to which the sample to be classified data belongs; The enhanced training of the risk classification model includes: using the risk classification model to determine the risk category of the sample data to be classified, as a sample to predict the classification result; predicting the classification result based on the sample and the real The second difference between the class labels adjusts the parameters of the risk classification model.

其中,风险类别包括收支监控、预算监控、事中监控和融资监控中的至少一者。Wherein, the risk category includes at least one of revenue and expenditure monitoring, budget monitoring, ongoing monitoring and financing monitoring.

其中,每间隔第一预设时长执行增强训练,每间隔第二预设时长获取目标业务数据。Wherein, the enhanced training is performed every first preset time interval, and the target service data is acquired every second preset time interval.

其中,该方法还包括:将基于目标业务数据获取的风险相关数据存储到数据库中,以响应于数据调用请求,发送与数据调用请求匹配的数据至目标对象,其中,风险相关数据包括风险业务数据和非风险业务数据。Wherein, the method further includes: storing the risk-related data obtained based on the target business data into the database, and sending the data matching the data call request to the target object in response to the data call request, wherein the risk-related data includes risk business data and non-risk business data.

为解决上述技术问题,本申请采用的另一个技术方案是:提供一种电子设备,包括相互耦接的存储器和处理器,处理器用于执行存储器中存储的程序指令,以实现上述任一风险识别方法。In order to solve the above technical problems, another technical solution adopted by this application is to provide an electronic device, including a memory and a processor coupled to each other, and the processor is used to execute the program instructions stored in the memory to realize any of the above risk identification method.

为解决上述技术问题,本申请采用的另一个技术方案是:提供一种计算机可读存储介质,其上存储有程序指令,程序指令被处理器执行时实现上述任一风险识别方法。In order to solve the above technical problems, another technical solution adopted by the present application is to provide a computer-readable storage medium on which program instructions are stored, and when the program instructions are executed by a processor, any of the above risk identification methods is implemented.

上述方案,利用风险识别模型从目标业务数据中识别得到风险业务数据,用于生成样本业务数据以对风险识别模型进行增强训练,利用风险分类模型确定风险业务数据的风险类别,用于生成样本待分类数据以对风险分类模型进行增强训练,一方面增加了模型训练的样本数量,增加了模型学习的信息量,提高模型的准确性,从而提升风险识别的准确性,另一方面,通过模型对目标业务数据的处理,挖掘了目标业务数据中关于财务风险的信息,并用于模型的训练中,增加了模型学习的信息量,提高模型的准确性,从而提升风险识别的准确性。In the above solution, the risk business data is identified from the target business data by using the risk identification model, which is used to generate sample business data for enhanced training of the risk identification model, and the risk classification model is used to determine the risk category of the risk business data, which is used to generate the sample to be Classification data is used to enhance the training of the risk classification model. On the one hand, it increases the number of samples for model training, increases the amount of information learned by the model, and improves the accuracy of the model, thereby improving the accuracy of risk identification. On the other hand, through the model to The processing of target business data excavates the information about financial risks in the target business data and uses it in model training, which increases the amount of information learned by the model and improves the accuracy of the model, thereby improving the accuracy of risk identification.

附图说明Description of drawings

图1是本申请风险识别方法一实施例的流程示意图;Fig. 1 is a schematic flow chart of an embodiment of the risk identification method of the present application;

图2是本申请步骤S120另一实施例的流程示意图;FIG. 2 is a schematic flow diagram of another embodiment of step S120 of the present application;

图3是本申请步骤S221另一实施例的流程示意图;FIG. 3 is a schematic flow diagram of another embodiment of step S221 of the present application;

图4是本申请风险识别方法另一实施例的流程示意图;Fig. 4 is a schematic flow chart of another embodiment of the risk identification method of the present application;

图5是本申请风险识别方法再一实施例的流程示意图;Fig. 5 is a schematic flow chart of another embodiment of the risk identification method of the present application;

图6是本申请电子设备一实施例的框架示意图;Fig. 6 is a schematic frame diagram of an embodiment of the electronic device of the present application;

图7是本申请计算机可读存储介质一实施例的框架示意图。Fig. 7 is a schematic diagram of an embodiment of a computer-readable storage medium of the present application.

具体实施方式Detailed ways

为使本申请的目的、技术方案及效果更加清楚、明确,以下参照附图并举实施例对本申请进一步详细说明。以下描述中,为了说明而不是为了限定,提出了诸如特定系统结构、接口、技术之类的具体细节,以便透彻理解本申请。In order to make the purpose, technical solution and effect of the present application more clear and definite, the present application will be further described in detail below with reference to the accompanying drawings and examples. In the following description, for purposes of illustration rather than limitation, specific details, such as specific system architectures, interfaces, and techniques, are set forth in order to provide a thorough understanding of the present application.

本文中术语“和/或”,仅仅是一种描述关联对象的关联关系,表示可以存在三种关系,例如,A和/或B,可以表示:单独存在A,同时存在A和B,单独存在B这三种情况。另外,本文中字符“/”,一般表示前后关联对象是一种“或”的关系。此外,本文中的“多”表示两个或者多于两个。另外,本文中术语“至少一种”表示多种中的任意一种或多种中的至少两种的任意组合,例如,包括A、B、C中的至少一种,可以表示包括从A、B和C构成的集合中选择的任意一个或多个元素。The term "and/or" in this article is just an association relationship describing associated objects, which means that there can be three relationships, for example, A and/or B can mean: A exists alone, A and B exist simultaneously, and there exists alone B these three situations. In addition, the character "/" in this article generally indicates that the contextual objects are an "or" relationship. In addition, "many" herein means two or more than two. In addition, the term "at least one" herein means any one of a variety or any combination of at least two of the more, for example, including at least one of A, B, and C, which may mean including from A, Any one or more elements selected from the set formed by B and C.

本申请中所述的风险识别方法可以由一电子设备执行,又可以称为执行设备,该电子设备可以为具有处理能力的任意设备,例如,手机、电脑等。The risk identification method described in this application can be executed by an electronic device, which can also be referred to as an execution device, and the electronic device can be any device with processing capabilities, such as a mobile phone, a computer, and the like.

请参阅图1,图1是本申请风险识别方法一实施例的流程示意图。Please refer to FIG. 1 . FIG. 1 is a schematic flowchart of an embodiment of a risk identification method of the present application.

具体而言,该方法可以包括:Specifically, the method can include:

步骤S110:获取目标业务数据。Step S110: Obtain target service data.

其中,目标业务数据可以为与财务、资金相关的业务数据,例如,关于差旅报销的业务数据等等。此外,目标业务数据可以包括若干条业务数据。Wherein, the target business data may be business data related to finance and funds, for example, business data about travel reimbursement and the like. In addition, the target business data may include several pieces of business data.

具体来说,目标业务数据可以为财务管理系统产生的业务数据,执行设备可以从财务管理系统获取目标业务数据。Specifically, the target business data may be business data generated by the financial management system, and the executing device may obtain the target business data from the financial management system.

步骤S120:利用风险识别模型划分目标业务数据得到风险业务数据和非风险业务数据。Step S120: Use the risk identification model to divide the target business data to obtain risky business data and non-risk business data.

需要说明的是,风险识别模型可以为预先进行训练得到的模型,可以用于对输入模型的业务数据进行风险识别,确定业务数据为风险数据类型或者非风险数据类型,从而将业务数据划分为风险业务数据和非风险业务数据。其中,风险数据类型表示该业务数据被判定为存在资金风险,非风险数据类型表示该业务数据被判定为不存在资金风险。It should be noted that the risk identification model can be a pre-trained model, which can be used to identify the risks of the business data input into the model, and determine whether the business data is a risk data type or a non-risk data type, thereby classifying the business data into risk data types. Business data and non-risk business data. Wherein, the risk data type indicates that the business data is judged to have capital risk, and the non-risk data type represents that the business data is judged to have no capital risk.

具体来说,目标业务数据可以包括若干条业务数据,风险业务数据可以包括目标业务数据中被识别为风险数据类型的业务数据,非风险业务数据可以包括目标业务数据中被识别为非风险数据类型的业务数据。Specifically, the target business data can include several pieces of business data, the risky business data can include business data identified as risky data types in the target business data, and non-risk business data can include business data identified as non-risk data types in the target business data business data.

其中,风险识别模型可以为利用人工标注后的业务数据预先进行训练得到的。Wherein, the risk identification model may be obtained by pre-training using manually marked business data.

步骤S130:利用风险分类模型确定风险业务数据的风险类别。Step S130: Use the risk classification model to determine the risk category of the risk business data.

需要说明的是,风险分类模型可以为预先进行训练得到的模型,可以用于对输入的业务数据进行风险分类,确定业务数据的风险类别,从而将业务数据划分为与各个风险类别对应。其中,风险类别可以包括预先确定的若干种风险类别。一些实施例中,风险类别包括收支监控、预算监控、事中监控、融资监控四种。It should be noted that the risk classification model can be a pre-trained model, which can be used to classify the risks of the input business data, determine the risk categories of the business data, and divide the business data into corresponding risk categories. Wherein, the risk category may include several predetermined risk categories. In some embodiments, risk categories include revenue and expenditure monitoring, budget monitoring, ongoing monitoring, and financing monitoring.

其中,风险分类模型可以为利用人工标注后的业务数据预先进行训练得到的。Wherein, the risk classification model may be obtained through pre-training using manually labeled business data.

当然,风险分类模型能够划分的风险类别也可以不局限于上述实施例中的类别,可以根据实际应用需要而调整风险分类模型所能够划分的风险类别。Of course, the risk categories that can be classified by the risk classification model are not limited to the categories in the above embodiments, and the risk categories that can be classified by the risk classification model can be adjusted according to actual application requirements.

其中,风险业务数据可以为由风险识别模型输出的,由风险识别模型对目标业务数据进行风险识别得到的。Wherein, the risk business data may be output by the risk identification model and obtained by risk identification of the target business data by the risk identification model.

步骤S140:基于由风险业务数据所生成的样本业务数据,对风险识别模型进行增强训练,并基于由已确定风险类别的风险业务数据所生成的样本待分类数据,对风险分类模型进行增强训练。Step S140: Based on the sample business data generated from the risk business data, the risk identification model is enhanced for training, and the risk classification model is enhanced for training based on the sample data to be classified generated from the risk business data of the determined risk category.

其中,风险业务数据为风险识别模型对目标业务数据进行风险识别得到的,用于生成样本业务数据,样本业务数据用于对风险识别模型进行增强训练。已确定风险类别的风险业务数据为风险分类模型对风险业务数据进行风险分类得到的,用于生成样本待分类数据,样本待分类数据用于对风险分类模型进行增强训练。Wherein, the risk business data is obtained by risk identification of the target business data by the risk identification model, and is used to generate sample business data, and the sample business data is used for enhanced training of the risk identification model. The risk business data of the determined risk category is obtained by risk classification of the risk business data by the risk classification model, and is used to generate sample data to be classified, and the sample data to be classified is used for enhanced training of the risk classification model.

上述方案,利用风险识别模型从目标业务数据中识别得到风险业务数据,用于生成样本业务数据以对风险识别模型进行增强训练,利用风险分类模型确定风险业务数据的风险类别,用于生成样本待分类数据以对风险分类模型进行增强训练,一方面增加了模型训练的样本数量,增加了模型学习的信息量,提高模型的准确性,从而提升风险识别的准确性,另一方面,通过模型对目标业务数据的处理,从杂乱无章的目标业务数据中挖掘了目标业务数据中关于财务风险的信息,并用于模型的训练中,增加了模型学习的信息量,提高模型的准确性,从而提升风险识别的准确性。In the above solution, the risk business data is identified from the target business data by using the risk identification model, which is used to generate sample business data for enhanced training of the risk identification model, and the risk classification model is used to determine the risk category of the risk business data, which is used to generate the sample to be Classification data is used to enhance the training of the risk classification model. On the one hand, it increases the number of samples for model training, increases the amount of information learned by the model, and improves the accuracy of the model, thereby improving the accuracy of risk identification. On the other hand, through the model to The processing of target business data mines the information about financial risks in the target business data from the chaotic target business data, and uses it in the training of the model, which increases the amount of information learned by the model, improves the accuracy of the model, and thus improves risk identification accuracy.

请参阅图2,图2是本申请步骤S120另一实施例的流程示意图。具体而言,步骤S120包括:Please refer to FIG. 2 . FIG. 2 is a schematic flowchart of another embodiment of step S120 of the present application. Specifically, step S120 includes:

步骤S221:利用风险识别模型从目标业务数据中依次识别得到直接风险数据、间接风险数据和罕见风险数据。Step S221: Use the risk identification model to sequentially identify direct risk data, indirect risk data and rare risk data from the target business data.

其中,直接风险数据表示财务管理系统直接产生的、存在财务风险的业务数据,间接风险数据表示财务管理系统间接产生的、存在财务风险的业务数据,罕见风险数据表示出现频率较低的、存在财务风险的业务数据。Among them, the direct risk data refers to the business data directly generated by the financial management system and has financial risks; the indirect risk data refers to the business data indirectly generated by the financial management system and has financial risks; Risky business data.

在一具体的应用场景中,对于疑似重复支付监控,满足以下条件的业务数据可以被识别为直接风险数据:1、以当天监控最新交易流水记录为监控对象,往前查询历史近30天范围内(含30天)与其在收付款方名称、账号、金额、摘要均一致的支出记录,监控重复笔数>0(不含监控对象本笔)的付款记录;2、交易方向=支出;3、剔除支付摘要中包含“上交”、“批归”、“实归”、“归集”、“资金池成员”字样的交易记录。In a specific application scenario, for suspected repeated payment monitoring, business data that meets the following conditions can be identified as direct risk data: 1. Take the latest transaction records of the current monitoring day as the monitoring object, and query the history within the past 30 days (Including 30 days) Expenditure records consistent with the name, account number, amount, and summary of the payee and payee, monitor the payment records with the number of repeated transactions > 0 (excluding the monitoring object's current transaction); 2. Transaction direction = expenditure; 3. Exclude transaction records that contain the words "handed in", "approved", "actually returned", "collected", and "fund pool member" in the payment summary.

在一具体的应用场景中,对于应收票据预算执行业务,业务数据可以包括财务管理系统间接产生的应收票据执行偏差率(x)=|收取票据规模的(实际执行数-二次排程数)|/二次排程数*100%,(大于100%时取100%;分母为零,分子不为零,偏差率为100%;分母为零,分子也为零,偏差率为0%;),该业务数据在满足风险判断条件的情况可以被识别为间接风险数据。In a specific application scenario, for the bill receivable budget execution business, the business data may include the execution deviation rate (x) of the bill receivable indirectly generated by the financial management system = | the scale of bills collected (actual execution number - secondary schedule number)|/number of secondary schedules*100%, (take 100% when it is greater than 100%; if the denominator is zero, the numerator is not zero, and the deviation rate is 100%; if the denominator is zero, the numerator is also zero, and the deviation rate is 0 %;), the business data can be identified as indirect risk data if it satisfies the risk judgment conditions.

在一具体的应用场景中,对于账户状态异常监控,对满足以下条件的业务数据可以被识别为罕见风险数据:1、银行账户的“挂接状态=未挂接”且“是否办理无法挂接备案=否”;2、银行账户的“监控状态=未授权”且“是否办理无法监控备案=否”。In a specific application scenario, for the abnormal monitoring of account status, business data that meets the following conditions can be identified as rare risk data: 1. The bank account's "attachment status = unattached" and "whether the transaction cannot be attached" Filing = No"; 2. "Monitoring status = Unauthorized" of the bank account and "Whether the record cannot be monitored = No".

步骤S222:将直接风险数据、间接风险数据和罕见风险数据一并作为风险业务数据,以及将目标业务数据中除直接风险数据、间接风险数据和罕见风险数据之外的部分业务数据作为非风险业务数据。Step S222: Take direct risk data, indirect risk data and rare risk data together as risk business data, and use part of the business data in the target business data except direct risk data, indirect risk data and rare risk data as non-risk business data.

其中,直接风险数据、间接风险数据和罕见风险数据三者均为存在财务风险的业务数据,一并作为风险业务数据。目标业务数据中除了上述三者之外的部分业务数据,表示被判定为不属于上述三种风险数据,不存在资金风险,则可以作为非风险业务数据。Among them, direct risk data, indirect risk data and rare risk data are all business data with financial risks, and they are regarded as risk business data together. Part of the business data in the target business data other than the above three, which means that it is judged not to belong to the above three risk data, and there is no capital risk, can be regarded as non-risk business data.

一些实施例中,在风险识别模型进行风险识别过程中,还可以根据实际应用需要而调整风险业务数据所包含的风险数据种类。示例性地,利用风险识别模型从目标业务数据中依次识别得到直接风险数据、间接风险数据。或者,也可以增加风险业务数据所包含的风险数据种类。In some embodiments, during the risk identification process of the risk identification model, the types of risk data included in the risk business data may also be adjusted according to actual application needs. Exemplarily, direct risk data and indirect risk data are sequentially identified from target business data by using a risk identification model. Alternatively, the types of risk data included in the risk business data may also be increased.

请参阅图3,图3是本申请步骤S221另一实施例的流程示意图。具体而言,步骤S221包括:Please refer to FIG. 3 , which is a schematic flowchart of another embodiment of step S221 of the present application. Specifically, step S221 includes:

步骤S3211:获取目标业务数据属于直接风险数据的第一概率。Step S3211: Obtain the first probability that the target business data belongs to the direct risk data.

具体来说,不同种类的风险数据的识别方式有所差异,直接风险数据、间接风险数据和罕见风险数据采用不同的方式进行识别。首先从目标业务数据中识别出直接风险数据,而后在目标业务数据中除直接风险数据之外的部分识别出间接风险数据,最后在目标业务数据中除直接风险数据、间接风险数据之外的部分识别出罕见风险数据,以将目标业务数据区分为了风险业务数据和非风险业务数据。Specifically, different types of risk data are identified in different ways, and direct risk data, indirect risk data, and rare risk data are identified in different ways. First identify the direct risk data from the target business data, then identify the indirect risk data in the target business data except for the direct risk data, and finally identify the target business data except for the direct risk data and indirect risk data. Rare risk data is identified to distinguish target business data into risk business data and non-risk business data.

一些实施例中,可以通过如下公式计算各条业务数据属于直接风险数据的第一概率:In some embodiments, the first probability that each piece of business data belongs to direct risk data can be calculated by the following formula:

其中,Pr(S/W)表示该条业务数据属于直接风险数据的第一概率,Pr(S)表示任何业务数据是风险数据的总体概率,Pr(W/S)表示“业务数据中风险数据”出现在风险数据中的总体概率,Pr(H)表示任何业务数据不是风险数据的概率,Pr(W/H)表示风险数据出现在业务数据中的概率,Pr(S)、Pr(W/S)、Pr(H)、Pr(W/H)基于目标业务数据计算得到。Among them, Pr(S/W) indicates the first probability that this piece of business data belongs to direct risk data, Pr(S) indicates the overall probability that any business data is risk data, and Pr(W/S) indicates that "risk data in business data "The overall probability of appearing in risk data, Pr(H) represents the probability that any business data is not risk data, Pr(W/H) represents the probability of risk data appearing in business data, Pr(S), Pr(W/ S), Pr(H), and Pr(W/H) are calculated based on target business data.

步骤S3212:基于第一概率从目标业务数据中确定直接风险数据。Step S3212: Determine direct risk data from target business data based on the first probability.

具体来说,可以根据业务数据的第一概率是否满足第一预设要求而确定该条业务数据是否属于直接风险数据。Specifically, it may be determined whether the piece of business data belongs to direct risk data according to whether the first probability of the business data satisfies the first preset requirement.

在一具体的应用场景中,第一预设条件可以为达到预设第一概率阈值,从而将第一概率达到第一概率阈值的业务数据作为直接风险数据。当然,第一预设条件可以根据实际应用需要而调整。In a specific application scenario, the first preset condition may be reaching a preset first probability threshold, so that business data whose first probability reaches the first probability threshold is taken as direct risk data. Of course, the first preset condition can be adjusted according to actual application needs.

步骤S3213:获取目标业务数据中除直接风险数据之外的部分业务数据属于间接风险数据的第二概率。Step S3213: Obtain the second probability that part of the business data in the target business data except the direct risk data belongs to the indirect risk data.

其中,业务数据可以包括若干子数据,对于一条业务数据来说,其是否属于间接风险数据与可以其子数据的风险概率关联。Wherein, business data may include several sub-data, and for a piece of business data, whether it belongs to indirect risk data may be associated with the risk probability of its sub-data.

一些实施例中,可以通过如下公式计算目标业务数中除直接风险数据之外的各条业务数据属于间接风险数据的第二概率:In some embodiments, the second probability that each piece of business data in the target business data other than the direct risk data belongs to the indirect risk data can be calculated by the following formula:

其中,p表示该业务数据是间接风险数据的第二概率,p1表示该业务数据的第一条子数据对应的是风险数据的p(S/W1)概率,p2表示该业务数据第二条子数据对应的是风险数据的p(S/W2)概率,……,pn表示该业务数据第n条子数据对应的是风险数据的p(S/WN)概率。p1、p2、……pn为基于该条业务数据对应的各子数据得到。Among them, p represents the second probability that the business data is indirect risk data, p 1 represents the p(S/W 1 ) probability that the first sub-data of the business data corresponds to the risk data, and p 2 represents the second probability of the business data The two pieces of sub-data correspond to the p(S/W 2 ) probability of the risk data, ..., p n indicates that the nth piece of sub-data of the business data corresponds to the p(S/W N ) probability of the risk data. p 1 , p 2 , ... p n are obtained based on the sub-data corresponding to the piece of business data.

步骤S3214:基于第二概率,从目标业务数据中除直接风险数据之外的部分业务数据中确定间接风险数据。Step S3214: Based on the second probability, determine the indirect risk data from part of the target business data except the direct risk data.

具体来说,可以根据业务数据的第二概率是否满足第二预设要求而确定该条业务数据是否属于间接风险数据。Specifically, it may be determined whether the piece of business data belongs to indirect risk data according to whether the second probability of the business data meets the second preset requirement.

在一具体的应用场景中,第二预设条件可以为达到预设第二概率阈值,从而将第二概率达到第二概率阈值的业务数据作为间接风险数据。当然,第二预设条件可以根据实际应用需要而调整。In a specific application scenario, the second preset condition may be reaching a preset second probability threshold, so that business data whose second probability reaches the second probability threshold is taken as indirect risk data. Of course, the second preset condition can be adjusted according to actual application needs.

步骤S3215:获取目标业务数据中除直接风险数据、间接风险数据之外的部分业务数据属于罕见风险数据的第三概率。Step S3215: Obtain the third probability that part of the business data in the target business data except direct risk data and indirect risk data belongs to rare risk data.

需要说明的是,罕见风险数据为数据出现频率比较低的、存在财务风险的数据。可以通过如下公式计算目标业务数据中除直接风险数据和间接风险数据之外的各条业务数据的第三概率:It should be noted that rare risk data refers to data with a relatively low frequency of occurrence and financial risk. The third probability of each item of business data in the target business data except direct risk data and indirect risk data can be calculated by the following formula:

其中,上述公式可以扩展到其中n等于零(没有定义垃圾性的)的情况,并在这种情况下估值为PrS。Among other things, the above formula can be extended to the case where n is equal to zero (no junkness defined), and evaluates to PrS in this case.

其中,P'r(S/W)表示业务数据为罕见风险数据的第三概率,也可以理解为业务数据的更正概率(即在第一概率的基础上更新计算的概率),s表示有关输入风险数据的有关背景信息的强度,Pr(S)表示任何输入数据为风险数据的概率,n表示这个数据在学习阶段出现的次数,Pr(S/W)表示该条业务数据属于直接风险数据的第一概率。Among them, P'r(S/W) represents the third probability that business data is rare risk data, and can also be understood as the correction probability of business data (that is, the probability of updating calculations based on the first probability), and s represents the relevant input The strength of background information about risk data, Pr(S) indicates the probability that any input data is risk data, n indicates the number of times this data appears in the learning phase, Pr(S/W) indicates that the business data belongs to direct risk data first probability.

一些实施例中,PrS可以再次取等于0.5,以避免过于怀疑输入的业务数据。3是s的一个良好值,意味着学习的语料库必须包含超过该业务信息的3个信息,在风险性值中要比在默认值投入更多的信心。In some embodiments, PrS can be equal to 0.5 again, so as to avoid too much suspicion of the input service data. 3 is a good value for s, meaning that the learned corpus must contain 3 more information than the business information, put more confidence in the risky value than in the default value.

步骤S3216:基于第三概率,从目标业务数据中除直接风险数据、间接风险数据之外的部分业务数据中确定罕见风险数据。Step S3216: Based on the third probability, determine rare risk data from part of the target business data except direct risk data and indirect risk data.

具体来说,可以根据业务数据的第三概率是否满足第三预设要求而确定该条业务数据是否属于罕见风险数据。Specifically, it may be determined whether the piece of business data belongs to rare risk data according to whether the third probability of the business data satisfies the third preset requirement.

在一具体的应用场景中,第三预设条件可以为达到预设第三概率阈值,从而将第三概率达到第三概率阈值的业务数据作为罕见风险数据。当然,第三预设条件可以根据实际应用需要而调整。In a specific application scenario, the third preset condition may be reaching a preset third probability threshold, so that business data whose third probability reaches the third probability threshold is regarded as rare risk data. Of course, the third preset condition can be adjusted according to actual application needs.

请参阅图4,图4是本申请风险识别方法另一实施例的流程示意图。Please refer to FIG. 4 . FIG. 4 is a schematic flowchart of another embodiment of the risk identification method of the present application.

具体而言,该方法可以包括如下步骤:Specifically, the method may include the following steps:

步骤S410:获取目标业务数据。Step S410: Obtain target service data.

一些实施例中,执行设备可以每间隔第二预设时间获取目标业务数据,进一步地,每间隔第二预设时间从财务管理系统中获取目标业务数据。In some embodiments, the executing device may acquire target business data at intervals of a second preset time, and further, acquire target business data from the financial management system at intervals of a second preset time.

在一具体的应用场景中,第二预设时长可以为5分钟。当然,第二预设时长还可以根据实际业务数据量而评估调整,示例性地,第二预设时长也可以为预设时长。In a specific application scenario, the second preset duration may be 5 minutes. Of course, the second preset duration can also be evaluated and adjusted according to the actual service data volume, for example, the second preset duration can also be a preset duration.

一些实施例中,也可以根据财务管理系统中产生的、未处理的业务数据量而触发获取目标业务数据,示例性地,在未处理的业务数据量达到第一预设数量的情况下,执行设备获取未处理的业务数据作为目标业务数据。In some embodiments, acquisition of target business data may also be triggered according to the amount of unprocessed business data generated in the financial management system. For example, when the amount of unprocessed business data reaches a first preset amount, execute The device acquires unprocessed business data as target business data.

一些实施例中,在获取到目标业务数据之后,还可以对目标业务数据进行预处理,预处理可以包括但不限于脱敏、清洗等。In some embodiments, after the target business data is acquired, the target business data may also be preprocessed, and the preprocessing may include but not limited to desensitization, cleaning, and the like.

在一些实施例中,在获取到目标业务数据后,还可以将目标业务数据存储到执行设备中的数据库中,以供后续调用。In some embodiments, after the target business data is acquired, the target business data may also be stored in a database in the executing device for subsequent calling.

通过周期性地获取财务管理系统产生的业务数据进行风险识别和风险分类,每次获取到的目标业务数据中可以包括有正在进行的业务的业务数据,以及还可以包括有已经完成的业务的业务数据。示例性地,例如,A报销正在报销流程中,B报销已经完成。对于已经完成的业务的业务数据进行风险识别和风险分类,能够实现对已经发生的资金风险进行评估。对于正在进行的业务的业务数据进行风险识别和风险分类,能够实现对即将发生的资金风险进行预测。因此,通过上述方式能够对已经产生的资金风险和即将发生的资金风险进行识别,做到资金风险识别的全面覆盖,准确、全面地识别资金风险。By periodically acquiring business data generated by the financial management system for risk identification and risk classification, the target business data acquired each time may include business data of ongoing business and business that has been completed data. Exemplarily, for example, A's reimbursement is in the process of reimbursement, and B's reimbursement has been completed. Risk identification and risk classification are carried out on the business data of the completed business, which can realize the assessment of the capital risk that has occurred. The risk identification and risk classification of the business data of the ongoing business can realize the prediction of the upcoming capital risk. Therefore, through the above method, it is possible to identify the capital risk that has already occurred and the capital risk that is about to occur, so as to achieve full coverage of capital risk identification and accurately and comprehensively identify capital risk.

步骤S420:利用风险识别模型划分目标业务数据得到风险业务数据和非风险业务数据。Step S420: Use the risk identification model to divide the target business data to obtain risky business data and non-risk business data.

在一具体的应用场景中,将目标业务数据这一数据集输入风险识别模型中,对于模型中的各条业务数据进行风险识别,得到风险类型预测结果,风险类型预测结果表示业务数据属于风险数据类型和非风险数据类型中的何种。基于风险类型预测结果可以将风险业务数据划分得到风险业务数据和非风险业务数据两个数据集。In a specific application scenario, input the data set of target business data into the risk identification model, carry out risk identification on each piece of business data in the model, and obtain the risk type prediction result. The risk type prediction result indicates that the business data belongs to risk data Which of type and non-risk data type. Based on the risk type prediction results, the risky business data can be divided into two data sets: risky business data and non-risky business data.

一些实施例中,执行设备可以将风险识别模型处理得到的风险业务数据和非风险业务数据两个数据集,存储到数据库中,以供后续使用,示例性地,可以供后续模型调用。In some embodiments, the execution device can store the two data sets of risk business data and non-risk business data processed by the risk identification model into the database for subsequent use, for example, for subsequent model calls.

步骤S430:确定风险业务数据的明细类别。Step S430: Determine the detailed category of risk business data.

其中,风险业务数据包括若干业务数据,具体地,上述步骤可以为确定各条业务数据属于何种预设的明细类别。明细类别的数量大于风险类别的数量。Wherein, the risk business data includes several business data, specifically, the above step may be to determine which preset detailed category each piece of business data belongs to. The number of detail categories is greater than the number of risk categories.

一些实施例中,明细类别可以包括但不限于如下预设类别:低效账户监控、超标账户监控、未达账款监控、账外账监控、电费账户异常支出监控、账户状态异常监控、电费账户未对账数据监控、银行账户有效性监控、电子支付监控、疑似重复支付监控、备用金余额监控、对外借出资金监控、应付款项余额异常监控、大额调户监控、退票监控、支付流程回退监控、付款单据传递及时性监控、大额对私支付监控、“收支余”监控、大额现金收支监控、大额现金余额监控、内部封闭结算监控、资金双向交易异常监控、特殊支付监控、MAC地址重复拦截退回、供应商黑名单拦截退回、签名流程不匹配拦截退回、外部独立账户大额支付拦截、外部供应商大额支付拦截、对私大额支付拦截预警、异常时间支付指令拦截预警、超大额支付拦截预警、贷款偿还及时性监控、融资台账规范性监控、票据兑付及时性监控、新增应收票据单位监控、票据台账规范性监控、应收票据预算执行准确性、应付票据预算执行准确性、票据台账-出票日期规范性监控、承兑银行规范性监控、非银行承兑汇票监控、非电子票据监控。In some embodiments, detailed categories may include but not limited to the following preset categories: inefficient account monitoring, over-standard account monitoring, unpaid account monitoring, off-book account monitoring, electricity account abnormal expenditure monitoring, account status abnormal monitoring, electricity account Unreconciled data monitoring, bank account validity monitoring, electronic payment monitoring, suspected double payment monitoring, reserve fund balance monitoring, external lending fund monitoring, abnormal payable balance monitoring, large-value account transfer monitoring, check refund monitoring, payment process return Refund monitoring, payment document transmission timeliness monitoring, large-amount private payment monitoring, "revenue and expenditure balance" monitoring, large-amount cash receipts and payments monitoring, large-amount cash balance monitoring, internal closed settlement monitoring, abnormal two-way transaction of funds monitoring, special payment Monitoring, MAC address duplicate interception and return, supplier blacklist interception and return, signature process mismatch interception and return, external independent account large-value payment interception, external supplier large-value payment interception, private large-value payment interception warning, abnormal time payment instruction Interception early warning, super large payment interception early warning, loan repayment timeliness monitoring, financing ledger normative monitoring, bill redemption timeliness monitoring, newly added bill receivable unit monitoring, bill ledger normative monitoring, bill receivable budget execution accuracy , Accuracy of bills payable budget execution, bill ledger - normative monitoring of bill date, acceptance bank normative monitoring, non-bank acceptance bill monitoring, non-electronic bill monitoring.

其中,各个明细类别所提示的风险不同,具体如下:Among them, the risks indicated by each detailed category are different, as follows:

低效账户监控:监控不同账户分类下账户交易次数少于一定标准的账户;超标账户监控:监控各单位各账户分类下银行账户开立情况超出账户分级分类管控标准数的账户数;未达账款监控:监控各单位上月底未达账款存量情况;账外账监控:监控资金收支交易中,账户名称为本系统内单位,但账号未按规定纳入监控范围的交易记录;电费账户异常支出监控:监控电费账户发生除资金上缴、手续费以外的资金支出行为的交易记录;银行账户有效性监控:对前一日的所有账户通过收支余规则进行稽核,分析并判断银行账户状态(前一日期初余额+收入-支出与前一日期末余额进行比对,如果不一致,显示账户差异情况);电费账户三方对账实时监控:监控资金中心电费账户三方对账结果中,存在不明账款、未清分账款或未达账款中至少一项的记录;电子支付监控:监控各单位电子支付比率及非电子支付情况;疑似重复支付监控:监控近30天内收付款单位及账号、支付金额及支付摘要均相同的付款记录;备用金余额监控:监控各单位备用金存量明细;对外借出资金监控:监控对外借出资金的明细付款记录;应付款项余额异常监控:监控对外借出资金的明细付款记录;大额调户监控:监控各单位中电财账户向外部银行账户单笔转账金额>1000万元或月累计转账金额>2000万元的记录;退票监控:监控资金支付退票的记录;支付流程回退监控:监控集中付款业务的回退情况;付款单据传递及时性监控:监控集中支付传递中心时间与滚动排程日期大于3天的记录;大额对私支付监控:监控经费账户单笔交易金额大于5000元的对私支付记录;“收支余”监控:监控金融机构交易明细反映的银行账户收入、支出及余额,与会计账簿反映的收入、支出及余额存在差异的记录;大额现金收支监控:监控月累计库存现金科目借贷方发生额合计值大于5000元的单位;大额现金余额监控:监控月末存在大额现金余额的单位;内部封闭结算监控:监控各单位未执行内部封闭结算情况;资金双向交易异常监控:监控各单位虚假交易情况(临时借出、过桥资金);特殊支付监控:监控交易摘要中包含“招待”、“烟”、“酒”、“茶”、“礼品”、“接待”等字符的特殊支付记录;MAC地址重复拦截退回:监控MAC地址重复拦截退回情况;供应商黑名单拦截退回:监控供应商黑名单拦截退回情况;签名流程不匹配拦截退回:监控签名流程不匹配拦截退回情况;外部独立账户大额支付拦截:监控对外部独立账户大额支付拦截预警情况;外部供应商大额支付拦截:监控对外部供应商大额支付拦截预警情况;对私大额支付拦截预警:监控对私大额支付拦截预警情况;异常时间支付指令拦截预警:监控异常时间支付指令拦截预警情况;超大额支付拦截预警:监控对超大额支付实施落地处理情况;贷款偿还及时性监控:监控存在逾期未还贷款的记录;融资台账规范性监控:监控各单位融资台账期末余额与账面余额存在差异的情况;票据兑付及时性监控:应收票据未兑付(监控截止查询日期之前已到期,且应收票据未处理的票据记录),应付票据未兑付(监控截止查询日期之前已到期,且应付票据未处理的票据记录);新增应收票据单位监控:监控发生新增收取票据行为的单位;新增应收票据单位监控:监控发生新增收取票据行为的单位;票据台账规范性监控:监控各单位应收票据、应付票据累计台账票面金额与累计账面票据余额存在差异的情况;应收票据预算执行准确性:监控各单位上月底应收票据月度预算数与实际执行数偏差情况;应付票据预算执行准确性:监控各单位上月底应付票据月度预算数与实际执行数偏差情况;票据台账-出票日期规范性监控:监控票据台账-票据台账登记信息与出票日期不一致的票据记录;承兑银行规范性监控:监控票据台账-承兑人不包含“农业银行”、“工商银行”、“建设银行”、“中国银行”字段的票据记录;非银行承兑汇票监控:监控票据台账-监控票据类型不为“银行承兑汇票”的票据记录;非电子票据监控:监控票据台账-票据介质不为“电子票据”的票据记录。Low-efficiency account monitoring: monitor accounts with account transactions less than a certain standard under different account classifications; over-standard account monitoring: monitor the number of accounts with bank account openings exceeding the account classification and control standards under each account classification of each unit; unreached accounts Payment monitoring: monitoring the unpaid account balance of each unit at the end of last month; off-book account monitoring: monitoring the transaction records of fund income and expenditure transactions, the account name is the unit in the system, but the account number is not included in the monitoring scope according to the regulations; the electricity account is abnormal Expenditure monitoring: monitoring the transaction records of capital expenditure behaviors other than fund handover and handling fees in the electricity account; bank account validity monitoring: checking all accounts of the previous day through the balance of income and expenditure rules, analyzing and judging the status of bank accounts ( The balance at the beginning of the previous day + income - expenditure is compared with the balance at the end of the previous day. If they are inconsistent, the account difference will be displayed); the real-time monitoring of the three-party reconciliation of the electricity account: monitoring the result of the three-party reconciliation of the electricity account of the capital center, there are unclear accounts Records of at least one of the payment, unsettled accounts or outstanding accounts; electronic payment monitoring: monitor the electronic payment ratio and non-electronic payment status of each unit; suspected double payment monitoring: monitor the payment units and account numbers within the past 30 days, Payment records with the same payment amount and payment summary; reserve fund balance monitoring: monitor the details of the reserve fund stock of each unit; external lending funds monitoring: monitor the detailed payment records of external lending funds; abnormal balance of payables monitoring: monitor external lending Detailed payment records of funds; large-amount transfer account monitoring: monitor records of single transfer amount > 10 million yuan or monthly accumulative transfer amount > 20 million yuan from Zhongdiancai account of each unit to external bank account; check refund monitoring: monitor fund payment refund Payment process rollback monitoring: monitor the rollback of centralized payment business; payment document transmission timeliness monitoring: monitor the records that the centralized payment delivery center time and rolling schedule date are more than 3 days; large-amount private payment monitoring: monitor Private payment records with a single transaction amount of more than 5,000 yuan in the fund account; "receipt and expenditure balance" monitoring: monitor the income, expenditure and balance of the bank account reflected in the transaction details of the financial institution, and the income, expenditure and balance reflected in the accounting books. Records; monitoring of large cash receipts and expenditures: monitor units with a monthly accumulative inventory cash account with a total value of more than 5,000 yuan from debit and credit; monitoring of large cash balances: monitor units with large cash balances at the end of the month; internal closed settlement monitoring: monitor each The unit did not implement internal closed settlement; monitoring of abnormal two-way transaction of funds: monitoring of false transactions of each unit (temporary lending, bridge funds); monitoring of special payments: monitoring of transaction summaries containing "entertainment", "cigarette", and "wine" , "tea", "gift", "reception" and other characters of special payment records; MAC address duplicate interception and return: monitor MAC address duplicate interception and return; supplier blacklist interception and return: monitor supplier blacklist interception and return; signature Process mismatch interception and return: monitoring of signature process mismatch interception and return; external independent account large-value payment interception: monitoring of external independent account large-value payment interception and early warning; external supplier large-value payment interception: monitoring external supplier large-value payment interception Early warning of payment interception; early warning of interception of large-amount private payment: monitoring of early warning of interception of large-amount private payment; early warning of payment instruction interception at abnormal time: monitoring of early warning of payment instruction interception at abnormal time; early warning of interception of super-large payment: monitoring of super-large payment Implementation of the implementation of the processing situation; loan repayment timeliness monitoring: monitoring the existence of overdue loan repayment records; financing ledger normative monitoring: monitoring the differences between the ending balance and book balance of each unit's financing ledger; bill redemption timeliness monitoring: should Bills received unpaid (monitor records of bills that have expired before the cut-off date of inquiry and bills receivable have not been processed), bills payable have not been paid (monitored records of bills that have expired before the cut-off date of inquiry and bills payable have not been processed); new Add bill receivable unit monitoring: monitor the unit that has newly collected bills; add bill receivable unit monitoring: monitor the unit that has newly collected bills; bill ledger normative monitoring: monitor the bills receivable, payable There is a discrepancy between the face value of the accumulative ledger and the balance of the accumulative bills; the execution accuracy of the bills receivable budget: monitor the deviation between the monthly budget of the bills receivable by each unit at the end of last month and the actual execution amount; the execution accuracy of the bills payable budget: monitor The deviation between the monthly budget and actual execution of bills payable by each unit at the end of last month; standard monitoring of bill ledger-bill issuance date: monitor bill ledger-bill ledger registration information inconsistent with the bill issuance date; acceptance bank normative Monitoring: monitoring bill ledger - the acceptor does not include bill records in the fields of "Agricultural Bank of China", "Industrial and Commercial Bank of China", "Construction Bank" and "Bank of China"; monitoring of non-bank acceptance bills: monitoring bill ledger - monitoring bill type does not Bill records of "bank acceptance bills"; non-electronic bill monitoring: monitor bill ledger - bill records whose bill medium is not "electronic bills".

需要说明的是,不同明细类别所提示的风险不同,而相同明细类别所提示的风险有其共性,进行明细类别的分类是在风险识别的基础上进一步挖掘目标业务数据包含的关于风险识别的信息的方式。一些实施例中,也可以采用其他处理方式来替代明细类别的分类来挖掘目标业务数据包含的关于风险识别的信息。It should be noted that the risks suggested by different detailed categories are different, but the risks suggested by the same detailed category have their commonalities. The classification of detailed categories is to further mine the information about risk identification contained in the target business data on the basis of risk identification. The way. In some embodiments, other processing methods may be used to replace the classification of detailed categories to mine the information about risk identification contained in the target business data.

一些实施例中,执行设备可以将风险业务数据按照明细类别分类后作为风险明细数据而存储到数据库中,以供后续使用。In some embodiments, the execution device may classify the risk business data according to the detailed category and store it as risk detailed data in the database for subsequent use.

步骤S440:利用风险分类模型确定已确定明细类别的风险业务数据的风险类别。Step S440: Using the risk classification model to determine the risk category of the risk business data whose detailed category has been determined.

步骤S440的相关描述可以参考前述实施例中关于步骤S130的相关内容。For related descriptions of step S440, reference may be made to related content about step S130 in the foregoing embodiments.

进一步地,利用风险分类模型将已确定明细类别的风险业务数据映射到分类函数维度,并确定各风险类别分别对应的分离超平面;在分类函数维度,基于分离超平面将已确定明细类别的风险业务数据划分为与各风险类别对应。从而能够得到确定了明细类别的风险业务数据。Further, use the risk classification model to map the risk business data of the identified detailed categories to the classification function dimension, and determine the separation hyperplane corresponding to each risk category; in the classification function dimension, based on the separation hyperplane, the risks Business data is divided into corresponding risk categories. In this way, risk business data with detailed categories can be obtained.

在一具体的应用场景中,已确定明细类别的风险业务数据中各条业务数据已经确定好明细类别。对于已经确定好明细类别的风险业务数据可以进一步划分为几大类风险类别。示例性地,明细类别低效账户监控、超标账户监控、未达账款监控、账外账监控、电费账户异常支出监控、账户状态异常监控、电费账户未对账数据监控、银行账户有效性监控、电费账户三方对账实时监控可以被确定为账户监控这一风险类别。明细类别电子支付监控、疑似重复支付监控、备用金余额监控、对外借出资金监控、应付款项余额异常监控、大额调户监控、退票监控、支付流程回退监控、付款单据传递及时性监控、大额对私支付监控、“收支余”监控、大额现金收支监控、大额现金余额监控、内部封闭结算监控、资金双向交易异常监控、特殊支付监控可以被确定为收支监控这一风险类别。明细类别MAC地址重复拦截退回、供应商黑名单拦截退回、签名流程不匹配拦截退回、外部独立账户大额支付拦截、外部供应商大额支付拦截、对私大额支付拦截预警、异常时间支付指令拦截预警、超大额支付拦截预警可以被确定为事中监控这一风险类别。明细类别贷款偿还及时性监控、融资台账规范性监控、票据兑付及时性监控、新增应收票据单位监控、票据台账规范性监控、应收票据预算执行准确性、应付票据预算执行准确性、票据台账-出票日期规范性监控、承兑银行规范性监控、非银行承兑汇票监控、非电子票据监控可以被确定为融资监控这一风险类别。In a specific application scenario, each item of business data in the risk business data whose detailed category has been determined has already determined the detailed category. The risk business data whose detailed categories have been determined can be further divided into several major risk categories. Exemplarily, detailed category inefficient account monitoring, over-standard account monitoring, outstanding account monitoring, off-book account monitoring, electricity account abnormal expenditure monitoring, account status abnormal monitoring, electricity account unreconciled data monitoring, bank account validity monitoring Real-time monitoring of tripartite reconciliation of electricity account can be identified as the risk category of account monitoring. Detailed category electronic payment monitoring, suspected double payment monitoring, reserve fund balance monitoring, external loan fund monitoring, payable balance abnormal monitoring, large-amount transfer account monitoring, check refund monitoring, payment process rollback monitoring, payment document transfer timeliness monitoring, Large-amount private payment monitoring, "receipt and expenditure surplus" monitoring, large-amount cash receipt and expenditure monitoring, large-amount cash balance monitoring, internal closed settlement monitoring, abnormal two-way transaction of funds monitoring, and special payment monitoring can be determined as the monitoring of income and expenditure. risk category. Detailed category MAC address duplicate interception and return, supplier blacklist interception and return, signature process mismatch interception and return, external independent account large-value payment interception, external supplier large-value payment interception, private large-value payment interception warning, abnormal time payment instruction Interception early warning and super large payment interception early warning can be identified as the risk category of in-process monitoring. Detailed category loan repayment timeliness monitoring, financing ledger normative monitoring, bill redemption timeliness monitoring, newly added bill receivable unit monitoring, bill ledger normative monitoring, bill receivable budget execution accuracy, bill payable budget execution accuracy , Bill ledger-regular monitoring of bill date, normative monitoring of acceptance banks, monitoring of non-bank acceptance bills, and monitoring of non-electronic bills can be identified as the risk category of financing monitoring.

通过上述明细类别以及风险类别的设置,能够实现对资金运营全过程中可能的资金风险进行识别,提升风险预测的准确性和全面性。Through the setting of the above detailed categories and risk categories, it is possible to identify possible capital risks in the whole process of capital operation and improve the accuracy and comprehensiveness of risk prediction.

一些实施例中,对于不同明细类别的风险业务数据,可以采用不同的特征转换函数,即采用不同方式将风险业务数据映射到分类函数维度。In some embodiments, different feature conversion functions may be used for risky business data of different detailed categories, that is, different ways may be used to map risky business data to classification function dimensions.

步骤S450:基于由风险业务数据所生成的样本业务数据,对风险识别模型进行增强训练,并基于由已确定风险类别的风险业务数据所生成的样本待分类数据,对风险分类模型进行增强训练。Step S450: Based on the sample business data generated from the risky business data, perform enhanced training on the risk identification model, and based on the sample data to be classified generated from the risky business data with identified risk categories, perform enhanced training on the risk classification model.

需要说明的是,对风险识别模型的增强训练和对风险分类模型的增强训练可以是相互独立的。两个模型的增强训练均可以执行多次。It should be noted that the enhanced training of the risk identification model and the enhanced training of the risk classification model may be independent of each other. Augmentation training for both models can be performed multiple times.

一些实施例中,执行设备可以每间隔第一预设时长执行增强训练的相关步骤。In some embodiments, the executing device may execute the related steps of the enhanced training every first preset time interval.

一些实施例中,执行设备也可以采用不同的时间间隔分别重复执行对风险识别模型的增强训练和对风险分类模型的增强训练的相关步骤。In some embodiments, the execution device may also repeatedly execute the related steps of enhancing training of the risk identification model and enhancing training of the risk classification model at different time intervals.

通过不断获取目标业务数据,用于生成样本业务数据和样本待分类数据,以周期性地进行两模型的增强训练,实现了训练数据的滚动更新,一方面,使得模型训练的样本数量明显扩大,增加了模型学习的信息量,提高模型的准确性,从而提升风险识别的准确性,另一方面,通过数据的滚动更新,使得模型不断加强学习,提高模型的准确性,从而提升风险识别的准确性。By continuously obtaining the target business data, it is used to generate sample business data and sample data to be classified, so as to periodically carry out enhanced training of the two models, and realize the rolling update of the training data. On the one hand, the number of samples for model training is significantly expanded. Increase the amount of information learned by the model and improve the accuracy of the model, thereby improving the accuracy of risk identification. On the other hand, through the rolling update of data, the model is continuously strengthened to learn and improve the accuracy of the model, thereby improving the accuracy of risk identification sex.

具体地,执行设备还可以利用已确定明细类别的风险业务数据生成样本业务数据,其中样本业务数据标注有真实风险标签,真实风险标签表征样本业务数据属于风险数据类型和非风险数据类型中的何种,样本业务数据用于对风险识别模型进行增强训练。Specifically, the execution device can also generate sample business data by using the risk business data of the determined detailed category, wherein the sample business data is marked with a real risk label, and the real risk label indicates which of the risk data types and non-risk data types the sample business data belongs to. Type, the sample business data is used to enhance the training of the risk identification model.

在一具体的应用场景中,由于样本业务数据是根据已确定明细类别的风险业务数据生成的,样本业务数据中包含的业务数据来自于风险业务数据这一数据集,故来自于风险业务数据的业务数据对应的真实风险标签表征该业务数据属于风险数据类型。In a specific application scenario, since the sample business data is generated based on the risk business data of the specified category, the business data contained in the sample business data comes from the data set of risk business data, so the data from the risk business data The real risk label corresponding to the business data indicates that the business data belongs to the risk data type.

一些实施例中,也可以利用已确定明细类别的风险业务数据与其他业务数据一并生成样本业务数据。示例性地,其他业务数据可以是之前对风险识别模型训练过程中使用过的样本数据。In some embodiments, it is also possible to generate sample business data by using the risk business data of the determined detailed category together with other business data. Exemplarily, other business data may be sample data previously used in the training process of the risk identification model.

具体地,执行设备可以利用风险识别模型划分样本业务数据,得到样本风险类型预测结果,基于样本风险类型预测结果确定样本风险数据和样本非风险数据,其中,样本风险预测结果表征预测样本业务数据属于风险数据类型和非风险数据类型中的何者,而后基于风险预测结果和真实风险标签之间的第一差异调整风险识别模型的参数。Specifically, the execution device can use the risk identification model to divide the sample business data, obtain the sample risk type prediction result, and determine the sample risk data and sample non-risk data based on the sample risk type prediction result, wherein the sample risk prediction result indicates that the sample business data belongs to Which one of the risk data type and the non-risk data type, and then adjust the parameters of the risk identification model based on the first difference between the risk prediction result and the real risk label.

其中,对于样本风险类型预测结果为属于风险数据类型的样本业务数据作为样本风险数据,对于样本风险类型预测结果为属于非风险数据类型的样本业务数据作为样本非风险数据。样本风险类型预测结果为利用风险识别模型对于样本业务数据属于风险数据类型还是非风险数据类型进行预测的结果,而真实风险标签则是预先确定的、关于样本业务数据属于风险数据类型还是非风险数据类型的真实标签,将两者进行比较得到第一差异可以用于调整风险识别模型的参数,从而提高风险识别模型的准确性。Wherein, for the sample risk type, the predicted result is the sample business data belonging to the risk data type as the sample risk data, and for the sample risk type, the predicted result is the sample business data belonging to the non-risk data type as the sample non-risk data. The prediction result of the sample risk type is the result of using the risk identification model to predict whether the sample business data belongs to the risk data type or the non-risk data type, while the real risk label is predetermined, whether the sample business data belongs to the risk data type or the non-risk data type The real label of the type, comparing the two to get the first difference can be used to adjust the parameters of the risk identification model, thereby improving the accuracy of the risk identification model.

一些实施例中,由于样本业务数据是基于已确定明细类别的风险业务数据生成的,真实风险标签还可以包含关于明细类别的信息,风险识别模型还可以被设置为能够学习真实风险标签包含的明细类别的信息,以提升风险识别模型的准确性。In some embodiments, since the sample business data is generated based on the risk business data of the specified category, the real risk label can also contain information about the category, and the risk identification model can also be set to be able to learn the details contained in the real risk label. categories of information to improve the accuracy of risk identification models.

具体地,执行设备还可以利用已确定风险类别的风险业务数据生成样本待分类数据,其中,样本待分类数据标注有真实分类标签,真实分类标签用于表征样本风险数据所述的风险类别。Specifically, the execution device can also generate sample data to be classified by using the risk business data of the determined risk category, wherein the sample data to be classified is marked with a real classification label, and the real classification label is used to represent the risk category described in the sample risk data.

一些实施例中,也可以利用已确定风险类别的风险业务数据与其他业务数据一并生成样本待分类数据。示例性地,其他业务数据可以是之前对风险分类模型训练过程中使用过的样本数据。In some embodiments, the risk business data of the determined risk category may also be used together with other business data to generate sample data to be classified. Exemplarily, other business data may be sample data previously used in the training process of the risk classification model.

具体地,执行设备可以利用风险分类模型确定样本待分类数据的风险类别,作为样本预测分类结果,基于样本预测分类结果和真实类别标签之间的第二差异调整风险分类模型的参数。Specifically, the execution device may use the risk classification model to determine the risk category of the sample to-be-classified data as a predicted classification result of the sample, and adjust the parameters of the risk classification model based on the second difference between the predicted classification result of the sample and the real category label.

其中,样本预测分类结果是利用风险分类模型对样本待分类数据所述的风险类别进行预测的结果,真实类别标签则是样本待分类数据所属风险类别的真实标签,利用两者之间的第二差异可以调整风险分类模型的参数,以提高风险分类模型的准确性。Among them, the sample prediction classification result is the result of using the risk classification model to predict the risk category of the sample data to be classified, and the real category label is the real label of the risk category to which the sample data to be classified belongs. Differences can adjust the parameters of the risk classification model to improve the accuracy of the risk classification model.

在一具体的应用场景中,风险分类模型的训练过程包括:首先确定如下公式:In a specific application scenario, the training process of the risk classification model includes: first determine the following formula:

f(x)=sign(w*T·Φ(x)+b*)f(x)=sign(w *T Φ(x)+b * )

其中,Φ(x)表示空间的特征转换函数,x表示样本待分类数据,其中包括n条业务数据,y表示x对应的风险分类后的数据集,其中包括n条业务数据。Sign()表示sign函数。其次,基于下式求解αi *i的最优解):Among them, Φ(x) represents the feature transformation function of the space, x represents the sample data to be classified, including n pieces of business data, and y represents the data set after risk classification corresponding to x, including n pieces of business data. Sign() represents the sign function. Secondly, α i * (the optimal solution of α i ) is solved based on the following formula:

αi≥0,i=1,2,…,nα i ≥0,i=1,2,…,n

之后,基于下式求解w*、b*(w、b的最优解):After that, solve w*, b* (the optimal solution of w, b) based on the following formula:

最后,计算得到分离超平面如下式:Finally, the separation hyperplane is calculated as follows:

w*Φ(x)+b*=0w * Φ(x)+b * =0

而后利用分离超平面可以得到按照预测风险类别分类的样本待分类数据,将该结果与样本待分类数据标注的真实类别标签进行比较,利用两者之间的差异调整风险分类模型的参数。Then, the separation hyperplane can be used to obtain the sample data to be classified according to the predicted risk category, compare the result with the real category label labeled by the sample data to be classified, and use the difference between the two to adjust the parameters of the risk classification model.

一些实施例中,执行设备还可以将基于目标业务数据获取的风险相关数据存储到数据库中,以响应于数据调用请求,发送与数据调用请求匹配的数据到目标对象。其中,数据调用请求为目标对象发送的,风险相关数据包括风险业务数据、非风险业务数据。In some embodiments, the execution device may also store the risk-related data obtained based on the target business data in the database, so as to respond to the data call request and send the data matching the data call request to the target object. Wherein, the data call request is sent by the target object, and the risk-related data includes risk business data and non-risk business data.

一些实施例中,风险相关数据还可以包括按照风险类别分类的风险业务数据等。In some embodiments, the risk-related data may also include risk business data classified according to risk categories.

请参阅图5,图5是本申请风险识别方法再一实施例的流程示意图。Please refer to FIG. 5 . FIG. 5 is a schematic flowchart of another embodiment of the risk identification method of the present application.

本实施例中,执行设备可以运行一风险监控系统,以及执行设备中包含两个数据库,其中,第一数据库可以用于存储从生产环境(财务管理系统)获取的目标业务数据等。第二数据库可以用于存储风险相关数据,以及增强训练所需的样本业务数据、样本待分类数据等。In this embodiment, the execution device can run a risk monitoring system, and the execution device includes two databases, wherein the first database can be used to store target business data obtained from the production environment (financial management system). The second database can be used to store risk-related data, as well as sample business data required for enhanced training, sample data to be classified, and the like.

执行设备可以与生产环境的设备进行通信,获取生产环境产生的业务数据,并存储于第一数据库中,而后风险监控系统可以从第一数据库中获取到目标业务数据。风险监控系统通过风险识别模型和风险分类模型可以从识别目标业务数据中的风险业务数据,并确定其风险类别。风险识别模型和风险分类模型处理得到的数据可以作为风险相关数据存储到第二数据库中,用于供后续增强训练使用,风险监控系统可以从第二数据库获取经风险识别模型和风险分类模型处理后的DB数据(数据集),用于进行增强训练,从而完成模型的矫正。终端用户可以通过其终端设备运行的客户端与执行设备进行通信,通过客户端向执行设备发送数据调用请求,以使执行设备将第二数据库中存储的数据反馈至客户端,以使客户端可以展示相关数据,示例性地,显示各风险类别的业务数据。The execution device can communicate with the devices in the production environment, obtain business data generated in the production environment, and store them in the first database, and then the risk monitoring system can obtain target business data from the first database. The risk monitoring system can identify the risky business data in the target business data and determine its risk category through the risk identification model and the risk classification model. The data processed by the risk identification model and the risk classification model can be stored in the second database as risk-related data for subsequent enhanced training. The DB data (data set) is used for enhanced training to complete the correction of the model. The terminal user can communicate with the execution device through the client running on the terminal device, and send a data call request to the execution device through the client, so that the execution device will feed back the data stored in the second database to the client, so that the client can Relevant data is displayed, for example, business data of each risk category is displayed.

请参阅图6,图6是本申请电子设备一实施例的框架示意图。Please refer to FIG. 6 . FIG. 6 is a schematic frame diagram of an embodiment of the electronic device of the present application.

本实施例中,电子设备60包括存储器61、处理器62,其中存储器61耦接处理器62。具体地,电子设备60的各个组件可通过总线耦合在一起,或者电子设备60的处理器62分别与其他组件一一连接。该电子设备60可以为具有处理能力的任意设备,例如计算机、平板电脑、手机等。In this embodiment, the electronic device 60 includes a memory 61 and a processor 62 , wherein the memory 61 is coupled to the processor 62 . Specifically, various components of the electronic device 60 may be coupled together through a bus, or the processor 62 of the electronic device 60 may be connected to other components one by one. The electronic device 60 may be any device with processing capability, such as a computer, a tablet computer, a mobile phone, and the like.

存储器61用于存储处理器62执行的程序指令以及处理器62在处理过程中的数据等。例如,风险业务数据、样本业务数据等。其中,该存储器61包括非易失性存储部分,用于存储上述程序指令。The memory 61 is used to store program instructions executed by the processor 62 and data during processing by the processor 62 . For example, risk business data, sample business data, etc. Wherein, the memory 61 includes a non-volatile storage part for storing the above-mentioned program instructions.

处理器62控制电子设备60的操作,处理器62还可以称为CPU(Central ProcessingUnit,中央处理单元)。处理器62可能是一种集成电路芯片,具有信号的处理能力。处理器62还可以是通用处理器、数字信号处理器(DSP)、专用集成电路(ASIC)、现场可编程门阵列(FPGA)或者其他可编程逻辑器件、分立门或者晶体管逻辑器件、分立硬件组件。通用处理器可以是微处理器或者该处理器也可以是任何常规的处理器等。另外,处理器62可以由多个成电路芯片共同实现。The processor 62 controls the operation of the electronic device 60, and the processor 62 may also be referred to as a CPU (Central Processing Unit, central processing unit). The processor 62 may be an integrated circuit chip with signal processing capability. The processor 62 can also be a general purpose processor, a digital signal processor (DSP), an application specific integrated circuit (ASIC), a field programmable gate array (FPGA) or other programmable logic devices, discrete gate or transistor logic devices, discrete hardware components . A general-purpose processor may be a microprocessor, or the processor may be any conventional processor, or the like. In addition, the processor 62 may be jointly implemented by multiple circuit chips.

处理器62通过调用存储器61存储的程序指令,用于执行指令以实现上述任一风险识别方法。The processor 62 invokes the program instructions stored in the memory 61 to execute the instructions to implement any of the above risk identification methods.

请参阅图7,图7是本申请计算机可读存储介质一实施例的框架示意图。Please refer to FIG. 7 . FIG. 7 is a schematic frame diagram of an embodiment of a computer-readable storage medium of the present application.

本实施例中,该计算机可读存储介质70存储有处理器可运行的程序指令71,该程序指令71能够被执行,用以实现上述任一风险识别方法。In this embodiment, the computer-readable storage medium 70 stores program instructions 71 executable by the processor, and the program instructions 71 can be executed to implement any of the risk identification methods described above.

该计算机可读存储介质70具体可以为U盘、移动硬盘、只读存储器(ROM,Read-OnlyMemory)、随机存取存储器(RAM,Random Access Memory)、磁碟或者光盘等可以存储程序指令的介质,或者也可以为存储有该程序指令的服务器,该服务器可将存储的程序指令发送给其他设备运行,或者也可以自运行该存储的程序指令。The computer-readable storage medium 70 can specifically be a medium that can store program instructions, such as a U disk, a mobile hard disk, a read-only memory (ROM, Read-Only Memory), a random access memory (RAM, Random Access Memory), a magnetic disk or an optical disk. , or it can also be a server storing the program instructions, and the server can send the stored program instructions to other devices to run, or can also run the stored program instructions by itself.

在一些实施例中,计算机可读存储介质70还可以为如图6所示的存储器。In some embodiments, the computer-readable storage medium 70 may also be a memory as shown in FIG. 6 .

以上所述仅为本申请的实施方式,并非因此限制本申请的专利范围,凡是利用本申请说明书及附图内容所作的等效结构或等效流程变换,或直接或间接运用在其他相关的技术领域,均同理包括在本申请的专利保护范围内。The above is only the implementation of the application, and does not limit the patent scope of the application. Any equivalent structure or equivalent process conversion made by using the specification and drawings of the application, or directly or indirectly used in other related technologies fields, are all included in the scope of patent protection of this application in the same way.

Claims (10)

1.一种风险识别方法,其特征在于,所述方法包括:1. A risk identification method, characterized in that the method comprises: 获取目标业务数据;Obtain target business data; 利用风险识别模型划分所述目标业务数据得到风险业务数据和非风险业务数据;Using a risk identification model to divide the target business data to obtain risk business data and non-risk business data; 利用风险分类模型确定所述风险业务数据的风险类别;Using a risk classification model to determine the risk category of the risk business data; 基于由所述风险业务数据所生成的样本业务数据,对所述风险识别模型进行增强训练,并基于由已确定所述风险类别的风险业务数据所生成的样本待分类数据,对所述风险分类模型进行增强训练。Perform enhanced training on the risk identification model based on the sample business data generated from the risk business data, and classify the risk based on the sample data to be classified generated from the risk business data for which the risk category has been determined The model is enhanced for training. 2.根据权利要求1所述的方法,其特征在于,所述利用风险识别模型划分所述目标业务数据得到风险业务数据和非风险业务数据包括:2. The method according to claim 1, wherein said dividing said target business data using a risk identification model to obtain risk business data and non-risk business data comprises: 利用所述风险识别模型从所述目标业务数据中依次识别得到直接风险数据、间接风险数据和罕见风险数据;Using the risk identification model to sequentially identify direct risk data, indirect risk data and rare risk data from the target business data; 将所述直接风险数据、所述间接风险数据和所述罕见风险数据一并作为所述风险业务数据,以及将所述目标业务数据中除所述直接风险数据、所述间接风险数据和所述罕见风险数据之外的部分业务数据作为所述非风险业务数据。taking the direct risk data, the indirect risk data and the rare risk data together as the risk business data, and dividing the target business data from the direct risk data, the indirect risk data and the Part of the business data other than the rare risk data is used as the non-risk business data. 3.根据权利要求2所述的方法,其特征在于,所述利用所述风险识别模型从所述目标业务数据中依次识别得到直接风险数据、间接风险数据和罕见风险数据包括:3. The method according to claim 2, characterized in that, using the risk identification model to sequentially identify direct risk data, indirect risk data and rare risk data from the target business data comprises: 获取所述目标业务数据属于所述直接风险数据的第一概率;Obtaining a first probability that the target business data belongs to the direct risk data; 基于所述第一概率从所述目标业务数据中确定所述直接风险数据;determining said immediate risk data from said target business data based on said first probability; 获取所述目标业务数据中除所述直接风险数据之外的部分业务数据属于所述间接风险数据的第二概率;Obtaining a second probability that part of the business data in the target business data other than the direct risk data belongs to the indirect risk data; 基于所述第二概率从所述目标业务数据中除所述直接风险数据之外的部分业务数据中确定所述间接风险数据;determining the indirect risk data from part of the target business data except the direct risk data based on the second probability; 获取所述目标业务数据中除所述直接风险数据和所述间接风险数据之外的部分业务数据属于所述罕见风险数据的第三概率;Obtaining a third probability that part of the target business data other than the direct risk data and the indirect risk data belongs to the rare risk data; 基于所述第三概率从所述目标业务数据中除所述直接风险数据和所述间接风险数据之外的部分业务数据中确定所述罕见风险数据。The rare risk data is determined from part of the target service data except the direct risk data and the indirect risk data based on the third probability. 4.根据权利要求1所述的方法,其特征在于,所述利用风险识别模型划分所述目标业务数据得到风险业务数据和非风险业务数据之后,所述利用风险分类模型确定所述风险业务数据的风险类别之前,所述方法还包括:4. The method according to claim 1, characterized in that, after said using a risk identification model to divide said target business data to obtain risky business data and non-risk business data, said using a risk classification model to determine said risky business data Before the risk category of , the method also includes: 确定所述风险业务数据的明细类别,其中,所述明细类别数量大于所述风险类别的数量;determining the detailed categories of the risk business data, wherein the number of detailed categories is greater than the number of risk categories; 所述利用风险分类模型确定所述风险业务数据的风险类别包括:The determining the risk category of the risk business data by using the risk classification model includes: 利用所述风险分类模型将已确定所述明细类别的所述风险业务数据映射到分类函数维度,并确定各所述风险类别分别对应的分离超平面;Using the risk classification model to map the risk business data of which the detailed category has been determined to a classification function dimension, and determine the separation hyperplane corresponding to each of the risk categories; 在所述分类函数维度,基于所述分离超平面将已确定所述明细类别的所述风险业务数据划分为与各所述风险类别对应。In the classification function dimension, the risk business data of which the detailed categories have been determined are divided into corresponding risk categories based on the separating hyperplane. 5.根据权利要求4所述的方法,其特征在于,所述方法还包括:5. method according to claim 4, is characterized in that, described method also comprises: 利用已确定所述明细类别的所述风险业务数据生成所述样本业务数据,其中,所述样本业务数据标注有真实风险标签,所述真实风险标签表征所述样本业务数据属于风险数据类型和非风险数据类型中的何种;The sample business data is generated by using the risk business data of which the detailed category has been determined, wherein the sample business data is marked with a real risk label, and the real risk label indicates that the sample business data belongs to the risk data type and non-risk data. Which of the risk data types; 所述基于由所述风险业务数据所生成的样本业务数据,对所述风险识别模型进行增强训练包括:The performing enhanced training on the risk identification model based on the sample business data generated by the risk business data includes: 利用所述风险识别模型划分所述样本业务数据,得到样本风险类型预测结果,基于所述样本风险类型预测结果确定样本风险数据和样本非风险数据,所述样本风险预测结果表征预测所述样本业务数据属于风险数据类型和非风险数据类型中的何种;Using the risk identification model to divide the sample business data to obtain a sample risk type prediction result, determine sample risk data and sample non-risk data based on the sample risk type prediction result, and the sample risk prediction result represents and predicts the sample business Which of the risk data type and non-risk data type does the data belong to; 基于所述风险预测结果和所述真实风险标签之间的第一差异调整所述风险识别模型的参数。Adjusting parameters of the risk identification model based on a first difference between the risk prediction result and the true risk label. 6.根据权利要求4所述的方法,其特征在于,所述方法还包括:6. The method according to claim 4, characterized in that the method further comprises: 利用已确定所述风险类别的所述风险业务数据生成所述样本待分类数据,其中,所述样本待分类数据标注有真实类别标签,所述真实类别标签表征所述样本待分类数据所属的所述风险类别;The sample data to be classified is generated by using the risk business data of which the risk category has been determined, wherein the sample data to be classified is marked with a real category label, and the real category label represents all the samples to which the sample data to be classified belongs. the above risk categories; 所述基于由已确定所述风险类别的风险业务数据所生成的样本待分类数据,对所述风险分类模型进行增强训练包括:The enhanced training of the risk classification model based on the sample data to be classified generated from the risk business data of which the risk category has been determined includes: 利用所述风险分类模型确定所述样本待分类数据的风险类别,作为样本预测分类结果;Using the risk classification model to determine the risk category of the sample data to be classified as a sample prediction classification result; 基于所述样本预测分类结果和所述真实类别标签之间的第二差异调整所述风险分类模型的参数。Adjusting parameters of the risk classification model based on a second difference between the sample predicted classification result and the true class label. 7.根据权利要求1所述的方法,其特征在于,所述风险类别包括收支监控、预算监控、事中监控和融资监控中的至少一者;7. The method according to claim 1, wherein the risk category includes at least one of revenue and expenditure monitoring, budget monitoring, ongoing monitoring and financing monitoring; 和/或,每间隔第一预设时长执行所述增强训练,每间隔第二预设时长获取所述目标业务数据。And/or, the enhanced training is performed every first preset time period, and the target service data is acquired every second preset time period. 8.根据权利要求1所述的方法,其特征在于,所述方法还包括:8. The method according to claim 1, further comprising: 将基于所述目标业务数据获取的风险相关数据存储到数据库中,以响应于数据调用请求,发送与所述数据调用请求匹配的数据至目标对象,其中,所述风险相关数据包括所述风险业务数据和所述非风险业务数据。storing the risk-related data obtained based on the target business data in a database, and sending data matching the data call request to the target object in response to the data call request, wherein the risk-related data includes the risk business data and the non-risk business data. 9.一种电子设备,其特征在于,包括相互耦接的存储器和处理器,所述处理器用于执行所述存储器中存储的程序指令,以实现权利要求1至8中任一项所述的风险识别方法。9. An electronic device, characterized in that it comprises a memory and a processor coupled to each other, and the processor is configured to execute the program instructions stored in the memory, so as to realize the method described in any one of claims 1 to 8. risk identification method. 10.一种计算机可读存储介质,其上存储有程序指令,其特征在于,所述程序指令被处理器执行时实现权利要求1至8中任一项所述的风险识别方法。10. A computer-readable storage medium, on which program instructions are stored, wherein, when the program instructions are executed by a processor, the risk identification method according to any one of claims 1 to 8 is implemented.
CN202310630270.XA 2023-05-30 2023-05-30 Risk identification method, risk identification equipment and computer readable storage medium Pending CN116703155A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202310630270.XA CN116703155A (en) 2023-05-30 2023-05-30 Risk identification method, risk identification equipment and computer readable storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202310630270.XA CN116703155A (en) 2023-05-30 2023-05-30 Risk identification method, risk identification equipment and computer readable storage medium

Publications (1)

Publication Number Publication Date
CN116703155A true CN116703155A (en) 2023-09-05

Family

ID=87828626

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202310630270.XA Pending CN116703155A (en) 2023-05-30 2023-05-30 Risk identification method, risk identification equipment and computer readable storage medium

Country Status (1)

Country Link
CN (1) CN116703155A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN118396651A (en) * 2024-04-16 2024-07-26 国网河南省电力公司漯河供电公司 Digital management and control system and method for power grid marketing business risk

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113313633A (en) * 2021-06-25 2021-08-27 西安紫光展锐科技有限公司 Training method and device of hyper-division network model and electronic equipment
CN115713396A (en) * 2022-10-28 2023-02-24 远光软件股份有限公司 Risk prediction method for capital operation, electronic equipment and computer storage medium

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113313633A (en) * 2021-06-25 2021-08-27 西安紫光展锐科技有限公司 Training method and device of hyper-division network model and electronic equipment
CN115713396A (en) * 2022-10-28 2023-02-24 远光软件股份有限公司 Risk prediction method for capital operation, electronic equipment and computer storage medium

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN118396651A (en) * 2024-04-16 2024-07-26 国网河南省电力公司漯河供电公司 Digital management and control system and method for power grid marketing business risk

Similar Documents

Publication Publication Date Title
US20230360053A1 (en) System and method using multiple profiles and scores for assessing financial transaction risk
US10607284B2 (en) System and method to search and verify borrower information using banking and investment account data and process to systematically share information with lenders and government sponsored agencies for underwriting and securitization phases of the lending cycle
US7966256B2 (en) Methods and systems of predicting mortgage payment risk
CA2828751C (en) System and method for suspect entity detection and mitigation
US10319028B2 (en) Payment account monitoring system and method
US7536348B2 (en) Enhancing delinquent debt collection using statistical models of debt historical information and account events
US7556192B2 (en) Systems and methods for decisioning or approving a financial credit account based on a customer's check-writing behavior
US8645264B2 (en) Apparatus and methods for verifying a credit applicant's income that enhance a credit applicant's experience
US20220207295A1 (en) Predicting occurrences of temporally separated events using adaptively trained artificial intelligence processes
US20220398583A1 (en) Transaction reconciliation and deduplication
US20160086263A1 (en) System and method for locating and accessing account data to verify income
US20130073386A1 (en) Systems and methods for generating financial institution product offer proposals
US20120330819A1 (en) System and method for locating and accessing account data
US20150363875A1 (en) System and Method for Filtering and Analyzing Transaction Information
CN116738258B (en) User key data mining method based on data analysis
US20240378604A1 (en) Analytics rules engine for credit transaction stacking identification
US20080021813A1 (en) Method for scoring accounts for retention and marketing accounts based on retention and profitability
CN117709710A (en) Risk identification method, apparatus, device and storage medium
US20240378466A1 (en) Analytics rules engine for select transaction identification
US20230040705A1 (en) Risk management network
CN116703155A (en) Risk identification method, risk identification equipment and computer readable storage medium
US12093147B1 (en) Computer-based system for automatic repair of transaction information
WO2021127281A1 (en) Systems and methods for determining an affordability index
CN117372071A (en) Method, device, server and medium for determining intention clients of financial products
CN114240610B (en) Automatic fund collection method, device, computer equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
CB03 Change of inventor or designer information
CB03 Change of inventor or designer information

Inventor after: Yang Fan

Inventor after: Wang Haixiang

Inventor after: Zhang Weizhong

Inventor after: Li Sixuan

Inventor after: Wei Yaohao

Inventor after: Lin Diehuan

Inventor before: Wang Haixiang

Inventor before: Zhang Weizhong

Inventor before: Li Sixuan

Inventor before: Wei Yaohao

Inventor before: Lin Diehuan

Inventor before: Yang Fan