CN115529581B - Off-network user identification method and device - Google Patents
Off-network user identification method and device Download PDFInfo
- Publication number
- CN115529581B CN115529581B CN202110712358.7A CN202110712358A CN115529581B CN 115529581 B CN115529581 B CN 115529581B CN 202110712358 A CN202110712358 A CN 202110712358A CN 115529581 B CN115529581 B CN 115529581B
- Authority
- CN
- China
- Prior art keywords
- communication
- network user
- communication object
- network
- object set
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04W—WIRELESS COMMUNICATION NETWORKS
- H04W8/00—Network data management
- H04W8/18—Processing of user or subscriber data, e.g. subscribed services, user preferences or user profiles; Transfer of user or subscriber data
- H04W8/183—Processing at user equipment or user record carrier
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04W—WIRELESS COMMUNICATION NETWORKS
- H04W24/00—Supervisory, monitoring or testing arrangements
- H04W24/08—Testing, supervising or monitoring using real traffic
Landscapes
- Engineering & Computer Science (AREA)
- Computer Networks & Wireless Communication (AREA)
- Signal Processing (AREA)
- Databases & Information Systems (AREA)
- Information Transfer Between Computers (AREA)
- Telephonic Communication Services (AREA)
Abstract
Description
技术领域Technical Field
本发明涉及通信技术领域,尤其涉及一种离网用户识别方法及装置。The present invention relates to the field of communication technology, and in particular to an off-network user identification method and device.
背景技术Background technique
目前,国内通讯业务已经泛化饱和,当用户个性化的需求与运营商提供的服务不匹配时就容易造成用户流失,即造成现网用户离网的情况,而用户离网后由于变更了原有运营商的手机号码,造成运营商很难再对其进行二次营销。At present, domestic telecommunications services have become generalized and saturated. When the personalized needs of users do not match the services provided by operators, it is easy to cause user loss, that is, causing existing network users to leave the network. After users leave the network, they change their mobile phone numbers to the original operators, making it difficult for operators to carry out secondary marketing to them.
相关技术中,主要是对异网用户进行无差别的营销,即通过获取一定区域内的异网用户清单,对其进行无差别推广,然而,该方法由于对离网用户识别不够精准,导致转化率很低。In the related technology, the main method is to carry out indiscriminate marketing to off-network users, that is, by obtaining a list of off-network users in a certain area, indiscriminate promotion is carried out to them. However, this method is not accurate enough in identifying off-network users, resulting in a very low conversion rate.
发明内容Summary of the invention
本发明实施例提供一种离网用户识别方法及装置,以解决相关技术对离网用户识别不够精准,导致转化率很低的问题。The embodiments of the present invention provide an off-network user identification method and device to solve the problem that the related art is not accurate enough in identifying off-network users, resulting in a very low conversion rate.
第一方面,本发明实施例提供了一种离网用户识别方法,包括:In a first aspect, an embodiment of the present invention provides an off-network user identification method, comprising:
获取离网用户在第一预设时间段内的第一通讯对象集,其中,所述第一通讯对象集为与所述离网用户通讯频次高于第一预设频次的同网通讯对象的集合;Acquire a first communication object set of the off-network user within a first preset time period, wherein the first communication object set is a collection of same-network communication objects whose communication frequency with the off-network user is higher than a first preset frequency;
获取所述第一通讯对象集中所有第一通讯对象在第二预设时间段内的异网通讯对象,得到第二通讯对象集;Acquire all the communication objects of the first communication object in the first communication object set in the second preset time period to obtain a second communication object set;
获取所述第二通讯对象集中每个第二通讯对象在第三预设时间段内的第三通讯对象集,其中,每个第二通讯对象的第三通讯对象集为与该第二通讯对象通讯频次高于第二预设频次的异网通讯对象的集合;Acquire a third communication object set of each second communication object in the second communication object set within a third preset time period, wherein the third communication object set of each second communication object is a set of communication objects in different networks whose communication frequency with the second communication object is higher than the second preset frequency;
分别将每个第三通讯对象集与所述第一通讯对象集进行相似度比较,并确定与所述第一通讯对象集相似度最高的第三通讯对象集对应的目标第二通讯对象为所述离网用户变更后的对象。The similarity between each third communication object set and the first communication object set is compared respectively, and the target second communication object corresponding to the third communication object set having the highest similarity to the first communication object set is determined as the changed object of the off-network user.
可选的,所述获取离网用户在第一预设时间段内的第一通讯对象集,包括:Optionally, obtaining a first communication object set of the off-network user within a first preset time period includes:
获取离网用户在第一预设时间段内的通讯数据,所述通讯数据包括通讯对象和通讯频次;Acquire communication data of the off-network user within a first preset time period, the communication data including a communication object and a communication frequency;
根据各通讯对象与所述离网用户的通讯频次,以及各通讯对象与所述离网用户是否为同网用户,确定与所述离网用户通讯频次高于第一预设频次的同网通讯对象,得到第一通讯对象集。According to the communication frequency between each communication object and the off-network user and whether each communication object and the off-network user are on the same network, the same-network communication objects whose communication frequency with the off-network user is higher than a first preset frequency are determined to obtain a first communication object set.
可选的,所述获取离网用户在第一预设时间段内的通讯数据之后,所述方法还包括:Optionally, after acquiring the communication data of the off-network user within the first preset time period, the method further includes:
根据所述通讯数据,构建所述离网用户的通讯网络超图,其中,所述通讯网络超图包括通讯对象和各通讯对象与所述离网用户的通讯频次;Constructing a communication network hypergraph of the off-network user according to the communication data, wherein the communication network hypergraph includes communication objects and communication frequencies between each communication object and the off-network user;
所述根据各通讯对象与所述离网用户的通讯频次,以及各通讯对象与所述离网用户是否为同网用户,确定与所述离网用户通讯频次高于第一预设频次的同网通讯对象,包括:The determining, according to the communication frequency between each communication object and the off-network user, and whether each communication object and the off-network user are on the same network, a same-network communication object whose communication frequency with the off-network user is higher than a first preset frequency, comprises:
剔除所述通讯网络超图中与所述离网用户异网的通讯对象,以及与所述离网用户通讯频次低于或等于所述第一预设频次的通讯对象,得到所述离网用户的紧密通讯网络超图。The communication objects in different networks from the off-network user and the communication objects whose communication frequency with the off-network user is lower than or equal to the first preset frequency are eliminated from the communication network hypergraph to obtain a close communication network hypergraph of the off-network user.
可选的,所述分别将每个第三通讯对象集与所述第一通讯对象集进行相似度比较,包括:Optionally, the comparing the similarity of each third communication object set with the first communication object set respectively includes:
基于所述第一通讯对象集与每个第三通讯对象集中相同通讯对象的数量,确定所述离网用户与每个第二通讯对象的相似度。Based on the number of identical communication objects in the first communication object set and each third communication object set, the similarity between the off-network user and each second communication object is determined.
可选的,所述确定与所述第一通讯对象集相似度最高的第三通讯对象集对应的目标第二通讯对象为所述离网用户变更后的对象之后,所述方法还包括:Optionally, after determining that the target second communication object corresponding to the third communication object set having the highest similarity to the first communication object set is the changed object of the off-network user, the method further includes:
确定所述离网用户的常驻位置信息;Determining the permanent location information of the off-network user;
将所述目标第二通讯对象的信息和所述常驻位置信息推送给营销终端。The information of the target second communication object and the resident location information are pushed to the marketing terminal.
第二方面,本发明实施例还提供一种离网用户识别装置,包括:In a second aspect, an embodiment of the present invention further provides an off-network user identification device, comprising:
第一获取模块,用于获取离网用户在第一预设时间段内的第一通讯对象集,其中,所述第一通讯对象集为与所述离网用户通讯频次高于第一预设频次的同网通讯对象的集合;A first acquisition module, used to acquire a first communication object set of an off-network user within a first preset time period, wherein the first communication object set is a collection of same-network communication objects whose communication frequency with the off-network user is higher than a first preset frequency;
第二获取模块,用于获取所述第一通讯对象集中所有第一通讯对象在第二预设时间段内的异网通讯对象,得到第二通讯对象集;A second acquisition module, configured to acquire all communication objects of the first communication object in the first communication object set within a second preset time period, to obtain a second communication object set;
第三获取模块,用于获取所述第二通讯对象集中每个第二通讯对象在第三预设时间段内的第三通讯对象集,其中,每个第二通讯对象的第三通讯对象集为与该第二通讯对象通讯频次高于第二预设频次的异网通讯对象的集合;A third acquisition module, used to acquire a third communication object set of each second communication object in the second communication object set within a third preset time period, wherein the third communication object set of each second communication object is a set of communication objects in different networks whose communication frequency with the second communication object is higher than the second preset frequency;
第一确定模块,用于分别将每个第三通讯对象集与所述第一通讯对象集进行相似度比较,并确定与所述第一通讯对象集相似度最高的第三通讯对象集对应的目标第二通讯对象为所述离网用户变更后的对象。The first determining module is used to compare the similarity of each third communication object set with the first communication object set respectively, and determine that the target second communication object corresponding to the third communication object set with the highest similarity to the first communication object set is the changed object of the off-network user.
可选的,所述第一获取模块包括:Optionally, the first acquisition module includes:
获取单元,用于获取离网用户在第一预设时间段内的通讯数据,所述通讯数据包括通讯对象和通讯频次;An acquisition unit, configured to acquire communication data of an off-network user within a first preset time period, wherein the communication data includes a communication object and a communication frequency;
确定单元,用于根据各通讯对象与所述离网用户的通讯频次,以及各通讯对象与所述离网用户是否为同网用户,确定与所述离网用户通讯频次高于第一预设频次的同网通讯对象,得到第一通讯对象集。The determination unit is used to determine the same-network communication objects whose communication frequency with the off-network user is higher than a first preset frequency according to the communication frequency between each communication object and the off-network user and whether each communication object and the off-network user are same-network users, so as to obtain a first communication object set.
可选的,所述第一获取模块还包括:Optionally, the first acquisition module further includes:
构建单元,用于根据所述通讯数据,构建所述离网用户的通讯网络超图,其中,所述通讯网络超图包括通讯对象和各通讯对象与所述离网用户的通讯频次;A construction unit, configured to construct a communication network hypergraph of the off-network user according to the communication data, wherein the communication network hypergraph includes communication objects and communication frequencies between each communication object and the off-network user;
所述确定单元用于剔除所述通讯网络超图中与所述离网用户异网的通讯对象,以及与所述离网用户通讯频次低于或等于所述第一预设频次的通讯对象,得到所述离网用户的紧密通讯网络超图。The determining unit is used to eliminate the communication objects in the communication network hypergraph that are in different networks from the off-network user, and the communication objects whose communication frequency with the off-network user is lower than or equal to the first preset frequency, to obtain a close communication network hypergraph of the off-network user.
可选的,所述第一确定模块用于基于所述第一通讯对象集与每个第三通讯对象集中相同通讯对象的数量,确定所述离网用户与每个第二通讯对象的相似度。Optionally, the first determining module is used to determine the similarity between the off-network user and each second communication object based on the number of identical communication objects in the first communication object set and each third communication object set.
可选的,所述离网用户识别装置还包括:Optionally, the off-grid user identification device further includes:
第二确定模块,用于确定所述离网用户的常驻位置信息;A second determination module is used to determine the permanent location information of the off-network user;
推送模块,用于将所述目标第二通讯对象的信息和所述常驻位置信息推送给营销终端。The push module is used to push the information of the target second communication object and the permanent location information to the marketing terminal.
第三方面,本发明实施例还提供一种离网用户识别装置,包括:收发机、存储器、处理器及存储在存储器上并可在处理器上运行的计算机程序,所述处理器执行所述计算机程序时实现如上所述的离网用户识别方法中的步骤。In a third aspect, an embodiment of the present invention further provides an off-network user identification device, comprising: a transceiver, a memory, a processor, and a computer program stored in the memory and executable on the processor, wherein the processor implements the steps in the off-network user identification method as described above when executing the computer program.
第四方面,本发明实施例还提供一种计算机可读存储介质,所述计算机可读存储介质上存储计算机程序,所述计算机程序被处理器执行时实现如上所述的离网用户识别方法中的步骤。In a fourth aspect, an embodiment of the present invention further provides a computer-readable storage medium, on which a computer program is stored, and when the computer program is executed by a processor, the steps in the off-network user identification method as described above are implemented.
在本发明实施例中,获取离网用户在第一预设时间段内的第一通讯对象集,其中,所述第一通讯对象集为与所述离网用户通讯频次高于第一预设频次的同网通讯对象的集合;获取所述第一通讯对象集中所有第一通讯对象在第二预设时间段内的异网通讯对象,得到第二通讯对象集;获取所述第二通讯对象集中每个第二通讯对象在第三预设时间段内的第三通讯对象集,其中,每个第二通讯对象的第三通讯对象集为与该第二通讯对象通讯频次高于第二预设频次的异网通讯对象的集合;分别将每个第三通讯对象集与所述第一通讯对象集进行相似度比较,并确定与所述第一通讯对象集相似度最高的第三通讯对象集对应的目标第二通讯对象为所述离网用户变更后的对象。这样,通过基于离网用户的通讯数据,对离网用户变更后的信息进行精准识别,可以提高离网用户的识别准确率,进而可对离网用户进行针对性推广,提高转化率。In an embodiment of the present invention, a first communication object set of an off-network user within a first preset time period is obtained, wherein the first communication object set is a set of communication objects on the same network whose communication frequency with the off-network user is higher than the first preset frequency; all communication objects on different networks of the first communication objects in the first communication object set within a second preset time period are obtained to obtain a second communication object set; a third communication object set of each second communication object in the second communication object set within a third preset time period is obtained, wherein the third communication object set of each second communication object is a set of communication objects on different networks whose communication frequency with the second communication object is higher than the second preset frequency; each third communication object set is respectively compared with the first communication object set for similarity, and the target second communication object corresponding to the third communication object set with the highest similarity to the first communication object set is determined to be the object after the off-network user is changed. In this way, by accurately identifying the changed information of the off-network user based on the communication data of the off-network user, the identification accuracy of the off-network user can be improved, and then the off-network user can be promoted in a targeted manner to improve the conversion rate.
附图说明BRIEF DESCRIPTION OF THE DRAWINGS
为了更清楚地说明本发明实施例的技术方案,下面将对本发明实施例描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本发明的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动性的前提下,还可以根据这些附图获得其他的附图。In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the accompanying drawings required for use in the description of the embodiments of the present invention will be briefly introduced below. Obviously, the accompanying drawings in the following description are only some embodiments of the present invention. For ordinary technicians in this field, other accompanying drawings can be obtained based on these accompanying drawings without paying creative labor.
图1是本发明实施例提供的离网用户识别方法的流程图;FIG1 is a flow chart of an off-network user identification method provided by an embodiment of the present invention;
图2是本发明实施例提供的离网用户的通讯社交网络超图的示意图;FIG2 is a schematic diagram of a communication social network hypergraph of an off-network user provided by an embodiment of the present invention;
图3是本发明实施例提供的离网用户的紧密社交网络超图的示意图;FIG3 is a schematic diagram of a close social network hypergraph of an off-network user provided by an embodiment of the present invention;
图4是本发明实施例提供的离网用户的紧密交际圈好友的通讯社交网络超图示意图;FIG4 is a schematic diagram of a communication social network hypergraph of close social circle friends of an off-network user provided by an embodiment of the present invention;
图5是本发明实施例提供的离网用户识别装置的结构图之一;FIG5 is a structural diagram of an off-network user identification device provided in an embodiment of the present invention;
图6是本发明实施例提供的离网用户识别装置的结构图之二;6 is a second structural diagram of an off-network user identification device provided in an embodiment of the present invention;
具体实施方式Detailed ways
下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有作出创造性劳动前提下所获得的所有其他实施例,都属于本发明保护的范围。The following will be combined with the drawings in the embodiments of the present invention to clearly and completely describe the technical solutions in the embodiments of the present invention. Obviously, the described embodiments are part of the embodiments of the present invention, not all of the embodiments. Based on the embodiments of the present invention, all other embodiments obtained by ordinary technicians in this field without creative work are within the scope of protection of the present invention.
参见图1,图1是本发明实施例提供的离网用户识别方法的流程图,如图1所示,包括以下步骤:Referring to FIG. 1 , FIG. 1 is a flow chart of an off-network user identification method provided by an embodiment of the present invention. As shown in FIG. 1 , the method comprises the following steps:
步骤101、获取离网用户在第一预设时间段内的第一通讯对象集,其中,所述第一通讯对象集为与所述离网用户通讯频次高于第一预设频次的同网通讯对象的集合。Step 101: Acquire a first communication object set of an off-network user within a first preset time period, wherein the first communication object set is a collection of same-network communication objects whose communication frequency with the off-network user is higher than a first preset frequency.
上述离网用户可以是指注销或停机的目标运营商号码,目标运营商可以是需要进行离网用户识别的运营商。上述第一预设时间段可以是离网用户离网前一段时间,如离网前一个月或三个月。上述通讯对象可以是指与离网用户有通讯往来,如有通话、短信记录的对象。The off-network user may refer to a target operator number that is deregistered or shut down, and the target operator may be an operator that needs to identify the off-network user. The first preset time period may be a period of time before the off-network user leaves the network, such as one month or three months before leaving the network. The communication object may refer to an object that has communication with the off-network user, such as a call or text message record.
例如,想要识别第一运营商的离网用户时,可以获取该离网用户在离网前三个月内的通讯社交数据,并从中获得该离网用户的通讯对象集。For example, when you want to identify an off-network user of the first operator, you can obtain the communication and social data of the off-network user within three months before leaving the network, and obtain the communication object set of the off-network user from it.
在获得离网用户在第一预设时间段内的通讯对象集后,还可以进一步对其通讯对象集进行筛选剔除,以保留其交往较为密切的同网通讯对象,如去除其中与所述离网用户通讯频次较低的通讯对象,以及去除异网通讯对象,也即可以保留与所述离网用户通讯频次高于第一预设频次的同网通讯对象,得到第一通讯对象集,其中,所述第一预设频次可以是根据需求设定的频次,如将第一预设时间段内与离网用户通讯次数少于3次的通讯对象删除,或者,将近1个月内与离网用户没有通讯往来的通讯对象删除,所述异网通讯对象可以是指与所述离网用户所属的运营商不同的通讯对象,所述同网通讯对象则是指与所述离网用户所属的运营商相同的通讯对象。After obtaining the communication object set of the off-network user within the first preset time period, the communication object set can be further screened and eliminated to retain the same-network communication objects with which the user has closer interactions, such as removing communication objects with which the off-network user has a lower communication frequency, and removing communication objects from different networks, that is, the same-network communication objects with which the off-network user has a communication frequency higher than the first preset frequency can be retained to obtain the first communication object set, wherein the first preset frequency can be a frequency set according to demand, such as deleting communication objects that have communicated with the off-network user less than 3 times within the first preset time period, or deleting communication objects that have not communicated with the off-network user in nearly 1 month. The different-network communication objects may refer to communication objects of a different operator from that of the off-network user, and the same-network communication objects refer to communication objects of the same operator as that of the off-network user.
可选的,所述步骤101包括:Optionally, step 101 includes:
获取离网用户在第一预设时间段内的通讯数据,所述通讯数据包括通讯对象和通讯频次;Acquire communication data of the off-network user within a first preset time period, the communication data including a communication object and a communication frequency;
根据各通讯对象与所述离网用户的通讯频次,以及各通讯对象与所述离网用户是否为同网用户,确定与所述离网用户通讯频次高于第一预设频次的同网通讯对象,得到第一通讯对象集。According to the communication frequency between each communication object and the off-network user and whether each communication object and the off-network user are on the same network, the same-network communication objects whose communication frequency with the off-network user is higher than a first preset frequency are determined to obtain a first communication object set.
即一种实施方式中,可以先获取离网用户在第一预设时间段内的通讯数据,所述通讯数据包括通讯对象、与各通讯对象的通讯频次、通讯方式等,其中,通讯对象包括普通用户和机构用户等,通讯方式包括电话、短信、5G消息等,所述通讯数据还可包括主动和被动方信息,如主动给其他用户拨打电话、主动发起5G消息、收到某机构的短信等。That is, in one implementation, the communication data of the off-network user within the first preset time period can be first obtained, and the communication data includes the communication object, the communication frequency with each communication object, the communication method, etc., wherein the communication object includes ordinary users and institutional users, etc., and the communication method includes telephone, text message, 5G message, etc. The communication data can also include active and passive party information, such as actively calling other users, actively initiating 5G messages, receiving text messages from an organization, etc.
然后,可根据各通讯对象与所述离网用户的通讯频次,以及各通讯对象与所述离网用户是否为同网用户,通讯对象是否为机构用户等,去除其中一些交往不密切的通讯对象,如去除机构用户、去除异网通讯对象、去除通讯频次较低的通讯对象,从而得到所述离网用户的同网紧密好友集,也即确定与所述离网用户通讯频次高于第一预设频次的同网通讯对象,得到第一通讯对象集。这样,可快速确定离网用户的紧密通讯对象集。Then, according to the communication frequency between each communication object and the off-network user, and whether each communication object and the off-network user are users of the same network, whether the communication object is an institutional user, etc., some communication objects with whom there is not close contact can be removed, such as removing institutional users, removing communication objects on different networks, and removing communication objects with low communication frequency, so as to obtain the close friends set of the off-network user on the same network, that is, to determine the communication objects on the same network whose communication frequency with the off-network user is higher than the first preset frequency, and obtain the first communication object set. In this way, the close communication object set of the off-network user can be quickly determined.
可选的,所述获取离网用户在第一预设时间段内的通讯数据之后,所述方法还包括:Optionally, after acquiring the communication data of the off-network user within the first preset time period, the method further includes:
根据所述通讯数据,构建所述离网用户的通讯网络超图,其中,所述通讯网络超图包括通讯对象和各通讯对象与所述离网用户的通讯频次;Constructing a communication network hypergraph of the off-network user according to the communication data, wherein the communication network hypergraph includes communication objects and communication frequencies between each communication object and the off-network user;
所述根据各通讯对象与所述离网用户的通讯频次,以及各通讯对象与所述离网用户是否为同网用户,确定与所述离网用户通讯频次高于第一预设频次的同网通讯对象,包括:The determining, according to the communication frequency between each communication object and the off-network user, and whether each communication object and the off-network user are on the same network, a same-network communication object whose communication frequency with the off-network user is higher than a first preset frequency, comprises:
剔除所述通讯网络超图中与所述离网用户异网的通讯对象,以及与所述离网用户通讯频次低于或等于所述第一预设频次的通讯对象,得到所述离网用户的紧密通讯网络超图。The communication objects in different networks from the off-network user and the communication objects whose communication frequency with the off-network user is lower than or equal to the first preset frequency are eliminated from the communication network hypergraph to obtain a close communication network hypergraph of the off-network user.
一种实施方式中,在获得离网用户的通讯数据后,可以基于所述通讯数据,构建所述离网用户的通讯网络超图,通过通讯网络超图记录所述离网用户与哪些通讯对象存在通讯往来,以及具体的通讯方式、通讯频次等。In one implementation, after obtaining the communication data of an off-network user, a communication network hypergraph of the off-network user can be constructed based on the communication data. The communication network hypergraph records which communication objects the off-network user has communication exchanges with, as well as the specific communication methods, communication frequency, etc.
例如,如图2所示,可依据离网用户的通讯社交数据,构建离网用户的通讯社交网络超图,通讯社交网络超图可以通过结构图的形式进行表征,比如G=(V,E,W),其中G表示通讯社交网络超图,V是节点集合,即与离网用户建立有通讯交集的用户或者机构,E是边集合,表示与离网用户有通讯关系和往来,W是边的权重矩阵,表示通讯频次。这样,如图2所示,V={同网好友1、同网好友2、…、同网好友n、异网好友1、异网好友2、…、异网好友m、机构1、机构2、…、机构t},E={离网用户-同网好友1、离网用户-异网好友1、离网用户-机构1、…},W={主动拨打电话7次,接收消息5次、…}。For example, as shown in FIG2, a communication social network hypergraph of an off-network user can be constructed based on the communication social data of the off-network user. The communication social network hypergraph can be represented in the form of a structural diagram, such as G = (V, E, W), where G represents a communication social network hypergraph, V is a node set, that is, users or organizations that have established communication intersections with the off-network user, E is an edge set, indicating that there is a communication relationship and contact with the off-network user, and W is an edge weight matrix, indicating the communication frequency. In this way, as shown in FIG2, V = {same network friend 1, same network friend 2, ..., same network friend n, different network friend 1, different network friend 2, ..., different network friend m, organization 1, organization 2, ..., organization t}, E = {off-network user-same network friend 1, off-network user-different network friend 1, off-network user-organization 1, ...}, W = {actively dialed 7 times, received messages 5 times, ...}.
然后,可以对离网用户的通讯社交网络超图进行优化,得到离网用户的紧密社交网络超图,具体包括:删除异网好友用户节点;删除非真实用户节点,比如机构用户节点;按照通讯业务时间和频次等维度删除弱关系节点用户,比如删除1个月内没有通讯交际的同网好友用户节点,删除历史时间段内通话次数少于3次的同网好友用户等。如图2所示,虚线矩形框标识的区域即为需要删除的用户节点,优化后得到的离网用户的紧密社交网络超图如图3所示。Then, the communication social network hypergraph of the off-network user can be optimized to obtain a close social network hypergraph of the off-network user, including: deleting the user nodes of friends on different networks; deleting non-real user nodes, such as institutional user nodes; deleting weak relationship node users according to dimensions such as communication service time and frequency, such as deleting the user nodes of friends on the same network who have not communicated within 1 month, deleting the users of friends on the same network who have made less than 3 calls in the historical time period, etc. As shown in Figure 2, the area marked by the dotted rectangle is the user node that needs to be deleted, and the close social network hypergraph of the off-network user obtained after optimization is shown in Figure 3.
通过离网用户的紧密社交网络超图,可得到离网用户的紧密交际圈好友集合,即V′={同网好友1、同网好友2、…、同网好友t},对应到图3所示的离网用户的紧密社交网络超图,可确定V′={同网好友1、同网好友2、同网好友3}。Through the close social network hypergraph of the off-network user, we can obtain the set of friends in the close social circle of the off-network user, that is, V′={friend 1 on the same network, friend 2 on the same network, …, friend t on the same network}. Corresponding to the close social network hypergraph of the off-network user shown in Figure 3, we can determine V′={friend 1 on the same network, friend 2 on the same network, friend 3 on the same network}.
这样,通过构建离网用户的通讯社交网络超图,可以较为直观地清楚离网用户的通讯对象信息,便于对离网用户的通讯数据进行快速直观地分析。In this way, by constructing a communication social network hypergraph of off-grid users, the communication object information of off-grid users can be more intuitively understood, making it easier to quickly and intuitively analyze the communication data of off-grid users.
步骤102、获取所述第一通讯对象集中所有第一通讯对象在第二预设时间段内的异网通讯对象,得到第二通讯对象集。Step 102: Acquire all communication objects in different networks of the first communication objects in the first communication object set within a second preset time period to obtain a second communication object set.
在获得离网用户的第一通讯对象集后,可以进一步获取所述第一通讯对象集中各第一通讯对象在第二预设时间段内的异网通讯对象,得到由所有第一通讯对象的异网通讯对象组成的第二通讯对象集,具体可以是通过获取各第一通讯对象在第二预设时间段内的通讯数据来确定各第一通讯对象在该时间段内的异网通讯对象,所述第二预设时间段可以是离网用户离网后的一段时间,如离网后一个月或三个月。After obtaining the first communication object set of the off-network user, you can further obtain the inter-network communication objects of each first communication object in the first communication object set within a second preset time period to obtain a second communication object set consisting of the inter-network communication objects of all the first communication objects. Specifically, you can determine the inter-network communication objects of each first communication object in the time period by obtaining the communication data of each first communication object in the second preset time period. The second preset time period can be a period of time after the off-network user goes offline, such as one month or three months after going offline.
该步骤中,通过获取所述第一通讯对象集中所有第一通讯对象在第二预设时间段内的异网通讯对象,可以初步确定离网用户变更网络后的用户范围,即可根据离网前和离网后的同一用户会拥有基本相同的好友圈,来锁定离网用户的范围。In this step, by obtaining the inter-network communication objects of all the first communication objects in the first communication object set within the second preset time period, the user range of the off-network user after changing the network can be preliminarily determined. The range of the off-network user can be locked based on the fact that the same user before and after leaving the network will have basically the same circle of friends.
需说明的是,该步骤中,也可依据前述实施方式中确定的离网用户的紧密交际圈好友的通讯数据,构建离网用户的紧密交际圈好友的通讯社交网络超图,其通讯社交网络超图也可以通过结构图的形式进行表征,比如G′=(V,E,W),其中G′表示通讯社交网络超图,V是节点集合,即与离网用户的紧密交际圈好友建立有通讯交集的用户或者机构,E是边集合,表示与离网用户的紧密交际圈好友有通讯关系和往来,W是边的权重矩阵,表示各种通讯行为的数量及属性等。离网用户的紧密交际圈好友的通讯社交网络超图可如图4所示。这样,依据离网用户的紧密交际圈好友的通讯社交网络超图,可获取图4中所有异网用户节点集合,即V-异网={异网好友1、异网好友2、…、异网好友g},图4中虚线矩形框标识的区域即为异网用户节点。It should be noted that in this step, a communication social network hypergraph of the off-network user's close social circle friends can also be constructed based on the communication data of the off-network user's close social circle friends determined in the aforementioned implementation mode, and the communication social network hypergraph can also be represented in the form of a structural diagram, such as G′=(V, E, W), where G′ represents the communication social network hypergraph, V is a node set, that is, users or institutions with communication intersections established with the off-network user's close social circle friends, E is an edge set, indicating that there are communication relationships and exchanges with the off-network user's close social circle friends, and W is an edge weight matrix, indicating the number and attributes of various communication behaviors. The communication social network hypergraph of the off-network user's close social circle friends can be shown in Figure 4. In this way, based on the communication social network hypergraph of the off-network user's close social circle friends, the node set of all users in different networks in Figure 4 can be obtained, that is, V-different network={different network friend 1, different network friend 2, ..., different network friend g}, and the area marked by the dotted rectangular box in Figure 4 is the different network user node.
步骤103、获取所述第二通讯对象集中每个第二通讯对象在第三预设时间段内的第三通讯对象集,其中,每个第二通讯对象的第三通讯对象集为与该第二通讯对象通讯频次高于第二预设频次的异网通讯对象的集合。Step 103: Obtain a third communication object set for each second communication object in the second communication object set within a third preset time period, wherein the third communication object set for each second communication object is a set of communication objects in different networks whose communication frequency with the second communication object is higher than the second preset frequency.
对于所述第二通讯对象集中的每个第二通讯对象,可以分别获取每个第二通讯对象在第三预设时间段内与该第二通讯对象通讯频次高于第二预设频次的异网通讯对象,得到每个第二通讯对象的第三通讯对象集,其中,所述第三预设时间段可以是离网后的一段时间,如可以是离网后一个月或三个月,所述第三预设时间段可以与上述第二预设时间段相同,也可以不同。即该步骤中,可以分别获取每个第二通讯对象的紧密异网通讯对象集。For each second communication object in the second communication object set, the inter-network communication objects with which each second communication object communicates with the second communication object at a frequency higher than the second preset frequency within the third preset time period can be obtained respectively, to obtain the third communication object set for each second communication object, wherein the third preset time period can be a period of time after leaving the network, such as one month or three months after leaving the network, and the third preset time period can be the same as or different from the second preset time period. That is, in this step, the close inter-network communication object set of each second communication object can be obtained respectively.
也就是说,可依次对如图4中所示的异网用户节点,采用上述构建用户的紧密社交网络超图的方式进行计算,得到所有异网用户的紧密社交网络超图集合,即G-异网用户的紧密社交网络超图,G-异网用户的紧密社交网络超图={G-异网用户1的紧密社交网络超图、G-异网用户2的紧密社交网络超图、…、G-异网用户g的紧密社交网络超图}。That is to say, the nodes of heterogeneous users as shown in FIG. 4 can be calculated in turn using the method of constructing the user's close social network hypergraph, and a set of close social network hypergraphs of all heterogeneous users can be obtained, namely, a close social network hypergraph of G-heterogeneous users, where the close social network hypergraph of G-heterogeneous users = {G-the close social network hypergraph of heterogeneous user 1, G-the close social network hypergraph of heterogeneous user 2, ..., G-the close social network hypergraph of heterogeneous user g}.
步骤104、分别将每个第三通讯对象集与所述第一通讯对象集进行相似度比较,并确定与所述第一通讯对象集相似度最高的第三通讯对象集对应的目标第二通讯对象为所述离网用户变更后的对象。Step 104: Compare the similarity of each third communication object set with the first communication object set respectively, and determine that the target second communication object corresponding to the third communication object set having the highest similarity with the first communication object set is the changed object of the off-network user.
本发明实施例中,由于离网用户变更后的号码很可能为第二通讯对象集中某个第二通讯对象,因此,可以通过比较离网用户与各第二通讯对象的紧密通讯对象,看是否存在较多的共同好友,来确定二者是否为离网前后的同一用户。In the embodiment of the present invention, since the changed number of the off-network user is likely to be a second communication object in the second communication object set, it is possible to determine whether the off-network user is the same user before and after leaving the network by comparing the close communication objects of the off-network user and each second communication object to see whether there are many common friends.
具体地,可以将每个第二通讯对象的紧密异网通讯对象与离网用户的紧密同网通讯对象进行比较,也即比较每个第三通讯对象集与第一通讯对象集的相似度,若某个第三通讯对象集与第一通讯对象集的相似度越大,则表明二者中的通讯对象越一致,对应地,该第三通讯对象集对应的第二通讯对象与所述离网用户为同一用户的可能性也越大,因此,可以在分别比较每个第三通讯对象集与所述第一通讯对象集后,将与所述第一通讯对象集相似度最高的第三通讯对象集对应的目标第二通讯对象确定为所述离网用户变更后的对象。Specifically, the close inter-network communication objects of each second communication object can be compared with the close intra-network communication objects of the off-network user, that is, the similarity between each third communication object set and the first communication object set can be compared. If the similarity between a third communication object set and the first communication object set is greater, it means that the communication objects in the two are more consistent. Correspondingly, the possibility that the second communication object corresponding to the third communication object set is the same user as the off-network user is also greater. Therefore, after comparing each third communication object set with the first communication object set respectively, the target second communication object corresponding to the third communication object set with the highest similarity to the first communication object set can be determined as the changed object of the off-network user.
可选的,所述分别将每个第三通讯对象集与所述第一通讯对象集进行相似度比较,包括:Optionally, the comparing the similarity of each third communication object set with the first communication object set respectively includes:
基于所述第一通讯对象集与每个第三通讯对象集中相同通讯对象的数量,确定所述离网用户与每个第二通讯对象的相似度。Based on the number of identical communication objects in the first communication object set and each third communication object set, the similarity between the off-network user and each second communication object is determined.
一种实施方式中,可以通过比较所述第一通讯对象集与每个第三通讯对象集中相同通讯对象的数量,来确定所述离网用户与每个第二通讯对象的相似度,具体地,相同通讯对象也即共同好友的数量越多,判定二者相似度越高,反之相似度越低,例如,在比较所述第一通讯对象集与每个第三通讯对象集中各通讯对象时,发现所述第一通讯对象集与某个第三通讯对象集中号码相同的通讯对象数量最多时,可以确定该第三通讯对象集对应的第二通讯对象为所述离网用户变更后的号码。In one implementation, the similarity between the off-network user and each second communication object can be determined by comparing the number of identical communication objects in the first communication object set and each third communication object set. Specifically, the more identical communication objects, or common friends, the higher the similarity between the two, and vice versa. For example, when comparing the first communication object set with each communication object in each third communication object set, if it is found that the number of communication objects with the same number as that in the first communication object set and a certain third communication object set is the largest, it can be determined that the second communication object corresponding to the third communication object set is the changed number of the off-network user.
当然,在比较两个通讯对象集的相似度时,还可以进一步比较各第二通讯对象与共同好友的通讯频次,通讯频次越高,则表明相似度越高,倾向于是离网前后同一用户的可能性越大。Of course, when comparing the similarity of two communication object sets, we can further compare the communication frequency between each second communication object and the common friend. The higher the communication frequency, the higher the similarity, and the greater the possibility that it is the same user before and after leaving the network.
这样,通过基于两个通讯对象集中共同好友的数量,来确定离网用户与每个第二通讯对象的相似度,可以保证比较结果较为准确。In this way, by determining the similarity between the off-network user and each second communication object based on the number of common friends in the two communication object sets, it is possible to ensure that the comparison result is relatively accurate.
即在比较所述离网用户与每个第二通讯对象的相似度时,可以是将离网用户的紧密社交网络超图与G-异网用户的紧密社交网络超图中的每个子图逐一匹配,完成相似度计算,最为相似的即为离网用户节点。That is, when comparing the similarity between the off-network user and each second communication object, the off-network user's close social network hypergraph can be matched one by one with each subgraph in the close social network hypergraph of the G-different network user to complete the similarity calculation, and the most similar one is the off-network user node.
具体地,可记离网用户节点为x,对于每个异网用户节点y,可基于共同好友来计算两个用户节点的相似度,计算公式如下:其中,Sxy为相似度,Γ(x)为离网用户节点x的紧密同网好友节点的集合,Γ(y)为异网用户节点y的紧密异网好友节点的集合,k(x)为离网用户节点x的紧密同网好友数量,k(y)为异网用户节点y的紧密异网好友数量,即两个节点的共同好友的数量越多,且与共同好友的边的权重占比越大,则表明这两个节点越相似,也倾向于是离网前后同一用户的可能性越大。Specifically, the off-network user node can be recorded as x. For each off-network user node y, the similarity between the two user nodes can be calculated based on common friends. The calculation formula is as follows: Among them, Sxy is the similarity, Γ(x) is the set of close friends of the off-network user node x, Γ(y) is the set of close friends of the off-network user node y, k(x) is the number of close friends of the off-network user node x, k(y) is the number of close friends of the off-network user node y, that is, the more common friends two nodes have and the greater the weight of the edges with common friends, the more similar the two nodes are, and the greater the possibility that they are the same user before and after leaving the network.
可选的,所述步骤104之后,所述方法还包括:Optionally, after step 104, the method further includes:
确定所述离网用户的常驻位置信息;Determining the permanent location information of the off-network user;
将所述目标第二通讯对象的信息和所述常驻位置信息推送给营销终端。The information of the target second communication object and the resident location information are pushed to the marketing terminal.
即在确定变更后的离网用户信息后,还可进一步获取离网用户的常驻位置信息,具体可以是根据离网用户在一定周期内,如离网前一个月使用的终端定位等位置数据,计算用户夜间常驻位置,进而判断出用户所在的网格信息或者小区信息。然后,可将所述离网用户的常驻位置信息和所述目标第二通讯对象的信息如手机号码发送给营销终端,如网格经理,以便于目标运营商针对识别出的离网用户展开精准的二次业务推广,保障当前用户量的规模。That is, after determining the changed off-network user information, the off-network user's permanent location information can be further obtained. Specifically, the user's nighttime permanent location can be calculated based on the terminal positioning data used by the off-network user within a certain period, such as one month before leaving the network, and then the grid information or cell information of the user can be determined. Then, the off-network user's permanent location information and the target second communication object's information, such as the mobile phone number, can be sent to a marketing terminal, such as a grid manager, so that the target operator can carry out accurate secondary business promotion for the identified off-network users and ensure the scale of the current user volume.
当然,为了提高业务质量,还可以进一步分析离网用户的后台数据,如根据离网前用户行为数据和网络质量等数据等进行分析,以获得离网用户的离网原因,并将离网原因一并推送给网格经理,便于后续针对性地提高业务服务质量,避免现网客户的流失。Of course, in order to improve service quality, the background data of off-grid users can be further analyzed, such as analysis based on user behavior data and network quality data before leaving the network, to obtain the reasons for leaving the network, and push the reasons to the grid manager to facilitate subsequent targeted improvement of service quality and avoid the loss of existing network customers.
本发明实施例的离网用户识别方法,获取离网用户在第一预设时间段内的第一通讯对象集,其中,所述第一通讯对象集为与所述离网用户通讯频次高于第一预设频次的同网通讯对象的集合;获取所述第一通讯对象集中所有第一通讯对象在第二预设时间段内的异网通讯对象,得到第二通讯对象集;获取所述第二通讯对象集中每个第二通讯对象在第三预设时间段内的第三通讯对象集,其中,每个第二通讯对象的第三通讯对象集为与该第二通讯对象通讯频次高于第二预设频次的异网通讯对象的集合;分别将每个第三通讯对象集与所述第一通讯对象集进行相似度比较,并确定与所述第一通讯对象集相似度最高的第三通讯对象集对应的目标第二通讯对象为所述离网用户变更后的对象。这样,通过基于离网用户的通讯数据,对离网用户变更后的信息进行精准识别,可以提高离网用户的识别准确率,进而可对离网用户进行针对性推广,提高转化率。The off-network user identification method of the embodiment of the present invention obtains the first communication object set of the off-network user within the first preset time period, wherein the first communication object set is a set of same-network communication objects whose communication frequency with the off-network user is higher than the first preset frequency; obtains the different-network communication objects of all the first communication objects in the first communication object set within the second preset time period to obtain the second communication object set; obtains the third communication object set of each second communication object in the second communication object set within the third preset time period, wherein the third communication object set of each second communication object is a set of different-network communication objects whose communication frequency with the second communication object is higher than the second preset frequency; and compares the similarity of each third communication object set with the first communication object set respectively, and determines that the target second communication object corresponding to the third communication object set with the highest similarity to the first communication object set is the object after the off-network user is changed. In this way, by accurately identifying the changed information of the off-network user based on the communication data of the off-network user, the identification accuracy of the off-network user can be improved, and then the off-network user can be promoted in a targeted manner to improve the conversion rate.
本发明实施例还提供了一种离网用户识别装置。参见图5,图5是本发明实施例提供的离网用户识别装置的结构图。由于离网用户识别装置解决问题的原理与本发明实施例中离网用户识别方法相似,因此该离网用户识别装置的实施可以参见方法的实施,重复之处不再赘述。The embodiment of the present invention further provides an off-network user identification device. See FIG5 , which is a structural diagram of the off-network user identification device provided by the embodiment of the present invention. Since the principle of solving the problem by the off-network user identification device is similar to the off-network user identification method in the embodiment of the present invention, the implementation of the off-network user identification device can refer to the implementation of the method, and the repeated parts will not be repeated.
如图5所示,离网用户识别装置500包括:As shown in FIG5 , the off-network user identification device 500 includes:
第一获取模块501,用于获取离网用户在第一预设时间段内的第一通讯对象集,其中,所述第一通讯对象集为与所述离网用户通讯频次高于第一预设频次的同网通讯对象的集合;A first acquisition module 501 is used to acquire a first communication object set of an off-network user within a first preset time period, wherein the first communication object set is a collection of same-network communication objects whose communication frequency with the off-network user is higher than a first preset frequency;
第二获取模块502,用于获取所述第一通讯对象集中所有第一通讯对象在第二预设时间段内的异网通讯对象,得到第二通讯对象集;A second acquisition module 502 is used to acquire all the first communication objects in the first communication object set within a second preset time period to obtain a second communication object set;
第三获取模块503,用于获取所述第二通讯对象集中每个第二通讯对象在第三预设时间段内的第三通讯对象集,其中,每个第二通讯对象的第三通讯对象集为与该第二通讯对象通讯频次高于第二预设频次的异网通讯对象的集合;A third acquisition module 503 is used to acquire a third communication object set of each second communication object in the second communication object set within a third preset time period, wherein the third communication object set of each second communication object is a set of communication objects in different networks whose communication frequency with the second communication object is higher than the second preset frequency;
第一确定模块504,用于分别将每个第三通讯对象集与所述第一通讯对象集进行相似度比较,并确定与所述第一通讯对象集相似度最高的第三通讯对象集对应的目标第二通讯对象为所述离网用户变更后的对象。The first determining module 504 is used to compare the similarity between each third communication object set and the first communication object set, and determine that the target second communication object corresponding to the third communication object set with the highest similarity to the first communication object set is the changed object of the off-network user.
可选的,第一获取模块501包括:Optionally, the first acquisition module 501 includes:
获取单元,用于获取离网用户在第一预设时间段内的通讯数据,所述通讯数据包括通讯对象和通讯频次;An acquisition unit, configured to acquire communication data of an off-network user within a first preset time period, wherein the communication data includes a communication object and a communication frequency;
确定单元,用于根据各通讯对象与所述离网用户的通讯频次,以及各通讯对象与所述离网用户是否为同网用户,确定与所述离网用户通讯频次高于第一预设频次的同网通讯对象,得到第一通讯对象集。The determination unit is used to determine the same-network communication objects whose communication frequency with the off-network user is higher than a first preset frequency according to the communication frequency between each communication object and the off-network user and whether each communication object and the off-network user are same-network users, so as to obtain a first communication object set.
可选的,第一获取模块501还包括:Optionally, the first acquisition module 501 further includes:
构建单元,用于根据所述通讯数据,构建所述离网用户的通讯网络超图,其中,所述通讯网络超图包括通讯对象和各通讯对象与所述离网用户的通讯频次;A construction unit, configured to construct a communication network hypergraph of the off-network user according to the communication data, wherein the communication network hypergraph includes communication objects and communication frequencies between each communication object and the off-network user;
所述确定单元用于剔除所述通讯网络超图中与所述离网用户异网的通讯对象,以及与所述离网用户通讯频次低于或等于所述第一预设频次的通讯对象,得到所述离网用户的紧密通讯网络超图。The determining unit is used to eliminate the communication objects in the communication network hypergraph that are in different networks from the off-network user, and the communication objects whose communication frequency with the off-network user is lower than or equal to the first preset frequency, to obtain a close communication network hypergraph of the off-network user.
可选的,第一确定模块504用于基于所述第一通讯对象集与每个第三通讯对象集中相同通讯对象的数量,确定所述离网用户与每个第二通讯对象的相似度。Optionally, the first determining module 504 is configured to determine the similarity between the off-network user and each second communication object based on the number of identical communication objects in the first communication object set and each third communication object set.
可选的,离网用户识别装置500还包括:Optionally, the off-network user identification device 500 further includes:
第二确定模块,用于确定所述离网用户的常驻位置信息;A second determination module is used to determine the permanent location information of the off-network user;
推送模块,用于将所述目标第二通讯对象的信息和所述常驻位置信息推送给营销终端。The push module is used to push the information of the target second communication object and the permanent location information to the marketing terminal.
本发明实施例提供的离网用户识别装置,可以执行上述方法实施例,其实现原理和技术效果类似,本实施例此处不再赘述。The off-network user identification device provided in the embodiment of the present invention can execute the above method embodiment, and its implementation principle and technical effect are similar, which will not be repeated in this embodiment.
本发明实施例的离网用户识别装置500,获取离网用户在第一预设时间段内的第一通讯对象集,其中,所述第一通讯对象集为与所述离网用户通讯频次高于第一预设频次的同网通讯对象的集合;获取所述第一通讯对象集中所有第一通讯对象在第二预设时间段内的异网通讯对象,得到第二通讯对象集;获取所述第二通讯对象集中每个第二通讯对象在第三预设时间段内的第三通讯对象集,其中,每个第二通讯对象的第三通讯对象集为与该第二通讯对象通讯频次高于第二预设频次的异网通讯对象的集合;分别将每个第三通讯对象集与所述第一通讯对象集进行相似度比较,并确定与所述第一通讯对象集相似度最高的第三通讯对象集对应的目标第二通讯对象为所述离网用户变更后的对象。这样,通过基于离网用户的通讯数据,对离网用户变更后的信息进行精准识别,可以提高离网用户的识别准确率,进而可对离网用户进行针对性推广,提高转化率。The off-network user identification device 500 of the embodiment of the present invention obtains the first communication object set of the off-network user within the first preset time period, wherein the first communication object set is a set of communication objects on the same network whose communication frequency with the off-network user is higher than the first preset frequency; obtains the communication objects of different networks of all the first communication objects in the first communication object set within the second preset time period to obtain the second communication object set; obtains the third communication object set of each second communication object in the second communication object set within the third preset time period, wherein the third communication object set of each second communication object is a set of communication objects on different networks whose communication frequency with the second communication object is higher than the second preset frequency; respectively compares the similarity of each third communication object set with the first communication object set, and determines that the target second communication object corresponding to the third communication object set with the highest similarity to the first communication object set is the object after the off-network user is changed. In this way, by accurately identifying the changed information of the off-network user based on the communication data of the off-network user, the identification accuracy of the off-network user can be improved, and then the off-network user can be promoted in a targeted manner to improve the conversion rate.
本发明实施例还提供了一种离网用户识别装置。由于离网用户识别装置解决问题的原理与本发明实施例中离网用户识别方法相似,因此该离网用户识别装置的实施可以参见方法的实施,重复之处不再赘述。如图6所示,本发明实施例的离网用户识别装置,包括:处理器600,用于读取存储器620中的程序,执行下列过程:The embodiment of the present invention also provides an off-network user identification device. Since the principle of solving the problem by the off-network user identification device is similar to that of the off-network user identification method in the embodiment of the present invention, the implementation of the off-network user identification device can refer to the implementation of the method, and the repeated parts will not be repeated. As shown in FIG6, the off-network user identification device in the embodiment of the present invention includes: a processor 600, which is used to read the program in the memory 620 and execute the following process:
获取离网用户在第一预设时间段内的第一通讯对象集,其中,所述第一通讯对象集为与所述离网用户通讯频次高于第一预设频次的同网通讯对象的集合;Acquire a first communication object set of the off-network user within a first preset time period, wherein the first communication object set is a collection of same-network communication objects whose communication frequency with the off-network user is higher than a first preset frequency;
获取所述第一通讯对象集中所有第一通讯对象在第二预设时间段内的异网通讯对象,得到第二通讯对象集;Acquire all the communication objects of the first communication object in the first communication object set in the second preset time period to obtain a second communication object set;
获取所述第二通讯对象集中每个第二通讯对象在第三预设时间段内的第三通讯对象集,其中,每个第二通讯对象的第三通讯对象集为与该第二通讯对象通讯频次高于第二预设频次的异网通讯对象的集合;Acquire a third communication object set of each second communication object in the second communication object set within a third preset time period, wherein the third communication object set of each second communication object is a set of communication objects in different networks whose communication frequency with the second communication object is higher than the second preset frequency;
分别将每个第三通讯对象集与所述第一通讯对象集进行相似度比较,并确定与所述第一通讯对象集相似度最高的第三通讯对象集对应的目标第二通讯对象为所述离网用户变更后的对象。The similarity between each third communication object set and the first communication object set is compared respectively, and the target second communication object corresponding to the third communication object set having the highest similarity to the first communication object set is determined as the changed object of the off-network user.
收发机610,用于在处理器600的控制下接收和发送数据。The transceiver 610 is configured to receive and send data under the control of the processor 600 .
其中,在图6中,总线架构可以包括任意数量的互联的总线和桥,具体由处理器600代表的一个或多个处理器和存储器620代表的存储器的各种电路链接在一起。总线架构还可以将诸如外围设备、稳压器和功率管理电路等之类的各种其他电路链接在一起,这些都是本领域所公知的,因此,本文不再对其进行进一步描述。总线接口提供接口。收发机610可以是多个元件,即包括发送机和收发机,提供用于在传输介质上与各种其他装置通信的单元。处理器600负责管理总线架构和通常的处理,存储器620可以存储处理器600在执行操作时所使用的数据。In FIG. 6 , the bus architecture may include any number of interconnected buses and bridges, specifically one or more processors represented by processor 600 and various circuits of memory represented by memory 620 are linked together. The bus architecture may also link together various other circuits such as peripherals, voltage regulators, and power management circuits, which are well known in the art and are therefore not further described herein. The bus interface provides an interface. The transceiver 610 may be a plurality of components, namely, a transmitter and a transceiver, providing a unit for communicating with various other devices on a transmission medium. The processor 600 is responsible for managing the bus architecture and general processing, and the memory 620 may store data used by the processor 600 when performing operations.
可选的,处理器600还用于读取存储器620中的程序,执行如下步骤:Optionally, the processor 600 is further configured to read a program in the memory 620 and execute the following steps:
获取离网用户在第一预设时间段内的通讯数据,所述通讯数据包括通讯对象和通讯频次;Acquire communication data of the off-network user within a first preset time period, the communication data including a communication object and a communication frequency;
根据各通讯对象与所述离网用户的通讯频次,以及各通讯对象与所述离网用户是否为同网用户,确定与所述离网用户通讯频次高于第一预设频次的同网通讯对象,得到第一通讯对象集。According to the communication frequency between each communication object and the off-network user and whether each communication object and the off-network user are on the same network, the same-network communication objects whose communication frequency with the off-network user is higher than a first preset frequency are determined to obtain a first communication object set.
可选的,处理器600还用于读取存储器620中的程序,执行如下步骤:Optionally, the processor 600 is further configured to read a program in the memory 620 and execute the following steps:
根据所述通讯数据,构建所述离网用户的通讯网络超图,其中,所述通讯网络超图包括通讯对象和各通讯对象与所述离网用户的通讯频次;Constructing a communication network hypergraph of the off-network user according to the communication data, wherein the communication network hypergraph includes communication objects and communication frequencies between each communication object and the off-network user;
剔除所述通讯网络超图中与所述离网用户异网的通讯对象,以及与所述离网用户通讯频次低于或等于所述第一预设频次的通讯对象,得到所述离网用户的紧密通讯网络超图。The communication objects in different networks from the off-network user and the communication objects whose communication frequency with the off-network user is lower than or equal to the first preset frequency are eliminated from the communication network hypergraph to obtain a close communication network hypergraph of the off-network user.
可选的,处理器600还用于读取存储器620中的程序,执行如下步骤:Optionally, the processor 600 is further configured to read a program in the memory 620 and execute the following steps:
基于所述第一通讯对象集与每个第三通讯对象集中相同通讯对象的数量,确定所述离网用户与每个第二通讯对象的相似度。Based on the number of identical communication objects in the first communication object set and each third communication object set, the similarity between the off-network user and each second communication object is determined.
可选的,处理器600还用于读取存储器620中的程序,执行如下步骤:Optionally, the processor 600 is further configured to read a program in the memory 620 and execute the following steps:
确定所述离网用户的常驻位置信息;Determining the permanent location information of the off-network user;
将所述目标第二通讯对象的信息和所述常驻位置信息推送给营销终端。The information of the target second communication object and the resident location information are pushed to the marketing terminal.
本发明实施例提供的离网用户识别装置,可以执行上述方法实施例,其实现原理和技术效果类似,本实施例此处不再赘述。The off-network user identification device provided in the embodiment of the present invention can execute the above method embodiment, and its implementation principle and technical effect are similar, which will not be repeated in this embodiment.
此外,本发明实施例的计算机可读存储介质,用于存储计算机程序,所述计算机程序可被处理器执行实现以下步骤:In addition, the computer-readable storage medium of the embodiment of the present invention is used to store a computer program, and the computer program can be executed by a processor to implement the following steps:
获取离网用户在第一预设时间段内的第一通讯对象集,其中,所述第一通讯对象集为与所述离网用户通讯频次高于第一预设频次的同网通讯对象的集合;Acquire a first communication object set of the off-network user within a first preset time period, wherein the first communication object set is a collection of same-network communication objects whose communication frequency with the off-network user is higher than a first preset frequency;
获取所述第一通讯对象集中所有第一通讯对象在第二预设时间段内的异网通讯对象,得到第二通讯对象集;Acquire all the communication objects of the first communication object in the first communication object set in the second preset time period to obtain a second communication object set;
获取所述第二通讯对象集中每个第二通讯对象在第三预设时间段内的第三通讯对象集,其中,每个第二通讯对象的第三通讯对象集为与该第二通讯对象通讯频次高于第二预设频次的异网通讯对象的集合;Acquire a third communication object set of each second communication object in the second communication object set within a third preset time period, wherein the third communication object set of each second communication object is a set of communication objects in different networks whose communication frequency with the second communication object is higher than the second preset frequency;
分别将每个第三通讯对象集与所述第一通讯对象集进行相似度比较,并确定与所述第一通讯对象集相似度最高的第三通讯对象集对应的目标第二通讯对象为所述离网用户变更后的对象。The similarity between each third communication object set and the first communication object set is compared respectively, and the target second communication object corresponding to the third communication object set having the highest similarity to the first communication object set is determined as the changed object of the off-network user.
可选的,所述获取离网用户在第一预设时间段内的第一通讯对象集,包括:Optionally, obtaining a first communication object set of the off-network user within a first preset time period includes:
获取离网用户在第一预设时间段内的通讯数据,所述通讯数据包括通讯对象和通讯频次;Acquire communication data of the off-network user within a first preset time period, the communication data including a communication object and a communication frequency;
根据各通讯对象与所述离网用户的通讯频次,以及各通讯对象与所述离网用户是否为同网用户,确定与所述离网用户通讯频次高于第一预设频次的同网通讯对象,得到第一通讯对象集。According to the communication frequency between each communication object and the off-network user and whether each communication object and the off-network user are on the same network, the same-network communication objects whose communication frequency with the off-network user is higher than a first preset frequency are determined to obtain a first communication object set.
可选的,所述获取离网用户在第一预设时间段内的通讯数据之后,所述方法还包括:Optionally, after acquiring the communication data of the off-network user within the first preset time period, the method further includes:
根据所述通讯数据,构建所述离网用户的通讯网络超图,其中,所述通讯网络超图包括通讯对象和各通讯对象与所述离网用户的通讯频次;Constructing a communication network hypergraph of the off-network user according to the communication data, wherein the communication network hypergraph includes communication objects and communication frequencies between each communication object and the off-network user;
所述根据各通讯对象与所述离网用户的通讯频次,以及各通讯对象与所述离网用户是否为同网用户,确定与所述离网用户通讯频次高于第一预设频次的同网通讯对象,包括:The determining, according to the communication frequency between each communication object and the off-network user, and whether each communication object and the off-network user are on the same network, a same-network communication object whose communication frequency with the off-network user is higher than a first preset frequency, comprises:
剔除所述通讯网络超图中与所述离网用户异网的通讯对象,以及与所述离网用户通讯频次低于或等于所述第一预设频次的通讯对象,得到所述离网用户的紧密通讯网络超图。The communication objects in different networks from the off-network user and the communication objects whose communication frequency with the off-network user is lower than or equal to the first preset frequency are eliminated from the communication network hypergraph to obtain a close communication network hypergraph of the off-network user.
可选的,所述分别将每个第三通讯对象集与所述第一通讯对象集进行相似度比较,包括:Optionally, the comparing the similarity of each third communication object set with the first communication object set respectively includes:
基于所述第一通讯对象集与每个第三通讯对象集中相同通讯对象的数量,确定所述离网用户与每个第二通讯对象的相似度。Based on the number of identical communication objects in the first communication object set and each third communication object set, the similarity between the off-network user and each second communication object is determined.
可选的,所述确定与所述第一通讯对象集相似度最高的第三通讯对象集对应的目标第二通讯对象为所述离网用户变更后的对象之后,所述方法还包括:Optionally, after determining that the target second communication object corresponding to the third communication object set having the highest similarity to the first communication object set is the changed object of the off-network user, the method further includes:
确定所述离网用户的常驻位置信息;Determining the permanent location information of the off-network user;
将所述目标第二通讯对象的信息和所述常驻位置信息推送给营销终端。The information of the target second communication object and the resident location information are pushed to the marketing terminal.
在本申请所提供的几个实施例中,应该理解到,所揭露方法和装置,可以通过其它的方式实现。例如,以上所描述的装置实施例仅仅是示意性的,例如,所述单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,例如多个单元或组件可以结合或者可以集成到另一个系统,或一些特征可以忽略,或不执行。另一点,所显示或讨论的相互之间的耦合或直接耦合或通信连接可以是通过一些接口,装置或单元的间接耦合或通信连接,可以是电性,机械或其它的形式。In the several embodiments provided in the present application, it should be understood that the disclosed methods and devices can be implemented in other ways. For example, the device embodiments described above are only schematic. For example, the division of the units is only a logical function division. There may be other division methods in actual implementation, such as multiple units or components can be combined or integrated into another system, or some features can be ignored or not executed. Another point is that the mutual coupling or direct coupling or communication connection shown or discussed can be an indirect coupling or communication connection through some interfaces, devices or units, which can be electrical, mechanical or other forms.
另外,在本发明各个实施例中的各功能单元可以集成在一个处理单元中,也可以是各个单元单独物理包括,也可以两个或两个以上单元集成在一个单元中。上述集成的单元既可以采用硬件的形式实现,也可以采用硬件加软件功能单元的形式实现。In addition, each functional unit in each embodiment of the present invention may be integrated into one processing unit, or each unit may be physically included separately, or two or more units may be integrated into one unit. The above-mentioned integrated unit may be implemented in the form of hardware or in the form of hardware plus software functional units.
上述以软件功能单元的形式实现的集成的单元,可以存储在一个计算机可读取存储介质中。上述软件功能单元存储在一个存储介质中,包括若干指令用以使得一台计算机设备(可以是个人计算机,服务器,或者网络设备等)执行本发明各个实施例所述收发方法的部分步骤。而前述的存储介质包括:U盘、移动硬盘、只读存储器(Read-Only Memory,ROM)、随机存取存储器(Random Access Memory,RAM)、磁碟或者光盘等各种可以存储程序代码的介质。The above-mentioned integrated unit implemented in the form of a software functional unit can be stored in a computer-readable storage medium. The above-mentioned software functional unit is stored in a storage medium, including a number of instructions for a computer device (which can be a personal computer, a server, or a network device, etc.) to perform some steps of the sending and receiving method described in each embodiment of the present invention. The aforementioned storage medium includes: U disk, mobile hard disk, read-only memory (Read-Only Memory, ROM), random access memory (Random Access Memory, RAM), disk or optical disk and other media that can store program codes.
以上所述是本发明的优选实施方式,应当指出,对于本技术领域的普通技术人员来说,在不脱离本发明所述原理的前提下,还可以作出若干改进和润饰,这些改进和润饰也应视为本发明的保护范围。The above is a preferred embodiment of the present invention. It should be pointed out that for ordinary technicians in this technical field, several improvements and modifications can be made without departing from the principles of the present invention. These improvements and modifications should also be regarded as the scope of protection of the present invention.
Claims (9)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110712358.7A CN115529581B (en) | 2021-06-25 | 2021-06-25 | Off-network user identification method and device |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110712358.7A CN115529581B (en) | 2021-06-25 | 2021-06-25 | Off-network user identification method and device |
Publications (2)
Publication Number | Publication Date |
---|---|
CN115529581A CN115529581A (en) | 2022-12-27 |
CN115529581B true CN115529581B (en) | 2024-08-02 |
Family
ID=84693524
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202110712358.7A Active CN115529581B (en) | 2021-06-25 | 2021-06-25 | Off-network user identification method and device |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN115529581B (en) |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101729682A (en) * | 2009-11-11 | 2010-06-09 | 南京联创科技集团股份有限公司 | Method for automatically tracing communication network users |
CN108712269A (en) * | 2018-05-30 | 2018-10-26 | 中国联合网络通信集团有限公司 | The method for retrieving and device of off-network user |
Family Cites Families (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
GB2467597B (en) * | 2009-02-10 | 2012-12-26 | Oracle Int Corp | Integrated communication system and method |
US20140280237A1 (en) * | 2013-03-18 | 2014-09-18 | Share This Inc. | Method and system for identifying sets of social look-alike users |
WO2017071369A1 (en) * | 2015-10-31 | 2017-05-04 | 华为技术有限公司 | Method and device for predicting user unsubscription |
CN107111651A (en) * | 2016-07-29 | 2017-08-29 | 深圳越界创新科技有限公司 | A kind of matching degree computational methods, device and user equipment |
CN108712287B (en) * | 2018-05-22 | 2020-12-29 | 同济大学 | VANET community discovery method based on node similarity |
-
2021
- 2021-06-25 CN CN202110712358.7A patent/CN115529581B/en active Active
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101729682A (en) * | 2009-11-11 | 2010-06-09 | 南京联创科技集团股份有限公司 | Method for automatically tracing communication network users |
CN108712269A (en) * | 2018-05-30 | 2018-10-26 | 中国联合网络通信集团有限公司 | The method for retrieving and device of off-network user |
Also Published As
Publication number | Publication date |
---|---|
CN115529581A (en) | 2022-12-27 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN102083010B (en) | Method and equipment for screening user information | |
CN110337059B (en) | Analysis algorithm, server and network system for family relationship of user | |
CN104704524B (en) | Information disclosure system, information disclosure server, communication terminal, information disclosure method, and non-transitory computer-readable medium | |
CN103605791B (en) | Information transmission system and information-pushing method | |
US8249231B2 (en) | System and computer program product for predicting churners in a telecommunications network | |
CN113556430B (en) | Outbound system and outbound method | |
CN112100250A (en) | Data processing method and device | |
CN102438205B (en) | Method and system for pushing service based on action of mobile user | |
CN112752256A (en) | Client portrait label determination method, device, equipment and storage medium | |
CN114547002A (en) | Method, device, electronic device and storage medium for identification of takeaway number | |
CN115470867A (en) | Seat matching method, device, equipment and storage medium based on knowledge graph | |
CN112307075B (en) | User relationship identification method and device | |
CN111026818A (en) | Multi-object associated view construction method, terminal device and storage medium | |
CN110677269B (en) | Method and device for determining communication user relationship and computer readable storage medium | |
CN115529581B (en) | Off-network user identification method and device | |
CN108647315A (en) | Collecting method and system | |
CN112184481A (en) | Smart grid arrearage processing method and device combining RPA and AI | |
CN104573034A (en) | CDR call ticket based user group division method and system | |
CN115348544B (en) | Different network user positioning method, terminal equipment and storage medium | |
CN115526647A (en) | Target user identification method and device | |
CN103458411A (en) | Communication account analysis device and method | |
CN119271894A (en) | Information processing method and device | |
CN106899742A (en) | Message registration management method and terminal | |
CN109919811A (en) | Insurance agent's culture scheme generation method and relevant device based on big data | |
CN115988475B (en) | Method, device and storage medium for predicting portable user |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |