CN110377854A - User access activity information monitoring method and device, computer equipment - Google Patents
User access activity information monitoring method and device, computer equipment Download PDFInfo
- Publication number
- CN110377854A CN110377854A CN201910469619.XA CN201910469619A CN110377854A CN 110377854 A CN110377854 A CN 110377854A CN 201910469619 A CN201910469619 A CN 201910469619A CN 110377854 A CN110377854 A CN 110377854A
- Authority
- CN
- China
- Prior art keywords
- data
- webpage
- web page
- information
- sharing
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/958—Organisation or management of web site content, e.g. publishing, maintaining pages or automatic linking
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- Databases & Information Systems (AREA)
- General Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- Artificial Intelligence (AREA)
- Evolutionary Computation (AREA)
- Evolutionary Biology (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Bioinformatics & Computational Biology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Information Transfer Between Computers (AREA)
Abstract
本发明公开了一种用户访问行为信息监控方法及装置、计算机设备,涉及网页处理技术领域,主要目的在于解决现有网页浏览监控仅仅通过对访问量pv,独立访客uv等数据进行监控,且一般的监控系统体量较大,对数据库及服务器的压力过大,数据查询困难且效率较低的问题。包括发送监控数据获取请求,统计网页服务端返回的监控数据获取请求响应携带的网页访问数据、和/或网页分享数据、和/或网页页面数据;按照数据结构对统计后的网页访问数据、和/或网页分享数据、和/或网页页面数据分别进行分类聚合,对分类聚合后得到的数据进行存储,并生成存储位置信息;根据用户访问行为信息及所述数据对应的存储位置信息生成监控日志。
The invention discloses a method and device for monitoring user access behavior information, and computer equipment, and relates to the technical field of web page processing. The monitoring system has a large volume, the pressure on the database and server is too large, the data query is difficult and the efficiency is low. Including sending a monitoring data acquisition request, and counting the web page access data, and/or web page sharing data, and/or web page data carried in the response to the monitoring data acquisition request returned by the web server; / or webpage sharing data and/or webpage page data are classified and aggregated respectively, the data obtained after classification and aggregation is stored, and storage location information is generated; monitoring logs are generated according to user access behavior information and storage location information corresponding to the data. .
Description
技术领域technical field
本发明涉及一种网页处理技术领域,特别是涉及一种用户访问行为信息监控方法及装置、计算机设备。The present invention relates to the technical field of web page processing, in particular to a method and device for monitoring user access behavior information, and computer equipment.
背景技术Background technique
随着网页技术的不断优化,为了实时的对网页状态进行监控,一般情况下会对网页的流量进行监控,以此来确定用户浏览情况。目前,用户访问行为信息即为用户通过浏览网页所产生的全部访问行为数据,现有的网页浏览监控仅仅通过对访问量pv,独立访客uv等数据进行监控,无法很好的满足业务人员通过对数据的监控分析用户行为的目的;另一方面,一般的监控系统体量较大,对数据库及网页服务器的存储压力过大,使得数据查询困难,降低监控数据的效率。With the continuous optimization of web technology, in order to monitor the status of the web page in real time, the traffic of the web page is generally monitored to determine the user's browsing situation. At present, user access behavior information is all the access behavior data generated by users browsing web pages. The existing web browsing monitoring only monitors data such as traffic pv, independent visitor uv, etc. The purpose of data monitoring and analysis of user behavior; on the other hand, the general monitoring system has a large volume, and the storage pressure on the database and web server is too large, which makes data query difficult and reduces the efficiency of monitoring data.
发明内容SUMMARY OF THE INVENTION
有鉴于此,本发明提供一种用户访问行为信息监控方法及装置、计算机设备,主要目的在于解决现有网页浏览监控仅仅通过对访问量pv,独立访客uv等数据进行监控,无法很好的满足业务人员通过对数据的监控分析用户行为的目的,且一般的监控系统体量较大,对数据库及服务器的压力过大,数据查询困难且效率较低的问题。In view of this, the present invention provides a method and device for monitoring user access behavior information, and computer equipment, the main purpose of which is to solve the problem that the existing web browsing monitoring only monitors data such as visits pv, independent visitors uv, etc., which cannot be well satisfied. Business personnel analyze the purpose of user behavior through data monitoring, and the general monitoring system is large in size, the pressure on the database and server is too large, data query is difficult and the efficiency is low.
依据本发明一个方面,提供了一种用户访问行为信息监控方法,包括:According to one aspect of the present invention, a method for monitoring user access behavior information is provided, comprising:
发送监控数据获取请求,统计网页服务端返回的监控数据获取请求响应携带的网页访问数据、和/或网页分享数据、和/或网页页面数据;Send a monitoring data acquisition request, and count the web page access data, and/or web page sharing data, and/or web page data carried in the response to the monitoring data acquisition request returned by the web server;
按照数据结构对统计后的网页访问数据、和/或网页分享数据、和/或网页页面数据分别进行分类聚合,对分类聚合后得到的网页访问数据、和/或网页分享数据、和/或网页页面数据进行存储,并生成存储位置信息;According to the data structure, the statistics of web page access data, and/or web page sharing data, and/or web page data are respectively classified and aggregated, and the web page access data, and/or web page sharing data, and/or web pages obtained after the classification and aggregation are aggregated. Store page data and generate storage location information;
根据所述网页访问数据、和/或所述网页分享数据、和/或所述网页页面数据对应的存储位置信息生成监控日志。The monitoring log is generated according to the webpage access data, and/or the webpage sharing data, and/or storage location information corresponding to the webpage page data.
进一步地,所述发送监控数据获取请求,统计网页服务端返回的监控数据获取请求响应携带的网页访问数据、和/或网页分享数据、和/或网页页面数据包括:Further, the sending of the monitoring data acquisition request, and the web page access data, and/or web page sharing data, and/or web page data carried in the response to the monitoring data acquisition request returned by the statistics web server include:
当检测到页面加载指令时,向所述网页服务端发送网页访问数据获取请求,并统计所述网页服务端返回的网页id、终端设备号、终端设备信息、终端位置信息、阅读来源信息、分享层级信息;和/或,When a page loading instruction is detected, a request for obtaining webpage access data is sent to the webpage server, and the webpage id, terminal device number, terminal device information, terminal location information, reading source information, sharing information returned by the webpage server are counted. Tier information; and/or,
当检测到网页分享指令时,向所述网页服务端发送网页分享数据获取请求,统计所述网页服务端返回的网页id、终端设备号、分享去向信息,并对所述网页的分享层级信息进行累计;和/或,When a webpage sharing instruction is detected, a webpage sharing data acquisition request is sent to the webpage server, the webpage id, terminal device number, and sharing destination information returned by the webpage server are counted, and the sharing level information of the webpage is analyzed. cumulative; and/or,
当检测到网页页面点击指令时,向所述网页服务端发送网页页面数据获取请求,统计所述网页服务端返回的网页页面点击信息。When a webpage page click instruction is detected, a webpage page data acquisition request is sent to the webpage server, and the webpage page click information returned by the webpage server is counted.
进一步地,所述当检测到网页页面点击指令时,向所述网页服务端发送网页页面数据获取请求,统计所述网页服务端返回的网页页面点击信息包括:Further, when a webpage page click instruction is detected, a webpage page data acquisition request is sent to the webpage server, and statistics on webpage click information returned by the webpage server include:
当检测到网页页面点击指令时,向所述网页服务端发送网页页面数据获取请求,并接收网页页面数据获取请求响应,所述网页页面数据获取请求响应中携带网页页面元素点击信息、网页页面浏览时间累积信息;根据所述网页页面元素点击信息、所述网页页面浏览时间累积信息统计网页页面点击信息。When a webpage page click instruction is detected, a webpage page data acquisition request is sent to the webpage server, and a webpage page data acquisition request response is received, and the webpage page data acquisition request response carries webpage page element click information, webpage page browsing information Time accumulation information; according to the webpage page element click information and the webpage page browsing time accumulation information, the webpage page click information is counted.
进一步地,所述按照数据结构对统计后的网页访问数据、和/或网页分享数据、和/或网页页面数据分别进行分类聚合,对分类聚合后得到的网页访问数据、和/或网页分享数据、和/或网页页面数据进行存储,并生成存储位置信息包括:Further, according to the data structure, the statistics of webpage access data, and/or webpage sharing data, and/or webpage page data are classified and aggregated respectively, and the webpage access data and/or webpage sharing data obtained after the classification and aggregation are aggregated. , and/or web page data, and generate storage location information including:
解析统计后的网页访问数据、和/或网页分享数据、和/或网页页面数据的数据结构;Analyze the statistics of web page access data, and/or web page sharing data, and/or the data structure of web page data;
当解析的数据结构为简单类型数据结构,则按照时限需求对所述网页访问数据、和/或所述网页分享数据、和/或所述网页页面数据聚合存储至不同存储容量的容器中,生成存储位置信息,所述简单类型数据结构为仅进行数据量增量变化的数据结构;或,When the parsed data structure is a simple type data structure, the web page access data, and/or the web page sharing data, and/or the web page data are aggregated and stored in containers with different storage capacities according to time-limited requirements to generate Storage location information, the simple type data structure is a data structure that only performs incremental changes in the amount of data; or,
当解析的数据结构为复杂类型数据结构,则对所述网页访问数据、和/或所述网页分享数据、和/或所述网页页面数据按照预设冗余条件进行过滤,将过滤后的网页访问数据、和/或网页分享数据、和/或网页页面数据按照用户id、浏览id、时间戳进行缓存,生成存储位置信息,所述复杂类型数据结构为对数据形式、数据表达方式进行处理的数据结构。When the parsed data structure is a complex type of data structure, the web page access data, and/or the web page sharing data, and/or the web page page data are filtered according to preset redundancy conditions, and the filtered web page is filtered. Access data, and/or web page sharing data, and/or web page data are cached according to user id, browsing id, and timestamp to generate storage location information. The complex type data structure is for processing data forms and data expressions. data structure.
进一步地,所述当解析的数据结构为复杂类型数据结构,则对所述网页访问数据、和/或所述网页分享数据、和/或所述网页页面数据按照预设冗余条件进行过滤,将过滤后的网页访问数据、和/或网页分享数据、和/或网页页面数据按照用户id、浏览id、时间戳进行缓存,生成存储位置信息包括:Further, when the parsed data structure is a complex type data structure, the webpage access data, and/or the webpage sharing data, and/or the webpage page data are filtered according to preset redundancy conditions, Cache the filtered web page access data, and/or web page sharing data, and/or web page data according to the user id, browsing id, and timestamp, and generate storage location information including:
当解析的数据结构为复杂类型数据结构,则按照用户id、浏览id、时间戳判断所述网页访问数据、和/或所述网页分享数据、和/或所述网页页面数据是否为过期数据;若为非过期数据,则将所述网页访问数据、和/或所述网页分享数据、和/或所述网页页面数据分别存储至与所述用户id、所述浏览id、所述时间戳对应的缓存位置中,生成存储位置信息。When the parsed data structure is a complex type of data structure, determine whether the webpage access data, and/or the webpage sharing data, and/or the webpage page data is expired data according to the user id, browsing id, and timestamp; If it is non-expired data, store the webpage access data, and/or the webpage sharing data, and/or the webpage page data respectively corresponding to the user id, the browsing id, and the timestamp In the cache location of , generate storage location information.
进一步地,所述根据所述网页访问数据、和/或所述网页分享数据、和/或所述网页页面数据对应的存储位置信息生成监控日志之后,所述方法还包括:Further, after the monitoring log is generated according to the webpage access data, and/or the webpage sharing data, and/or the storage location information corresponding to the webpage page data, the method further includes:
通过消息队列发送延时消息,读取所述延时消息中的用户访问行为信息及延时时间,将所述延时消息的时间戳与缓存后的用户id、浏览id对应的时间戳进行对比,若缓存后的用户id、浏览id对应的时间戳小于所述延时消息的时间戳,则确定所述用户访问行为信息为冗余数据,若缓存后的用户id、浏览id对应的时间戳大于所述延时消息的时间戳,则缓存所述用户访问行为信息。Send a delayed message through the message queue, read the user access behavior information and delay time in the delayed message, and compare the timestamp of the delayed message with the timestamp corresponding to the cached user id and browsing id , if the timestamp corresponding to the cached user id and browsing id is smaller than the timestamp of the delayed message, it is determined that the user access behavior information is redundant data, if the timestamp corresponding to the cached user id and browsing id If the time stamp is greater than the time stamp of the delayed message, the user access behavior information is cached.
进一步地,所述当解析的数据结构为简单类型数据结构,则按照时限需求对所述网页访问数据、和/或所述网页分享数据、和/或所述网页页面数据聚合存储至不同存储容量的容器中,生成存储位置信息之前,还包括:Further, when the parsed data structure is a simple type data structure, the web page access data, and/or the web page sharing data, and/or the web page data are aggregated and stored in different storage capacities according to time limit requirements. In the container of , before generating the storage location information, it also includes:
配置具有不同存储有效期、不同访问时长的预设个数的容器,所述存储有效期的确定方式为存储有效期=需求的最近时间,所述访问时长的确定方式为访问时长=统计量的精度,所述预设个数的设定方式为预设个数=存储有效期/访问时长。Configure a preset number of containers with different storage validity periods and different access durations. The storage validity period is determined in a manner of storage validity period=the most recent time required, and the access duration is determined in the manner of access duration=accuracy of statistics, so The setting method of the preset number is preset number=storage validity period/access duration.
依据本发明另一个方面,提供了一种用户访问行为信息监控装置,包括:According to another aspect of the present invention, a device for monitoring user access behavior information is provided, comprising:
统计模块,用于发送监控数据获取请求,统计网页服务端返回的监控数据获取请求响应携带的网页访问数据、和/或网页分享数据、和/或网页页面数据;A statistics module, used to send a monitoring data acquisition request, and to count web page access data, and/or web page sharing data, and/or web page data carried in the response to the monitoring data acquisition request returned by the web server;
存储模块,用于存储按照数据结构对统计后的网页访问数据、和/或网页分享数据、和/或网页页面数据分别进行分类聚合,对分类聚合后得到的网页访问数据、和/或网页分享数据、和/或网页页面数据进行存储,并生成存储位置信息;The storage module is used to store the statistics of webpage access data, and/or webpage sharing data, and/or webpage page data according to the data structure, respectively classify and aggregate the webpage access data obtained after classification and aggregation, and/or webpage sharing. data, and/or web page data, and generate storage location information;
生成模块,用于根据所述网页访问数据、和/或所述网页分享数据、和/或所述网页页面数据对应的存储位置信息生成监控日志。A generating module, configured to generate a monitoring log according to the webpage access data, and/or the webpage sharing data, and/or storage location information corresponding to the webpage page data.
进一步地,所述统计模块包括:Further, the statistical module includes:
第一统计单元,用于当检测到页面加载指令时,向所述网页服务端发送网页访问数据获取请求,并统计所述网页服务端返回的网页id、终端设备号、终端设备信息、终端位置信息、阅读来源信息、分享层级信息;和/或,The first statistical unit is configured to send a request for obtaining webpage access data to the webpage server when a page loading instruction is detected, and count the webpage id, terminal device number, terminal device information, and terminal location returned by the webpage server information, reading source information, sharing tier information; and/or,
第二统计单元,用于当检测到网页分享指令时,向所述网页服务端发送网页分享数据获取请求,统计所述网页服务端返回的网页id、终端设备号、分享去向信息,并对所述网页的分享层级信息进行累计;和/或,The second statistical unit is configured to send a webpage sharing data acquisition request to the webpage server when a webpage sharing instruction is detected, count the webpage id, terminal device number, and sharing destination information returned by the webpage server, and analyze all the sharing level information of the pages described above; and/or,
第三统计单元,用于当检测到网页页面点击指令时,向所述网页服务端发送网页页面数据获取请求,统计所述网页服务端返回的网页页面点击信息。A third statistics unit, configured to send a request for obtaining web page data to the web server when a web page click instruction is detected, and to count the web page click information returned by the web server.
进一步地,所述第三统计单元,具体用于当检测到网页页面点击指令时,向所述网页服务端发送网页页面数据获取请求,并接收网页页面数据获取请求响应,所述网页页面数据获取请求响应中携带网页页面元素点击信息、网页页面浏览时间累积信息;根据所述网页页面元素点击信息、所述网页页面浏览时间累积信息统计网页页面点击信息。Further, the third statistical unit is specifically configured to send a web page data acquisition request to the web server when a web page click instruction is detected, and receive a web page data acquisition request response, the web page data acquisition The request response carries webpage element click information and webpage page browsing time accumulation information; the webpage page click information is counted according to the webpage element click information and the webpage page browsing time accumulation information.
进一步地,所述存储模块包括:Further, the storage module includes:
解析单元,用于解析统计后的网页访问数据、和/或网页分享数据、和/或网页页面数据的数据结构;Parsing unit, used for parsing the statistics of webpage access data, and/or webpage sharing data, and/or the data structure of webpage page data;
第一存储单元,用于当解析的数据结构为简单类型数据结构,则按照时限需求对所述网页访问数据、和/或所述网页分享数据、和/或所述网页页面数据聚合存储至不同存储容量的容器中,生成存储位置信息,所述简单类型数据结构为仅进行数据量增量变化的数据结构;或,The first storage unit is used to aggregate and store the web page access data, and/or the web page sharing data, and/or the web page data to different data structures according to time limit requirements when the parsed data structure is a simple type data structure. In the container of the storage capacity, the storage location information is generated, and the simple type data structure is a data structure that only performs incremental changes in the amount of data; or,
第二存储单元,用于当解析的数据结构为复杂类型数据结构,则对所述网页访问数据、和/或所述网页分享数据、和/或所述网页页面数据按照预设冗余条件进行过滤,将过滤后的网页访问数据、和/或网页分享数据、和/或网页页面数据按照用户id、浏览id、时间戳进行缓存,生成存储位置信息,所述复杂类型数据结构为对数据形式、数据表达方式进行处理的数据结构。The second storage unit is configured to, when the parsed data structure is a complex type of data structure, perform processing on the webpage access data, and/or the webpage sharing data, and/or the webpage page data according to preset redundancy conditions Filter, cache the filtered web page access data, and/or web page sharing data, and/or web page data according to the user id, browsing id, and timestamp to generate storage location information, and the complex type data structure is a pair of data in the form of , Data structure for data expression processing.
进一步地,所述第二存储单元,具体用于当解析的数据结构为复杂类型数据结构,则按照用户id、浏览id、时间戳判断所述网页访问数据、和/或所述网页分享数据、和/或所述网页页面数据是否为过期数据;若为非过期数据,则将所述网页访问数据、和/或所述网页分享数据、和/或所述网页页面数据分别存储至与所述用户id、所述浏览id、所述时间戳对应的缓存位置中,生成存储位置信息。Further, the second storage unit is specifically used for judging the webpage access data and/or the webpage sharing data according to the user id, browsing id and timestamp when the parsed data structure is a complex type data structure, and/or whether the web page data is expired data; if it is non-expired data, the web page access data, and/or the web page sharing data, and/or the web page data are respectively stored in the In the cache location corresponding to the user id, the browsing id, and the timestamp, storage location information is generated.
进一步地,所述装置还包括:Further, the device also includes:
确定模块,用于通过消息队列发送延时消息,读取所述延时消息中的用户访问行为信息及延时时间,将所述延时消息的时间戳与缓存后的用户id、浏览id对应的时间戳进行对比,若缓存后的用户id、浏览id对应的时间戳小于所述延时消息的时间戳,则确定所述用户访问行为信息为冗余数据,若缓存后的用户id、浏览id对应的时间戳大于所述延时消息的时间戳,则缓存所述用户访问行为信息。A determination module, configured to send a delayed message through a message queue, read user access behavior information and delay time in the delayed message, and correspond the timestamp of the delayed message with the cached user id and browsing id If the timestamp corresponding to the cached user id and browsing id is smaller than the timestamp of the delayed message, it is determined that the user access behavior information is redundant data. If the cached user id, browsing id If the timestamp corresponding to the id is greater than the timestamp of the delayed message, the user access behavior information is cached.
进一步地,所述装置还包括:Further, the device also includes:
配置模块,用于配置具有不同存储有效期、不同访问时长的预设个数的容器,所述存储有效期的确定方式为存储有效期=需求的最近时间,所述访问时长的确定方式为访问时长=统计量的精度,所述预设个数的设定方式为预设个数=存储有效期/访问时长。A configuration module, used to configure a preset number of containers with different storage validity periods and different access durations, the storage validity period is determined by the storage validity period=the most recent time required, and the access duration is determined by the access duration=statistics The preset number is set in the following manner: preset number=storage validity period/access duration.
根据本发明的又一方面,提供了一种存储介质,所述存储介质中存储有至少一可执行指令,所述可执行指令使处理器执行如上述用户访问行为信息监控方法对应的操作。According to another aspect of the present invention, a storage medium is provided, wherein the storage medium stores at least one executable instruction, and the executable instruction enables a processor to perform operations corresponding to the foregoing method for monitoring user access behavior information.
根据本发明的再一方面,提供了一种计算机设备,包括:处理器、存储器、通信接口和通信总线,所述处理器、所述存储器和所述通信接口通过所述通信总线完成相互间的通信;According to yet another aspect of the present invention, a computer device is provided, comprising: a processor, a memory, a communication interface, and a communication bus, and the processor, the memory, and the communication interface can communicate with each other through the communication bus. communication;
所述存储器用于存放至少一可执行指令,所述可执行指令使所述处理器执行上述用户访问行为信息监控方法对应的操作。The memory is used for storing at least one executable instruction, and the executable instruction enables the processor to perform operations corresponding to the above-mentioned method for monitoring user access behavior information.
借由上述技术方案,本发明实施例提供的技术方案至少具有下列优点:With the above technical solutions, the technical solutions provided by the embodiments of the present invention have at least the following advantages:
本发明提供了一种用户访问行为信息监控方法及装置、计算机设备,与现有技术网页浏览监控仅仅通过对访问量pv,独立访客uv等数据进行监控及监控系统体量较大,对数据库及服务器的压力过大相比,本发明实施例通过对网页服务端返回的网页访问数据、和/或网页分享数据、和/或网页页面数据进行统计,将统计后的网页访问数据、和/或网页分享数据、和/或网页页面数据进行分类聚合,分别方法至对应的存储位置中,并生成存储位置信息,根据存储位置信息生成监控日志,实现在进行监控数据查询时通过监控日志中的存储位置信息直接调取需要查询的数据,减少数据库及服务器的存储压力,使得在查询监控数据时,根据监控服务端中存储的数据进行查找,提高对用户访问行为的监控效率。The invention provides a method and device for monitoring user access behavior information, and computer equipment, which is different from the prior art web browsing monitoring only by monitoring data such as visits pv, independent visitors uv and the like, and the monitoring system is relatively large, and the database and Compared with the excessive pressure on the server, the embodiment of the present invention collects statistics on the web page access data, and/or web page sharing data, and/or web page page data returned by the web page server, and collects statistics on the web page access data, and/or Web page sharing data and/or web page data are classified and aggregated, respectively, into corresponding storage locations, and storage location information is generated, and monitoring logs are generated according to the storage location information. The location information directly retrieves the data that needs to be queried, which reduces the storage pressure on the database and the server, so that when querying the monitoring data, it can be searched according to the data stored in the monitoring server, and the monitoring efficiency of the user's access behavior is improved.
上述说明仅是本发明技术方案的概述,为了能够更清楚了解本发明的技术手段,而可依照说明书的内容予以实施,并且为了让本发明的上述和其它目的、特征和优点能够更明显易懂,以下特举本发明的具体实施方式。The above description is only an overview of the technical solutions of the present invention, in order to be able to understand the technical means of the present invention more clearly, it can be implemented according to the content of the description, and in order to make the above and other purposes, features and advantages of the present invention more obvious and easy to understand , the following specific embodiments of the present invention are given.
附图说明Description of drawings
通过阅读下文优选实施方式的详细描述,各种其他的优点和益处对于本领域普通技术人员将变得清楚明了。附图仅用于示出优选实施方式的目的,而并不认为是对本发明的限制。而且在整个附图中,用相同的参考符号表示相同的部件。在附图中:Various other advantages and benefits will become apparent to those of ordinary skill in the art upon reading the following detailed description of the preferred embodiments. The drawings are for the purpose of illustrating preferred embodiments only and are not to be considered limiting of the invention. Also, the same components are denoted by the same reference numerals throughout the drawings. In the attached image:
图1示出了本发明实施例提供的一种用户访问行为信息监控方法流程图;1 shows a flowchart of a method for monitoring user access behavior information provided by an embodiment of the present invention;
图2示出了本发明实施例提供的另一种用户访问行为信息监控方法流程图;2 shows a flowchart of another method for monitoring user access behavior information provided by an embodiment of the present invention;
图3示出了本发明实施例提供的一种用户访问行为信息监控装置组成框图;3 shows a block diagram of a composition of an apparatus for monitoring user access behavior information provided by an embodiment of the present invention;
图4示出了本发明实施例提供的另一种用户访问行为信息监控装置组成框图;FIG. 4 shows a block diagram of the composition of another device for monitoring user access behavior information provided by an embodiment of the present invention;
图5示出了本发明实施例提供的一种计算机设备的结构示意图。FIG. 5 shows a schematic structural diagram of a computer device provided by an embodiment of the present invention.
具体实施方式Detailed ways
下面将参照附图更详细地描述本公开的示例性实施例。虽然附图中显示了本公开的示例性实施例,然而应当理解,可以以各种形式实现本公开而不应被这里阐述的实施例所限制。相反,提供这些实施例是为了能够更透彻地理解本公开,并且能够将本公开的范围完整的传达给本领域的技术人员。Exemplary embodiments of the present disclosure will be described in more detail below with reference to the accompanying drawings. While exemplary embodiments of the present disclosure are shown in the drawings, it should be understood that the present disclosure may be embodied in various forms and should not be limited by the embodiments set forth herein. Rather, these embodiments are provided so that the present disclosure will be more thoroughly understood, and will fully convey the scope of the present disclosure to those skilled in the art.
本发明实施例提供了一种用户访问行为信息监控方法,如图1所示,该方法包括:An embodiment of the present invention provides a method for monitoring user access behavior information. As shown in FIG. 1 , the method includes:
101、发送监控数据获取请求,统计网页服务端返回的监控数据获取请求响应携带的网页访问数据、和/或网页分享数据、和/或网页页面数据。101. Send a monitoring data acquisition request, and count webpage access data, and/or webpage sharing data, and/or webpage page data carried in the response to the monitoring data acquisition request returned by the webpage server.
对于本发明实施例,用户访问行为信息即为用户通过浏览网页所产生的全部访问行为数据,一般的,用户通过点击网页中的链接实现对网页中信息的访问,用户通过终端向服务端发送网页浏览的请求实现获取网页信息的目的,在此过程中,用户通过浏览网页则会生成访问行为。用户访问行为信息中包括网页访问数据、和/或网页分享数据、和/或网页页面数据,所述网页访问数据包括pv数据、uv数据,如十分钟访问量、每小时访问量、每日访问量、访问总量、各地域访问pv、各设备访问pv等,网页分享数据包括每小时分享量、每日分享量、分享总量、分享去向、传播层级等,传播层级为每分享层级对应的访问pv、uv量,所述网页页面数据包括页面访问深度信息、页面访问时长、页面流失率、页面元素点击等,本发明实施例不做具体限定。For the embodiment of the present invention, the user access behavior information is all the access behavior data generated by the user by browsing the web page. Generally, the user accesses the information in the web page by clicking the link in the web page, and the user sends the web page to the server through the terminal. The browsing request achieves the purpose of obtaining web page information. During this process, the user generates access behavior by browsing the web page. User access behavior information includes web page access data, and/or web page sharing data, and/or web page data, and the web page access data includes pv data, uv data, such as ten-minute visits, hourly visits, daily visits Page sharing data includes hourly sharing volume, daily sharing volume, total sharing volume, sharing destination, dissemination level, etc. The dissemination level corresponds to each sharing level The amount of access pv and uv, the webpage page data includes page access depth information, page access duration, page churn rate, page element clicks, etc., which are not specifically limited in the embodiment of the present invention.
其中,所述网页服务端为接收网页访问请求的服务端,可以为云端、也可以为本地服务端,本发明实施例不做具体限定。本发明实施例中,当前服务端为了对用户访问行为信息进行监控,向网页服务端发送网页访问数据获取请求,根据网页访问数据获取请求响应中携带的具体数据统计网页访问数据、和/或网页分享数据、和/或网页页面数据。所述网页访问数据获取请求由当前服务端发出,请求中携带有具体需要获取的数据,如访问某网址的用户数量等。The webpage server is a server that receives a webpage access request, and may be a cloud or a local server, which is not specifically limited in the embodiment of the present invention. In this embodiment of the present invention, in order to monitor the user access behavior information, the current server sends a webpage access data acquisition request to the webpage server, and counts the webpage access data and/or webpage according to the specific data carried in the response to the webpage access data acquisition request. Share data, and/or web page data. The webpage access data acquisition request is sent by the current server, and the request carries specific data to be acquired, such as the number of users accessing a certain website.
需要说明的是,对网页访问数据、和/或网页分享数据、和/或网页页面数据进行统计可以包括对pv数据、uv数据等数据的计算,对每小时分享量、每日分享量、分享总量、分享去向、传播层级等数据的计算,对页面访问深度信息、页面访问时长、页面流失率、页面元素点击等数据的计算。本发明实施例中的用户访问行为信息存储至服务端中的数据库中,以便直接调取。It should be noted that the statistics of webpage access data, and/or webpage sharing data, and/or webpage page data may include the calculation of pv data, uv data and other data. Calculation of data such as total amount, sharing destination, and dissemination level, and calculation of data such as page access depth information, page access duration, page churn rate, and page element clicks. The user access behavior information in the embodiment of the present invention is stored in the database in the server so as to be directly retrieved.
102、按照数据结构对统计后的网页访问数据、和/或网页分享数据、和/或网页页面数据分别进行分类聚合,对分类聚合后得到的网页访问数据、和/或网页分享数据、和/或网页页面数据进行存储,并生成存储位置信息。102. According to the data structure, the statistics of webpage access data, and/or webpage sharing data, and/or webpage page data are respectively classified and aggregated, and the webpage access data and/or webpage sharing data obtained after the classification and aggregation are aggregated. or web page data for storage, and generate storage location information.
其中,所述分类聚合为对网页访问数据、和/或网页分享数据、和/或网页页面数据按照不同的业务需求,进行不同的分类处理,例如按照每10分钟将访问量进行分类聚合、按照市级别将访问量进行分类聚合,本发明实施例不做具体限定。另外,为了便于对分类聚合后的数据进行监控,将分类聚合后的网页访问数据、和/或网页分享数据、和/或网页页面数据分别存储至不同的存储位置中,并根据存储的位置生成网页访问数据、和/或网页分享数据、和/或网页页面数据分别对应的存储位置信息。Wherein, the classification aggregation is to perform different classification processing on webpage access data, and/or webpage sharing data, and/or webpage page data according to different business requirements, for example, classifying and aggregating the number of visits every 10 minutes, according to The city level classifies and aggregates the traffic, which is not specifically limited in this embodiment of the present invention. In addition, in order to facilitate the monitoring of the classified and aggregated data, the classified and aggregated webpage access data, and/or webpage sharing data, and/or webpage page data are respectively stored in different storage locations, and generated according to the storage locations. The storage location information corresponding to the webpage access data, and/or the webpage sharing data, and/or the webpage page data respectively.
103、根据所述网页访问数据、和/或所述网页分享数据、和/或所述网页页面数据对应的存储位置信息生成监控日志。103. Generate a monitoring log according to the webpage access data, and/or the webpage sharing data, and/or storage location information corresponding to the webpage page data.
其中,所述监控日志中按照不同的存储位置信息记录对应的网页访问数据、和/或网页分享数据、和/或网页页面数据,例如,1号存储位置存储有2015年1月2日每10分钟统计的用户访问网页a的数据量。Wherein, the monitoring log records corresponding web page access data, and/or web page sharing data, and/or web page data according to different storage location information. The amount of data that users visit webpage a in minutes.
本发明提供了一种用户访问行为信息监控方法,与现有技术网页浏览监控仅仅通过对访问量pv,独立访客uv等数据进行监控及监控系统体量较大,对数据库及服务器的压力过大相比,本发明实施例通过对网页服务端返回的网页访问数据、和/或网页分享数据、和/或网页页面数据进行统计,将统计后的网页访问数据、和/或网页分享数据、和/或网页页面数据进行分类聚合,分别方法至对应的存储位置中,并生成存储位置信息,根据存储位置信息生成监控日志,实现在进行监控数据查询时通过监控日志中的存储位置信息直接调取需要查询的数据,减少数据库及服务器的存储压力,使得在查询监控数据时,根据监控服务端中存储的数据进行查找,提高对用户访问行为的监控效率。The invention provides a method for monitoring user access behavior information, which is different from the prior art web browsing monitoring only by monitoring data such as the number of visits pv, independent visitors uv and the like, and the monitoring system has a large volume, and the pressure on the database and the server is too large. In contrast, the embodiment of the present invention collects statistics on webpage access data, and/or webpage sharing data, and/or webpage page data returned by the webpage server, and collects statistics on webpage access data, and/or webpage sharing data, and / or web page data is classified and aggregated, and the method is divided into the corresponding storage location, and the storage location information is generated, and the monitoring log is generated according to the storage location information, so as to realize the direct retrieval through the storage location information in the monitoring log when querying the monitoring data The data that needs to be queried reduces the storage pressure of the database and the server, so that when querying the monitoring data, it is searched according to the data stored in the monitoring server, and the monitoring efficiency of the user's access behavior is improved.
本发明实施例提供了另一种用户访问行为信息监控方法,如图2所示,该方法包括:An embodiment of the present invention provides another method for monitoring user access behavior information. As shown in FIG. 2 , the method includes:
201a、当检测到页面加载指令时,向所述网页服务端发送网页访问数据获取请求,并统计所述网页服务端返回的网页id、终端设备号、终端设备信息、终端位置信息、阅读来源信息、分享层级信息。201a, when a page loading instruction is detected, send a webpage access data acquisition request to the webpage server, and count the webpage id, terminal device number, terminal device information, terminal location information, and reading source information returned by the webpage server , share level information.
对于本发明实施例,客户端向网页服务端及当前服务端均发送页面加载指令,以使得当前服务端根据用户的访问行为进行监控,当当前服务端检测到客户端中用户发送的页面加载指令时,网页服务端对网页加载指令进行处理,向客户端反馈网页数据,因此,当前客户端向网页服务端发送网页访问数据获取请求,所述网页访问数据获取请求中携带有网页加载信息,从而根据返回的网页加载请求响应统计,统计的对象包括网页id、终端设备号、终端设备信息、终端位置信息、阅读来源信息、分享层级信息。For the embodiment of the present invention, the client sends a page loading instruction to both the web server and the current server, so that the current server monitors the user's access behavior. When the current server detects the page loading instruction sent by the user in the client When the web page server processes the web page loading instruction and feeds back web page data to the client, the current client sends a web page access data acquisition request to the web server, and the web page access data acquisition request carries web page loading information, thereby According to the returned webpage loading request response statistics, the statistical objects include webpage id, terminal device number, terminal device information, terminal location information, reading source information, and sharing level information.
需要说明的是,步骤201a中的统计为按照时间间隔计算网页id的个数、按照浏览的网页计算终端设备号的个数、按照浏览网页的内容计算终端设备信息即终端设备名称的个数、按照浏览的网页计算终端位置信息的个数、按照浏览的网页计算网页内容的来源个数、计算访问网页被分享的层数等,本发明实施例不做具体限定。It should be noted that the statistics in step 201a are calculating the number of web page IDs according to time intervals, calculating the number of terminal equipment numbers according to the webpages browsed, calculating terminal equipment information according to the content of the browsed webpages, that is, the number of terminal equipment names, The number of terminal location information is calculated according to the webpages browsed, the number of sources of webpage content is calculated according to the webpages browsed, the number of layers shared by the visited webpage is calculated, etc., which are not specifically limited in this embodiment of the present invention.
进一步地,与步骤201a并列的步骤201b、当检测到网页分享指令时,向所述网页服务端发送网页分享数据获取请求,统计所述网页服务端返回的网页id、终端设备号、分享去向信息,并对所述网页的分享层级信息进行累计。Further, in step 201b parallel to step 201a, when a webpage sharing instruction is detected, send a webpage sharing data acquisition request to the webpage server, and count the webpage id, terminal device number, and sharing destination information returned by the webpage server. , and accumulate the sharing level information of the webpage.
对于本发明实施例,客户端向网页服务端及当前服务端均发送页面分享指令,以使得当前服务端根据用户的访问行为进行监控,当当前服务端检测到客户端中用户发送的页面分享指令时,网页服务端对网页分享指令进行处理,向客户端反馈网页数据,因此,当前客户端向网页服务端发送网页分享数据获取请求,所述网页分享数据获取请求中携带有网页分享信息,从而根据返回的网页分享请求响应统计,统计的对象包括网页id、终端设备号、分享去向信息,并对所述网页的分享层级信息进行累计。网页的分享层级信息为网页被用户分享的传播层级个数,例如,网页a被用户1分享后,用户2浏览后进行分享,用户3浏览用户2分析的网页后进行分享,计算的层级个数为2级。For the embodiment of the present invention, the client sends a page sharing instruction to both the web server and the current server, so that the current server monitors the user's access behavior. When the current server detects the page sharing instruction sent by the user in the client When the web page server processes the web page sharing instruction and feeds back web page data to the client, the current client sends a web page sharing data acquisition request to the web server, and the web page sharing data acquisition request carries the web page sharing information, thereby According to the returned webpage sharing request response statistics, the statistical objects include webpage ID, terminal device number, sharing destination information, and the sharing level information of the webpage is accumulated. The sharing level information of a web page is the number of communication levels that the web page is shared by users. For example, after web page a is shared by user 1, user 2 browses and shares it, and user 3 browses the web page analyzed by user 2 and shares it. The calculated number of levels to level 2.
需要说明的是,步骤201b中的统计为计算分享网页的网页id个数、计算分享网页的终端设备号个数、计算网页分享的分析去向信息个数。It should be noted that the statistics in step 201b include calculating the number of web page IDs sharing the web page, calculating the number of terminal device numbers sharing the web page, and calculating the number of analysis destination information shared by the web page.
进一步地,与步骤201a并列的步骤201c、当检测到网页页面点击指令时,向所述网页服务端发送网页页面数据获取请求,统计所述网页服务端返回的网页页面点击信息。Further, in step 201c parallel to step 201a, when a webpage page click instruction is detected, a webpage page data acquisition request is sent to the webpage server, and the webpage page click information returned by the webpage server is counted.
对于本发明实施例,客户端向网页服务端及当前服务端均发送网页页面点击指令,以使得当前服务端根据用户的访问行为进行监控,当当前服务端检测到客户端中用户发送的网页页面点击指令时,网页服务端对网页页面点击指令进行处理,向客户端反馈网页数据,因此,当前客户端向网页服务端发送网页页面数据获取请求,所述网页页面数据获取请求中携带有网页页面信息,从而根据返回的网页页面请求响应统计,统计的对象为网页页面点击信息,如对网页页面各个元素的点击情况。For the embodiment of the present invention, the client sends a webpage page click instruction to both the webpage server and the current server, so that the current server monitors the user's access behavior. When the current server detects the webpage sent by the user in the client When the command is clicked, the web server processes the click command of the web page and feeds back the web page data to the client. Therefore, the current client sends a web page data acquisition request to the web server, and the web page data acquisition request carries the web page. Therefore, according to the returned web page request response statistics, the statistical object is web page click information, such as clicks on various elements of the web page.
对于本发明实施例,步骤201c具体可以为:当检测到网页页面点击指令时,向所述网页服务端发送网页页面数据获取请求,并接收网页页面数据获取请求响应,所述网页页面数据获取请求响应中携带网页页面元素点击信息、网页页面浏览时间累积信息;根据所述网页页面元素点击信息、所述网页页面浏览时间累积信息统计网页页面点击信息。For this embodiment of the present invention, step 201c may specifically be: when a webpage click instruction is detected, send a webpage data acquisition request to the webpage server, and receive a webpage data acquisition request response, the webpage data acquisition request The response carries webpage element click information and webpage page browsing time accumulation information; webpage page click information is counted according to the webpage element click information and the webpage page browsing time accumulation information.
对于本发明实施例,针对网页页面数据的统计,为当检测到客户端触发网页页面点击指令时,向网页服务端发送网页页面数据获取请求,以便对需求页面信息的收集,包括网页页面元素点击信息、网页页面浏览时间累积信息等。针对不同系统下的应用场景,对网页页面的数据收集方式不同,本发明实施例包括2中不同的统计方法,当为元素触发点击事件,首先判断元素是否已经被点击,是则只需要处理点击次数,否则获取点击元素id、名称、类型、描述,分别存储在对象中,放在对应的页面对象中。其中,ios系统下的交易平台、浏览器当用户进入页面时,判断是否为第一个页面,以及是否为第一次进入此页面,若不是,则记录进入此页面的时间戳,减去上一个页面离开的时间戳,求时间差,累加到上一个网页的停留时间,作为网页页面浏览时间累积信息,若不为第一次则不进行累积。另外,为了对此场景的网页件监控,先获取用户信息,获取信息、存放在全周对象中,每次进入页面或者切换页面时,发送全周的对象数据,当前服务端接收到数据后,若visibilitystate状态为visible时发送请求,获取当前的时间戳,减去上一次发送请求的时间戳,时间差累加在上一个页面的停留时间,作为网页页面浏览时间累积信息。进一步地,通过检测visibilitychange事件,判断visibilitystate状态,为visible时,页面是正在浏览状态,为hidden时,页面处于隐藏,不在浏览状态,确保visibilitychange只执行一次,执行多次会出现页面停留时间,全部累加在第一次隐藏的页面上,执行一次不会出现,以此监控是否已经进入页面,并判断进入页面时在对应的页面中是否累加时间、元素,若是则点击元素并判断是否点击过此元素,若是,则在对应的元素对象中累加点击次数,若没有点击过此元素,则在全局对象对应字段,添加对象,同时获取点击元素需要采集的信息,放置对象中,若没有进入页面时在对应的页面中累加时间、元素,则不做处理,只计算页面停留时间,并存储,而当监测没有进入页面时,在全局对象存储页面字段中添加一个对象,然后判断是否点击页面元素,若没有点击则只处理停留时间,若点击页面元素,则再次判断是否点击过此元素。此外,由于可能会产生跳页浏览的情况,或者返回继续浏览的情况,一定记录当前进入页面id,或者存储是第几项,以便对页面内的信息累加、添加,在轮调发送请求时,记录发送的时间戳,带上全周对象,因为用户不是按正常的程序退出,获取不到数据,这样只有离开的网页有误差,且误差非常小。针对安卓系统下的交易凭条、浏览器,在进入页面事件后,先获取用户信息、活动信息、存放在全局对象中,并记录当前页面的id,判断是否为已经进入的页面,若是则判断是否点击元素,若是,则判断元素是否以及点击过,若是累加点击次数,若没有点击过此元素,在对应全局对象中对应字段新加一个对象,采集点击元素的信息,放在此对象中,若没有点击过元素,则只处理页面停留时间并累加,若没有进入页面,则在全局对象中新加一条数据,存储页面信息,然后判断是否点击元素,若是则判断点击元素是否被点击过,若是,则需要累加点击次数,若没有点击过,则采集点击元素信息存储在对象中,并把对象方在全局对象对应的字段中,若没有点击元素,则只需要赋值页面停留时间。另外,当记录当前页面的id后,判断60秒之内是否有触发事件,如果有用户正在浏览,不做处理,如果没有则默认用户不在继续浏览页面,清除轮训发送接口请求。For the embodiment of the present invention, for the statistics of web page data, when it is detected that the client triggers a web page click instruction, a web page data acquisition request is sent to the web server, so as to collect the required page information, including web page element clicks information, accumulated information of web page browsing time, etc. For application scenarios under different systems, data collection methods for web pages are different. The embodiment of the present invention includes two different statistical methods. When a click event is triggered for an element, it is first determined whether the element has been clicked, and if so, only the click needs to be processed. The number of times, otherwise get the clicked element id, name, type, description, store them in the object respectively, and put them in the corresponding page object. Among them, the trading platform and browser under the ios system judge whether it is the first page and whether it is the first time to enter this page when the user enters the page. If not, record the timestamp of entering this page, minus the above The time stamp of a page leaving, find the time difference, and add it to the stay time of the previous web page as the cumulative information of the page browsing time. If it is not the first time, it will not be accumulated. In addition, in order to monitor the webpage software of this scenario, first obtain user information, obtain the information, and store it in the object of the whole week. Every time you enter the page or switch pages, the object data of the whole week is sent. If the request is sent when the visibility state is visible, the current timestamp is obtained, the timestamp of the last request is subtracted, and the time difference is accumulated in the stay time of the previous page, which is used as the accumulated information of the page browsing time. Further, by detecting the visibilitychange event, determine the visibilitystate state. When it is visible, the page is in the browsing state. When it is hidden, the page is hidden and not in the browsing state. Make sure that the visibilitychange is only executed once, and the page stay time will appear when executed multiple times. It is accumulated on the page that was hidden for the first time, and it will not appear when executed once, so as to monitor whether the page has been entered, and determine whether the time and elements are accumulated in the corresponding page when entering the page, if so, click on the element and determine whether it has been clicked. Element, if so, accumulate the number of clicks in the corresponding element object. If the element has not been clicked, add the object in the corresponding field of the global object, and obtain the information that needs to be collected for the clicked element, and place it in the object. If it does not enter the page If the time and elements are accumulated in the corresponding page, no processing is performed, only the page stay time is calculated and stored, and when the monitoring does not enter the page, an object is added to the global object storage page field, and then it is judged whether to click on the page element, If there is no click, only the dwell time is processed. If the page element is clicked, it is judged again whether the element has been clicked. In addition, due to the possibility of skipping page browsing, or returning to continue browsing, the id of the currently entered page must be recorded, or the number of items stored in order to accumulate and add the information in the page. When sending requests in rotation, Record the time stamp sent, and bring the object of the whole week, because the user does not exit according to the normal procedure and cannot obtain the data, so only the left webpage has errors, and the error is very small. For transaction receipts and browsers under the Android system, after entering the page event, first obtain user information and activity information, store them in the global object, and record the id of the current page to determine whether it is the entered page, and if so, determine whether Whether to click on the element, if so, judge whether the element has been clicked. If the number of clicks is accumulated, if the element has not been clicked, a new object is added to the corresponding field in the corresponding global object, and the information of the clicked element is collected and placed in this object. If the element has not been clicked, only the dwell time of the page is processed and accumulated. If the page is not entered, a new piece of data is added to the global object to store the page information, and then it is judged whether the element is clicked. If so, it is judged whether the clicked element has been clicked. If so, you need to accumulate the number of clicks. If there is no click, collect the click element information and store it in the object, and put the object in the field corresponding to the global object. If there is no click element, you only need to assign the page dwell time. In addition, after recording the id of the current page, it is determined whether there is a trigger event within 60 seconds. If there is a user browsing, no processing is performed. If not, the default user does not continue to browse the page, and the rotation training is cleared to send the interface request.
202、解析统计后的网页访问数据、和/或网页分享数据、和/或网页页面数据的数据结构。202. Analyze the statistics of webpage access data, and/or webpage sharing data, and/or a data structure of webpage page data.
对于本发明实施例,为了便于对大量数据进行处理,解析网页访问数据、和/或网页分享数据、和/或网页页面数据的数据结构。其中,数据结构包括简单类型数据结构、复杂类型数据结构,简单类型数据结构为所述简单类型数据结构为仅进行数据量增量变化的数据结构,即只需进行简单增量的数据结构,例如单层级结构数据,pv、uv、分享总量等,多层级结构数据地域访问量、传播层级等;复杂类型数据结构为进行数据形式、数据表达方式进行处理的数据结构,即为需要进行处理的数据结构,如统计一定时间间隔的访问量、页面访问时长、页面访问深度等。For the embodiments of the present invention, in order to facilitate the processing of a large amount of data, the web page access data, and/or the web page sharing data, and/or the data structure of the web page page data are parsed. Wherein, the data structure includes a simple type data structure and a complex type data structure, and the simple type data structure is that the simple type data structure is a data structure that only needs to perform incremental changes in the amount of data, that is, a data structure that only needs to perform simple incremental changes, for example Single-level structure data, pv, uv, total share, etc., multi-level structure data regional access, dissemination level, etc.; complex type data structure is the data structure for processing data form and data expression, that is, it needs to be processed The data structure, such as statistics of visits at a certain time interval, page visit duration, page visit depth, etc.
需要说明的是,解析方法具体根据获取的到数据是否需要处理进行划分数据结构,本发明实施例不做具体限定。It should be noted that, the parsing method specifically divides the data structure according to whether the acquired data needs to be processed, which is not specifically limited in the embodiment of the present invention.
203a、当解析的数据结构为简单类型数据结构,则按照时限需求对所述网页访问数据、和/或所述网页分享数据、和/或所述网页页面数据聚合存储至不同存储容量的容器中,生成存储位置信息。203a. When the parsed data structure is a simple type data structure, the webpage access data, and/or the webpage sharing data, and/or the webpage page data are aggregated and stored in containers of different storage capacities according to time limit requirements to generate storage location information.
对于本发明实施例,为了准备对简单类型数据结构进行聚合存储,按照时限需求对网页访问数据、和/或网页分享数据、和/或网页页面数据进行聚合存储。所述时限需求可以为每日、每周等,本发明实施例不做具体限定。针对简单类型数据结构的数据进行的聚合为按照不同阶段进行增量计算,并分别存储至不同存储容量的容器中,对网页访问数据、和/或网页分享数据、和/或网页页面数据划分维度后,进行增量即可,例如单层级的数据,如pv总量为增加指定页面的访问量,多层级的数据,如pv每日量为增加指定页面的指定日期的访问量。然后通过不同的存储容器进行存储。For the embodiment of the present invention, in order to prepare for aggregated storage of simple data structures, aggregated storage of webpage access data, and/or webpage sharing data, and/or webpage page data is performed according to time limit requirements. The time limit requirement may be daily, weekly, etc., which is not specifically limited in the embodiment of the present invention. The aggregation of data of simple types of data structures is incremental calculation according to different stages, and stored in containers with different storage capacities respectively, and divides the dimensions of webpage access data, and/or webpage sharing data, and/or webpage page data After that, you can make increments. For example, single-level data, such as the total amount of PV, is to increase the number of visits to the specified page, and multi-level data, such as the daily volume of PV, is to increase the number of visits of the specified page on the specified date. It is then stored through different storage containers.
进一步地,步骤203a之前,本发明实施例还包括:配置具有不同存储有效期、不同访问时长的预设个数的容器,所述存储有效期的确定方式为存储有效期=需求的最近时间,所述访问时长的确定方式为访问时长=统计量的精度,所述预设个数的设定方式为预设个数=存储有效期/访问时长。Further, before step 203a, the embodiment of the present invention further includes: configuring a preset number of containers with different storage validity periods and different access durations. The determination method of the duration is access duration=accuracy of statistics, and the setting method of the preset number is preset number=storage validity period/access duration.
例如,配置20个容器,每个容器代表连续的30秒内的访问量,每个容器有10分钟有效期,有效期到时,会自动清空容器内的访问量,当某h5页面被访问时,根据当前的系统时间、计算出当前对应的容器编号,给对应编号的容器内的访问量+1。For example, configure 20 containers, each container represents the number of visits within 30 consecutive seconds, and each container has a validity period of 10 minutes. When the validity period expires, the traffic in the container will be automatically cleared. When a h5 page is accessed, according to Calculate the current system time, calculate the current corresponding container number, and add +1 to the number of visits in the container with the corresponding number.
需要说明的是,利用存储容器存储的数据,在进行查询时,查询出每个容器中的访问量,进行相加即可,相关公式:容器的有效期=需求的最近时间,如10分钟;容器对应的访问时长=统计量的精度,即时间的偏差值。容器个数=容器的有效期/容器对应的访问时长;当前时间对应的容器编号=(当前时间戳/统计量的精度)/(容器的有效期/统计量的精度)。It should be noted that, using the data stored in the storage container, when querying, the number of visits in each container can be queried and added. The relevant formula is: the validity period of the container = the most recent time required, such as 10 minutes; Corresponding access duration = precision of statistics, that is, the deviation value of time. The number of containers = the validity period of the container/the access duration corresponding to the container; the container number corresponding to the current time = (current timestamp/accuracy of statistics)/(validity period of the container/accuracy of statistics).
例如,编号No.001的h5页面在2018/11/11的浏览人数(uv)统计:a在h5页面加载时,会通知服务器No.001被用户张三访问;b服务器查询缓存数据库中记录,判断张三是否访问过No.001;c如访问过,则不进行uv的记录;d如未访问过,则根据当前服务器日期,如2018/11/11,使缓存数据库中No.001在2018/11/11的uv数加1;e将张三加入缓存数据库中,记录其已访问过No.001;f将No.001在2018/11/11的uv数通过异步的方式保存到数据库中。For example, the number of visitors (uv) statistics of the h5 page No. 001 on 2018/11/11: a When the h5 page is loaded, it will notify the server No. 001 to be accessed by the user Zhang San; b The server queries the records in the cache database, Determine whether Zhang San has visited No.001; c If he has visited, no uv record will be performed; d If he has not visited, then according to the current server date, such as 2018/11/11, make No.001 in the cache database in 2018 /11/11's uv number is incremented by 1; e adds Zhang San to the cache database to record that he has visited No.001; f saves No.001's uv number on 2018/11/11 to the database in an asynchronous manner .
203b、当解析的数据结构为复杂类型数据结构,则对所述网页访问数据、和/或所述网页分享数据、和/或所述网页页面数据按照预设冗余条件进行过滤,将过滤后的网页访问数据、和/或网页分享数据、和/或网页页面数据按照用户id、浏览id、时间戳进行缓存,生成存储位置信息。203b, when the parsed data structure is a complex type data structure, filter the webpage access data, and/or the webpage sharing data, and/or the webpage page data according to preset redundancy conditions, and filter the filtered data. The webpage access data, and/or webpage sharing data, and/or webpage page data are cached according to the user id, browsing id, and timestamp to generate storage location information.
对于本发明实施例,为了避免在进行数据存储时,大量重复的数据造成数据冗余,需要利用预设冗余条件对数据进行过滤,所述预设冗余条件为是否为过期数据,例如,包括在特定时间间隔内的数据是否重复,页面数据是否流失,页面流失的数据为用户点击网页后,网页未加载完毕时,用户离开网页所产生的数据。For the embodiment of the present invention, in order to avoid data redundancy caused by a large number of repeated data during data storage, it is necessary to filter the data by using a preset redundancy condition, and the preset redundancy condition is whether the data is outdated, for example, Including whether the data in a specific time interval is repeated, whether the page data is lost, the data of page loss is the data generated by the user leaving the web page after the user clicks the web page and the web page is not loaded.
对于本发明实施例,步骤203b具体可以为:当解析的数据结构为复杂类型数据结构,则按照用户id、浏览id、时间戳判断所述网页访问数据、和/或所述网页分享数据、和/或所述网页页面数据是否为过期数据;若为非过期数据,则将所述网页访问数据、和/或所述网页分享数据、和/或所述网页页面数据分别存储至与所述用户id、所述浏览id、所述时间戳对应的缓存位置中,生成存储位置信息。For this embodiment of the present invention, step 203b may specifically be as follows: when the parsed data structure is a complex type data structure, determine the webpage access data, and/or the webpage sharing data according to the user id, browsing id, and timestamp, and /or whether the web page data is expired data; if it is non-expired data, the web page access data, and/or the web page sharing data, and/or the web page data are respectively stored to the user with the user id, the browsing id, and the cache location corresponding to the timestamp, generate storage location information.
其中,所述过期数据为特定时间内数据出现重复,和/或为页面流失数据,即根据用户id、浏览id、时间戳判断特定时间内网页访问数据、和/或网页分享数据、和/或网页页面数据是否为过期数据,例如,一天中,用户id234432浏览网页时,记录的浏览网页对应的时间戳分别为2.31浏览网页a,2.36浏览网页a,5.06浏览网页v,一天中,浏览网页a的时间戳在5分钟特定的时间内,作为重复数据过滤掉。另外,当将网页访问数据、和/或网页分享数据、和/或网页页面数据进行存储时,存储至于用户id、浏览id、时间戳id分别对应的存储位置中,例如,用户88676浏览网页2335是对应时间戳为4.45-5.36的数据存储至存储位置1中。本发明实施例中的存储位置信息可以为存储位置的名称,也可以为对存储位置进行随机生成的标签等,本发明实施例不做具体限定。Wherein, the expired data refers to the repetition of data within a specific time period, and/or is page loss data, that is, the web page access data and/or web page sharing data within a specific time period are determined according to the user id, browsing id, and timestamp, and/or Whether the web page data is expired data, for example, when user id234432 browses a web page in one day, the recorded time stamps corresponding to the browsed web page are 2.31 browse web page a, 2.36 browse web page a, 5.06 browse web page v, and in one day, browse web page a Timestamps within a specific time period of 5 minutes are filtered out as duplicates. In addition, when webpage access data, and/or webpage sharing data, and/or webpage page data are stored, they are stored in the storage locations corresponding to the user id, browsing id, and timestamp id, for example, user 88676 browses webpage 2335 is the data corresponding to timestamp 4.45-5.36 is stored in storage location 1. The storage location information in the embodiment of the present invention may be the name of the storage location, or may be a label randomly generated for the storage location, etc., which is not specifically limited in the embodiment of the present invention.
204、根据所述网页访问数据、和/或所述网页分享数据、和/或所述网页页面数据对应的存储位置信息生成监控日志。204. Generate a monitoring log according to the webpage access data, and/or the webpage sharing data, and/or storage location information corresponding to the webpage page data.
本步骤与图1所示的步骤103方法相同,在此不再赘述。This step is the same as the method of step 103 shown in FIG. 1 , and will not be repeated here.
进一步地,为了避免在监控过程中出现网页延迟加载情况时,向网页服务端发送告警,并反馈给技术人员进行异常处理,本发明实施例还包括:通过消息队列发送延时消息,读取所述延时消息中的用户访问行为信息及延时时间,将所述延时消息的时间戳与缓存后的用户id、浏览id对应的时间戳进行对比,若缓存后的用户id、浏览id对应的时间戳小于所述延时消息的时间戳,则确定所述用户访问行为信息为冗余数据,若缓存后的用户id、浏览id对应的时间戳大于所述延时消息的时间戳,则缓存所述用户访问行为信息。Further, in order to avoid sending an alarm to the webpage server when the webpage is loaded with delay during the monitoring process, and feeding it back to the technician for abnormal processing, the embodiment of the present invention further includes: sending a delayed message through a message queue, reading all the information. User access behavior information and delay time in the delayed message, compare the timestamp of the delayed message with the timestamp corresponding to the cached user id and browsing id, if the cached user id and browsing id correspond to If the timestamp of the cached user id and browsing id are greater than the timestamp of the delayed message, then the user access behavior information is determined to be redundant data. The user access behavior information is cached.
对于本发明实施例,为了避免网页延迟加载对应的数据为冗余数据,将延时消息的时间戳与加载后生成访问数据中的用户id、浏览id的时间戳进行对比,若小于延时消息的时间戳,则确定位冗余数据,以使得对此类数据进行清除,若大于延时消息的时间戳,则为非冗余数据,将对应的户访问行为信息进行缓存。For the embodiment of the present invention, in order to prevent the data corresponding to the delayed loading of the webpage from being redundant data, the time stamp of the delayed message is compared with the time stamp of the user id and the browsing id in the access data generated after loading. If the timestamp is greater than the timestamp of the delayed message, it is non-redundant data, and the corresponding user access behavior information is cached.
进一步地,本发明实施例还可以包括:按照预设时间间隔发送监控信息。对于本发明实施例,为了及时上报监控的用户访问行为信息,按照预设时间间隔发送监控信息,所述监控信息中携带有所述监控日志,以使得监控人员从对应的存储位置中获取需要监测的用户访问行为信息。所述预设时间间隔可以为1天、1周,本发明实施例不做具体限定。Further, this embodiment of the present invention may further include: sending monitoring information according to preset time intervals. For the embodiment of the present invention, in order to report the monitored user access behavior information in time, the monitoring information is sent according to preset time intervals, and the monitoring information carries the monitoring log, so that the monitoring personnel can obtain the monitoring needs from the corresponding storage location. user access behavior information. The preset time interval may be 1 day or 1 week, which is not specifically limited in the embodiment of the present invention.
本发明提供了另一种用户访问行为信息监控方法,本发明实施例通过对网页服务端返回的网页访问数据、和/或网页分享数据、和/或网页页面数据进行统计,将统计后的网页访问数据、和/或网页分享数据、和/或网页页面数据进行分类聚合,分别方法至对应的存储位置中,并生成存储位置信息,根据存储位置信息生成监控日志,实现在进行监控数据查询时通过监控日志中的存储位置信息直接调取需要查询的数据,减少数据库及服务器的存储压力,使得在查询监控数据时,根据监控服务端中存储的数据进行查找,提高对用户访问行为的监控效率。The present invention provides another method for monitoring user access behavior information. In this embodiment of the present invention, statistics are performed on web page access data, and/or web page sharing data, and/or web page data returned by the web server, and the statistics of the web pages are collected. Access data, and/or web page sharing data, and/or web page data are classified and aggregated, respectively, to corresponding storage locations, and storage location information is generated, and monitoring logs are generated according to the storage location information, so as to realize the monitoring data query. Through the storage location information in the monitoring log, the data to be queried can be directly retrieved, reducing the storage pressure on the database and the server, so that when querying the monitoring data, the data stored in the monitoring server can be searched, and the monitoring efficiency of user access behavior can be improved. .
进一步的,作为对上述图1所示方法的实现,本发明实施例提供了一种用户访问行为信息监控装置,如图3所示,该装置包括:统计模块31、存储模块32、生成模块33。Further, as an implementation of the method shown in FIG. 1 , an embodiment of the present invention provides a device for monitoring user access behavior information. As shown in FIG. 3 , the device includes: a statistics module 31 , a storage module 32 , and a generation module 33 .
统计模块31,用于发送监控数据获取请求,统计网页服务端返回的监控数据获取请求响应携带的网页访问数据、和/或网页分享数据、和/或网页页面数据;A statistics module 31, configured to send a monitoring data acquisition request, and count web page access data, and/or web page sharing data, and/or web page data carried in the response to the monitoring data acquisition request returned by the web server;
存储模块32,用于存储按照数据结构对统计后的网页访问数据、和/或网页分享数据、和/或网页页面数据分别进行分类聚合,对分类聚合后得到的网页访问数据、和/或网页分享数据、和/或网页页面数据进行存储,并生成存储位置信息;The storage module 32 is used to store the statistics of webpage access data, and/or webpage sharing data, and/or webpage page data according to the data structure, respectively classify and aggregate the webpage access data obtained after classification and aggregation, and/or webpage share data, and/or web page data for storage, and generate storage location information;
生成模块33,用于根据所述网页访问数据、和/或所述网页分享数据、和/或所述网页页面数据对应的存储位置信息生成监控日志。The generating module 33 is configured to generate a monitoring log according to the webpage access data, and/or the webpage sharing data, and/or the storage location information corresponding to the webpage page data.
本发明提供了一种用户访问行为信息监控装置,与现有技术网页浏览监控仅仅通过对访问量pv,独立访客uv等数据进行监控及监控系统体量较大,对数据库及服务器的压力过大相比,本发明实施例通过对网页服务端返回的网页访问数据、和/或网页分享数据、和/或网页页面数据进行统计,将统计后的网页访问数据、和/或网页分享数据、和/或网页页面数据进行分类聚合,分别方法至对应的存储位置中,并生成存储位置信息,根据存储位置信息生成监控日志,实现在进行监控数据查询时通过监控日志中的存储位置信息直接调取需要查询的数据,减少数据库及服务器的存储压力,使得在查询监控数据时,根据监控服务端中存储的数据进行查找,提高对用户访问行为的监控效率。The present invention provides a user access behavior information monitoring device, which is compared with the prior art web browsing monitoring only by monitoring data such as the number of visits pv, independent visitors uv and the like, and the monitoring system has a large volume and excessive pressure on the database and server In contrast, the embodiment of the present invention collects statistics on webpage access data, and/or webpage sharing data, and/or webpage page data returned by the webpage server, and collects statistics on webpage access data, and/or webpage sharing data, and / or web page data is classified and aggregated, and the method is divided into the corresponding storage location, and the storage location information is generated, and the monitoring log is generated according to the storage location information, so as to realize the direct retrieval through the storage location information in the monitoring log when querying the monitoring data The data that needs to be queried reduces the storage pressure of the database and the server, so that when querying the monitoring data, it is searched according to the data stored in the monitoring server, and the monitoring efficiency of the user's access behavior is improved.
进一步的,作为对上述图2所示方法的实现,本发明实施例提供了另一种用户访问行为信息监控装置,如图4所示,该装置包括:统计模块41、存储模块42、生成模块43、确定模块44、配置模块45。Further, as an implementation of the method shown in FIG. 2, an embodiment of the present invention provides another device for monitoring user access behavior information. As shown in FIG. 4, the device includes: a statistics module 41, a storage module 42, and a generation module 43. A determination module 44 and a configuration module 45.
统计模块41,用于发送监控数据获取请求,统计网页服务端返回的监控数据获取请求响应携带的网页访问数据、和/或网页分享数据、和/或网页页面数据;A statistics module 41, configured to send a monitoring data acquisition request, and count web page access data, and/or web page sharing data, and/or web page data carried in the response to the monitoring data acquisition request returned by the web server;
存储模块42,用于存储按照数据结构对统计后的网页访问数据、和/或网页分享数据、和/或网页页面数据分别进行分类聚合,对分类聚合后得到的网页访问数据、和/或网页分享数据、和/或网页页面数据进行存储,并生成存储位置信息;The storage module 42 is used to store the statistics of webpage access data, and/or webpage sharing data, and/or webpage page data according to the data structure, respectively classify and aggregate the webpage access data obtained after the classification and aggregation, and/or webpage share data, and/or web page data for storage, and generate storage location information;
生成模块43,用于根据所述网页访问数据、和/或所述网页分享数据、和/或所述网页页面数据对应的存储位置信息生成监控日志。The generating module 43 is configured to generate a monitoring log according to the webpage access data, and/or the webpage sharing data, and/or storage location information corresponding to the webpage page data.
进一步地,所述统计模块41包括:Further, the statistics module 41 includes:
第一统计单元4101,用于当检测到页面加载指令时,向所述网页服务端发送网页访问数据获取请求,并统计所述网页服务端返回的网页id、终端设备号、终端设备信息、终端位置信息、阅读来源信息、分享层级信息;和/或,The first statistics unit 4101 is configured to send a web page access data acquisition request to the web page server when a page loading instruction is detected, and count the web page id, terminal device number, terminal device information, terminal device returned by the web page server location information, reading source information, sharing tier information; and/or,
第二统计单元4102,用于当检测到网页分享指令时,向所述网页服务端发送网页分享数据获取请求,统计所述网页服务端返回的网页id、终端设备号、分享去向信息,并对所述网页的分享层级信息进行累计;和/或,The second statistics unit 4102 is configured to send a web page sharing data acquisition request to the web server when a web page sharing instruction is detected, count the web page id, terminal device number, and sharing destination information returned by the web server, and analyze the The sharing level information of the webpage is accumulated; and/or,
第三统计单元4103,用于当检测到网页页面点击指令时,向所述网页服务端发送网页页面数据获取请求,统计所述网页服务端返回的网页页面点击信息。The third statistical unit 4103 is configured to send a webpage page data acquisition request to the webpage server when a webpage page click instruction is detected, and count the webpage page click information returned by the webpage server.
进一步地,所述第三统计单元4103,具体用于当检测到网页页面点击指令时,向所述网页服务端发送网页页面数据获取请求,并接收网页页面数据获取请求响应,所述网页页面数据获取请求响应中携带网页页面元素点击信息、网页页面浏览时间累积信息;根据所述网页页面元素点击信息、所述网页页面浏览时间累积信息统计网页页面点击信息。Further, the third statistical unit 4103 is specifically configured to send a webpage page data acquisition request to the webpage server when a webpage page click instruction is detected, and receive a webpage page data acquisition request response, the webpage page data The acquisition request response carries webpage page element click information and webpage page browsing time accumulation information; and the webpage page click information is counted according to the webpage element click information and the webpage page browsing time accumulation information.
进一步地,所述存储模块42包括:Further, the storage module 42 includes:
解析单元4201,用于解析统计后的网页访问数据、和/或网页分享数据、和/或网页页面数据的数据结构;A parsing unit 4201, configured to parse the statistics of webpage access data, and/or webpage sharing data, and/or the data structure of webpage page data;
第一存储单元4202,用于当解析的数据结构为简单类型数据结构,则按照时限需求对所述网页访问数据、和/或所述网页分享数据、和/或所述网页页面数据聚合存储至不同存储容量的容器中,生成存储位置信息,所述简单类型数据结构为仅进行数据量增量变化的数据结构;或,The first storage unit 4202 is used to aggregate and store the web page access data, and/or the web page sharing data, and/or the web page data to In containers of different storage capacities, storage location information is generated, and the simple type data structure is a data structure that only performs incremental changes in the amount of data; or,
第二存储单元4203,用于当解析的数据结构为复杂类型数据结构,则对所述网页访问数据、和/或所述网页分享数据、和/或所述网页页面数据按照预设冗余条件进行过滤,将过滤后的网页访问数据、和/或网页分享数据、和/或网页页面数据按照用户id、浏览id、时间戳进行缓存,生成存储位置信息,所述复杂类型数据结构为对数据形式、数据表达方式进行处理的数据结构。The second storage unit 4203 is configured to, when the parsed data structure is a complex type data structure, store the webpage access data, and/or the webpage sharing data, and/or the webpage page data according to preset redundancy conditions Perform filtering, cache the filtered web page access data, and/or web page sharing data, and/or web page data according to the user id, browsing id, and timestamp, and generate storage location information. The data structure that is processed by the form and data representation.
进一步地,所述第二存储单元4203,具体用于当解析的数据结构为复杂类型数据结构,则按照用户id、浏览id、时间戳判断所述网页访问数据、和/或所述网页分享数据、和/或所述网页页面数据是否为过期数据;若为非过期数据,则将所述网页访问数据、和/或所述网页分享数据、和/或所述网页页面数据分别存储至与所述用户id、所述浏览id、所述时间戳对应的缓存位置中,生成存储位置信息。Further, the second storage unit 4203 is specifically used to judge the webpage access data and/or the webpage sharing data according to the user id, browsing id and timestamp when the parsed data structure is a complex type data structure. , and/or whether the web page data is expired data; if it is non-expired data, the web page access data, and/or the web page sharing data, and/or the web page data are respectively stored in the In the cache location corresponding to the user id, the browsing id, and the timestamp, storage location information is generated.
进一步地,所述装置还包括:Further, the device also includes:
确定模块44,用于通过消息队列发送延时消息,读取所述延时消息中的用户访问行为信息及延时时间,将所述延时消息的时间戳与缓存后的用户id、浏览id对应的时间戳进行对比,若缓存后的用户id、浏览id对应的时间戳小于所述延时消息的时间戳,则确定所述用户访问行为信息为冗余数据,若缓存后的用户id、浏览id对应的时间戳大于所述延时消息的时间戳,则缓存所述用户访问行为信息。The determining module 44 is configured to send the delayed message through the message queue, read the user access behavior information and the delay time in the delayed message, and compare the time stamp of the delayed message with the cached user id and browsing id The corresponding timestamps are compared. If the timestamps corresponding to the cached user id and browsing id are smaller than the timestamp of the delayed message, it is determined that the user access behavior information is redundant data. If the timestamp corresponding to the browsing id is greater than the timestamp of the delayed message, the user access behavior information is cached.
进一步地,所述装置还包括:Further, the device also includes:
配置模块45,用于配置具有不同存储有效期、不同访问时长的预设个数的容器,所述存储有效期的确定方式为存储有效期=需求的最近时间,所述访问时长的确定方式为访问时长=统计量的精度,所述预设个数的设定方式为预设个数=存储有效期/访问时长。The configuration module 45 is used to configure a preset number of containers with different storage validity periods and different access durations. The storage validity period is determined by the storage validity period=the most recent time required, and the access duration is determined by the access duration= The precision of the statistics, the preset number is set in the following manner: preset number = storage validity period/access duration.
本发明提供了另一种用户访问行为信息监控装置,与现有技术网页浏览监控仅仅通过对访问量pv,独立访客uv等数据进行监控及监控系统体量较大,对数据库及服务器的压力过大相比,本发明实施例通过对网页服务端返回的网页访问数据、和/或网页分享数据、和/或网页页面数据进行统计,将统计后的网页访问数据、和/或网页分享数据、和/或网页页面数据进行分类聚合,分别方法至对应的存储位置中,并生成存储位置信息,根据存储位置信息生成监控日志,实现在进行监控数据查询时通过监控日志中的存储位置信息直接调取需要查询的数据,减少数据库及服务器的存储压力,使得在查询监控数据时,根据监控服务端中存储的数据进行查找,提高对用户访问行为的监控效率。The present invention provides another user access behavior information monitoring device, which is different from the prior art web browsing monitoring only by monitoring data such as visits pv, independent visitors uv and the like, and the monitoring system is relatively large, and the pressure on the database and the server is excessive. In contrast, in this embodiment of the present invention, statistics are performed on the webpage access data, and/or webpage sharing data, and/or webpage page data returned by the webpage server, and the collected webpage access data, and/or webpage sharing data, and/or web page data are classified and aggregated, respectively, to the corresponding storage location, and the storage location information is generated, and the monitoring log is generated according to the storage location information. Get the data that needs to be queried, reduce the storage pressure of the database and the server, so that when querying the monitoring data, search according to the data stored in the monitoring server, and improve the monitoring efficiency of user access behavior.
根据本发明一个实施例提供了一种存储介质,所述存储介质存储有至少一可执行指令,该计算机可执行指令可执行上述任意方法实施例中的用户访问行为信息监控方法。According to an embodiment of the present invention, a storage medium is provided, and the storage medium stores at least one executable instruction, and the computer-executable instruction can execute the method for monitoring user access behavior information in any of the foregoing method embodiments.
图5示出了根据本发明一个实施例提供的一种计算机设备的结构示意图,本发明具体实施例并不对计算机设备的具体实现做限定。FIG. 5 shows a schematic structural diagram of a computer device provided according to an embodiment of the present invention. The specific embodiment of the present invention does not limit the specific implementation of the computer device.
如图5所示,该计算机设备可以包括:处理器(processor)502、通信接口(Communications Interface)504、存储器(memory)506、以及通信总线508。As shown in FIG. 5 , the computer device may include: a processor (processor) 502 , a communications interface (Communications Interface) 504 , a memory (memory) 506 , and a communication bus 508 .
其中:处理器502、通信接口504、以及存储器506通过通信总线508完成相互间的通信。The processor 502 , the communication interface 504 , and the memory 506 communicate with each other through the communication bus 508 .
通信接口504,用于与其它设备比如客户端或其它服务器等的网元通信。The communication interface 504 is used to communicate with network elements of other devices such as clients or other servers.
处理器502,用于执行程序510,具体可以执行上述用户访问行为信息监控方法实施例中的相关步骤。The processor 502 is configured to execute the program 510, and may specifically execute the relevant steps in the foregoing embodiments of the method for monitoring user access behavior information.
具体地,程序510可以包括程序代码,该程序代码包括计算机操作指令。Specifically, the program 510 may include program code including computer operation instructions.
处理器502可能是中央处理器CPU,或者是特定集成电路ASIC(ApplicationSpecific Integrated Circuit),或者是被配置成实施本发明实施例的一个或多个集成电路。计算机设备包括的一个或多个处理器,可以是同一类型的处理器,如一个或多个CPU;也可以是不同类型的处理器,如一个或多个CPU以及一个或多个ASIC。The processor 502 may be a central processing unit (CPU), or an application specific integrated circuit (ASIC), or one or more integrated circuits configured to implement embodiments of the present invention. The one or more processors included in the computer equipment may be the same type of processors, such as one or more CPUs; or may be different types of processors, such as one or more CPUs and one or more ASICs.
存储器506,用于存放程序510。存储器506可能包含高速RAM存储器,也可能还包括非易失性存储器(non-volatile memory),例如至少一个磁盘存储器。The memory 506 is used to store the program 510 . Memory 506 may include high-speed RAM memory, and may also include non-volatile memory, such as at least one disk memory.
程序510具体可以用于使得处理器502执行以下操作:The program 510 can specifically be used to cause the processor 502 to perform the following operations:
发送监控数据获取请求,统计网页服务端返回的监控数据获取请求响应携带的网页访问数据、和/或网页分享数据、和/或网页页面数据;Send a monitoring data acquisition request, and count the web page access data, and/or web page sharing data, and/or web page data carried in the response to the monitoring data acquisition request returned by the web server;
按照数据结构对统计后的网页访问数据、和/或网页分享数据、和/或网页页面数据分别进行分类聚合,对分类聚合后得到的网页访问数据、和/或网页分享数据、和/或网页页面数据进行存储,并生成存储位置信息;According to the data structure, the statistics of web page access data, and/or web page sharing data, and/or web page data are respectively classified and aggregated, and the web page access data, and/or web page sharing data, and/or web pages obtained after the classification and aggregation are aggregated. Store page data and generate storage location information;
根据所述网页访问数据、和/或所述网页分享数据、和/或所述网页页面数据对应的存储位置信息生成监控日志。The monitoring log is generated according to the webpage access data, and/or the webpage sharing data, and/or storage location information corresponding to the webpage page data.
显然,本领域的技术人员应该明白,上述的本发明的各模块或各步骤可以用通用的计算装置来实现,它们可以集中在单个的计算装置上,或者分布在多个计算装置所组成的网络上,可选地,它们可以用计算装置可执行的程序代码来实现,从而,可以将它们存储在存储装置中由计算装置来执行,并且在某些情况下,可以以不同于此处的顺序执行所示出或描述的步骤,或者将它们分别制作成各个集成电路模块,或者将它们中的多个模块或步骤制作成单个集成电路模块来实现。这样,本发明不限制于任何特定的硬件和软件结合。Obviously, those skilled in the art should understand that the above-mentioned modules or steps of the present invention can be implemented by a general-purpose computing device, which can be centralized on a single computing device, or distributed in a network composed of multiple computing devices Alternatively, they may be implemented in program code executable by a computing device, such that they may be stored in a storage device and executed by the computing device, and in some cases, in a different order than here The steps shown or described are performed either by fabricating them separately into individual integrated circuit modules, or by fabricating multiple modules or steps of them into a single integrated circuit module. As such, the present invention is not limited to any particular combination of hardware and software.
以上所述仅为本发明的优选实施例而已,并不用于限制本发明,对于本领域的技术人员来说,本发明可以有各种更改和变化。凡在本发明的精神和原则之内,所作的任何修改、等同替换、改进等,均应包括在本发明的保护范围之内。The above descriptions are only preferred embodiments of the present invention, and are not intended to limit the present invention. For those skilled in the art, the present invention may have various modifications and changes. Any modification, equivalent replacement, improvement, etc. made within the spirit and principle of the present invention shall be included within the protection scope of the present invention.
Claims (10)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910469619.XA CN110377854A (en) | 2019-05-31 | 2019-05-31 | User access activity information monitoring method and device, computer equipment |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910469619.XA CN110377854A (en) | 2019-05-31 | 2019-05-31 | User access activity information monitoring method and device, computer equipment |
Publications (1)
Publication Number | Publication Date |
---|---|
CN110377854A true CN110377854A (en) | 2019-10-25 |
Family
ID=68249589
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201910469619.XA Pending CN110377854A (en) | 2019-05-31 | 2019-05-31 | User access activity information monitoring method and device, computer equipment |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN110377854A (en) |
Cited By (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111404960A (en) * | 2020-03-26 | 2020-07-10 | 军事科学院系统工程研究院网络信息研究所 | Attribute extraction method applied to heaven-earth integrated network access control system |
CN111813623A (en) * | 2020-06-29 | 2020-10-23 | 北京百度网讯科技有限公司 | A page monitoring method, device, electronic device and storage medium |
CN112241501A (en) * | 2020-09-17 | 2021-01-19 | 上海连尚网络科技有限公司 | A method and device for sharing information |
CN112286946A (en) * | 2020-09-25 | 2021-01-29 | 长沙市到家悠享网络科技有限公司 | Data processing method, server and medium |
CN112434001A (en) * | 2020-11-24 | 2021-03-02 | 杭州未名信科科技有限公司 | Multi-data-source configurable MongoDB data synchronization system supporting data preprocessing, electronic equipment and medium |
CN113010831A (en) * | 2021-04-06 | 2021-06-22 | 金宝贝网络科技(苏州)有限公司 | Thermodynamic diagram optimization method, thermodynamic diagram optimization system and storage medium |
CN113779374A (en) * | 2021-02-24 | 2021-12-10 | 北京京东振世信息技术有限公司 | Page query management method and device |
CN114510608A (en) * | 2022-01-13 | 2022-05-17 | 深圳市聚点数码科技有限公司 | Chain relation propagation tracing method and device |
CN115794587A (en) * | 2022-12-22 | 2023-03-14 | 上海哔哩哔哩科技有限公司 | Web page management method, platform, electronic device and storage medium |
CN116132248A (en) * | 2022-10-11 | 2023-05-16 | 马上消费金融股份有限公司 | Information pushing method and computer readable storage medium |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1791022A (en) * | 2005-12-26 | 2006-06-21 | 阿里巴巴公司 | Log analyzing method and system |
US20090112795A1 (en) * | 2007-10-30 | 2009-04-30 | Oracle International Corp. | Query statistics |
WO2013068530A2 (en) * | 2011-11-10 | 2013-05-16 | Koninklijke Kpn N.V. | Logically and end-user-specific physically storing an electronic file |
CN105260447A (en) * | 2015-10-09 | 2016-01-20 | 上海瀚之友信息技术服务有限公司 | Webpage data analysis method and system |
CN109474653A (en) * | 2018-02-09 | 2019-03-15 | 上海共启网络科技有限公司 | A kind of shopping website guest access monitoring statisticss and outer push system and method for standing |
-
2019
- 2019-05-31 CN CN201910469619.XA patent/CN110377854A/en active Pending
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1791022A (en) * | 2005-12-26 | 2006-06-21 | 阿里巴巴公司 | Log analyzing method and system |
US20090112795A1 (en) * | 2007-10-30 | 2009-04-30 | Oracle International Corp. | Query statistics |
WO2013068530A2 (en) * | 2011-11-10 | 2013-05-16 | Koninklijke Kpn N.V. | Logically and end-user-specific physically storing an electronic file |
CN105260447A (en) * | 2015-10-09 | 2016-01-20 | 上海瀚之友信息技术服务有限公司 | Webpage data analysis method and system |
CN109474653A (en) * | 2018-02-09 | 2019-03-15 | 上海共启网络科技有限公司 | A kind of shopping website guest access monitoring statisticss and outer push system and method for standing |
Cited By (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111404960A (en) * | 2020-03-26 | 2020-07-10 | 军事科学院系统工程研究院网络信息研究所 | Attribute extraction method applied to heaven-earth integrated network access control system |
CN111813623A (en) * | 2020-06-29 | 2020-10-23 | 北京百度网讯科技有限公司 | A page monitoring method, device, electronic device and storage medium |
CN111813623B (en) * | 2020-06-29 | 2024-02-27 | 北京百度网讯科技有限公司 | Page monitoring method and device, electronic equipment and storage medium |
CN112241501A (en) * | 2020-09-17 | 2021-01-19 | 上海连尚网络科技有限公司 | A method and device for sharing information |
CN112286946A (en) * | 2020-09-25 | 2021-01-29 | 长沙市到家悠享网络科技有限公司 | Data processing method, server and medium |
CN112286946B (en) * | 2020-09-25 | 2024-01-12 | 长沙市到家悠享网络科技有限公司 | Data processing method, server and medium |
CN112434001B (en) * | 2020-11-24 | 2023-07-04 | 杭州未名信科科技有限公司 | Multiple data sources supporting data preprocessing Configurable MongoDB data synchronization system, electronic equipment and media |
CN112434001A (en) * | 2020-11-24 | 2021-03-02 | 杭州未名信科科技有限公司 | Multi-data-source configurable MongoDB data synchronization system supporting data preprocessing, electronic equipment and medium |
CN113779374A (en) * | 2021-02-24 | 2021-12-10 | 北京京东振世信息技术有限公司 | Page query management method and device |
CN113779374B (en) * | 2021-02-24 | 2023-09-26 | 北京京东振世信息技术有限公司 | Page query management method and device |
CN113010831A (en) * | 2021-04-06 | 2021-06-22 | 金宝贝网络科技(苏州)有限公司 | Thermodynamic diagram optimization method, thermodynamic diagram optimization system and storage medium |
CN114510608A (en) * | 2022-01-13 | 2022-05-17 | 深圳市聚点数码科技有限公司 | Chain relation propagation tracing method and device |
CN116132248A (en) * | 2022-10-11 | 2023-05-16 | 马上消费金融股份有限公司 | Information pushing method and computer readable storage medium |
CN116132248B (en) * | 2022-10-11 | 2024-05-24 | 马上消费金融股份有限公司 | Information pushing method and computer readable storage medium |
CN115794587A (en) * | 2022-12-22 | 2023-03-14 | 上海哔哩哔哩科技有限公司 | Web page management method, platform, electronic device and storage medium |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110377854A (en) | User access activity information monitoring method and device, computer equipment | |
US10296435B2 (en) | Storage of mass data for monitoring | |
US8601112B1 (en) | Method and system for collecting and analyzing time-series data | |
US20100153431A1 (en) | Alert triggered statistics collections | |
CN101902505B (en) | A real-time statistical device and method for distributed DNS query logs | |
US10587707B2 (en) | Method and apparatus for monitoring website access data | |
CN107092544B (en) | Monitoring method and device | |
CN103718171B (en) | Optimize WEB with user's history to crawl | |
US20150248446A1 (en) | Method and system for collecting and analyzing time-series data | |
CN107707414A (en) | The monitoring system and method for CDN | |
US20170060769A1 (en) | Systems, devices and methods for generating locality-indicative data representations of data streams, and compressions thereof | |
US8775556B1 (en) | Automated segmentation and processing of web site traffic data over a rolling window of time | |
CN112751726B (en) | Data processing method and device, electronic equipment and storage medium | |
CN112671602B (en) | Data processing method, device, system, equipment and storage medium of edge node | |
CN103207882A (en) | Shop visiting data processing method and system | |
CN108509313A (en) | A kind of business monitoring method, platform and storage medium | |
CN101252462A (en) | Alarming page furbishing method as well as server and client end | |
CN106874319A (en) | The distributed statistical method and device of click volume | |
CN103916463B (en) | Network access statistical analysis method and system | |
CN112131198B (en) | Log analysis method and device and electronic equipment | |
CN110932935A (en) | Resource control method, apparatus, device and computer storage medium | |
CN112311901B (en) | Access quantity statistical method and system | |
CN101894119B (en) | Mass data storage system for monitoring | |
US8032797B1 (en) | Storage of mass data for monitoring | |
CN117221178A (en) | Traffic data processing method, device, equipment and storage medium |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20191025 |
|
RJ01 | Rejection of invention patent application after publication |