[go: up one dir, main page]

CN108009215B - A kind of search results pages user behavior pattern assessment method, apparatus and system - Google Patents

A kind of search results pages user behavior pattern assessment method, apparatus and system Download PDF

Info

Publication number
CN108009215B
CN108009215B CN201711144282.2A CN201711144282A CN108009215B CN 108009215 B CN108009215 B CN 108009215B CN 201711144282 A CN201711144282 A CN 201711144282A CN 108009215 B CN108009215 B CN 108009215B
Authority
CN
China
Prior art keywords
user
browsing
behavior
page
data
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201711144282.2A
Other languages
Chinese (zh)
Other versions
CN108009215A (en
Inventor
王红
胡晓红
周莹
于晓梅
房有丽
狄瑞彤
孟广婷
刘海燕
宋永强
王露潼
王倩
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shandong Education Equipment Center Co ltd
Original Assignee
Shandong Normal University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shandong Normal University filed Critical Shandong Normal University
Priority to CN201711144282.2A priority Critical patent/CN108009215B/en
Publication of CN108009215A publication Critical patent/CN108009215A/en
Application granted granted Critical
Publication of CN108009215B publication Critical patent/CN108009215B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/957Browsing optimisation, e.g. caching or content distillation

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

本发明公开了一种搜索结果页用户行为模式测评方法、装置及系统,该方法包括:接收用户信息,根据用户信息中认知风格进行用户类型划分;接收用户在搜索引擎结果页内的至少两种不同类型的测试行为数据;可视化处理测试行为数据并进行初步定性分析;分别挖掘不同用户类型、页面布局下的用户不同类型的行为模式进行分析,评价搜索结果页用户行为模式。通过多种类型的测试行为数据,分析搜索引擎结果页内在不同类型布局和不同类型用户下常见的用户行为模式,为优化页面布局,改善链接投放效果具有重要贡献。

The invention discloses a method, device and system for evaluating user behavior patterns on a search result page. The method includes: receiving user information, and classifying user types according to the cognitive style in the user information; receiving at least two user information in the search engine result page. different types of test behavior data; visually process test behavior data and conduct preliminary qualitative analysis; mine different types of users and different types of user behavior patterns under page layout for analysis, and evaluate user behavior patterns on search results pages. Through various types of test behavior data, the analysis of common user behavior patterns in search engine results pages under different types of layouts and different types of users has made important contributions to optimizing page layouts and improving link delivery effects.

Description

一种搜索结果页用户行为模式测评方法、装置及系统A method, device and system for evaluating user behavior patterns on search result pages

技术领域technical field

本发明属于搜索引擎结果页优化的技术领域,涉及一种搜索结果页用户行为模式测评方法、装置及系统,尤其是涉及一种基于多模特征的搜索结果页用户行为模式测评方法、装置及系统。The invention belongs to the technical field of search engine result page optimization, and relates to a search result page user behavior pattern evaluation method, device and system, in particular to a search result page user behavior pattern evaluation method, device and system based on multimodal features .

背景技术Background technique

信息时代,互联网已渗入到各方各面,不仅表现为人们生活方式的改变,也使得企业营销策略等发生了巨大变化。人们获取信息的方式不断增多,企业传播信息的方式也不断增多,而其中最为重要的系统就是搜索引擎(Search Engine)。搜索引擎是用户在有某种需求时通过搜索引擎的搜索框发出搜索任务,经由一定的策略和程序从网络上搜集信息,通过处理和组织后,按照一定的规则将相关信息以搜索引擎结果页(SERP:Search EngineResults Page)的固定模式展示给用户的应用系统。并且在搜索引擎结果页中常伴有搜索内容相关的商业推广,是互联网最为普遍的营销方式之一。由此可见,搜索引擎对人们的生活具有重要影响意义,合理的分析用户在搜索引擎结果页内的行为模式可以了解用户的日常行为习惯,更好的进行信息处理与商业推广的页面布局和排版设计。因此探索用户行为模式,不仅具有重要的研究价值,商业价值,也对系统设计人员具有重要指导意义。In the information age, the Internet has penetrated into all aspects, not only manifested as changes in people's lifestyles, but also brought about tremendous changes in corporate marketing strategies. There are more and more ways for people to obtain information, and the ways for enterprises to disseminate information are also increasing, and the most important system is the search engine (Search Engine). A search engine is a user who sends a search task through the search box of the search engine when there is a certain need, collects information from the Internet through certain strategies and procedures, and after processing and organizing, sends relevant information to the search engine result page according to certain rules. (SERP:Search EngineResults Page) The fixed mode is displayed to the user's application system. Moreover, commercial promotion related to search content is often accompanied by search engine result pages, which is one of the most common marketing methods on the Internet. It can be seen that search engines have an important impact on people's lives. Reasonable analysis of user behavior patterns in search engine results pages can help understand users' daily behaviors and habits, and better page layout and typesetting for information processing and business promotion design. Therefore, exploring user behavior patterns not only has important research value and commercial value, but also has important guiding significance for system designers.

现阶段,针对搜索引擎结果页内用户行为的研究已有很多,研究人员大多通过挖掘用户的网络行为日志,或者是通过记录用户的光标信息的方式获取用户行为信息,然后进行相关分析、处理及推荐。At this stage, there are many studies on user behavior in search engine results pages. Most researchers obtain user behavior information by mining users' network behavior logs or recording user cursor information, and then perform relevant analysis, processing and recommend.

但是,目前针对搜索引擎结果页内用户行为的研究主要存在以下缺点:However, the current research on user behavior in search engine results pages mainly has the following shortcomings:

(1)采用的信息源大多为主观性信息,不够准确;(1) Most of the information sources used are subjective information, which is not accurate enough;

(2)虽然部分研究人员意识到上下文的作用,却甚少分析搜索结果页内不同区域间的相互作用关系,甚少分析不同类型用户在不同布局下常用的行为模式。(2) Although some researchers are aware of the role of context, they seldom analyze the interaction relationship between different regions in the search results page, and seldom analyze the common behavior patterns of different types of users under different layouts.

综上所述,现有技术中针对搜索引擎结果页内如何科学、准确地测评用户行为模式,优化页面布局以提高用户体验的问题,尚缺乏行之有效的解决方案。To sum up, in the prior art, there is still no effective solution to the problem of how to scientifically and accurately evaluate user behavior patterns in search engine result pages and optimize page layout to improve user experience.

发明内容Contents of the invention

针对现有技术中存在的不足,解决现有技术针对搜索引擎结果页内如何科学、准确地测评用户行为模式,优化页面布局以提高用户体验的问题,本发明提供了一种搜索结果页用户行为模式测评方法、装置及系统,具体为一种基于多模特征的搜索结果页用户行为模式测评方法、装置及系统,通过多种类型的测试行为数据,分析搜索引擎结果页内在不同类型布局和不同类型用户下常见的用户行为模式,为优化页面布局,改善链接投放效果具有重要贡献。Aiming at the deficiencies in the prior art and solving the problems in the prior art on how to scientifically and accurately evaluate the user behavior pattern in the search engine result page and optimize the page layout to improve the user experience, the present invention provides a search result page user behavior Mode evaluation method, device and system, specifically a method, device and system for user behavior mode evaluation of search result pages based on multi-mode features, through various types of test behavior data, analysis of different types of layouts and different types of search engine result pages. It is a common user behavior pattern under different types of users, which makes an important contribution to optimizing the page layout and improving the effect of link delivery.

本发明的第一目的是提供一种搜索结果页用户行为模式测评方法。The first object of the present invention is to provide a method for evaluating user behavior patterns on a search result page.

为了实现上述目的,本发明采用如下一种技术方案:In order to achieve the above object, the present invention adopts the following technical scheme:

一种搜索结果页用户行为模式测评方法,该方法包括:A method for evaluating user behavior patterns on a search result page, the method comprising:

接收用户信息,根据用户信息中认知风格进行用户类型划分;Receive user information, and classify user types according to the cognitive style in the user information;

接收用户在搜索引擎结果页内的至少两种不同类型的测试行为数据;receiving at least two different types of test behavior data of users within search engine results pages;

可视化处理测试行为数据并进行初步定性分析;Visually process test behavior data and conduct preliminary qualitative analysis;

分别挖掘不同用户类型、页面布局下的用户不同类型的行为模式进行分析,评价搜索结果页用户行为模式。The behavior patterns of different types of users and different types of users under the page layout are respectively excavated for analysis, and the user behavior patterns of search result pages are evaluated.

作为进一步的优选方案,所述接收用户信息前,根据视觉情况,剔除不适合眼动采集的用户,在剩余用户中随机选取若干用户进行搜索结果页用户行为模式测评。As a further preferred solution, before receiving user information, users who are not suitable for eye movement collection are eliminated according to visual conditions, and a number of users are randomly selected from the remaining users to evaluate user behavior patterns on the search result page.

作为进一步的优选方案,所述用户信息包括用户基本信息和用户认知信息;As a further preferred solution, the user information includes basic user information and user cognitive information;

所述用户基本信息包括用户的姓名、性别、年龄和职业;The basic user information includes the user's name, gender, age and occupation;

所述用户认知信息采用镶嵌图形测试法获取,包括认知风格。The user cognition information is obtained by mosaic graph test method, including cognition style.

作为进一步的优选方案,所述测试行为数据为引擎结果页内用户的浏览过程中所产生的信息源数据,包括但不限于光标数据和眼动数据;As a further preferred solution, the test behavior data is the information source data generated during the user's browsing process in the engine result page, including but not limited to cursor data and eye movement data;

所述光标数据为用户浏览搜索引擎结果页过程中获取的输入设备触发的光标事件;The cursor data is a cursor event triggered by an input device obtained during the user's browsing of the search engine result page;

所述眼动数据为用户浏览搜索引擎结果页过程中获取的眼动追踪信息。The eye-movement data is eye-tracking information acquired during a user's browsing of a search engine result page.

作为进一步的优选方案,所述可视化处理测试行为数据并进行初步定性分析的具体步骤包括:As a further preferred solution, the specific steps of visually processing test behavior data and performing preliminary qualitative analysis include:

将光标数据和眼动数据进行数据可视化;Data visualization of cursor data and eye movement data;

初步定性分析页面布局对所述眼动数据的影响。Preliminary qualitative analysis of the impact of page layout on the eye movement data.

作为进一步的优选方案,所述分别挖掘不同用户类型、页面布局下的用户不同类型的行为模式进行分析的具体步骤包括:As a further preferred solution, the specific steps for analyzing different user types and different types of behavior patterns of users under different user types and page layouts include:

采用与测评行为数据类型数量相同的频繁模式挖掘法,Using the same frequent pattern mining method as the number of evaluation behavior data types,

挖掘不同用户类型、页面布局下的光标行为模式,根据光标数据,分析搜索结果页内不同链接与用户类型、页面布局间光标行为的关联模式,和不同用户类型、页面布局下各链接内光标行为参数的组合模式;Mining the cursor behavior patterns under different user types and page layouts, based on the cursor data, analyzing the correlation patterns between different links in the search results page and the cursor behaviors between user types and page layouts, as well as the cursor behaviors in links under different user types and page layouts Combination mode of parameters;

以及挖掘不同用户类型、页面布局下的眼动浏览模式,根据眼动数据得到用户浏览搜索引擎结果页过程中时序型信息,分析用户浏览搜索引擎结果页过程中各链接间的顺序关系。And mining the eye-movement browsing patterns under different user types and page layouts, according to the eye-movement data, the time-series information in the process of users browsing the search engine result pages is obtained, and the sequence relationship between the links in the process of users browsing the search engine result pages is analyzed.

作为进一步的优选方案,采用频繁项集挖掘算法挖掘不同用户类型、页面布局下的光标行为模式;As a further preferred solution, the frequent itemset mining algorithm is used to mine the cursor behavior patterns under different user types and page layouts;

采用定向频繁浏览模式挖掘算法挖掘不同用户类型、页面布局下的眼动浏览模式,所述定向频繁浏览模式挖掘算法用于挖掘不同页面布局下用户定向定长的眼动浏览模式,得到用户行为进程中的时序型信息。Using the directional frequent browsing pattern mining algorithm to mine eye movement browsing patterns under different user types and page layouts, the directional frequent browsing pattern mining algorithm is used to mine user directional and fixed-length eye movement browsing patterns under different page layouts to obtain user behavior processes Time series information in .

作为进一步的优选方案,所述定向频繁浏览模式挖掘法包括:As a further preferred solution, the directional frequent browsing pattern mining method includes:

根据所述测试行为数据得到用户在搜索引擎结果页的浏览序列数据;Obtain the browsing sequence data of the user on the search engine result page according to the test behavior data;

在浏览序列数据中添加采纳长度和其对应的支持度属性,并初始化;Add the adoption length and its corresponding support attribute in the browsing sequence data, and initialize it;

处理每个浏览序列的支持度,使其为零,得到新序列;Process the support of each browsing sequence to make it zero and get a new sequence;

判断新序列是否为频繁序列,输出频繁序列。Determine whether the new sequence is a frequent sequence, and output the frequent sequence.

作为进一步的优选方案,在该方法中,预设链接区域范围,由所述测试行为数据中提取出用户进入每条链接区域范围内的时间。As a further preferred solution, in this method, the range of the link area is preset, and the time when the user enters the range of each link area is extracted from the test behavior data.

作为进一步的优选方案,所述根据所述测试行为数据得到用户在搜索引擎结果页的浏览序列的具体步骤包括:As a further preferred solution, the specific steps of obtaining the browsing sequence of the user on the search engine result page according to the test behavior data include:

根据页面布局对所述测试行为数据进行粗分类;Roughly classifying the test behavior data according to the page layout;

根据用户进入每条链接区域范围内的时间的先后顺序进行排列,每个用户查看每个网页都对应一条浏览序列数据。Arrange according to the order of time when users enter the area of each link, and each user views each web page corresponding to a piece of browsing sequence data.

作为进一步的优选方案,所述处理每个浏览序列的支持度使其为零的具体步骤包括:As a further preferred solution, the specific steps of processing the support of each browsing sequence to make it zero include:

预设支持度阈值;Preset support threshold;

计算浏览序列数据首元素的支持度,将小于支持度阈值的首元素支持度置零并剔除该序列;Calculate the support of the first element of the browsing sequence data, set the support of the first element less than the support threshold to zero and eliminate the sequence;

对浏览序列数据按照首元素值进行排序后分类,创建与之相对应的队列,将序列按类别进入不同队列,并删除每个序列首元素;Sort and classify the browsing sequence data according to the value of the first element, create a queue corresponding to it, enter the sequence into different queues by category, and delete the first element of each sequence;

更新序列的采纳长度和其对应的支持度属性,直至每个序列中的元素都以支持度等于0结束为止。Update the adopted length of the sequence and its corresponding support attribute until the elements in each sequence end with support equal to 0.

作为进一步的优选方案,判断新序列是否为频繁序列的具体步骤为:As a further preferred solution, the specific steps for judging whether a new sequence is a frequent sequence are:

计算新序列的得分,所述得分为该序列的采纳长度与对应支持度的乘积;Calculating the score of the new sequence, the score being the product of the adopted length of the sequence and the corresponding support degree;

将得分进行排序,确定最大得分;Sort the scores to determine the maximum score;

最大得分对应的新序列为频繁浏览模式序列,否则,判定新序列为非频繁浏览模式序列。The new sequence corresponding to the maximum score is a frequent browsing pattern sequence; otherwise, it is determined that the new sequence is a non-frequent browsing pattern sequence.

本发明的第二目的是提供一种计算机可读存储介质。A second object of the present invention is to provide a computer-readable storage medium.

为了实现上述目的,本发明采用如下一种技术方案:In order to achieve the above object, the present invention adopts the following technical scheme:

一种计算机可读存储介质,其中存储有多条指令,所述指令适于由终端设备设备的处理器加载并执行以下处理:A computer-readable storage medium, in which a plurality of instructions are stored, and the instructions are adapted to be loaded by a processor of a terminal device and perform the following processing:

接收用户信息,根据用户信息中认知风格进行用户类型划分;Receive user information, and classify user types according to the cognitive style in the user information;

接收用户在搜索引擎结果页内的至少两种不同类型的测试行为数据;receiving at least two different types of test behavior data of users within search engine results pages;

可视化处理测试行为数据并进行初步定性分析;Visually process test behavior data and conduct preliminary qualitative analysis;

分别挖掘不同用户类型、页面布局下的用户不同类型的行为模式进行分析,评价搜索结果页用户行为模式。The behavior patterns of different types of users and different types of users under the page layout are respectively excavated for analysis, and the user behavior patterns of search result pages are evaluated.

本发明的第三目的是提供一种搜索结果页用户行为模式测评装置。The third object of the present invention is to provide a device for evaluating user behavior patterns on a search result page.

为了实现上述目的,本发明采用如下一种技术方案:In order to achieve the above object, the present invention adopts the following technical scheme:

一种搜索结果页用户行为模式测评装置,采用互联网终端设备,包括处理器和计算机可读存储介质,处理器用于实现各指令;计算机可读存储介质用于存储多条指令,所述指令适于由处理器加载并执行以下处理:A device for evaluating user behavior patterns on a search result page, using Internet terminal equipment, including a processor and a computer-readable storage medium, the processor is used to implement instructions; the computer-readable storage medium is used to store multiple instructions, and the instructions are suitable for Loaded by the processor and performs the following processing:

接收用户信息,根据用户信息中认知风格进行用户类型划分;Receive user information, and classify user types according to the cognitive style in the user information;

接收用户在搜索引擎结果页内的至少两种不同类型的测试行为数据;receiving at least two different types of test behavior data of users within search engine results pages;

可视化处理测试行为数据并进行初步定性分析;Visually process test behavior data and conduct preliminary qualitative analysis;

分别挖掘不同用户类型、页面布局下的用户不同类型的行为模式进行分析,评价搜索结果页用户行为模式。The behavior patterns of different types of users and different types of users under the page layout are respectively excavated for analysis, and the user behavior patterns of search result pages are evaluated.

本发明的第四目的是提供一种搜索结果页用户行为模式测评系统。The fourth object of the present invention is to provide a system for evaluating user behavior patterns on a search result page.

为了实现上述目的,本发明采用如下一种技术方案:In order to achieve the above object, the present invention adopts the following technical scheme:

一种搜索结果页用户行为模式测评系统,该系统基于上述一种搜索结果页用户行为模式测评方法,包括:用户信息采集装置、行为数据采集装置和行为模式测评装置;A search result page user behavior pattern assessment system, the system is based on the above search result page user behavior pattern assessment method, comprising: a user information collection device, a behavior data collection device and a behavior pattern assessment device;

所述用户信息采集装置,用于采集用户信息,并发送至行为模式测评装置;The user information collection device is used to collect user information and send it to the behavior pattern evaluation device;

所述行为数据采集装置,用于采集用户在搜索引擎结果页内的至少两种不同类型的测试行为数据,并发送至行为模式测评装置;The behavior data collection device is used to collect at least two different types of test behavior data of the user in the search engine result page, and send it to the behavior pattern evaluation device;

所述行为模式测评装置,用于接收用户信息,根据用户信息中认知风格进行用户类型划分;接收用户在搜索引擎结果页内的至少两种不同类型的测试行为数据;可视化处理测试行为数据并进行初步定性分析;分别挖掘不同用户类型、页面布局下的用户不同类型的行为模式进行分析,评价搜索结果页用户行为模式。The behavior pattern evaluation device is used to receive user information, classify user types according to the cognitive style in the user information; receive at least two different types of test behavior data of the user in the search engine result page; visually process the test behavior data and Carry out preliminary qualitative analysis; excavate different user types and different types of user behavior patterns under the page layout for analysis, and evaluate the user behavior patterns of the search results page.

本发明的有益效果:Beneficial effects of the present invention:

(1)本发明所述的一种搜索结果页用户行为模式测评方法、装置及系统,,获取用户在搜索引擎结果页内的至少两种不同类型的测试行为数据,例如,用户的眼动数据和光标数据,通过搜索引擎结果页内用户的浏览过程中所产生的至少两种信息源特征进行频繁行为模式挖掘,分析页面布局、用户类型与搜索引擎结果页以及各项链接间的组合模式,以及时序关系,本发明对改善搜索引擎结果页内链接布局方式,个性化推荐信息,以及广告投放效果具有重要意义。(1) A search result page user behavior pattern evaluation method, device and system according to the present invention, obtain at least two different types of test behavior data of the user in the search engine result page, for example, the user's eye movement data and cursor data, mining frequent behavior patterns through at least two information source characteristics generated during the user’s browsing process in the search engine result page, and analyzing the combination mode between page layout, user type, search engine result page and various links, As well as the timing relationship, the present invention is of great significance for improving the link layout mode in the search engine result page, personalized recommendation information, and advertisement delivery effect.

(2)本发明所述的一种搜索结果页用户行为模式测评方法、装置及系统,集搜索引擎结果页内的多种信息源的行为信息,提出一种新颖的、定向定长的频繁浏览模式挖掘算法挖掘用户浏览过程中的频繁时序模式,并利用频繁项集挖掘算法挖掘用户的光标行为模式,为精确分析各种布局与用户类型下链接间的关系,优化页面布局,提升用户体验提供重要依据。(2) A search result page user behavior pattern evaluation method, device and system according to the present invention collects behavior information of various information sources in the search engine result page, and proposes a novel, directional and fixed-length frequent browsing The pattern mining algorithm mines the frequent sequential patterns in the user's browsing process, and uses the frequent item set mining algorithm to mine the user's cursor behavior pattern, in order to accurately analyze the relationship between various layouts and links under user types, optimize the page layout, and improve user experience. Important reference.

附图说明Description of drawings

构成本申请的一部分的说明书附图用来提供对本申请的进一步理解,本申请的示意性实施例及其说明用于解释本申请,并不构成对本申请的不当限定。The accompanying drawings constituting a part of the present application are used to provide further understanding of the present application, and the schematic embodiments and descriptions of the present application are used to explain the present application, and do not constitute improper limitations to the present application.

图1为本发明中的方法流程图;Fig. 1 is the method flowchart among the present invention;

图2是本发明行为数据采集实验中搜索引擎结果页示例图;Fig. 2 is an example figure of the search engine result page in the behavioral data collection experiment of the present invention;

图3是本发明行为数据采集实验中搜索结果页区兴趣域划分图:Fig. 3 is a domain-of-interest division diagram of the search result page area in the behavioral data collection experiment of the present invention:

图4是本发明眼动行为扫视路径图;Fig. 4 is the saccade path diagram of the eye movement behavior of the present invention;

图5是本发明眼动行为热区图;Fig. 5 is the heat map of the eye movement behavior of the present invention;

图6是本发明眼动行为关键绩效指标图;Fig. 6 is a key performance indicator diagram of the eye movement behavior of the present invention;

图7是本发明认知因素作用下各链接点击行为关联规则图;Fig. 7 is an association rule diagram of clicking behaviors of each link under the action of cognitive factors in the present invention;

图8是本发明页面布局因素作用下各链接点击行为关联规则图;Fig. 8 is an association rule diagram of each link click behavior under the action of page layout factors in the present invention;

图9是本发明中的系统结构示意图。Fig. 9 is a schematic diagram of the system structure in the present invention.

具体实施方式:Detailed ways:

下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例仅仅是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都属于本发明保护的范围。The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

应该指出,以下详细说明都是例示性的,旨在对本申请提供进一步的说明。除非另有指明,本实施例使用的所有技术和科学术语具有与本申请所属技术领域的普通技术人员通常理解的相同含义。It should be pointed out that the following detailed description is exemplary and intended to provide further explanation to the present application. Unless otherwise specified, all technical and scientific terms used in this embodiment have the same meaning as commonly understood by one of ordinary skill in the art to which this application belongs.

需要注意的是,这里所使用的术语仅是为了描述具体实施方式,而非意图限制根据本申请的示例性实施方式。如在这里所使用的,除非上下文另外明确指出,否则单数形式也意图包括复数形式,此外,还应当理解的是,当在本说明书中使用术语“包含”和/或“包括”时,其指明存在特征、步骤、操作、器件、组件和/或它们的组合。It should be noted that the terminology used here is only for describing specific implementations, and is not intended to limit the exemplary implementations according to the present application. As used herein, unless the context clearly dictates otherwise, the singular is intended to include the plural, and it should also be understood that when the terms "comprising" and/or "comprising" are used in this specification, they mean There are features, steps, operations, means, components and/or combinations thereof.

针对现有技术中存在的不足,解决现有技术针对搜索引擎结果页内如何科学、准确地测评用户行为模式,优化页面布局以提高用户体验的问题,本发明提供了一种搜索结果页用户行为模式测评方法、装置及系统,通过多种类型的测试行为数据,分析搜索引擎结果页内在不同类型布局和不同类型用户下常见的用户行为模式,为优化页面布局,改善链接投放效果具有重要贡献。Aiming at the deficiencies in the prior art and solving the problems in the prior art on how to scientifically and accurately evaluate the user behavior pattern in the search engine result page and optimize the page layout to improve the user experience, the present invention provides a search result page user behavior The mode evaluation method, device and system, through various types of test behavior data, analyze common user behavior modes in different types of layouts and different types of users in search engine results pages, making important contributions to optimizing page layouts and improving link delivery effects.

在不冲突的情况下,本申请中的实施例及实施例中的特征可以相互组合。下面结合附图与实施例对本发明作进一步说明。In the case of no conflict, the embodiments in the present application and the features in the embodiments can be combined with each other. The present invention will be further described below in conjunction with the accompanying drawings and embodiments.

实施例1:Example 1:

本实施例1的目的是提供一种搜索结果页用户行为模式测评方法。The purpose of Embodiment 1 is to provide a method for evaluating user behavior patterns on a search result page.

为了实现上述目的,本发明采用如下一种技术方案:In order to achieve the above object, the present invention adopts the following technical scheme:

如图1所示,As shown in Figure 1,

一种搜索结果页用户行为模式测评方法,该方法包括:A method for evaluating user behavior patterns on a search result page, the method comprising:

步骤(1):接收用户信息,根据用户信息中认知风格进行用户类型划分;Step (1): Receive user information, and classify user types according to the cognitive style in the user information;

步骤(2):接收用户在搜索引擎结果页内的至少两种不同类型的测试行为数据;Step (2): receiving at least two different types of test behavior data of the user in the search engine result page;

步骤(3):可视化处理测试行为数据并进行初步定性分析;Step (3): visually process test behavior data and conduct preliminary qualitative analysis;

步骤(4):分别挖掘不同用户类型、页面布局下的用户不同类型的行为模式进行分析,评价搜索结果页用户行为模式。Step (4): Mining and analyzing different user types and user behavior patterns under different page layouts, and evaluating user behavior patterns on the search result page.

需要注意的是,在步骤(1)中所述接收用户信息前,根据视觉情况,剔除不适合眼动采集的用户,在剩余用户中随机选取若干用户进行搜索结果页用户行为模式测评。实验的被试用户为进行视觉情况筛选后的随机人群,裸眼或矫正视力为1.0以上,无影响眼动追踪实验进行数据采集工作的视觉疾病或障碍。It should be noted that before receiving user information in step (1), users who are not suitable for eye movement collection are eliminated according to visual conditions, and a number of users are randomly selected from the remaining users to evaluate user behavior patterns on the search results page. The test users in the experiment are random people who have been screened for visual conditions, with uncorrected or corrected vision of 1.0 or more, and no visual diseases or obstacles that affect the data collection work of the eye tracking experiment.

在本实施例中,共招募了63名被试用户,男女比例1:1.2,用户年龄在18-21岁之间,平均年龄19.7岁。为了使得测试结果贴近实际情况,选取不同专业的学生。除此之外,所有被试人员都要经过视觉情况筛选,剔除裸眼视力或矫正视力小于1.0,以及色盲、色弱等影响实验眼动数据收集的被试。In this embodiment, a total of 63 test users were recruited, the male to female ratio was 1:1.2, and the users were between 18 and 21 years old, with an average age of 19.7 years. In order to make the test results close to the actual situation, students of different majors are selected. In addition, all subjects were screened for visual conditions, and those with uncorrected or corrected vision less than 1.0, as well as those with color blindness and color weakness that affected the collection of experimental eye movement data were excluded.

在步骤(1)中,所述用户信息包括用户基本信息和用户认知信息;In step (1), the user information includes user basic information and user cognitive information;

所述用户基本信息包括用户的姓名、性别、年龄和职业;通过填写表格的方式采集用户基本信息;The basic user information includes the user's name, gender, age and occupation; the basic user information is collected by filling in a form;

所述用户认知信息采用镶嵌图形测试法获取,包括认知风格。根据用户信息中认知风格进行用户类型划分。The user cognition information is obtained by mosaic graph test method, including cognition style. Classify user types according to the cognitive style in user information.

在本实施例中,搜索任务的类别随机选取,尽可能覆盖不同类型,比如:电视机、游乐园等。搜索引擎选取中文百度搜索引擎。考虑到网络不确定因素的影响,因此爬取与搜索任务对应的搜索引擎结果页,然后对搜索结果页进行纯净化,即只保留十条结果链接以及右侧相关推荐。然后通过人工添加的方式在结果链接列表的上方、下方及右侧添加广告链接。本发明搜索结果页示例图2所示。按照链接划分成12个兴趣区域(十条结果链接所在兴趣区域1-10,广告链接所在兴趣区域AD,右侧相关推荐所在兴趣区域R)。本发明搜索结果页兴趣区域划分图如图3所示。其中L1、L2及L3分别表示广告位于结果链接列表上方、下方和右侧的商业推广布局情况。In this embodiment, the category of the search task is randomly selected, covering different types as much as possible, such as: TV, amusement park, etc. The search engine selects the Chinese Baidu search engine. Considering the impact of network uncertainties, the search engine result pages corresponding to the search tasks are crawled, and then the search result pages are purified, that is, only ten result links and relevant recommendations on the right are kept. Then add advertising links on the top, bottom and right side of the result link list by manually adding. An example of the search result page of the present invention is shown in FIG. 2 . According to the link, it is divided into 12 interest areas (interest area 1-10 where the ten result links are located, interest area AD where the advertisement link is located, and interest area R where the relevant recommendation is located on the right). The division diagram of interest areas on the search result page of the present invention is shown in FIG. 3 . Among them, L1, L2 and L3 represent the commercial promotion layout situation where the advertisement is located above, below and on the right side of the result link list respectively.

在步骤(2)中,所述测试行为数据为引擎结果页内用户的浏览过程中所产生的信息源数据,包括但不限于光标数据和眼动数据;在本实施例中,采集光标数据和眼动数据这两种信息源的行为信息。In step (2), the test behavior data is the information source data generated during the browsing process of the user in the engine result page, including but not limited to cursor data and eye movement data; in this embodiment, the cursor data and eye movement data are collected. Behavioral information from these two sources of information is eye-tracking data.

所述光标数据为用户浏览搜索引擎结果页过程中获取的输入设备触发的光标事件;在本实施例中,通过在搜索引擎结果页内嵌入JavaScript代码获取用户浏览搜索引擎结果页过程中所产生的光标数据。The cursor data is a cursor event triggered by an input device obtained during the user's browsing of the search engine result page; Cursor data.

需要注意的是,在本实施例中,输入设备可以针对搜索结果页一个元素或多个元素,触发一系列光标事件,包括但不限于,例如,光标左键单击、光标右键单击、光标滚轮滚动或光标移动控制光标移动等的光标事件,或者再例如,回车确认事件、菜单弹出事件或方向键滚动事件等键盘事件,或者再例如,触摸板滑动控制光标移动、触摸板左键单击或触摸板右键单击等触摸板事件,或者再例如,对于触摸式终端装置,触摸屏滑动控制页面移动或触摸屏单击等触摸屏事件。It should be noted that in this embodiment, the input device can trigger a series of cursor events for one element or multiple elements of the search result page, including but not limited to, for example, cursor left click, cursor right click, cursor Cursor events such as wheel scrolling or cursor movement to control cursor movement, or keyboard events such as enter confirmation events, menu pop-up events, or direction key scrolling events, or for example, touchpad sliding to control cursor movement, touchpad left button single touchpad events such as clicking or right-clicking on the touchpad, or, for example, for a touch-type terminal device, touchscreen events such as sliding to control page movement or touchscreen clicking.

所述眼动数据为用户浏览搜索引擎结果页过程中获取的眼动追踪信息。在本实施例中,运用眼动追踪方式获取用户浏览搜索引擎结果页过程中所产生的眼动数据。The eye-movement data is eye-tracking information acquired during a user's browsing of a search engine result page. In this embodiment, the eye movement data generated during the process of browsing the search engine result page by the user is acquired by means of eye movement tracking.

在本实施例中,眼动追踪装置为德国普升科技有限公司研发的SMI RED(Version2.5)眼动仪,选取的采样频率为120Hz。需要注意的是,使用该装置时,被试要求将头部固定在桌子边缘的U型支架内,实验过程中不可随意移动头部。正式实验前,对用户进行两次眼动校准工作,两次均达标才可继续进行实验。In this embodiment, the eye tracking device is the SMI RED (Version 2.5) eye tracker developed by Pusheng Technology Co., Ltd. of Germany, and the selected sampling frequency is 120 Hz. It should be noted that when using this device, the subjects were required to fix their heads in the U-shaped bracket on the edge of the table, and they were not allowed to move their heads arbitrarily during the experiment. Before the formal experiment, the user has to perform eye movement calibration work twice, and the experiment can only be continued if the standards are met twice.

在本实施例中,眼动行为的记录和初步定性分析过程使用眼动仪自带的IViewX,Experiment Center和BeGaze,频繁模式挖掘使用Matlab R2014a。In this embodiment, IViewX, Experiment Center and BeGaze that come with the eye tracker are used for the recording of eye movement behavior and preliminary qualitative analysis, and Matlab R2014a is used for frequent pattern mining.

在本实施例中,采集的所述测试行为数据具体步骤为:In this embodiment, the specific steps of collecting the test behavior data are:

步骤(2-1)实验前准备:需要向用户介绍实验的基本步骤,硬件设备的使用规则,实验期间的具体注意事项以及眼部校准工作;Step (2-1) Pre-experiment preparation: It is necessary to introduce the basic steps of the experiment, the rules of using hardware equipment, specific precautions during the experiment and eye calibration to the user;

步骤(2-2)采集眼动数据:在用户浏览搜索引擎结果页期间按照眼动追踪装置的使用规则要求被试,获取浏览过程中产生的眼动数据;Step (2-2) collecting eye movement data: during the user's browsing of the search engine result page, the subject is required to obtain the eye movement data generated during the browsing process in accordance with the rules of use of the eye tracking device;

步骤(2-3)采集光标行为数据:在用户需要浏览的搜索引擎结果页内嵌入相应到的Javascript代码,记录用户浏览过程产生的光标行为数据;Step (2-3) collecting cursor behavior data: embed the corresponding Javascript code in the search engine result page that the user needs to browse, and record the cursor behavior data generated during the user's browsing process;

在本实施例中,采集用户行为数据的场景设计为:In this embodiment, the scenario design for collecting user behavior data is:

为使实验结果更符合实际情况,本发明设计的测评场景是:给定用户一定量的搜索任务,被试通过屏幕显示了解搜索任务简介,理解信息后按动空格键开始搜索结果页浏览,浏览过程与实际生活一样,可以进行光标的滚动、滑动与点击。每个搜索结果页浏览结束后关闭浏览器,下一个搜索任务简介自动弹出。运用眼动追踪方式获取用户在浏览网页过程中无意识的注意信息,通过网页中嵌入的JavaScript代码获取用户的光标信息。采用6(搜索任务)×3(页面布局)×2(认知风格)的设计,为了防止用户疲劳,这里定义的搜索任务6项为最大值,用户可以根据自身情况随时停止。In order to make the experimental results more in line with the actual situation, the evaluation scene designed by the present invention is: given a certain amount of search tasks for the user, the subject understands the brief introduction of the search task through the screen display, presses the space bar after understanding the information to start browsing the search result page, browses The process is the same as real life, you can scroll, slide and click the cursor. Close the browser after browsing each search result page, and the next search task profile will pop up automatically. Use eye-tracking method to obtain the unconscious attention information of the user in the process of browsing the webpage, and obtain the user's cursor information through the JavaScript code embedded in the webpage. A design of 6 (search tasks)×3 (page layout)×2 (cognitive style) is adopted. In order to prevent user fatigue, the maximum number of search tasks defined here is 6, and users can stop at any time according to their own conditions.

在本实施例中,共计获得了319条用户浏览搜索结果页的眼动和光标行为信息,数据汇总如表1所示。In this embodiment, a total of 319 pieces of eye movement and cursor behavior information of users browsing the search result page are obtained, and the data summary is shown in Table 1.

表1Table 1

在步骤(3)中,所述可视化处理测试行为数据并进行初步定性分析的具体步骤包括:In step (3), the concrete steps of said visual processing test behavior data and carrying out preliminary qualitative analysis include:

步骤(3-1):将光标数据和眼动数据进行数据可视化;Step (3-1): Data visualization of cursor data and eye movement data;

步骤(3-2):初步定性分析页面布局对所述眼动数据的影响。Step (3-2): Preliminary qualitative analysis of the impact of the page layout on the eye movement data.

在本实施例中,眼动数据采用BeGaze眼动分析软件进行数据可视化,如图4所示的扫视路径图、图5所示的热区图,图6所示的关键绩效指标图。光标数据采用Excel表格展示,表2所示为点击行为纪要表,通过这些可视化数据图进行初步定性分析。可以发现用户对于搜索引擎结果页内页面上方注视较多,商业推广对用户的吸引力较强,但右侧广告对用户的吸引力相比上方和下方的商业推广较小。In this embodiment, the eye movement data is visualized using BeGaze eye movement analysis software, such as the saccade path diagram shown in FIG. 4 , the heat zone diagram shown in FIG. 5 , and the key performance indicator diagram shown in FIG. 6 . Cursor data is displayed in an Excel table, and Table 2 shows a summary table of click behaviors. Preliminary qualitative analysis is performed through these visual data graphs. It can be found that users pay more attention to the top of the search engine results page, and the commercial promotion is more attractive to users, but the advertisement on the right is less attractive to users than the commercial promotion at the top and bottom.

表2Table 2

11 L1L1 DependenceDependence 55 99 1010 22 L1L1 DependenceDependence 11 22 33 66 77 33 L1L1 IndependenceIndependence 22 55 1010 44 L1L1 IndependenceIndependence 22 44 66 88 RR 55 L1L1 DependenceDependence 11 33 44 99 ADAD 66 L1L1 IndependenceIndependence 22 55 77 99 77 L1L1 IndependenceIndependence 11 33 44 88 L1L1 DependenceDependence 22 99 ADAD 99 L1L1 DependenceDependence 11 33 44 77 1010 L2L2 DependenceDependence 33 44 66 77 1111 L2L2 DependenceDependence 77 88 1212 L2L2 DependenceDependence 22 66 RR 1313 L2L2 IndependenceIndependence 22 33 ADAD 1414 L2L2 IndependenceIndependence 33 44 66 88 99 ADAD 1515 L2L2 DependenceDependence 44 ADAD 1616 L2L2 IndependenceIndependence 22 33 ADAD 1717 L2L2 DependenceDependence 22 44 1818 L2L2 IndependenceIndependence 22 55 99 1919 L3L3 DependenceDependence 44 55 66 77 88 99 1010 2020 L3L3 IndependenceIndependence 11 22 33 44 77 88 1010 21twenty one L3L3 IndependenceIndependence 11 44 55 22twenty two L3L3 DependenceDependence 22 33 22 23twenty three L3L3 DependenceDependence 11 55 99 24twenty four L3L3 DependenceDependence 11 55 2525 L3L3 IndependenceIndependence 11 22 55 77 RR 2626 L3L3 IndependenceIndependence 44 99

在步骤(4)中,所述分别挖掘不同用户类型、页面布局下的用户不同类型的行为模式进行分析的具体步骤包括:In step (4), the specific steps for analyzing the different types of behavior patterns of users under different user types and page layouts are analyzed, including:

采用与测评行为数据类型数量相同的频繁模式挖掘法,Using the same frequent pattern mining method as the number of evaluation behavior data types,

挖掘不同用户类型、页面布局下的光标行为模式,根据光标数据,分析搜索结果页内不同链接与用户类型、页面布局间光标行为的关联模式,和不同用户类型、页面布局下各链接内光标行为参数的组合模式;Mining the cursor behavior patterns under different user types and page layouts, based on the cursor data, analyzing the correlation patterns between different links in the search results page and the cursor behaviors between user types and page layouts, as well as the cursor behaviors in links under different user types and page layouts Combination mode of parameters;

以及挖掘不同用户类型、页面布局下的眼动浏览模式,根据眼动数据得到用户浏览搜索引擎结果页过程中时序型信息,分析用户浏览搜索引擎结果页过程中各链接间的顺序关系。And mining the eye-movement browsing patterns under different user types and page layouts, according to the eye-movement data, the time-series information in the process of users browsing the search engine result pages is obtained, and the sequence relationship between the links in the process of users browsing the search engine result pages is analyzed.

在本实施例中,采用频繁项集挖掘算法挖掘不同用户类型、页面布局下的光标行为模式;In this embodiment, the frequent itemset mining algorithm is used to mine cursor behavior patterns under different user types and page layouts;

采用定向频繁浏览模式挖掘算法挖掘不同用户类型、页面布局下的眼动浏览模式,所述定向频繁浏览模式挖掘算法用于挖掘不同页面布局下用户定向定长的眼动浏览模式,得到用户行为进程中的时序型信息。Using the directional frequent browsing pattern mining algorithm to mine eye movement browsing patterns under different user types and page layouts, the directional frequent browsing pattern mining algorithm is used to mine user directional and fixed-length eye movement browsing patterns under different page layouts to obtain user behavior processes Time series information in .

(4-1)光标行为模式分析:使用频繁模式挖掘算法分析搜索结果页内不同链接(包括十条结果链接、广告链接以及相关推荐)与两种因素(用户类型即用户认知方式、页面布局)间光标行为的关联模式,了解与两种因素紧密相关的链接及组合模式,建立认知方式、页面布局与各条链接间的关系;(4-1) Cursor behavior pattern analysis: use frequent pattern mining algorithm to analyze different links in the search results page (including ten result links, advertising links and related recommendations) and two factors (user type, namely user cognitive style, page layout) The correlation mode of the cursor behavior among the two factors, understand the link and combination mode closely related to the two factors, and establish the relationship between the cognitive mode, page layout and each link;

针对用户认知方式和页面布局两种因素分别展开讨论,光标行为参数较多,这里仅以具有代表性的点击特征为例:The two factors of user cognition and page layout are discussed separately. There are many cursor behavior parameters. Here we only take representative click characteristics as an example:

用户类型影响分析(即用户认知影响分析):根据点击行为数据,按照用户认知方式对数据进行分类,挖掘每类中的点击特征的频繁项集,分析用户认知风格和搜索引擎结果页内各链接之间的关联规则,了解不同认知下各链接内点击行为参数的组合模式。User type impact analysis (i.e. user cognition impact analysis): According to the click behavior data, classify the data according to the user cognition mode, mine the frequent item sets of click features in each category, and analyze the user cognition style and search engine result pages The association rules between the links in the website, and the combination mode of the click behavior parameters in the links under different cognitions.

页面布局影响分析:根据点击行为数据,按照页面布局即广告位置对数据进行分类,挖掘每类中的点击特征的频繁项集,分析页面布局和搜索引擎结果页内各链接之间的关联规则,了解不同位置下各链接内点击行为参数的组合模式;Page layout impact analysis: According to the click behavior data, classify the data according to the page layout, that is, the advertisement position, mine the frequent item sets of the click characteristics in each category, and analyze the page layout and the association rules between the links in the search engine result page, Understand the combination mode of click behavior parameters in each link under different positions;

在本实施例中,采用多种频繁模式挖掘的经典方法分析搜索引擎结果页内不同兴趣区域的点击行为和认知风格、广告位置间的频繁项集与关联规则。由于不同算法下的结果大同小异,因此只给出最经典的频繁模式挖掘Apriori算法下的运行结果。图7和图8分别表示计算出的搜索引擎结果页上各个兴趣区域的点击行为与不同认知风格、广告位置的关联规则(minSup=0.03,minConf=0.4)。为了方便展示,本发明将关联规则中项目之间的连接可视化为一个有向图。首先,需要生成规则前因和后项的邻接矩阵,该矩阵为稀疏矩阵,矩阵里的元素值为前因和后项间的关联度。然后,使关联规则中的前因作为有向图的前驱,关联规则中的后项作为有向图中相应前因的后件。以图8中加粗连接线为例,表示{7}{2}→{Independence},{7}→{Independence},{7}{2}→{Dependence}都是满足阈值的关联规则。In this embodiment, multiple classic methods of frequent pattern mining are used to analyze the click behavior and cognitive style of different interest areas in the search engine result page, frequent item sets and association rules between advertisement positions. Since the results of different algorithms are similar, only the running results of the most classic frequent pattern mining Apriori algorithm are given. Figure 7 and Figure 8 respectively show the calculated association rules (minSup=0.03, minConf=0.4) between the click behavior of each interest area on the search engine result page and different cognitive styles and advertisement positions. For the convenience of presentation, the present invention visualizes the connection between items in the association rule as a directed graph. First, it is necessary to generate an adjacency matrix of regular antecedents and subsequent items. This matrix is a sparse matrix, and the values of the elements in the matrix are the correlation degrees between antecedents and subsequent items. Then, let the antecedents in the association rules be the predecessors of the directed graph, and the latter items in the association rules be the consequents of the corresponding antecedents in the directed graph. Taking the bold connection line in Figure 8 as an example, it means that {7}{2}→{Independence}, {7}→{Independence}, {7}{2}→{Dependence} are all association rules that satisfy the threshold.

分析用户认知因素作用下,场独立型认知风格点击行为的频繁项集中项的维度要明显大于场依赖型认知风格。这说明场独立型用户在浏览网页时,主观能动性较强,不拘泥于一定范围,整体意识性强;场依赖型用户在此过程中有相对固定的浏览区域,模式也相对固定。Under the influence of user cognition factors, the dimensions of the items in the frequent itemset of field-independent cognitive style click behavior are significantly larger than those of field-dependent cognitive style. This shows that field-independent users have a strong subjective initiative when browsing the web, are not limited to a certain range, and have a strong overall awareness; field-dependent users have relatively fixed browsing areas and patterns in the process.

分析页面布局因素作用下,可以看出广告位于L1和L3位置时频繁项集中项的维度要明显大于广告位于L2。L1处的布局格式影响到的项最多,对广告的影响力也较大。L2处的布局格式影响到的项最少,但对广告的影响力较大。这与步骤(3)初步定性分析的结论相呼应。Analyzing the effects of page layout factors, it can be seen that the dimensions of items in the frequent itemset are significantly larger when the advertisement is located in L1 and L3 than when the advertisement is located in L2. The layout format at L1 affects the most items and has a greater influence on the advertisement. The layout format at L2 affects the least items, but has a greater influence on the ad. This echoes the conclusion of the preliminary qualitative analysis in step (3).

(4-2)眼动浏览模式分析:本实施例提出了定向频繁浏览模式挖掘算法DFBP(Directional Frequent Browsing Patterns),用于挖掘不同页面布局下用户定向定长的浏览模式,了解用户行为进程中的时序型信息,结合实际情况分析用户浏览过程中各链接间的顺序关系;(4-2) Analysis of eye-movement browsing patterns: This embodiment proposes a directional frequent browsing pattern mining algorithm DFBP (Directional Frequent Browsing Patterns), which is used to mine user-directed and fixed-length browsing patterns under different page layouts, and understand user behavior in the process. Time-series information, combined with the actual situation to analyze the sequence relationship between the links in the user's browsing process;

在所述步骤(4-2)中,眼动浏览模式分析的DFBP算法具体步骤为:In said step (4-2), the specific steps of the DFBP algorithm of eye movement browsing pattern analysis are:

步骤(4-2-1):依照广告位置对采集到的数据进行粗分类,每个类均根据采集到的用户首次进入每个兴趣区域内时间的先后顺序进行排列,每个用户查看每个网页都对应一条浏览模式序列QiStep (4-2-1): Roughly classify the collected data according to the location of the advertisement. Each category is arranged according to the order of the time when the collected users first entered each area of interest. Each user views each Each web page corresponds to a browsing mode sequence Q i ;

例如:广告位置,包括:网页的顶部、网页的中部或网页的底部;For example: ad placement, including: the top of the web page, the middle of the web page, or the bottom of the web page;

每个用户查看每个网页都对应一条浏览模式序列:Each page viewed by a user corresponds to a sequence of browsing patterns:

例如,打开百度,输入检索词,对应检索词有10条检索链接,编号分别是1-10,对于广告区域,定义为AD,检索词相关区域,定义为R;For example, open Baidu, enter the search term, there are 10 search links corresponding to the search term, and the numbers are 1-10 respectively. For the advertisement area, it is defined as AD, and the area related to the search term is defined as R;

假设,用户的眼睛浏览顺序分别是AD、1、2、3、12、4、5、6、8、7、9、10;那么该用户的浏览模式序列就是:11→1→2→3→12→4→5→6→8→7→9→10;Assume that the browsing order of the user's eyes is AD, 1, 2, 3, 12, 4, 5, 6, 8, 7, 9, 10; then the user's browsing pattern sequence is: 11→1→2→3→ 12→4→5→6→8→7→9→10;

假设,用户的眼睛浏览顺序分别是1、2、3、AD、4、5、6、7、8、9、10、R;那么该用户的浏览模式序列就是:1→2→3→AD→4→5→6→7→8→9→10→R;Assume that the browsing order of the user's eyes is 1, 2, 3, AD, 4, 5, 6, 7, 8, 9, 10, R; then the user's browsing mode sequence is: 1→2→3→AD→ 4→5→6→7→8→9→10→R;

假设,用户的眼睛浏览顺序分别是AD、2、3、4、5;那么该用户的浏览模式序列就是:AD→2→3→4→5;Assume that the browsing order of the user's eyes is AD, 2, 3, 4, 5; then the user's browsing pattern sequence is: AD→2→3→4→5;

步骤(4-2-2):为步骤(4-2-1)得到的所有数据添加三个属性并初始化:被采纳长度L=(l1,l2,…,lp),支持度序列S=(s1,s2,…,sp),l1=0,s1=0,支持度阈值s;Step (4-2-2): Add three attributes to all the data obtained in step (4-2-1) and initialize: adopted length L=(l 1 ,l 2 ,…,l p ), support sequence S=(s 1 ,s 2 ,…,s p ), l 1 =0, s 1 =0, support threshold s;

元素的支持度,举例:Element support, for example:

元素的支持度记为元素的频繁度,当一个数据集内有300条数据,首元素为11的有30条,此时首元素的支持度30;首元素一致的前提下,第二个元素为1的有10条,此时第二个元素的支持度为10;前两个元素一致的前提下,第三个元素为2的有7条,此时第三个元素的支持度为7,以此类推。The support degree of an element is recorded as the frequency of the element. When there are 300 pieces of data in a data set, there are 30 pieces of data whose first element is 11. At this time, the support degree of the first element is 30; on the premise that the first element is consistent, the second element There are 10 items for 1, and the support degree of the second element is 10 at this time; on the premise that the first two elements are consistent, there are 7 items for the third element of 2, and the support degree of the third element is 7 at this time , and so on.

被采纳长度,举例:Accepted length, for example:

被采纳长度,了解某序列的具备频繁条件的具体长度信息。序列从首元素开始计算其支持度,若大于支持度阈值则l1=1,L=(1),继续进行,否则l1=1,L=(0),结束;当l1=1时,计算第二个元素的支持度,若大于支持度阈值,则l2=2,L=(1,2),继续进行,否则保持l1=1,L=(1)不变,结束;当l2=2时,计算第三个元素的支持度,若大于支持度阈值则l3=3,L=(1,2,3),重复该步骤,否则保持不变,结束。Adopted length, understand the specific length information of a sequence with frequent conditions. The sequence calculates its support from the first element, if it is greater than the support threshold, then l 1 =1, L=(1), continue, otherwise l 1 =1, L=(0), end; when l 1 =1 , calculate the support degree of the second element, if it is greater than the support degree threshold, then l 2 =2, L=(1,2), continue, otherwise keep l 1 =1, L=(1) unchanged, end; When l 2 =2, calculate the support of the third element, if it is greater than the support threshold, then l 3 =3, L=(1,2,3), repeat this step, otherwise keep it unchanged, and end.

例如,浏览模式序列11→1→2→3→12→4→5→6→8→7→9→10,支持度阈值s=8。首元素AD的支持度为30>8,则l1=1,L=(1);第二个元素1的支持度为10>8,则l2=2,L=(1,2);第三个元素2的支持度为7<8,则保持l2=2,L=(1,2)不变,结束。For example, for the browsing pattern sequence 11→1→2→3→12→4→5→6→8→7→9→10, the support threshold s=8. The support degree of the first element AD is 30>8, then l 1 =1, L=(1); the support degree of the second element 1 is 10>8, then l 2 =2, L=(1,2); The support degree of the third element 2 is 7<8, then keep l 2 =2, L=(1,2) unchanged, and end.

支持度序列,举例:Support sequence, for example:

支持度序列,了解某序列的具体频繁度信息。从首元素开始计算第一个元素的支持度,当一个数据集内有300条数据,首元素为AD的有30条,此时首元素的支持度s1=30,S=(30);首元素一致的前提下,第二个元素为1的有10条,此时第二个元素的支持度为s2=10,S=(30,10);前两个元素一致的前提下,第三个元素为2的有7条,此时第三个元素的支持度为s3=7,7<s,结束,S=(30,10)保持不变,若第三个元素满足大于支持度阈值则以此类推。Support sequence, to understand the specific frequency information of a sequence. Calculate the support degree of the first element from the first element. When there are 300 pieces of data in a data set, and the first element is AD, there are 30 pieces. At this time, the support degree of the first element s 1 =30, S=(30); On the premise that the first element is consistent, there are 10 items whose second element is 1. At this time, the support degree of the second element is s 2 =10, S=(30,10); on the premise that the first two elements are consistent, There are 7 items with the third element being 2. At this time, the support degree of the third element is s 3 =7, 7<s, end, S=(30,10) remains unchanged, if the third element satisfies greater than And so on for the support threshold.

频繁度阈值,举例:Frequency threshold, for example:

频繁度阈值,通过经验调试出的,比较适合当前数据集大小的一个数值。当一个元素的支持度大于该数值时,即被认为该元素是频繁的,这里令s=8。The frequency threshold is a value that is more suitable for the size of the current data set, which is debugged through experience. When the support of an element is greater than this value, it is considered that the element is frequent, here let s=8.

步骤(4-2-3):计算序列首元素的支持度sj,若sj<s,则令序列sj=0并剔除该序列;Step (4-2-3): Calculate the support degree s j of the first element of the sequence, if s j <s, set the sequence s j =0 and delete the sequence;

例如,当前数据集内有30条数据的首元素是一样的,如AD,那么该首元素的支持度sj=30For example, if the first elements of 30 pieces of data in the current data set are the same, such as AD, then the support of the first element s j =30

计算序列首元素的支持度sj Calculate the support s j of the first element of the sequence

步骤(4-2-4):对剩余序列按照首元素值从大到小进行排序,创建与排序后每个首元素相对应的队列G1,G2,…,Gt,将序列按类别进入不同队列,并删除每个序列首元素;Step (4-2-4): Sort the remaining sequences according to the value of the first element from large to small, create a queue G 1 , G 2 ,…,G t corresponding to each first element after sorting, and sort the sequence by category Enter different queues and delete the first element of each sequence;

步骤(4-2-5):更新序列的L和S属性,为L和S分别添加lj+1=lj+1,sj+1,原始L=(l1,l2,…,lj),S=(s1,s2,…,sj),更新后为L=(l1,l2,…,lj,lj+1),S=(s1,s2,…,sj,sj+1);Step (4-2-5): Update the L and S attributes of the sequence, add l j+1 =l j +1, s j+1 for L and S respectively, the original L=(l 1 ,l 2 ,…, l j ), S=(s 1 ,s 2 ,…,s j ), after updating, it becomes L=(l 1 ,l 2 ,…,l j ,l j+1 ), S=(s 1 ,s 2 ,...,s j ,s j+1 );

步骤(4-2-6):重复步骤(4-2-3)和(4-2-4),直至每个序列中的元素都以支持度等于0结束为止;Step (4-2-6): Repeat steps (4-2-3) and (4-2-4) until the elements in each sequence end with support equal to 0;

步骤(4-2-7):计算每个序列的得分Fi=li*si,从Fi中找到最大得分maxFi,最大得分对应的序列为频繁浏览模式序列,输出浏览模式序列Qi,否则,判定序列为非频繁浏览模式序列。Step (4-2-7): Calculate the score F i =l i *s i of each sequence, find the maximum score maxF i from F i , the sequence corresponding to the maximum score is a frequent browsing pattern sequence, and output the browsing pattern sequence Q i , otherwise, it is determined that the sequence is a non-frequent browsing pattern sequence.

如表3所示的三种布局下使用DFBP算法得到的用户最常见的五种浏览模式。观察L1布局下的浏览序列,发现用户浏览初期如果检测到广告时,用户将优先检测广告,然后按序浏览;如果用户初期未检测广告,用户将按顺序浏览URL。L2布局时广告位于下方,用户自然按顺序浏览。L3布局时广告位于右侧,用户通常按照习惯先观察URL,但通常右侧广告图片大、颜色鲜艳,刺激力度大,所以在1、2号URL后,用户注意力将被右侧广告吸引。As shown in Table 3, the five most common browsing patterns of users obtained by using the DFBP algorithm under the three layouts. Observing the browsing sequence under the L1 layout, it is found that if the user detects an advertisement at the beginning of browsing, the user will first detect the advertisement and then browse in order; if the user does not detect the advertisement at the beginning, the user will browse the URL in order. In the L2 layout, the advertisement is located at the bottom, and users naturally browse in order. In the L3 layout, the advertisement is on the right side. Users usually observe the URL first according to the habit, but usually the advertisement picture on the right side is large, the color is bright, and the stimulus is strong. Therefore, after URL 1 and 2, the user’s attention will be attracted by the advertisement on the right side.

表3table 3

观察三种不同布局,可以看出不同布局下,用户浏览模式大致是呈现自上而下型。并且还可以发现,很多时候用户首先注视到的区域为2号位URL,之后反溯到1号位。结合实际,这种情况应该是由于用户在使用电脑进行搜索、浏览时,打开新网页会有一定的时间间隔,而这段间隔会使用户视线重置到屏幕中间区域,加之用户通过日常积累了解到结果主要在页面左侧排列,因此用户首先注视2号位的URL,了解后用户将重新按序阅读过程。Observing the three different layouts, it can be seen that under different layouts, the user browsing mode is roughly top-down. And it can also be found that in many cases, the area that the user first looks at is the URL of No. 2, and then backtracks to No. 1. Combined with reality, this situation should be due to the fact that when users use computers to search and browse, there will be a certain time interval when opening a new webpage, and this interval will cause the user's sight to reset to the middle area of the screen. The results are mainly arranged on the left side of the page, so the user first pays attention to the URL at No. 2, and after understanding, the user will read the process in order again.

本实施例将运用眼动追踪方式获取用户在浏览网页过程中无意识的注意信息,通过网页中嵌入的JavaScript代码获取用户的光标信息。本发明将提出一种新颖的、定向定长的定向频繁浏览模式挖掘算法DFBP用于挖掘频繁时序模式,并利用频繁项集挖掘算法挖掘用户的光标行为模式。通过眼动特征和光标特征,分析搜索引擎结果页内在不同类型布局和不同类型用户下常见的用户行为模式,为优化页面布局,改善链接投放效果具有重要贡献。In this embodiment, the eye tracking method is used to obtain the unconscious attention information of the user in the process of browsing the webpage, and the user's cursor information is obtained through the JavaScript code embedded in the webpage. The present invention proposes a novel, directional and fixed-length directional frequent browsing pattern mining algorithm DFBP for mining frequent sequential patterns, and uses the frequent itemset mining algorithm to mine user cursor behavior patterns. Through eye movement features and cursor features, the analysis of common user behavior patterns in different types of layouts and different types of users in search engine results pages has made important contributions to optimizing page layouts and improving link delivery effects.

实施例2:Example 2:

本实施例2的目的是提供一种计算机可读存储介质。The purpose of Embodiment 2 is to provide a computer-readable storage medium.

为了实现上述目的,本发明采用如下一种技术方案:In order to achieve the above object, the present invention adopts the following technical scheme:

一种计算机可读存储介质,其中存储有多条指令,所述指令适于由终端设备设备的处理器加载并执行以下处理:A computer-readable storage medium, in which a plurality of instructions are stored, and the instructions are adapted to be loaded by a processor of a terminal device and perform the following processing:

接收用户信息,根据用户信息中认知风格进行用户类型划分;Receive user information, and classify user types according to the cognitive style in the user information;

接收用户在搜索引擎结果页内的至少两种不同类型的测试行为数据;receiving at least two different types of test behavior data of users within search engine results pages;

可视化处理测试行为数据并进行初步定性分析;Visually process test behavior data and conduct preliminary qualitative analysis;

分别挖掘不同用户类型、页面布局下的用户不同类型的行为模式进行分析,评价搜索结果页用户行为模式。The behavior patterns of different types of users and different types of users under the page layout are respectively excavated for analysis, and the user behavior patterns of search result pages are evaluated.

在本实施例中,计算机可读记录介质的例子包括磁存储介质(例如,ROM,RAM,USB,软盘,硬盘等)、光学记录介质(例如,CD-ROM或DVD)、PC接口(例如,PCI、PCI-Expres、WiFi等)等。然而,本公开的各个方面不限于此。In this embodiment, examples of the computer-readable recording medium include magnetic storage media (for example, ROM, RAM, USB, floppy disk, hard disk, etc.), optical recording media (for example, CD-ROM or DVD), PC interfaces (for example, PCI, PCI-Express, WiFi, etc.), etc. However, aspects of the present disclosure are not limited thereto.

实施例3:Example 3:

本实施例3的目的是提供一种搜索结果页用户行为模式测评装置。The purpose of Embodiment 3 is to provide a device for evaluating user behavior patterns on a search result page.

为了实现上述目的,本发明采用如下一种技术方案:In order to achieve the above object, the present invention adopts the following technical scheme:

一种搜索结果页用户行为模式测评装置,采用一种互联网终端设备,包括处理器和计算机可读存储介质,处理器用于实现各指令;计算机可读存储介质用于存储多条指令,所述指令适于由处理器加载并执行以下处理:A device for evaluating user behavior patterns on a search result page, using an Internet terminal device, including a processor and a computer-readable storage medium, the processor is used to implement instructions; the computer-readable storage medium is used to store multiple instructions, and the instructions is adapted to be loaded by the processor and perform the following processing:

接收用户信息,根据用户信息中认知风格进行用户类型划分;Receive user information, and classify user types according to the cognitive style in the user information;

接收用户在搜索引擎结果页内的至少两种不同类型的测试行为数据;receiving at least two different types of test behavior data of users within search engine results pages;

可视化处理测试行为数据并进行初步定性分析;Visually process test behavior data and conduct preliminary qualitative analysis;

分别挖掘不同用户类型、页面布局下的用户不同类型的行为模式进行分析,评价搜索结果页用户行为模式。The behavior patterns of different types of users and different types of users under the page layout are respectively excavated for analysis, and the user behavior patterns of search result pages are evaluated.

本领域技术人员应该明白,上述的本发明的各模块或各步骤可以用通用的计算机装置来实现,可选地,它们可以用计算装置可执行的程序代码来实现,从而,可以将它们存储在存储装置中由计算装置来执行,或者将它们分别制作成各个集成电路模块,或者将它们中的多个模块或步骤制作成单个集成电路模块来实现。本发明不限制于任何特定的硬件和软件的结合。Those skilled in the art should understand that each module or each step of the present invention described above can be realized by a general-purpose computer device, and optionally, they can be realized by a program code executable by the computing device, thereby, they can be stored in The storage device is executed by the computing device, or they are manufactured as individual integrated circuit modules, or multiple modules or steps among them are manufactured as a single integrated circuit module. The invention is not limited to any specific combination of hardware and software.

实施例4:Example 4:

本实施例4的目的是提供一种搜索结果页用户行为模式测评系统。The purpose of Embodiment 4 is to provide a system for evaluating user behavior patterns on a search result page.

为了实现上述目的,本发明采用如下一种技术方案:In order to achieve the above object, the present invention adopts the following technical scheme:

如图9所示,As shown in Figure 9,

一种搜索结果页用户行为模式测评系统,该系统基于上述一种搜索结果页用户行为模式测评方法,包括:用户信息采集装置、行为数据采集装置和行为模式测评装置;A search result page user behavior pattern assessment system, the system is based on the above search result page user behavior pattern assessment method, comprising: a user information collection device, a behavior data collection device and a behavior pattern assessment device;

所述用户信息采集装置,用于采集用户信息,并发送至行为模式测评装置;The user information collection device is used to collect user information and send it to the behavior pattern evaluation device;

所述行为数据采集装置,用于采集用户在搜索引擎结果页内的至少两种不同类型的测试行为数据,并发送至行为模式测评装置;The behavior data collection device is used to collect at least two different types of test behavior data of the user in the search engine result page, and send it to the behavior pattern evaluation device;

所述行为模式测评装置,用于接收用户信息,根据用户信息中认知风格进行用户类型划分;接收用户在搜索引擎结果页内的至少两种不同类型的测试行为数据;可视化处理测试行为数据并进行初步定性分析;分别挖掘不同用户类型、页面布局下的用户不同类型的行为模式进行分析,评价搜索结果页用户行为模式。The behavior pattern evaluation device is used to receive user information, classify user types according to the cognitive style in the user information; receive at least two different types of test behavior data of the user in the search engine result page; visually process the test behavior data and Carry out preliminary qualitative analysis; excavate different user types and different types of user behavior patterns under the page layout for analysis, and evaluate the user behavior patterns of the search results page.

所述用户信息采集装置是利用表格统计形式采集用户基本信息,并通过镶嵌图形测试获取用户认知信息。The user information collection device collects basic user information in the form of table statistics, and obtains user cognition information through a mosaic pattern test.

所述行为数据采集装置,具体包括:The behavioral data collection device specifically includes:

眼动数据采集模块,被配置为使用眼动追踪装置获取用户浏览搜索引擎结果页过程中所产生的眼动数据;在本实施例中,眼动追踪装置为德国普升科技有限公司研发的SMIRED(Version2.5)眼动仪,选取的采样频率为120Hz。The eye movement data acquisition module is configured to use the eye movement tracking device to obtain the eye movement data generated during the user's browsing of the search engine result page; in this embodiment, the eye movement tracking device is SMIRED developed by Germany Pusheng Technology Co., Ltd. (Version2.5) eye tracker, the selected sampling frequency is 120Hz.

光标数据采集模块,被配置为在搜索引擎结果页内嵌入JavaScript代码获取用户浏览搜索引擎结果页过程中所产生的光标数据。The cursor data collection module is configured to embed JavaScript code in the search engine result page to obtain cursor data generated during the user's browsing of the search engine result page.

所述行为模式测评装置,具体包括:The behavior pattern evaluation device specifically includes:

初步定性分析模块:将获取到的数据进行可视化处理,通过得到的可视化数据图进行初步定性分析不同因素、不同信息源下搜索结果页内各条链接间映射关系;Preliminary qualitative analysis module: Visualize the obtained data, and conduct preliminary qualitative analysis of the mapping relationship between different factors and links in the search result page under different information sources through the obtained visualized data graph;

光标行为模式分析模块,被配置为通过捕获的光标数据,使用频繁项集挖掘算法,挖掘不同用户类型和不同布局下常见的光标行为模式,并结合实际说明结果;The cursor behavior pattern analysis module is configured to use the frequent itemset mining algorithm to mine the common cursor behavior patterns under different user types and different layouts through the captured cursor data, and explain the results in combination with the actual situation;

眼动行为模式分析模块,被配置为根据获取的眼动数据,利用频繁浏览模式挖掘算法,挖掘不同用户类型和不同布局下具有时序性的用户常见浏览模式,分析用户认知方式和各项链接间的组合关系,并结合实际说明结果。The eye movement behavior pattern analysis module is configured to use the frequent browsing pattern mining algorithm based on the obtained eye movement data to mine the common browsing patterns of users with time sequence under different user types and different layouts, and analyze the user's cognitive mode and various links The combination relationship among them, and combined with the actual explanation results.

本发明的有益效果:Beneficial effects of the present invention:

(1)本发明所述的一种搜索结果页用户行为模式测评方法、装置及系统,,获取用户在搜索引擎结果页内的至少两种不同类型的测试行为数据,例如,用户的眼动数据和光标数据,通过搜索引擎结果页内用户的浏览过程中所产生的至少两种信息源特征进行频繁行为模式挖掘,分析页面布局、用户类型与搜索引擎结果页以及各项链接间的组合模式,以及时序关系,本发明对改善搜索引擎结果页内链接布局方式,个性化推荐信息,以及广告投放效果具有重要意义。(1) A search result page user behavior pattern evaluation method, device and system according to the present invention, obtain at least two different types of test behavior data of the user in the search engine result page, for example, the user's eye movement data and cursor data, mining frequent behavior patterns through at least two information source characteristics generated during the user’s browsing process in the search engine result page, and analyzing the combination mode between page layout, user type, search engine result page and various links, As well as the timing relationship, the present invention is of great significance for improving the link layout mode in the search engine result page, personalized recommendation information, and advertisement delivery effect.

(2)本发明所述的一种搜索结果页用户行为模式测评方法、装置及系统,集搜索引擎结果页内的多种信息源的行为信息,提出一种新颖的、定向定长的频繁浏览模式挖掘算法挖掘用户浏览过程中的频繁时序模式,并利用频繁项集挖掘算法挖掘用户的光标行为模式,为精确分析各种布局与用户类型下链接间的关系,优化页面布局,提升用户体验提供重要依据。(2) A search result page user behavior pattern evaluation method, device and system according to the present invention collects behavior information of various information sources in the search engine result page, and proposes a novel, directional and fixed-length frequent browsing The pattern mining algorithm mines the frequent sequential patterns in the user's browsing process, and uses the frequent item set mining algorithm to mine the user's cursor behavior pattern, in order to accurately analyze the relationship between various layouts and links under user types, optimize the page layout, and improve user experience. Important reference.

以上所述仅为本申请的优选实施例而已,并不用于限制本申请,对于本领域的技术人员来说,本申请可以有各种更改和变化。凡在本申请的精神和原则之内,所作的任何修改、等同替换、改进等,均应包含在本申请的保护范围之内。因此,本发明将不会被限制于本文所示的这些实施例,而是要符合与本文所公开的原理和新颖特点相一致的最宽的范围。The above descriptions are only preferred embodiments of the present application, and are not intended to limit the present application. For those skilled in the art, there may be various modifications and changes in the present application. Any modifications, equivalent replacements, improvements, etc. made within the spirit and principles of this application shall be included within the protection scope of this application. Therefore, the present invention will not be limited to the embodiments shown herein, but is to be accorded the widest scope consistent with the principles and novel features disclosed herein.

Claims (14)

1.一种搜索结果页用户行为模式测评方法,其特征在于,该方法包括:1. A search result page user behavior pattern evaluation method, is characterized in that, the method comprises: 接收用户信息,根据用户信息中认知风格进行用户类型划分;Receive user information, and classify user types according to the cognitive style in the user information; 接收用户在搜索引擎结果页内的至少两种不同类型的测试行为数据;receiving at least two different types of test behavior data of users within search engine results pages; 可视化处理测试行为数据并进行初步定性分析;Visually process test behavior data and conduct preliminary qualitative analysis; 分别挖掘不同用户类型、页面布局下的用户不同类型的行为模式进行分析,评价搜索结果页用户行为模式;用户不同类型的行为模式包括光标行为模式和眼动浏览模式;采用频繁项集挖掘算法挖掘不同用户类型、页面布局下的光标行为模式;Mining different user types and different types of user behavior patterns under the page layout for analysis, and evaluating user behavior patterns on the search result page; different types of user behavior patterns include cursor behavior patterns and eye movement browsing patterns; frequent itemset mining algorithms are used to mine Cursor behavior patterns under different user types and page layouts; 采用定向频繁浏览模式挖掘算法挖掘不同用户类型、页面布局下的眼动浏览模式,所述定向频繁浏览模式挖掘算法用于挖掘不同页面布局下用户定向定长的眼动浏览模式,得到用户行为进程中的时序型信息。Using the directional frequent browsing pattern mining algorithm to mine eye movement browsing patterns under different user types and page layouts, the directional frequent browsing pattern mining algorithm is used to mine user directional and fixed-length eye movement browsing patterns under different page layouts to obtain user behavior processes Time series information in . 2.如权利要求1所述的方法,其特征在于,所述接收用户信息前,根据视觉情况,剔除不适合眼动采集的用户,在剩余用户中随机选取若干用户进行搜索结果页用户行为模式测评。2. The method according to claim 1, wherein before receiving user information, users who are not suitable for eye movement collection are eliminated according to visual conditions, and a number of users are randomly selected from the remaining users to perform user behavior patterns on the search result page Evaluation. 3.如权利要求1所述的方法,其特征在于,3. The method of claim 1, wherein, 所述用户信息包括用户基本信息和用户认知信息;The user information includes basic user information and user cognitive information; 所述用户基本信息包括用户的姓名、性别、年龄和职业;The basic user information includes the user's name, gender, age and occupation; 所述用户认知信息采用镶嵌图形测试法获取,包括认知风格。The user cognition information is obtained by mosaic graph test method, including cognition style. 4.如权利要求1所述的方法,其特征在于,所述测试行为数据为引擎结果页内用户的浏览过程中所产生的信息源数据,包括但不限于光标数据和眼动数据;4. The method according to claim 1, wherein the test behavior data is information source data generated during the user's browsing process in the engine result page, including but not limited to cursor data and eye movement data; 所述光标数据为用户浏览搜索引擎结果页过程中获取的输入设备触发的光标事件;The cursor data is a cursor event triggered by an input device obtained during the user's browsing of the search engine result page; 所述眼动数据为用户浏览搜索引擎结果页过程中获取的眼动追踪信息。The eye-movement data is eye-tracking information acquired during a user's browsing of a search engine result page. 5.如权利要求4所述的方法,其特征在于,所述可视化处理测试行为数据并进行初步定性分析的具体步骤包括:5. method as claimed in claim 4, is characterized in that, the concrete step of described visual processing test behavior data and carrying out preliminary qualitative analysis comprises: 将光标数据和眼动数据进行数据可视化;Data visualization of cursor data and eye movement data; 初步定性分析页面布局对所述眼动数据的影响。Preliminary qualitative analysis of the impact of page layout on the eye movement data. 6.如权利要求5所述的方法,其特征在于,所述分别挖掘不同用户类型、页面布局下的用户不同类型的行为模式进行分析的具体步骤包括:6. The method according to claim 5, characterized in that, the specific steps of analyzing the different types of behavior patterns of users under different user types and page layouts respectively include: 采用与测评行为数据类型数量相同的频繁模式挖掘法,Using the same frequent pattern mining method as the number of evaluation behavior data types, 挖掘不同用户类型、页面布局下的光标行为模式,根据光标数据,分析搜索结果页内不同链接与用户类型、页面布局间光标行为的关联模式,和不同用户类型、页面布局下各链接内光标行为参数的组合模式;Mining the cursor behavior patterns under different user types and page layouts, based on the cursor data, analyzing the correlation patterns between different links in the search results page and the cursor behaviors between user types and page layouts, as well as the cursor behaviors in links under different user types and page layouts Combination mode of parameters; 以及挖掘不同用户类型、页面布局下的眼动浏览模式,根据眼动数据得到用户浏览搜索引擎结果页过程中时序型信息,分析用户浏览搜索引擎结果页过程中各链接间的顺序关系。And mining the eye-movement browsing patterns under different user types and page layouts, according to the eye-movement data, the time-series information in the process of users browsing the search engine result pages is obtained, and the sequence relationship between the links in the process of users browsing the search engine result pages is analyzed. 7.如权利要求1所述的方法,其特征在于,所述定向频繁浏览模式挖掘法包括:7. The method according to claim 1, wherein the directional frequent browsing pattern mining method comprises: 根据所述测试行为数据得到用户在搜索引擎结果页的浏览序列数据;Obtain the browsing sequence data of the user on the search engine result page according to the test behavior data; 在浏览序列数据中添加采纳长度和其对应的支持度属性,并初始化;Add the adoption length and its corresponding support attribute in the browsing sequence data, and initialize it; 处理每个浏览序列的支持度,使其为零,得到新序列;Process the support of each browsing sequence to make it zero and get a new sequence; 判断新序列是否为频繁序列,输出频繁序列。Determine whether the new sequence is a frequent sequence, and output the frequent sequence. 8.如权利要求7所述的方法,其特征在于,8. The method of claim 7, wherein, 预设链接区域范围,由所述测试行为数据中提取出用户进入每条链接区域范围内的时间。The range of the link area is preset, and the time when the user enters the range of each link area is extracted from the test behavior data. 9.如权利要求7所述的方法,其特征在于,9. The method of claim 7, wherein, 所述根据所述测试行为数据得到用户在搜索引擎结果页的浏览序列的具体步骤包括:The specific steps of obtaining the browsing sequence of the user on the search engine result page according to the test behavior data include: 根据页面布局对所述测试行为数据进行粗分类;Roughly classifying the test behavior data according to the page layout; 根据用户进入每条链接区域范围内的时间的先后顺序进行排列,每个用户查看每个网页都对应一条浏览序列数据。Arrange according to the order of time when users enter the area of each link, and each user views each web page corresponding to a piece of browsing sequence data. 10.如权利要求7所述的方法,其特征在于,10. The method of claim 7, wherein, 所述处理每个浏览序列的支持度使其为零的具体步骤包括:The specific steps of processing the support of each browsing sequence to make it zero include: 预设支持度阈值;Preset support threshold; 计算浏览序列数据首元素的支持度,将小于支持度阈值的首元素支持度置零并剔除该序列;Calculate the support of the first element of the browsing sequence data, set the support of the first element less than the support threshold to zero and eliminate the sequence; 对浏览序列数据按照首元素值进行排序后分类,创建与之相对应的队列,将序列按类别进入不同队列,并删除每个序列首元素;Sort and classify the browsing sequence data according to the value of the first element, create a queue corresponding to it, enter the sequence into different queues by category, and delete the first element of each sequence; 更新序列的采纳长度和其对应的支持度属性,直至每个序列中的元素都以支持度等于0结束为止。Update the adopted length of the sequence and its corresponding support attribute until the elements in each sequence end with support equal to 0. 11.如权利要求7所述的方法,其特征在于,11. The method of claim 7, wherein, 判断新序列是否为频繁序列的具体步骤为:The specific steps for judging whether a new sequence is a frequent sequence are: 计算新序列的得分,所述得分为该序列的采纳长度与对应支持度的乘积;Calculating the score of the new sequence, the score being the product of the adopted length of the sequence and the corresponding support degree; 将得分进行排序,确定最大得分;Sort the scores to determine the maximum score; 最大得分对应的新序列为频繁浏览模式序列,否则,判定新序列为非频繁浏览模式序列。The new sequence corresponding to the maximum score is a frequent browsing pattern sequence; otherwise, it is determined that the new sequence is a non-frequent browsing pattern sequence. 12.一种计算机可读存储介质,其中存储有多条指令,其特征在于,所述指令适于由终端设备设备的处理器加载并执行以下处理:12. A computer-readable storage medium, wherein a plurality of instructions are stored, wherein the instructions are adapted to be loaded by a processor of a terminal device and perform the following processing: 接收用户信息,根据用户信息中认知风格进行用户类型划分;Receive user information, and classify user types according to the cognitive style in the user information; 接收用户在搜索引擎结果页内的至少两种不同类型的测试行为数据;receiving at least two different types of test behavior data of users within search engine results pages; 可视化处理测试行为数据并进行初步定性分析;Visually process test behavior data and conduct preliminary qualitative analysis; 分别挖掘不同用户类型、页面布局下的用户不同类型的行为模式进行分析,评价搜索结果页用户行为模式;用户不同类型的行为模式包括光标行为模式和眼动浏览模式;采用频繁项集挖掘算法挖掘不同用户类型、页面布局下的光标行为模式;Mining different user types and different types of user behavior patterns under the page layout for analysis, and evaluating user behavior patterns on the search result page; different types of user behavior patterns include cursor behavior patterns and eye movement browsing patterns; frequent itemset mining algorithms are used to mine Cursor behavior patterns under different user types and page layouts; 采用定向频繁浏览模式挖掘算法挖掘不同用户类型、页面布局下的眼动浏览模式,所述定向频繁浏览模式挖掘算法用于挖掘不同页面布局下用户定向定长的眼动浏览模式,得到用户行为进程中的时序型信息。Using the directional frequent browsing pattern mining algorithm to mine eye movement browsing patterns under different user types and page layouts, the directional frequent browsing pattern mining algorithm is used to mine user directional and fixed-length eye movement browsing patterns under different page layouts to obtain user behavior processes Time series information in . 13.一种搜索结果页用户行为模式测评装置,采用互联网终端设备,包括处理器和计算机可读存储介质,处理器用于实现各指令;计算机可读存储介质用于存储多条指令,其特征在于,所述指令适于由处理器加载并执行以下处理:13. A search result page user behavior mode evaluation device, using Internet terminal equipment, including a processor and a computer-readable storage medium, the processor is used to implement each instruction; the computer-readable storage medium is used to store multiple instructions, characterized in that , the instructions are adapted to be loaded by the processor and perform the following processing: 接收用户信息,根据用户信息中认知风格进行用户类型划分;Receive user information, and classify user types according to the cognitive style in the user information; 接收用户在搜索引擎结果页内的至少两种不同类型的测试行为数据;receiving at least two different types of test behavior data of users within search engine results pages; 可视化处理测试行为数据并进行初步定性分析;Visually process test behavior data and conduct preliminary qualitative analysis; 分别挖掘不同用户类型、页面布局下的用户不同类型的行为模式进行分析,评价搜索结果页用户行为模式;用户不同类型的行为模式包括光标行为模式和眼动浏览模式;采用频繁项集挖掘算法挖掘不同用户类型、页面布局下的光标行为模式;Mining different user types and different types of user behavior patterns under the page layout for analysis, and evaluating user behavior patterns on the search result page; different types of user behavior patterns include cursor behavior patterns and eye movement browsing patterns; frequent itemset mining algorithms are used to mine Cursor behavior patterns under different user types and page layouts; 采用定向频繁浏览模式挖掘算法挖掘不同用户类型、页面布局下的眼动浏览模式,所述定向频繁浏览模式挖掘算法用于挖掘不同页面布局下用户定向定长的眼动浏览模式,得到用户行为进程中的时序型信息。Using the directional frequent browsing pattern mining algorithm to mine eye movement browsing patterns under different user types and page layouts, the directional frequent browsing pattern mining algorithm is used to mine user directional and fixed-length eye movement browsing patterns under different page layouts to obtain user behavior processes Time series information in . 14.一种搜索结果页用户行为模式测评系统,如权利要求1-11任一项所述的一种搜索结果页用户行为模式测评方法,其特征在于,包括:用户信息采集装置、行为数据采集装置和行为模式测评装置;14. A search result page user behavior pattern evaluation system, a search result page user behavior pattern evaluation method as claimed in any one of claims 1-11, characterized in that it comprises: a user information collection device, a behavior data collection devices and behavioral pattern assessment devices; 所述用户信息采集装置,用于采集用户信息,并发送至行为模式测评装置;The user information collection device is used to collect user information and send it to the behavior pattern evaluation device; 所述行为数据采集装置,用于采集用户在搜索引擎结果页内的至少两种不同类型的测试行为数据,并发送至行为模式测评装置;The behavior data collection device is used to collect at least two different types of test behavior data of the user in the search engine result page, and send it to the behavior pattern evaluation device; 所述行为模式测评装置,用于接收用户信息,根据用户信息中认知风格进行用户类型划分;接收用户在搜索引擎结果页内的至少两种不同类型的测试行为数据;可视化处理测试行为数据并进行初步定性分析;分别挖掘不同用户类型、页面布局下的用户不同类型的行为模式进行分析,评价搜索结果页用户行为模式。The behavior pattern evaluation device is used to receive user information, classify user types according to the cognitive style in the user information; receive at least two different types of test behavior data of the user in the search engine result page; visually process the test behavior data and Carry out preliminary qualitative analysis; excavate different user types and different types of user behavior patterns under the page layout for analysis, and evaluate the user behavior patterns of the search results page.
CN201711144282.2A 2017-11-17 2017-11-17 A kind of search results pages user behavior pattern assessment method, apparatus and system Active CN108009215B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201711144282.2A CN108009215B (en) 2017-11-17 2017-11-17 A kind of search results pages user behavior pattern assessment method, apparatus and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201711144282.2A CN108009215B (en) 2017-11-17 2017-11-17 A kind of search results pages user behavior pattern assessment method, apparatus and system

Publications (2)

Publication Number Publication Date
CN108009215A CN108009215A (en) 2018-05-08
CN108009215B true CN108009215B (en) 2018-11-06

Family

ID=62052794

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201711144282.2A Active CN108009215B (en) 2017-11-17 2017-11-17 A kind of search results pages user behavior pattern assessment method, apparatus and system

Country Status (1)

Country Link
CN (1) CN108009215B (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109245938B (en) * 2018-10-10 2021-06-25 上海尚往网络科技有限公司 Method and equipment for executing resource configuration operation of user
CN109885746A (en) * 2019-01-17 2019-06-14 平安城市建设科技(深圳)有限公司 Page Dynamic Distribution method, apparatus, equipment and storage medium
CN113569178B (en) * 2021-07-19 2024-07-23 上海淇玥信息技术有限公司 Method and device for optimizing external chain delivery based on user behavior analysis and electronic equipment
CN114201412B (en) * 2022-02-16 2022-05-06 广东数源智汇科技有限公司 Evaluation method for thousand-person and thousand-face degrees of search engine

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102496012A (en) * 2011-12-14 2012-06-13 上海海事大学 Device and method for discovering potential demands based on eye movement tracking and historical behavior
CN103177170A (en) * 2011-12-21 2013-06-26 中国移动通信集团四川有限公司 Hotspot analysis method and hotspot analysis system used for collecting eye movement of user
CN103440328A (en) * 2013-09-03 2013-12-11 暨南大学 User classification method based on mouse behaviors
CN103488507A (en) * 2013-09-18 2014-01-01 北京思特奇信息技术股份有限公司 User behavior trajectory playback method

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8458171B2 (en) * 2009-01-30 2013-06-04 Google Inc. Identifying query aspects
CN102279786B (en) * 2011-08-25 2015-11-25 百度在线网络技术(北京)有限公司 A kind of method of monitoring and measuring application program effective access amount and device
CN103823905A (en) * 2014-03-18 2014-05-28 北京奇虎科技有限公司 Method and device for marking URL in search result page
CN106844720A (en) * 2017-02-09 2017-06-13 郑州云海信息技术有限公司 A kind of method and device for searching for data processing

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102496012A (en) * 2011-12-14 2012-06-13 上海海事大学 Device and method for discovering potential demands based on eye movement tracking and historical behavior
CN103177170A (en) * 2011-12-21 2013-06-26 中国移动通信集团四川有限公司 Hotspot analysis method and hotspot analysis system used for collecting eye movement of user
CN103440328A (en) * 2013-09-03 2013-12-11 暨南大学 User classification method based on mouse behaviors
CN103488507A (en) * 2013-09-18 2014-01-01 北京思特奇信息技术股份有限公司 User behavior trajectory playback method

Also Published As

Publication number Publication date
CN108009215A (en) 2018-05-08

Similar Documents

Publication Publication Date Title
US7596552B2 (en) Method and system for extracting web data
CN107885857A (en) A kind of search results pages user&#39;s behavior pattern mining method, apparatus and system
CN109829733B (en) False comment detection system and method based on shopping behavior sequence data
CN108009215B (en) A kind of search results pages user behavior pattern assessment method, apparatus and system
US9069872B2 (en) Relating web page change with revisitation patterns
Clarkson et al. Resultmaps: Visualization for search interfaces
CN103838885A (en) Advertisement-putting-oriented potential user searching and user model ordering method
Diaz et al. Robust models of mouse movement on dynamic web search results pages
Lagun et al. Viewser: Enabling large-scale remote user studies of web search examination and interaction
CN106920129A (en) A kind of network advertisement effect evaluation system and its method that tracking is moved based on eye
Williams et al. Detecting good abandonment in mobile search
US20090299975A1 (en) System and method for document analysis, processing and information extraction
WO2017190610A1 (en) Target user orientation method and device, and computer storage medium
CN106127546A (en) A kind of Method of Commodity Recommendation based on the big data in intelligence community
US9344507B2 (en) Method of processing web access information and server implementing same
CN107798563A (en) Internet advertising effect assessment method and system based on multi-modal feature
US20180096067A1 (en) Creation and optimization of resource contents
CN107783945A (en) A kind of search result web page notice assessment method and device based on the dynamic tracking of eye
CN110096681A (en) Contract terms analysis method, device, equipment and readable storage medium storing program for executing
CN115017200A (en) Search result sorting method and device, electronic equipment and storage medium
CN105512224A (en) Search engine user satisfaction automatic assessment method based on cursor position sequence
Liu et al. Predicting search user examination with visual saliency
JP4827900B2 (en) Questionnaire result analysis support apparatus and method
TWI642017B (en) Method and apparatus for evaluating target group
JP6320353B2 (en) Digital marketing system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
TR01 Transfer of patent right
TR01 Transfer of patent right

Effective date of registration: 20210825

Address after: Room 507, Golden Times Square, 9999 jingshidong Road, Lixia District, Jinan City, Shandong Province, 250000

Patentee after: Shandong Education Equipment Center Co.,Ltd.

Address before: 250014 No. 88 East Wenhua Road, Shandong, Ji'nan

Patentee before: SHANDONG NORMAL University