CN102955829B - For the method being ranked up to resource items, device and equipment - Google Patents
For the method being ranked up to resource items, device and equipment Download PDFInfo
- Publication number
- CN102955829B CN102955829B CN201110254123.4A CN201110254123A CN102955829B CN 102955829 B CN102955829 B CN 102955829B CN 201110254123 A CN201110254123 A CN 201110254123A CN 102955829 B CN102955829 B CN 102955829B
- Authority
- CN
- China
- Prior art keywords
- input sequence
- information
- time
- timeliness
- resource
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Landscapes
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
本发明提供一种用于对资源项进行排序的方法、装置和设备,根据本发明的方案根据来自用户的输入序列来获取与该输入序列相关的时效信息;并根据所述时效信息是否满足预定条件,判断所述输入序列是否具有时效性;并当判断所述输入序列具有时效性时,对基于所述输入序列进行检索来获得的资源项进行时效性排序,以将排序后的资源项提供给所述用户。与现有技术相比,本发明具有以下优点:1)针对基于具有时效性的输入序列进行检索所得的资源项进行时效性排序,以将可能与该输入序列的时效性相关的资源项排序提前,更好地满足了用户的突发检索需求;2)根据资源项的新鲜度来执行时效性排序,从而获得更好的时效性排序结果。
The present invention provides a method, device and equipment for sorting resource items. According to the scheme of the present invention, the timeliness information related to the input sequence is obtained according to the input sequence from the user; and according to whether the timeliness information meets the predetermined condition, judging whether the input sequence is time-sensitive; and when judging that the input sequence is time-sensitive, sorting the resource items retrieved based on the input sequence according to timeliness, so as to provide the sorted resource items to said user. Compared with the prior art, the present invention has the following advantages: 1) Time-sensitive sorting is performed on the resource items retrieved based on the time-sensitive input sequence, so as to advance the sorting of resource items that may be related to the time-sensitive input sequence , to better meet the user's sudden retrieval needs; 2) perform time-sensitive sorting according to the freshness of resource items, so as to obtain better time-sensitive sorting results.
Description
技术领域technical field
本发明涉及计算机技术领域,尤其涉及用于对资源项进行排序的方法、装置和设备。The present invention relates to the field of computer technology, in particular to a method, device and equipment for sorting resource items.
背景技术Background technique
现有技术中,往往因为某些突发事件而产生与该等突发事件相关的突发检索需求。这种需求具有不可预期、短时间爆发、检索需求集中的特点,故现有技术中尚缺乏能够根据突发检索需求来向用户提供时效性排序结果的方案。In the prior art, sudden retrieval requirements related to some unexpected events are often generated due to these unexpected events. This kind of demand has the characteristics of unpredictable, short-term burst, and concentrated retrieval demand. Therefore, there is still a lack of solutions in the prior art that can provide users with time-sensitive sorting results according to sudden retrieval demand.
发明内容Contents of the invention
本发明的目的是提供一种用于对资源项进行排序的方法、装置和设备。The object of the present invention is to provide a method, device and equipment for sorting resource items.
根据本发明的一个方面,提供一种用于对资源项进行排序的方法,其中,该方法包括以下步骤:According to one aspect of the present invention, there is provided a method for sorting resource items, wherein the method includes the following steps:
a根据来自用户的输入序列来获取与该输入序列相关的时效信息;a Acquiring time-sensitive information related to the input sequence according to the input sequence from the user;
b根据所述时效信息是否满足预定条件,判断所述输入序列是否具有时效性;b. Judging whether the input sequence has timeliness according to whether the timeliness information satisfies a predetermined condition;
其中,该方法还包括以下步骤:Wherein, the method also includes the following steps:
x当判断所述输入序列具有时效性时,对基于所述输入序列进行检索来获得的资源项进行时效性排序,以将排序后的资源项提供给所述用户。x When it is judged that the input sequence has timeliness, perform timeliness sorting on the resource items retrieved based on the input sequence, so as to provide the sorted resource items to the user.
根据本发明的另一个方面,还提供了一种用于对资源项进行排序的排序装置,其中,该排序装置包括:According to another aspect of the present invention, a sorting device for sorting resource items is also provided, wherein the sorting device includes:
时效信息获取装置,用于根据来自用户的输入序列来获取与该输入序列相关的时效信息;A timeliness information acquisition device, configured to acquire timeliness information related to the input sequence according to the input sequence from the user;
第一判断装置,用于根据所述时效信息是否满足预定条件,判断所述输入序列是否具有时效性;The first judging means is used for judging whether the input sequence has timeliness according to whether the timeliness information satisfies a predetermined condition;
时效排序装置,用于当判断所述输入序列具有时效性时,对基于所述输入序列进行检索来获得的资源项进行时效性排序,以将排序后的资源项提供给所述用户。The timeliness sorting device is configured to perform timeliness sorting on the resource items retrieved based on the input sequence when it is judged that the input sequence is timeliness, so as to provide the sorted resource items to the user.
与现有技术相比,本发明具有以下优点:1)能够识别具有时效性的输入序列,并能够针对基于该具有时效性的输入序列进行检索所得的资源项进行时效性排序,以将可能与该输入序列的时效性相关的资源项排序提前,更好地满足了用户的突发检索需求;2)根据资源项的新鲜度来执行时效性排序,从而获得更好的时效性排序结果。Compared with the prior art, the present invention has the following advantages: 1) It can identify the time-sensitive input sequence, and can perform time-sensitive sorting on the resource items retrieved based on the time-sensitive input sequence, so as to sort the resource items that may be related to The timeliness-related resource items of the input sequence are sorted in advance, which better meets the user's sudden retrieval needs; 2) The timeliness sorting is performed according to the freshness of the resource items, so as to obtain better timeliness sorting results.
附图说明Description of drawings
通过阅读参照以下附图所作的对非限制性实施例所作的详细描述,本发明的其它特征、目的和优点将会变得更明显:Other characteristics, objects and advantages of the present invention will become more apparent by reading the detailed description of non-limiting embodiments made with reference to the following drawings:
图1为本发明一个方面的用于对资源项进行排序的方法流程图;Fig. 1 is a flowchart of a method for sorting resource items according to one aspect of the present invention;
图2为本发明一个优选实施例的用于对资源项进行排序的方法流程图;Fig. 2 is a flowchart of a method for sorting resource items according to a preferred embodiment of the present invention;
图3为本发明一个方面的用于对资源项进行排序的排序装置结构示意图;3 is a schematic structural diagram of a sorting device for sorting resource items according to one aspect of the present invention;
图4为本发明一个优选实施例的用于对资源项进行排序的排序装置结构示意图。Fig. 4 is a schematic structural diagram of a sorting device for sorting resource items according to a preferred embodiment of the present invention.
附图中相同或相似的附图标记代表相同或相似的部件。The same or similar reference numerals in the drawings represent the same or similar components.
具体实施方式detailed description
下面结合附图对本发明作进一步详细描述。The present invention will be described in further detail below in conjunction with the accompanying drawings.
图1为本发明一个方面的用于对资源项进行排序的方法流程图。其中,根据本发明的方法主要通过计算机设备中的排序装置来完成,该排序装置包括但不限于安装在计算机设备中且能够实现本发明方案的诸如应用模块、操作系统、处理控制器等的装置。其中,该计算机设备包括但不限于:1)用户设备;2)网络设备。所述用户设备包括但不限于计算机、智能手机、PDA等;所述网络设备包括但不限于单个网络服务器、多个网络服务器组成的服务器组或基于云计算(Cloud Computing)的由大量计算机或网络服务器构成的云,其中,云计算是分布式计算的一种,由一群松散耦合的计算机集组成的一个超级虚拟计算机。Fig. 1 is a flowchart of a method for sorting resource items according to one aspect of the present invention. Among them, the method according to the present invention is mainly completed by the sorting device in the computer equipment, which includes but not limited to devices such as application modules, operating systems, processing controllers, etc. installed in the computer equipment and capable of implementing the solutions of the present invention . Wherein, the computer equipment includes but not limited to: 1) user equipment; 2) network equipment. The user equipment includes but is not limited to computers, smart phones, PDAs, etc.; the network equipment includes but is not limited to a single network server, a server group composed of multiple network servers, or a large number of computers or networks based on cloud computing (Cloud Computing). A cloud composed of servers, among them, cloud computing is a kind of distributed computing, a super virtual computer composed of a group of loosely coupled computer sets.
在步骤S1中,排序装置根据来自用户的输入序列来获取与该输入序列相关的时效信息。In step S1, the sorting device acquires timeliness information related to the input sequence according to the input sequence from the user.
具体地,排序装置获取其他装置或设备提供的输入序列,或者,实时获取来自用户的输入序列,并根据该输入序列来获取与该输入序列相关的时效信息。其中,排序装置获取与该输入序列相关的时效信息的方式包括但不限于:Specifically, the sorting device obtains an input sequence provided by other devices or devices, or obtains an input sequence from a user in real time, and obtains time-sensitive information related to the input sequence according to the input sequence. Among them, the way for the sorting device to obtain the timeliness information related to the input sequence includes but is not limited to:
1)排序装置根据该输入序列来在预建立的用户需求突增关键词库中进行查询,并将查询所得的查询结果作为所述时效信息。其中,该查询结果中包括以下任一项:a)用于表示能够在用户需求突增关键词库查询获得该输入序列或该输入序列包含的关键词的标识信息,例如,“Y”、“1”等;b)用于表示未能够在用户需求突增关键词库查询获得该输入序列或该输入序列包含的关键词的标识信息,例如,“N”、“O”等。优选地,该查询结果中还包括以下至少一项:a)在用户需求突增关键词库查询所得的该用户需求突增关键词库中的突增关键词;b)与查询所得的突增关键词相关的信息,例如,输入序列或关键词的用户需求变化强度等。1) The sorting device performs a query in the pre-established user demand sudden increase keyword database according to the input sequence, and uses the query result obtained from the query as the time-sensitive information. Wherein, the query result includes any of the following items: a) It is used to indicate that the input sequence or the keywords contained in the input sequence can be obtained by querying the user demand sudden keyword database, for example, "Y", " 1" etc.; b) is used to indicate that the input sequence or the identification information of the keywords contained in the input sequence cannot be obtained from the user demand sudden keyword database query, for example, "N", "O" and so on. Preferably, the query result also includes at least one of the following: a) the sudden increase keyword in the user demand sudden increase keyword library obtained by querying the user demand sudden increase keyword database; Keyword-related information, for example, input sequence or user demand change intensity of keywords, etc.
2)排序装置根据所述输入序列来获取与该输入序列相关的时效特征信息,以作为所述时效信息。其中,所述时效特征信息包括能够用于与该输入序列相关且能够用于判断该输入序列是否为具有时效性的输入序列的特征信息。优选地,所述时效特征信息包括但不限于以下至少下一项:2) The sorting device acquires time-sensitive characteristic information related to the input sequence according to the input sequence as the time-sensitive information. Wherein, the time-sensitive feature information includes feature information that can be used to correlate with the input sequence and can be used to judge whether the input sequence is a time-sensitive input sequence. Preferably, the time-sensitive feature information includes but not limited to at least the following item:
a)基于输入序列进行检索来获得的资源项数量随时间的变化信息;例如,该资源项数量随时间的变化信息包括如下信息:基于所述输入序列进行检索来获得的资源项中包含50个在前96至前120小时内建立的资源项、48个在前72至前96小时内建立的资源项、127个在前48小时至前72小时内建立的资源项、309个在前24至前48小时内建立的资源项以及1045个在24小时内建立的资源项;又例如,进行检索所得的资源项数量随时间的变化信息包括各个预定时间段内建立的资源信息数量的相对其相邻时间段内建立的资源信息数量的变化值等。优选地,排序装置通过对基于输入序列进行检索所得的资源项的建立或更新时间进行统计分析,来获得该资源项数量随时间的变化信息。a) information on the change in the number of resource items over time obtained by searching based on the input sequence; for example, the information on the change in the number of resource items over time includes the following information: the resource items obtained by searching based on the input sequence contain 50 Resource items created within the previous 96 to 120 hours, 48 resource items created within the previous 72 to 96 hours, 127 resource items created within the previous 48 hours to the previous 72 hours, 309 resource items created within the previous 24 to Resource items created within the previous 48 hours and 1045 resource items created within 24 hours; as another example, the information on the change in the number of resource items over time includes the relative quantity of resource information created within each predetermined time period. The change value of the amount of resource information established in the adjacent time period, etc. Preferably, the sorting device obtains information on changes in the number of resource items over time by statistically analyzing the creation or update time of the resource items retrieved based on the input sequence.
b)输入序列和/或输入序列包含的关键词的被检索次数随时间的变化信息;例如,在各个预定时间段内或预定时间点上,输入序列或输入序列包含的关键词的被检索次数;又例如,各个预定时间段或预定时间点相对其相邻时间段或时间点的输入序列或关键词的被检索次数的差值等。优选地,排序装置基于输入序列或输入序列包含的关键词来在本地存储设备中进行查询,以获得预存储的与输入序列或输入序列包含的关键词对应的被检索次数随时间的变化信息;或者,排序装置向其他装置或设备发出请求,以获得与输入序列或输入序列包含的关键词对应的被检索次数随时间的变化信息;或者,排序装置通过在预定时间段内对输入序列或输入序列包含的关键词的被检索次数进行监控及统计,来获得该被检索次数随时间的变化信息。b) information on the number of times the input sequence and/or keywords included in the input sequence are retrieved over time; for example, the input sequence or the number of times the keywords included in the input sequence are retrieved within each predetermined time period or at a predetermined time point ; Another example is the difference between each predetermined time period or predetermined time point relative to its adjacent time period or time point's input sequence or the number of times keywords are retrieved. Preferably, the sorting device performs a query in the local storage device based on the input sequence or keywords contained in the input sequence, so as to obtain pre-stored information on changes in the number of searches over time corresponding to the input sequence or keywords contained in the input sequence; Or, the sorting device sends a request to other devices or equipment to obtain information on the number of searches over time corresponding to the input sequence or keywords contained in the input sequence; The search times of the keywords contained in the sequence are monitored and counted to obtain the change information of the search times over time.
c)基于对所述输入序列进行语义分析来获得的该输入序列的突发语义特征信息。其中,突发语义特征信息包括突发特征关键词和/或突发特征语义结构。例如,突发语义特征信息包括诸如“最新、今天、本周、最近、某年某月某日”等时间关键词,或者包括诸如“地震、车祸、逝世、着火、砍人事件、自杀门”等突发事件关键词;又例如,突发语义特征结构包括诸如专有名词+普通名词等结构;再例如,突发语义特征信息包括诸如“福岛地震”等同时包含突发特征关键词以及突发特征语义结构的语义特征信息。优选地,优选地,排序装置基于输入序列来在本地存储设备中进行查询,以获得预存储的与输入序列对应的突发语义特征信息;或者,排序装置向其他装置或设备发出请求,以获得与输入序列对应的突发语义特征信息;或者,排序装置对该输入序列进行语义分析,来获得该输入序列的突发语义特征信息。c) Semantic burst feature information of the input sequence obtained based on semantic analysis of the input sequence. Wherein, the burst semantic feature information includes burst feature keywords and/or burst feature semantic structure. For example, the burst semantic feature information includes time keywords such as "latest, today, this week, recent, certain year, month, and day", or includes time keywords such as "earthquake, car accident, death, fire, hacking incident, suicide door" and other emergency keywords; for another example, the emergency semantic feature structure includes structures such as proper nouns + common nouns; for another example, the emergency semantic feature information includes such as "Fukushima earthquake" and other emergency feature keywords and Semantic feature information of burst feature semantic structure. Preferably, preferably, the sorting device performs a query in the local storage device based on the input sequence to obtain pre-stored burst semantic feature information corresponding to the input sequence; or, the sorting device sends a request to other devices or devices to obtain The burst semantic feature information corresponding to the input sequence; or, the sorting device performs semantic analysis on the input sequence to obtain the burst semantic feature information of the input sequence.
3)排序装置先根据所述输入序列在预建立的用户需求突增关键词库中查询与该输入序列有关的突增关键词;当查询到与该输入序列有关的突增关键词时,将查询结果作为所述时效信息;当未能查询到与该输入序列有关的突增关键词时,排序装置根据所述输入序列来获取与该输入序列相关的时效特征信息,以作为所述时效信息的步骤。其中,排序装置根据所述输入序列来获取与该输入序列相关的时效特征信息的方式与前述2)中排序装置获取时效特征信息方式相同或相似,在此不再赘述。3) The sorting device first inquires the sudden increase keyword related to the input sequence in the pre-established user demand sudden increase keyword library according to the input sequence; when the sudden increase keyword related to the input sequence is found, it will The query result is used as the timeliness information; when the sudden increase keyword related to the input sequence cannot be found, the sorting device obtains the timeliness feature information related to the input sequence according to the input sequence as the timeliness information A step of. Wherein, the manner in which the sorting device obtains the time-sensitive feature information related to the input sequence according to the input sequence is the same as or similar to the manner in which the sorting device obtains the time-sensitive feature information in 2) above, and will not be repeated here.
需要说明的是,上述举例仅为更好地说明本发明的技术方案,而非对本发明的限制,本领域技术人员应该理解,任何根据来自用户的输入序列来获取与该输入序列相关的时效信息的实现方式,均应包含在本发明的范围内。It should be noted that the above examples are only to better illustrate the technical solution of the present invention, rather than limit the present invention. Those skilled in the art should understand that any time-sensitive information related to the input sequence obtained according to the input sequence from the user All implementation methods should be included within the scope of the present invention.
接着,在步骤S2中,排序装置根据时效信息是否满足预定条件,判断输入序列是否具有时效性。其中,所述预定条件包括基于能够被确定为具有时效性的输入序列来获得的时效信息所需要满足的条件;例如,当时效信息包括在预建立的用户需求突增关键词库中进行查询所得的查询结果时,预定条件包括查询结果中包含“Y”或“1”等标识信息,当时效信息包括时效特征信息时,预定条件包括时效特征信息所需要满足的条件等,优选地,基于输入序列进行检索来获得的资源项数量随时间的增长越快、输入序列或输入序列包含的关键词的被检索次数随时间的增长越快、突发语义特征信息中包含的突发语义特征越强,则排序装置将输入序列判断为具有时效性的输入序列的可能性越高。Next, in step S2, the sorting device judges whether the input sequence has timeliness according to whether the timeliness information satisfies a predetermined condition. Wherein, the predetermined conditions include the conditions that need to be satisfied by the timeliness information obtained based on the input sequence that can be determined to be timeliness; When the query results are obtained, the predetermined conditions include identification information such as "Y" or "1" in the query results, and when the timeliness information includes timeliness feature information, the predetermined conditions include conditions that the timeliness feature information needs to meet, etc. Preferably, based on the input The faster the number of resource items retrieved by the sequence increases with time, the faster the number of retrievals of the input sequence or keywords contained in the input sequence increases with time, and the stronger the burst semantic feature contained in the burst semantic feature information , the sorting device is more likely to judge the input sequence as a time-sensitive input sequence.
例如,预定条件包括查询结果中包含标识信息“Y”或“1”,在步骤S1中,排序装置基于输入序列获得的时效信息包括包含标识信息“Y”的查询结果;则排序装置根据该时效信息满足预定条件,判断输入序列具有时效性。For example, the predetermined condition includes that the query result contains the identification information "Y" or "1". In step S1, the timeliness information obtained by the sorting device based on the input sequence includes the query result containing the identification information "Y"; then the sorting device according to the timeliness The information satisfies the predetermined condition, and it is judged that the input sequence has timeliness.
又例如,预定条件包括任一时间段中建立的资源项数量相对其前一时间段中建立的资源项数量的增长率超过50%;在步骤S1中,排序装置获得的基于输入序列进行检索来获得的资源项数量随时间的变化信息包括如下信息:基于所述输入序列进行检索来获得的资源项中包含50个在前96至前120小时内建立的资源项、48个在前72至前96小时内建立的资源项、127个在前48小时至前72小时内建立的资源项、309个在前24至前48小时内建立的资源项以及1045个在24小时内建立的资源项;则排序装置分析资源项数量随时间的变化信息,确定各个时间段中建立的资源项数量相对其前一时间段中建立的资源项数量的增长率,并根据在前48小时至前72小时内建立的资源项数量相对前72至前96小时内建立的资源项数量的增长率62.2%大于第一预定增长阈值50%,判断输入序列具有时效性。For another example, the predetermined condition includes that the growth rate of the number of resource items established in any time period relative to the number of resource items established in the previous time period exceeds 50%; in step S1, the sorting device retrieves based on the input sequence to obtain The obtained change information of the number of resource items over time includes the following information: among the resource items retrieved based on the input sequence, there are 50 resource items established within the previous 96 to 120 hours, 48 resource items established within the previous 72 to Resource items created within 96 hours, 127 resource items created within the previous 48 hours to the previous 72 hours, 309 resource items created within the previous 24 to 48 hours, and 1045 resource items created within 24 hours; Then the sorting device analyzes the change information of the resource item quantity over time, determines the growth rate of the resource item quantity established in each time period relative to the resource item quantity established in the previous time period, and according to The 62.2% growth rate of the number of resource items created relative to the number of resource items created in the previous 72 to 96 hours is greater than the first predetermined growth threshold of 50%, and it is judged that the input sequence has timeliness.
再例如,预定条件包括以下两项条件:i)包括输入序列中包含突发语义特征关键词;ii)输入序列和其包含的关键词的被检索次数在任意两个时间段中随时间的增长率均超过100%;在步骤S1中,排序装置获得输入序列“地震门”中包含突发特征关键词“地震”,并获得输入序列“地震门”以及输入序列包含的关键词“地震”的被检索次数随时间的变化信息如下表1所示:For another example, the predetermined condition includes the following two conditions: i) including the sudden semantic feature keywords contained in the input sequence; ii) the number of searches of the input sequence and the keywords contained in it increases with time in any two time periods The rate exceeds 100%; in step S1, the sorting device obtains the input sequence "earthquake gate" containing the sudden characteristic keyword "earthquake", and obtains the input sequence "earthquake gate" and the keyword "earthquake" contained in the input sequence. The change information of the number of searches over time is shown in Table 1 below:
表1Table 1
则排序装置对输入序列“地震门”以及输入序列包含的关键词“地震”的被检索次数随时间的变化信息进行分析,确定输入序列和其包含的关键词的被检索次数在任意两个时间段中随时间的增长率均小于100%;则排序装置判断输入序列“地震门”不具有时效性。Then the sorting device analyzes the input sequence "earthquake gate" and the time-dependent change information of the retrieval times of the keyword "earthquake" contained in the input sequence, and determines that the retrieval times of the input sequence and the keywords contained in it are between any two times. The growth rates over time in the segment are all less than 100%; then the sorting device judges that the input sequence "earthquake gate" has no timeliness.
需要说明的是,上述举例仅为更好地说明本发明的技术方案,而非对本发明的限制,本领域技术人员应该理解,任何根据时效信息是否满足预定条件来判断输入序列是否具有时效性的实现方式,均应包含在本发明的范围内。It should be noted that the above examples are only to better illustrate the technical solution of the present invention, rather than limit the present invention. Those skilled in the art should understand that any method that judges whether the input sequence has timeliness based on whether the timeliness information satisfies the predetermined condition Implementation methods should all be included within the scope of the present invention.
在步骤S3中,当判断所述输入序列具有时效性时,排序装置对基于所述输入序列进行检索来获得的资源项进行时效性排序,以将排序后的资源项提供给所述用户。In step S3, when it is judged that the input sequence is time-sensitive, the sorting device performs time-sensitive sorting on the resource items retrieved based on the input sequence, so as to provide the sorted resource items to the user.
其中,排序装置对基于所述输入序列进行检索来获得的资源项进行时效性排序,以将排序后的资源项提供给用户的方式包括但不限于:Wherein, the sorting device sorts the resource items retrieved based on the input sequence in a time-sensitive manner, so as to provide the sorted resource items to the user, including but not limited to:
1)排序装置由基于所述输入序列进行检索来获得的多个资源项中选择一部分资源项,并将所选择的资源项置于预定的排序位置,再对未选择的资源项进行排序;接着,排序装置将排序后的资源项提供给用户。其中,排序装置能够根据诸如资源项的建立或更新时间、资源项的点击率、预确定的资源项的质量、资源项与输入序列的匹配度等多种因素来选择前述一部分资源项。1) The sorting device selects a part of resource items from a plurality of resource items retrieved based on the input sequence, and places the selected resource items in a predetermined sorting position, and then sorts unselected resource items; then , the sorting device provides the sorted resource items to the user. Wherein, the sorting device can select the above-mentioned part of resource items according to various factors such as establishment or update time of resource items, click rate of resource items, predetermined quality of resource items, matching degree of resource items and input sequence, etc.
例如,预定的排序位置为前三位排序位置,排序装置基于所述输入序列进行检索来获得多个资源项,并由该多个资源项中选择建立时间短于24小时且与输入序列的匹配度大于预定匹配度阈值的资源项,接着,排序装置进一步由所选择的资源项中选择点击率前三位的资源项,并将该点击率前三位的资源项按照点击率由高至低的顺序置于前三位排序位置;接着,排序装置采用现有技术来对其余资源项进行排序,并将所确定的前三位资源项以及排序后的其他资源项提供给用户。For example, the predetermined sorting position is the top three sorting positions, and the sorting device searches based on the input sequence to obtain multiple resource items, and selects from the multiple resource items the one whose establishment time is shorter than 24 hours and matches the input sequence The resource items whose degree of matching is greater than the predetermined matching degree threshold, then, the sorting device further selects the resource items with the top three click-through rates from the selected resource items, and sorts the resource items with the top three click-through rates according to the click-through rate from high to low The sequence is placed in the top three sorting positions; then, the sorting device uses the existing technology to sort the remaining resource items, and provides the determined top three resource items and other resource items after sorting to the user.
2)排序装置由基于所述输入序列进行检索来获得的多个资源项中选择一部分资源项,并对所选择的资源项进行加权处理,以根据加权处理后的处理结果来进行排序,并将排序后的资源项提供给用户。该通过加权处理来进行排序的方式将在参照图2所示实施例中予以详述,在此不予赘述。2) The sorting device selects a part of resource items from a plurality of resource items retrieved based on the input sequence, and performs weighting processing on the selected resource items to sort according to the weighted processing results, and The sorted resource items are provided to the user. The manner of sorting through weighting processing will be described in detail with reference to the embodiment shown in FIG. 2 , and will not be repeated here.
需要说明的是,上述举例仅为更好地说明本发明的技术方案,而非对本发明的限制,本领域技术人员应该理解,任何当判断所述输入序列具有时效性时,对基于所述输入序列进行检索来获得的资源项进行时效性排序,以将排序后的资源项提供给所述用户的实现方式,均应包含在本发明的范围内。It should be noted that the above examples are only to better illustrate the technical solutions of the present invention, and are not intended to limit the present invention. Those skilled in the art should understand that when it is judged that the input sequence is time-sensitive, any The resource items retrieved in sequence are time-sensitively sorted, so as to provide the sorted resource items to the user, all of which should be included in the scope of the present invention.
需要进一步说明的是,当判断所述输入序列具有时效性时,排序装置或其他装置对基于所述输入序列进行检索来获得的资源项进行现有技术的排序处理,以将排序后的资源项提供给所述用户;或者,当判断所述输入序列具有时效性时,排序装置不执行任何操作。It should be further explained that, when it is judged that the input sequence has timeliness, the sorting device or other devices perform prior art sorting processing on the resource items retrieved based on the input sequence, so that the sorted resource items provided to the user; or, when it is judged that the input sequence is time-sensitive, the sorting device does not perform any operation.
作为优选方式之一,排序装置根据所需要展现的资源项数量以及所需要展现的资源项排序位置,确定该需要展现的资源项排序位置所对应的资源项。例如,用户设备向搜索引擎发送用于进行查询的输入序列,则搜索引擎中的排序装置根据该用户设备一次性所能够呈现的资源项数量为10,由基于该输入序列来获得的资源项中选择进行选择,并确定前10位排序位置对应的资源项来反馈给用户设备,以使用户设备将排序前10位的资源项提供给用户;接着,当用户设备向搜索引擎发送翻页请求,搜索引擎再次由基于该输入序列来获得的资源项中进行选择,并确定排序11至20位对应的资源项来反馈给用户设备,以使用户设备将该排序11至20位的资源项提供给用户。As one of the preferred manners, the sorting device determines the resource item corresponding to the sorted position of the resource item to be displayed according to the number of resource items to be displayed and the sorted position of the resource item to be displayed. For example, if the user equipment sends an input sequence for querying to the search engine, the sorting device in the search engine can present 10 resource items based on the number of resource items that the user equipment can present at one time, and the resource items obtained based on the input sequence Select to make a selection, and determine the resource items corresponding to the top 10 ranking positions to feed back to the user equipment, so that the user equipment will provide the top 10 resource items to the user; then, when the user equipment sends a page turning request to the search engine, The search engine again selects from the resource items obtained based on the input sequence, and determines the resource items corresponding to ranks 11 to 20 to feed back to the user equipment, so that the user equipment provides the resource items ranked 11 to 20 to user.
作为优选方式之一,排序装置对基于输入序列来获得的资源项进行排序后,根据需要展现给用户的资源项排序位置,来选择部分排序位置的资源项提供给用户。例如,用户设备向搜索引擎发送用于进行查询的输入序列,则搜索引擎中的排序装置对基于输入序列来获得的资源项进行排序后,基于默认的一次提供10个排序后的资源项的规则,将排序前10位的资源项提供给用户设备,以使用户设备将排序前10位的资源项提供给用户;接着,当用户设备项搜索引擎发送提供第3页资源项的请求,则搜索引擎根据第3页对应的资源项为21至30位资源项,来将排序第21至30位的资源项提供给用户设备,以使用户设备将排序第21至30位的资源项的资源项提供给用户。As one of the preferred manners, after sorting the resource items obtained based on the input sequence, the sorting device selects some resource items in the sorted positions and provides them to the user according to the sorted position of the resource items presented to the user as required. For example, when the user equipment sends an input sequence for query to the search engine, the sorting device in the search engine sorts the resource items obtained based on the input sequence, and then provides 10 sorted resource items at a time based on the default rule , provide the top 10 resource items to the user device, so that the user device will provide the top 10 resource items to the user; then, when the user device item search engine sends a request to provide resource items on the third page, the search The engine provides the resource items ranked 21st to 30th to the user equipment according to the resource items corresponding to the 3rd page being 21st to 30th resource items, so that the user equipment will sort the resource items of the 21st to 30th resource items provided to the user.
作为本发明的优选方案之一,在本发明中对包含于预建立的用户需求突增关键词库中的至少一个突增关键词分别执行以下步骤S41(图未示)、步骤S42(图未示)以及步骤S43(图未示)。As one of the preferred solutions of the present invention, in the present invention, the following steps S41 (not shown in the figure), step S42 (not shown in the figure) are respectively performed for at least one sudden increase keyword contained in the pre-established user demand sudden increase keyword library shown) and step S43 (not shown).
在步骤S41中,排序装置获取一个或多个用户对基于当前处理的突增关键词或包含该当前处理的突增关键词的输入序列来获得的排序后的资源项的点击信息。优选地,该点击信息中包括基于当前处理的突增关键词或包含该当前处理的突增关键词的输入序列来获得的排序后的各个资源项的点击量和/或点击率。In step S41 , the sorting device acquires click information of one or more users on the sorted resource items obtained based on the currently processed sudden increase keyword or an input sequence containing the currently processed sudden increase keyword. Preferably, the click information includes the click amount and/or click rate of each sorted resource item obtained based on the currently processed sudden increase keyword or an input sequence containing the currently processed sudden increase keyword.
具体地,排序装置获取其他装置提供的点击信息;或者,排序装置通过在预定时间段内对用户基于前述排序后的资源项的点击行为进行监控并统计,来获得点击信息。Specifically, the sorting device obtains click information provided by other devices; or, the sorting device obtains the click information by monitoring and counting the user's click behavior based on the sorted resource items within a predetermined period of time.
接着,在步骤S42中,排序装置根据前述步骤S41中所得的点击信息,判断该突增关键词是否时效性已过。Next, in step S42, the sorting device judges whether the timeliness of the sudden increase keyword has expired according to the click information obtained in step S41.
其中,排序装置判断突增关键词是否时效性已过的方式包括但不限于:Among them, the way for the sorting device to judge whether the timeliness of the sudden increase keyword has expired includes but is not limited to:
1)排序装置根据前述步骤S3中被指定至预定排序位置或者被选择进行加权处理的资源项的点击率或点击量,并结合其他资源项的点击率或点击量间,来获得判断评价值;并根据该判断评价值是否大于或小于第一预定判断阈值,来判断当前处理的突增关键词是否时效性已过。1) The sorting device obtains the judgment evaluation value according to the click-through rate or click-through rate of the resource item assigned to the predetermined sorting position or selected for weighting processing in the aforementioned step S3, combined with the click-through rate or click-through rate of other resource items; And according to whether the judging evaluation value is greater than or smaller than the first predetermined judging threshold, it is judged whether the timeliness of the currently processed sudden increase keyword has expired.
例如,排序装置在步骤S41中获得的用户对基于突增关键词“5月CPI”进行检索所得的7个资源项的点击量如下表2所示:For example, the number of clicks obtained by the sorting device on the seven resource items retrieved based on the sudden increase keyword "May CPI" obtained by the sorting device in step S41 is shown in Table 2 below:
表2Table 2
其中,资源项A和E为被指定至预定排序位置的资源项,资源项D为被选择进行加权处理的资源项,则排序装置将资源项A、D和E的点击量平均值与资源项B、C、F和G的点击量平均值的差值733作为判断评价值,并根据判断评价值大于第一预定判断阈值500,判断当前处理的突增关键词时效性未过。Wherein, resource items A and E are resource items assigned to predetermined sorting positions, and resource item D is a resource item selected for weighting processing, then the sorting device compares the average value of hits of resource items A, D, and E with the resource item The difference 733 of the mean values of clicks of B, C, F and G is used as the judgment evaluation value, and according to the judgment evaluation value being greater than the first predetermined judgment threshold 500, it is judged that the timeliness of the sudden increase keyword currently processed has not expired.
2)排序装置根据各个资源项中建立或更新时间短于第一预定时间阈值的资源项的点击信息,并结合各个资源项中建立或更新时间长于第二预定时间阈值的资源项的点击信息,来获得判断评价值;并根据该判断评价值是否大于或小于第二预定判断阈值,来判断当前处理的突增关键词是否时效性已过。2) The sorting device is based on the click information of the resource items whose establishment or update time is shorter than the first predetermined time threshold in each resource item, combined with the click information of the resource items whose establishment or update time is longer than the second predetermined time threshold among each resource item, to obtain a judgment evaluation value; and according to whether the judgment evaluation value is greater than or smaller than a second predetermined judgment threshold, it is judged whether the timeliness of the currently processed sudden increase keyword has expired.
例如,排序装置将各个资源项中建立或更新时间短于3天的资源项的点击信息与建立或更新时间长于90天的资源项的点击信息的比值作为判断评价值,并根据该判断评价值是否大于或小于第二预定判断阈值,来判断当前处理的突增关键词是否时效性已过。For example, the sorting device takes the ratio of the click information of the resource items whose establishment or update time is shorter than 3 days and the click information of the resource items whose establishment or update time is longer than 90 days in each resource item as the judgment evaluation value, and according to the judgment evaluation value Whether it is greater than or less than a second predetermined judgment threshold is used to judge whether the timeliness of the currently processed sudden keyword has expired.
需要说明的是,上述举例仅为更好地说明本发明的技术方案,而非对本发明的限制,本领域技术人员应该理解,任何根据点击信息,判断突增关键词是否时效性已过的实现方式,均应包含在本发明的范围内。It should be noted that the above examples are only to better illustrate the technical solution of the present invention, rather than limit the present invention. Those skilled in the art should understand that any implementation of judging whether the timeliness of a sudden increase keyword has expired based on the click information methods, should be included within the scope of the present invention.
在步骤S43中,当判断该突增关键词时效性已过,排序装置在所述用户需求突增关键词库中删除该突增关键词。In step S43, when it is judged that the timeliness of the sudden increase keyword has expired, the sorting device deletes the sudden increase keyword from the user demand sudden increase keyword database.
优选地,当判断该突增关键词时效性未过,排序装置不对该突增关键词执行操作。Preferably, when it is judged that the timeliness of the sudden increase keyword has not expired, the sorting device does not perform any operation on the sudden increase keyword.
作为本实施例的优选方案之一,所述步骤S2进一步包括步骤S21(图未示)以及步骤S22(图未示)。As one of the preferred solutions of this embodiment, the step S2 further includes step S21 (not shown in the figure) and step S22 (not shown in the figure).
在步骤S21中,排序装置根据所述时效特征信息是否满足预定条件,判断所述输入序列是否具有时效性。其中,排序装置根据时效特征信息是否满足预定条件来判断输入序列是否具有时效性的步骤已在前述步骤S2中予以详述,在此不再赘述。In step S21, the sorting device judges whether the input sequence is time-sensitive according to whether the time-sensitive feature information satisfies a predetermined condition. Wherein, the step of the sorting device judging whether the input sequence has timeliness according to whether the timeliness feature information satisfies the predetermined condition has been described in detail in the aforementioned step S2, and will not be repeated here.
在步骤S22中,当判断所述输入序列具有时效性时,排序装置将所述输入序列和/或所述输入序列中的关键词加入所述用户需求突增关键词库中。优选地,排序装置还将与输入序列对应的时效特征信息加入所述用户需求突增关键词库中;更优选地,排序装置还将基于时效特征信息来获得的信息,例如,资源项数量或输入序列的被检索次数随时间的增长率等,加入所述用户需求突增关键词库中。In step S22, when it is judged that the input sequence is time-sensitive, the sorting device adds the input sequence and/or keywords in the input sequence into the user demand sudden increase keyword library. Preferably, the sorting device also adds time-sensitive feature information corresponding to the input sequence into the user demand sudden increase keyword library; more preferably, the sorting device also adds information obtained based on the time-sensitive feature information, for example, the number of resource items or The growth rate of the search times of the input sequence over time, etc., are added to the keyword database for sudden increase in user demand.
根据本发明的方法能够识别具有时效性的输入序列,并能够针对基于该具有时效性的输入序列进行检索所得的资源项进行时效性排序,以将可能与该输入序列的时效性相关的资源项排序提前,更好地满足了用户的突发检索需求。The method according to the present invention can identify the input sequence with timeliness, and can perform timeliness sorting on the resource items retrieved based on the input sequence with timeliness, so as to sort the resource items that may be related to the timeliness of the input sequence The sorting is advanced to better meet the user's sudden retrieval needs.
图2为本发明一个优选实施例的用于对资源项进行排序的方法流程图。根据本实施例的方法包括步骤S1、步骤S2以及步骤S3。其中,步骤S3进一步包括时效性排序步骤S31。Fig. 2 is a flowchart of a method for sorting resource items according to a preferred embodiment of the present invention. The method according to this embodiment includes step S1, step S2 and step S3. Wherein, step S3 further includes a timeliness sorting step S31.
步骤S1以及步骤S2已在参照图1所示实施例中予以详述,并以引用的方式包含于此,不再赘述。Step S1 and step S2 have been described in detail with reference to the embodiment shown in FIG. 1 , and are included here by reference, and will not be repeated here.
接着,在步骤S3中,当判断所述输入序列具有时效性时,排序装置对基于所述输入序列进行检索来获得的资源项进行时效性排序,以将排序后的资源项提供给所述用户。本实施例中,步骤S3中的时效性排序步骤S31包括根据所述各个资源项的新鲜度相关信息,由其中选择至少一个资源项,并对所选择的至少一个资源项进行加权处理,以根据该加权处理后的处理结果来进行排序的步骤。Next, in step S3, when it is judged that the input sequence is time-sensitive, the sorting device performs time-sensitive sorting on the resource items retrieved based on the input sequence, so as to provide the sorted resource items to the user . In this embodiment, the timeliness sorting step S31 in step S3 includes selecting at least one resource item from among them according to the freshness-related information of each resource item, and performing weighting processing on the selected at least one resource item, so that according to The step of sorting the processing results after the weighting processing.
其中,所述新鲜度相关信息包括能够用于确定各个资源项的建立或更新时间是否属于输入序列的时效期间的相关信息。优选地,新鲜度相关信息包括以下至少一项:Wherein, the freshness related information includes related information that can be used to determine whether the establishment or update time of each resource item belongs to the aging period of the input sequence. Preferably, the freshness-related information includes at least one of the following:
1)资源项的建立或更新时间以及预确定的输入序列的用户需求突增时间。其中,排序装置能够获取预存储的各个资源项的建立或更新时间,例如,排序装置或其他装置在抓取各个资源项的同时,存储各个资源项的建立或更新时间,以供后续使用;其中,该建立或更新时间可为各个资源项在其网络中的建立或更新时间,也可为排序装置或其他装置抓取并存储或更新各个资源项的时间。其中,在步骤S31前,所述预确定的输入序列的用户需求突增时间能够由排序装置或其他装置根据以下至少一项因素来确定:a)基于输入序列进行检索来获得的资源项数量随时间的变化信息,该资源项数量随时间的变化信息已在参照图1所示实施例中对时效特征信息进行说明的部分予以详述,在此不再赘述;排序装置或其他装置通过根据前述资源项数量随时间的变化信息来确定资源项数量随时间的增长强度,以确定用户需求突增时间,例如,排序装置或其他装置将资源项数量随时间的增长强度第一次超过第二预定增长阈值的时间段的起始时间点作为用户需求突增时间;b)输入序列和/或输入序列包含的关键词的被检索次数随时间的变化信息,该被检索次数随时间的变化信息已在参照图1所示实施例中对时效特征信息进行说明的部分予以详述,在此不再赘述;排序装置或其他装置通过根据前述被检索次数随时间的变化信息来确定被检索次数随时间的增长强度,以确定用户需求突增时间。1) The creation or update time of the resource item and the user demand surge time of the predetermined input sequence. Wherein, the sorting device can obtain the establishment or update time of each resource item stored in advance, for example, the sorting device or other devices store the establishment or update time of each resource item while grabbing each resource item, for subsequent use; , the establishment or update time may be the establishment or update time of each resource item in its network, or the time when the sorting device or other devices capture and store or update each resource item. Wherein, before step S31, the user demand surge time of the predetermined input sequence can be determined by a sorting device or other devices according to at least one of the following factors: a) The number of resource items obtained by searching based on the input sequence varies with Time change information, the change information of the number of resource items over time has been described in detail in the part of the description of the time-sensitive feature information in the embodiment shown in Figure 1, and will not be repeated here; The change information of the number of resource items over time is used to determine the growth intensity of the number of resource items over time, so as to determine the time of sudden increase in user demand. The starting time point of the time period of the growth threshold is used as the user demand sudden increase time; b) the input sequence and/or the information on the change of the number of searches over time of the keywords contained in the input sequence, and the information on the change of the number of searches over time has been In the embodiment shown in FIG. 1 , the part of the timeliness feature information is described in detail, and will not be repeated here; the sorting device or other devices determine the number of times retrieved over time according to the change information of the number of times retrieved over time. The growth intensity of the user to determine the time of sudden increase in user demand.
具体地,排序装置由基于输入序列进行检索来获得的各个资源项中选择建立或更新时间晚于用户需求突增时间的资源项,并对所选择的资源项进行加权处理,以根据该加权处理后的处理结果来进行排序。Specifically, the sorting device selects the resource items whose establishment or update time is later than the user demand surge time from among the resource items retrieved based on the input sequence, and performs weighting processing on the selected resource items, so that according to the weighting processing The final processing results are sorted.
例如,预确定的用户需求突增时间为5月12日,则排序装置由由基于输入序列进行检索来获得的各个资源项中选择建立时间晚于5月12日的资源项,并根据预定的加权值来对所选择的各个资源项进行加权处理,以根据该加权处理后的处理结果来进行排序。For example, if the predetermined user demand surge time is May 12, the sorting device selects the resource items whose creation time is later than May 12 from among the resource items obtained by searching based on the input The weighted value is used to carry out weighting processing on each selected resource item, so as to sort according to the processing result after the weighting processing.
2)所述输入序列的历史检索记录中包含的资源项。2) The resource items included in the historical retrieval records of the input sequence.
具体地,排序装置由基于输入序列进行检索来获得的资源项中选择未包含于该输入序列的历史检索记录中的资源项,并对所选择的资源项进行加权处理,以根据该加权处理后的处理结果来进行排序。由于所选择的资源项未包含于数据序列的历史检索记录中,因此,可认为所选择的资源项的建立时间晚于历史检索记录的最后更新时间,故基于历史检索记录来选择的资源项的建立或更新时间应处于输入序列的时效期间。Specifically, the sorting device selects resource items that are not included in the historical retrieval records of the input sequence from the resource items obtained by searching based on the input sequence, and performs weighting processing on the selected resource items, so that according to the weighted processing The processing results are sorted. Since the selected resource item is not included in the historical retrieval record of the data sequence, it can be considered that the creation time of the selected resource item is later than the last update time of the historical retrieval record, so the resource item selected based on the historical retrieval record The creation or update time should be within the aging period of the input sequence.
需要说明的是,上述举例仅为更好地说明本发明的技术方案,而非对本发明的限制,本领域技术人员应该理解,任何根据所述各个资源项的新鲜度相关信息,由其中选择至少一个资源项,并对所选择的至少一个资源项进行加权处理,以根据该加权处理后的处理结果来进行排序的实现方式,均应包含在本发明的范围内。It should be noted that the above examples are only to better illustrate the technical solutions of the present invention, rather than limiting the present invention. Those skilled in the art should understand that any information related to the freshness of each resource item is selected from which at least One resource item, and performing weighting processing on at least one selected resource item, so as to sort according to the processing result after the weighting processing, should be included in the scope of the present invention.
需要进一步说明的是,排序装置可先选择需要所有加权处理的资源项并进行加权处理,然后再对所有进过加权处理以及未经过加权处理的资源项进行排序;或者,排序装置每获得一个需要加权处理的资源项,即对该资源项进行加权处理,并将该资源项插入已排序资源项中。It should be further explained that the sorting device can first select resource items that require all weighted processing and perform weighted processing, and then sort all resource items that have undergone weighted processing and that have not undergone weighted processing; The weighted resource item is to perform weighted processing on the resource item and insert the resource item into the sorted resource item.
作为本实施例的优选方案之一,本实施例的方法还包括步骤S5以及步骤S6,前述步骤S31中的加权处理步骤进一步包括根据加权值,对所选择的需要进行加权处理的资源项进行加权处理的步骤。As one of the preferred solutions of this embodiment, the method of this embodiment further includes steps S5 and S6, and the weighting processing step in step S31 further includes weighting the selected resource items that need to be weighted according to the weighting value processing steps.
在步骤S2之后执行的步骤S5中,排序装置根据步骤S1中获得的时效信息,确定输入序列的用户需求变化强度。In step S5 executed after step S2, the sorting device determines the change intensity of the user demand of the input sequence according to the aging information obtained in step S1.
具体地,排序装置确定输入序列的用户需求变化强度的方式包括但不限于:Specifically, the way for the sorting device to determine the intensity of change in user demand of the input sequence includes but is not limited to:
1)当时效信息中包括排序装置在用户需求突增关键词库中查询所得的查询结果,且该查询结果包含查询所得的突增关键词的用户需求变化强度时,排序装置根据查询结果中的突增关键词的用户需求变化强度来确定输入序列的用户需求变化强度。1) When the timeliness information includes the query result obtained by the sorting device in the user demand sudden increase keyword database, and the query result includes the user demand change intensity of the sudden increase keyword obtained from the query, the sorting device according to the query result The user demand change strength of the sudden increase keyword is used to determine the user demand change strength of the input sequence.
例如,输入序列为“福岛地震怎么样了”,在步骤S1中,排序装置获得的查询结果中包含突增关键词“福岛地震”对应的用户需求变化强度为3级,则排序装置直接将突增关键词“福岛地震”对应的用户需求变化强度作为输入序列“福岛地震怎么样了”的用户需求变化强度。For example, if the input sequence is "how about the Fukushima earthquake", in step S1, the query result obtained by the sorting device contains the sudden increase keyword "Fukushima earthquake" and the corresponding user demand change intensity is level 3, then the sorting device directly The user demand change intensity corresponding to the sudden increase keyword "Fukushima earthquake" is used as the user demand change intensity of the input sequence "how about the Fukushima earthquake".
又例如,输入序列为“福岛地震核电站”,在步骤S1中,排序装置获得的查询结果中包含突增关键词“福岛地震”对应的用户需求变化强度为4.76,突增关键词“核电站”对应的用户需求变化强度为2.35,则排序装置将突增关键词对应的用户需求变化强度之和7.11作为输入序列“福岛地震核电站”的用户需求变化强度。For another example, if the input sequence is "Fukushima Earthquake Nuclear Power Plant", in step S1, the query result obtained by the sorting device includes the sudden increase keyword "Fukushima Earthquake" corresponding to a user demand change intensity of 4.76, and the sudden increase keyword "Nuclear Power Plant The user demand change intensity corresponding to "is 2.35, then the sorting device takes the sum of user demand change intensity corresponding to the sudden increase keyword as 7.11 as the user demand change intensity of the input sequence "Fukushima Earthquake Nuclear Power Plant".
2)当时效信息中包括时效特征信息,则排序装置根据时效特征信息来确定输入序列的用户需求变化强度。2) The timeliness information includes timeliness feature information, and the sorting device determines the change intensity of the user demand of the input sequence according to the timeliness feature information.
例如,时效特征信息包括基于输入序列进行检索来获得的资源项数量随时间的变化信息,则排序装置根据预确定的诸如支持向量机、贝叶斯模型、最大熵模型等机器学习模型来确定输入序列的用户需求变化强度。其中,可通过将大量已确定用户需求变化强度以及已获得资源项数量随时间的变化信息的输入序列对机器学习模型进行训练,来获得用于确定用户需求变化强度的机器学习模型。For example, the time-sensitive feature information includes information on the change of the number of resource items over time obtained by searching based on the input sequence, and the sorting device determines the input The intensity of user demand change for the sequence. Wherein, the machine learning model for determining the change intensity of the user demand can be obtained by training the machine learning model with a large number of input sequences that have determined the change intensity of the user demand and the information on the change of the number of resource items over time.
又例如,在步骤S1中,排序装置基于输入序列“福岛地震”获得的时效特征信息包括以下两项:1)包含突发特征关键词“地震”以及突发特征语义结构“专有名词+普通名词”的突发语义特征信息;2)输入序列“福岛地震”的被检索次数随时间的变化信息,该变化信息如下表3所示:For another example, in step S1, the time-sensitive feature information obtained by the sorting device based on the input sequence "Fukushima Earthquake" includes the following two items: 1) contains the burst feature keyword "earthquake" and the burst feature semantic structure "proper noun + 2) the change information of the retrieved times of the input sequence "Fukushima Earthquake" over time, the change information is shown in Table 3 below:
表3table 3
则排序装置逐一计算相邻两个时间点的被检索次数差值,并获得最大差值=15247-2045=13202,接着,排序装置获取该最大差值相对最大被检索次数的比例=13202/17896=73.77%,接着,排序装置确定73.77%属于预定范围70-80%,并获得预定范围70-80%对应的用户需求变化强度为第二级。并且,排序装置根据当突发语义特征信息同时包含突发特征关键词以及突发特征语义结构时,该突发语义特征信息对应的等级调整方式为上调一级的预定规则,确定包含突发特征关键词“地震”以及突发特征语义结构“专有名词+普通名词”的突发语义特征信息对应的等级调整规则为上调一级。则排序装置根据基于突发语义特征信息确定的等级调整规则来调整基于被检索次数随时间的变化信息来获得的等级,最终确定输入序列“福岛地震”的用户需求变化强度为第一级。Then the sorting device calculates the difference value of the number of searches at two adjacent time points one by one, and obtains the maximum difference = 15247-2045 = 13202, and then, the sorting device obtains the ratio of the maximum difference to the maximum number of searches = 13202/17896 =73.77%, then, the sorting device determines that 73.77% belongs to the predetermined range of 70-80%, and obtains the user demand change intensity corresponding to the predetermined range of 70-80% as the second level. Moreover, the sorting device determines that the burst feature information contains burst feature keywords and the burst feature semantic structure according to the predetermined rule that the level adjustment method corresponding to the burst semantic feature information is to raise one level. The level adjustment rule corresponding to the burst semantic feature information of the keyword "earthquake" and the burst feature semantic structure "proper noun + common noun" is one level up. Then the sorting device adjusts the grade obtained based on the change information of the number of retrieved times over time according to the grade adjustment rule determined based on the burst semantic feature information, and finally determines that the user demand change intensity of the input sequence "Fukushima earthquake" is the first grade.
优选地,排序装置将输入序列及其用户需求变化强度加入用户需求突增关键词库中。Preferably, the sorting device adds the input sequence and its user demand change intensity to the user demand sudden increase keyword library.
需要说明的是,上述举例仅为更好地说明本发明的技术方案,而非对本发明的限制,本领域技术人员应该理解,任何根据时效信息来确定输入序列的用户需求变化强度的实现方式,均应包含在本发明的范围内。It should be noted that the above examples are only to better illustrate the technical solutions of the present invention, and are not intended to limit the present invention. Those skilled in the art should understand that any implementation method for determining the intensity of changes in user needs of an input sequence based on time-sensitive information, All should be included within the scope of the present invention.
接着,在步骤S6中,排序装置根据该用户需求变化强度,确定加权值。Next, in step S6, the sorting device determines a weighted value according to the change intensity of the user demand.
例如,当用户需求变化强度为一个数值时,直接将该数值作为加权值,或者,将该数值乘以预定的调节因子,并将乘积作为加权值等。For example, when the change intensity of the user's demand is a numerical value, the numerical value is directly used as the weighted value, or the numerical value is multiplied by a predetermined adjustment factor, and the product is used as the weighted value, etc.
又例如,当用户需求变化强度为一个等级时,获取预定的与该等级相应的加权值等。For another example, when the change intensity of the user demand is at a level, a predetermined weighted value corresponding to the level is obtained.
需要说明的是,上述举例仅为更好地说明本发明的技术方案,而非对本发明的限制,本领域技术人员应该理解,任何根据该用户需求变化强度,确定加权值的实现方式,均应包含在本发明的范围内。It should be noted that the above examples are only to better illustrate the technical solutions of the present invention, and are not intended to limit the present invention. Those skilled in the art should understand that any implementation of determining the weighted value according to the change intensity of the user's needs should be included within the scope of the present invention.
接着,排序装置执行包含步骤S31的步骤S3,本实施例中,步骤S31中的加权处理步骤进一步包括根据步骤S6中确定的加权值,对所选择的需要进行加权处理的资源项进行加权处理的步骤。Next, the sorting device executes step S3 including step S31. In this embodiment, the weighting processing step in step S31 further includes performing weighting processing on the selected resource items that need to be weighted according to the weighting value determined in step S6. step.
优选地,步骤S31中的加权处理步骤还包括根据步骤S6中确定的加权值,并结合所选择的资源项中每个资源项的建立或更新时间,对所选择的资源项进行加权处理的步骤。Preferably, the weighting processing step in step S31 further includes the step of performing weighting processing on the selected resource items according to the weight value determined in step S6 and in combination with the establishment or update time of each resource item in the selected resource items .
具体地,排序装置根据各个资源项的建立或更新时间,来调整加权值,并根据调整后的加权值来来对所选择的资源项进行加权处理。Specifically, the sorting device adjusts the weight value according to the establishment or update time of each resource item, and performs weight processing on the selected resource item according to the adjusted weight value.
例如,排序装置在步骤S6中获得的加权值为2.3,且排序装置获得一个需要进行加权处理的资源项的建立时间为5月24日13点,则排序装置根据该建立时间与当前时间5月24日20点的时间间距小于预定时间间隔1天,来将加权值上调0.1,以获得调整后的加权值2.4,并根据调整后的加权值2.4,来对该资源项进行加权处理。For example, the weighted value obtained by the sorting device in step S6 is 2.3, and the establishment time of a resource item that needs to be weighted is obtained by the sorting device at 13 o'clock on May 24th, then the sorting device will calculate according to the establishment time and the current time in May If the time interval at 20:00 on the 24th is less than the predetermined time interval of 1 day, the weighted value is increased by 0.1 to obtain an adjusted weighted value of 2.4, and the resource item is weighted according to the adjusted weighted value of 2.4.
又例如,排序装置在步骤S6中获得的加权值为0.1,且排序装置获得一个需要进行加权处理的资源项的建立时间为5月24日,接着,排序装置确定该建立时间与输入序列对应的用户需求突增时间5月22的时间间隔为2天,则排序装置将加权值与时间间隔的积0.2作为调整后的加权值,来对该资源项进行加权处理。For another example, the weighted value obtained by the sorting device in step S6 is 0.1, and the establishment time of a resource item that needs to be weighted is obtained by the sorting device as May 24. Then, the sorting device determines that the establishment time corresponds to the input sequence The time interval between the user demand surge time May 22 is 2 days, and the sorting device uses the product of the weighted value and the time interval of 0.2 as the adjusted weighted value to perform weighting processing on the resource item.
需要说明的是,上述举例仅为更好地说明本发明的技术方案,而非对本发明的限制,本领域技术人员应该理解,任何根据加权值,并结合所选择的资源项中每个资源项的建立或更新时间,对所选择的资源项进行加权处理的实现方式,均应包含在本发明的范围内。It should be noted that the above examples are only to better illustrate the technical solution of the present invention, rather than limit the present invention. Those skilled in the art should understand that any combination of resource items selected according to weighted values The establishment or update time of the resource item, and the implementation manner of weighting the selected resource items should all be included in the scope of the present invention.
根据本实施例的方法,进一步根据资源项的新鲜度来执行时效性排序,从而获得更好的时效性排序结果。According to the method of this embodiment, timeliness sorting is further performed according to the freshness of resource items, so as to obtain a better timeliness sorting result.
图3为本发明一个方面的用于对资源项进行排序的排序装置结构示意图。根据本发明的排序装置包括时效信息获取装置1、第一判断装置2以及时效排序装置3。Fig. 3 is a schematic structural diagram of a sorting device for sorting resource items according to one aspect of the present invention. The ranking device according to the present invention includes a timeliness information acquisition device 1 , a first judging device 2 and a timeliness sorting device 3 .
时效信息获取装置1根据来自用户的输入序列来获取与该输入序列相关的时效信息。The aging information acquiring device 1 acquires aging information related to the input sequence according to the input sequence from the user.
具体地,时效信息获取装置1获取其他装置或设备提供的输入序列,或者,实时获取来自用户的输入序列,并根据该输入序列来获取与该输入序列相关的时效信息。其中,时效信息获取装置1获取与该输入序列相关的时效信息的方式包括但不限于:Specifically, the timeliness information acquiring device 1 acquires an input sequence provided by other devices or equipment, or acquires an input sequence from a user in real time, and acquires timeliness information related to the input sequence according to the input sequence. Wherein, the methods for obtaining the timeliness information related to the input sequence by the timeliness information acquisition device 1 include but are not limited to:
1)时效信息获取装置1根据该输入序列来在预建立的用户需求突增关键词库中进行查询,并将查询所得的查询结果作为所述时效信息。其中,该查询结果中包括以下任一项:a)用于表示能够在用户需求突增关键词库查询获得该输入序列或该输入序列包含的关键词的标识信息,例如,“Y”、“1”等;b)用于表示未能够在用户需求突增关键词库查询获得该输入序列或该输入序列包含的关键词的标识信息,例如,“N”、“0”等。优选地,该查询结果中还包括以下至少一项:a)在用户需求突增关键词库查询所得的该用户需求突增关键词库中的突增关键词;b)与查询所得的突增关键词相关的信息,例如,输入序列或关键词的用户需求变化强度等。1) The timeliness information acquisition device 1 searches the pre-established user demand sudden increase keyword database according to the input sequence, and takes the query result obtained from the query as the timeliness information. Wherein, the query result includes any of the following items: a) It is used to indicate that the input sequence or the keywords contained in the input sequence can be obtained by querying the user demand sudden keyword database, for example, "Y", " 1" etc.; b) It is used to indicate that the input sequence or the identification information of the keywords contained in the input sequence cannot be obtained from the user demand sudden keyword database query, for example, "N", "0" and so on. Preferably, the query result also includes at least one of the following: a) the sudden increase keyword in the user demand sudden increase keyword library obtained by querying the user demand sudden increase keyword database; Keyword-related information, for example, input sequence or user demand change intensity of keywords, etc.
2)时效信息获取装置1中包含第一子获取装置(图未示),该第一子获取装置根据所述输入序列来获取与该输入序列相关的时效特征信息,以作为所述时效信息。其中,所述时效特征信息包括能够用于与该输入序列相关且能够用于判断该输入序列是否为具有时效性的输入序列的特征信息。优选地,所述时效特征信息包括但不限于以下至少下一项:2) The timeliness information acquisition device 1 includes a first sub-acquisition device (not shown in the figure), and the first sub-acquisition device acquires timeliness feature information related to the input sequence according to the input sequence as the timeliness information. Wherein, the time-sensitive feature information includes feature information that can be used to correlate with the input sequence and can be used to judge whether the input sequence is a time-sensitive input sequence. Preferably, the time-sensitive feature information includes but not limited to at least the following item:
a)基于输入序列进行检索来获得的资源项数量随时间的变化信息;例如,该资源项数量随时间的变化信息包括如下信息:基于所述输入序列进行检索来获得的资源项中包含50个在前96至前120小时内建立的资源项、48个在前72至前96小时内建立的资源项、127个在前48小时至前72小时内建立的资源项、309个在前24至前48小时内建立的资源项以及1045个在24小时内建立的资源项;又例如,进行检索所得的资源项数量随时间的变化信息包括各个预定时间段内建立的资源信息数量的相对其相邻时间段内建立的资源信息数量的变化值等。优选地,第一子获取装置通过对基于输入序列进行检索所得的资源项的建立或更新时间进行统计分析,来获得该资源项数量随时间的变化信息。a) information on the change in the number of resource items over time obtained by searching based on the input sequence; for example, the information on the change in the number of resource items over time includes the following information: the resource items obtained by searching based on the input sequence contain 50 Resource items created within the previous 96 to 120 hours, 48 resource items created within the previous 72 to 96 hours, 127 resource items created within the previous 48 hours to the previous 72 hours, 309 resource items created within the previous 24 to Resource items created within the previous 48 hours and 1045 resource items created within 24 hours; as another example, the information on the change in the number of resource items over time includes the relative quantity of resource information created within each predetermined time period. The change value of the amount of resource information established in the adjacent time period, etc. Preferably, the first sub-acquisition means obtains information on changes in the number of resource items over time by statistically analyzing the creation or update time of the resource items retrieved based on the input sequence.
b)输入序列和/或输入序列包含的关键词的被检索次数随时间的变化信息;例如,在各个预定时间段内或预定时间点上,输入序列或输入序列包含的关键词的被检索次数;又例如,各个预定时间段或预定时间点相对其相邻时间段或时间点的输入序列或关键词的被检索次数的差值等。优选地,第一子获取装置基于输入序列或输入序列包含的关键词来在本地存储设备中进行查询,以获得预存储的与输入序列或输入序列包含的关键词对应的被检索次数随时间的变化信息;或者,第一子获取装置向其他装置或设备发出请求,以获得与输入序列或输入序列包含的关键词对应的被检索次数随时间的变化信息;或者,第一子获取装置通过在预定时间段内对输入序列或输入序列包含的关键词的被检索次数进行监控及统计,来获得该被检索次数随时间的变化信息。b) information on the number of times the input sequence and/or keywords included in the input sequence are retrieved over time; for example, the input sequence or the number of times the keywords included in the input sequence are retrieved within each predetermined time period or at a predetermined time point ; Another example is the difference between each predetermined time period or predetermined time point relative to its adjacent time period or time point's input sequence or the number of times keywords are retrieved. Preferably, the first sub-acquisition means searches the local storage device based on the input sequence or the keywords contained in the input sequence, so as to obtain the pre-stored data of the times of retrieval corresponding to the input sequence or the keywords contained in the input sequence over time. Change information; or, the first sub-acquisition means sends a request to other devices or equipment to obtain information on the number of searches over time corresponding to the input sequence or the keywords contained in the input sequence; or, the first sub-acquisition means through the Monitoring and counting the number of retrievals of the input sequence or keywords included in the input sequence within a predetermined period of time to obtain information on changes in the retrieval frequency over time.
c)基于对所述输入序列进行语义分析来获得的该输入序列的突发语义特征信息。其中,突发语义特征信息包括突发特征关键词和/或突发特征语义结构。例如,突发语义特征信息包括诸如“最新、今天、本周、最近、某年某月某日”等时间关键词,或者包括诸如“地震、车祸、逝世、着火、砍人事件、自杀门”等突发事件关键词;又例如,突发语义特征结构包括诸如专有名词+普通名词等结构;再例如,突发语义特征信息包括诸如“福岛地震”等同时包含突发特征关键词以及突发特征语义结构的语义特征信息。优选地,第一子获取装置基于输入序列来在本地存储设备中进行查询,以获得预存储的与输入序列对应的突发语义特征信息;或者,第一子获取装置向其他装置或设备发出请求,以获得与输入序列对应的突发语义特征信息;或者,第一子获取装置对该输入序列进行语义分析,来获得该输入序列的突发语义特征信息。c) Semantic burst feature information of the input sequence obtained based on semantic analysis of the input sequence. Wherein, the burst semantic feature information includes burst feature keywords and/or burst feature semantic structure. For example, the burst semantic feature information includes time keywords such as "latest, today, this week, recent, certain year, month, and day", or includes time keywords such as "earthquake, car accident, death, fire, hacking incident, suicide door" and other emergency keywords; for another example, the emergency semantic feature structure includes structures such as proper nouns + common nouns; for another example, the emergency semantic feature information includes such as "Fukushima earthquake" and other emergency feature keywords and Semantic feature information of burst feature semantic structure. Preferably, the first sub-acquisition means searches the local storage device based on the input sequence to obtain pre-stored burst semantic feature information corresponding to the input sequence; or, the first sub-acquisition means sends a request to other means or equipment to obtain burst semantic feature information corresponding to the input sequence; or, the first sub-acquisition means performs semantic analysis on the input sequence to obtain burst semantic feature information of the input sequence.
3)时效信息获取装置1中包括查询装置(图未示)、定义装置(图未示)以及第二子获取装置(图未示)。查询装置根据所述输入序列在预建立的用户需求突增关键词库中查询与该输入序列有关的突增关键词;当查询到与该输入序列有关的突增关键词时,定义装置将查询结果作为所述时效信息;当未能查询到与该输入序列有关的突增关键词时,第一子获取装置根据所述输入序列来获取与该输入序列相关的时效特征信息,以作为所述时效信息的步骤。其中,第一子获取装置根据所述输入序列来获取与该输入序列相关的时效特征信息的方式与前述2)中第一子获取装置获取时效特征信息的方式相同或相似,在此不再赘述。3) The aging information acquisition device 1 includes a query device (not shown in the figure), a definition device (not shown in the figure) and a second sub-acquisition device (not shown in the figure). The query device queries the sudden increase keywords related to the input sequence in the pre-established user demand sudden increase keyword library according to the input sequence; when the sudden increase keywords related to the input sequence are found, the definition device will query The result is used as the timeliness information; when the sudden increase keyword related to the input sequence cannot be found, the first sub-acquisition means acquires the timeliness feature information related to the input sequence according to the input sequence, as the Steps for aging information. Wherein, the manner in which the first sub-acquisition means acquires the time-sensitive feature information related to the input sequence according to the input sequence is the same as or similar to the way in which the first sub-acquisition means acquires the time-sensitive feature information in 2) above, and will not be repeated here. .
需要说明的是,上述举例仅为更好地说明本发明的技术方案,而非对本发明的限制,本领域技术人员应该理解,任何根据来自用户的输入序列来获取与该输入序列相关的时效信息的实现方式,均应包含在本发明的范围内。It should be noted that the above examples are only to better illustrate the technical solution of the present invention, rather than limit the present invention. Those skilled in the art should understand that any time-sensitive information related to the input sequence obtained according to the input sequence from the user All implementation methods should be included within the scope of the present invention.
接着,第一判断装置2根据时效信息是否满足预定条件,判断输入序列是否具有时效性。其中,所述预定条件包括基于能够被确定为具有时效性的输入序列来获得的时效信息所需要满足的条件;例如,当时效信息包括在预建立的用户需求突增关键词库中进行查询所得的查询结果时,预定条件包括查询结果中包含“Y”或“1”等标识信息,当时效信息包括时效特征信息时,预定条件包括时效特征信息所需要满足的条件等,优选地,基于输入序列进行检索来获得的资源项数量随时间的增长越快、输入序列或输入序列包含的关键词的被检索次数随时间的增长越快、突发语义特征信息中包含的突发语义特征越强,则排序装置将输入序列判断为具有时效性的输入序列的可能性越高。Next, the first judging means 2 judges whether the input sequence has timeliness according to whether the timeliness information satisfies a predetermined condition. Wherein, the predetermined conditions include the conditions that need to be satisfied by the timeliness information obtained based on the input sequence that can be determined to be timeliness; When the query results are obtained, the predetermined conditions include identification information such as "Y" or "1" in the query results, and when the timeliness information includes timeliness feature information, the predetermined conditions include conditions that the timeliness feature information needs to meet, etc. Preferably, based on the input The faster the number of resource items retrieved by the sequence increases with time, the faster the number of retrievals of the input sequence or keywords contained in the input sequence increases with time, and the stronger the burst semantic feature contained in the burst semantic feature information , the sorting device is more likely to judge the input sequence as a time-sensitive input sequence.
例如,预定条件包括查询结果中包含标识信息“Y”或“1”,时效信息获取装置1基于输入序列获得的时效信息包括包含标识信息“Y”的查询结果;则第一判断装置2根据该时效信息满足预定条件,判断输入序列具有时效性。For example, the predetermined condition includes that the query result contains the identification information "Y" or "1", and the timeliness information obtained by the aging information acquisition device 1 based on the input sequence includes the query result containing the identification information "Y"; then the first judging device 2 according to the The timeliness information satisfies the predetermined condition, and it is judged that the input sequence has timeliness.
又例如,预定条件包括任一时间段中建立的资源项数量相对其前一时间段中建立的资源项数量的增长率超过50%;时效信息获取装置1获得的基于输入序列进行检索来获得的资源项数量随时间的变化信息包括如下信息:基于所述输入序列进行检索来获得的资源项中包含50个在前96至前120小时内建立的资源项、48个在前72至前96小时内建立的资源项、127个在前48小时至前72小时内建立的资源项、309个在前24至前48小时内建立的资源项以及1045个在24小时内建立的资源项;则第一判断装置2分析资源项数量随时间的变化信息,确定各个时间段中建立的资源项数量相对其前一时间段中建立的资源项数量的增长率,并根据在前48小时至前72小时内建立的资源项数量相对前72至前96小时内建立的资源项数量的增长率62.2%大于第一预定增长阈值50%,判断输入序列具有时效性。For another example, the predetermined condition includes that the growth rate of the number of resource items established in any time period relative to the number of resource items established in the previous time period exceeds 50%; The change information of the number of resource items over time includes the following information: among the resource items retrieved based on the input sequence, there are 50 resource items established within the previous 96 to 120 hours, 48 resource items established within the previous 72 to 96 hours 127 resource items created within the previous 48 hours to the previous 72 hours, 309 resource items created within the previous 24 to 48 hours, and 1045 resource items created within 24 hours; then A judging device 2 analyzes the change information of the quantity of resource items over time, determines the growth rate of the quantity of resource items established in each time period relative to the quantity of resource items established in the previous time period, and according to the number of resource items established in the previous 48 hours to the previous 72 hours The 62.2% growth rate of the number of resource items created within the previous 72 to 96 hours relative to the number of resource items created within the previous 96 hours is greater than the first predetermined growth threshold of 50%, and it is judged that the input sequence has timeliness.
再例如,预定条件包括以下两个条件:i)输入序列中包含突发语义特征关键词;ii)输入序列和其包含的关键词的被检索次数在任意两个时间段中随时间的增长率均超过100%。时效信息获取装置获得输入序列“地震门”中包含突发特征关键词“地震”,并获得输入序列“地震门”以及输入序列包含的关键词“地震”的被检索次数随时间的变化信息如下表4所示:For another example, the predetermined conditions include the following two conditions: i) the input sequence contains sudden semantic feature keywords; ii) the number of searches of the input sequence and the keywords contained in it is the growth rate over time in any two time periods Both exceeded 100%. The time-sensitive information acquisition device obtains the input sequence "earthquake gate" containing the sudden characteristic keyword "earthquake", and obtains the input sequence "earthquake gate" and the information on the number of retrievals of the keyword "earthquake" contained in the input sequence over time as follows Table 4 shows:
表4Table 4
则第一判断装置2对输入序列“地震门”以及输入序列包含的关键词“地震”的被检索次数随时间的变化信息进行分析,确定输入序列和其包含的关键词的被检索次数在任意两个时间段中随时间的增长率均小于100%;则第一判断装置2判断输入序列“地震门”不具有时效性。Then the first judging device 2 analyzes the input sequence "earthquake gate" and the time-dependent change information of the retrieved times of the keyword "earthquake" contained in the input sequence, and determines that the input sequence and the retrieved times of the keywords contained in it are within any The growth rates over time in the two time periods are both less than 100%; then the first judging device 2 judges that the input sequence "earthquake gate" has no timeliness.
需要说明的是,上述举例仅为更好地说明本发明的技术方案,而非对本发明的限制,本领域技术人员应该理解,任何根据时效信息是否满足预定条件来判断输入序列是否具有时效性的实现方式,均应包含在本发明的范围内。It should be noted that the above examples are only to better illustrate the technical solution of the present invention, rather than limit the present invention. Those skilled in the art should understand that any method that judges whether the input sequence has timeliness based on whether the timeliness information satisfies the predetermined condition Implementation methods should all be included within the scope of the present invention.
当判断所述输入序列具有时效性时,时效排序装置3对基于所述输入序列进行检索来获得的资源项进行时效性排序,以将排序后的资源项提供给所述用户。When it is judged that the input sequence has timeliness, the timeliness sorting device 3 performs timeliness sorting on the resource items retrieved based on the input sequence, so as to provide the sorted resource items to the user.
其中,时效排序装置3对基于所述输入序列进行检索来获得的资源项进行时效性排序,以将排序后的资源项提供给用户的方式包括但不限于:Wherein, the timeliness sorting device 3 performs timeliness sorting on the resource items retrieved based on the input sequence, and the ways of providing the sorted resource items to the user include but not limited to:
1)时效排序装置3由基于所述输入序列进行检索来获得的多个资源项中选择一部分资源项,并将所选择的资源项置于预定的排序位置,再对未选择的资源项进行排序;接着,排序装置将排序后的资源项提供给用户。其中,时效排序装置3能够根据诸如资源项的建立或更新时间、资源项的点击率、预确定的资源项的质量、资源项与输入序列的匹配度等多种因素来选择前述一部分资源项。1) The timeliness sorting device 3 selects a part of resource items from a plurality of resource items retrieved based on the input sequence, and places the selected resource items in a predetermined sorting position, and then sorts the unselected resource items ; Next, the sorting device provides the sorted resource items to the user. Among them, the timeliness sorting device 3 can select the above-mentioned part of resource items according to various factors such as establishment or update time of resource items, click rate of resource items, predetermined quality of resource items, matching degree of resource items and input sequence, etc.
例如,预定的排序位置为前三位排序位置,排序装置基于所述输入序列进行检索来获得多个资源项,时效排序装置3由该多个资源项中选择建立时间短于24小时且与输入序列的匹配度大于预定匹配度阈值的资源项,接着,时效排序装置3进一步由所选择的资源项中选择点击率前三位的资源项,并将该点击率前三位的资源项按照点击率由高至低的顺序置于前三位排序位置;接着,时效排序装置3采用现有技术来对其余资源项进行排序;接着,排序装置将所确定的前三位资源项以及排序后的其他资源项提供给用户。For example, the predetermined sorting position is the first three sorting positions, and the sorting device searches based on the input sequence to obtain multiple resource items, and the timeliness sorting device 3 selects from the multiple resource items that the construction time is shorter than 24 hours and is consistent with the input The resource items whose matching degree of the sequence is greater than the predetermined matching degree threshold, then, the timeliness sorting device 3 further selects the resource items with the top three click rates from the selected resource items, and sorts the resource items with the top three click rates according to the click rate. Rates are placed in the top three sorting positions in order from high to low; then, the aging sorting device 3 uses the prior art to sort the remaining resource items; then, the sorting device sorts the determined top three resource items and the sorted Other resource items are provided to the user.
2)时效排序装置3由基于所述输入序列进行检索来获得的多个资源项中选择一部分资源项,并对所选择的资源项进行加权处理,以根据加权处理后的处理结果来进行排序,并将排序后的资源项提供给用户。该通过加权处理来进行排序的方式将在参照图2所示实施例中予以详述,在此不予赘述。2) The timeliness sorting device 3 selects a part of resource items from a plurality of resource items retrieved based on the input sequence, and performs weighting processing on the selected resource items, so as to sort according to the weighted processing results, And provide the sorted resource items to the user. The manner of sorting through weighting processing will be described in detail with reference to the embodiment shown in FIG. 2 , and will not be repeated here.
需要说明的是,上述举例仅为更好地说明本发明的技术方案,而非对本发明的限制,本领域技术人员应该理解,任何当判断所述输入序列具有时效性时,对基于所述输入序列进行检索来获得的资源项进行时效性排序,以将排序后的资源项提供给所述用户的实现方式,均应包含在本发明的范围内。It should be noted that the above examples are only to better illustrate the technical solutions of the present invention, and are not intended to limit the present invention. Those skilled in the art should understand that when it is judged that the input sequence is time-sensitive, any The resource items retrieved in sequence are time-sensitively sorted, so as to provide the sorted resource items to the user, all of which should be included in the scope of the present invention.
需要进一步说明的是,当判断所述输入序列具有时效性时,排序装置或其他装置对基于所述输入序列进行检索来获得的资源项进行现有技术的排序处理,以将排序后的资源项提供给所述用户;或者,当判断所述输入序列具有时效性时,排序装置不执行任何操作。It should be further explained that, when it is judged that the input sequence has timeliness, the sorting device or other devices perform prior art sorting processing on the resource items retrieved based on the input sequence, so that the sorted resource items provided to the user; or, when it is judged that the input sequence is time-sensitive, the sorting device does not perform any operation.
作为优选方式之一,时效排序装置3根据所需要展现的资源项数量以及所需要展现的资源项排序位置,确定该需要展现的资源项排序位置所对应的资源项。例如,用户设备向搜索引擎发送用于进行查询的输入序列,则搜索引擎中的时效排序装置3根据该用户设备一次性所能够呈现的资源项数量为10,由基于该输入序列来获得的资源项中选择进行选择,并确定前10位排序位置对应的资源项来反馈给用户设备,以使用户设备将排序前10位的资源项提供给用户;接着,当用户设备向搜索引擎发送翻页请求,时效排序装置3再次由基于该输入序列来获得的资源项中进行选择,并确定排序11至20位对应的资源项来反馈给用户设备,以使用户设备将该排序11至20位的资源项提供给用户。As one of the preferred ways, the timeliness sorting device 3 determines the resource item corresponding to the resource item sorting position that needs to be presented according to the number of resource items that need to be presented and the sorting position of the resource items that need to be presented. For example, if the user equipment sends an input sequence for querying to the search engine, then the timeliness sorting device 3 in the search engine can present 10 resource items at one time according to the user equipment, and the resource items obtained based on the input sequence Item selection, and determine the resource items corresponding to the top 10 ranking positions to feed back to the user equipment, so that the user equipment will provide the top 10 resource items to the user; then, when the user equipment sends the search engine to turn the page request, the timeliness sorting device 3 selects again from the resource items obtained based on the input sequence, and determines the resource items corresponding to the ranks 11 to 20 to feed back to the user equipment, so that the user equipment ranks the resource items of the ranks 11 to 20 Resource items are offered to users.
作为优选方式之一,时效排序装置3对基于输入序列来获得的资源项进行排序后,根据需要展现给用户的资源项排序位置,来选择部分排序位置的资源项提供给用户。例如,用户设备向搜索引擎发送用于进行查询的输入序列,则搜索引擎中的时效排序装置3对基于输入序列来获得的资源项进行排序后,基于默认的一次提供10个排序后的资源项的规则,将排序前10位的资源项提供给用户设备,以使用户设备将排序前10位的资源项提供给用户;接着,当用户设备项搜索引擎发送提供第3页资源项的请求,则时效排序装置3根据第3页对应的资源项为21至30位资源项,来将排序第21至30位的资源项提供给用户设备,以使用户设备将排序第21至30位的资源项的资源项提供给用户。As one of the preferred ways, after sorting the resource items obtained based on the input sequence, the timeliness sorting device 3 selects some resource items in the sorted position and provides them to the user according to the sorting positions of the resource items presented to the user as required. For example, if the user equipment sends an input sequence for querying to the search engine, the timeliness sorting device 3 in the search engine sorts the resource items obtained based on the input sequence, and provides 10 sorted resource items at a time based on the default According to the rule, provide the top 10 resource items to the user equipment, so that the user equipment will provide the top 10 resource items to the user; then, when the user equipment item search engine sends a request to provide resource items on the third page, Then the timeliness sorting device 3 provides the resource items ranked 21st to 30th to the user equipment according to the resource items corresponding to the 3rd page being 21st to 30th resource items, so that the user equipment will sort the 21st to 30th resource items The item's resource item is provided to the user.
作为本发明的优选方案之一,根据本发明的排序装置还包括更新装置(图未示),该更新装置根据其包含的点击信息获取装置(图未示)、第二判断装置(图未示)以及删除装置(图未示),来对包含于所述用户需求突增关键词库中的至少一个突增关键词分别执行操作。As one of the preferred solutions of the present invention, the sorting device according to the present invention also includes an update device (not shown in the figure), which updates according to the click information acquisition device (not shown in the figure) and the second judging device (not shown in the figure) included in the update device. ) and a deleting device (not shown in the figure), to respectively perform operations on at least one sudden increase keyword contained in the user demand sudden increase keyword library.
点击信息获取装置获取一个或多个用户对基于当前处理的突增关键词或包含该当前处理的突增关键词的输入序列来获得的排序后的资源项的点击信息。优选地,该点击信息中包括基于当前处理的突增关键词或包含该当前处理的突增关键词的输入序列来获得的排序后的各个资源项的点击量和/或点击率。The click information acquiring means acquires click information of one or more users on sorted resource items obtained based on the currently processed sudden increase keyword or an input sequence containing the currently processed sudden increase keyword. Preferably, the click information includes the click amount and/or click rate of each sorted resource item obtained based on the currently processed sudden increase keyword or an input sequence containing the currently processed sudden increase keyword.
具体地,点击信息获取装置获取其他装置提供的点击信息;或者,点击信息获取装置通过在预定时间段内对用户基于前述排序后的资源项的点击行为进行监控并统计,来获得点击信息。Specifically, the click information acquisition means acquires click information provided by other devices; or, the click information acquisition means monitors and counts click behaviors of users based on the sorted resource items within a predetermined period of time to obtain click information.
接着,第二判断装置根据前述步骤S41中所得的点击信息,判断该突增关键词是否时效性已过。Next, the second judging means judges, according to the click information obtained in the aforementioned step S41, whether the timeliness of the sudden increase keyword has expired.
其中,第二判断装置判断突增关键词是否时效性已过的方式包括但不限于:Wherein, the way for the second judging device to judge whether the timeliness of the sudden increase keyword has expired includes but is not limited to:
1)第二判断装置根据被时效排序装置3指定至预定排序位置或者选择进行加权处理的资源项的点击率或点击量,并结合其他资源项的点击率或点击量间,来获得判断评价值;并根据该判断评价值是否大于或小于第一预定判断阈值,来判断当前处理的突增关键词是否时效性已过。1) The second judging means obtains the judging evaluation value according to the click-through rate or click-through rate of the resource item assigned to the predetermined sorting position or selected to be weighted by the aging sorting means 3, combined with the click-through rate or click-through rate of other resource items ; and according to whether the judgment evaluation value is greater than or less than the first predetermined judgment threshold, it is judged whether the timeliness of the sudden increase keyword currently processed has expired.
例如,点击信息获取装置获得的用户对基于突增关键词“5月CPI”进行检索所得的7个资源项的点击量如下表5所示:For example, the amount of clicks obtained by the user on the seven resource items retrieved based on the sudden increase keyword "CPI in May" obtained by clicking on the information acquisition device is shown in Table 5 below:
表5table 5
其中,资源项A和E为被指定至预定排序位置的资源项,资源项D为被选择进行加权处理的资源项,则第二判断装置将资源项A、D和E的点击量平均值与资源项B、C、F和G的点击量平均值的差值733作为判断评价值,并根据判断评价值大于第一预定判断阈值500,判断当前处理的突增关键词时效性未过。Wherein, resource items A and E are resource items assigned to a predetermined sorting position, and resource item D is a resource item selected for weighting processing, then the second judging means compares the average value of hits of resource items A, D, and E with The difference 733 of the average value of hits of resource items B, C, F, and G is used as the judgment evaluation value, and according to the judgment evaluation value being greater than the first predetermined judgment threshold 500, it is judged that the timeliness of the sudden increase keyword currently processed has not expired.
2)第二判断装置根据各个资源项中建立或更新时间短于第一预定时间阈值的资源项的点击信息,并结合各个资源项中建立或更新时间长于第二预定时间阈值的资源项的点击信息,来获得判断评价值;并根据该判断评价值是否大于或小于第二预定判断阈值,来判断当前处理的突增关键词是否时效性已过。2) The second judging means is based on the click information of resource items whose establishment or update time is shorter than the first predetermined time threshold in each resource item, combined with the click information of resource items whose establishment or update time is longer than the second predetermined time threshold in each resource item information to obtain a judgment evaluation value; and according to whether the judgment evaluation value is greater than or less than a second predetermined judgment threshold, it is judged whether the timeliness of the currently processed sudden increase keyword has expired.
例如,第二判断装置将各个资源项中建立或更新时间短于3天的资源项的点击信息与建立或更新时间长于90天的资源项的点击信息的比值作为判断评价值,并根据该判断评价值是否大于或小于第二预定判断阈值,来判断当前处理的突增关键词是否时效性已过。For example, the second judging means takes the ratio of the click information of the resource items whose establishment or update time is shorter than 3 days and the click information of the resource items whose establishment or update time is longer than 90 days in each resource item as the judgment evaluation value, and according to the judgment Whether the evaluation value is greater than or less than a second predetermined judgment threshold is used to judge whether the timeliness of the currently processed sudden keyword has expired.
需要说明的是,上述举例仅为更好地说明本发明的技术方案,而非对本发明的限制,本领域技术人员应该理解,任何根据点击信息,判断突增关键词是否时效性已过的实现方式,均应包含在本发明的范围内。It should be noted that the above examples are only to better illustrate the technical solution of the present invention, rather than limit the present invention. Those skilled in the art should understand that any implementation of judging whether the timeliness of a sudden increase keyword has expired based on the click information methods, should be included within the scope of the present invention.
当判断该突增关键词时效性已过,删除装置在所述用户需求突增关键词库中删除该突增关键词。When it is judged that the timeliness of the sudden increase keyword has expired, the deleting device deletes the sudden increase keyword in the user demand sudden increase keyword database.
优选地,当判断该突增关键词时效性未过,排序装置不对该突增关键词执行操作。Preferably, when it is judged that the timeliness of the sudden increase keyword has not expired, the sorting device does not perform any operation on the sudden increase keyword.
作为本实施例的优选方案之一,第一判断装置2进一步包括子判断装置(图未示)以及添加装置(图未示)。As one of the preferred solutions of this embodiment, the first judging device 2 further includes a sub-judging device (not shown in the figure) and an adding device (not shown in the figure).
子判断装置根据所述时效特征信息是否满足预定条件,判断所述输入序列是否具有时效性。其中,子判断装置根据时效特征信息是否满足预定条件来判断输入序列是否具有时效性的方式与第一判断装置2根据时效特征信息是否满足预定条件来判断输入序列是否具有时效性的方式相同或相似,在此不再赘述。The sub-judgment means judges whether the input sequence is time-sensitive according to whether the time-sensitive characteristic information satisfies a predetermined condition. Wherein, the manner in which the sub-judgment means judges whether the input sequence is time-sensitive according to whether the time-sensitive feature information satisfies the predetermined condition is the same or similar to the way that the first judging means 2 judges whether the input sequence is time-sensitive according to whether the time-sensitive feature information satisfies the predetermined condition , which will not be repeated here.
当判断所述输入序列具有时效性时,添加装置将所述输入序列和/或所述输入序列中的关键词加入所述用户需求突增关键词库中。优选地,添加装置还将与输入序列对应的时效特征信息加入所述用户需求突增关键词库中;更优选地,添加装置还将基于时效特征信息来获得的信息,例如,资源项数量或输入序列的被检索次数随时间的增长率等,加入所述用户需求突增关键词库中。When judging that the input sequence has timeliness, the adding device adds the input sequence and/or keywords in the input sequence to the user demand sudden increase keyword library. Preferably, the adding means also adds the time-sensitive feature information corresponding to the input sequence into the user demand sudden increase keyword library; more preferably, the adding means also adds information obtained based on the time-sensitive feature information, for example, the number of resource items or The growth rate of the search times of the input sequence over time, etc., are added to the keyword database for sudden increase in user demand.
根据本发明的排序装置能够识别具有时效性的输入序列,并能够针对基于该具有时效性的输入序列进行检索所得的资源项进行时效性排序,以将可能与该输入序列的时效性相关的资源项排序提前,更好地满足了用户的突发检索需求。The sorting device according to the present invention can identify the input sequence with timeliness, and can perform timeliness sorting on the resource items retrieved based on the input sequence with timeliness, so as to sort resources that may be related to the timeliness of the input sequence Items are sorted in advance, which better meets the user's sudden retrieval needs.
图4为本发明一个优选实施例的用于对资源项进行排序的排序装置结构示意图。根据本实施例的方法包括时效信息获取装置1、第一判断装置2以及时效排序装置3。其中,时效排序装置3进一步包括子排序装置31。Fig. 4 is a schematic structural diagram of a sorting device for sorting resource items according to a preferred embodiment of the present invention. The method according to this embodiment includes an aging information acquiring means 1 , a first judging means 2 and an aging sorting means 3 . Wherein, the aging sorting device 3 further includes a sub-sorting device 31 .
时效信息获取装置1以及第一判断装置2已在参照图2所示实施例中予以详述,并以引用的方式包含于此,不再赘述。The aging information acquiring device 1 and the first judging device 2 have been described in detail with reference to the embodiment shown in FIG. 2 , and are included here by reference, and will not be repeated here.
当判断所述输入序列具有时效性时,时效排序装置3对基于所述输入序列进行检索来获得的资源项进行时效性排序,以将排序后的资源项提供给所述用户。本实施例中,时效排序装置3中的子排序装置31执行所述时效性排序的操作,具体地,子排序装置31根据所述各个资源项的新鲜度相关信息,由其中选择至少一个资源项,并对所选择的至少一个资源项进行加权处理,以根据该加权处理后的处理结果来进行排序的步骤。When it is judged that the input sequence has timeliness, the timeliness sorting device 3 performs timeliness sorting on the resource items retrieved based on the input sequence, so as to provide the sorted resource items to the user. In this embodiment, the sub-sorting device 31 in the timeliness sorting device 3 performs the operation of the timeliness sorting, specifically, the sub-sorting device 31 selects at least one resource item from among them according to the freshness related information of each resource item , and performing weighting processing on at least one selected resource item, so as to perform a step of sorting according to a processing result after the weighting processing.
其中,所述新鲜度相关信息包括能够用于确定各个资源项的建立或更新时间是否属于输入序列的时效期间的相关信息。优选地,新鲜度相关信息包括以下至少一项:Wherein, the freshness related information includes related information that can be used to determine whether the establishment or update time of each resource item belongs to the aging period of the input sequence. Preferably, the freshness-related information includes at least one of the following:
1)资源项的建立或更新时间以及预确定的输入序列的用户需求突增时间。其中,排序装置能够获取预存储的各个资源项的建立或更新时间,例如,排序装置或其他装置在抓取各个资源项的同时,存储各个资源项的建立或更新时间,以供后续使用;其中,该建立或更新时间可为各个资源项在其网络中的建立或更新时间,也可为排序装置或其他装置抓取并存储或更新各个资源项的时间。其中,在子排序装置31执行操作前,所述预确定的输入序列的用户需求突增时间能够由排序装置或其他装置根据以下至少一项因素来确定:a)基于输入序列进行检索来获得的资源项数量随时间的变化信息,该资源项数量随时间的变化信息已在参照图3所示实施例中对时效特征信息进行说明的部分予以详述,在此不再赘述;排序装置或其他装置通过根据前述资源项数量随时间的变化信息来确定资源项数量随时间的增长强度,以确定用户需求突增时间,例如,排序装置或其他装置将资源项数量随时间的增长强度第一次超过第二预定增长阈值的时间段的起始时间点作为用户需求突增时间;b)输入序列和/或输入序列包含的关键词的被检索次数随时间的变化信息,该被检索次数随时间的变化信息已在参照图3所示实施例中对时效特征信息进行说明的部分予以详述,在此不再赘述;排序装置或其他装置通过根据前述被检索次数随时间的变化信息来确定被检索次数随时间的增长强度,以确定用户需求突增时间。1) The creation or update time of the resource item and the user demand surge time of the predetermined input sequence. Wherein, the sorting device can obtain the establishment or update time of each resource item stored in advance, for example, the sorting device or other devices store the establishment or update time of each resource item while grabbing each resource item, for subsequent use; , the establishment or update time may be the establishment or update time of each resource item in its network, or the time when the sorting device or other devices capture and store or update each resource item. Wherein, before the sub-sorting device 31 executes the operation, the user demand surge time of the predetermined input sequence can be determined by the sorting device or other devices according to at least one of the following factors: a) the time obtained by searching based on the input sequence The change information of the number of resource items over time, the information about the change of the number of resource items over time has been described in detail in the part of the description of the time-sensitive feature information in the embodiment shown in Figure 3, and will not be repeated here; the sorting device or other The device determines the growth intensity of the number of resource items over time according to the change information of the number of resource items over time, so as to determine the time of sudden increase in user demand. For example, the sorting device or other devices determine the growth intensity of the number of resource items The starting time point of the time period exceeding the second predetermined growth threshold is taken as the user demand sudden increase time; b) information on the number of searches over time of the input sequence and/or keywords contained in the input sequence, the number of searches over time The change information has been described in detail in the part of the description of the time-sensitive feature information in the embodiment shown in FIG. 3 , and will not be repeated here; The growth intensity of retrieval times over time to determine the time when user demand surges.
具体地,子排序装置31由基于输入序列进行检索来获得的各个资源项中选择建立或更新时间晚于用户需求突增时间的资源项,并对所选择的资源项进行加权处理,以根据该加权处理后的处理结果来进行排序。Specifically, the sub-sorting device 31 selects the resource items whose establishment or update time is later than the user demand surge time from among the resource items retrieved based on the input sequence, and performs weighting processing on the selected resource items to obtain according to the The processing results after weighting processing are sorted.
例如,预确定的用户需求突增时间为5月12日,则子排序装置31由由基于输入序列进行检索来获得的各个资源项中选择建立时间晚于5月12日的资源项,并根据预定的加权值来对所选择的各个资源项进行加权处理,以根据该加权处理后的处理结果来进行排序。For example, if the predetermined user demand surge time is May 12, the sub-sorting device 31 selects the resource items whose creation time is later than May 12 from among the resource items obtained by searching based on the input sequence, and according to The selected resource items are weighted according to a predetermined weight value, so as to be sorted according to the weighted processing results.
2)所述输入序列的历史检索记录中包含的资源项。2) The resource items included in the historical retrieval records of the input sequence.
具体地,子排序装置31由基于输入序列进行检索来获得的资源项中选择未包含于该输入序列的历史检索记录中的资源项,并对所选择的资源项进行加权处理,以根据该加权处理后的处理结果来进行排序。由于所选择的资源项未包含于数据序列的历史检索记录中,因此,可认为所选择的资源项的建立时间晚于历史检索记录的最后更新时间,故基于历史检索记录来选择的资源项的建立或更新时间应处于输入序列的时效期间。Specifically, the sub-sorting device 31 selects resource items that are not included in the historical retrieval records of the input sequence from the resource items retrieved based on the input sequence, and performs weighting processing on the selected resource items, so that according to the weighted The processed results are sorted. Since the selected resource item is not included in the historical retrieval record of the data sequence, it can be considered that the creation time of the selected resource item is later than the last update time of the historical retrieval record, so the resource item selected based on the historical retrieval record The creation or update time should be within the aging period of the input sequence.
需要说明的是,上述举例仅为更好地说明本发明的技术方案,而非对本发明的限制,本领域技术人员应该理解,任何根据所述各个资源项的新鲜度相关信息,由其中选择至少一个资源项,并对所选择的至少一个资源项进行加权处理,以根据该加权处理后的处理结果来进行排序的实现方式,均应包含在本发明的范围内。It should be noted that the above examples are only to better illustrate the technical solutions of the present invention, rather than limiting the present invention. Those skilled in the art should understand that any information related to the freshness of each resource item is selected from which at least One resource item, and performing weighting processing on at least one selected resource item, so as to sort according to the processing result after the weighting processing, should be included in the scope of the present invention.
需要进一步说明的是,子排序装置31可先选择需要所有加权处理的资源项并进行加权处理,然后再对所有进过加权处理以及未经过加权处理的资源项进行排序;或者,子排序装置31每获得一个需要加权处理的资源项,即对该资源项进行加权处理,并将该资源项插入已排序资源项中。It should be further explained that the sub-sorting device 31 may first select resource items that require all weighted processing and perform weighting processing, and then sort all resource items that have undergone weighted processing and have not undergone weighted processing; or, the sub-sorting device 31 Each time a resource item that needs to be weighted is obtained, the resource item is weighted, and the resource item is inserted into the sorted resource items.
作为本实施例的优选方案之一,本实施例的排序装置还包括强度确定装置以及加权值确定装置,前述子排序装置31进一步包括执行加权处理操作的加权处理装置(图未示),该加权处理装置根据加权值,对所选择的需要进行加权处理的资源项进行加权处理的步骤。As one of the preferred schemes of this embodiment, the sorting device of this embodiment also includes a strength determining device and a weighted value determining device, and the aforementioned sub-sorting device 31 further includes a weighting processing device (not shown) that performs a weighting processing operation, the weighting A step in which the processing device performs weighting processing on the selected resource items that need to be weighted according to the weighting value.
强度确定装置根据时效信息获取装置1获得的时效信息,确定输入序列的用户需求变化强度。The intensity determination means determines the change intensity of the user demand of the input sequence according to the aging information obtained by the aging information acquiring means 1 .
具体地,强度确定装置确定输入序列的用户需求变化强度的方式包括但不限于:Specifically, the means for determining the strength of the user demand change of the input sequence include but not limited to:
1)当时效信息中包括排序装置在用户需求突增关键词库中查询所得的查询结果,且该查询结果包含查询所得的突增关键词的用户需求变化强度时,强度确定装置根据查询结果中的突增关键词的用户需求变化强度来确定输入序列的用户需求变化强度。1) When the timeliness information includes the query result obtained by the sorting device in the user demand sudden increase keyword database, and the query result includes the user demand change intensity of the query sudden increase keyword, the strength determination device according to the query result The user demand change strength of the sudden increase keyword is used to determine the user demand change strength of the input sequence.
例如,输入序列为“福岛地震怎么样了”,时效信息获取装置1获得的查询结果中包含突增关键词“福岛地震”对应的用户需求变化强度为3级,则强度确定装置直接将突增关键词“福岛地震”对应的用户需求变化强度作为输入序列“福岛地震怎么样了”的用户需求变化强度。For example, if the input sequence is "how about the Fukushima earthquake", and the query result obtained by the time-sensitive information acquisition device 1 contains the sudden increase keyword "Fukushima earthquake", the user demand change intensity corresponding to level 3 is 3, then the intensity determination device will directly The user demand change intensity corresponding to the sudden increase keyword "Fukushima earthquake" is used as the user demand change intensity of the input sequence "how about the Fukushima earthquake".
又例如,输入序列为“福岛地震核电站”,时效信息获取装置1获得的查询结果中包含突增关键词“福岛地震”对应的用户需求变化强度为4.76,突增关键词“核电站”对应的用户需求变化强度为2.35,则强度确定装置将突增关键词对应的用户需求变化强度之和7.11作为输入序列“福岛地震核电站”的用户需求变化强度。For another example, the input sequence is "Fukushima Earthquake Nuclear Power Plant", and the query result obtained by the time-sensitive information acquisition device 1 includes the sudden increase keyword "Fukushima Earthquake", the user demand change intensity is 4.76, and the sudden increase keyword "Nuclear Power Plant" corresponds to If the user demand change intensity is 2.35, the intensity determination device takes the sum of user demand change intensity corresponding to the sudden increase keyword as 7.11 as the user demand change intensity of the input sequence "Fukushima Earthquake Nuclear Power Plant".
2)当时效信息中包括时效特征信息,则强度确定装置根据时效特征信息来确定输入序列的用户需求变化强度。2) The timeliness information includes timeliness feature information, and the intensity determining means determines the change intensity of the user demand of the input sequence according to the timeliness feature information.
例如,时效特征信息包括基于输入序列进行检索来获得的资源项数量随时间的变化信息,则强度确定装置根据预确定的诸如支持向量机、贝叶斯模型、最大熵模型等机器学习模型来确定输入序列的用户需求变化强度。其中,可通过将大量已确定用户需求变化强度以及已获得资源项数量随时间的变化信息的输入序列对机器学习模型进行训练,来获得用于确定用户需求变化强度的机器学习模型。For example, the time-sensitive feature information includes information on the change of the number of resource items over time obtained by searching based on the input sequence, and the strength determination device determines the The intensity of user demand change for the input sequence. Wherein, the machine learning model for determining the change intensity of the user demand can be obtained by training the machine learning model with a large number of input sequences that have determined the change intensity of the user demand and the information on the change of the number of resource items over time.
又例如,时效信息获取装置1基于输入序列“福岛地震”获得的时效特征信息包括以下两项:1)包含突发特征关键词“地震”以及突发特征语义结构“专有名词+普通名词”的突发语义特征信息;2)输入序列“福岛地震”的被检索次数随时间的变化信息,该变化信息如下表6所示:For another example, the time-sensitive feature information obtained by the time-sensitive information acquisition device 1 based on the input sequence "Fukushima earthquake" includes the following two items: 1) contains the burst feature keyword "earthquake" and the burst feature semantic structure "proper noun + common noun" 2) The change information of the retrieved times of the input sequence "Fukushima Earthquake" over time, the change information is shown in Table 6 below:
表6Table 6
则强度确定装置逐一计算相邻两个时间点的被检索次数差值,并获得最大差值=15247-2045=13202,接着,强度确定装置获取该最大差值相对最大被检索次数的比例=13202/17896=73.77%,接着,强度确定装置确定73.77%属于预定范围70-80%,并获得预定范围70-80%对应的用户需求变化强度为第二级。并且,强度确定装置根据当突发语义特征信息同时包含突发特征关键词以及突发特征语义结构时,该突发语义特征信息对应的等级调整方式为上调一级的预定规则,确定包含突发特征关键词“地震”以及突发特征语义结构“专有名词+普通名词”的突发语义特征信息对应的等级调整规则为上调一级。则强度确定装置根据基于突发语义特征信息确定的等级调整规则来调整基于被检索次数随时间的变化信息来获得的等级,最终确定输入序列“福岛地震”的用户需求变化强度为第一级。Then the strength determination device calculates the difference value of the number of times retrieved at two adjacent time points one by one, and obtains the maximum difference = 15247-2045 = 13202, and then, the strength determination device obtains the ratio of the maximum difference to the maximum number of times retrieved = 13202 /17896=73.77%, then, the intensity determining means determines that 73.77% belongs to the predetermined range of 70-80%, and obtains the user demand change intensity corresponding to the predetermined range of 70-80% as the second level. Moreover, the intensity determination device determines that the burst semantic feature information contains burst feature keywords and burst feature semantic structure according to the predetermined rule that the level adjustment mode corresponding to the burst semantic feature information is to increase one level. The level adjustment rule corresponding to the burst semantic feature information of the feature keyword "earthquake" and the burst feature semantic structure "proper noun + common noun" is to raise one level. Then the intensity determination device adjusts the level obtained based on the change information of the number of retrieved times over time according to the level adjustment rule determined based on the burst semantic feature information, and finally determines that the user demand change intensity of the input sequence "Fukushima earthquake" is the first level .
优选地,排序装置将输入序列及其用户需求变化强度加入用户需求突增关键词库中。Preferably, the sorting device adds the input sequence and its user demand change intensity into the user demand sudden increase keyword library.
需要说明的是,上述举例仅为更好地说明本发明的技术方案,而非对本发明的限制,本领域技术人员应该理解,任何根据时效信息来确定输入序列的用户需求变化强度的实现方式,均应包含在本发明的范围内。It should be noted that the above examples are only to better illustrate the technical solutions of the present invention, and are not intended to limit the present invention. Those skilled in the art should understand that any implementation method for determining the intensity of changes in user needs of an input sequence based on time-sensitive information, All should be included within the scope of the present invention.
接着,加权值确定装置根据该用户需求变化强度,确定加权值。Next, the weighted value determining means determines the weighted value according to the change intensity of the user's demand.
例如,当用户需求变化强度为一个数值时,直接将该数值作为加权值,或者,将该数值乘以预定的调节因子,并将乘积作为加权值等。For example, when the change intensity of the user's demand is a numerical value, the numerical value is directly used as the weighted value, or the numerical value is multiplied by a predetermined adjustment factor, and the product is used as the weighted value, etc.
又例如,当用户需求变化强度为一个等级时,获取预定的与该等级相应的加权值等。For another example, when the change intensity of the user demand is at a level, a predetermined weighted value corresponding to the level is obtained.
需要说明的是,上述举例仅为更好地说明本发明的技术方案,而非对本发明的限制,本领域技术人员应该理解,任何根据该用户需求变化强度,确定加权值的实现方式,均应包含在本发明的范围内。It should be noted that the above examples are only to better illustrate the technical solutions of the present invention, and are not intended to limit the present invention. Those skilled in the art should understand that any implementation of determining the weighted value according to the change intensity of the user's needs should be included within the scope of the present invention.
接着,时效排序装置3执行操作,本实施例中,包含于子排序装置31中的加权处理装置根据加权确定装置确定的加权值,对所选择的需要进行加权处理的资源项进行加权处理的步骤。Next, the aging sorting device 3 performs an operation. In this embodiment, the weighting processing device included in the sub-sorting device 31 performs weighting processing on the selected resource items that need to be weighted according to the weight value determined by the weighting determination device. .
优选地,加权处理装置还包括子加权处理装置,子加权处理装置根据加权确定装置确定的加权值,并结合所选择的资源项中每个资源项的建立或更新时间,对所选择的资源项进行加权处理的步骤。Preferably, the weighting processing device further includes a sub-weighting processing device, and the sub-weighting processing device calculates the selected resource item according to the weight value determined by the weight determining device and in combination with the establishment or update time of each resource item in the selected resource item. Steps to carry out the weighting process.
具体地,排序装置根据各个资源项的建立或更新时间,来调整加权值,并根据调整后的加权值来来对所选择的资源项进行加权处理。Specifically, the sorting device adjusts the weight value according to the establishment or update time of each resource item, and performs weight processing on the selected resource item according to the adjusted weight value.
例如,加权确定装置获得的加权值为2.3,且需要进行加权处理的资源项的建立时间为5月24日13点,则子加权处理装置根据该建立时间与当前时间5月24日20点的时间间距小于预定时间间隔1天,来将加权值上调0.1,以获得调整后的加权值2.4,并根据调整后的加权值2.4,来对该资源项进行加权处理。For example, if the weight value obtained by the weight determination device is 2.3, and the establishment time of the resource item that needs to be weighted is 13 o'clock on May 24, then the sub-weight processing device will calculate the value based on the establishment time and the current time at 20 o'clock on May 24. If the time interval is less than 1 day, the weighted value is increased by 0.1 to obtain an adjusted weighted value of 2.4, and the resource item is weighted according to the adjusted weighted value of 2.4.
又例如,加权确定装置获得的加权值为0.1,且需要进行加权处理的资源项的建立时间为5月24日,接着,子加权处理装置确定该建立时间与输入序列对应的用户需求突增时间5月22的时间间隔为2天,则子加权处理装置将加权值与时间间隔的积0.2作为调整后的加权值,来对该资源项进行加权处理。For another example, the weighted value obtained by the weight determination device is 0.1, and the establishment time of the resource item that needs to be weighted is May 24, and then, the sub-weight processing device determines that the establishment time corresponds to the user demand sudden increase time of the input sequence If the time interval of May 22 is 2 days, the sub-weighting processing device uses 0.2, the product of the weight value and the time interval, as the adjusted weight value to perform weighting processing on the resource item.
需要说明的是,上述举例仅为更好地说明本发明的技术方案,而非对本发明的限制,本领域技术人员应该理解,任何根据加权值,并结合所选择的资源项中每个资源项的建立或更新时间,对所选择的资源项进行加权处理的实现方式,均应包含在本发明的范围内。It should be noted that the above examples are only to better illustrate the technical solution of the present invention, rather than limit the present invention. Those skilled in the art should understand that any combination of resource items selected according to weighted values The establishment or update time of the resource item, and the implementation manner of weighting the selected resource items should all be included in the scope of the present invention.
根据本实施例的排序装置,进一步根据资源项的新鲜度来执行时效性排序,从而获得更好的时效性排序结果。According to the sorting apparatus of this embodiment, time-sensitive sorting is further performed according to the freshness of resource items, so as to obtain better time-sensitive sorting results.
对于本领域技术人员而言,显然本发明不限于上述示范性实施例的细节,而且在不背离本发明的精神或基本特征的情况下,能够以其他的具体形式实现本发明。因此,无论从哪一点来看,均应将实施例看作是示范性的,而且是非限制性的,本发明的范围由所附权利要求而不是上述说明限定,因此旨在将落在权利要求的等同要件的含义和范围内的所有变化涵括在本发明内。不应将权利要求中的任何附图标记视为限制所涉及的权利要求。此外,显然“包括”一词不排除其他单元或步骤,单数不排除复数。系统权利要求中陈述的多个单元或装置也可以由一个单元或装置通过软件或者硬件来实现。第一,第二等词语用来表示名称,而并不表示任何特定的顺序。It will be apparent to those skilled in the art that the invention is not limited to the details of the above-described exemplary embodiments, but that the invention can be embodied in other specific forms without departing from the spirit or essential characteristics of the invention. Accordingly, the embodiments should be regarded in all points of view as exemplary and not restrictive, the scope of the invention being defined by the appended claims rather than the foregoing description, and it is therefore intended that the scope of the invention be defined by the appended claims rather than by the foregoing description. All changes within the meaning and range of equivalents of the elements are embraced in the present invention. Any reference sign in a claim should not be construed as limiting the claim concerned. In addition, it is obvious that the word "comprising" does not exclude other elements or steps, and the singular does not exclude the plural. A plurality of units or devices stated in the system claims may also be realized by one unit or device through software or hardware. The words first, second, etc. are used to denote names without implying any particular order.
Claims (21)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201110254123.4A CN102955829B (en) | 2011-08-30 | 2011-08-30 | For the method being ranked up to resource items, device and equipment |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201110254123.4A CN102955829B (en) | 2011-08-30 | 2011-08-30 | For the method being ranked up to resource items, device and equipment |
Publications (2)
Publication Number | Publication Date |
---|---|
CN102955829A CN102955829A (en) | 2013-03-06 |
CN102955829B true CN102955829B (en) | 2017-11-03 |
Family
ID=47764639
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201110254123.4A Active CN102955829B (en) | 2011-08-30 | 2011-08-30 | For the method being ranked up to resource items, device and equipment |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN102955829B (en) |
Families Citing this family (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103164527A (en) * | 2013-03-27 | 2013-06-19 | 百度在线网络技术(北京)有限公司 | Retrieval method and retrieval equipment |
CN103984757B (en) * | 2014-05-29 | 2016-08-24 | 奇飞翔艺(北京)软件有限公司 | Search results pages is inserted the method and system of news information entry |
WO2015176624A1 (en) * | 2014-05-19 | 2015-11-26 | 北京奇虎科技有限公司 | Sudden timeliness search term identification method and system |
CN108810056B (en) * | 2017-05-04 | 2021-05-14 | 腾讯科技(北京)有限公司 | Information pushing method and device |
CN110119936A (en) * | 2019-04-30 | 2019-08-13 | 上海钒兆钛智能科技有限公司 | It is a kind of for determining the method and system of the degree of balance of digital resource item |
CN111913995A (en) * | 2020-07-08 | 2020-11-10 | 广州汇才创智科技有限公司 | Programming learning resource pushing method based on artificial intelligence and programming learning platform |
CN116578602B (en) * | 2023-06-15 | 2024-03-12 | 深圳计算科学研究院 | Time sequence ordering method and device |
CN117077802B (en) * | 2023-06-15 | 2024-07-02 | 深圳计算科学研究院 | A method and device for sorting and predicting time series data |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101246499A (en) * | 2008-03-27 | 2008-08-20 | 腾讯科技(深圳)有限公司 | Network information search method and system |
CN102073684A (en) * | 2010-12-22 | 2011-05-25 | 百度在线网络技术(北京)有限公司 | Method and device for excavating search log and page search method and device |
CN102073699A (en) * | 2010-12-20 | 2011-05-25 | 百度在线网络技术(北京)有限公司 | Method, device and equipment for improving search result based on user behaviors |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20090265328A1 (en) * | 2008-04-16 | 2009-10-22 | Yahool Inc. | Predicting newsworthy queries using combined online and offline models |
-
2011
- 2011-08-30 CN CN201110254123.4A patent/CN102955829B/en active Active
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101246499A (en) * | 2008-03-27 | 2008-08-20 | 腾讯科技(深圳)有限公司 | Network information search method and system |
CN102073699A (en) * | 2010-12-20 | 2011-05-25 | 百度在线网络技术(北京)有限公司 | Method, device and equipment for improving search result based on user behaviors |
CN102073684A (en) * | 2010-12-22 | 2011-05-25 | 百度在线网络技术(北京)有限公司 | Method and device for excavating search log and page search method and device |
Also Published As
Publication number | Publication date |
---|---|
CN102955829A (en) | 2013-03-06 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN102955829B (en) | For the method being ranked up to resource items, device and equipment | |
US8150841B2 (en) | Detecting spiking queries | |
EP3577574B1 (en) | Content search engine | |
US10776431B2 (en) | System and method for recommending content based on search history and trending topics | |
WO2017121251A1 (en) | Information push method and device | |
CN105631707A (en) | Advertisement click rate estimation method based on decision tree, application recommendation method and device | |
US10664481B2 (en) | Computer system programmed to identify common subsequences in logs | |
US9245014B2 (en) | Clustering of feedback reports | |
CN104142999A (en) | Search result display method and device | |
CN112818230B (en) | Content recommendation method, device, electronic equipment and storage medium | |
CN109255000B (en) | Dimension management method and device for label data | |
US8788477B1 (en) | Identifying addresses and titles of authoritative web pages by analyzing search queries in query logs | |
EP2389624A1 (en) | Sampling analysis of search queries | |
CN111061954B (en) | Search result sorting method and device and storage medium | |
US20110184940A1 (en) | System and method for detecting changes in the relevance of past search results | |
CN107291939A (en) | The clustering match method and system of hotel information | |
US10552438B2 (en) | Triggering method for instant search | |
WO2015185020A1 (en) | Information category obtaining method and apparatus | |
CN103425650A (en) | Recommendation searching method and recommendation searching system | |
CN115858815A (en) | Method for determining mapping information, advertisement recommendation method, device, equipment and medium | |
JP5488031B2 (en) | Search device | |
CN110674387B (en) | Method, apparatus and computer storage medium for data search | |
CN105989152B (en) | Method, device and system for monitoring service quality of search engine | |
CN108352032A (en) | Systems and methods for returning prioritized content | |
CN112100441A (en) | Video recommendation method, electronic device and computer-readable storage medium |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |