[go: up one dir, main page]

HK1192035B - Method for providing search result, and apparatus thereof - Google Patents

Method for providing search result, and apparatus thereof Download PDF

Info

Publication number
HK1192035B
HK1192035B HK14105111.7A HK14105111A HK1192035B HK 1192035 B HK1192035 B HK 1192035B HK 14105111 A HK14105111 A HK 14105111A HK 1192035 B HK1192035 B HK 1192035B
Authority
HK
Hong Kong
Prior art keywords
product information
determined
price
standard deviation
average
Prior art date
Application number
HK14105111.7A
Other languages
Chinese (zh)
Other versions
HK1192035A (en
Inventor
李嘉森
姚建强
帅朝谦
Original Assignee
阿里巴巴集团控股有限公司
Filing date
Publication date
Application filed by 阿里巴巴集团控股有限公司 filed Critical 阿里巴巴集团控股有限公司
Publication of HK1192035A publication Critical patent/HK1192035A/en
Publication of HK1192035B publication Critical patent/HK1192035B/en

Links

Description

一种提供搜索结果的方法及装置Method and device for providing search results

技术领域Technical Field

本申请涉及通信技术领域,尤其涉及一种提供搜索结果的方法及装置。The present application relates to the field of communication technology, and in particular to a method and device for providing search results.

背景技术Background Art

目前,大多购物网站都可以向用户提供搜索功能,用户可以通过该搜索功能搜索其想要的商品的商品信息。具体的,用户向购物网站的服务器发送搜索词,服务器基于接收到的搜索词,搜索与该搜索词相关的商品信息,并将搜索到的商品信息提供给用户。Currently, most shopping websites offer users a search function that allows them to search for product information of desired products. Specifically, the user sends a search term to the shopping website's server. The server searches for product information related to the received search term based on the received search term and provides the searched product information to the user.

在实际应用中,由于用户通常比较关注的是排序比较靠前的商品信息,因此服务器搜索到商品信息之后,需要对搜索到的商品信息进行排序,并将排序后的商品信息提供给用户。而由于一般用户对商品信息中包含的价格比较关注,因此,在现有技术中,服务器在对搜索到的商品信息进行排序时,可以按照所包含的价格由高到低或由低到高的顺序对搜索到的商品信息进行排序。In actual applications, users are generally more interested in product information that is ranked higher. Therefore, after searching for product information, the server needs to sort the searched product information and provide the sorted product information to the user. Since users are generally more interested in the prices included in the product information, in the prior art, the server can sort the searched product information in descending or descending order based on the prices included.

但是,对于同一种商品而言,不同的卖家制定的该商品的出售价格也有所不同。例如,作为买家的用户向服务器发送的搜索词为某品牌的跑鞋,在服务器搜索到的商品信息中,10个商品信息中包含的价格为20元、20个商品信息中包含的价格为100元、5个商品信息中包含的价格为300元。而实际上该品牌的跑鞋的价格应该是120元,显然,作为买家的用户会直观的认为以100元出售该跑鞋更加合理,从而关注这20个所包含的价格为100元的商品信息。However, different sellers may set different prices for the same product. For example, a buyer might search for a certain brand of running shoes. Among the product information retrieved by the server, 10 products have a price of 20 yuan, 20 have a price of 100 yuan, and 5 have a price of 300 yuan. However, the actual price of the brand of running shoes should be 120 yuan. Obviously, the buyer would intuitively think that 100 yuan is more reasonable, and thus pay attention to the 20 products with a price of 100 yuan.

然而,现有技术中服务器只能单一的按照所包含的价格由高到低或由低到高的顺序对搜索到的商品信息进行排序,假设服务器按照价格由低到高的顺序对上例中搜索到的商品信息进行排序,则排序结果为:10个包含的价格为20元的商品信息、20个包含的价格为100元的商品信息、5个包含的价格为300元的商品信息。However, in the prior art, the server can only sort the searched product information in order of the prices contained, from high to low or from low to high. Assuming that the server sorts the searched product information in the above example in order of prices from low to high, the sorting result is: 10 product information containing a price of 20 yuan, 20 product information containing a price of 100 yuan, and 5 product information containing a price of 300 yuan.

显然,服务器并未将这20个包含的价格为100元的商品信息排在靠前的位置,而是将10个包含的价格为20元的商品信息排在了靠前的位置,因此服务器将排序后的商品信息提供给用户后,排序靠前的商品信息对应的商品往往并不是用户认为价格合理的商品,从而用户就会不断的设置筛选条件进行重新搜索,导致服务器的压力增大。Obviously, the server did not rank the 20 products priced at 100 yuan at the front, but instead ranked the 10 products priced at 20 yuan at the front. Therefore, after the server provides the sorted product information to the user, the products corresponding to the product information at the front are often not the products that the user thinks are reasonably priced. As a result, the user will constantly set filtering conditions and re-search, which increases the pressure on the server.

发明内容Summary of the Invention

本申请实施例提供一种提供搜索结果的方法及装置,用以解决现有技术中服务器将排序后的商品信息提供给用户后,用户仍需要不断的设置筛选条件进行重新搜索,导致服务器的压力较大的问题。The embodiments of the present application provide a method and apparatus for providing search results, which is used to solve the problem in the prior art that after the server provides sorted product information to the user, the user still needs to continuously set filtering conditions to re-search, causing great pressure on the server.

本申请实施例提供的一种提供搜索结果的方法,包括:An embodiment of the present application provides a method for providing search results, including:

服务器基于接收到的搜索词搜索相关的商品信息;并The server searches for relevant product information based on the received search term; and

查找预先确定的所述搜索词对应的价格最优值;以及Finding a predetermined optimal price corresponding to the search term; and

分别确定搜索到的每个商品信息中包含的价格与查找到的价格最优值的差值的绝对值,按照分别确定的绝对值由小到大的顺序对搜索到的商品信息进行排序并提供。The absolute value of the difference between the price contained in each searched product information and the found optimal price is determined respectively, and the searched product information is sorted and provided in ascending order according to the respectively determined absolute values.

本申请实施例提供的一种提供搜索结果的装置,包括:An embodiment of the present application provides a device for providing search results, including:

搜索模块,用于基于接收到的搜索词搜索相关的商品信息;A search module, used to search for relevant product information based on the received search terms;

最优值确定模块,用于查找预先确定的所述搜索词对应的价格最优值;An optimal value determination module, configured to find a predetermined optimal price value corresponding to the search term;

提供模块,用于分别确定搜索到的商品信息中包含的价格与查找到的价格最优值的差值的绝对值,按照分别确定的绝对值由小到大的顺序对搜索到的商品信息进行排序并提供。A module is provided for respectively determining the absolute value of the difference between the price contained in the searched product information and the found optimal price value, and sorting and providing the searched product information in ascending order according to the respectively determined absolute values.

本申请实施例提供一种提供搜索结果的方法及装置,该方法服务器基于接收到的搜索词进行搜索时,查找预先确定的该搜索词对应的价格最优值,分别确定搜索到的每个商品信息中包含的价格与该价格最优值的差值的绝对值,并按照确定的绝对值由小到大的顺序对搜索到的商品信息进行排序并提供。通过上述方法,服务器向用户提供的商品信息中,排序靠前的商品信息即为包含的价格与价格最优值相差较小的商品信息,与现有技术中单一按照包含的价格从高到低或从低到高的顺序提供搜索结果的方法相比,可以有效减少用户设置筛选条件并重新搜索的次数,这减轻了服务器的压力。An embodiment of the present application provides a method and device for providing search results. When the server performs a search based on a received search term, the method searches for a predetermined optimal price value corresponding to the search term, determines the absolute value of the difference between the price contained in each searched product information and the optimal price value, and sorts and provides the searched product information in order of the determined absolute values from small to large. Through the above method, among the product information provided by the server to the user, the product information with the highest sorting order is the product information whose contained price has the smallest difference with the optimal price value. Compared with the method in the prior art that provides search results in a single order from high to low or from low to high according to the contained price, the method can effectively reduce the number of times the user sets the filter conditions and searches again, thereby reducing the pressure on the server.

附图说明BRIEF DESCRIPTION OF THE DRAWINGS

图1为本申请实施例提供的提供搜索结果的过程;FIG1 is a process for providing search results according to an embodiment of the present application;

图2为本申请实施例提供的拟合基于该搜索词搜索到的商品信息中包含的价格的概率密度函数的过程;FIG2 is a process of fitting the probability density function of the price contained in the product information searched based on the search term provided by an embodiment of the present application;

图3为本申请实施例提供的一种提供搜索结果的装置结构示意图。FIG3 is a schematic diagram of the structure of a device for providing search results provided in an embodiment of the present application.

具体实施方式DETAILED DESCRIPTION

由于现有技术中服务器只能单一的按照包含的价格由高到低或由低到高的顺序对搜索到的商品信息进行排序并提供,因此,现有技术中服务器提供的排序靠前的商品信息所包含的价格往往与大多用户所认为的合理价格相差甚远,这就会导致用户需要不断的设置筛选条件进行重新搜索,导致服务器的压力增大。Since the server in the prior art can only sort and provide the searched product information in order of the prices contained therein from high to low or from low to high, the prices contained in the top-ranked product information provided by the server in the prior art are often far from what most users consider to be reasonable prices. This will cause users to constantly set filtering conditions and search again, which increases the pressure on the server.

本申请实施例服务器预先确定搜索词对应的价格最优值,该搜索词对应的价格最优值即为在基于该搜索词所能搜索到的商品信息所包含的价格中,被大多用户所认为的合理价格,在基于搜索词进行搜索时,按照搜索到的商品信息中包含的价格与预先确定的该搜索词对应的价格最优值的差值的绝对值越小,排序越靠前的方式,对搜索到的商品信息进行排序并提供,从而向用户提供的排序靠前的商品信息即为包含的价格与该搜索词对应的价格最优值较为接近的商品信息,以降低用户通过设置筛选条件进行重新搜索的次数,减轻服务器的压力。In the embodiment of the present application, the server predetermines the optimal price value corresponding to the search term, which is the price that most users consider to be reasonable among the prices contained in the product information that can be searched based on the search term. When searching based on the search term, the searched product information is sorted and provided in such a manner that the smaller the absolute value of the difference between the price contained in the searched product information and the predetermined optimal price value corresponding to the search term, the higher the ranking. The product information with a higher ranking provided to the user is the product information whose price is closer to the optimal price value corresponding to the search term, so as to reduce the number of times the user re-searches by setting filtering conditions and alleviate the pressure on the server.

下面结合说明书附图,对本申请实施例进行详细描述。The embodiments of the present application are described in detail below with reference to the accompanying drawings.

图1为本申请实施例提供的提供搜索结果的过程,具体包括以下步骤:FIG1 is a process for providing search results according to an embodiment of the present application, which specifically includes the following steps:

S101:服务器基于接收到的搜索词搜索相关的商品信息。S101: The server searches for relevant product information based on the received search term.

与现有技术中类似的,用户想要搜索某种商品的商品信息时,向服务器发送相应的搜索词,服务器接收到用户发送的搜索词后,基于接收到的该搜索词,搜索与该搜索词相关的商品信息。Similar to the prior art, when a user wants to search for product information of a certain product, he sends the corresponding search term to the server. After the server receives the search term sent by the user, it searches for product information related to the search term based on the received search term.

S102:查找预先确定的该搜索词对应的价格最优值。S102: Finding a predetermined optimal price value corresponding to the search term.

在本申请实施例中,服务器预先针对搜索日志中记录的每个搜索词都确定了相应的价格最优值,因此服务器基于接收到的该搜索词进行搜索后,查找预先确定的该搜索词对应的价格最优值。In an embodiment of the present application, the server has previously determined a corresponding optimal price value for each search term recorded in the search log. Therefore, after the server performs a search based on the received search term, it searches for the predetermined optimal price value corresponding to the search term.

其中,预先确定该搜索词对应的价格最优值的方法可以为:根据基于该搜索词所能搜索到的所有商品信息中包含的价格,将该所有商品信息中包含的价格的平均价格作为该搜索词对应的价格最优值,或者将该所有商品信息中包含的价格的中位数作为该搜索词对应的价格最优值。当然,也可以人为设定该搜索词对应的价格最优值。The method for predetermining the optimal price value corresponding to the search term may be: taking the average of the prices of all product information that can be searched based on the search term as the optimal price value corresponding to the search term, or taking the median of the prices of all product information as the optimal price value corresponding to the search term. Of course, the optimal price value corresponding to the search term may also be manually set.

S103:分别确定搜索到的每个商品信息中包含的价格与查找到的价格最优值的差值的绝对值,按照分别确定的绝对值由小到大的顺序对搜索到的商品信息进行排序并提供。S103: Determine the absolute value of the difference between the price included in each searched product information and the found optimal price, sort the searched product information in ascending order of the determined absolute values, and provide the results.

也即,在对搜索到的商品信息进行排序时,按照所包含的价格与价格最优值的差值的绝对值越小,排序越靠前的方式进行排序,由于预先确定的该搜索词对应的价格最优值更加贴近大多数用户认为的以该搜索词搜索到的商品的合理价格,因此,按照步骤S103中的排序方式对搜索到的商品信息进行排序并提供给用户后,可以有效减少用户设置筛选条件进行重新搜索的次数,从而减轻服务器的压力。That is, when sorting the searched product information, the smaller the absolute value of the difference between the included price and the optimal price value, the higher the ranking. Since the predetermined optimal price value corresponding to the search term is closer to what most users consider to be a reasonable price for the product searched with the search term, after the searched product information is sorted according to the sorting method in step S103 and provided to the user, the number of times the user sets the filtering conditions to re-search can be effectively reduced, thereby reducing the pressure on the server.

在本申请实施例中,服务器需要预先针对搜索日志中记录的每个使用过的搜索词确定对应的价格最优值,确定某个搜索词对应的价格最优值的方法具体为,服务器查找搜索日志中记录的过去每次基于该搜索词进行搜索时,搜索到的商品信息中满足指定条件的商品信息,并根据查找到的商品信息中包含的价格,确定该搜索词对应的价格最优值。其中,满足指定条件的商品信息包括被用户点击的商品信息。当然,满足指定条件的商品信息还可以包括被用户收藏的商品信息、订购了相应商品的商品信息、分享的商品信息,以及其他用户执行了指定操作的商品信息。并且,服务器在查找过去每次基于该搜索词搜索到的满足指定条件的商品信息时,可以查找过去的设定时间段内每次基于该搜索词进行搜索时,搜索到的商品信息中满足指定条件的商品信息。In an embodiment of the present application, the server needs to determine the corresponding optimal price value for each used search term recorded in the search log in advance. The method for determining the optimal price value corresponding to a certain search term is specifically that the server searches the search log for product information that meets the specified conditions in the product information searched for each time a search was performed based on the search term in the past, and determines the optimal price value corresponding to the search term based on the price contained in the product information found. Among them, the product information that meets the specified conditions includes the product information clicked by the user. Of course, the product information that meets the specified conditions can also include product information collected by the user, product information for which the corresponding product has been ordered, product information shared, and product information for which other users have performed specified operations. In addition, when the server searches for product information that meets the specified conditions that was searched for each time based on the search term in the past, it can search for product information that meets the specified conditions in the product information that was searched for each time a search was performed based on the search term within a set time period in the past.

在实际应用中,服务器将基于该搜索词搜索到的商品信息提供给用户后,用户关注的往往是其认为包含的价格比较合理的商品信息,对于其认为包含的价格比较合理的商品信息,用户通常会进行点击操作以详细查看该商品信息,而对于其认为包含的价格不合理的商品信息,用户通常则不会进行点击操作而忽略该商品信息。因此,服务器根据过去每次基于该搜索词搜索到的商品信息中被点击的商品信息中包含的价格,确定该搜索词对应的价格最优值。In actual applications, after the server provides the user with product information found based on the search term, the user will often focus on product information that they believe has a reasonable price. For those product information that they believe has a reasonable price, the user will usually click on it to view the product information in detail. However, for product information that they believe has an unreasonable price, the user will usually not click on it and will ignore it. Therefore, the server determines the optimal price corresponding to the search term based on the prices of the product information that was clicked in each of the product information searches based on the search term.

例如,假设要确定搜索词“1g显卡”对应的价格最优值,则服务器在搜索日志中查找过去以“1g显卡”为搜索词进行的搜索行为。假设查找到3次搜索行为,则进一步查找进行这3次搜索行为时每次搜索到的商品信息中满足指定条件的商品信息,也即查找每次以“1g显卡”为搜索词进行搜索后,搜索到的商品信息中被用户点击的商品信息。假设查找到第1次搜索到的商品信息中被点击的商品信息为商品A的信息,第2次搜索到的商品信息中被点击的商品信息为商品B的信息,第3次搜索到的商品信息中被点击的商品信息为商品C的信息,则服务器根据查找到的该商品A的信息中包含的价格、商品B的信息中包含的价格、商品C的信息中包含的价格,确定该搜索词“1g显卡”对应的价格最优值。其中,可以将查找到的商品信息中包含的价格的平均值确定为该搜索词对应的价格最优值,也即将商品A的信息、商品B的信息、商品C的信息中分别包含的价格的平均值作为该搜索词对应的价格最优值。For example, assuming that the optimal price corresponding to the search term "1g graphics card" is to be determined, the server searches the search log for past searches using the search term "1g graphics card." Assuming that three searches are found, the server further searches for product information that meets the specified conditions among the product information found during each of the three searches. In other words, the server searches for product information that was clicked by the user after each search using the search term "1g graphics card." Assuming that the product information clicked in the first search is product A, the product information clicked in the second search is product B, and the product information clicked in the third search is product C, the server determines the optimal price corresponding to the search term "1g graphics card" based on the prices contained in the found information about product A, the prices contained in the found information about product B, and the prices contained in the found information about product C. Among them, the average value of the prices contained in the found product information can be determined as the optimal price value corresponding to the search term, that is, the average value of the prices contained in the information of product A, product B, and product C respectively can be used as the optimal price value corresponding to the search term.

当然,在根据查找到的商品信息中包含的价格确定该搜索词对应的价格最优值时,可根据实际情况将查找到的商品信息中包含的价格转换为统一的度量价格。例如,根据实际情况将查找到的商品信息中包含的价格转换为每个商品的价格,或者每斤商品的价格,或者每升商品的价格,或者每平方米商品的价格等等。Of course, when determining the optimal price corresponding to the search term based on the prices contained in the found product information, the prices contained in the found product information can be converted into a unified metric price according to actual circumstances. For example, the prices contained in the found product information can be converted into the price per product, or the price per kilogram, or the price per liter, or the price per square meter, etc. according to actual circumstances.

通过上述方法服务器可以更加准确的确定出该搜索词对应的价格最优值,使后续基于该搜索词进行搜索时,通过图1所示的步骤S103对搜索到的商品信息进行排序并提供后,可以进一步减少用户设置筛选条件进行重新搜索的次数,以进一步减轻服务器的压力。Through the above method, the server can more accurately determine the optimal price value corresponding to the search term, so that when subsequent searches are performed based on the search term, after sorting and providing the searched product information through step S103 shown in Figure 1, the number of times the user sets filtering conditions and re-searches can be further reduced, thereby further reducing the pressure on the server.

考虑到在实际应用中存在很多相似的搜索词,例如“1g显卡”和“显卡1g”,采用这两个搜索词搜索时,搜索到的商品信息是基本相同的,因此,服务器可以将搜索日志中记录的搜索词进行整理,将相似的搜索词整合为一个归一化搜索词,例如,将搜索词“1g显卡”和搜索词“显卡1g”整合为归一化搜索词“1g显卡”。在查找过去每次以该搜索词搜索到的商品信息中满足指定条件的商品信息时,则基于该搜索词的归一化搜索词在搜索日志中进行查找。Considering that there are many similar search terms in actual applications, such as "1g graphics card" and "graphics card 1g", the product information found when searching with these two search terms is basically the same. Therefore, the server can organize the search terms recorded in the search log and combine similar search terms into a normalized search term. For example, the search terms "1g graphics card" and "graphics card 1g" can be combined into the normalized search term "1g graphics card". When searching for product information that meets the specified conditions among the product information found in previous searches using this search term, the search log is searched based on the normalized search term of this search term.

例如,将搜索日志中记录的搜索词“1g显卡”和搜索词“显卡1g”整合为归一化搜索词“1g显卡”后,如果要确定搜索词“显卡1g”对应的价格最优值,则确定该搜索词“显卡1g”的归一化搜索词为“1g显卡”,查找搜索日志中记录的过去每次基于“1g显卡”进行搜索时,搜索到的被用户点击的商品信息,并根据查找到的商品信息中包含的价格确定搜索词“显卡1g”对应的价格最优值。For example, after integrating the search terms "1g graphics card" and "graphics card 1g" recorded in the search log into the normalized search term "1g graphics card", if you want to determine the optimal price value corresponding to the search term "graphics card 1g", then determine the normalized search term of the search term "graphics card 1g" as "1g graphics card", and search the search log for the product information clicked by the user each time the search was based on "1g graphics card" in the past, and determine the optimal price value corresponding to the search term "graphics card 1g" based on the price contained in the found product information.

较佳的,由于在实际应用中,对于以同一个搜索词搜索到的商品信息而言,搜索到的各商品信息中包含的价格的概率密度分布大致上是符合混合双高斯概率分布的,因此,服务器在根据查找到的各个满足指定条件的商品信息中包含的价格,确定该搜索词对应的价格最优值时,可以根据查找到的商品信息中包含的价格,采用混合双高斯模型拟合基于该搜索词搜索到的商品信息中包含的价格的概率密度函数,并根据拟合的概率密度函数,确定概率密度最大时对应的价格作为该搜索词对应的价格最优值。Preferably, since in actual applications, for product information searched with the same search term, the probability density distribution of the prices contained in each searched product information roughly conforms to the mixed double Gaussian probability distribution, therefore, when the server determines the optimal price value corresponding to the search term based on the prices contained in each product information found that meets the specified conditions, it can use the mixed double Gaussian model to fit the probability density function of the prices contained in the product information searched based on the search term based on the prices contained in the found product information, and determine the price corresponding to the maximum probability density as the optimal price value corresponding to the search term based on the fitted probability density function.

其中,采用混合双高斯模型拟合基于该搜索词搜索到的商品信息中包含的价格的概率密度函数的过程具体如图2所示,图2为本申请实施例提供的拟合基于该搜索词搜索到的商品信息中包含的价格的概率密度函数的过程,具体包括以下步骤:Among them, the process of using the mixed double Gaussian model to fit the probability density function of the price contained in the product information searched based on the search term is specifically shown in Figure 2. Figure 2 is a process of fitting the probability density function of the price contained in the product information searched based on the search term provided by an embodiment of the present application, which specifically includes the following steps:

S201:根据查找到的商品信息中包含的价格,确定查找到的商品信息中包含的价格的总平均值。S201: Determine a total average value of the prices included in the found product information based on the prices included in the found product information.

例如,要确定搜索词“1g显卡”对应的价格最优值,服务器在搜索日志中查找过去以“1g显卡”为搜索词进行的搜索行时,搜索到的商品信息中满足指定条件的商品信息。假设共查找到n个满足指定条件的商品信息,其中,第i个商品信息中包含的价格记为xi,则确定这n个查找到的商品信息中包含的价格的总平均值For example, to determine the optimal price for the search term "1g graphics card", the server searches the search log for past searches with the search term "1g graphics card" and finds product information that meets the specified criteria. Assuming that n product information that meets the specified criteria is found, and the price contained in the i-th product information is denoted as x i , then determine the total average price of these n found product information.

S202:根据查找到的商品信息中包含的价格以及确定的总平均值,确定查找到的商品信息中包含的价格的总标准差。S202: Determine a total standard deviation of the prices included in the found product information based on the prices included in the found product information and the determined total average value.

继续沿用上例,确定了n个商品信息中包含的价格的总平均值之后,采用公式确定这n个商品信息中包含的价格的总标准差σ。Continuing with the above example, after determining the total average of the prices contained in the n product information, the formula is used to determine the total standard deviation σ of the prices contained in these n product information.

较佳的,对于查找到的商品信息中包含的价格,如果某个商品信息中包含的价格与确定的总平均值相差较大,则该商品信息中包含的价格就是异常数据,异常数据会影响后续拟合的准确性,因此,本申请实施例中服务器还可以剔除查找到的各商品信息包含的价格中的异常数据。具体的,继续沿用上例,针对查找到的n个商品信息中的第i个商品信息中包含的价格xi,如果的值大于3,则确定该第i个商品信息中包含的价格xi为异常数据,剔除该第i个商品信息中包含的价格xi。或者,如果的值不在设定范围内,则确定该第i个商品信息中包含的价格xi为异常数据,剔除该第i个商品信息中包含的价格xi,该设定范围例如可以为0.2~1.8。Preferably, for the prices contained in the found product information, if the price contained in a certain product information differs significantly from the determined overall average, then the price contained in the product information is abnormal data, and abnormal data will affect the accuracy of subsequent fitting. Therefore, in the embodiment of the present application, the server can also remove abnormal data from the prices contained in each found product information. Specifically, continuing with the above example, for the price x i contained in the i-th product information among the n found product information, if the value is greater than 3, then the price x i contained in the i-th product information is determined to be abnormal data, and the price x i contained in the i-th product information is removed. Alternatively, if the value is not within a set range, then the price x i contained in the i-th product information is determined to be abnormal data, and the price x i contained in the i-th product information is removed. The set range can be , for example, 0.2 to 1.8.

S203:将确定的总平均值与第一设定参数的乘积作为第一平均值,将确定的总平均值与第二设定参数的乘积作为第二平均值,其中,第一设定参数大于1,第二设定参数小于1。S203: taking the product of the determined total average value and a first setting parameter as a first average value, and taking the product of the determined total average value and a second setting parameter as a second average value, wherein the first setting parameter is greater than 1 and the second setting parameter is less than 1.

继续沿用上例,确定的第一平均值确定的第二平均值其中,a为第一设定参数,b为第二设定参数,本申请实施例中只要保证a和b中一个大于1,另一个小于1即可,例如a为0.3,b为1.3。Continuing with the above example, the first average value is determined and the second average value is determined, where a is the first setting parameter and b is the second setting parameter. In the embodiment of the present application, it is sufficient to ensure that one of a and b is greater than 1 and the other is less than 1, for example, a is 0.3 and b is 1.3.

S204:将确定的总标准差同时作为第一标准差和第二标准差。S204: The determined total standard deviation is used as both the first standard deviation and the second standard deviation.

继续沿用上例,确定的第一标准差σ1=σ,确定的第二标准差σ2=σ。Continuing with the above example, the first standard deviation σ 1 is determined to be σ, and the second standard deviation σ 2 is determined to be σ.

S205:针对查找到的每个商品信息中包含的价格,采用公式确定每个价格对应的中间值。 S205: For each price contained in the product information found, use a formula to determine the median value corresponding to each price.

其中,p为初始设定的权值。初始设定的权值可以设定为0.5。Where p is the initial weight value, which can be set to 0.5.

继续沿用上例,针对查找到的n个商品信息中的第i个商品信息中包含的价格xi,采用上述公式确定该第i个商品信息中包含的价格xi对应的中间值gamma(i)。Continuing with the above example, for the price x i contained in the i-th product information among the n product information found, the above formula is used to determine the median value gamma(i) corresponding to the price x i contained in the i-th product information.

S206:根据确定的每个价格对应的中间值,采用公式重新确定第一平均值,采用公式重新确定第二平均值。S206: Re-determine the first average value using a formula according to the determined median value corresponding to each price, and re-determine the second average value using a formula.

S207:根据重新确定的第一平均值和第二平均值,采用公式 重新确定第一标准差,采用公式重新确定第二标准差。 S207: Based on the newly determined first and second mean values, use the formula to re-determine the first standard deviation and the formula to re-determine the second standard deviation.

其中,公式和中的 mu1和mu2为步骤S206中重新确定的第一平均值和第二平均值。 Wherein, mu 1 and mu 2 in the formula are the first average value and the second average value re-determined in step S206.

S208:采用公式重新确定权值。S208: Re-determine the weight using a formula.

其中,n为查找到的商品信息的数量。Where n is the number of product information found.

S209:判断重新确定的第一平均值、第二平均值、第一标准差、第二标准差是否满足设定条件,若是,则执行步骤S211,否则执行步骤S210。S209: Determine whether the re-determined first average value, second average value, first standard deviation, and second standard deviation meet the set conditions. If so, execute step S211; otherwise, execute step S210.

在本申请实施例中,判断重新确定的第一平均值、第二平均值、第一标准差、第二标准差是否满足设定条件的方法具体为:判断重新确定的第一平均值与上一次确定的第一平均值的差值在设定范围内,判断重新确定的第二平均值与上一次确定的第二平均值的差值在设定范围内,判断重新确定的第一标准差与上一次确定的第一标准差的差值在设定范围内,判断重新确定的第二标准差与上一次确定的第二标准差的差值在设定范围内;当上述四个判断中的至少一个判断结果为是时,判断重新确定的第一平均值、第二平均值、第一标准差、第二标准差是否满足设定条件。In an embodiment of the present application, the method for determining whether the re-determined first average value, second average value, first standard deviation, and second standard deviation meet the set conditions is specifically as follows: determining whether the difference between the re-determined first average value and the first average value determined last time is within the set range, determining whether the difference between the re-determined second average value and the second average value determined last time is within the set range, determining whether the difference between the re-determined first standard deviation and the first standard deviation determined last time is within the set range, and determining whether the difference between the re-determined second standard deviation and the second standard deviation determined last time is within the set range; when at least one of the above four judgments is yes, determining whether the re-determined first average value, second average value, first standard deviation, and second standard deviation meet the set conditions.

当然,也可以当上述四个判断中的至少两个或所有判断结果均为是时,判断重新确定的第一平均值、第二平均值、第一标准差、第二标准差是否满足设定条件。Of course, when the results of at least two or all of the above four judgments are yes, it is also possible to judge whether the re-determined first average value, second average value, first standard deviation, and second standard deviation meet the set conditions.

S210:根据重新确定的第一平均值、第二平均值、第一标准差、第二标准差、权值,重新确定每个价格对应的中间值,返回步骤S206。S210: Re-determine the median value corresponding to each price based on the re-determined first average value, second average value, first standard deviation, second standard deviation, and weight, and return to step S206.

也即,基于重新确定的第一平均值、第二平均值、第一标准差、第二标准差、权值, 采用公式重新确定查找到 的每个商品信息中包含的价格对应的中间值,并根据重新确定的每个中间值继续确定第一 平均值、第二平均值、第一标准差、第二标准差和权值,直至确定的第一平均值、第二平均 值、第一标准差、第二标准差满足设定条件为止。 That is, based on the newly determined first average, second average, first standard deviation, second standard deviation, and weight, the formula is used to re-determine the median price for each product found. Based on each newly determined median, the first average, second average, first standard deviation, second standard deviation, and weight are then determined until the determined first average, second average, first standard deviation, and second standard deviation meet the specified conditions.

S211:根据重新确定的第一平均值、第二平均值、第一标准差、第二标准、权值,确 定函数作为拟合的基于所 述搜索词搜索到的商品信息中包含的价格的概率密度函数。 S211: Based on the newly determined first mean, second mean, first standard deviation, second standard deviation, and weight, a function is determined as a fitted probability density function of the prices contained in the product information found based on the search term.

继续沿用上例,当得到满足设定条件的第一平均值、第二平均值、第一标准差、第 二标准差时,根据满足设定条件的第一平均值、第二平均值、第一标准差、第二标准差,以及 最终确定的权值得到函数 即为拟合的基于该搜索词“1g显卡”搜索到的商品信息中包含的价格的概率密度函数。 Continuing with the above example, when the first mean, second mean, first standard deviation, and second standard deviation satisfy the set conditions, the function obtained based on the first mean, second mean, first standard deviation, and second standard deviation, as well as the final weights, is the fitted probability density function of the prices contained in the product information found based on the search term "1g graphics card."

在后续的过程中,则可以确定概率密度最大时对应的价格作为该搜索词“1g显卡”对应的价格最优值,也即得到的上述函数取最大函数值f(x)max时对应的自变量x的值,就是该搜索词“1g显卡”对应的价格最优值。这是因为,得到上述概率密度函数是基于查找到的满足指定条件的商品信息中包含的价格确定的,而满足指定条件的商品信息是被用户点击的商品信息,因此上述概率密度函数反映了当以该搜索词“1g显卡”进行搜索时,用户点击搜索到的商品信息中包含不同价格的商品信息的概率,点击包含某个价格的商品信息的概率越大,说明认为该价格是合理价格的用户越多,因此将概率密度最大时对应的价格作为该搜索词对应的价格最优值,可以更加准确的确定出该搜索词对应的价格最优值,以进一步减少用户设置筛选条件进行重新搜索的次数,减轻服务器的压力。In the subsequent process, the price corresponding to the maximum probability density can be determined as the optimal price value corresponding to the search term "1g graphics card". That is, the value of the independent variable x corresponding to the maximum function value f(x) max of the above function is the optimal price value corresponding to the search term "1g graphics card". This is because the above probability density function is determined based on the prices contained in the product information found that meets the specified conditions, and the product information that meets the specified conditions is the product information clicked by the user. Therefore, the above probability density function reflects the probability of users clicking on product information containing different prices in the searched product information when searching with the search term "1g graphics card". The greater the probability of clicking on product information containing a certain price, the more users believe that the price is a reasonable price. Therefore, the price corresponding to the maximum probability density is used as the optimal price value corresponding to the search term. This can more accurately determine the optimal price value corresponding to the search term, further reducing the number of times users set filtering conditions and re-search, and reducing the pressure on the server.

其中,在得到拟合的基于该搜索词搜索到的商品信息中包含的价格的概率密度函 数之后,由于该函数的最大 值一定是在mu1和mu2之间出现,因此确定该函数取最大函数值f(x)max时对应的自变量x的值 的方法可以为:将mu1到mu2的范围划分为若干个子范围,例如,可以将mu1到mu2的范围划分 为n个子范围,分别为mu1到mu1+d、mu1+d到mu1+2d、mu1+2d到mu1+3d……、mu1+(n-1)d到mu2,其 中,在划分的每个子范围中随机选择一个x的值,根据得到的概率密度函数确定选择的每个x的值对应 的函数值f(x),将确定的每个函数值f(x)中最大的函数值所对应的x的值确定为该函数取 最大函数值f(x)max时对应的自变量x的值,也即该搜索词对应的价格最优函数。 After obtaining the fitted probability density function of the prices contained in the product information searched based on the search term, since the maximum value of the function must appear between mu 1 and mu 2 , the method for determining the value of the independent variable x corresponding to when the function takes the maximum function value f(x) max can be: dividing the range of mu 1 to mu 2 into several sub-ranges. For example, the range of mu 1 to mu 2 can be divided into n sub-ranges, namely mu 1 to mu 1 +d, mu 1 +d to mu 1 +2d, mu 1 +2d to mu 1 +3d, ..., mu 1 +(n-1)d to mu 2 , wherein a value of x is randomly selected in each divided sub-range, and the function value f(x) corresponding to each selected value of x is determined according to the obtained probability density function. The value of x corresponding to the largest function value among each determined function value f(x) is determined as the value of the independent variable x corresponding to when the function takes the maximum function value f(x) max , that is, the optimal price function corresponding to the search term.

较佳的,由于统计学中如果要确定某些样本的概率密度分布符合混合双高斯概率分布,就要求不同样本的数量要大于一定阈值,因此,本申请实施例中在采用图2所示的过程拟合基于该搜索词搜索到的商品信息中包含的价格的概率密度函数之前,还要确定查找到的满足指定条件且所包含的不同价格的商品信息的数量大于设定阈值。Preferably, since in statistics, if one wants to determine whether the probability density distribution of certain samples conforms to a mixed double Gaussian probability distribution, the number of different samples is required to be greater than a certain threshold. Therefore, in the embodiment of the present application, before using the process shown in FIG2 to fit the probability density function of the prices contained in the product information searched based on the search term, it is also necessary to determine that the number of product information found that meets the specified conditions and contains different prices is greater than the set threshold.

另外,考虑到实际应用中基于某些搜索词搜索到的商品信息中包含的价格往往较高,采用如图2所示的过程拟合基于该搜索词搜索到的商品信息中包含的价格的概率密度函数时所需要的计算量较大,因此,在上述步骤S201中,可以将查找到的每个商品信息中包含的价格进行取对数运算,将得到的结果重新作为每个商品信息中包含的价格。In addition, considering that the prices contained in the product information searched based on certain search terms are often high in actual applications, the amount of calculation required when fitting the probability density function of the prices contained in the product information searched based on the search terms using the process shown in Figure 2 is large. Therefore, in the above step S201, the price contained in each product information found can be logarithmically operated, and the result obtained can be used as the price contained in each product information.

例如,针对查找到的n个满足指定条件的商品信息中的第i个商品信息中包含的价格xi,可以将log2xi的值重新作为该第i个商品信息中包含的价格,后续则根据重新确定的每个商品信息中包含的价格进行如图2所示的拟合过程。假设根据图2所示的过程最终确定出该搜索词对应的价格最优值为X,则还要将2X重新作为确定的该搜索词对应的价格最优值。For example, for the price x i contained in the i-th product information among the n products that meet the specified conditions, the value of log 2 x i can be re-used as the price contained in the i-th product information. Subsequently, the fitting process shown in Figure 2 is performed based on the re-determined prices contained in each product information. Assuming that the optimal price value corresponding to the search term is ultimately determined to be X according to the process shown in Figure 2, 2 X is also re-determined as the optimal price value corresponding to the search term.

在得到了该搜索词对应的价格最优值后,还可以对得到的该搜索词的价格最优值进行人工调整,并保存该搜索词的价格最优值,用以在后续基于该搜索词进行搜索时,查找该搜索词的价格最优值,并根据搜索到的各商品信息中包含的价格以及该搜索词对应的价格最优值对搜索到的商品进行排序并提供。当然,本申请实施例中服务器对搜索到的商品信息进行排序时,除了根据搜索到的各商品信息中包含的价格以及该搜索词对应的价格最优值进行排序之外,还可以综合搜索到的商品信息与该搜索词的相关性、搜索到的商品信息对应的商品的质量、提供搜索到的商品信息对应的卖家的质量等因素对搜索到的商品信息进行排序。After obtaining the optimal price value corresponding to the search term, the obtained optimal price value of the search term can also be manually adjusted, and the optimal price value of the search term can be saved to find the optimal price value of the search term when performing a subsequent search based on the search term, and the searched products can be sorted and provided according to the prices contained in the searched product information and the optimal price value corresponding to the search term. Of course, in the embodiment of the present application, when the server sorts the searched product information, in addition to sorting according to the prices contained in the searched product information and the optimal price value corresponding to the search term, it can also sort the searched product information based on factors such as the relevance of the searched product information to the search term, the quality of the product corresponding to the searched product information, and the quality of the seller corresponding to the searched product information.

服务器也可以仅当接收到用户发送的按照价格最优值进行排序的请求时,才根据搜索到的各商品信息中包含的价格以及该搜索词对应的价格最优值对搜索到的商品信息进行排序并提供,未接收到按照价格最优值进行排序的请求时,则按照默认的排序规则对搜索到的商品信息进行排序并提供。The server may also sort and provide the searched product information based on the prices contained in each searched product information and the optimal price value corresponding to the search term only when it receives a request from the user to sort by the optimal price value. If no request to sort by the optimal price value is received, the searched product information will be sorted and provided according to the default sorting rules.

图3为本申请实施例提供的一种提供搜索结果的装置结构示意图,具体包括:FIG3 is a schematic diagram of a structure of a device for providing search results according to an embodiment of the present application, which specifically includes:

搜索模块301,用于基于接收到的搜索词搜索相关的商品信息;Search module 301, for searching for relevant product information based on received search terms;

最优值确定模块302,用于查找预先确定的所述搜索词对应的价格最优值;The optimal value determination module 302 is used to find the predetermined optimal price value corresponding to the search term;

提供模块303,用于分别确定搜索到的每个商品信息中包含的价格与查找到的价格最优值的差值的绝对值,按照分别确定的绝对值由小到大的顺序对搜索到的商品信息进行排序并提供。The providing module 303 is used to respectively determine the absolute value of the difference between the price contained in each searched product information and the found optimal price value, and sort and provide the searched product information in ascending order according to the respectively determined absolute values.

所述最优值确定模块302具体用于,查找搜索日志中记录的过去每次基于所述搜索词进行搜索时,搜索到的商品信息中满足指定条件的商品信息,其中,满足指定条件的商品信息包括被用户点击的商品信息;根据查找到的商品信息中包含的价格,确定所述搜索词对应的价格最优值。The optimal value determination module 302 is specifically used to search for product information that meets specified conditions in each past search based on the search term, as recorded in the search log, wherein the product information that meets the specified conditions includes product information clicked by the user; and determine the optimal price value corresponding to the search term based on the price contained in the found product information.

所述最优值确定模块302具体用于,根据查找到的商品信息中包含的价格,采用混合双高斯模型拟合基于所述搜索词搜索到的商品信息中包含的价格的概率密度函数,根据拟合的概率密度函数,确定概率密度最大时对应的价格作为所述搜索词对应的价格最优值。The optimal value determination module 302 is specifically used to use a mixed double Gaussian model to fit the probability density function of the prices contained in the product information searched based on the search term, and determine the price corresponding to the maximum probability density as the optimal price value corresponding to the search term based on the fitted probability density function.

所述最优值确定模块302具体包括:The optimal value determination module 302 specifically includes:

初始化单元3021,用于根据查找到的商品信息中包含的价格,确定查找到的商品信息中包含的价格的总平均值,根据查找到的商品信息中包含的价格以及确定的总平均值,确定查找到的商品信息中包含的价格的总标准差,将确定的总平均值与第一设定参数的乘积作为第一平均值,将确定的总平均值与第二设定参数的乘积作为第二平均值,其中,所述第一设定参数大于1,所述第二设定参数小于1;将确定的总标准差同时作为第一标准差和第二标准差;Initialization unit 3021 is configured to determine, based on the prices included in the found product information, a grand average of the prices included in the found product information; determine, based on the prices included in the found product information and the determined grand average, a grand standard deviation of the prices included in the found product information; multiply the determined grand average by a first setting parameter as a first grand average; and multiply the determined grand average by a second setting parameter as a second grand average, wherein the first setting parameter is greater than 1 and the second setting parameter is less than 1; and use the determined grand standard deviation as both the first standard deviation and the second standard deviation;

训练单元3022,用于针对查找到的每个商品信息中包含的价格,采用公式确定每个价格对应的中间 值,其中,p为初始设定的权值,mu1为第一平均值,mu2为第二平均值,σ1为第一标准差,σ2为 第二标准差,xi为查找到的第i个商品信息中包含的价格,gamma(i)为查找到的第i个商品 信息中包含的价格对应的中间值;根据确定的每个价格对应的中间值,采用公式重新确定第一平均值,采用公式重新确定第二 平均值;根据重新确定的第一平均值和第二平均值,采用公式 重新确定第一标准差,采用公式重新确定第二标准差;采用公 式重新确定权值,其中,n为查找到的商品信息的数量;判断重新确定的第 一平均值、第二平均值、第一标准差、第二标准差是否满足设定条件;当判断结果为否时,根 据重新确定的第一平均值、第二平均值、第一标准差、第二标准差、权值,重新确定每个价格 对应的中间值,并根据重新确定的每个中间值继续确定第一平均值、第二平均值、第一标准 差、第二标准差和权值,直至确定的第一平均值、第二平均值、第一标准差、第二标准差满足 所述设定条件为止; The training unit 3022 is used to determine the median value corresponding to each price contained in each product information found using the formula, where p is the initial weight, mu 1 is the first mean value, mu 2 is the second mean value, σ 1 is the first standard deviation, σ 2 is the second standard deviation, and x i is the price contained in the i-th product information found, gamma(i) is the median value corresponding to the price contained in the i-th product information found; based on the determined median value corresponding to each price, the first average value is re-determined using the formula, and the second average value is re-determined using the formula; based on the re-determined first average value and second average value, the first standard deviation is re-determined using the formula, and the second standard deviation is re-determined using the formula; the weight is re-determined using the formula, wherein n is the number of product information found; it is judged whether the re-determined first average value, second average value, first standard deviation, and second standard deviation meet the set conditions; when the judgment result is no, the median value corresponding to each price is re-determined based on the re-determined first average value, second average value, first standard deviation, second standard deviation, and weight, and the first average value, second average value, first standard deviation, second standard deviation, and weight are continued to be determined based on each re-determined median value until the determined first average value, second average value, first standard deviation, and second standard deviation meet the set conditions;

函数确定单元3023,用于当所述训练单元3022判断重新确定的第一平均值、第二平均 值、第一标准差、第二标准差满足设定条件时,根据重新确定的第一平均值、第二平均值、第一 标准差、第二标准、权值,确定函数 作为拟合的基于所述搜索词搜索到的商品信息中包含的价格的概率密度函数。 Function determination unit 3023 is configured to, when the training unit 3022 determines that the newly determined first average value, second average value, first standard deviation, and second standard deviation meet the set conditions, determine a function based on the newly determined first average value, second average value, first standard deviation, second standard deviation, and weights as a fitted probability density function of the prices contained in the product information retrieved based on the search term.

所述训练单元3022具体用于,当判断重新确定的第一平均值与上一次确定的第一平均值的差值在设定范围内时,或者,当判断重新确定的第二平均值与上一次确定的第二平均值的差值在设定范围内时,或者,当判断重新确定的第一标准差与上一次确定的第一标准差的差值在设定范围内时,或者,当判断重新确定的第二标准差与上一次确定的第二标准差的差值在设定范围内时,判断重新确定的第一平均值、第二平均值、第一标准差、第二标准差满足所述设定条件。The training unit 3022 is specifically used to determine whether the re-determined first average value, the second average value, the first standard deviation, and the second standard deviation meet the set conditions when the difference between the re-determined first average value and the last determined first average value is within the set range, or when the difference between the re-determined second average value and the last determined second average value is within the set range, or when the difference between the re-determined first standard deviation and the last determined first standard deviation is within the set range, or when the difference between the re-determined second standard deviation and the last determined second standard deviation is within the set range.

本申请实施例提供一种提供搜索结果的方法及装置,该方法服务器基于接收到的搜索词进行搜索时,查找预先确定的该搜索词对应的价格最优值,分别确定搜索到的每个商品信息中包含的价格与该价格最优值的差值的绝对值,并按照确定的绝对值由小到大的顺序对搜索到的商品信息进行排序并提供。通过上述方法,服务器向用户提供的商品信息中,排序靠前的商品信息即为包含的价格与价格最优值相差较小的商品信息,与现有技术中单一按照包含的价格从高到低或从低到高的顺序提供搜索结果的方法相比,可以有效减少用户设置筛选条件并重新搜索的次数,这减轻了服务器的压力。An embodiment of the present application provides a method and device for providing search results. When the server performs a search based on a received search term, the method searches for a predetermined optimal price value corresponding to the search term, determines the absolute value of the difference between the price contained in each searched product information and the optimal price value, and sorts and provides the searched product information in order of the determined absolute values from small to large. Through the above method, among the product information provided by the server to the user, the product information with the highest sorting order is the product information whose contained price has the smallest difference with the optimal price value. Compared with the method in the prior art that provides search results in a single order from high to low or from low to high according to the contained price, the method can effectively reduce the number of times the user sets the filter conditions and searches again, thereby reducing the pressure on the server.

显然,本领域的技术人员可以对本申请进行各种改动和变型而不脱离本申请的精神和范围。这样,倘若本申请的这些修改和变型属于本申请权利要求及其等同技术的范围之内,则本申请也意图包含这些改动和变型在内。Obviously, those skilled in the art may make various changes and modifications to this application without departing from the spirit and scope of this application. Thus, if these modifications and variations of this application fall within the scope of the claims of this application and their equivalents, this application is intended to include these modifications and variations.

Claims (10)

1.一种提供搜索结果的方法,其特征在于,包括:1. A method for providing search results, characterized in that it includes: 服务器基于接收到的搜索词搜索相关的商品信息,所述商品信息包括商品的价格;并The server searches for relevant product information based on the received search terms, including product prices; and 从预先针对搜索日志中记录的每个搜索词确定的相应的价格最优值中,查找接收到的所述搜索词对应的价格最优值,其中,一个搜索词相应的价格最优值是通过查找搜索日志记录的过去基于所述一个搜索词进行搜索时,搜索到的满足指定条件的商品信息包含的价格确定的;以及From the pre-determined optimal price values for each search term recorded in the search log, the optimal price value corresponding to the received search term is retrieved, wherein the optimal price value corresponding to a search term is determined by looking up the prices of product information that meets specified conditions and were found in past searches based on the search term, as recorded in the search log; and 分别确定搜索到的每个商品信息中包含的价格与查找到的价格最优值的差值的绝对值,按照分别确定的绝对值由小到大的顺序对搜索到的商品信息进行排序并提供;Determine the absolute value of the difference between the price contained in each searched product information and the optimal price found, and sort and provide the searched product information in ascending order of the determined absolute values; 其中,预先确定的搜索词对应的价格最优值是用户认为的以该搜索词搜索到的商品的合理价格。The optimal price corresponding to the pre-determined search term is the price that the user considers reasonable for the products found using that search term. 2.如权利要求1所述的方法,其特征在于,确定所述搜索词对应的价格最优值,具体包括:2. The method as described in claim 1, characterized in that determining the optimal price value corresponding to the search term specifically includes: 所述服务器查找搜索日志中记录的过去每次基于所述搜索词进行搜索时,搜索到的商品信息中满足指定条件的商品信息,其中,满足指定条件的商品信息包括被用户点击的商品信息;The server searches the search log for product information that meets specified conditions from the product information found during each past search based on the search term. The product information that meets the specified conditions includes product information that has been clicked by the user. 根据查找到的商品信息中包含的价格,确定所述搜索词对应的价格最优值。Based on the prices contained in the found product information, determine the optimal price value corresponding to the search term. 3.如权利要求2所述的方法,其特征在于,根据查找到的商品信息中包含的价格,确定所述搜索词对应的价格最优值,具体包括:3. The method as described in claim 2, characterized in that determining the optimal price value corresponding to the search term based on the price contained in the found product information specifically includes: 根据查找到的商品信息中包含的价格,采用混合双高斯模型拟合基于所述搜索词搜索到的商品信息中包含的价格的概率密度函数;Based on the prices contained in the found product information, a mixture of Gaussian models is used to fit the probability density function of the prices contained in the product information searched based on the search terms; 根据拟合的概率密度函数,确定概率密度最大时对应的价格作为所述搜索词对应的价格最优值。Based on the fitted probability density function, the price corresponding to the maximum probability density is determined as the optimal price value for the search term. 4.如权利要求3所述的方法,其特征在于,采用混合双高斯模型拟合基于所述搜索词搜索到的商品信息中包含的价格的概率密度函数,具体包括:4. The method as described in claim 3, characterized in that, fitting the probability density function of the price contained in the product information searched based on the search term using a mixture of Gaussian models, specifically including: 根据查找到的商品信息中包含的价格,确定查找到的商品信息中包含的价格的总平均值;Based on the prices contained in the found product information, determine the overall average of the prices contained in the found product information; 根据查找到的商品信息中包含的价格以及确定的总平均值,确定查找到的商品信息中包含的价格的总标准差;Based on the prices contained in the found product information and the determined total average, determine the total standard deviation of the prices contained in the found product information; 将确定的总平均值与第一设定参数的乘积作为第一平均值,将确定的总平均值与第二设定参数的乘积作为第二平均值,其中,所述第一设定参数大于1,所述第二设定参数小于1;The product of the determined total average value and the first set parameter is used as the first average value, and the product of the determined total average value and the second set parameter is used as the second average value, wherein the first set parameter is greater than 1 and the second set parameter is less than 1; 将确定的总标准差同时作为第一标准差和第二标准差;The determined total standard deviation is used as both the first and second standard deviations. 针对查找到的每个商品信息中包含的价格,采用公式确定每个价格对应的中间值,其中,p为初始设定的权值,mu1为第一平均值,mu2为第二平均值,σ1为第一标准差,σ2为第二标准差,xi为查找到的第i个商品信息中包含的价格,gamma(i)为查找到的第i个商品信息中包含的价格对应的中间值;For each product information found, the price contained in the product information is determined by a formula, where p is the initially set weight, mu 1 is the first average value, mu 2 is the second average value, σ 1 is the first standard deviation, σ 2 is the second standard deviation, xi is the price contained in the i-th product information found, and gamma(i) is the median value corresponding to the price contained in the i-th product information found. 根据确定的每个价格对应的中间值,采用公式重新确定第一平均值,采用公式重新确定第二平均值;Based on the median value corresponding to each determined price, the first average value is redetermined using a formula, and the second average value is redetermined using a formula. 根据重新确定的第一平均值和第二平均值,采用公式重新确定第一标准差,采用公式重新确定第二标准差;Based on the newly determined first and second averages, the first standard deviation is redetermined using the formula, and the second standard deviation is redetermined using the formula. 采用公式重新确定权值,其中,n为查找到的商品信息的数量;The weights are redefined using a formula, where n is the number of product information items found. 判断重新确定的第一平均值、第二平均值、第一标准差、第二标准差是否满足设定条件;Determine whether the newly determined first mean, second mean, first standard deviation, and second standard deviation meet the set conditions; 若是,则根据重新确定的第一平均值、第二平均值、第一标准差、第二标准、权值,确定函数作为拟合的基于所述搜索词搜索到的商品信息中包含的价格的概率密度函数;If so, then based on the newly determined first average, second average, first standard deviation, second standard deviation, and weights, a function is determined as the probability density function of the price contained in the product information searched based on the search term. 否则,根据重新确定的第一平均值、第二平均值、第一标准差、第二标准差、权值,重新确定每个价格对应的中间值,并根据重新确定的每个中间值继续确定第一平均值、第二平均值、第一标准差、第二标准差和权值,直至确定的第一平均值、第二平均值、第一标准差、第二标准差满足所述设定条件为止。Otherwise, based on the newly determined first average, second average, first standard deviation, second standard deviation, and weight, the median value corresponding to each price is re-determined, and based on each newly determined median value, the first average, second average, first standard deviation, second standard deviation, and weight are continued to be determined until the determined first average, second average, first standard deviation, and second standard deviation satisfy the set conditions. 5.如权利要求4所述的方法,其特征在于,判断重新确定的第一平均值、第二平均值、第一标准差、第二标准差满足所述设定条件,具体包括:5. The method as described in claim 4, characterized in that determining whether the newly determined first average, second average, first standard deviation, and second standard deviation satisfy the set conditions specifically includes: 判断重新确定的第一平均值与上一次确定的第一平均值的差值在设定范围内;或者Determine if the difference between the newly determined first average and the previously determined first average is within a set range; or 判断重新确定的第二平均值与上一次确定的第二平均值的差值在设定范围内;或者Determine if the difference between the newly determined second average and the previously determined second average is within a set range; or 判断重新确定的第一标准差与上一次确定的第一标准差的差值在设定范围内;或者Determine if the difference between the newly determined first standard deviation and the previously determined first standard deviation is within the set range; or 判断重新确定的第二标准差与上一次确定的第二标准差的差值在设定范围内。Determine if the difference between the newly determined second standard deviation and the previously determined second standard deviation is within the set range. 6.一种提供搜索结果的装置,其特征在于,包括:6. An apparatus for providing search results, characterized in that it comprises: 搜索模块,用于基于接收到的搜索词搜索相关的商品信息,所述商品信息包括商品的价格;The search module is used to search for relevant product information based on received search terms, the product information including the price of the product; 最优值确定模块,用于从预先针对搜索日志中记录的每个搜索词确定的相应的价格最优值中,查找接收到的所述搜索词对应的价格最优值,其中,一个搜索词相应的价格最优值是通过查找搜索日志记录的过去基于所述一个搜索词进行搜索时,搜索到的满足指定条件的商品信息包含的价格确定的;The optimal value determination module is used to find the optimal price value corresponding to the received search term from the corresponding optimal price values pre-determined for each search term recorded in the search log. The optimal price value corresponding to a search term is determined by looking up the prices of goods that meet specified conditions and were found when searching based on the search term in the past, as recorded in the search log. 提供模块,用于分别确定搜索到的每个商品信息中包含的价格与查找到的价格最优值的差值的绝对值,按照确定的绝对值由小到大的顺序对搜索到的商品信息进行排序并提供;The module provides a method to determine the absolute value of the difference between the price contained in each searched product information and the optimal price found, and to sort and provide the searched product information in ascending order of the determined absolute values. 其中,预先确定的搜索词对应的价格最优值是用户认为的以该搜索词搜索到的商品的合理价格。The optimal price corresponding to the pre-determined search term is the price that the user considers reasonable for the products found using that search term. 7.如权利要求6所述的装置,其特征在于,所述最优值确定模块具体用于,查找搜索日志中记录的过去每次基于所述搜索词进行搜索时,搜索到的商品信息中满足指定条件的商品信息,其中,满足指定条件的商品信息包括被用户点击的商品信息;根据查找到的商品信息中包含的价格,确定所述搜索词对应的价格最优值。7. The apparatus as described in claim 6, wherein the optimal value determination module is specifically configured to: search the search log for product information that meets specified conditions in the product information searched each time a search was performed based on the search term in the past, wherein the product information that meets the specified conditions includes product information clicked by the user; and determine the optimal price value corresponding to the search term based on the price contained in the searched product information. 8.如权利要求7所述的装置,其特征在于,所述最优值确定模块具体用于,根据查找到的商品信息中包含的价格,采用混合双高斯模型拟合基于所述搜索词搜索到的商品信息中包含的价格的概率密度函数,根据拟合的概率密度函数,确定概率密度最大时对应的价格作为所述搜索词对应的价格最优值。8. The apparatus as described in claim 7, wherein the optimal value determination module is specifically configured to, based on the price contained in the found product information, use a mixture of Gaussian models to fit the probability density function of the price contained in the product information searched based on the search term, and determine the price corresponding to the maximum probability density as the optimal price value corresponding to the search term based on the fitted probability density function. 9.如权利要求8所述的装置,其特征在于,所述最优值确定模块具体包括:9. The apparatus as described in claim 8, wherein the optimal value determination module specifically comprises: 初始化单元,用于根据查找到的商品信息中包含的价格,确定查找到的商品信息中包含的价格的总平均值,根据查找到的商品信息中包含的价格以及确定的总平均值,确定查找到的商品信息中包含的价格的总标准差,将确定的总平均值与第一设定参数的乘积作为第一平均值,将确定的总平均值与第二设定参数的乘积作为第二平均值,其中,所述第一设定参数大于1,所述第二设定参数小于1;将确定的总标准差同时作为第一标准差和第二标准差;An initialization unit is configured to determine the total average price of the found product information based on the prices contained in the found product information; determine the total standard deviation of the prices contained in the found product information based on the prices contained in the found product information and the determined total average price; multiply the determined total average price by a first set parameter as a first average price; multiply the determined total average price by a second set parameter as a second average price; wherein the first set parameter is greater than 1 and the second set parameter is less than 1; and use the determined total standard deviation as both the first standard deviation and the second standard deviation. 训练单元,用于针对查找到的每个商品信息中包含的价格,采用公式确定每个价格对应的中间值,其中,p为初始设定的权值,mu1为第一平均值,mu2为第二平均值,σ1为第一标准差,σ2为第二标准差,xi为查找到的第i个商品信息中包含的价格,gamma(i)为查找到的第i个商品信息中包含的价格对应的中间值;根据确定的每个价格对应的中间值,采用公式重新确定第一平均值,采用公式重新确定第二平均值;根据重新确定的第一平均值和第二平均值,采用公式重新确定第一标准差,采用公式重新确定第二标准差;采用公式重新确定权值,其中,n为查找到的商品信息的数量;判断重新确定的第一平均值、第二平均值、第一标准差、第二标准差是否满足设定条件;当判断结果为否时,根据重新确定的第一平均值、第二平均值、第一标准差、第二标准差、权值,重新确定每个价格对应的中间值,并根据重新确定的每个中间值继续确定第一平均值、第二平均值、第一标准差、第二标准差和权值,直至确定的第一平均值、第二平均值、第一标准差、第二标准差满足所述设定条件为止;The training unit is used to determine the median value corresponding to each price in each found product information using a formula, where p is the initially set weight, mu1 is the first average, mu2 is the second average, σ1 is the first standard deviation, σ2 is the second standard deviation, and x... Let i represent the price contained in the i-th product information found, and gamma(i) represent the median value corresponding to the price contained in the i-th product information found. Based on the determined median value corresponding to each price, a first average value is re-determined using a formula, and a second average value is re-determined using a formula. Based on the re-determined first average value and second average value, a first standard deviation is re-determined using a formula, and a second standard deviation is re-determined using a formula. The weights are re-determined using a formula, where n represents the number of product information found. It is determined whether the re-determined first average value, second average value, first standard deviation, and second standard deviation meet the set conditions. If the determination result is negative, based on the re-determined first average value, second average value, first standard deviation, second standard deviation, and weights, the median value corresponding to each price is re-determined, and based on each re-determined median value, the first average value, second average value, first standard deviation, second standard deviation, and weights are further determined until the determined first average value, second average value, first standard deviation, and second standard deviation meet the set conditions. 函数确定单元,用于当所述训练单元判断重新确定的第一平均值、第二平均值、第一标准差、第二标准差满足设定条件时,根据重新确定的第一平均值、第二平均值、第一标准差、第二标准、权值,确定函数作为拟合的基于所述搜索词搜索到的商品信息中包含的价格的概率密度函数。The function determination unit is used to determine a function as the probability density function of the price contained in the product information searched based on the search term, when the training unit determines that the re-determined first average, second average, first standard deviation, and second standard deviation meet the set conditions. 10.如权利要求9所述的装置,其特征在于,所述训练单元具体用于,当判断重新确定的第一平均值与上一次确定的第一平均值的差值在设定范围内时,或者,当判断重新确定的第二平均值与上一次确定的第二平均值的差值在设定范围内时,或者,当判断重新确定的第一标准差与上一次确定的第一标准差的差值在设定范围内时,或者,当判断重新确定的第二标准差与上一次确定的第二标准差的差值在设定范围内时,判断重新确定的第一平均值、第二平均值、第一标准差、第二标准差满足所述设定条件。10. The apparatus of claim 9, wherein the training unit is specifically configured to: determine that the newly determined first average, second average, first standard deviation, and second standard deviation satisfy the set conditions when the difference between the newly determined first average and the previously determined first average is within a set range, or when the difference between the newly determined second average and the previously determined second average is within a set range, or when the difference between the newly determined first standard deviation and the previously determined first standard deviation is within a set range, or when the difference between the newly determined second standard deviation and the previously determined second standard deviation is within a set range.
HK14105111.7A 2014-05-30 Method for providing search result, and apparatus thereof HK1192035B (en)

Publications (2)

Publication Number Publication Date
HK1192035A HK1192035A (en) 2014-08-08
HK1192035B true HK1192035B (en) 2021-01-22

Family

ID=

Similar Documents

Publication Publication Date Title
TWI518529B (en) Search result sorting method and device
JP5860456B2 (en) Determination and use of search term weighting
JP5575902B2 (en) Information retrieval based on query semantic patterns
JP5693746B2 (en) Product information ranking
US9934293B2 (en) Generating search results
US8880513B2 (en) Presentation of items based on a theme
US20160070762A1 (en) Topic Profile Query Creation
EP2842060A1 (en) Recommending keywords
CN112579854A (en) Information processing method, device, equipment and storage medium
CN103279486A (en) Method and device for providing related searches
WO2015185020A1 (en) Information category obtaining method and apparatus
CN110427546A (en) A method and device for displaying information
CN109948048A (en) A method and system for searching, sorting and displaying commodities
CN107169821B (en) Big data query recommendation method and system
CN114064728B (en) Data query method and device, electronic equipment, medium and product
CN103514187B (en) Method and device for providing search results
CN106909647B (en) Data retrieval method and device
CN104331823B (en) It is determined that the method and device for the middle keyword reservation price that releases news
CN110110267B (en) Method and device for extracting object features and searching for objects
US9646094B2 (en) System and method for performing a multiple pass search
HK1192035B (en) Method for providing search result, and apparatus thereof
CN103631832A (en) Service object ordering method, service object searching method and related device
CN111831796B (en) A method, device, electronic device and storage medium for processing user requests
US20160063109A1 (en) Query-breadth selected search result sorting mechanism
TWI486799B (en) A method and a device for determining a weight value of a search word, a search result generating method, and a device