HK40043934B - Search method and apparatus, and computer device - Google Patents
Search method and apparatus, and computer device Download PDFInfo
- Publication number
- HK40043934B HK40043934B HK42021034080.8A HK42021034080A HK40043934B HK 40043934 B HK40043934 B HK 40043934B HK 42021034080 A HK42021034080 A HK 42021034080A HK 40043934 B HK40043934 B HK 40043934B
- Authority
- HK
- Hong Kong
- Prior art keywords
- category
- search
- content
- index
- categories
- Prior art date
Links
Description
技术领域Technical Field
本申请涉及人工智能技术领域以及搜索查询技术领域,特别是涉及一种搜索方法、装置和计算机设备。This application relates to the fields of artificial intelligence technology and search query technology, and in particular to a search method, apparatus and computer device.
背景技术Background Technology
随着科学技术的飞速发展,互联网平台已经成为当今社会查询信息的主流平台。在互联网平台中具有海量的数据,搜索引擎的出现方便了用户更快地在海量的数据中找到满足自己需求的内容。With the rapid development of science and technology, the internet has become the mainstream platform for information retrieval in today's society. The internet platform contains massive amounts of data, and the emergence of search engines makes it easier for users to find content that meets their needs more quickly within this vast amount of data.
传统的搜索引擎平台,在用户进行搜索查询时,是根据搜索词中包括的关键词提示一些关联词,以帮助用户快速输入所要查询的词语。然而,传统方法中,仅局限于对用户输入的词语进行辅助提示。这样一来,用户无法在搜索输入过程中,了解更多的搜索相关信息,具有一定的局限性,因而,导致输入过程中表达的信息量比较低。Traditional search engine platforms suggest related terms based on keywords in the search query to help users quickly input their desired words. However, this traditional method is limited to providing auxiliary suggestions based solely on the user's input. As a result, users cannot learn more about the search-related information during the input process, leading to a relatively low amount of information conveyed during the input.
发明内容Summary of the Invention
基于此,有必要针对上述技术问题,提供一种能够提高在输入过程中表达的信息量的搜索方法、装置、计算机设备和存储介质。Therefore, it is necessary to provide a search method, apparatus, computer device, and storage medium that can improve the amount of information expressed during the input process, in order to address the above-mentioned technical problems.
一种搜索方法,所述方法包括:A search method, the method comprising:
展示查询页面;所述查询页面中显示有输入区;The query page is displayed; the query page includes an input area.
当监听到在所述输入区中输入搜索词时,则在所述查询页面中,实时展示与输入的搜索词对应的聚合提示内容;When a search term is detected in the input area, aggregated suggestions corresponding to the entered search term are displayed in real time on the query page.
其中,所述聚合提示内容,包括多个索引类目和在各所述索引类目下的与所述搜索词相关的类目内容;所述多个索引类目,是搜索引擎所支持的索引类目中的、且与所述搜索词的查询意图相关的索引类目。The aggregated suggestion content includes multiple index categories and category content related to the search term under each index category; the multiple index categories are index categories supported by the search engine and are related to the query intent of the search term.
一种搜索装置,其特征在于,所述装置包括:A search device, characterized in that the device comprises:
展示模块,用于展示查询页面;所述查询页面中显示有输入区;The display module is used to display the query page; the query page displays an input area.
监听模块,用于监听在所述输入区中输入的搜索词;A listening module is used to listen for search terms entered in the input area;
所述展示模块还用于当所述监听模块监听到在所述输入区中输入搜索词时,则在所述查询页面中,实时展示与输入的搜索词对应的聚合提示内容;The display module is also used to display aggregated prompts corresponding to the entered search term in real time on the query page when the monitoring module detects that a search term has been entered in the input area.
其中,所述聚合提示内容,包括多个索引类目和在各所述索引类目下的与所述搜索词相关的类目内容;所述多个索引类目,是搜索引擎所支持的索引类目中的、且与所述搜索词的查询意图相关的索引类目。The aggregated suggestion content includes multiple index categories and category content related to the search term under each index category; the multiple index categories are index categories supported by the search engine and are related to the query intent of the search term.
在一个实施例中,输入区为搜索框。展示模块还用于当监听到在所述搜索框中输入搜索词时,则在所述搜索框下展示下拉框,并在所述下拉框中,将所述多个索引类目下的与所述搜索词相关的类目内容按序展示。In one embodiment, the input area is a search box. The display module is also configured to, when a search term is detected being entered into the search box, display a drop-down list below the search box, and in the drop-down list, display the category content related to the search term under the multiple index categories in sequence.
在一个实施例中,展示模块还用于在所述下拉框中,按照所述类目内容与用户画像之间的匹配度由高到低的顺序,展示所述类目内容,且对应于所述类目内容展示相应的索引类目;所述用户画像,是指输入所述搜索词的用户的用户画像。In one embodiment, the display module is further configured to display the category content in the drop-down box in descending order of matching degree between the category content and the user profile, and to display the corresponding index category corresponding to the category content; the user profile refers to the user profile of the user who inputs the search term.
在一个实施例中,展示模块还用于当所述监听模块监听到对所述聚合提示内容中展示的类目内容的触发操作时,则跳转至与所触发的类目内容对应的页面;其中,跳转至的所述页面,符合所触发的类目内容所属的索引类目。In one embodiment, the display module is further configured to, when the listening module detects a triggering operation on the category content displayed in the aggregated prompt content, redirect to the page corresponding to the triggered category content; wherein the page redirected to corresponds to the index category to which the triggered category content belongs.
在一个实施例中,聚合提示内容,还包括与所述搜索词相关的意图关联提示词。展示模块还用于当监听模块监听到针对所述意图关联提示词的触发操作时,则跳转进入至第一搜索结果页;其中,在所述第一搜索结果页中,展示有以所触发的意图关联提示词为搜索条件,所搜索到的搜索结果。In one embodiment, the aggregated prompt content further includes intent-related prompt words associated with the search term. The display module is also configured to, when the monitoring module detects a triggering operation for the intent-related prompt word, redirect to a first search results page; wherein, the first search results page displays the search results obtained using the triggered intent-related prompt word as the search condition.
在一个实施例中,聚合提示内容,还包括搜索结果查看入口。展示模块还用于当监听模块监听到针对所述搜索结果查看入口的触发操作时,则跳转进入至第二搜索结果页;其中,在所述第二搜索结果页中,展示有以所述搜索词为搜索条件,所搜索得到的搜索结果。In one embodiment, the aggregated prompt content further includes a search result viewing entry. The display module is also configured to redirect to a second search result page when the listening module detects a trigger operation on the search result viewing entry; wherein, the second search result page displays the search results obtained using the search term as the search criteria.
一种计算机设备,包括存储器和处理器,所述存储器存储有计算机程序,所述处理器执行所述计算机程序时实现以下步骤:A computer device includes a memory and a processor, the memory storing a computer program, and the processor executing the computer program performing the following steps:
展示查询页面;所述查询页面中显示有输入区;The query page is displayed; the query page includes an input area.
当监听到在所述输入区中输入搜索词时,则在所述查询页面中,实时展示与输入的搜索词对应的聚合提示内容;When a search term is detected in the input area, aggregated suggestions corresponding to the entered search term are displayed in real time on the query page.
其中,所述聚合提示内容,包括多个索引类目和在各所述索引类目下的与所述搜索词相关的类目内容;所述多个索引类目,是搜索引擎所支持的索引类目中的、且与所述搜索词的查询意图相关的索引类目。The aggregated suggestion content includes multiple index categories and category content related to the search term under each index category; the multiple index categories are index categories supported by the search engine and are related to the query intent of the search term.
一种计算机可读存储介质,其上存储有计算机程序,所述计算机程序被处理器执行时实现以下步骤:A computer-readable storage medium having a computer program stored thereon, the computer program performing the following steps when executed by a processor:
展示查询页面;所述查询页面中显示有输入区;The query page is displayed; the query page includes an input area.
当监听到在所述输入区中输入搜索词时,则在所述查询页面中,实时展示与输入的搜索词对应的聚合提示内容;When a search term is detected in the input area, aggregated suggestions corresponding to the entered search term are displayed in real time on the query page.
其中,所述聚合提示内容,包括多个索引类目和在各所述索引类目下的与所述搜索词相关的类目内容;所述多个索引类目,是搜索引擎所支持的索引类目中的、且与所述搜索词的查询意图相关的索引类目。The aggregated suggestion content includes multiple index categories and category content related to the search term under each index category; the multiple index categories are index categories supported by the search engine and are related to the query intent of the search term.
上述搜索方法、装置、计算机设备和存储介质,在监听到查询页面中的输入区中输入的搜索词时,实时展示包括搜索引擎所支持的索引类目中的、且与所述搜索词的查询意图相关的索引类目,以及各索引类目下的搜索词相关的类目内容的聚合提示内容。即,在搜索输入过程中,既能够提示与搜索词相关的内容,又能够提示搜索引擎所支持的索引类目,从而能够让用户知晓搜索引擎相关的信息,因此,相较于仅对输入的词语进行辅助提示以外,能够提供更多的搜索相关信息,提高了所表达的信息量。The aforementioned search method, apparatus, computer equipment, and storage medium, upon detecting a search term entered in the input area of the query page, display in real time aggregated suggestions including index categories supported by the search engine that are relevant to the search intent of the search term, as well as category content related to the search term under each index category. That is, during the search input process, it can suggest both content related to the search term and the index categories supported by the search engine, thereby enabling the user to understand relevant search engine information. Therefore, compared to simply providing auxiliary suggestions for the entered words, it can provide more search-related information and increase the amount of information conveyed.
一种搜索方法,所述方法包括:A search method, the method comprising:
接收所发送至的在查询页面中输入的搜索词;Receive the search terms entered on the query page;
从搜索引擎所支持的索引类目中,筛选与搜索词的查询意图相关的多个索引类目;Filter multiple index categories that are relevant to the query intent of the search terms from the index categories supported by the search engine;
获取在筛选的所述多个索引类目下的、且与所述搜索词相关的类目内容;Obtain the category content related to the search term under the filtered multiple index categories;
根据筛选的所述索引类目和所述类目内容,生成与所述搜索词对应的聚合提示内容并返回;所述聚合提示内容,用于指示在所述查询页面中,展示所述聚合提示内容。Based on the selected index categories and category content, aggregated suggestion content corresponding to the search term is generated and returned; the aggregated suggestion content is used to indicate that the aggregated suggestion content is displayed on the query page.
一种搜索装置,所述装置包括:A search device, the device comprising:
接收模块,用于接收所发送至的在查询页面中输入的搜索词;The receiving module is used to receive the search terms entered on the query page.
索引类目筛选模块,用于从搜索引擎所支持的索引类目中,筛选与搜索词的查询意图相关的多个索引类目;The index category filtering module is used to filter multiple index categories that are related to the query intent of the search terms from the index categories supported by the search engine.
获取模块,用于获取在筛选的多个索引类目下的、且与搜索词相关的类目内容;The acquisition module is used to acquire category content that is related to the search term under multiple filtered index categories;
聚合提示模块,用于根据筛选的所述索引类目和所述类目内容,生成与所述搜索词对应的聚合提示内容并返回;所述聚合提示内容,用于指示在所述查询页面中,展示所述聚合提示内容。The aggregation suggestion module is used to generate and return aggregation suggestion content corresponding to the search term based on the filtered index category and the category content; the aggregation suggestion content is used to indicate that the aggregation suggestion content is displayed on the query page.
在一个实施例中,获取模块还用于在筛选的所述多个索引类目下,召回与所述搜索词相关的候选类目内容;当同一索引类目下的候选类目内容为多个时,则按照各所述候选类目内容与所述搜索词之间的文本相关性由高到低的顺序,从同一索引类目下的多个候选类目内容中,筛选前预设数量的候选类目内容,作为用于生成聚合提示内容的类目内容。In one embodiment, the acquisition module is further configured to recall candidate category content related to the search term under the filtered plurality of index categories; when there are multiple candidate category contents under the same index category, the preset number of candidate category contents are selected from the multiple candidate category contents under the same index category in descending order of text relevance between each candidate category content and the search term, and used as category content for generating aggregated prompt content.
在一个实施例中,获取模块还用于根据所述搜索词,确定所筛选的各索引类目所分别对应的召回方式;针对每个筛选的索引类目,分别所述索引类目对应的召回方式,召回所述索引类目下与所述搜索词相关的类目内容。In one embodiment, the acquisition module is further configured to determine the recall method corresponding to each of the selected index categories based on the search term; and for each selected index category, recall the category content related to the search term under the index category according to the recall method corresponding to the index category.
在一个实施例中,聚合提示模块还用于按照筛选的所述索引类目所对应的类目意图强度由高到低的顺序,对所述索引类目进行排序,或,按照所述类目内容与所述搜索词之间的相关度由高到低的顺序,对所述类目内容进行排序;根据排序后的索引类目和所述索引类目下的所述类目内容,生成与所述搜索词对应的聚合提示内容并返回。In one embodiment, the aggregation suggestion module is further configured to sort the index categories in descending order of category intent intensity corresponding to the filtered index categories, or sort the category content in descending order of relevance between the category content and the search term; and generate and return aggregate suggestion content corresponding to the search term based on the sorted index categories and the category content under the index categories.
在一个实施例中,聚合提示模块还用于获取输入所述搜索词的用户的用户画像;确定所述类目内容和所述用户画像之间的匹配度;按照所述匹配度由高到低的顺序,对所述类目内容进行排序;根据排序后的索引类目和所述索引类目下的所述类目内容,生成与所述搜索词对应的聚合提示内容并返回。In one embodiment, the aggregation suggestion module is further configured to obtain a user profile of the user who inputs the search term; determine the matching degree between the category content and the user profile; sort the category content in descending order of the matching degree; and generate and return aggregate suggestion content corresponding to the search term based on the sorted index categories and the category content under the index categories.
一种计算机设备,包括存储器和处理器,所述存储器存储有计算机程序,所述处理器执行所述计算机程序时实现以下步骤:A computer device includes a memory and a processor, the memory storing a computer program, and the processor executing the computer program performing the following steps:
接收所发送至的在查询页面中输入的搜索词;Receive the search terms entered on the query page;
从搜索引擎所支持的索引类目中,筛选与搜索词的查询意图相关的多个索引类目;Filter multiple index categories that are relevant to the query intent of the search terms from the index categories supported by the search engine;
获取在筛选的多个索引类目下的、且与搜索词相关的类目内容;Retrieve category content related to the search term from multiple filtered index categories;
根据筛选的所述索引类目和所述类目内容,生成与所述搜索词对应的聚合提示内容并返回;所述聚合提示内容,用于指示在所述查询页面中,展示所述聚合提示内容。Based on the selected index categories and category content, aggregated suggestion content corresponding to the search term is generated and returned; the aggregated suggestion content is used to indicate that the aggregated suggestion content is displayed on the query page.
一种计算机可读存储介质,其上存储有计算机程序,所述计算机程序被处理器执行时实现以下步骤:A computer-readable storage medium having a computer program stored thereon, the computer program performing the following steps when executed by a processor:
接收所发送至的在查询页面中输入的搜索词;Receive the search terms entered on the query page;
从搜索引擎所支持的索引类目中,筛选与搜索词的查询意图相关的多个索引类目;Filter multiple index categories that are relevant to the query intent of the search terms from the index categories supported by the search engine;
获取在筛选的多个索引类目下的、且与搜索词相关的类目内容;Retrieve category content related to the search term from multiple filtered index categories;
根据筛选的所述索引类目和所述类目内容,生成与所述搜索词对应的聚合提示内容并返回;所述聚合提示内容,用于指示在所述查询页面中,展示所述聚合提示内容。Based on the selected index categories and category content, aggregated suggestion content corresponding to the search term is generated and returned; the aggregated suggestion content is used to indicate that the aggregated suggestion content is displayed on the query page.
上述搜索方法、装置、计算机设备和存储介质,在获取到输入的搜索词时,能够实时从搜索引擎所支持的索引类目中,筛选出与搜索词的查询意图相关的索引类目,以及获取各索引类目下的与搜索词相关的类目内容,生成聚合提示内容并返回至终端,以使终端在输入过程中,既能够提示与搜索词相关的内容,又能够提示搜索引擎所支持的索引类目,从而能够让用户知晓搜索引擎相关的信息,因此,相较于仅对输入的词语进行辅助提示以外,能够提供更多的搜索相关信息,提高了所表达的信息量。The aforementioned search method, apparatus, computer equipment, and storage medium, upon receiving the input search term, can in real time filter index categories related to the query intent of the search term from the index categories supported by the search engine, and obtain the category content related to the search term under each index category, generate aggregated prompt content, and return it to the terminal. This allows the terminal to provide prompts related to the search term and the index categories supported by the search engine during the input process, thereby enabling the user to know relevant information about the search engine. Therefore, compared to simply providing auxiliary prompts for the input words, it can provide more search-related information and increase the amount of information conveyed.
附图说明Attached Figure Description
图1为一个实施例中搜索方法的应用环境图;Figure 1 is an application environment diagram of the search method in one embodiment;
图2为一个实施例中搜索方法的流程示意图;Figure 2 is a flowchart illustrating the search method in one embodiment;
图3为一个实施例中展示聚合提示内容的界面示意图;Figure 3 is a schematic diagram of the interface displaying the aggregated prompt content in one embodiment;
图4为一个实施例中实体库和词表的组成内容示意图;Figure 4 is a schematic diagram of the composition of the entity library and the thesaurus in one embodiment;
图5为一个实施例中索引类目筛选示意图;Figure 5 is a schematic diagram of index category filtering in one embodiment;
图6为一个实施例中召回方式示意图;Figure 6 is a schematic diagram of a recall method in one embodiment;
图7为另一个实施例中搜索方法的流程示意图;Figure 7 is a flowchart illustrating the search method in another embodiment;
图8为一个实施例中搜索方法的原理示意图;Figure 8 is a schematic diagram of the principle of the search method in one embodiment;
图9为一个实施例中搜索装置的结构框图;Figure 9 is a structural block diagram of the search device in one embodiment;
图10为另一个实施例中搜索装置的结构框图;Figure 10 is a structural block diagram of the search device in another embodiment;
图11为一个实施例中计算机设备的内部结构图;Figure 11 is an internal structure diagram of a computer device in one embodiment;
图12为另一个实施例中计算机设备的内部结构图。Figure 12 is an internal structural diagram of a computer device in another embodiment.
具体实施方式Detailed Implementation
为了使本申请的目的、技术方案及优点更加清楚明白,以下结合附图及实施例,对本申请进行进一步详细说明。应当理解,此处描述的具体实施例仅仅用以解释本申请,并不用于限定本申请。To make the objectives, technical solutions, and advantages of this application clearer, the following detailed description is provided in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are merely illustrative and not intended to limit the scope of this application.
本申请提供的搜索方法,可以应用于如图1所示的应用环境中。其中,终端102通过网络与服务器104进行通信。其中,终端102可以但不限于是各种个人计算机、笔记本电脑、智能手机、平板电脑和便携式可穿戴设备,服务器104可以用独立的服务器或者是多个服务器组成的服务器集群来实现。The search method provided in this application can be applied to the application environment shown in Figure 1. In this application, terminal 102 communicates with server 104 via a network. Terminal 102 can be, but is not limited to, various personal computers, laptops, smartphones, tablets, and portable wearable devices. Server 104 can be a standalone server or a server cluster consisting of multiple servers.
终端102可以展示查询页面;所述查询页面中显示有输入区。当监听到在所述输入区中输入搜索词时,终端102则可以将监听到的搜索词发送至服务器104。服务器104则可以从搜索引擎所支持的索引类目中,筛选与搜索词的查询意图相关的多个索引类目,并在筛选的所述多个索引类目下,召回与所述搜索词相关的类目内容。服务器104可以根据筛选的所述索引类目和所述类目内容,生成与所述搜索词对应的聚合提示内容并返回至终端102。终端102则可以在所述查询页面中,实时展示与输入的搜索词对应的聚合提示内容。Terminal 102 can display a query page, which includes an input area. When a search term is detected in the input area, terminal 102 can send the detected search term to server 104. Server 104 can then filter multiple index categories related to the search intent of the search term from the index categories supported by the search engine, and retrieve category content related to the search term under the filtered multiple index categories. Server 104 can generate aggregated suggestion content corresponding to the search term based on the filtered index categories and the category content, and return it to terminal 102. Terminal 102 can then display the aggregated suggestion content corresponding to the input search term in real time on the query page.
需要说明的是,本申请各实施例中的搜索方法中,搜索词与聚合提示内容是实时对应展示的,即随着搜索词的输入进度的变化,展示的聚合提示内容也会随之变化。It should be noted that in the search methods of various embodiments of this application, the search terms and aggregated prompts are displayed in real time, that is, as the input progress of the search terms changes, the displayed aggregated prompts will also change accordingly.
可以理解,本申请各实施例中的搜索方法,相当于使用了人工智能技术来自动提示与搜索词对应的聚合提示内容。It is understood that the search methods in the various embodiments of this application are equivalent to using artificial intelligence technology to automatically suggest aggregated prompts corresponding to the search terms.
人工智能(Artificial Intelligence,AI)是利用数字计算机或者数字计算机控制的机器模拟、延伸和扩展人的智能,感知环境、获取知识并使用知识获得最佳结果的理论、方法、技术及应用系统。换句话说,人工智能是计算机科学的一个综合技术,它企图了解智能的实质,并生产出一种新的能以人类智能相似的方式做出反应的智能机器。人工智能也就是研究各种智能机器的设计原理与实现方法,使机器具有感知、推理与决策的功能。Artificial Intelligence (AI) is the theory, methods, technology, and application systems that use digital computers or machines controlled by digital computers to simulate, extend, and expand human intelligence, perceive the environment, acquire knowledge, and use that knowledge to achieve optimal results. In other words, AI is a comprehensive technology within computer science that attempts to understand the essence of intelligence and produce a new kind of intelligent machine that can react in a way similar to human intelligence. AI studies the design principles and implementation methods of various intelligent machines, enabling them to possess the functions of perception, reasoning, and decision-making.
人工智能技术是一门综合学科,涉及领域广泛,既有硬件层面的技术也有软件层面的技术。人工智能基础技术一般包括如传感器、专用人工智能芯片、云计算、分布式存储、大数据处理技术、操作/交互系统、机电一体化等技术。人工智能软件技术主要包括计算机视觉技术、语音处理技术、自然语言处理技术以及机器学习/深度学习等几大方向。Artificial intelligence (AI) is a comprehensive discipline encompassing a wide range of fields, including both hardware and software technologies. Fundamental AI technologies generally include sensors, dedicated AI chips, cloud computing, distributed storage, big data processing, operating/interactive systems, and mechatronics. AI software technologies primarily include computer vision, speech processing, natural language processing, and machine learning/deep learning.
可以理解,本申请各实施例中的搜索方法,相当于使用了人工智能技术中的自然语言处理技术,来识别搜索词的查询意图,以展示与该查询意图相关的多个索引类目和相应召回的类目内容,从而自动提示与搜索词对应的聚合提示内容。It is understood that the search methods in the various embodiments of this application are equivalent to using natural language processing technology in artificial intelligence to identify the query intent of search terms, so as to display multiple index categories and corresponding recalled category content related to the query intent, thereby automatically prompting aggregated prompt content corresponding to the search terms.
自然语言处理(Nature Language processing,NLP)是计算机科学领域与人工智能领域中的一个重要方向。它研究能实现人与计算机之间用自然语言进行有效通信的各种理论和方法。自然语言处理是一门融语言学、计算机科学、数学于一体的科学。因此,这一领域的研究将涉及自然语言,即人们日常使用的语言,所以它与语言学的研究有着密切的联系。自然语言处理技术通常包括文本处理、语义理解、机器翻译、机器人问答、知识图谱等技术。Natural Language Processing (NLP) is an important field within computer science and artificial intelligence. It studies the theories and methods for enabling effective communication between humans and computers using natural language. NLP is a science that integrates linguistics, computer science, and mathematics. Therefore, research in this field involves natural language—the language people use in daily life—and thus it has a close relationship with linguistic research. NLP techniques typically include text processing, semantic understanding, machine translation, question answering, and knowledge graphs.
在一个实施例中,如图2所示,提供了一种搜索方法,以该方法应用于图1中的终端为例进行说明,包括以下步骤:In one embodiment, as shown in Figure 2, a search method is provided. Taking the application of this method to the terminal in Figure 1 as an example, the method includes the following steps:
步骤202,展示查询页面;所述查询页面中显示有输入区。Step 202: Display the query page; the query page displays an input area.
其中,查询页面,是供用户输入搜索词(query),以触发搜索查询的页面。The query page is where users can enter search terms to trigger a search query.
输入区,是用于接收输入语句的区域。The input area is the area used to receive input statements.
在一个实施例中,输入区可以包括搜索框。在其他实施例中,输入区还可以包括选择按钮,即用于只需要点击选择即可以输入搜索词,这种情况下就不需要一个专门供用户输入的搜索框,也可以实现搜索词的输入。In one embodiment, the input area may include a search box. In other embodiments, the input area may also include selection buttons, allowing users to enter search terms simply by clicking to select them. In this case, a dedicated search box is not required for user input, and search terms can still be entered.
步骤204,当监听到在输入区中输入搜索词时,则在查询页面中,实时展示与输入的搜索词对应的聚合提示内容;其中,聚合提示内容,包括多个索引类目和在各索引类目下的与搜索词相关的类目内容;多个索引类目,是搜索引擎所支持的索引类目中的、且与搜索词的查询意图相关的索引类目。Step 204: When a search term is detected in the input area, aggregated suggestions corresponding to the entered search term are displayed in real time on the query page. The aggregated suggestions include multiple index categories and category content related to the search term under each index category. The multiple index categories are those supported by the search engine and are related to the query intent of the search term.
可以理解,聚合提示内容,是指由多条提示内容聚合而成的内容。It is understandable that aggregated prompt content refers to content composed of multiple prompt messages.
本申请实施例中,聚合提示内容包括多个索引类目和在各索引类目下的与该搜索词相关的类目内容。可以理解,多个索引类目下的类目内容也属于多条不同的提示内容,因此,可以称为聚合提示内容。In this embodiment, the aggregated suggestion content includes multiple index categories and category content related to the search term under each index category. It can be understood that the category content under multiple index categories also constitutes multiple different suggestion contents; therefore, it can be called aggregated suggestion content.
可以理解,对于有时效性的搜索词来说,在不同时间所查询到的聚合提示内容可能不同。比如,搜索词是一个具有新闻时效性的关键词,那时效性期间,聚合提示内容应该展现一条最相关的新闻结果;当时效性过期后,可以展现其他相关结果。It's understandable that for time-sensitive search terms, the aggregated suggestions might differ at different times. For example, if the search term is a news-sensitive keyword, during that time period, the aggregated suggestions should display the most relevant news result; after the time period expires, other relevant results can be displayed.
聚合提示内容中展示的多个索引类目,是搜索引擎所支持的索引类目中的、且与搜索词的查询意图相关的索引类目。The multiple index categories displayed in the aggregated suggestion content are index categories supported by the search engine and are related to the query intent of the search terms.
其中,索引类目,即能够索引查询的类别或类型。搜索引擎所支持的索引类目,是指在搜索引擎中搜索查询时,能够反馈查询内容的类目。即能够从该索引类目中查询与搜索词相关的内容。The index category refers to the category or type that can be indexed for queries. The index categories supported by a search engine are those categories that provide the search results when a query is performed within the search engine. In other words, content related to the search terms can be found from these index categories.
在一个实施例中,索引类目包括子应用(即小程序)索引类目、公众账号索引类目、工具索引类目、多媒体内容索引类目(比如,音频、视频和文章等)、资讯索引类目(比如,新闻)、娱乐索引类目(比如,音乐、游戏和小说等)以及问答索引类目中的至少一种。In one embodiment, the index categories include at least one of the following: sub-application (i.e., mini-program) index category, public account index category, tool index category, multimedia content index category (e.g., audio, video, and articles), information index category (e.g., news), entertainment index category (e.g., music, games, and novels), and question and answer index category.
其中,子应用,是能够在母应用提供的环境中实现的应用程序。母应用是承载子应用的应用程序,为子应用的实现提供环境。母应用是原生应用程序。原生应用程序是可直接运行于操作系统的应用程序。子应用不用下载即可基于母应用运行,属于轻量级应用。比如,微信即为母应用,微信小程序即为运行于微信这个原生的母应用中的子应用。In this context, a child application is an application that can run within the environment provided by the parent application. The parent application is the application that hosts the child application, providing the environment for its implementation. The parent application is a native application, which can run directly on the operating system. Child applications can run based on the parent application without being downloaded, and are considered lightweight applications. For example, WeChat is the parent application, and WeChat Mini Programs are child applications running within the native parent application of WeChat.
在一个实施例中,子应用可以包括即时通信子应用、移动支付子应用和游戏子应用等多种场景下的子应用,只要其满足基于母应用运行的无需下载安装的轻量级应用这一特性,即可以称为子应用。In one embodiment, a sub-application may include sub-applications in various scenarios such as instant messaging sub-applications, mobile payment sub-applications, and game sub-applications. As long as it meets the characteristic of being a lightweight application that runs on the parent application without the need for downloading or installation, it can be called a sub-application.
公众账号是指公众平台中的应用账号,可以实现与公众平台中特定群体进行文字、图片、语音和视频的交流沟通。在一个实施例中,公众账号可以是社交平台中的公众服务号(比如,微信平台中的公众服务号),也可以是一些第三方支付平台中的服务账号等。A public account refers to an application account on a public platform, enabling communication with specific groups within the platform through text, images, voice, and video. In one embodiment, a public account can be a public service account on a social media platform (such as a public service account on WeChat), or a service account on some third-party payment platforms, etc.
需要说明的是,搜索引擎所支持的索引类目可以根据搜索引擎自身的需求确定,不同搜索引擎所支持的可以索引类目不同。It should be noted that the indexable categories supported by a search engine can be determined according to the search engine's own needs, and different search engines support different indexable categories.
现结合例子对搜索引擎支持的索引类目进行说明。假设,搜索引擎支持子应用索引类目,在输入搜索词A时,如果子应用索引类目是与搜索词A的查询意图相关的索引类目,那么,则可以从子应用索引类目中查找与搜索词A相关的内容,即查询到与搜索词A相关的子应用。又比如,搜索引擎支持公众账号索引类目,如果公众账号索引类目是与搜索词A的查询意图相关的索引类目,那么,则可以从公众账号索引类目中查找与搜索词A相关的内容,即查询到与搜索词A相关的公众账号。The following examples illustrate the index categories supported by search engines. Suppose a search engine supports sub-application index categories. When searching for term A, if the sub-application index category is relevant to the query intent of term A, then content related to term A can be found within that sub-application index category; that is, sub-applications related to term A can be retrieved. Similarly, if a search engine supports public account index categories, and this category is relevant to the query intent of term A, then content related to term A can be found within that category; that is, public accounts related to term A can be retrieved.
具体地,用户可以在输入区中输入搜索词,终端可以实时监听输入区中的输入事件,当监听到用户在输入区中输入搜索词的输入事件时,则可以将监听到的搜索词发送至服务器。服务器则可以从搜索引擎所支持的索引类目中,筛选与搜索词的查询意图相关的多个索引类目,并在筛选的所述多个索引类目下,召回与搜索词相关的类目内容。服务器可以根据筛选的多个索引类目和相应召回的类目内容,生成与搜索词对应的聚合提示内容并返回至终端。终端则可以在查询页面中,实时展示与输入的搜索词对应的聚合提示内容。Specifically, users can enter search terms in the input area. The terminal can monitor input events in the input area in real time. When the input event of a user entering a search term is detected, the terminal sends the detected search term to the server. The server can then filter multiple index categories related to the search intent from the index categories supported by the search engine, and retrieve category content related to the search term under the filtered index categories. Based on the filtered index categories and the corresponding retrieved category content, the server generates aggregated suggestion content corresponding to the search term and returns it to the terminal. The terminal can then display the aggregated suggestion content corresponding to the entered search term in real time on the query page.
在一个实施例中,服务器可以预先基于聚合分析构建不同类别的数据库(比如,实体库、新闻词表、问答词表和模式词表等),然后基于构建的数据库,从搜索引擎所支持的索引类目中,筛选与搜索词的查询意图相关的多个索引类目,以及基于构建的数据库,从在筛选的所述多个索引类目下,召回与搜索词相关的类目内容。在其他实施例中,服务器还可以直接进行意图识别分析,确定与搜索词的查询意图相关的多个索引类目,而不需要预先基于聚合分析构建不同类别的数据库。服务器还可以根据离线大数据,挖掘搜索词与类目内容的映射关系以召回类目内容,或者,直接使用搜索引擎召回类目内容。对此并不限定。In one embodiment, the server can pre-build different types of databases (e.g., entity databases, news terminologies, question-and-answer terminologies, and pattern terminologies) based on aggregate analysis. Then, based on the built databases, it filters multiple index categories related to the query intent of the search term from the index categories supported by the search engine. Furthermore, based on the built databases, it retrieves category content related to the search term from the filtered multiple index categories. In other embodiments, the server can also directly perform intent recognition analysis to determine multiple index categories related to the query intent of the search term, without needing to pre-build different types of databases based on aggregate analysis. The server can also mine the mapping relationship between search terms and category content based on offline big data to retrieve category content, or directly use the search engine to retrieve category content. This is not limited to any particular approach.
在一个实施例中,服务器可以生成内容聚合卡片并返回至终端,聚合提示内容展示于内容聚合卡片中。终端可以在查询页面中的预设位置处展示内容聚合卡片,以展示内容聚合卡片中与输入的搜索词对应的聚合提示内容。In one embodiment, the server can generate a content aggregation card and return it to the terminal, displaying aggregated suggestions within the content aggregation card. The terminal can then display the content aggregation card at a preset location on the search page to show the aggregated suggestions corresponding to the entered search terms.
需要说明的是,聚合提示内容包括但不限于多个索引类目和在各索引类目下的与搜索词相关的类目内容。即聚合提示内容中还可以包括其他内容,从而聚合更多的内容。It should be noted that the aggregated suggestion content includes, but is not limited to, multiple index categories and category content related to the search terms under each index category. In other words, the aggregated suggestion content can also include other content, thus aggregating more information.
在一个实施例中,服务器可以对召回的各类目内容进行排序,根据排序后的类目内容及相应的索引类目生成聚合提示内容,并返回至终端。In one embodiment, the server can sort the recalled category content, generate aggregated prompt content based on the sorted category content and the corresponding index category, and return it to the terminal.
上述搜索方法,在监听到查询页面中的输入区中输入的搜索词时,实时展示包括搜索引擎所支持的索引类目中的、且与所述搜索词的查询意图相关的索引类目,以及各索引类目下的搜索词相关的类目内容的聚合提示内容。即,在搜索输入过程中,既能够提示与搜索词相关的内容,又能够提示搜索引擎所支持的索引类目,从而能够让用户知晓搜索引擎相关的信息,因此,相较于仅对输入的词语进行辅助提示以外,能够提供更多的搜索相关信息,提高了所表达的信息量。The aforementioned search method, upon detecting a search term entered in the input area of the query page, displays in real-time aggregated suggestions, including index categories supported by the search engine that are relevant to the search intent of the term, as well as category content related to the search term within each index category. In other words, during the search input process, it provides suggestions not only for content related to the search term but also for the index categories supported by the search engine, thereby informing the user of relevant search engine information. Therefore, compared to simply providing auxiliary suggestions for the entered words, it offers more search-related information and increases the amount of information conveyed.
在一个实施例中,输入区为搜索框。步骤204中当监听到在所述输入区中输入搜索词时,则在所述查询页面中,实时展示与输入的搜索词对应的聚合提示内容,包括:当监听到在所述搜索框中输入搜索词时,则在所述搜索框下展示下拉框,并在所述下拉框中,将所述多个索引类目下的与所述搜索词相关的类目内容按序展示。In one embodiment, the input area is a search box. In step 204, when a search term is detected being entered in the input area, aggregated suggestions corresponding to the entered search term are displayed in real time on the query page. This includes: when a search term is detected being entered in the search box, a drop-down list is displayed below the search box, and the category content related to the search term under the multiple index categories is displayed sequentially in the drop-down list.
其中,按序展示,是指多个索引类目下的与搜索词相关的类目内容之间按顺序排列展示。需要说明的是,这里的按序展示,是指不同索引类目的类目内容之间的排列顺序。In this context, "displaying in sequence" refers to arranging the category content related to the search term within multiple index categories in a specific order. It's important to note that "displaying in sequence" here refers to the order in which the category content is arranged across different index categories.
在一个实施例中,不同索引类目下展示的类目内容,可以按照类目内容与用户画像之间的匹配度、类目内容所属索引类目对应的类目意图强度、以及类目内容与搜索词之间的相关度中的至少一种进行排序。In one embodiment, the category content displayed under different index categories can be sorted according to at least one of the following: the matching degree between the category content and the user profile, the category intent strength corresponding to the index category to which the category content belongs, and the relevance between the category content and the search term.
可以理解,在同一个索引类目下展示的类目内容可以为预设数量。预设数量可以为至少一个。同一个索引类目下展示的类目内容,可以是按照类目内容与搜索词之间的文本相关性由高到低的顺序,筛选出来的预设数量的类目内容。预设数量为可以理解,当预设数量为1时,则在同一个索引类目下展示的是,该索引类目下与搜索词之间的文本相关性最高的类目内容。It is understandable that the number of category contents displayed under the same index category can be a preset number. The preset number can be at least one. The category contents displayed under the same index category can be a preset number of category contents selected according to the order of textual relevance between the category contents and the search term from high to low. The preset number can be understood as follows: when the preset number is 1, the category contents displayed under the same index category have the highest textual relevance to the search term.
需要说明的是,类目内容皆是按照相应的索引类目进行分类展示,即同一索引类目的类目内容皆与所属的索引类目对应展示。It should be noted that the category content is displayed according to the corresponding index category, that is, the category content of the same index category is displayed according to the corresponding index category.
上述实施例中,对类目内容进行排序后生成聚合提示内容,提高了提示的准确性和可参考性。In the above embodiments, the aggregated prompts are generated after sorting the category content, which improves the accuracy and reference value of the prompts.
在一个实施例中,所述在所述下拉框中,将所述多个索引类目下的与所述搜索词相关的类目内容按序展示包括:在所述下拉框中,按照所述类目内容与用户画像之间的匹配度由高到低的顺序,展示所述类目内容,且对应于所述类目内容展示相应的索引类目。In one embodiment, displaying the category content related to the search term under the multiple index categories in the drop-down box in sequence includes: displaying the category content in the drop-down box in descending order of matching degree between the category content and the user profile, and displaying the corresponding index category corresponding to the category content.
其中,所述用户画像,是指输入所述搜索词的用户的用户画像。多个索引类目,是搜索引擎所支持的索引类目中的、且与所述搜索词的查询意图相关的索引类目。The user profile refers to the user profile of the user who inputs the search term. Multiple index categories are those index categories supported by the search engine that are related to the query intent of the search term.
在一个实施例中,输入用户搜索词的用户,可以是登录账号的用户或者选定的用于进行搜索操作的用户。In one embodiment, the user who enters the search term can be a user with a logged-in account or a user selected to perform the search operation.
具体地,服务器可以确定多个索引类目下与搜索词相关的类目内容与用户画像之间的匹配度,然后,按照匹配度由高到低的顺序,对类目内容进行排序,将排序后的类目内容和所属的索引类目,对应返回至终端。终端则可以在下拉框中,按照类目内容与用户画像之间的匹配度由高到低的顺序,展示类目内容,且对应于类目内容展示相应的索引类目。Specifically, the server can determine the matching degree between category content related to the search term and the user profile under multiple index categories. Then, it sorts the category content in descending order of matching degree and returns the sorted category content and its corresponding index category to the terminal. The terminal can then display the category content in a drop-down menu, ordered from highest to lowest matching degree between the category content and the user profile, and display the corresponding index category for each category content.
可以理解,索引类目之间的排列顺序与所述索引类目下的类目内容之间的排列顺序是一致的。It is understood that the order in which the index categories are arranged is consistent with the order in which the content of the categories under the index categories is arranged.
在一个实施例中,服务器可以通过预先训练的匹配模型,对类目内容进行语义分析,提取语义特征,然后将提取的语义特征和用户画像之间进行匹配度计算,得到各类目内容与用户画像之间的匹配度。In one embodiment, the server can perform semantic analysis on category content using a pre-trained matching model, extract semantic features, and then calculate the matching degree between the extracted semantic features and the user profile to obtain the matching degree between each category of content and the user profile.
可以理解,这样一来,针对同一搜索词、不同用户提示的类目内容的优先顺序不同,从而个性化提示。This is understandable, as it allows for different users to receive different priority levels of category content for the same search term, thus enabling personalized suggestions.
为了便于理解个性化提示,现举例说明。比如,针对同一搜索词“**直播”,得到类目内容为与该直播相关的公众账号、小程序、描述该实现直播的技术的科技文章和相应的直播视频画面。如果用户是互联网行业的技术人员(比如,其经常浏览科技博文或者经常查看与直播相关的技术资料),那么,在其搜索“**直播”时,则会将描述该实现直播的技术的科技文章排在下拉框的第一位,因为,该科技文章是与该用户画像的匹配度最高的类目内容。假设用户是专业的直播主,那么,在其搜索“**直播”时,则小程序会排在科技文章之前。又假如用户是喜欢看直播的普通用户,那么,在其搜索“**直播”时,则可以将小程序或直播视频画面排在前面,将科技文章排在最后。To better illustrate personalized suggestions, let's look at an example. For instance, for the same search term "**live stream," the results would include public accounts, mini-programs, tech articles describing the technology used to enable the live stream, and corresponding live stream video footage. If the user is a tech professional (e.g., someone who frequently browses tech blogs or reviews live stream-related technical materials), the tech article describing the technology used to enable the live stream would appear first in the dropdown menu, as it best matches their profile. If the user is a professional live streamer, the mini-program would appear before the tech article. Conversely, if the user is a casual live streamer, the mini-program or live stream video footage would appear first, with the tech article last.
上述实施例中,通过不同索引类目下的类目内容与用户画像之间的匹配度,来对类目内容进行排序,从而能够在输入过程中,针对同一搜索词、不同用户进行个性化提示,即不同用户提示的类目内容的优先顺序不同,提高了输入过程中提示的准确性。In the above embodiments, the category content is sorted by the matching degree between the category content under different index categories and the user profile, so that personalized prompts can be given to different users for the same search term during the input process. That is, the priority of the category content prompted to different users is different, which improves the accuracy of prompts during the input process.
在一个实施例中,所述在所述下拉框中,将所述多个索引类目下的与所述搜索词相关的类目内容按序展示包括:在所述下拉框中,按照所述多个索引类目所对应的类目意图强度由高到低的顺序,展示所述多个索引类目和所述多个索引类目下的所述类目内容。In one embodiment, displaying the category content related to the search term under the plurality of index categories in the drop-down box in sequence includes: displaying the plurality of index categories and the category content under the plurality of index categories in the drop-down box according to the category intent intensity corresponding to the plurality of index categories from high to low.
其中,类目意图强度,用于表征索引类目与搜索词的查询意图之间的相关程度。Category intent strength is used to characterize the degree of correlation between the index category and the query intent of the search term.
在一个实施例中,类目意图强度,可以根据类目意图分确定。即,类目意图强度可以是类目意图分本身,或者根据类目意图分计算得到。In one embodiment, the category intent strength can be determined based on the category intent score. That is, the category intent strength can be the category intent score itself, or it can be calculated based on the category intent score.
具体地,服务器可以分析多个索引类目与搜索词的查询意图之间的相关性,得到各个索引类目对应的类目意图强度,然后,按照类目意图强度由高到低的顺序,对索引类目下的类目内容进行排序,将排序后的类目内容和所属的索引类目,对应返回至终端。终端则可以在下拉框中,按照多个索引类目所对应的类目意图强度由高到低的顺序,展示多个索引类目和多个索引类目下的类目内容。Specifically, the server can analyze the relevance between multiple index categories and the query intent of search terms to obtain the category intent strength corresponding to each index category. Then, it sorts the category content under each index category in descending order of category intent strength and returns the sorted category content and its corresponding index category to the terminal. The terminal can then display multiple index categories and their category content in a drop-down menu, ordered from highest to lowest category intent strength.
可以理解,这样一来,可以将与用户查询意图最相关的索引类目和相应的类目内容推荐给用户,提高推荐准确性。This is understandable, as it allows the system to recommend the most relevant index categories and corresponding content to users, thereby improving recommendation accuracy.
在一个实施例中,所述在所述下拉框中,将所述多个索引类目下的与所述搜索词相关的类目内容按序展示包括:在所述下拉框中,按照所述类目内容与搜索词之间的相关度由高到低的顺序,展示所述类目内容,且对应于所述类目内容展示相应的索引类目。所述相关度,根据所述类目内容所对应的索引类目的类目意图强度、所述类目内容和所述搜索词之间的文本相关性、以及所述类目内容的时效性中的至少一种确定。In one embodiment, displaying the category content related to the search term under the plurality of index categories in the drop-down list in sequence includes: displaying the category content in the drop-down list according to the order of high to low relevance between the category content and the search term, and displaying the corresponding index category corresponding to the category content. The relevance is determined based on at least one of the following: the category intent strength of the index category corresponding to the category content, the textual relevance between the category content and the search term, and the timeliness of the category content.
其中,相关度,用于表征类目内容和搜索词之间的相关性。Among them, relevance is used to characterize the correlation between category content and search terms.
需要说明的是,相关度可以用于表征类目内容和搜索词之间的文本相关性,也可以用于表征类目内容和搜索词之间的综合相关性。It should be noted that relevance can be used to characterize the textual relevance between category content and search terms, or it can be used to characterize the overall relevance between category content and search terms.
在一个实施例中,相关度,可以根据所述类目内容所对应的索引类目的类目意图强度、所述类目内容和所述搜索词之间的文本相关性、以及所述类目内容的时效性中的至少一种确定。In one embodiment, relevance can be determined based on at least one of the following: the category intent strength of the index category corresponding to the category content, the textual relevance between the category content and the search term, and the timeliness of the category content.
在一个实施例中,服务器可以对类目内容和所述搜索词之间的文本相关性进行评分,得到文本相关分、对类目内容的时效性进行评分得到内容时效分、以及获取类目内容所对应的索引类目的类目意图分,然后对文本相关分、内容时效分以及类目意图分进行加权求和,得到综合分数作为类目内容和搜索词之间的相关度。In one embodiment, the server can score the text relevance between category content and the search term to obtain a text relevance score, score the timeliness of category content to obtain a content timeliness score, and obtain the category intent score of the index category corresponding to the category content. Then, the server can perform a weighted summation of the text relevance score, content timeliness score, and category intent score to obtain a comprehensive score as the relevance between the category content and the search term.
可以理解,在时效性维度评分时,时效性强的类目内容的内容时效分高于时效性弱的类目内容。比如,对于新闻内容来说,与搜索词匹配的新闻中,最新的新闻比已经过时的新闻的内容时效分要高。It's understandable that when scoring based on timeliness, content from categories with higher timeliness scores higher than content from categories with lower timeliness. For example, in the case of news content, the most recent news matching the search term scores higher than outdated news.
具体地,服务器可以按照相关度由高到低的顺序,对各类目内容进行排序,将排序后的类目内容和所属的索引类目,对应返回至终端。终端则可以在下拉框中,按照所述类目内容与搜索词之间的相关度由高到低的顺序,展示所述类目内容,且对应于所述类目内容展示相应的索引类目。Specifically, the server can sort the content of each category in descending order of relevance, and return the sorted category content and its corresponding index category to the terminal. The terminal can then display the category content in a drop-down menu, in descending order of relevance between the category content and the search term, and display the corresponding index category for each category content.
在一个实施例中,所述聚合提示内容,还包括与所述搜索词相关的意图关联提示词和搜索结果查看入口中的至少一种。可以理解,当生成内容聚合卡片时,内容聚合卡片中除了展示多个索引类目和相应召回的类目内容之外,还可以展示意图关联提示词和搜索结果查看入口中的至少一种。In one embodiment, the aggregated prompt content further includes at least one of intent-related prompts and a search result viewing entry point. It can be understood that when generating a content aggregation card, in addition to displaying multiple index categories and corresponding recalled category content, the content aggregation card may also display at least one of intent-related prompts and a search result viewing entry point.
其中,所述意图关联提示词,是与所述查询意图相关的提示词。The intent-related prompts are prompts related to the query intent.
在一个实施例中,终端将搜索词发送至服务器后,服务器除了从搜索引擎所支持的索引类目中筛选与所述搜索词的查询意图相关的索引类目及进行相应类目内容的召回处理以外,还可以查找与该查询意图相关的意图关联提示词。In one embodiment, after the terminal sends the search term to the server, in addition to filtering the index categories related to the query intent of the search term from the index categories supported by the search engine and performing the corresponding category content recall process, the server can also search for intent-related prompt words related to the query intent.
在一个实施例中,意图关联提示词,可以包括关联关键词和扩展关键词中的至少一种。关联关键词,是指包括搜索词中的关键词的词语。即,关联关键词中有一部分内容是搜索词中的关键词。扩展关键词,是搜索词中的关键词中不包括的、但与查询意图相关的关键词。即,扩展关键词中不包括搜索词中的关键词。In one embodiment, the intent-related prompt may include at least one of related keywords and extended keywords. Related keywords are words that include keywords from the search term. That is, a portion of the related keywords is a keyword from the search term. Extended keywords are keywords that are not included in the search term but are relevant to the query intent. That is, extended keywords do not include keywords from the search term.
可以理解,原生关键词和扩展关键词,都可以作为用户在输入搜索词时的意图关联提示词,以对用户的搜索查询提供辅助。It is understandable that both native keywords and extended keywords can serve as intent-related suggestions when users enter search terms, thus assisting their search queries.
在一个实施例中,服务器还可以生成搜索结果查看入口。搜索结果查看入口,用于引导进入搜索结果页查看完整搜索结果的入口。In one embodiment, the server can also generate a search result viewing entry point. This entry point directs users to the search results page to view the complete search results.
在一个实施例中,输入区为搜索框。当监听到在输入区中输入搜索词时,则可以在搜索框下方预设位置处展示内容聚合卡片,以实时展示与输入的搜索词对应的聚合提示内容。In one embodiment, the input area is a search box. When a search term is detected in the input area, a content aggregation card can be displayed at a preset position below the search box to show aggregated suggestions corresponding to the entered search term in real time.
图3为一个实施例中展示聚合提示内容的界面示意图。参照图3,虚线框302中所示的即为搜索框。在搜索框中输入“微信公开课”这一搜索词。可以理解,图3中虚线框304中的内容即为内容聚合卡片中展示的聚合提示内容。虚线框306中的“公众号”、“小程序”、“文章”和“视频”即为搜索引擎所支持的索引类目中的、且与搜索词的查询意图相关的索引类目,其各自后面对应的内容即为在该索引类目下的与搜索词相关的类目内容。比如,“公众号”索引类目后的“微信公开课”即为召回的与搜索词“微信公开课”相关的公众账号信息。即,通过搜索发现存在一个叫做“微信公开课”的公众账号。同样地,从图3可知,通过搜索发现存在“微信公开课+”的小程序、“***微信公开课PRO演讲”的文章,以及“2020微信公开课”的视频。图3中“相关搜索”即表示符合查询意图的提示词搜索结果,“相关搜索”后面的“微信公开课PRO”以及“微信短内容”即为意图关联提示词。其中,“微信公开课PRO”即为关联关键词,其包括搜索词“微信公开课”这一关键词。“微信短内容”即为扩展关键词,其不包括搜索词“微信公开课”这一关键词,但“微信短内容”与查询意图相关的扩展出的关键词。图3中的“探索更多搜索结果”即为搜索结果查看入口。即为可以在内容聚合卡片底部展示搜索结果查看入口。Figure 3 is a schematic diagram of the interface displaying aggregated prompts in one embodiment. Referring to Figure 3, the dashed box 302 shows the search box. Enter the search term "WeChat Open Class" in the search box. It can be understood that the content in dashed box 304 in Figure 3 is the aggregated prompt content displayed in the content aggregation card. The "Official Accounts," "Mini Programs," "Articles," and "Videos" in dashed box 306 are index categories supported by the search engine that are related to the query intent of the search term. The content following each category is the category content related to the search term under that index category. For example, "WeChat Open Class" after the "Official Accounts" index category is the retrieved public account information related to the search term "WeChat Open Class." That is, a public account called "WeChat Open Class" is found through the search. Similarly, as shown in Figure 3, a search reveals the existence of the "WeChat Open Class+" mini program, the article "***WeChat Open Class PRO Speech," and the video "2020 WeChat Open Class." In Figure 3, "Related Searches" represents search results for keywords that match the query intent. "WeChat Open Class PRO" and "WeChat Short Content" following "Related Searches" are intent-related keywords. "WeChat Open Class PRO" is a related keyword that includes the search term "WeChat Open Class." "WeChat Short Content" is an expanded keyword that does not include the search term "WeChat Open Class," but is related to the query intent. "Explore More Search Results" in Figure 3 is the entry point to view search results. This means the search results can be viewed at the bottom of the content aggregation card.
上述实施例中,在聚合提示内容中,除了有多条强相关的类目及类目内容以外,还展示有意图关联提示词和/或搜索结果查看入口,从而,还可以通过意图关联提示词的智能提示,缩短用户的输入路径,提高搜索效率。此外,当展示搜索结果查看入口时,还可以在聚合提示内容中提示的内容(类目内容和意图关联提示词)不符合用户需求时,进入搜索结果页,从而看到完整搜索结果,提高搜索成功率。In the above embodiments, in addition to multiple strongly related categories and category content, the aggregated suggestion content also displays intent-related suggestion words and/or search result viewing entry points. Therefore, intelligent suggestions using intent-related suggestion words can shorten the user's input path and improve search efficiency. Furthermore, when displaying the search result viewing entry point, if the content suggested in the aggregated suggestion content (category content and intent-related suggestion words) does not meet the user's needs, the user can enter the search results page to see the complete search results, thus improving the search success rate.
在一个实施例中,所述聚合提示内容,还包括与所述搜索词相关的意图关联提示词。该方法还包括:当监听到对聚合提示内容中展示的类目内容的触发操作时,则跳转至与所触发的类目内容对应的页面;其中,跳转至的页面,符合所触发的类目内容所属的索引类目。In one embodiment, the aggregated suggestion content further includes intent-related suggestion words associated with the search term. The method also includes: when a triggering operation on category content displayed in the aggregated suggestion content is detected, redirecting to the page corresponding to the triggered category content; wherein the redirected page matches the index category to which the triggered category content belongs.
可以理解,用户可以对展示的聚合提示内容进行触发(比如点击按压等触发操作)。It's understandable that users can trigger the displayed aggregated prompts (such as by clicking or pressing them).
具体地,用户可以对聚合提示内容中展示的类目内容进行触发操作,终端当监听到对聚合提示内容中展示的类目内容的触发操作时,则跳转至与所触发的类目内容对应的页面。Specifically, users can trigger operations on the category content displayed in the aggregated prompts. When the terminal detects a trigger operation on the category content displayed in the aggregated prompts, it will redirect to the page corresponding to the triggered category content.
在一个实施例中,终端当监听到对聚合提示内容中展示的类目内容的触发操作时,可以向服务器请求所要跳转的页面信息。服务器则可以确定所触发的类目内容对应的页面信息,并将页面信息反馈至终端。终端则根据该页面信息,从查询页面跳转至与所触发的类目内容对应的页面。In one embodiment, when a terminal detects a triggering action on category content displayed in an aggregated prompt, it can request the page information to be navigated to from the server. The server can then determine the page information corresponding to the triggered category content and send the page information back to the terminal. The terminal then navigates from the query page to the page corresponding to the triggered category content based on this page information.
可以理解,跳转至的页面,符合所触发的类目内容所属的索引类目。比如,触发的是子应用索引类目下的类目内容,则跳转至的页面即为符合子应用索引类目的页面。结合图3举例,假设触发的是小程序索引类目下的“微信公开课+”这一类目内容,则可以跳转至“微信公开课+”这一小程序页面。It's understandable that the page redirected to matches the index category to which the triggered content belongs. For example, if the triggered content is from a sub-application index category, then the page redirected to will match that sub-application index category. Referring to Figure 3, assuming the triggered content is from the "WeChat Open Class+" category under the Mini Program index category, then the user will be redirected to the "WeChat Open Class+" Mini Program page.
在一个实施例中,与所触发的类目内容对应的页面,可以包括所触发的类目内容的落地页和账号入口页中的任意一种。其中,落地页,又称着陆页或引导页。在互联网中,落地页是利用搜索引擎搜索后显示给用户的网页。账号入口,即为用于登录账号的页面。In one embodiment, the page corresponding to the triggered category content can include either a landing page for the triggered category content or an account entry page. A landing page, also known as a landing page or onboarding page, is a webpage displayed to users after a search engine result is obtained on the internet. An account entry page is the page used to log in to an account.
上述实施例中,在输入过程中,通过聚合提示内容提示与查询意图相关的类目内容,用户对提示的类目内容触发后,直接跳转至所触发的类目内容对应的页面,能够更快捷地进入到具体页面,而不需要先进入搜索结果页再选择具体页面进入,提高了搜索查询效率。In the above embodiments, during the input process, aggregated prompts suggest category content related to the query intent. After the user triggers the suggested category content, they are directly redirected to the page corresponding to the triggered category content, enabling them to enter the specific page more quickly, without having to enter the search results page first and then select the specific page, thus improving search efficiency.
在一个实施例中,所述聚合提示内容,还包括搜索结果查看入口。该方法还包括:当监听到针对意图关联提示词的触发操作时,则跳转进入至第一搜索结果页;其中,在第一搜索结果页中,展示有以所触发的意图关联提示词为搜索条件,所搜索到的搜索结果。In one embodiment, the aggregated prompt content further includes a search result viewing entry. The method also includes: when a triggering operation targeting an intent-related prompt word is detected, redirecting to a first search result page; wherein the first search result page displays search results retrieved using the triggered intent-related prompt word as the search condition.
具体地,用户还可以触发(比如,点击)聚合提示内容中的意图关联提示词。终端当监听到针对意图关联提示词的触发操作时,则可以以该意图关联提示词为搜索条件,向服务器发起搜索请求,服务器则可以以所触发的意图关联提示词为搜索条件进行搜索,并根据搜索结果生成第一搜索结果页反馈至终端。终端则可以从查询页面跳转至第一搜索结果页。Specifically, users can also trigger (e.g., click) intent-related prompts within the aggregated prompt content. When the terminal detects a trigger action targeting an intent-related prompt, it can send a search request to the server using that prompt as a search criterion. The server can then perform a search using the triggered intent-related prompt as the search criterion and generate a first search results page, which is then sent back to the terminal. The terminal can then navigate from the query page to the first search results page.
上述实施例中,通过意图关联提示词的智能提示,不需要输入完整的语句,直接选择意图关联提示词即可进行搜索查询,缩短了用户的输入路径,提高了搜索效率。In the above embodiments, through intelligent suggestions based on intent association keywords, users can directly select intent association keywords to perform search queries without needing to input complete sentences, thus shortening the user's input path and improving search efficiency.
在一个实施例中,该方法还包括:当监听到针对搜索结果查看入口的触发操作时,则跳转进入至第二搜索结果页;其中,在第二搜索结果页中,展示有以搜索词为搜索条件,所搜索得到的搜索结果。In one embodiment, the method further includes: when a trigger operation targeting the search results viewing entry is detected, redirecting to a second search results page; wherein, the second search results page displays the search results obtained using the search term as the search criteria.
具体地,用户还可以触发(比如,点击)聚合提示内容中的搜索结果查看入口。终端当监听到针对搜索结果查看入口的触发操作时,则可以以输入的该搜索词为搜索条件,向服务器发起搜索请求,服务器则可以以搜索词为搜索条件进行搜索,并根据搜索结果生成第二搜索结果页反馈至终端。终端则可以从查询页面跳转至第二搜索结果页。Specifically, users can also trigger (for example, by clicking) the search results viewing entry in the aggregated prompt content. When the terminal detects a trigger action targeting the search results viewing entry, it can send a search request to the server using the entered search term as the search criteria. The server can then perform a search using the search term as the search criteria and generate a second search results page, which is then sent back to the terminal. The terminal can then jump from the query page to the second search results page.
上述实施例中,当提示的内容不符合需求时,可以直接触发搜索结果查看入口来查看完整搜索结果,提高搜索成功率。In the above embodiments, when the suggested content does not meet the requirements, the search results viewing entry can be directly triggered to view the complete search results, thereby improving the search success rate.
在一个实施例中,聚合提示内容中的多个索引类目,是通过对搜索词进行类目意图分析,得到搜索引擎支持的各索引类目所对应的类目意图分,并根据各索引类目所对应的类目意图分,从搜索引擎所支持的索引类目中筛选出的索引类目。In one embodiment, the multiple index categories in the aggregated prompt content are obtained by performing category intent analysis on the search terms to obtain the category intent score corresponding to each index category supported by the search engine, and then filtering the index categories from the index categories supported by the search engine based on the category intent score corresponding to each index category.
其中,类目意图分,用于表征搜索词的查询意图与索引类目之间的相关程度。Among them, the category intent score is used to characterize the degree of relevance between the query intent of the search term and the index category.
可以理解,搜索词的查询意图与索引类目之间的相关程度的高低,与类目意图分正相关。索引类目对应的类目意图分越高,表示搜索词的查询意图与该索引类目之间的相关程度越高,反之,索引类目对应的类目意图分越低,表示搜索词的查询意图与该索引类目之间的相关程度越低。It's understandable that the relevance between the search term's query intent and the indexed category is positively correlated with the category intent score. A higher category intent score indicates a higher relevance between the search term's query intent and that indexed category; conversely, a lower category intent score indicates a lower relevance.
具体地,终端将搜索词发送至服务器之后,搜索词,用于指示服务器对搜索词进行类目意图分析,得到搜索引擎支持的各索引类目所对应的类目意图分,并根据各索引类目所对应的类目意图分,从搜索引擎所支持的索引类目中筛选出的索引类目。即,根据类目意图分,从搜索引擎所支持的索引类目中,筛选与搜索词的查询意图相关的索引类目。Specifically, after the terminal sends the search terms to the server, the search terms instruct the server to perform category intent analysis on the search terms, obtain the category intent score corresponding to each index category supported by the search engine, and then select the index categories from the index categories supported by the search engine based on the category intent score. That is, based on the category intent score, the server selects index categories related to the query intent of the search terms from the index categories supported by the search engine.
在一个实施例中,终端将搜索词发送至服务器之后,搜索词,用于指示服务器将搜索词输入预先训练的分类模型中,预测该搜索词在搜索引擎支持的各索引类目下对应的分数,作为搜索引擎支持的各索引类目所对应的类目意图分。In one embodiment, after the terminal sends the search term to the server, the search term is used to instruct the server to input the search term into a pre-trained classification model to predict the score corresponding to the search term under each index category supported by the search engine, which is used as the category intent score corresponding to each index category supported by the search engine.
在一个实施例中,服务器中预先构建了不同类别的数据库。每个数据库中预先存储有至少一种索引类目的类目内容,同一种索引类目下的类目内容存储在同一个数据库中。各类目内容具有对应的索引类目标注。可以理解,索引类目标注所表征的索引类目为搜索引擎所支持的索引类目。In one embodiment, different types of databases are pre-built in the server. Each database pre-stores category content for at least one index category, with category content under the same index category stored in the same database. Each category of content has a corresponding index category annotation. It can be understood that the index category represented by the index category annotation is the index category supported by the search engine.
在一个实施例中,数据库中,有些类目内容可能已经有索引类目标签,有些类目内容没有索引类目标签。这种情况下,就可以通过人工添加、分析用户行为日志以及模式串匹配处理,来为没有索引类目标签的类目内容添加相应的索引类目标签。In one embodiment, some categories in the database may already have indexed category tags, while others may not. In this case, the appropriate indexed category tags can be added to the categories that do not have them by manual addition, analysis of user behavior logs, and pattern string matching.
在一个实施例中,服务器可以从用户行为日志中,提取用户在历史搜索过程中所点击的索引类目,并将该点击的索引类目作为其所搜索到的类目内容的索引类目标签。In one embodiment, the server can extract the index categories clicked by the user during the historical search process from the user behavior log, and use the clicked index categories as the index category tags for the category content searched by the user.
在一个实施例中,模式串匹配处理,是指将没有添加索引类目标签的类目内容提取固定查询模式串,然后将该固定查询模式串与已经添加索引类目标签的类目内容进行匹配,将匹配到的类目内容所对应的索引类目标签,作为该没有添加索引类目标签的类目内容的索引类目标签。In one embodiment, pattern string matching processing refers to extracting a fixed query pattern string from the category content that has not been indexed by a target tag, then matching the fixed query pattern string with the category content that has been indexed by a target tag, and using the target tag corresponding to the matched category content as the target tag of the category content that has not been indexed by a target tag.
服务器可以预先根据数据库中存储的预设类目内容和相应的索引类目标注,进行机器学习训练,以训练得到分类模型。然后,服务器可以将终端发送的搜索词输入该预先训练的分类模型中,预测该搜索词在搜索引擎支持的各索引类目下对应的分数,即得到搜索引擎支持的各索引类目所对应的类目意图分。The server can pre-train a classification model using machine learning based on preset category content and corresponding index category annotations stored in the database. Then, the server can input the search terms sent by the terminal into this pre-trained classification model to predict the score of the search term under each index category supported by the search engine, thus obtaining the category intent score corresponding to each index category supported by the search engine.
在一个实施例中,分类模型可以是基于FastText或XGBoost的分类模型。In one embodiment, the classification model can be a classification model based on FastText or XGBoost.
在一个实施例中,预先构建的不同类别的数据库,可以包括实体库(即分类数据库)和词表。In one embodiment, pre-built databases of different categories may include entity libraries (i.e., classification databases) and vocabularies.
其中,实体是客观存在并可相互区别的事物。就数据库而言,实体往往指某类事物的集合。把每一类数据对象的个体称为实体。可以理解,实体库,是预先通过对预设类目内容聚合分类,构建的分类数据库。属于同一索引类目的类目内容被聚合在同一实体库中。In this context, an entity is an objectively existing and distinguishable thing. In the context of databases, an entity often refers to a collection of things of a certain kind. Each individual data object of a category is called an entity. An entity database can be understood as a pre-built categorized database constructed by aggregating and classifying content according to preset categories. Category content belonging to the same index category is aggregated in the same entity database.
可以理解,实体库中的数据内容即为名称。It is understandable that the data content in the entity database is the name.
在一个实施例中,实体库可以包括账号实体库和名称实体库。比如,公众账号、小程序账号类的名称可以分别组成两个账号实体库,音乐名称和小说名称等也可以分别组成对应的名称实体库。可以理解,实体库的聚合分类,可以根据实际需求进行设定,对此不做限定。可以理解,每种实体库都可以为一个或多个。In one embodiment, the entity library may include an account entity library and a name entity library. For example, the names of public accounts and mini-program accounts can each form two separate account entity libraries, and music titles and novel titles can also form corresponding name entity libraries. It is understood that the aggregation and categorization of the entity libraries can be set according to actual needs and is not limited thereto. It is also understood that each type of entity library can consist of one or more entities.
以图4为例,图4中的示例中,“账号库”、“音乐曲库”以及“游戏名”即为不同的实体库的名称(即不同的索引类目),“微信运动”以及“王者荣耀”等即为实体库中具体存储的名称(即类目内容)。Taking Figure 4 as an example, in the example in Figure 4, "Account Library", "Music Library" and "Game Name" are the names of different entity libraries (i.e., different index categories), while "WeChat Sports" and "Honor of Kings" are the specific names stored in the entity libraries (i.e., category content).
词表,是预先通过对预设类目内容聚合分类得到的。属于同一索引类目的类目内容被聚合在同一词表中。A thesaurus is a pre-defined collection of content from pre-defined categories. Content belonging to the same index category is grouped into the same thesaurus.
在一个实施例中,词表可以包括模式词表、资讯词表、问答词表、工具词表和词条词表中的至少一种。可以理解,每种词表都可以为一个或多个。In one embodiment, the thesaurus may include at least one of a pattern thesaurus, an information thesaurus, a question-and-answer thesaurus, a tool thesaurus, and an entry thesaurus. It is understood that each type of thesaurus may be one or more.
在一个实施例中,服务器可以根据人工预先配置的固定查询模式串。也可以从预设类目内容进行聚合分析,提取固定查询模式串。从而根据固定查询模式串得到模式词表。可以理解,固定查询模式串支持正则等匹配方式(即满足通用的正则匹配表达式)。如图4所示,“天气模式”以及“路线查询”都是有固定查询模式串的,所以,可以分别构建相应的词表。比如,天气模式的固定查询模式串为“**天气”,那么,在查询不同的城市或地区时,只需要替换掉**中的内容即可,所以是具有固定查询模式串的,因而,可以针对天气模式创建相应的词表。In one embodiment, the server can extract fixed query pattern strings based on manually pre-configured fixed query pattern strings. Alternatively, it can perform aggregation analysis on preset category content to extract fixed query pattern strings, thereby obtaining a pattern vocabulary based on these strings. It can be understood that fixed query pattern strings support matching methods such as regular expressions (i.e., satisfying general regular expression matching expressions). As shown in Figure 4, both "Weather Pattern" and "Route Query" have fixed query pattern strings, so corresponding vocabulary lists can be built for them. For example, the fixed query pattern string for the Weather Pattern is "**Weather". Therefore, when querying different cities or regions, only the content in ** needs to be replaced. Thus, it has a fixed query pattern string, and a corresponding vocabulary list can be created for the Weather Pattern.
在一个实施例中,服务器可以对热点资讯标题和内容作聚类分析得到新闻主题词,或者获取人工编辑新闻主题词,然后根据提取的新闻主题词构建新闻词表。如图4所示,对于热点资讯提取新闻主题词得到“新冠肺炎”。In one embodiment, the server can perform cluster analysis on the titles and content of trending news to obtain news keywords, or obtain manually edited news keywords, and then construct a news thesaurus based on the extracted news keywords. As shown in Figure 4, the news keyword extracted from trending news is "COVID-19".
在一个实施例中,服务器可以问答结果标题和内容中提取关键词,得到问答关键词,或获取人工编辑的问答关键词,然后根据获取的问答关键词,构建问答词表。如图4所示,医疗问答中提取问答关键词得到“感冒怎么办”。In one embodiment, the server can extract keywords from the question-and-answer result title and content to obtain question-and-answer keywords, or obtain manually edited question-and-answer keywords, and then construct a question-and-answer thesaurus based on the obtained question-and-answer keywords. As shown in Figure 4, extracting question-and-answer keywords from medical questions yields "What to do about a cold".
在一个实施例中,服务器可以对词条词库中的词条进行聚合分析,得到词条词表。如图4所示,“景点百科”和“人物百科”即为聚合分析的结果,其中,“故宫”即为具体的词条内容。In one embodiment, the server can perform aggregation analysis on entries in the thesaurus to obtain an entry thesaurus. As shown in Figure 4, "Scenic Spots Encyclopedia" and "Biographies Encyclopedia" are the results of the aggregation analysis, where "Forbidden City" is the specific entry content.
上述实施例中,通过对搜索词进行类目意图分析,并根据各索引类目所对应的类目意图分,从搜索引擎所支持的索引类目中筛选出的索引类目。能够更为准确地筛选出符合用户查询意图的索引类目,提高了输入过程中提示的准确性和相关性。In the above embodiments, by performing category intent analysis on search terms and filtering index categories from those supported by the search engine based on the category intent scores corresponding to each index category, index categories that match the user's query intent can be filtered more accurately, improving the accuracy and relevance of suggestions during the input process.
在一个实施例中,聚合提示内容中的多个索引类目,是对搜索引擎支持的各索引类目所对应的类目意图分进行排序后,筛选出的类目意图分大于或等于预设意图分阈值的前预设数量的索引类目。In one embodiment, the multiple index categories in the aggregated prompt content are the first preset number of index categories whose category intent scores are greater than or equal to a preset intent score threshold after sorting the category intent scores corresponding to each index category supported by the search engine.
可以理解,本实施例中,搜索词,还用于指示服务器对搜索引擎支持的各索引类目所对应的类目意图分进行排序,筛选类目意图分大于或等于预设意图分阈值的前预设数量的索引类目。预设数量可以根据实际需要设定。It is understood that in this embodiment, the search term is also used to instruct the server to sort the category intent scores corresponding to each index category supported by the search engine, and to filter out the top preset number of index categories whose category intent scores are greater than or equal to a preset intent score threshold. The preset number can be set according to actual needs.
具体地,服务器对搜索词进行类目意图分析,得到搜索引擎支持的各索引类目所对应的类目意图分之后,可以对搜索引擎支持的各索引类目所对应的类目意图分按照由高到低的顺序进行排序,然后筛选类目意图分大于或等于预设意图分阈值的前预设数量的索引类目。需要说明的是,筛选出的索引类目,需要同时满足两个条件,即该索引类目所对应的类目意图分要大于或等于预设意图分阈值,而且,该索引类目的排序要在前预设位次。Specifically, the server performs category intent analysis on the search terms, obtaining the category intent score corresponding to each index category supported by the search engine. Then, it sorts these category intent scores from highest to lowest, and selects the top preset number of index categories whose category intent scores are greater than or equal to a preset threshold. It should be noted that the selected index categories must simultaneously meet two conditions: their category intent score must be greater than or equal to the preset threshold, and they must be ranked within the top preset positions.
图5为一个实施例中索引类目筛选示意图。参照图5,对于搜索词“微信公开课”来说,按照其在搜索引擎所支持的索引类目下的类目意图分由高到低的顺序进行排序。假设,预设意图分阈值为0.5,前4个索引类目(即预设数量为4),那么,则可以筛选出“公众账号”、“小程序”、“新闻”以及“视频”这4个索引类目。Figure 5 is a schematic diagram of index category filtering in one embodiment. Referring to Figure 5, for the search term "WeChat Open Class", the categories are sorted in descending order of their category intent scores under the index categories supported by the search engine. Assuming that the preset intent score threshold is 0.5, and the first 4 index categories (i.e., the preset quantity is 4), then the 4 index categories "Public Accounts", "Mini Programs", "News", and "Videos" can be filtered out.
上述实施例中,对搜索引擎支持的各索引类目所对应的类目意图分进行排序后,筛选出的类目意图分大于或等于预设意图分阈值的前预设数量的索引类目。能够更为准确地、快速地筛选出符合用户查询意图的索引类目,提高了输入过程中提示的准确性和相关性。In the above embodiments, after sorting the category intent scores corresponding to each index category supported by the search engine, the top preset number of index categories with category intent scores greater than or equal to a preset intent score threshold are selected. This enables more accurate and faster selection of index categories that match the user's query intent, improving the accuracy and relevance of prompts during the input process.
在一个实施例中,所述类目内容,是根据所筛选的各索引类目所分别对应的召回方式,召回的所筛选的各索引类目下与所述搜索词相关的提示内容。In one embodiment, the category content is the prompt content related to the search term under each of the selected index categories, which is retrieved according to the retrieval method corresponding to each of the selected index categories.
可以理解,本实施例中,搜索词,还用于指示服务器根据搜索词,确定所筛选的各索引类目所分别对应的召回方式,并分别按照对应的召回方式,召回所筛选的各索引类目下与搜索词相关的类目内容。It is understood that in this embodiment, the search term is also used to instruct the server to determine the recall method corresponding to each of the selected index categories based on the search term, and to recall the category content related to the search term under each of the selected index categories according to the corresponding recall method.
需要说明的是,服务器可以将召回的类目内容全部返回,也可以对召回的类目内容进行进阶筛选,进一步筛选出与搜索词的相关度更高的类目内容,然后将筛选出的类目内容返回至终端。It should be noted that the server can return all the recalled category content, or it can perform further filtering on the recalled category content to select category content with higher relevance to the search terms, and then return the filtered category content to the terminal.
具体地,服务器在筛选出索引类目之后,还可以根据搜索词,确定所筛选的各索引类目所分别对应的召回方式。可以理解,不同的索引类目可以采用不同的召回方式,即,用各类别分别对应的召回方式,召回相应的类目内容。Specifically, after filtering the index categories, the server can also determine the corresponding recall method for each filtered index category based on the search terms. In other words, different index categories can use different recall methods; that is, the corresponding category content is retrieved using the recall method specific to each category.
在一个实施例中,召回方式包括实体库召回方式、词表召回方式、以及搜索引擎召回方式中的至少一种。In one embodiment, the recall method includes at least one of entity database recall, vocabulary recall, and search engine recall.
其中,实体库召回方式,用于指示从索引类目所对应的实体库中,召回与搜索词相关的类目内容。Among them, the entity database recall method is used to indicate the recall of category content related to the search term from the entity database corresponding to the index category.
词表召回方式,用于指示从索引类目所对应的词表中,召回与搜索词相关的类目内容。The term recall method is used to indicate the retrieval of category content related to the search term from the terminology corresponding to the indexed category.
搜索引擎召回方式,用于指示从搜索引擎中,召回与搜索词相关的类目内容。Search engine recall methods are used to instruct the retrieval of category content related to the search term from the search engine.
可以理解,可以预先针对每个索引类目设置对应的召回方式。一个索引类目可以对应至少一种召回方式。当一个索引类目对应多个召回方式时,可以随机选取一种召回方式进行召回处理,也可以按照预设先后顺序选取召回方式,当前一个召回方式无法成功召回时,则按照后一个召回方式,召回该索引类目下与搜索词相关的提示内容。It's understandable that a corresponding recall method can be pre-set for each index category. An index category can correspond to at least one recall method. When an index category corresponds to multiple recall methods, one method can be randomly selected for recall processing, or recall methods can be selected in a preset order. If the previous recall method fails, the next recall method will be used to retrieve the relevant prompts for the search term under that index category.
需要说明的是,同一召回方式下也可以包括多种子召回方式。同一召回方式下的子召回方式之间也有优先顺序。It should be noted that a single recall method can include multiple sub-recall methods. There is also a priority order among the sub-recall methods under the same recall method.
在一个实施例中,实体库召回方式可以包括直接召回和创建前缀树召回这两种子召回方式。其中,直接召回,是指直接从实体库中命中召回的方式。创建前缀树召回,是指根据索引类目所属的实体库中的名称创建前缀树,并将搜索词与前缀树中名称前缀进行匹配召回的方式。In one embodiment, the entity database retrieval method may include two sub-retrieval methods: direct retrieval and prefix tree creation retrieval. Direct retrieval refers to directly retrieving data from the entity database. Prefix tree creation retrieval refers to creating a prefix tree based on the names in the entity database to which the indexed category belongs, and then matching the search terms with the prefixes of the names in the prefix tree for retrieval.
在一个实施例中,词表召回方式可以包括与模式词表中固定查询模式串匹配召回的子召回方式,以及从具体词表进行召回的子召回方式。可以理解,与模式词表中固定查询模式串匹配召回,相当于模糊匹配召回,能够提高召回效率,所以优先于从具体词表进行召回。In one embodiment, the vocabulary-based recall method may include a sub-recall method that retrieves responses by matching them with a fixed query pattern string in the pattern vocabulary, and a sub-recall method that retrieves responses from a specific vocabulary. It can be understood that recalling responses by matching them with a fixed query pattern string in the pattern vocabulary is equivalent to fuzzy matching recall, which can improve recall efficiency, and therefore is preferred over recalling responses from a specific vocabulary.
在其他实施例中,还可以通过其他召回方式,比如离线聚合结果召回等召回方式进行召回处理。In other embodiments, recall processing can also be carried out through other recall methods, such as offline aggregation result recall.
图6为一个实施例中召回方式示意图。参照图6,每个索引类目可以对应一个或多个召回方式。比如,公众账号/小程序/音乐/游戏这些索引类目,对应的召回方式包括实体库召回方式和搜索引擎召回方式。其中,实体库召回方式中又包括直接召回和创建前缀(trie)树召回这两种子召回方式。从图6可知,优先采用直接召回方式,从实体库中直接召回对应结果(即类目内容),如果无法直接命中召回,则采用创建前缀树召回的方式,如果仍无法召回对应结果,则可以对实体库名称建立倒排索引,通过搜索引擎系统召回,即采用搜索引擎召回方式进行召回。Figure 6 illustrates a recall method in one embodiment. Referring to Figure 6, each index category can correspond to one or more recall methods. For example, index categories such as public accounts/mini-programs/music/games correspond to entity database recall and search engine recall. The entity database recall method further includes two sub-recall methods: direct recall and trie tree recall. As shown in Figure 6, the direct recall method is preferred, directly recalling the corresponding results (i.e., category content) from the entity database. If direct recall fails, the trie tree recall method is used. If the corresponding results still cannot be recalled, an inverted index can be built for the entity database name, and the search engine system can be used for recall, i.e., the search engine recall method is used.
再比如,工具查询索引类目,对应的召回方式包括词表召回方式和搜索引擎召回方式。其中,词表召回方式又包括与模式词表中固定查询模式串匹配召回的子召回方式,以及从具体工具词表进行召回的子召回方式。从图6可知,优先采用与模式词表中固定查询模式串匹配召回的子召回方式,如果无法直接召回对应结果,则从工具词表中召回对应结果,如果仍无法召回对应结果,则可以对工具类别名称建立倒排索引,通过搜索引擎系统召回,即采用搜索引擎召回方式进行召回。For example, when querying index categories for tools, the corresponding recall methods include terminology recall and search engine recall. The terminology recall method includes sub-recall methods that match results with fixed query pattern strings in the pattern terminology, and sub-recall methods that retrieve results from specific tool terms. As shown in Figure 6, the sub-recall method that matches results with fixed query pattern strings in the pattern terminology is used first. If the corresponding results cannot be retrieved directly, the corresponding results are retrieved from the tool terminology. If the corresponding results still cannot be retrieved, an inverted index is created for the tool category name, and the results are retrieved through the search engine system; that is, the search engine recall method is used.
其中,倒排索引是以文档作为关键索引。表中关键字所对应的记录表项记录了出现这个字或词的所有文档,一个表项就是一个字表段,它记录该文档的标识和字符在该文档中出现的位置情况。可以理解,对实体库名称建立倒排索引,相当于以实体库名称作为关键索引。对工具类别名称建立倒排索引,相当于以工具类别名称作为关键索引。这里不再对图6中的每个索引类目进行一一解释说明,皆与已经解释说明的索引类目的召回原理相同。Inverted indexes use documents as the key index. The record entries corresponding to the keywords in the table record all documents containing that word or phrase. Each entry is a word segment, recording the document's identifier and the position of the character within that document. It can be understood that creating an inverted index for an entity database name is equivalent to using the entity database name as the key index. Similarly, creating an inverted index for a tool category name is equivalent to using the tool category name as the key index. The specific index categories in Figure 6 will not be explained in detail here, as the retrieval principle is the same as that of the already explained index categories.
上述实施例中,不同的索引类目可以采用不同的召回方式,提高了召回准确性及召回效率。In the above embodiments, different index categories can adopt different recall methods, which improves recall accuracy and recall efficiency.
如图7所示,在一个实施例中,提供了一种搜索方法,以该搜索方法应用于图1中的服务器为例进行说明,该方法包括以下步骤:As shown in Figure 7, in one embodiment, a search method is provided. Taking the application of this search method to the server in Figure 1 as an example, the method includes the following steps:
步骤702,接收所发送至的在查询页面中输入的搜索词。Step 702: Receive the search terms entered on the query page.
步骤704,从搜索引擎所支持的索引类目中,筛选与搜索词的查询意图相关的多个索引类目。Step 704: Select multiple index categories from the index categories supported by the search engine that are related to the query intent of the search term.
步骤706,获取在筛选的多个索引类目下的、且与搜索词相关的类目内容。Step 706: Obtain category content related to the search term from the multiple filtered index categories.
步骤708,根据筛选的索引类目和类目内容,生成与搜索词对应的聚合提示内容并返回;聚合提示内容,用于指示在查询页面中,展示聚合提示内容。Step 708: Based on the selected index categories and category content, generate and return aggregated suggestion content corresponding to the search terms; the aggregated suggestion content is used to indicate whether to display aggregated suggestion content on the query page.
在一个实施例中,聚合提示内容还包括与搜索词相关的意图关联提示词和搜索结果查看入口中的至少一种;其中,意图关联提示词,用于被触发后跳转至以意图关联提示词为搜索条件进行搜索,得到的搜索结果页;搜索结果查看入口,用于被触发后跳转至以搜索词为搜索条件进行搜索,得到的搜索结果页。In one embodiment, the aggregated prompt content further includes at least one of intent-related prompts associated with the search term and a search result viewing entry; wherein, the intent-related prompts are used to redirect to the search results page obtained by searching with the intent-related prompts as search criteria after being triggered; and the search results viewing entry is used to redirect to the search results page obtained by searching with the search term as search criteria after being triggered.
在一个实施例中,从搜索引擎所支持的索引类目中,筛选与搜索词的查询意图相关的多个索引类目包括:对搜索词进行类目意图分析,得到搜索引擎支持的各索引类目所对应的类目意图分;根据各索引类目所对应的类目意图分,对搜索引擎所支持的索引类目进行筛选;筛选出的索引类目所对应的类目意图分,高于未筛选出的索引类目所对应的类目意图分。In one embodiment, filtering multiple index categories related to the query intent of a search term from the index categories supported by the search engine includes: performing category intent analysis on the search term to obtain the category intent score corresponding to each index category supported by the search engine; filtering the index categories supported by the search engine based on the category intent score corresponding to each index category; and ensuring that the category intent score corresponding to the filtered index categories is higher than the category intent score corresponding to the unfiltered index categories.
在一个实施例中,对搜索词进行类目意图分析,得到搜索引擎支持的各索引类目所对应的类目意图分包括:将搜索词输入预先训练的分类模型中,预测该搜索词在搜索引擎支持的各索引类目下对应的分数,作为搜索引擎支持的各索引类目所对应的类目意图分。In one embodiment, performing category intent analysis on search terms to obtain category intent scores corresponding to each index category supported by the search engine includes: inputting the search term into a pre-trained classification model, predicting the score corresponding to the search term under each index category supported by the search engine, and using this score as the category intent score corresponding to each index category supported by the search engine.
在一个实施例中,服务器可以预先根据数据库中存储的预设类目内容和相应的索引类目标注,进行机器学习训练,以训练得到分类模型。In one embodiment, the server can perform machine learning training in advance based on the preset category content and corresponding index category annotations stored in the database to train a classification model.
在一个实施例中,根据各索引类目所对应的类目意图分,对搜索引擎所支持的索引类目进行筛选包括:对搜索引擎支持的各索引类目所对应的类目意图分进行排序,筛选类目意图分大于或等于预设意图分阈值的前预设数量的索引类目。In one embodiment, filtering the index categories supported by the search engine based on the category intent score corresponding to each index category includes: sorting the category intent scores corresponding to each index category supported by the search engine, and filtering the top preset number of index categories whose category intent scores are greater than or equal to a preset intent score threshold.
在一个实施例中,步骤706获取在筛选的所述多个索引类目下的、且与所述搜索词相关的类目内容包括:在筛选的所述多个索引类目下,召回与所述搜索词相关的候选类目内容;当同一索引类目下的候选类目内容为多个时,则按照各所述候选类目内容与所述搜索词之间的文本相关性由高到低的顺序,从同一索引类目下的多个候选类目内容中,筛选前预设数量的候选类目内容,作为用于生成聚合提示内容的类目内容。In one embodiment, step 706, obtaining category content related to the search term under the filtered plurality of index categories, includes: recalling candidate category content related to the search term under the filtered plurality of index categories; when there are multiple candidate category contents under the same index category, selecting a predetermined number of candidate category contents from the multiple candidate category contents under the same index category in descending order of text relevance between each candidate category content and the search term, as category content for generating aggregated prompt content.
其中,前预设数量可以为前一个或至少两个。需要说明的是,本申请各实施例中的“多个”即为至少两个。文本相关性,是指类目内容这一文本与搜索词这一文本之间的相关性。The preset number can be one or at least two. It should be noted that "multiple" in the embodiments of this application means at least two. Text relevance refers to the relevance between the text of the category content and the text of the search term.
可以理解,当同一索引类目下的候选类目内容为多个时,按照文本相关性由高到低的顺序,从中筛选前预设数量的候选类目内容,作为用于生成聚合提示内容的类目内容,能够更为准确地推荐用户想要搜索的类目内容,提高了推荐准确性。It is understandable that when there are multiple candidate category contents under the same index category, the first preset number of candidate category contents are selected in descending order of text relevance as the category contents used to generate aggregated suggestion content. This can more accurately recommend the category contents that users want to search for, thus improving the accuracy of recommendations.
在一个实施例中,根据筛选的索引类目和类目内容,生成与搜索词对应的聚合提示内容并返回包括:按照筛选的索引类目所对应的类目意图强度由高到低的顺序,对索引类目进行排序;根据排序后的索引类目和索引类目下的类目内容,生成与搜索词对应的聚合提示内容并返回。In one embodiment, generating and returning aggregated suggestion content corresponding to the search term based on the selected index categories and category content includes: sorting the index categories in descending order of category intent intensity; and generating and returning aggregated suggestion content corresponding to the search term based on the sorted index categories and category content under the index categories.
具体地,服务器可以分析多个索引类目与搜索词的查询意图之间的相关性,得到各个索引类目对应的类目意图强度,然后,按照类目意图强度由高到低的顺序,对索引类目下的类目内容进行排序,根据排序后的索引类目和索引类目下的类目内容,生成与搜索词对应的聚合提示内容,并将该聚合提示内容返回至终端。Specifically, the server can analyze the correlation between multiple index categories and the query intent of search terms to obtain the category intent strength corresponding to each index category. Then, it sorts the category content under the index category in descending order of category intent strength. Based on the sorted index categories and the category content under the index categories, it generates aggregated suggestion content corresponding to the search term and returns the aggregated suggestion content to the terminal.
在一个实施例中,根据筛选的索引类目和类目内容,生成与搜索词对应的聚合提示内容并返回包括:按照类目内容与搜索词之间的相关度由高到低的顺序,对类目内容进行排序;根据排序后的索引类目和索引类目下的类目内容,生成与搜索词对应的聚合提示内容并返回。In one embodiment, generating and returning aggregated suggestion content corresponding to the search term based on the selected index categories and category content includes: sorting the category content in descending order of relevance between the category content and the search term; generating and returning aggregated suggestion content corresponding to the search term based on the sorted index categories and the category content under the index categories.
具体地,服务器可以根据所述类目内容所对应的索引类目的类目意图强度、所述类目内容和所述搜索词之间的文本相关性、以及所述类目内容的时效性中的至少一种,分析类目内容与搜索词之间的相关度。服务器可以按照类目内容与搜索词之间的相关度由高到低的顺序,对类目内容进行排序。服务器可以根据排序后的索引类目和索引类目下的类目内容,生成与搜索词对应的聚合提示内容,并将生成的聚合提示内容返回至终端。Specifically, the server can analyze the relevance between category content and search terms based on at least one of the following: the category intent strength of the index category corresponding to the category content, the textual relevance between the category content and the search term, and the timeliness of the category content. The server can sort the category content in descending order of relevance between the category content and the search term. Based on the sorted index categories and the category content under each index category, the server can generate aggregated suggestion content corresponding to the search term and return the generated aggregated suggestion content to the terminal.
在一个实施例中,服务器可以对类目内容和所述搜索词之间的文本相关性进行评分,得到文本相关分、对类目内容的时效性进行评分得到内容时效分、以及获取类目内容所对应的索引类目的类目意图分,然后对文本相关分、内容时效分以及类目意图分进行加权求和,得到综合分数作为类目内容和搜索词之间的相关度。In one embodiment, the server can score the text relevance between category content and the search term to obtain a text relevance score, score the timeliness of category content to obtain a content timeliness score, and obtain the category intent score of the index category corresponding to the category content. Then, the server can perform a weighted summation of the text relevance score, content timeliness score, and category intent score to obtain a comprehensive score as the relevance between the category content and the search term.
在一个实施例中,根据筛选的索引类目和类目内容,生成与搜索词对应的聚合提示内容并返回,包括:获取输入搜索词的用户的用户画像;确定类目内容和用户画像之间的匹配度;按照匹配度由高到低的顺序,对类目内容进行排序;根据排序后的索引类目和索引类目下的类目内容,生成与搜索词对应的聚合提示内容并返回。In one embodiment, based on the selected index categories and category content, aggregated suggestion content corresponding to the search term is generated and returned, including: obtaining the user profile of the user who inputs the search term; determining the matching degree between the category content and the user profile; sorting the category content in descending order of matching degree; and generating and returning aggregated suggestion content corresponding to the search term based on the sorted index categories and the category content under the index categories.
其中,所述用户画像,是指输入所述搜索词的用户的用户画像。多个索引类目,是搜索引擎所支持的索引类目中的、且与所述搜索词的查询意图相关的索引类目。The user profile refers to the user profile of the user who inputs the search term. Multiple index categories are those index categories supported by the search engine that are related to the query intent of the search term.
在一个实施例中,输入用户搜索词的用户,可以是登录账号的用户或者选定的用于进行搜索操作的用户。In one embodiment, the user who enters the search term can be a user with a logged-in account or a user selected to perform the search operation.
具体地,服务器可以确定多个索引类目下与搜索词相关的类目内容与用户画像之间的匹配度,然后,按照匹配度由高到低的顺序,对类目内容进行排序,将排序后的类目内容和所属的索引类目,对应返回至终端。终端则可以在下拉框中,按照类目内容与用户画像之间的匹配度由高到低的顺序,展示类目内容,且对应于类目内容展示相应的索引类目。Specifically, the server can determine the matching degree between category content related to the search term and the user profile under multiple index categories. Then, it sorts the category content in descending order of matching degree and returns the sorted category content and its corresponding index category to the terminal. The terminal can then display the category content in a drop-down menu, ordered from highest to lowest matching degree between the category content and the user profile, and display the corresponding index category for each category content.
可以理解,索引类目之间的排列顺序与所述索引类目下的类目内容之间的排列顺序是一致的。It is understood that the order in which the index categories are arranged is consistent with the order in which the content of the categories under the index categories is arranged.
在一个实施例中,服务器可以通过预先训练的匹配模型,对类目内容进行语义分析,提取语义特征,然后将提取的语义特征和用户画像之间进行匹配度计算,得到各类目内容与用户画像之间的匹配度。In one embodiment, the server can perform semantic analysis on category content using a pre-trained matching model, extract semantic features, and then calculate the matching degree between the extracted semantic features and the user profile to obtain the matching degree between each category of content and the user profile.
可以理解,这样一来,针对同一搜索词、不同用户提示的类目内容的优先顺序不同,从而个性化提示。This is understandable, as it allows for different users to receive different priority levels of category content for the same search term, thus enabling personalized suggestions.
上述实施例中,通过不同索引类目下的类目内容与用户画像之间的匹配度,来对类目内容进行排序,从而能够在输入过程中,针对同一搜索词、不同用户进行个性化提示,即不同用户提示的类目内容的优先顺序不同,提高了输入过程中提示的准确性。In the above embodiments, the category content is sorted by the matching degree between the category content under different index categories and the user profile, so that personalized prompts can be given to different users for the same search term during the input process. That is, the priority of the category content prompted to different users is different, which improves the accuracy of prompts during the input process.
在一个实施例中,在筛选的多个索引类目下,召回与搜索词相关的类目内容包括:根据搜索词,确定所筛选的各索引类目所分别对应的召回方式;针对每个筛选的索引类目,分别索引类目对应的召回方式,召回索引类目下与搜索词相关的类目内容。In one embodiment, recalling category content related to the search term under multiple filtered index categories includes: determining the recall method corresponding to each filtered index category based on the search term; and for each filtered index category, recalling the category content related to the search term under the index category using the corresponding recall method.
在一个实施例中,召回方式包括实体库召回方式、词表召回方式、以及搜索引擎召回方式中的至少一种。In one embodiment, the recall method includes at least one of entity database recall, vocabulary recall, and search engine recall.
在一个实施例中,服务器可以对召回的各类目内容进行排序,根据排序后的类目内容及相应的索引类目生成聚合提示内容,并返回至终端。In one embodiment, the server can sort the recalled category content, generate aggregated prompt content based on the sorted category content and the corresponding index category, and return it to the terminal.
在一个实施例中,服务器可以按照各索引类目对应的类目意图分由高到低的顺序,对各索引类目下的类目内容进行排序。服务器也可以根据召回的类目内容与搜索词之间文本相关性和类目内容的时效性中的至少一种,对类目内容进行评分,并按照评分分数由高到低的顺序,对类目内容进行排序。可以理解,服务器还可以将评分分数和类目意图分结合起来(比如,加权求和),确定综合分数,并按照综合分数由高到低的顺序,对类目内容进行排序。对此不作限定。In one embodiment, the server can sort the category content under each index category according to the category intent score, from highest to lowest. The server can also score the category content based on at least one of the textual relevance between the recalled category content and the search terms, and the timeliness of the category content, and sort the category content according to the score, from highest to lowest. It is understood that the server can also combine the score and the category intent score (e.g., weighted summation) to determine a comprehensive score, and sort the category content according to the comprehensive score, from highest to lowest. This is not limited.
可以理解,在时效性维度评分时,时效性强的类目内容评分高于时效性弱的类目内容。比如,对于新闻内容来说,与搜索词匹配的新闻中,最新的新闻比已经过时的新闻的评分要高。It's understandable that when scoring based on timeliness, content in categories with higher timeliness scores higher than content in categories with lower timeliness. For example, in the case of news content, the most recent news items matching the search terms score higher than outdated news items.
上述搜索方法、装置、计算机设备和存储介质,在获取到输入的搜索词时,能够实时从搜索引擎所支持的索引类目中,筛选出与搜索词的查询意图相关的索引类目,以及各索引类目下召回搜索词相关的类目内容,生成聚合提示内容并返回至终端,以使终端在输入过程中,既能够提示与搜索词相关的内容,又能够提示搜索引擎所支持的索引类目,从而能够让用户知晓搜索引擎相关的信息,因此,相较于仅对输入的词语进行辅助提示以外,能够提供更多的搜索相关信息,提高了所表达的信息量。The aforementioned search method, apparatus, computer equipment, and storage medium, upon receiving the input search term, can in real time filter index categories related to the query intent of the search term from the index categories supported by the search engine, as well as recall category content related to the search term under each index category, generate aggregated prompt content, and return it to the terminal. This allows the terminal to provide both content related to the search term and the index categories supported by the search engine during the input process, thereby enabling the user to be aware of search engine-related information. Therefore, compared to simply providing auxiliary prompts for the input words, it can provide more search-related information and increase the amount of information conveyed.
图8为一个实施例中搜索方法的原理示意图。参照图8,用户在终端的查询页面中的搜索框中输入搜索词,终端将搜索词发送至服务器,由服务器根据预先建立的不同类别的数据库(比如,实体库、新闻聚合的新闻词表、模式词表以及问答词表等),对搜索词进行类目意图分析,然后根据分析结果,筛选与搜索词的查询意图相关的索引类目。服务器可以确定所筛选的索引类目对应的召回方式(不同索引类目具有不同的召回方式),根据确定的召回方式,再次结合建立的数据库(比如,实体库、新闻聚合的新闻词表、模式词表以及问答词表等),在选取的索引类目下召回类目内容(即召回索引类目结果)。然后,服务器可以将召回的类目内容进行排序,组合得到聚合提示内容,生成内容聚合卡片,终端可以在搜索框下实时展示该内容聚合卡片。Figure 8 is a schematic diagram illustrating the principle of the search method in one embodiment. Referring to Figure 8, the user enters a search term in the search box on the terminal's query page. The terminal sends the search term to the server, which performs category intent analysis on the search term based on pre-established databases of different categories (e.g., entity database, news terminology for news aggregation, pattern terminology, and question-and-answer terminology). Based on the analysis results, the server filters index categories related to the query intent of the search term. The server can determine the recall method corresponding to the filtered index category (different index categories have different recall methods). Based on the determined recall method, and again combining it with the established databases (e.g., entity database, news terminology for news aggregation, pattern terminology, and question-and-answer terminology), the server recalls category content (i.e., recalls index category results) under the selected index category. Then, the server can sort the recalled category content, combine it to obtain aggregated prompt content, and generate a content aggregation card. The terminal can display this content aggregation card in real time below the search box.
应该理解的是,虽然上述流程图中的各个步骤按照箭头的指示依次显示,但是这些步骤并不是必然按照箭头指示的顺序依次执行。除非本文中有明确的说明,这些步骤的执行并没有严格的顺序限制,这些步骤可以以其它的顺序执行。而且,上述流程图中的至少一部分步骤可以包括多个步骤或者多个阶段,这些步骤或者阶段并不必然是在同一时刻执行完成,而是可以在不同的时刻执行,这些步骤或者阶段的执行顺序也不必然是依次进行,而是可以与其它步骤或者其它步骤中的步骤或者阶段的至少一部分轮流或者交替地执行。It should be understood that although the steps in the flowchart above are shown sequentially as indicated by the arrows, these steps are not necessarily executed in the order indicated by the arrows. Unless explicitly stated herein, there is no strict order restriction on the execution of these steps, and they can be executed in other orders. Moreover, at least some steps in the flowchart above may include multiple steps or multiple stages. These steps or stages are not necessarily completed at the same time, but can be executed at different times. The execution order of these steps or stages is not necessarily sequential, but can be performed alternately or in turn with other steps or at least some of the steps or stages in other steps.
在一个实施例中,如图9所示,提供了一种搜索装置,该装置可以采用软件模块或硬件模块,或者是二者的结合成为计算机设备的一部分,该装置具体包括:展示模块902和监听模块904,其中:In one embodiment, as shown in FIG9, a search device is provided. This device can be a software module, a hardware module, or a combination of both, integrated into a computer device. Specifically, the device includes a display module 902 and a listening module 904, wherein:
展示模块902,用于展示查询页面;查询页面中显示有输入区。Display module 902 is used to display the query page; the query page displays an input area.
监听模块904,用于监听在输入区中输入的搜索词。The listening module 904 is used to listen for search terms entered in the input area.
展示模块902还用于当监听模块904监听到在输入区中输入搜索词时,则在查询页面中,实时展示与输入的搜索词对应的聚合提示内容。The display module 902 is also used to display aggregated suggestions corresponding to the entered search term in real time on the query page when the listening module 904 detects that a search term has been entered in the input area.
其中,聚合提示内容,包括多个索引类目和在各索引类目下的与搜索词相关的类目内容;多个索引类目,是搜索引擎所支持的索引类目中的、且与搜索词的查询意图相关的索引类目。The aggregated suggestion content includes multiple index categories and category content related to the search terms under each index category; the multiple index categories are index categories supported by the search engine that are related to the query intent of the search terms.
在一个实施例中,输入区为搜索框。展示模块902还用于当监听到在所述搜索框中输入搜索词时,则在所述搜索框下展示下拉框,并在所述下拉框中,将所述多个索引类目下的与所述搜索词相关的类目内容按序展示。In one embodiment, the input area is a search box. The display module 902 is further configured to, when a search term is detected being entered into the search box, display a drop-down list below the search box, and in the drop-down list, display the category content related to the search term under the plurality of index categories in sequence.
在一个实施例中,展示模块902还用于在所述下拉框中,按照所述类目内容与用户画像之间的匹配度由高到低的顺序,展示所述类目内容,且对应于所述类目内容展示相应的索引类目;所述用户画像,是指输入所述搜索词的用户的用户画像。In one embodiment, the display module 902 is further configured to display the category content in the drop-down box in descending order of matching degree between the category content and the user profile, and to display the corresponding index category corresponding to the category content; the user profile refers to the user profile of the user who inputs the search term.
在一个实施例中,展示模块902还用于当监听模块904监听到对聚合提示内容中展示的类目内容的触发操作时,则跳转至与所触发的类目内容对应的页面;其中,跳转至的页面,符合所触发的类目内容所属的索引类目。In one embodiment, the display module 902 is further configured to, when the listening module 904 detects a trigger operation on the category content displayed in the aggregated prompt content, jump to the page corresponding to the triggered category content; wherein the page to which the jump is made conforms to the index category to which the triggered category content belongs.
在一个实施例中,聚合提示内容,还包括与搜索词相关的意图关联提示词和搜索结果查看入口中的至少一种;其中,意图关联提示词,是与查询意图相关的提示词。In one embodiment, the aggregated prompt content further includes at least one of intent-related prompts associated with the search term and a search result viewing entry; wherein, intent-related prompts are prompts associated with the query intent.
在一个实施例中,聚合提示内容,还包括与所述搜索词相关的意图关联提示词。展示模块902还用于当监听模块904监听到针对意图关联提示词的触发操作时,则跳转进入至第一搜索结果页;其中,在第一搜索结果页中,展示有以所触发的意图关联提示词为搜索条件,所搜索到的搜索结果。In one embodiment, the aggregated prompt content further includes intent-related prompt words associated with the search term. The display module 902 is also configured to, when the listening module 904 detects a triggering operation for the intent-related prompt word, redirect to the first search results page; wherein, the first search results page displays the search results obtained using the triggered intent-related prompt word as the search condition.
在一个实施例中,聚合提示内容,还包括搜索结果查看入口。展示模块902还用于当监听模块904监听到针对搜索结果查看入口的触发操作时,则跳转进入至第二搜索结果页;其中,在第二搜索结果页中,展示有以搜索词为搜索条件,所搜索得到的搜索结果。In one embodiment, the aggregated prompt content also includes a search result viewing entry. The display module 902 is further configured to, when the listening module 904 detects a trigger operation on the search result viewing entry, redirect to a second search result page; wherein, the second search result page displays the search results obtained using the search term as the search criteria.
在一个实施例中,聚合提示内容中的多个索引类目,是通过对搜索词进行类目意图分析,得到搜索引擎支持的各索引类目所对应的类目意图分,并根据各索引类目所对应的类目意图分,从搜索引擎所支持的索引类目中筛选出的索引类目。In one embodiment, the multiple index categories in the aggregated prompt content are obtained by performing category intent analysis on the search terms to obtain the category intent score corresponding to each index category supported by the search engine, and then filtering the index categories from the index categories supported by the search engine based on the category intent score corresponding to each index category.
在一个实施例中,类目内容,是根据所筛选的各索引类目所分别对应的召回方式,召回的所筛选的各索引类目下与搜索词相关的提示内容。In one embodiment, the category content is the prompt content related to the search term retrieved under each of the selected index categories according to the recall method corresponding to each of the selected index categories.
在一个实施例中,召回方式包括实体库召回方式、词表召回方式、以及搜索引擎召回方式中的至少一种;实体库召回方式,用于指示从索引类目所对应的实体库中,召回与搜索词相关的类目内容;实体库,是预先通过对预设类目内容聚合分类,构建的分类数据库;属于同一索引类目的类目内容被聚合在同一实体库中;词表召回方式,用于指示从索引类目所对应的词表中,召回与搜索词相关的类目内容;词表,是预先通过对预设类目内容聚合分类,构建的词表;属于同一索引类目的类目内容被聚合在同一词表中。In one embodiment, the recall method includes at least one of entity database recall, terminology recall, and search engine recall. The entity database recall method is used to indicate the recall of category content related to the search term from the entity database corresponding to the index category. The entity database is a classification database pre-built by aggregating and classifying preset category content; category content belonging to the same index category is aggregated in the same entity database. The terminology recall method is used to indicate the recall of category content related to the search term from the terminology corresponding to the index category. The terminology is a terminology pre-built by aggregating and classifying preset category content; category content belonging to the same index category is aggregated in the same terminology.
在一个实施例中,索引类目包括子应用索引类目、公众账号索引类目、工具索引类目、多媒体内容索引类目、以及问答索引类目中的至少一种。In one embodiment, the index categories include at least one of the following: sub-application index category, public account index category, tool index category, multimedia content index category, and question and answer index category.
在一个实施例中,如图10所示,提供了一种搜索装置,该装置可以采用软件模块或硬件模块,或者是二者的结合成为计算机设备的一部分,该装置具体包括:接收模块1002、索引类目筛选模块1004、获取模块1006和聚合提示模块1008,其中:In one embodiment, as shown in FIG10, a search device is provided. This device can be a software module, a hardware module, or a combination of both, integrated into a computer device. Specifically, the device includes: a receiving module 1002, an index category filtering module 1004, an acquisition module 1006, and an aggregation suggestion module 1008, wherein:
接收模块1002,用于接收所发送至的在查询页面中输入的搜索词。The receiving module 1002 is used to receive the search terms entered on the query page.
索引类目筛选模块1004,用于从搜索引擎所支持的索引类目中,筛选与搜索词的查询意图相关的多个索引类目。The index category filtering module 1004 is used to filter multiple index categories that are related to the query intent of the search terms from the index categories supported by the search engine.
获取模块1006,用于获取在筛选的多个索引类目下的、且与搜索词相关的类目内容。The acquisition module 1006 is used to acquire category content related to the search term under multiple filtered index categories.
聚合提示模块1008,用于根据筛选的索引类目和类目内容,生成与搜索词对应的聚合提示内容并返回;聚合提示内容,用于指示在查询页面中,展示聚合提示内容。The aggregation suggestion module 1008 is used to generate and return aggregation suggestion content corresponding to the search terms based on the filtered index categories and category content; the aggregation suggestion content is used to indicate whether to display the aggregation suggestion content on the query page.
在一个实施例中,聚合提示内容还包括与搜索词相关的意图关联提示词和搜索结果查看入口中的至少一种;其中,意图关联提示词,用于被触发后跳转至以意图关联提示词为搜索条件进行搜索,得到的搜索结果页;搜索结果查看入口,用于被触发后跳转至以搜索词为搜索条件进行搜索,得到的搜索结果页。In one embodiment, the aggregated prompt content further includes at least one of intent-related prompts associated with the search term and a search result viewing entry; wherein, the intent-related prompts are used to redirect to the search results page obtained by searching with the intent-related prompts as search criteria after being triggered; and the search results viewing entry is used to redirect to the search results page obtained by searching with the search term as search criteria after being triggered.
在一个实施例中,获取模块1006还用于在筛选的所述多个索引类目下,召回与所述搜索词相关的候选类目内容;当同一索引类目下的候选类目内容为多个时,则按照各所述候选类目内容与所述搜索词之间的文本相关性由高到低的顺序,从同一索引类目下的多个候选类目内容中,筛选前预设数量的候选类目内容,作为用于生成聚合提示内容的类目内容。In one embodiment, the acquisition module 1006 is further configured to recall candidate category content related to the search term under the filtered plurality of index categories; when there are multiple candidate category contents under the same index category, the preset number of candidate category contents are selected from the multiple candidate category contents under the same index category in descending order of text relevance between each candidate category content and the search term, and used as category content for generating aggregated prompt content.
在一个实施例中,获取模块1006还用于根据搜索词,确定所筛选的各索引类目所分别对应的召回方式;针对每个筛选的索引类目,分别索引类目对应的召回方式,召回索引类目下与搜索词相关的类目内容。In one embodiment, the acquisition module 1006 is further configured to determine the recall method corresponding to each of the selected index categories based on the search term; for each selected index category, the recall method corresponding to the index category is used to recall the category content related to the search term under the index category.
在一个实施例中,聚合提示模块1008还用于按照筛选的所述索引类目所对应的类目意图强度由高到低的顺序,对所述索引类目进行排序,或,按照所述类目内容与所述搜索词之间的相关度由高到低的顺序,对所述类目内容进行排序;根据排序后的索引类目和所述索引类目下的所述类目内容,生成与所述搜索词对应的聚合提示内容并返回。In one embodiment, the aggregation suggestion module 1008 is further configured to sort the index categories in descending order of category intent intensity corresponding to the filtered index categories, or sort the category content in descending order of relevance between the category content and the search term; and generate and return aggregate suggestion content corresponding to the search term based on the sorted index categories and the category content under the index categories.
在一个实施例中,聚合提示模块1008还用于获取输入所述搜索词的用户的用户画像;确定所述类目内容和所述用户画像之间的匹配度;按照所述匹配度由高到低的顺序,对所述类目内容进行排序;根据排序后的索引类目和所述索引类目下的所述类目内容,生成与所述搜索词对应的聚合提示内容并返回。In one embodiment, the aggregation suggestion module 1008 is further configured to obtain a user profile of the user who inputs the search term; determine the matching degree between the category content and the user profile; sort the category content in descending order of the matching degree; and generate and return aggregate suggestion content corresponding to the search term based on the sorted index categories and the category content under the index categories.
关于搜索装置的具体限定可以参见上文中对于搜索方法的限定,在此不再赘述。上述搜索装置中的各个模块可全部或部分通过软件、硬件及其组合来实现。上述各模块可以硬件形式内嵌于或独立于计算机设备中的处理器中,也可以以软件形式存储于计算机设备中的存储器中,以便于处理器调用执行以上各个模块对应的操作。Specific limitations regarding the search device can be found in the limitations of the search method above, and will not be repeated here. Each module in the aforementioned search device can be implemented entirely or partially through software, hardware, or a combination thereof. These modules can be embedded in or independent of the processor in the computer device in hardware form, or stored in the memory of the computer device in software form, so that the processor can call and execute the operations corresponding to each module.
在一个实施例中,提供了一种计算机设备,该计算机设备可以是终端,其内部结构图可以如图11所示。该计算机设备包括通过系统总线连接的处理器、存储器、通信接口、显示屏和输入装置。其中,该计算机设备的处理器用于提供计算和控制能力。该计算机设备的存储器包括非易失性存储介质、内存储器。该非易失性存储介质存储有操作系统和计算机程序。该内存储器为非易失性存储介质中的操作系统和计算机程序的运行提供环境。该计算机设备的通信接口用于与外部的终端进行有线或无线方式的通信,无线方式可通过WIFI、运营商网络、NFC(近场通信)或其他技术实现。该计算机程序被处理器执行时以实现一种搜索方法。该计算机设备的显示屏可以是液晶显示屏或者电子墨水显示屏,该计算机设备的输入装置可以是显示屏上覆盖的触摸层,也可以是计算机设备外壳上设置的按键、轨迹球或触控板,还可以是外接的键盘、触控板或鼠标等。In one embodiment, a computer device is provided, which may be a terminal, and its internal structure diagram may be as shown in Figure 11. The computer device includes a processor, memory, communication interface, display screen, and input device connected via a system bus. The processor of the computer device provides computing and control capabilities. The memory of the computer device includes a non-volatile storage medium and internal memory. The non-volatile storage medium stores an operating system and computer programs. The internal memory provides an environment for the operation of the operating system and computer programs in the non-volatile storage medium. The communication interface of the computer device is used for wired or wireless communication with external terminals. Wireless communication can be achieved through Wi-Fi, carrier networks, NFC (Near Field Communication), or other technologies. When the computer program is executed by the processor, it implements a search method. The display screen of the computer device may be a liquid crystal display (LCD) or an e-ink display. The input device of the computer device may be a touch layer covering the display screen, or buttons, a trackball, or a touchpad located on the casing of the computer device, or an external keyboard, touchpad, or mouse, etc.
在一个实施例中,提供了一种计算机设备,该计算机设备可以是服务器,其内部结构图可以如图12所示。该计算机设备包括通过系统总线连接的处理器、存储器和网络接口。其中,该计算机设备的处理器用于提供计算和控制能力。该计算机设备的存储器包括非易失性存储介质、内存储器。该非易失性存储介质存储有操作系统、计算机程序和数据库。该内存储器为非易失性存储介质中的操作系统和计算机程序的运行提供环境。该计算机设备的数据库用于存储搜索数据。该计算机设备的网络接口用于与外部的终端通过网络连接通信。该计算机程序被处理器执行时以实现一种搜索方法。In one embodiment, a computer device, which may be a server, is provided, and its internal structure diagram is shown in Figure 12. The computer device includes a processor, memory, and a network interface connected via a system bus. The processor provides computing and control capabilities. The memory includes a non-volatile storage medium and internal memory. The non-volatile storage medium stores an operating system, computer programs, and a database. The internal memory provides an environment for the operation of the operating system and computer programs in the non-volatile storage medium. The database stores search data. The network interface communicates with external terminals via a network connection. When the computer program is executed by the processor, it implements a search method.
本领域技术人员可以理解,图11和12中示出的结构,仅仅是与本申请方案相关的部分结构的框图,并不构成对本申请方案所应用于其上的计算机设备的限定,具体的计算机设备可以包括比图中所示更多或更少的部件,或者组合某些部件,或者具有不同的部件布置。Those skilled in the art will understand that the structures shown in Figures 11 and 12 are merely block diagrams of some structures related to the present application and do not constitute a limitation on the computer device to which the present application is applied. Specific computer devices may include more or fewer components than shown in the figures, or combine certain components, or have different component arrangements.
在一个实施例中,还提供了一种计算机设备,包括存储器和处理器,存储器中存储有计算机程序,该处理器执行计算机程序时实现上述各方法实施例中的步骤。In one embodiment, a computer device is also provided, including a memory and a processor, wherein the memory stores a computer program, and the processor executes the computer program to implement the steps in the above method embodiments.
在一个实施例中,提供了一种计算机可读存储介质,存储有计算机程序,该计算机程序被处理器执行时实现上述各方法实施例中的步骤。In one embodiment, a computer-readable storage medium is provided storing a computer program that, when executed by a processor, implements the steps in the above method embodiments.
本领域普通技术人员可以理解实现上述实施例方法中的全部或部分流程,是可以通过计算机程序来指令相关的硬件来完成,所述的计算机程序可存储于一非易失性计算机可读取存储介质中,该计算机程序在执行时,可包括如上述各方法的实施例的流程。其中,本申请所提供的各实施例中所使用的对存储器、存储、数据库或其它介质的任何引用,均可包括非易失性和易失性存储器中的至少一种。非易失性存储器可包括只读存储器(Read-Only Memory,ROM)、磁带、软盘、闪存或光存储器等。易失性存储器可包括随机存取存储器(Random Access Memory,RAM)或外部高速缓冲存储器。作为说明而非局限,RAM可以是多种形式,比如静态随机存取存储器(Static Random Access Memory,SRAM)或动态随机存取存储器(Dynamic Random Access Memory,DRAM)等。Those skilled in the art will understand that all or part of the processes in the methods of the above embodiments can be implemented by a computer program instructing related hardware. The computer program can be stored in a non-volatile computer-readable storage medium, and when executed, it can include the processes of the embodiments of the above methods. Any references to memory, storage, databases, or other media used in the embodiments provided in this application can include at least one of non-volatile and volatile memory. Non-volatile memory can include read-only memory (ROM), magnetic tape, floppy disk, flash memory, or optical storage, etc. Volatile memory can include random access memory (RAM) or external cache memory. By way of illustration and not limitation, RAM can be in various forms, such as static random access memory (SRAM) or dynamic random access memory (DRAM), etc.
以上实施例的各技术特征可以进行任意的组合,为使描述简洁,未对上述实施例中的各个技术特征所有可能的组合都进行描述,然而,只要这些技术特征的组合不存在矛盾,都应当认为是本说明书记载的范围。The technical features of the above embodiments can be combined in any way. For the sake of brevity, not all possible combinations of the technical features in the above embodiments are described. However, as long as there is no contradiction in the combination of these technical features, they should be considered to be within the scope of this specification.
以上所述实施例仅表达了本申请的几种实施方式,其描述较为具体和详细,但并不能因此而理解为对发明专利范围的限制。应当指出的是,对于本领域的普通技术人员来说,在不脱离本申请构思的前提下,还可以做出若干变形和改进,这些都属于本申请的保护范围。因此,本申请专利的保护范围应以所附权利要求为准。The embodiments described above are merely illustrative of several implementation methods of this application, and while the descriptions are relatively specific and detailed, they should not be construed as limiting the scope of the invention patent. It should be noted that those skilled in the art can make various modifications and improvements without departing from the concept of this application, and these all fall within the protection scope of this application. Therefore, the protection scope of this patent application should be determined by the appended claims.
Claims (20)
Publications (2)
| Publication Number | Publication Date |
|---|---|
| HK40043934A HK40043934A (en) | 2021-09-24 |
| HK40043934B true HK40043934B (en) | 2024-03-01 |
Family
ID=
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN112749328B (en) | Searching method, searching device and computer equipment | |
| US11645317B2 (en) | Recommending topic clusters for unstructured text documents | |
| JP7252914B2 (en) | Method, apparatus, apparatus and medium for providing search suggestions | |
| CN109271574A (en) | A kind of hot word recommended method and device | |
| US9846744B2 (en) | Media discovery and playlist generation | |
| US10599643B2 (en) | Template-driven structured query generation | |
| US6944609B2 (en) | Search results using editor feedback | |
| US9665643B2 (en) | Knowledge-based entity detection and disambiguation | |
| US10108720B2 (en) | Automatically providing relevant search results based on user behavior | |
| US20020073079A1 (en) | Method and apparatus for searching a database and providing relevance feedback | |
| US20090094223A1 (en) | System and method for classifying search queries | |
| US20170228372A1 (en) | System and method for querying questions and answers | |
| CN102968465B (en) | Network information service platform and the search service method based on this platform thereof | |
| US20090313217A1 (en) | Systems and methods for classifying search queries | |
| WO2021111400A1 (en) | System and method for enabling a search platform to users | |
| US20150160847A1 (en) | System and method for searching through a graphic user interface | |
| CN119557500B (en) | A method and system for accurate search of Internet massive data based on AI technology | |
| JP4375626B2 (en) | Search service system and method for providing input order of keywords by category | |
| Spitz et al. | EVELIN: Exploration of event and entity links in implicit networks | |
| Sakthi Murugan et al. | Ontology based information retrieval-an analysis | |
| Ahamed et al. | Deduce user search progression with feedback session | |
| HK40043934B (en) | Search method and apparatus, and computer device | |
| JP2006139484A (en) | Information retrieval method and system, and computer program | |
| HK40043934A (en) | Search method and apparatus, and computer device | |
| JP7438272B2 (en) | Method, computer device, and computer program for generating blocks of search intent units |