CN107436907A - Web text classification integration method and device - Google Patents
Web text classification integration method and device Download PDFInfo
- Publication number
- CN107436907A CN107436907A CN201610366352.8A CN201610366352A CN107436907A CN 107436907 A CN107436907 A CN 107436907A CN 201610366352 A CN201610366352 A CN 201610366352A CN 107436907 A CN107436907 A CN 107436907A
- Authority
- CN
- China
- Prior art keywords
- information
- text
- classification
- user
- target
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/35—Clustering; Classification
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Databases & Information Systems (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
本发明实施例提供一种网络文本分类整合方法及装置。该方法包括:接收用户输入的收藏指令;依据所述收藏指令存储当前显示的文本的内容信息或网址信息;获取所述当前显示的文本的识别信息和分类信息;根据所述当前显示的文本的分类信息,在所述分类信息对应的分类表中添加表项,所述表项包括所述当前显示的文本的识别信息。本发明实施例通过对来自于各个应用软件的文本进行分类存储,方便用户分类查询,用户只要记得阅读过的文本的来源、类型、收藏时间中的任意一个信息就可以查找到相应的文本,提高了用户查找文本信息的效率,从而提高了用户阅读文本信息的效率。
Embodiments of the present invention provide a network text classification and integration method and device. The method includes: receiving a collection instruction input by a user; storing content information or URL information of the currently displayed text according to the collection instruction; obtaining identification information and classification information of the currently displayed text; Category information, adding an item in the category table corresponding to the category information, where the item includes the identification information of the currently displayed text. The embodiment of the present invention classifies and stores the texts from various application software, which is convenient for users to search by classification. The user can find the corresponding text as long as he remembers the source, type, and collection time of the texts he has read, which improves the user experience. The efficiency of users searching for text information is improved, thereby improving the efficiency of users reading text information.
Description
技术领域technical field
本发明实施例涉及通信技术领域,尤其涉及一种网络文本分类整合方法及装置。The embodiments of the present invention relate to the field of communication technologies, and in particular, to a method and device for classifying and integrating network texts.
背景技术Background technique
随着智能终端的发展,用户在智能终端上可安装各种应用软件,并通过各应用软件获取不同的网络信息或享受不同的网络服务。With the development of smart terminals, users can install various application software on the smart terminal, and obtain different network information or enjoy different network services through each application software.
文本信息是网络信息的一种,用户可通过应用软件阅读网络中的文本信息,例如,用户登录微博阅读其他微博用户在微博中共享的文本信息,登录QQ阅读其他QQ好友上传的本文信息或转发的文本信息等,当用户在应用软件中发现其感兴趣的文本信息后,将其感兴趣的文本信息收藏在该应用软件中。Text information is a kind of network information. Users can read text information on the network through application software. For example, users log in to Weibo to read text information shared by other Weibo users on Weibo, and log in to QQ to read articles uploaded by other QQ friends. Text information or forwarded text information, etc., when the user finds the text information he is interested in in the application software, he will store the text information he is interested in in the application software.
由于用户在应用软件中发现其感兴趣的文本信息后,将其感兴趣的文本信息收藏在该应用软件中,当应用软件个数较多时,不同的应用软件分别存储有不同的文本信息,当用户需要读取收藏过的文本信息时,可能无法正确的查找到文本信息收藏的地方,从而降低了用户阅读文本信息的效率。After the user finds the text information he is interested in in the application software, he collects the text information he is interested in in the application software. When the number of application software is large, different application software stores different text information respectively. When the user needs to read the text information that has been bookmarked, the user may not be able to correctly find the place where the text information is stored, thereby reducing the efficiency of the user in reading the text information.
发明内容Contents of the invention
本发明实施例提供一种网络文本分类整合方法及装置,以提高用户阅读文本信息的效率。Embodiments of the present invention provide a network text classification and integration method and device to improve the efficiency of users reading text information.
本发明实施例的一个方面是提供一种网络文本分类整合方法,包括:An aspect of the embodiments of the present invention is to provide a network text classification integration method, including:
接收用户输入的收藏指令;Receive the collection instruction input by the user;
依据所述收藏指令存储当前显示的文本的内容信息或网址信息;storing the content information or URL information of the currently displayed text according to the collection instruction;
获取所述当前显示的文本的识别信息和分类信息;Obtaining identification information and classification information of the currently displayed text;
根据所述当前显示的文本的分类信息,在所述分类信息对应的分类表中添加表项,所述表项包括所述当前显示的文本的识别信息。According to the classification information of the currently displayed text, an entry is added to a classification table corresponding to the classification information, and the entry includes the identification information of the currently displayed text.
本发明实施例的另一个方面是提供一种网络文本分类整合装置,包括:Another aspect of the embodiments of the present invention is to provide a network text classification integration device, including:
接收模块,用于接收用户输入的收藏指令;A receiving module, configured to receive a collection instruction input by a user;
存储模块,用于依据所述收藏指令存储当前显示的文本的内容信息或网址信息;A storage module, configured to store content information or URL information of the currently displayed text according to the collection instruction;
获取模块,用于获取所述当前显示的文本的识别信息和分类信息;An acquisition module, configured to acquire identification information and classification information of the currently displayed text;
分类表创建模块,用于根据所述当前显示的文本的分类信息,在所述分类信息对应的分类表中添加表项,所述表项包括所述当前显示的文本的识别信息。The classification table creation module is configured to add an entry in the classification table corresponding to the classification information according to the classification information of the currently displayed text, and the entry includes the identification information of the currently displayed text.
本发明实施例提供的网络文本分类整合方法及装置,通过对来自于各个应用软件的文本进行分类存储,方便用户分类查询,用户只要记得阅读过的文本的来源、类型、收藏时间中的任意一个信息就可以查找到相应的文本,提高了用户查找文本信息的效率,从而提高了用户阅读文本信息的效率。The network text classification and integration method and device provided by the embodiments of the present invention classify and store the texts from various application software, which is convenient for users to classify and query. Users only need to remember any one of the source, type, and storage time of the texts they have read. The corresponding text can be searched for the information, which improves the efficiency of the user to find the text information, thereby improving the efficiency of the user to read the text information.
附图说明Description of drawings
图1为本发明实施例提供的网络文本分类整合方法流程图;Fig. 1 is the flowchart of the network text classification and integration method provided by the embodiment of the present invention;
图2为本发明另一实施例提供的网络文本分类整合方法流程图;FIG. 2 is a flowchart of a network text classification and integration method provided by another embodiment of the present invention;
图3为本发明实施例提供的网络文本分类整合装置的结构图;3 is a structural diagram of a network text classification and integration device provided by an embodiment of the present invention;
图4为本发明另一实施例提供的网络文本分类整合装置的结构图。FIG. 4 is a structural diagram of a network text classification integration device provided by another embodiment of the present invention.
具体实施方式detailed description
图1为本发明实施例提供的网络文本分类整合方法流程图。本发明实施例针对用户在应用软件中发现其感兴趣的文本信息后,将其感兴趣的文本信息收藏在该应用软件中,当应用软件个数较多时,不同的应用软件分别存储有不同的文本信息,当用户需要读取收藏过的文本信息时,可能无法正确的查找到文本信息收藏的地方,从而降低了用户阅读文本信息的效率,提供了网络文本分类整合方法,该方法的具体步骤如下:FIG. 1 is a flow chart of a network text classification and integration method provided by an embodiment of the present invention. The embodiment of the present invention is aimed at storing the text information of interest in the application software after the user finds the text information of interest in the application software. When the number of application software is large, different application software stores different Text information, when the user needs to read the text information that has been collected, it may not be able to correctly find the place where the text information is stored, thereby reducing the efficiency of the user's reading text information, providing a network text classification integration method, the specific steps of the method as follows:
步骤S101、接收用户输入的收藏指令;Step S101, receiving a collection instruction input by a user;
本发明实施例中,用户持有终端设备,终端设备安装有多个应用软件,用户通过各应用软件可访问网络,并获取网络中的文本,例如日志、博客、小说等文章。若用户阅读的文本是其感兴趣的文本,但由于当前时间有限未能阅读全文,为了后续再次阅读或继续阅读,则用户点击该应用软件提供的用户界面中的汇总收藏按键,汇总收藏按键可以是终端设备上的一个特定的按键,也可以是多个按键的组合例如,音量键和开机键的组合。具体地,该汇总收藏按键可以适用于该终端设备上的多个应用软件,例如,用户在微信中发现的文本可以通过该汇总收藏按键收藏,用户在微博中发现的文本也可以通过该汇总收藏按键收藏。当用户按下该汇总收藏按键后,终端设备产生收藏指令。In the embodiment of the present invention, the user holds a terminal device, and the terminal device is installed with multiple application software, and the user can access the network through each application software, and obtain texts in the network, such as articles such as logs, blogs, and novels. If the text the user reads is the text he is interested in, but the full text cannot be read due to the current limited time, in order to read it again or continue reading later, the user clicks the collection and collection button in the user interface provided by the application software, and the collection and collection button can be It is a specific button on the terminal device, or it can be a combination of multiple buttons, for example, a combination of a volume button and a power button. Specifically, the collection and collection button can be applied to multiple application software on the terminal device. For example, the text found by the user in WeChat can be collected through the collection and collection button, and the text found by the user in Weibo can also be collected through the collection and collection button. Favorites button. After the user presses the collective collection button, the terminal device generates a collection instruction.
步骤S102、依据所述收藏指令存储当前显示的文本的内容信息或网址信息。Step S102, storing content information or URL information of the currently displayed text according to the favorite instruction.
终端设备依据该收藏指令存储当前屏幕上显示的文本,存储的方式有两种,一种是存储该文本的内容信息,另一种是存储该文本的网址信息例如链接,用户点击汇总收藏按键后,终端设备的屏幕上显示提示信息,提示用户收藏文本的内容信息或网址信息,若用户希望下次阅读原文,不希望原文的链接发生变化后导致收藏的网址信息找不到原文,则确定收藏文本的内容信息,若用户希望阅读更新后的文本,则确定收藏网址信息,收藏网址信息可方便用户跟踪阅读新的文本,但也面临原文的链接发生变化后导致收藏的网址信息找不到原文的问题。终端设备依据用户选择的存储方式将该文本的内容信息或网址信息存储在存储器中,另外,给每个文本的内容信息或网址信息分配一个唯一的标识号。The terminal device stores the text currently displayed on the screen according to the collection command. There are two storage methods, one is to store the content information of the text, and the other is to store the URL information of the text, such as a link. After the user clicks the summary collection button , a prompt message is displayed on the screen of the terminal device, prompting the user to bookmark the content information or URL information of the text. If the user wants to read the original text next time, and does not want the original text to be found in the favorite URL information after the link of the original text changes, then confirm the favorite. The content information of the text, if the user wants to read the updated text, then determine the favorite URL information, the favorite URL information can facilitate the user to track and read the new text, but it also faces the problem that the original text cannot be found in the favorite URL information after the link of the original text changes The problem. The terminal device stores the text content information or website address information in the memory according to the storage method selected by the user, and in addition, assigns a unique identification number to each text content information or website address information.
步骤S103、获取所述当前显示的文本的识别信息和分类信息;Step S103, acquiring identification information and classification information of the currently displayed text;
所述分类信息包括如下至少一种:来源信息、类型信息、时间信息。所述文本的识别信息包括所述文本的关键信息。所述获取所述当前显示的文本的识别信息和分类信息,包括:根据所述当前显示的文本的内容信息获取关键信息,所述关键信息包括如下至少一种:标题信息、作者信息、内容关键字;根据所述当前显示的文本的网址信息获取所述文本的来源信息;分析所述当前显示的文本的内容信息确定所述文本的类型信息;根据所述当前显示的文本的收藏时间、发布时间获取所述文本的时间信息。The classification information includes at least one of the following: source information, type information, and time information. The identification information of the text includes key information of the text. The acquiring the identification information and classification information of the currently displayed text includes: acquiring key information according to the content information of the currently displayed text, and the key information includes at least one of the following: title information, author information, content key character; obtain the source information of the text according to the URL information of the currently displayed text; analyze the content information of the currently displayed text to determine the type information of the text; Time Gets the time information of the text.
在步骤S102之后,获取所述当前显示的文本的识别信息和分类信息,识别信息具体为文本的关键信息,分类信息至少包括来源信息、类型信息、时间信息。终端设备根据当前显示的文本的内容信息获取该文本对应的标题信息、作者信息、内容关键字等信息,并将标题信息、作者信息、内容关键字作为该文本的关键信息。After step S102, the identification information and classification information of the currently displayed text are obtained. The identification information is specifically the key information of the text, and the classification information includes at least source information, type information, and time information. The terminal device obtains information such as title information, author information, and content keywords corresponding to the text according to the content information of the currently displayed text, and uses the title information, author information, and content keywords as key information of the text.
由于不同媒体平台发布的文本,其对应链接的域名部分不同,例如微信上的文本链接的域名部分通常是mp.weixin.qq.com;微博上文本链接域名部分通常是mp.weibo.cn;通过解析当前显示的文本链接的域名部分可确定该文本的来源。Due to the different domain names of texts published on different media platforms, the domain names of the corresponding links are different. For example, the domain name of text links on WeChat is usually mp.weixin.qq.com; the domain name of text links on Weibo is usually mp.weibo.cn; The source of the text is determined by parsing the domain name portion of the currently displayed text link.
另外,终端设备提供有文本的已有分类,用户可以将当前显示的文本存储到已有的一个分类中,也可以在终端设备上新创建一个分类,将当前显示的文本存储到该新的分类中,另外,终端设备还可以对当前显示的文本进行文本分析及话题检测确定出该文本的分类,例如金融类、科技类、艺术类、体育类等,并将该文本存储到确定出的分类中。In addition, the terminal device provides an existing category of text, and the user can store the currently displayed text in an existing category, or create a new category on the terminal device, and store the currently displayed text in the new category In addition, the terminal device can also perform text analysis and topic detection on the currently displayed text to determine the classification of the text, such as finance, technology, art, sports, etc., and store the text in the determined classification middle.
终端设备还可记录用户收藏该文本的时间,以及该文本在网络中的发布时间。The terminal device can also record the time when the user bookmarked the text, and the release time of the text on the network.
步骤S104、根据所述当前显示的文本的分类信息,在所述分类信息对应的分类表中添加表项,所述表项包括所述当前显示的文本的识别信息。Step S104: According to the classification information of the currently displayed text, an entry is added in the classification table corresponding to the classification information, and the entry includes the identification information of the currently displayed text.
在本发明实施例中,对于文本的每个分类对应有一个分类表,具体地,文本的来源、类型、时间分别对应有分类表,例如来源的分类表包括了来自各个应用软件的文本,例如微博、微信、QQ等,类型的分类表包括了各种类型、各种题材的文本,时间的分类表可以根据文本收藏的时间的先后顺序对收藏的所有文本进行排序。In the embodiment of the present invention, there is a classification table corresponding to each classification of the text. Specifically, the source, type, and time of the text correspond to the classification table respectively. For example, the classification table of the source includes texts from various application software, such as For Weibo, WeChat, QQ, etc., the classification table of types includes texts of various types and themes, and the classification table of time can sort all the texts in the collection according to the time sequence of the text collection.
在本发明实施例中,当前显示的文本可能同时属于不同的分类,例如来自微博的金融类的2016年1月1日收藏的文本,同时属于微博、金融和时间的分类,则分别在各分类对应的分类表中添加表项,表项包括该文本的识别信息例如关键信息,即微博对应有一个分类表,分类表包括多个表项,每个表项包括来自微博的文本的关键信息;金融对应有一个分类表,分类表包括多个表项,每个表项包括金融类的文本的关键信息;时间对应有一个分类表,分类表包括多个表项,每个表项包括收藏时间和文本的关键信息,多个表项可以按照收藏时间从大到小的顺序排列,也可以按照从小到大的顺序排列。In the embodiment of the present invention, the currently displayed text may belong to different classifications at the same time, for example, the text collected on January 1, 2016 from the financial category of Weibo, which belongs to the classification of Weibo, finance and time at the same time, is respectively in Add entries to the classification table corresponding to each classification, and the entry includes the identification information of the text such as key information, that is, there is a classification table corresponding to Weibo, and the classification table includes multiple entries, and each entry includes text from Weibo key information; financial correspondence has a classification table, and the classification table includes multiple table items, and each table item includes key information of financial text; time correspondence has a classification table, and the classification table includes multiple table items, each table Items include the key information of collection time and text, and multiple table items can be arranged in descending order of collection time, or in ascending order of collection time.
本发明实施例通过对来自于各个应用软件的文本进行分类存储,方便用户分类查询,用户只要记得阅读过的文本的来源、类型、收藏时间中的任意一个信息就可以查找到相应的文本,提高了用户查找文本信息的效率,从而提高了用户阅读文本信息的效率。The embodiment of the present invention classifies and stores the texts from various application software, which is convenient for users to search by classification. The user can find the corresponding text as long as he remembers the source, type, and collection time of the texts he has read, which improves the user experience. The efficiency of users searching for text information is improved, thereby improving the efficiency of users reading text information.
图2为本发明另一实施例提供的网络文本分类整合方法流程图。在图1对应的实施例的基础上,本发明实施例提供的网络文本分类整合方法的具体步骤如下:FIG. 2 is a flowchart of a network text classification and integration method provided by another embodiment of the present invention. On the basis of the embodiment corresponding to Fig. 1, the specific steps of the network text classification integration method provided by the embodiment of the present invention are as follows:
步骤S201、接收用户输入的收藏指令;Step S201, receiving a collection instruction input by a user;
步骤S202、依据所述收藏指令存储当前显示的文本的内容信息或网址信息;Step S202, storing content information or URL information of the currently displayed text according to the collection instruction;
步骤S203、获取所述当前显示的文本的识别信息和分类信息;Step S203, acquiring identification information and classification information of the currently displayed text;
步骤S204、根据所述当前显示的文本的分类信息,在所述分类信息对应的分类表中添加表项,所述表项包括所述当前显示的文本的识别信息;Step S204, according to the classification information of the currently displayed text, add an entry in the classification table corresponding to the classification information, the entry includes the identification information of the currently displayed text;
步骤S201-步骤S204分别与步骤S101-104一致,具体方法此处不再赘述。Step S201-step S204 are respectively consistent with steps S101-104, and the specific methods will not be repeated here.
步骤S205、接收用户输入的分类信息;Step S205, receiving the classification information input by the user;
当用户需要查找存储过的文本时,用户根据记忆确定目标文本的分类信息,例如,用户只记得目标文本属于金融类,或只记得目标文本是从微信收藏的,或只记得目标文本是一周之前收藏的,则用户只需要根据记忆中目标文本的任意一个相关的信息选择终端设备已有的分类,终端设备已有的分类包括来源分类、类型分类、时间分类等。When the user needs to find the stored text, the user determines the classification information of the target text according to the memory, for example, the user only remembers that the target text belongs to the financial category, or only remembers that the target text is saved from WeChat, or only remembers that the target text is a week ago Favorite, the user only needs to select the existing classification of the terminal device according to any relevant information of the target text in memory. The existing classification of the terminal device includes source classification, type classification, time classification and so on.
步骤S206、依据所述分类信息获取并显示分类表,所述分类表包括所述分类信息对应的文本的识别信息,以使所述用户依据所述文本的识别信息选择目标文本;Step S206, acquiring and displaying a classification table according to the classification information, the classification table including identification information of the text corresponding to the classification information, so that the user can select a target text according to the identification information of the text;
终端设备针对每种分类提供有相应的视图,用户只需选择相应的视图便可获取到该分类对应的分类表,例如,用户选择了来源分类,则终端设备显示来源对应的分类表,分类表包括文本的来源信息和文本的关键信息,用户确定来源是微信,终端设备只显示来自微信的文本的表项,每个表项包括文本的关键信息,该关键信息具体为标题信息、作者信息、内容关键字,用户可根据各文本的关键信息确定目标文本。The terminal device provides a corresponding view for each classification, and the user only needs to select the corresponding view to obtain the classification table corresponding to the classification. For example, if the user selects the source classification, the terminal device displays the classification table corresponding to the source, and the classification table Including the source information of the text and the key information of the text. If the user determines that the source is WeChat, the terminal device only displays the entry of the text from WeChat. Each entry includes the key information of the text. The key information is specifically title information, author information, Content keywords, users can determine the target text according to the key information of each text.
步骤S207、依据用户选择的所述目标文本,获取并显示所述目标本文的内容信息。Step S207, according to the target text selected by the user, acquire and display the content information of the target text.
在本发明实施例中,依据用户选择的所述目标文本,获取并显示所述目标本文的内容信息的方法有两种,第一种是依据用户选择的所述目标文本的关键信息确定所述目标文本的标识号;依据所述标识号获取并显示所述目标本文的内容信息。第二种是依据用户选择的所述目标文本的关键信息确定所述目标文本的标识号;依据所述标识号获取所述目标本文的网址信息;依据所述网址信息获取并显示所述目标本文的内容信息。In the embodiment of the present invention, there are two methods for obtaining and displaying the content information of the target text according to the target text selected by the user. The first method is to determine the target text based on the key information of the target text selected by the user. The identification number of the target text; acquire and display the content information of the target text according to the identification number. The second is to determine the identification number of the target text according to the key information of the target text selected by the user; obtain the website address information of the target text according to the identification number; obtain and display the target text according to the website information content information.
根据上述实施例可知,文本在终端设备中有两种存储方式,一种是存储文本的内容信息,另一种是存储文本的网址信息,对于每一个文本都对应有唯一的标识号,在本发明实施例中,文本的关键信息也可以唯一的确定一个文本,终端设备预先建立文本的关键信息与标识号的对应关系,通过用户选择的目标文本的关键信息确定所述目标文本的标识号,若存储器中存储的是该目标文本的内容信息,则根据该标识号从存储器中获取该目标文本的内容信息,并直接显示在显示屏上。若存储器中存储的是该目标文本的网址信息,则根据该标识号从存储器中获取该目标文本的网址信息,依据该网址信息访问相应的服务器,从服务器获取该网址信息对应的目标文本的内容信息,终端设备将从服务器获取到的目标文本的内容信息显示在显示屏上。According to the above embodiment, it can be seen that there are two storage methods for text in the terminal device, one is to store the content information of the text, and the other is to store the website information of the text, and each text has a unique identification number corresponding to it. In the embodiment of the invention, the key information of the text can also uniquely determine a text, the terminal device pre-establishes the corresponding relationship between the key information of the text and the identification number, and determines the identification number of the target text through the key information of the target text selected by the user, If the content information of the target text is stored in the memory, the content information of the target text is obtained from the memory according to the identification number and directly displayed on the display screen. If the URL information of the target text is stored in the memory, the URL information of the target text is obtained from the memory according to the identification number, the corresponding server is accessed according to the URL information, and the content of the target text corresponding to the URL information is obtained from the server Information, the terminal device displays the content information of the target text obtained from the server on the display screen.
本发明实施例通过终端设备存储目标文本的内容信息,可防止链接失效的现象出现,通过终端设备存储目标文本的网址信息,可保证用户阅读的目标文本是最新更新的文本,提高了目标文本的实时性。In the embodiment of the present invention, the content information of the target text is stored by the terminal device, which can prevent the phenomenon of link failure, and by storing the website information of the target text by the terminal device, it can ensure that the target text read by the user is the latest updated text, which improves the accuracy of the target text. real-time.
图3为本发明实施例提供的网络文本分类整合装置的结构图。本发明实施例提供的网络文本分类整合装置可以执行网络文本分类整合方法实施例提供的处理流程,如图3所示,网络文本分类整合装置30包括接收模块31、存储模块32、获取模块33和分类表创建模块34,其中,接收模块31用于接收用户输入的收藏指令;存储模块32用于依据所述收藏指令存储当前显示的文本的内容信息或网址信息;获取模块33用于获取所述当前显示的文本的识别信息和分类信息;分类表创建模块34用于根据所述当前显示的文本的分类信息,在所述分类信息对应的分类表中添加表项,所述表项包括所述当前显示的文本的识别信息。FIG. 3 is a structural diagram of a network text classification and integration device provided by an embodiment of the present invention. The network text classification integration device provided by the embodiment of the present invention can execute the processing flow provided by the network text classification integration method embodiment, as shown in Figure 3, the network text classification integration device 30 includes a receiving module 31, a storage module 32, an acquisition module 33 and Classification table creation module 34, wherein, receiving module 31 is used for receiving the collection instruction of user input; Storage module 32 is used for storing the content information or website address information of the text currently displayed according to described collection instruction; Obtaining module 33 is used for obtaining described Identification information and classification information of the currently displayed text; the classification table creation module 34 is used to add an entry in the classification table corresponding to the classification information according to the classification information of the currently displayed text, and the entry includes the Identifying information for the currently displayed text.
本发明实施例提供的网络文本分类整合装置可以具体用于执行上述图1所提供的方法实施例,具体功能此处不再赘述。The network text classification and integration device provided by the embodiment of the present invention can be specifically used to execute the method embodiment provided in FIG. 1 above, and the specific functions will not be repeated here.
本发明实施例通过对来自于各个应用软件的文本进行分类存储,方便用户分类查询,用户只要记得阅读过的文本的来源、类型、收藏时间中的任意一个信息就可以查找到相应的文本,提高了用户查找文本信息的效率,从而提高了用户阅读文本信息的效率。The embodiment of the present invention classifies and stores the texts from various application software, which is convenient for users to search by classification. The user can find the corresponding text as long as he remembers the source, type, and collection time of the texts he has read, which improves the user experience. The efficiency of users searching for text information is improved, thereby improving the efficiency of users reading text information.
图4为本发明另一实施例提供的网络文本分类整合装置的结构图。在上述实施例的基础上,所述分类信息包括如下至少一种:来源信息、类型信息、时间信息;所述文本的识别信息包括所述文本的关键信息;获取模块33具体用于根据所述当前显示的文本的内容信息获取关键信息,所述关键信息包括如下至少一种:标题信息、作者信息、内容关键字;根据所述当前显示的文本的网址信息获取所述文本的来源信息;分析所述当前显示的文本的内容信息确定所述文本的类型信息;根据所述当前显示的文本的收藏时间、发布时间获取所述文本的时间信息。FIG. 4 is a structural diagram of a network text classification integration device provided by another embodiment of the present invention. On the basis of the above embodiments, the classification information includes at least one of the following: source information, type information, and time information; the identification information of the text includes key information of the text; the acquisition module 33 is specifically configured to The content information of the currently displayed text obtains key information, and the key information includes at least one of the following: title information, author information, and content keywords; obtains the source information of the text according to the URL information of the currently displayed text; analyzes The content information of the currently displayed text determines the type information of the text; and the time information of the text is acquired according to the collection time and publishing time of the currently displayed text.
所述接收模块31还用于接收用户输入的分类信息;所述获取模块33还用于依据所述分类信息获取分类表,所述分类表包括所述分类信息对应的文本的识别信息,以使所述用户依据所述文本的识别信息选择目标文本;依据用户选择的所述目标文本,获取所述目标本文的内容信息;所述网络文本分类整合装置30还包括显示模块35,所述显示模块35用于显示分类表;显示所述目标本文的内容信息。The receiving module 31 is also used to receive the classification information input by the user; the acquisition module 33 is also used to obtain a classification table according to the classification information, and the classification table includes the identification information of the text corresponding to the classification information, so that The user selects the target text according to the identification information of the text; according to the target text selected by the user, the content information of the target text is obtained; the network text classification integration device 30 also includes a display module 35, and the display module 35 is used to display the classification table; display the content information of the target article.
所述获取模块33具体用于依据用户选择的所述目标文本的关键信息确定所述目标文本的标识号;依据所述标识号获取所述目标本文的内容信息。The obtaining module 33 is specifically configured to determine the identification number of the target text according to the key information of the target text selected by the user; and obtain the content information of the target text according to the identification number.
所述获取模块33具体用于依据用户选择的所述目标文本的关键信息确定所述目标文本的标识号;依据所述标识号获取所述目标本文的网址信息;依据所述网址信息获取所述目标本文的内容信息。The obtaining module 33 is specifically used to determine the identification number of the target text according to the key information of the target text selected by the user; obtain the website address information of the target text according to the identification number; obtain the website address information according to the website information. The content information of the target article.
本发明实施例提供的网络文本分类整合装置可以具体用于执行上述图2所提供的方法实施例,具体功能此处不再赘述。The network text classification and integration device provided by the embodiment of the present invention can be specifically used to execute the method embodiment provided in FIG. 2 above, and the specific functions will not be repeated here.
本发明实施例通过终端设备存储目标文本的内容信息,可防止链接失效的现象出现,通过终端设备存储目标文本的网址信息,可保证用户阅读的目标文本是最新更新的文本,提高了目标文本的实时性。In the embodiment of the present invention, the content information of the target text is stored by the terminal device, which can prevent the phenomenon of link failure, and by storing the website information of the target text by the terminal device, it can ensure that the target text read by the user is the latest updated text, which improves the accuracy of the target text. real-time.
综上所述,本发明实施例通过对来自于各个应用软件的文本进行分类存储,方便用户分类查询,用户只要记得阅读过的文本的来源、类型、收藏时间中的任意一个信息就可以查找到相应的文本,提高了用户查找文本信息的效率,从而提高了用户阅读文本信息的效率;通过终端设备存储目标文本的内容信息,可防止链接失效的现象出现,通过终端设备存储目标文本的网址信息,可保证用户阅读的目标文本是最新更新的文本,提高了目标文本的实时性。To sum up, the embodiment of the present invention classifies and stores the texts from various application software, which is convenient for users to search by category. Users only need to remember the source, type, and storage time of the texts they have read to find any information. The corresponding text improves the efficiency of users to search for text information, thereby improving the efficiency of users to read text information; storing the content information of the target text through the terminal device can prevent the phenomenon of link failure, and storing the URL information of the target text through the terminal device , it can ensure that the target text read by the user is the latest updated text, which improves the real-time performance of the target text.
在本发明所提供的几个实施例中,应该理解到,所揭露的装置和方法,可以通过其它的方式实现。例如,以上所描述的装置实施例仅仅是示意性的,例如,所述单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,例如多个单元或组件可以结合或者可以集成到另一个系统,或一些特征可以忽略,或不执行。另一点,所显示或讨论的相互之间的耦合或直接耦合或通信连接可以是通过一些接口,装置或单元的间接耦合或通信连接,可以是电性,机械或其它的形式。In the several embodiments provided by the present invention, it should be understood that the disclosed devices and methods can be implemented in other ways. For example, the device embodiments described above are only illustrative. For example, the division of the units is only a logical function division. In actual implementation, there may be other division methods. For example, multiple units or components can be combined or May be integrated into another system, or some features may be ignored, or not implemented. In another point, the mutual coupling or direct coupling or communication connection shown or discussed may be through some interfaces, and the indirect coupling or communication connection of devices or units may be in electrical, mechanical or other forms.
所述作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部单元来实现本实施例方案的目的。The units described as separate components may or may not be physically separated, and the components shown as units may or may not be physical units, that is, they may be located in one place, or may be distributed to multiple network units. Part or all of the units can be selected according to actual needs to achieve the purpose of the solution of this embodiment.
另外,在本发明各个实施例中的各功能单元可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中。上述集成的单元既可以采用硬件的形式实现,也可以采用硬件加软件功能单元的形式实现。In addition, each functional unit in each embodiment of the present invention may be integrated into one processing unit, each unit may exist separately physically, or two or more units may be integrated into one unit. The above-mentioned integrated units can be implemented in the form of hardware, or in the form of hardware plus software functional units.
上述以软件功能单元的形式实现的集成的单元,可以存储在一个计算机可读取存储介质中。上述软件功能单元存储在一个存储介质中,包括若干指令用以使得一台计算机设备(可以是个人计算机,服务器,或者网络设备等)或处理器(processor)执行本发明各个实施例所述方法的部分步骤。而前述的存储介质包括:U盘、移动硬盘、只读存储器(Read-Only Memory,ROM)、随机存取存储器(Random Access Memory,RAM)、磁碟或者光盘等各种可以存储程序代码的介质。The above-mentioned integrated units implemented in the form of software functional units may be stored in a computer-readable storage medium. The above-mentioned software functional units are stored in a storage medium, and include several instructions to make a computer device (which may be a personal computer, server, or network device, etc.) or a processor (processor) execute the methods described in various embodiments of the present invention. partial steps. The aforementioned storage medium includes: U disk, mobile hard disk, read-only memory (Read-Only Memory, ROM), random access memory (Random Access Memory, RAM), magnetic disk or optical disk and other various media that can store program codes. .
本领域技术人员可以清楚地了解到,为描述的方便和简洁,仅以上述各功能模块的划分进行举例说明,实际应用中,可以根据需要而将上述功能分配由不同的功能模块完成,即将装置的内部结构划分成不同的功能模块,以完成以上描述的全部或者部分功能。上述描述的装置的具体工作过程,可以参考前述方法实施例中的对应过程,在此不再赘述。Those skilled in the art can clearly understand that for the convenience and brevity of description, only the division of the above-mentioned functional modules is used as an example for illustration. The internal structure of the system is divided into different functional modules to complete all or part of the functions described above. For the specific working process of the device described above, reference may be made to the corresponding process in the foregoing method embodiments, and details are not repeated here.
最后应说明的是:以上各实施例仅用以说明本发明的技术方案,而非对其限制;尽管参照前述各实施例对本发明进行了详细的说明,本领域的普通技术人员应当理解:其依然可以对前述各实施例所记载的技术方案进行修改,或者对其中部分或者全部技术特征进行等同替换;而这些修改或者替换,并不使相应技术方案的本质脱离本发明各实施例技术方案的范围。Finally, it should be noted that: the above embodiments are only used to illustrate the technical solutions of the present invention, rather than limiting them; although the present invention has been described in detail with reference to the foregoing embodiments, those of ordinary skill in the art should understand that: It is still possible to modify the technical solutions described in the foregoing embodiments, or perform equivalent replacements for some or all of the technical features; and these modifications or replacements do not make the essence of the corresponding technical solutions deviate from the technical solutions of the various embodiments of the present invention. scope.
Claims (10)
- A kind of 1. Web text classification integration method, it is characterised in that including:Receive the collection instruction of user's input;The content information or website information of the text currently shown according to the collection instruction storage;Obtain the identification information and classification information of the text currently shown;According to the classification information of the text currently shown, in classification chart corresponding to the classification information List item is added, the list item includes the identification information of the text currently shown.
- 2. according to the method for claim 1, it is characterised in that the classification information is included as follows extremely Few one kind:Source-information, type information, temporal information;The identification information of the text includes the key message of the text;The identification information and classification information for obtaining the text currently shown, including:Key message is obtained according to the content information of the text currently shown, the key message includes It is following at least one:Heading message, author information, content-keyword;The source-information of the text is obtained according to the website information of the text currently shown;The content information for analyzing the text currently shown determines the type information of the text;The time that the text is obtained according to the collection time of the text currently shown, issuing time believes Breath.
- 3. according to the method for claim 2, it is characterised in that described currently to be shown according to described The classification information of text, after adding list item in classification chart corresponding to the classification information, in addition to:Receive the classification information of user's input;Obtained according to the classification information and show classification chart, the classification chart includes the classification information pair The identification information for the text answered, so that identification information selection target text of the user according to the text;According to the target text of user's selection, obtain and show described target this paper content information.
- 4. according to the method for claim 3, it is characterised in that described according to the described of user's selection Target text, obtain and show described target this paper content information, including:Key message according to the target text of user's selection determines the identification number of the target text;Obtained according to the identification number and show described target this paper content information.
- 5. according to the method for claim 3, it is characterised in that described according to the described of user's selection Target text, obtain and show described target this paper content information, including:Key message according to the target text of user's selection determines the identification number of the target text;Described target this paper website information is obtained according to the identification number;Obtained according to the website information and show described target this paper content information.
- A kind of 6. Web text classification integrating apparatus, it is characterised in that including:Receiving module, for receiving the collection instruction of user's input;Memory module, for the content information or net of the text currently shown according to the collection instruction storage Location information;Acquisition module, for obtaining the identification information and classification information of the text currently shown;Classification chart creation module, for the classification information according to the text currently shown, at described point List item is added in classification chart corresponding to category information, the list item includes the identification of the text currently shown Information.
- 7. Web text classification integrating apparatus according to claim 6, it is characterised in that described point Category information includes following at least one:Source-information, type information, temporal information;The knowledge of the text Other information includes the key message of the text;The acquisition module is specifically used for obtaining crucial letter according to the content information of the text currently shown Breath, the key message include following at least one:Heading message, author information, content-keyword; The source-information of the text is obtained according to the website information of the text currently shown;Analysis is described to work as The content information of the text of preceding display determines the type information of the text;According to the text currently shown This collection time, issuing time obtain the temporal information of the text.
- 8. Web text classification integrating apparatus according to claim 7, it is characterised in that described to connect Receive the classification information that module is additionally operable to receive user's input;The acquisition module is additionally operable to obtain classification chart according to the classification information, and the classification chart includes institute The identification information of text corresponding to classification information is stated, so that identification information of the user according to the text Selection target text;According to the target text of user's selection, the content for obtaining described target this paper is believed Breath;The Web text classification integrating apparatus also includes display module, and the display module is used to show and divided Class table;Show described target this paper content information.
- 9. Web text classification integrating apparatus according to claim 8, it is characterised in that described to obtain Modulus block is specifically used for determining the target text according to the key message of the target text of user's selection Identification number;Described target this paper content information is obtained according to the identification number.
- 10. Web text classification integrating apparatus according to claim 8, it is characterised in that described Acquisition module is specifically used for determining the target text according to the key message of the target text of user's selection This identification number;Described target this paper website information is obtained according to the identification number;According to the network address Target this paper content information described in acquisition of information.
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201610366352.8A CN107436907A (en) | 2016-05-27 | 2016-05-27 | Web text classification integration method and device |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201610366352.8A CN107436907A (en) | 2016-05-27 | 2016-05-27 | Web text classification integration method and device |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| CN107436907A true CN107436907A (en) | 2017-12-05 |
Family
ID=60454541
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN201610366352.8A Pending CN107436907A (en) | 2016-05-27 | 2016-05-27 | Web text classification integration method and device |
Country Status (1)
| Country | Link |
|---|---|
| CN (1) | CN107436907A (en) |
Citations (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN102298614A (en) * | 2011-07-29 | 2011-12-28 | 百度在线网络技术(北京)有限公司 | Method for determining collection category of page collection information and device and equipment |
| CN102486791A (en) * | 2010-12-06 | 2012-06-06 | 腾讯科技(深圳)有限公司 | Method and server for intelligently classifying bookmarks |
| CN102799610A (en) * | 2012-06-01 | 2012-11-28 | 北京奇乐客科技有限公司 | Method and system for collecting network information |
| CN102929963A (en) * | 2012-10-11 | 2013-02-13 | 北京百度网讯科技有限公司 | Setting method and system of website type |
| CN103324669A (en) * | 2013-05-20 | 2013-09-25 | 北京奇虎科技有限公司 | Method and client for processing web page bookmark |
| CN103559288A (en) * | 2013-11-08 | 2014-02-05 | 惠州Tcl移动通信有限公司 | Method and mobile terminal for intelligent collecting and sharing |
-
2016
- 2016-05-27 CN CN201610366352.8A patent/CN107436907A/en active Pending
Patent Citations (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN102486791A (en) * | 2010-12-06 | 2012-06-06 | 腾讯科技(深圳)有限公司 | Method and server for intelligently classifying bookmarks |
| CN102298614A (en) * | 2011-07-29 | 2011-12-28 | 百度在线网络技术(北京)有限公司 | Method for determining collection category of page collection information and device and equipment |
| CN102799610A (en) * | 2012-06-01 | 2012-11-28 | 北京奇乐客科技有限公司 | Method and system for collecting network information |
| CN102929963A (en) * | 2012-10-11 | 2013-02-13 | 北京百度网讯科技有限公司 | Setting method and system of website type |
| CN103324669A (en) * | 2013-05-20 | 2013-09-25 | 北京奇虎科技有限公司 | Method and client for processing web page bookmark |
| CN103559288A (en) * | 2013-11-08 | 2014-02-05 | 惠州Tcl移动通信有限公司 | Method and mobile terminal for intelligent collecting and sharing |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US9152722B2 (en) | Augmenting online content with additional content relevant to user interest | |
| US11036744B2 (en) | Personalization of news articles based on news sources | |
| US9317613B2 (en) | Large scale entity-specific resource classification | |
| US9300755B2 (en) | System and method for determining information reliability | |
| US9268873B2 (en) | Landing page identification, tagging and host matching for a mobile application | |
| TWI479339B (en) | Theme-updated system, computer readable storage medium and device | |
| US10825110B2 (en) | Entity page recommendation based on post content | |
| US20110167053A1 (en) | Visual and multi-dimensional search | |
| TW201514845A (en) | Title and body extraction from web page | |
| US9660947B1 (en) | Method and apparatus for filtering undesirable content based on anti-tags | |
| US10176265B2 (en) | Awareness engine | |
| CN103136228A (en) | Image search method and image search device | |
| CN110232126B (en) | Hot spot mining method and server and computer-readable storage medium | |
| JPWO2003046764A1 (en) | Information analysis method and apparatus | |
| US20120330932A1 (en) | Presenting supplemental content in context | |
| US9460165B2 (en) | Retrieval device, retrieval system, retrieval method, retrieval program, and computer-readable recording medium storing retrieval program | |
| WO2014180130A1 (en) | Method and system for recommending contents | |
| CN107911448B (en) | Content pushing method and device | |
| CN104090923B (en) | The methods of exhibiting and device of a kind of rich media information in browser | |
| CN104090757A (en) | Method and device for displaying rich media information in browser | |
| CN113407818B (en) | Automatic Information Retrieval | |
| CN113821596A (en) | Information recommendation method and device, computer equipment and storage medium | |
| US20130132368A1 (en) | Large scale analytical reporting from web content | |
| CN108280102B (en) | Internet surfing behavior recording method and device and user terminal | |
| CN103631796A (en) | Web site classification management method and electronic device |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PB01 | Publication | ||
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| RJ01 | Rejection of invention patent application after publication | ||
| RJ01 | Rejection of invention patent application after publication |
Application publication date: 20171205 |