[go: up one dir, main page]

CN101853253A - Device and method for managing multimedia content in mobile terminal - Google Patents

Device and method for managing multimedia content in mobile terminal Download PDF

Info

Publication number
CN101853253A
CN101853253A CN200910128310A CN200910128310A CN101853253A CN 101853253 A CN101853253 A CN 101853253A CN 200910128310 A CN200910128310 A CN 200910128310A CN 200910128310 A CN200910128310 A CN 200910128310A CN 101853253 A CN101853253 A CN 101853253A
Authority
CN
China
Prior art keywords
unit
multimedia content
text data
text
search
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN200910128310A
Other languages
Chinese (zh)
Inventor
朱璇
史媛媛
邓菁
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Samsung Telecommunications Technology Research Co Ltd
Samsung Electronics Co Ltd
Original Assignee
Beijing Samsung Telecommunications Technology Research Co Ltd
Samsung Electronics Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Samsung Telecommunications Technology Research Co Ltd, Samsung Electronics Co Ltd filed Critical Beijing Samsung Telecommunications Technology Research Co Ltd
Priority to CN200910128310A priority Critical patent/CN101853253A/en
Publication of CN101853253A publication Critical patent/CN101853253A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

提供一种在移动终端中管理多媒体内容的设备和方法。所述设备包括:数据采集单元,用于将多媒体内容转换为文本数据,并按照预定的反映多媒体内容关键属性的关键字从转换的文本数据中提取相应的关键字值;数据库建立单元,用于基于数据采集单元获得的关键字值来建立多媒体内容数据库;存储单元,用于存储数据库建立单元所建立的多媒体内容数据库;查询输入单元,用于将用户输入的查询信息识别为文本数据,并从识别的文本数据中提取关键的搜索词;搜索单元,用于基于查询输入单元提取的搜索词从存储在存储单元中的多媒体内容数据库中搜索与搜索词相关的多媒体内容;以及搜索结果输出单元,用于向用户输出搜索单元的搜索结果。

Figure 200910128310

Provided are an apparatus and method for managing multimedia content in a mobile terminal. The device includes: a data acquisition unit, used to convert multimedia content into text data, and extract corresponding keyword values from the converted text data according to predetermined keywords reflecting key attributes of the multimedia content; a database establishment unit, used to Establish the multimedia content database based on the keyword value obtained by the data acquisition unit; the storage unit is used to store the multimedia content database established by the database establishment unit; the query input unit is used to identify the query information input by the user as text data, and from extracting a key search term from the identified text data; a search unit for searching multimedia content related to the search term from a multimedia content database stored in the storage unit based on the search term extracted by the query input unit; and a search result output unit, Used to output the search results of the search unit to the user.

Figure 200910128310

Description

在移动终端中管理多媒体内容的设备和方法 Device and method for managing multimedia content in mobile terminal

技术领域technical field

本发明涉及一种在移动终端中管理多媒体内容的设备和方法,更具体地说,本发明涉及一种通过将多媒体内容统一转换为文本数据,从而建立多媒体内容数据库以便管理和查询多媒体内容的设备和方法。The present invention relates to a device and method for managing multimedia content in a mobile terminal, more specifically, the present invention relates to a device for establishing a multimedia content database for managing and querying multimedia content by uniformly converting multimedia content into text data and methods.

背景技术Background technique

随着通信技术以及数字信号处理技术的发展,移动终端除了实现基本的语音通话功能之外,还能够执行各种其它功能,诸如拍摄照片、发送消息、收发电子邮件、GPS(全球定位系统)定位等。在使用上述功能时,将产生和传输大量的各种多媒体内容或信息,诸如,短消息、电子邮件、照片、语音通话等。由于各种多媒体内容具有各自不同的数据格式,因此,难以在不同种类的多媒体内容之间执行对信息的统一管理。With the development of communication technology and digital signal processing technology, mobile terminals can perform various other functions in addition to basic voice call functions, such as taking photos, sending messages, sending and receiving emails, GPS (Global Positioning System) positioning wait. When using the above functions, a large amount of various multimedia contents or information will be generated and transmitted, such as short messages, emails, photos, voice calls and the like. Since various multimedia contents have respective different data formats, it is difficult to perform unified management of information among different kinds of multimedia contents.

此外,移动终端的普及率如此之高,以致移动终端已经变成人们彼此联络的主要手段。因此,不仅在移动终端的通讯录中存储了大量联系人信息,而且在各种多媒体内容的信息中体现出不同联系人之间的各种人际关系,例如,在甲发送给乙的电子邮件中会提到丙,而乙和丙的合照可能存储在乙的移动终端中。此外,在两个联系人之间的语音通话或短消息中也会提到另外的联系人等等。然而,这种多媒体内容中体现出来的人际关系在现有的移动终端中并不能得到适当的反映或查询。因此,需要能够在移动终端中通过管理多媒体内容来反映出各种联系人之间的人际关系的技术方案。Furthermore, the penetration rate of mobile terminals is so high that mobile terminals have become the main means by which people communicate with each other. Therefore, not only a large amount of contact information is stored in the address book of the mobile terminal, but also various interpersonal relationships between different contacts are reflected in the information of various multimedia contents, for example, in the e-mail sent by A to B C will be mentioned, and the photo of B and C may be stored in B's mobile terminal. In addition, another contact or the like may be mentioned in a voice call or a short message between two contacts. However, the interpersonal relationship embodied in such multimedia content cannot be properly reflected or queried in existing mobile terminals. Therefore, there is a need for a technical solution capable of reflecting the interpersonal relationship among various contacts by managing multimedia content in the mobile terminal.

发明内容Contents of the invention

本发明的目的在于提供一种能够在移动终端中管理各种媒体信息的设备和方法,通过所述设备和方法,各种多媒体信息被转换为统一的文本形式,并基于统一的形式进行搜索,而各个联系人在大量多媒体信息中体现出来的人际关系也能够得到良好地反映。The purpose of the present invention is to provide a device and method capable of managing various media information in a mobile terminal. Through the device and method, various multimedia information is converted into a unified text form and searched based on the unified form. And the interpersonal relationship of each contact person reflected in a large amount of multimedia information can also be well reflected.

根据本发明的一方面,提供一种用于在移动终端中管理多媒体内容的设备,包括:数据采集单元,用于将多媒体内容转换为文本数据,并按照预定的反映多媒体内容关键属性的关键字从转换的文本数据中提取相应的关键字值;数据库建立单元,用于基于数据采集单元获得的关键字值来建立多媒体内容数据库;存储单元,用于存储数据库建立单元所建立的多媒体内容数据库;查询输入单元,用于将用户输入的查询信息识别为文本数据,并从识别的文本数据中提取关键的搜索词;搜索单元,用于基于查询输入单元提取的搜索词从存储在存储单元中的多媒体内容数据库中搜索与搜索词相关的多媒体内容;以及搜索结果输出单元,用于向用户输出搜索单元的搜索结果。According to an aspect of the present invention, there is provided a device for managing multimedia content in a mobile terminal, including: a data acquisition unit, used to convert multimedia content into text data, and according to a predetermined keyword reflecting the key attributes of the multimedia content Extract the corresponding keyword value from the converted text data; the database building unit is used to build the multimedia content database based on the keyword value obtained by the data acquisition unit; the storage unit is used to store the multimedia content database established by the database building unit; The query input unit is used to identify the query information input by the user as text data, and extracts key search terms from the identified text data; the search unit is used to extract the search terms based on the query input unit from the stored in the storage unit. The multimedia content database is searched for multimedia content related to the search word; and the search result output unit is used to output the search result of the search unit to the user.

所述数据采集单元包括:文本转换单元,用于将多媒体内容转换为文本数据;以及文本分析单元,用于按照预定的反映多媒体内容关键属性的关键字从转换的文本数据中提取相应的关键字值。The data acquisition unit includes: a text conversion unit, which is used to convert the multimedia content into text data; and a text analysis unit, which is used to extract corresponding keywords from the converted text data according to predetermined keywords reflecting the key attributes of the multimedia content value.

所述文本转换单元包括以下项中的至少一个:短消息转换单元,用于将短消息转换为文本数据;电子邮件转换单元,用于将电子邮件转换为文本数据;语音通话转换单元,用于将语音通话转换为文本数据;场景分类和聚类单元,用于按照预定的类别将照片分类并通过聚类处理来得出与该照片类似的照片,并将照片的场景分类和聚类结果记录为文本数据;面部识别单元,用于识别出照片中各个人物的面部特征以及总人数,并将面部识别的结果记录为文本数据;以及电子地图转换单元,用于参照电子地图将定位系统的位置信息转换为文本数据。The text conversion unit includes at least one of the following items: a short message conversion unit for converting short messages into text data; an e-mail conversion unit for converting e-mails into text data; a voice call conversion unit for The voice call is converted into text data; the scene classification and clustering unit is used to classify photos according to a predetermined category and obtain photos similar to the photo through clustering processing, and record the scene classification and clustering results of the photos as Text data; a facial recognition unit, used to recognize the facial features and total number of people in the photo, and record the result of facial recognition as text data; and an electronic map conversion unit, used to refer to the electronic map to convert the position information of the positioning system Convert to text data.

所述文本分析单元包括:词语划分单元,用于将文本转换单元输出的文本正文划分为多个词语;词性确定单元,用于确定由词语划分单元划分的多个词语的词性;人名提取单元,用于依照词性确定单元的输出结果来提取文本正文中的人名;时间提取单元,用于依照词性确定单元的输出结果来提取文本正文中的时间词语;重要词语提取单元,用于依照词性确定单元的输出结果来提取文本正文中的重要词语;以及其它关键字值提取单元,用于从文本转换单元输出的除文本正文之外的其它信息提取关键字值。The text analysis unit includes: a word division unit, which is used to divide the text text output by the text conversion unit into a plurality of words; a part-of-speech determination unit, which is used to determine the parts of speech of a plurality of words divided by the word division unit; a name extraction unit, It is used to extract the name of the person in the text body according to the output result of the part-of-speech determination unit; the time extraction unit is used to extract the time words in the text body according to the output result of the part-of-speech determination unit; the important word extraction unit is used to determine the unit according to the part-of-speech The output result of the text is used to extract important words in the text body; and other keyword value extraction units are used to extract keyword values from information other than the text body output by the text conversion unit.

所述设备还包括:时间信息解析单元,用于将时间提取单元提取的时间词语转换为时间信息。The device further includes: a time information analyzing unit, configured to convert the time words extracted by the time extracting unit into time information.

所述设备还包括:近义词产生单元,用于针对重要词语提取单元提取的重要词语来产生相应的近义词。The device further includes: a synonym generation unit, configured to generate corresponding synonyms for the important words extracted by the important word extraction unit.

所述数据库建立单元基于关键字、关键字值以及相关属性描述来建立多媒体内容数据库。The database building unit builds a multimedia content database based on keywords, keyword values, and related attribute descriptions.

所述数据库建立单元还建立联系人数据库,并且联系人数据库的联系人记录随着多媒体内容数据库而更新。The database establishing unit also establishes a contact database, and the contact records of the contact database are updated along with the multimedia content database.

所述查询输入单元包括:文本识别单元,用于将用户通过键盘或手写板输入的查询信息转换为文本数据;文本捕获单元和光字符识别OCR单元,用于捕获和识别被查询照片上的字符数据;面部识别单元,用于识别出查询照片中各个人物的面部特征,并将面部识别的结果记录为文本数据;以及语音识别单元,用于将用户输入的语音查询转换为文本数据。The query input unit includes: a text recognition unit, which is used to convert the query information input by the user through a keyboard or handwriting pad into text data; a text capture unit and an optical character recognition OCR unit, which are used to capture and recognize character data on the queried photo a facial recognition unit, used to recognize the facial features of each person in the query photo, and record the result of the facial recognition as text data; and a voice recognition unit, used to convert the voice query input by the user into text data.

所述搜索结果输出单元以形象化的方式向用户输出搜索单元的搜索结果。The search result output unit outputs the search result of the search unit to the user in a visualized manner.

所述搜索单元基于查询输入单元提取的搜索词,参照联系人数据库,从存储在存储单元中的多媒体内容数据库中搜索与搜索词相关的多媒体内容。The search unit searches the multimedia content related to the search term from the multimedia content database stored in the storage unit with reference to the contact database based on the search term extracted by the query input unit.

所述搜索单元基于多媒体内容数据库和联系人数据库来搜索联系人在多媒体内容中的相互关系。The searching unit searches for mutual relations of contacts in the multimedia content based on the multimedia content database and the contact database.

如果查询输入单元提取多个搜索词,则搜索单元按照预定的逻辑关系或用户设置的逻辑关系来针对所述多个搜索词进行搜索。If the query input unit extracts multiple search terms, the search unit searches for the multiple search terms according to a predetermined logical relationship or a logical relationship set by the user.

根据本发明的另一方面,提供一种用于在移动终端中管理多媒体内容的方法,包括:将多媒体内容转换为文本数据,并按照预定的反映多媒体内容关键属性的关键字从转换的文本数据中提取相应的关键字值;基于所述关键字值来建立多媒体内容数据库;存储所建立的多媒体内容数据库;将用户输入的查询信息识别为文本数据,并从识别的文本数据中提取关键的搜索词;基于提取的搜索词从存储的多媒体内容数据库中搜索与搜索词相关的多媒体内容;以及向用户输出搜索结果。According to another aspect of the present invention, a method for managing multimedia content in a mobile terminal is provided, including: converting the multimedia content into text data, and converting the converted text data according to a predetermined keyword reflecting a key attribute of the multimedia content Extract the corresponding keyword value; build a multimedia content database based on the keyword value; store the established multimedia content database; identify the query information input by the user as text data, and extract the key search key from the identified text data searching for multimedia content related to the search term from a stored multimedia content database based on the extracted search term; and outputting a search result to a user.

将多媒体内容转换为文本数据的步骤包括以下步骤中的至少一个:将短消息转换为文本数据;将电子邮件转换为文本数据;将语音通话转换为文本数据;按照预定的类别将照片分类并通过聚类处理来得出与该照片类似的照片,并将照片的分类和聚类结果记录为文本数据;识别出照片中各个人物的面部特征以及总人数,并将识别的结果记录为文本数据;以及参照电子地图将定位系统的位置信息转换为文本数据。The step of converting multimedia content into text data includes at least one of the following steps: converting short messages into text data; converting emails into text data; converting voice calls into text data; classifying photos according to predetermined categories and passing Cluster processing to obtain photos similar to the photo, and record the classification and clustering results of the photos as text data; recognize the facial features and total number of people in the photos, and record the recognition results as text data; and The position information of the positioning system is converted into text data with reference to the electronic map.

按照预定的反映多媒体内容关键属性的关键字从转换的文本数据中提取相应的关键字值的步骤包括:将文本正文划分为多个词语;确定划分的多个词语的词性;依照划分的多个词语的词性来提取文本正文中的人名;依照划分的多个词语的词性来提取文本正文中的时间词语;依照划分的多个词语的词性来提取文本正文中的重要词语;以及从除文本正文之外的其它信息提取关键字值。The step of extracting the corresponding keyword value from the converted text data according to the predetermined keyword reflecting the key attribute of the multimedia content includes: dividing the text text into a plurality of words; determining the parts of speech of the divided words; The part of speech of the word is used to extract the name of the person in the text body; the time words in the text body are extracted according to the part of speech of the divided multiple words; the important words in the text body are extracted according to the part of speech of the divided multiple words; Information other than extracting key values.

在所述方法中,还包括:将提取的时间词语转换为时间信息。In the method, it also includes: converting the extracted time words into time information.

在所述方法中,还包括:针对提取的重要词语来产生相应的近义词。In the method, it also includes: generating corresponding synonyms for the extracted important words.

所述多媒体内容数据库中的多媒体内容记录包括关键字、关键字值以及相关属性描述。The multimedia content records in the multimedia content database include keywords, keyword values and related attribute descriptions.

在所述方法中,还包括:建立联系人数据库,并且联系人数据库的联系人记录随着多媒体内容数据库而更新。In the method, it also includes: establishing a contact database, and the contact records of the contact database are updated along with the multimedia content database.

将用户输入的查询信息识别为文本数据的步骤包括以下步骤中的至少一个:将用户通过键盘或手写板输入的查询信息转换为文本数据;捕获和识别被查询照片上的字符数据;识别出查询照片中各个人物的面部特征,并将识别的结果记录为文本数据;以及将用户输入的语音查询转换为文本数据。The step of recognizing the query information input by the user as text data includes at least one of the following steps: converting the query information input by the user through a keyboard or tablet into text data; capturing and recognizing character data on the queried photo; identifying the query information The facial features of each person in the photo, and record the recognition result as text data; and convert the voice query input by the user into text data.

在向用户输出搜索结果的步骤中,以形象化的方式向用户输出搜索结果。In the step of outputting the search result to the user, the search result is output to the user in a visualized manner.

在所述方法中,还包括:基于提取的搜索词,参照联系人数据库,从存储的多媒体内容数据库中搜索与搜索词相关的多媒体内容。In the method, it further includes: based on the extracted search term, referring to the contact database, and searching the stored multimedia content database for multimedia content related to the search term.

在所述方法中,还包括:基于多媒体内容数据库和联系人数据库来搜索联系人在多媒体内容中的相互关系。In the method, it further includes: searching for mutual relations of contacts in the multimedia content based on the multimedia content database and the contact database.

在所述方法中,还包括:如果提取多个搜索词,则按照预定的逻辑关系或用户设置的逻辑关系来针对所述多个搜索词进行搜索。In the method, further comprising: if a plurality of search words are extracted, searching for the plurality of search words according to a predetermined logical relationship or a logical relationship set by a user.

附图说明Description of drawings

通过下面结合附图进行的对实施例的描述,本发明的上述和/或其它目的和优点将会变得更加清楚,其中:The above and/or other objects and advantages of the present invention will become more clear through the following description of the embodiments in conjunction with the accompanying drawings, wherein:

图1是示出根据本发明示例性实施例的在移动终端中管理多媒体内容的设备的框图;1 is a block diagram illustrating an apparatus for managing multimedia content in a mobile terminal according to an exemplary embodiment of the present invention;

图2是示出根据本发明示例性实施例的在移动终端中管理多媒体内容的方法的流程图;2 is a flowchart illustrating a method of managing multimedia content in a mobile terminal according to an exemplary embodiment of the present invention;

图3是示出图1所示的在移动终端中管理多媒体内容的设备中的数据采集单元的详细结构的示图;FIG. 3 is a diagram illustrating a detailed structure of a data collection unit in the device for managing multimedia content in a mobile terminal shown in FIG. 1;

图4示出输入到图3所示的数据采集单元的照片;Figure 4 shows a photo input to the data acquisition unit shown in Figure 3;

图5示出存储在移动终端中的联系人照片的示例;Fig. 5 shows an example of a contact photo stored in a mobile terminal;

图6是示出图1所示的在移动终端中管理多媒体内容的设备中的查询输入单元的详细结构的示图;以及6 is a diagram illustrating a detailed structure of a query input unit in the device for managing multimedia content in a mobile terminal shown in FIG. 1; and

图7示出图1所示的在移动终端中管理多媒体内容的设备中的搜索结果输出单元输出的搜索结果的示例。FIG. 7 illustrates an example of search results output by a search result output unit in the apparatus for managing multimedia content in a mobile terminal shown in FIG. 1 .

具体实施方式Detailed ways

现将详细参照本发明的实施例,所述实施例的示例在附图中示出,其中,相同的标号始终指的是相同的部件。以下将通过参照附图来说明所述实施例,以便解释本发明。Reference will now be made in detail to embodiments of the present invention, examples of which are illustrated in the accompanying drawings, wherein like numerals refer to like parts throughout. The embodiments are described below in order to explain the present invention by referring to the figures.

图1是示出根据本发明示例性实施例的在移动终端中管理多媒体内容的设备的框图。如图1所示,根据本发明示例性实施例的在移动终端中管理多媒体内容的设备包括:数据采集单元10,用于将多媒体内容转换为文本数据,并按照预定的反映多媒体内容关键属性的关键字从转换的文本数据中提取相应的关键字值;数据库建立单元20,用于基于数据采集单元10获得的关键字值来建立多媒体内容数据库;存储单元30,用于存储数据库建立单元20所建立的多媒体内容数据库;查询输入单元40,用于将用户输入的查询信息识别为文本数据,并从识别的文本数据中提取关键的搜索词;搜索单元50,用于基于查询输入单元40提取的搜索词从存储在存储单元30中的多媒体内容数据库中搜索与搜索词相关的多媒体内容;以及搜索结果输出单元60,用于向用户输出搜索单元的搜索结果。FIG. 1 is a block diagram illustrating an apparatus for managing multimedia content in a mobile terminal according to an exemplary embodiment of the present invention. As shown in FIG. 1 , the device for managing multimedia content in a mobile terminal according to an exemplary embodiment of the present invention includes: a data acquisition unit 10 configured to convert multimedia content into text data, and to reflect the key attributes of the multimedia content according to predetermined The keyword extracts the corresponding keyword value from the converted text data; the database building unit 20 is used to set up the multimedia content database based on the keyword value obtained by the data acquisition unit 10; the storage unit 30 is used to store the database building unit 20 The multimedia content database that builds; Inquiry input unit 40, is used for identifying the query information that the user inputs as text data, and extracts key search word from the text data of recognition; Search unit 50, is used for extracting based on inquiry input unit 40 The search term searches for multimedia content related to the search term from a multimedia content database stored in the storage unit 30; and a search result output unit 60 configured to output the search result of the search unit to the user.

以下将参照图2来描述利用图1所示的管理多媒体内容的设备来实现根据本发明的管理多媒体内容的方法的示例。An example of implementing the method for managing multimedia content according to the present invention by using the device for managing multimedia content shown in FIG. 1 will be described below with reference to FIG. 2 .

图2是示出根据本发明示例性实施例的在移动终端中管理多媒体内容的方法的流程图。参照图2,在步骤S100,由数据采集单元10将多媒体内容转换为文本数据,并按照预定的反映多媒体内容关键属性的关键字从转换的文本数据中提取相应的关键字值;在步骤S200,由数据库建立单元20基于数据采集单元10获得的关键字值来建立多媒体内容数据库,并将建立的多媒体内容数据库存储在存储单元30中;在步骤S300,由查询输入单元40将用户输入的查询信息识别为文本数据,并从识别的文本数据中提取关键的搜索词;在步骤S400,由搜索单元50基于查询输入单元40提取的搜索词从存储在存储单元30中的多媒体内容数据库中搜索与搜索词相关的多媒体内容;然后,在步骤S500,由搜索结果输出单元60向用户输出搜索单元40的搜索结果。FIG. 2 is a flowchart illustrating a method of managing multimedia content in a mobile terminal according to an exemplary embodiment of the present invention. With reference to Fig. 2, in step S100, multimedia content is converted into text data by data acquisition unit 10, and extracts corresponding keyword value from the converted text data according to the keyword of predetermined reflection multimedia content key attribute; In step S200, Based on the keyword value that data collection unit 10 obtains by database establishment unit 20, build multimedia content database, and the multimedia content database of establishment is stored in storage unit 30; Recognize as text data, and extract key search word from the text data of recognition; In step S400, search and search from the multimedia content database stored in storage unit 30 by the search word that search unit 50 extracts based on inquiry input unit 40 word-related multimedia content; then, in step S500, the search result output unit 60 outputs the search result of the search unit 40 to the user.

以下将参照图3到图7来描述图1所示的在移动终端中管理多媒体内容的设备的各个构件及其具体操作。Components and specific operations of the device for managing multimedia content in a mobile terminal shown in FIG. 1 will be described below with reference to FIGS. 3 to 7 .

图3是示出图1所示的在移动终端中管理多媒体内容的设备中的数据采集单元10的详细结构的示图。作为示例,图3所示的数据采集单元10可包括:文本转换单元101,用于将多媒体内容转换为文本数据;文本分析单元102,用于按照预定的反映多媒体内容关键属性的关键字从转换的文本数据中提取相应的关键字值。FIG. 3 is a diagram showing a detailed structure of the data collection unit 10 in the device for managing multimedia content in a mobile terminal shown in FIG. 1 . As an example, the data acquisition unit 10 shown in FIG. 3 may include: a text conversion unit 101, which is used to convert multimedia content into text data; a text analysis unit 102, which is used to convert from Extract the corresponding keyword value from the text data of .

根据本发明的示例性实施例,文本转换单元101将各种多媒体内容转换为文本数据。作为示例,对于短消息,文本转换单元101包括短消息转换单元1011,用于将短消息转换为文本数据;对于电子邮件,文本转换单元101包括电子邮件转换单元1012,用于将电子邮件转换为文本数据;上述短消息转换单元1011和电子邮件转换单元1012被设计为将完整的短消息或电子邮件信息转换为文本格式的数据,其包括发送方、接收方、抄送方、标题、正文、收发时间等。对于语音通话,文本转换单元101包括语音通话转换单元1013,用于将语音通话转换为文本数据。可利用现有的语音识别技术来构建语音通话转换单元1013,例如,语音通话转换单元1013可基于预先建立的语言模型和声学模型来识别出语音通话的内容,并将其转换为文本数据,除了语音通话的内容之外,语音通话转换单元1013还将语音通话的发送方、接收方、时间等转换为文本数据;对于照片,文本转换单元101包括场景分类和聚类单元1014,用于按照预定的类别将照片分类并通过聚类处理来得出与该照片类似的照片,其中,预定的类别是指预先设置的照片类别,例如,人像(单人照、合影、集体照)、风景(山水、海滩、天空)、物品(花卉、静物)等等,而聚类则是采用模式识别的常用算法,通过提取照片画面中的特征,根据某种距离测度,衡量照片之间的相似性,并将相互间距较小的照片聚成一类,所述场景分类和聚类单元1014将照片的场景分类和聚类结果记录为文本数据;以及面部识别单元1015,用于识别出照片中各个人物的面部特征以及总人数,优选地,可参照联系人数据库(将在稍后描述)中的照片或特定的人像数据库来识别出待转换照片中的各个人物的信息,所述面部识别单元1015将面部识别的结果记录为文本数据。此外,优选地,文本转换单元还包括电子地图转换单元1016,用于参照电子地图将定位系统的位置信息转换为文本数据,从而可提供当产生或接收多媒体内容时的地址的文本数据。本领域技术人员应该理解:上述各个单元仅仅作为示例,并不是为了限制本发明,实际上,多媒体内容的种类是灵活多变的,可针对需要管理的多媒体内容设置相应的文本转换单元,而只要能够达到文本转换的功能,任何用于实现文本转换的现有技术均能够应用于本发明。According to an exemplary embodiment of the present invention, the text conversion unit 101 converts various multimedia contents into text data. As an example, for a short message, the text conversion unit 101 includes a short message conversion unit 1011 for converting the short message into text data; for an email, the text conversion unit 101 includes an email conversion unit 1012 for converting the email into Text data; above-mentioned short message conversion unit 1011 and e-mail conversion unit 1012 are designed to convert complete short message or e-mail information into text format data, which includes sender, recipient, CC party, title, text, Sending and receiving time, etc. For voice calls, the text conversion unit 101 includes a voice call conversion unit 1013 for converting voice calls into text data. Existing speech recognition technology can be used to construct the voice call conversion unit 1013. For example, the voice call conversion unit 1013 can recognize the content of the voice call based on the pre-established language model and acoustic model, and convert it into text data, except In addition to the content of the voice call, the voice call conversion unit 1013 also converts the sender, receiver, time, etc. of the voice call into text data; for photos, the text conversion unit 101 includes a scene classification and clustering unit 1014 for Classify the photos and obtain photos similar to the photo through clustering processing, wherein the predetermined category refers to the preset photo category, for example, portrait (single photo, group photo, group photo), landscape (landscape, beach, sky), objects (flowers, still life), etc., and clustering is a common algorithm for pattern recognition, by extracting the features in the photo frame, according to a certain distance measure, to measure the similarity between photos, and Photos with small mutual distances are grouped into one category, and the scene classification and clustering unit 1014 records the scene classification and clustering results of the photos as text data; and the facial recognition unit 1015 is used to identify the facial features of each person in the photo And the total number of people, preferably, can refer to the photos in the contact database (to be described later) or the specific portrait database to identify the information of each character in the photo to be converted, and the facial recognition unit 1015 will Results are recorded as text data. In addition, preferably, the text conversion unit further includes an electronic map conversion unit 1016 for converting the location information of the positioning system into text data with reference to the electronic map, so as to provide text data of the address when generating or receiving multimedia content. Those skilled in the art should understand that: the above-mentioned units are only examples, and are not intended to limit the present invention. In fact, the types of multimedia content are flexible, and corresponding text conversion units can be set for the multimedia content that needs to be managed. The function of text conversion can be achieved, and any existing technology for realizing text conversion can be applied to the present invention.

根据本发明的示例性实施例,文本分析单元102包括:词语划分单元1021,用于将文本转换单元101输出的文本正文划分为多个词语;词性确定单元1022,用于确定由词语划分单元1021划分的多个词语的词性;人名提取单元1023,用于依照词性确定单元1022的输出结果来提取文本正文中的人名;时间提取单元1024,用于依照词性确定单元1022的输出结果来提取文本正文中的时间词语;优选地,在时间提取单元1024之后跟有时间信息解析单元1026,用于将时间提取单元1024提取的时间词语转换为时间信息;重要词语提取单元1025,用于依照词性确定单元1022的输出结果来提取文本正文中的重要词语;优选地,在重要词语提取单元1025之后可设置有近义词产生单元(未示出),用于针对重要词语提取单元1025提取的重要词语来产生相应的“近义词”,以便搜索更加智能化和全面化;其它关键字值提取单元1027,用于从文本转换单元101输出的除文本正文之外的其它信息(诸如发送方、接收方、抄送方、标题、收发时间、地点等)提取关键字值。According to an exemplary embodiment of the present invention, the text analysis unit 102 includes: a word division unit 1021, which is used to divide the text text output by the text conversion unit 101 into a plurality of words; a part-of-speech determination unit 1022, which is used to determine the Part of speech of a plurality of words divided; Name extraction unit 1023 is used to extract the name of the person in the text text according to the output result of the part of speech determination unit 1022; Time extraction unit 1024 is used to extract the text text according to the output result of the part of speech determination unit 1022 The time words in; preferably, after the time extraction unit 1024, there is a time information parsing unit 1026, which is used to convert the time words extracted by the time extraction unit 1024 into time information; The output result of 1022 extracts the important words in the text text; Preferably, after the important words extracting unit 1025, a synonym generation unit (not shown) can be provided for generating corresponding important words for the important words extracted by the important words extracting unit 1025 The "synonyms" of so that the search is more intelligent and comprehensive; other keyword value extracting unit 1027 is used for other information (such as sender, receiver, CC) from the text conversion unit 101 output except the text text , title, sending and receiving time, location, etc.) to extract keyword values.

如上所述,数据采集单元10在将各种多媒体内容转换为文本数据,并按照预定的反映多媒体内容关键属性的关键字从转换的文本数据中提取相应的关键字值之后,将针对各个多媒体内容得到的数据采集结果发送到数据库建立单元20。然后,数据库建立单元20基于数据采集单元10输出的内容来建立多媒体内容数据库。作为示例,在多媒体内容数据库中,每个多媒体内容基于预定的反映多媒体内容关键属性的关键字、相应的关键字值以及相关属性描述来记录,每个多媒体内容记录具有自己的索引,索引可表示为“字母+标号”的形式,其中,字母表示媒体类型,标号表示多媒体记录在该媒体类型中的序号。As mentioned above, after the data acquisition unit 10 converts various multimedia contents into text data, and extracts corresponding keyword values from the converted text data according to predetermined keywords reflecting the key attributes of the multimedia contents, it will The obtained data collection results are sent to the database building unit 20 . Then, the database building unit 20 builds a multimedia content database based on the content output by the data collection unit 10 . As an example, in the multimedia content database, each multimedia content is recorded based on predetermined keywords reflecting the key attributes of the multimedia content, corresponding keyword values and related attribute descriptions, each multimedia content record has its own index, and the index can represent It is in the form of "letter + label", wherein the letter indicates the media type, and the label indicates the serial number of the multimedia record in the media type.

例如,对于在2008年8月3日19点55分,在国贸中心从张三接收到的短消息“小丽,晚上有空么?一起去看电影?”这一项多媒体内容,在经过数据采集单元10的处理之后,可得出:短消息的接收时间为“2008-8-3  19:55”,接收地点为“国贸中心”,发送方为“张三”,正文中的人名为“小丽”,正文中的时间为“晚上”,其可被时间信息解析单元1026解析为“2008-8-318:00-24:00”,正文中的重要词语为“电影”,词性为名词,而近义词产生单元可就此生成近义词“影片”。基于上述关键词以及关键词值,数据库建立单元20在多媒体内容数据库中产生相应的多媒体内容记录,具体如表1所示。For example, at 19:55 on August 3, 2008, the short message "Xiao Li, are you free in the evening? Go to the movies together?" received from Zhang San in the International Trade Center, after passing through the data After the processing of the acquisition unit 10, it can be drawn that the receiving time of the short message is "2008-8-3 19:55", the receiving place is "International Trade Center", the sender is "Zhang San", and the name of the person in the text is "Xiaoli", the time in the text is "evening", which can be parsed by the time information analysis unit 1026 as "2008-8-318:00-24:00", the important word in the text is "movie", and the part of speech is noun, and the synonym generation unit can generate the synonym "movie" accordingly. Based on the above keywords and keyword values, the database building unit 20 generates corresponding multimedia content records in the multimedia content database, as shown in Table 1 for details.

表1、接收短消息在多媒体内容数据库中的记录示例Table 1, Record examples of receiving short messages in the multimedia content database

如表1所述,对于关键字<媒体类型>,其关键字值为“短消息”,相关属性描述指示该短消息是发送的短消息还是接收的短消息,在该示例中,短消息为接收到的短消息;对于关键字<发送方><人名>,其关键字值为“张三”,相关属性描述首先指示“张三”是否为移动终端的联系人,如果是移动终端的联系人,则“联系人”=Y,如果不是移动终端的联系人,则“联系人”=N;此外,优选地,在“联系人”=Y的情况下,相关属性描述可进一步指出该联系人在联系人数据库(其将在后面描述)中的索引;对于关键字<时间>,其关键字值为“2008-8-3  19:55”;对于关键字<地点>,其关键字值为“国贸中心”,相关属性描述指示该地点是发送地点还是接收地点,在该示例中,“国贸中心”为接收地点;对于关键字<正文><人名>,其关键字值为“小丽”,相关属性描述的内容同关键字<发送人><人名>;对于关键字<正文><时间>,其关键字值为“晚上”,优选地,其相关属性描述指示时间词语“晚上”的解析结果“2008-8-3  18:00-24:00”;对于关键字<正文><重要词语>,其关键字值为“电影”,相关属性描述指示“电影”的词性,优选地,其相关属性描述还指示“电影”的近义词,即,“影片”。As described in Table 1, for the keyword <media type>, its keyword value is "short message", and the relevant attribute description indicates whether the short message is a sent short message or a received short message. In this example, the short message is Received short message; for the keyword <sender> <name>, its keyword value is "Zhang San", and the related attribute description first indicates whether "Zhang San" is the contact of the mobile terminal, if it is the contact of the mobile terminal person, then "contact"=Y, if it is not a contact of the mobile terminal, then "contact"=N; in addition, preferably, in the case of "contact"=Y, the relevant attribute description can further indicate the contact The index of the person in the contact database (it will be described later); for the keyword <time>, its keyword value is "2008-8-3 19:55"; for the keyword <location>, its keyword value is "International Trade Center", and the relevant attribute description indicates whether the location is the sending location or the receiving location. In this example, "International Trade Center" is the receiving location; ", the content of the relevant attribute description is the same as the keyword <sender> <person's name>; for the keyword <text> <time>, its keyword value is "evening", preferably, its related attribute description indicates the time word "evening" The parsing result of "2008-8-3 18:00-24:00"; for the keyword <body> <important words>, its keyword value is "movie", and the related attribute description indicates the part of speech of "movie", preferably , whose relative attribute description also indicates a synonym for "movie", ie, "film".

类似地,对于在2008年8月3日20点00分,在国贸中心向张三发送的短消息“电影票已经订好了,晚上见”这一项多媒体内容,在经过数据采集单元10的处理之后,可得出:短消息的发送时间为“2008-8-3  20:00”,发送地点为“国贸中心”,接收方为“张三”,正文中的时间为“晚上”,其可被时间信息解析单元1026解析为“2008-8-3  18:00-24:00”,正文中的重要词语为“电影”,词性为名词,而近义词产生单元可就此生成近义词“影片”,正文中另一重要词语为“电影票”,词性为名词。基于上述关键词以及关键词值,数据库建立单元20在多媒体内容数据库中产生相应的多媒体内容记录,具体如表2所示。Similarly, for at 20:00 on August 3, 2008, the short message "movie ticket has been ordered, see you in the evening" sent to Zhang San in the International Trade Center, after passing through the data collection unit 10 After processing, it can be concluded that the sending time of the short message is "2008-8-3 20:00", the sending place is "International Trade Center", the receiver is "Zhang San", the time in the text is "evening", and its It can be parsed as "2008-8-3 18:00-24:00" by the time information parsing unit 1026, the important word in the text is "movie", the part of speech is a noun, and the synonym generation unit can generate the synonym "movie", Another important word in the text is "movie ticket", and the part of speech is a noun. Based on the above keywords and keyword values, the database building unit 20 generates corresponding multimedia content records in the multimedia content database, as shown in Table 2 for details.

表2、发送短消息在多媒体内容数据库中的记录示例Table 2, examples of records in the multimedia content database for sending short messages

Figure B2009101283100D0000091
Figure B2009101283100D0000091

作为另一示例,对于在2008年9月5日9点38分,在国贸中心从李四接收到的标题为“快递已到”、同时抄送到欣欣的电子邮件“小丽,您好!你发给我的快递已经收到了,所有货品完好无损,谢谢!李四”这一项多媒体内容,在经过数据采集单元10的处理之后,可得出:电子邮件的接收时间为“2008-9-5  9:38”,接收地点为“国贸中心”,发送方为“李四”,接收方为“小丽”,抄送方为“欣欣”,标题中的重要词语为“快递”,词性为名词,正文中的人名为“小丽”、“李四”,正文中的重要词语为“快递”,词性为名词,正文中的另一重要词语为“货品”,词性为“名词”。基于上述关键词以及关键词值,数据库建立单元20在多媒体内容数据库中产生相应的多媒体内容记录,具体如表3所示。As another example, at 9:38 on September 5, 2008, the email titled "Express delivery has arrived" received from Li Si at the China International Trade Center and copied to Xinxin, "Xiao Li, hello! The courier you sent to me has been received, and all the goods are in good condition, thank you! After the multimedia content of "Li Si" is processed by the data acquisition unit 10, it can be concluded that the receiving time of the email is "2008-9 -5 9:38", the receiving location is "International Trade Center", the sender is "Li Si", the receiver is "Xiaoli", the copying party is "Xinxin", the important word in the title is "Express", the part of speech is a noun, the names of the people in the text are "Xiaoli" and "Li Si", the important word in the text is "express delivery", the part of speech is a noun, and the other important word in the text is "goods", the part of speech is "noun" . Based on the above keywords and keyword values, the database building unit 20 generates corresponding multimedia content records in the multimedia content database, as shown in Table 3 in detail.

表3、接收电子邮件在多媒体内容数据库中的记录示例Table 3. Examples of records of received e-mails in the multimedia content database

Figure B2009101283100D0000101
Figure B2009101283100D0000101

类似地,对于在2008年8月4日19点20分,在望京新城发送给欣欣的标题为“推荐一部好电影”的电子邮件“昨天和张三去看了《xxx》,挺好看的,你有空可以和小安一起去看看”这一项多媒体内容,在经过数据采集单元10的处理之后,可得出:电子邮件的发送时间为“2008-8-4  19:20”,发送地点为“望京新城”,发送方为“小丽”,接收方为“欣欣”,标题中的重要词语为“电影”,正文中的人名为“张三”、“小安”,正文中的时间为“昨天”,其可被时间信息解析单元1026解析为“2008-8-3  00:00-24:00”。基于上述关键词以及关键词值,数据库建立单元20在多媒体内容数据库中产生相应的多媒体内容记录,具体如表4所示。Similarly, for the e-mail titled "Recommend a good movie" sent to Xinxin at 19:20 on August 4, 2008 in Wangjing New City, "I went to see "xxx" with Zhang San yesterday, it was very good. , you can go and see with Xiaoan when you have time", after the processing of the data acquisition unit 10, it can be drawn that the sending time of the email is "2008-8-4 19:20", and the sending The location is "Wangjing New City", the sender is "Xiaoli", the receiver is "Xinxin", the important word in the title is "Movie", the names of the people in the text are "Zhang San" and "Xiao'an", and the words in the text are The time is "yesterday", which can be resolved to "2008-8-3 00:00-24:00" by the time information analysis unit 1026. Based on the above keywords and keyword values, the database building unit 20 generates corresponding multimedia content records in the multimedia content database, as shown in Table 4 specifically.

表4、发送电子邮件在多媒体内容数据库中的记录示例Table 4. Examples of records in the multimedia content database for sending e-mails

Figure B2009101283100D0000102
Figure B2009101283100D0000102

作为另一示例,对于在2008年9月5日10点15分到10点19分,在国贸中心拨打给李四的语音呼叫“我:喂 你好 是李四吗 李四:你好 小丽吧我:你转来的贷款已经到账了 麻烦你了 李四:别客气 我:那有空再联络啦”这一项多媒体内容,在经过数据采集单元10的处理之后,可得出:语音呼叫的开始时间为“2008-9-5  10:15”,结束时间为“2008-9-5  10:19”,发送地点为“国贸中心”,发送方为“小丽”,接收方为“李四”,正文中的人名为“李四”、“小丽”,正文中的重要词语为“贷款”,词性为“名词”。基于上述关键词以及关键词值,数据库建立单元20在多媒体内容数据库中产生相应的多媒体内容记录,具体如表5所示。As another example, for a voice call made to Li Si at the International Trade Center from 10:15 to 10:19 on September 5, 2008, "I: Hello, is it Li Si? Li Si: Hello Xiaoli Come on me: The loan you transferred has already arrived. Sorry for your trouble. Li Si: You’re welcome. Me: I’ll contact you when I’m free." The start time of the call is "2008-9-5 10:15", the end time is "2008-9-5 10:19", the sending location is "International Trade Center", the sender is "Xiaoli", and the receiver is " Li Si", the names of the people in the text are "Li Si" and "Xiaoli", the important word in the text is "loan", and the part of speech is "noun". Based on the above keywords and keyword values, the database building unit 20 generates corresponding multimedia content records in the multimedia content database, as shown in Table 5 in detail.

表5、呼出语音通话在多媒体内容数据库中的记录示例Table 5. Examples of records of outgoing voice calls in the multimedia content database

Figure B2009101283100D0000112
Figure B2009101283100D0000112

在表5中,由于语音通话持续一段时间,因此对于关键字“时间”而言,产生两个具体时刻,此时,时间的相关属性描述需要指出开始时间是什么时刻,结束时间是什么时刻。In Table 5, since the voice call lasts for a period of time, two specific times are generated for the keyword "time". At this time, the related attribute description of the time needs to indicate when the start time is and what time is the end time.

作为另一示例,对于如图4中的(a)所示的照片,该照片拍摄时间为2008年9月1日14点44分,拍摄地点为长安大剧院,照片左侧为欣欣,右侧为小安。该照片经过数据采集单元10的处理之后,可得出:照片的拍摄时间为“2008-9-1  14:44”,拍摄地点为“长安大剧院”,照片类别为“人像”,照片中的两个人分别是“欣欣”和“小安”,而分类和聚类单元1014还可识别出该照片存在两张相似照片。基于上述关键词以及关键词值,数据库建立单元20在多媒体内容数据库中产生相应的多媒体内容记录,具体如表6所示。As another example, for the photo shown in (a) in Figure 4, the photo was taken at 14:44 on September 1, 2008, and the location of the photo was Chang'an Grand Theater. The left side of the photo is Xinxin, and the right side For Xiaoan. After the photo is processed by the data acquisition unit 10, it can be drawn that the shooting time of the photo is "2008-9-1 14:44", the shooting location is "Chang'an Grand Theater", the photo category is "portrait", and the photo in the photo is "portrait". The two persons are "Xinxin" and "Xiao'an", and the classification and clustering unit 1014 can also identify that there are two similar photos in this photo. Based on the above keywords and keyword values, the database building unit 20 generates corresponding multimedia content records in the multimedia content database, as shown in Table 6 specifically.

表6、图4中的(a)所示照片在多媒体内容数据库中的记录示例The record example of photo shown in (a) in table 6, Fig. 4 in the multimedia content database

Figure B2009101283100D0000121
Figure B2009101283100D0000121

在表6中,“数量”指示人像中的人数,p1和p2分别指示与该照片类似的两张照片在多媒体内容数据库中的索引。In Table 6, "quantity" indicates the number of people in the portrait, and p1 and p2 respectively indicate the indexes of two photos similar to the photo in the multimedia content database.

类似地,对于如图4中的(b)所示的照片,该照片拍摄时间为2008年10月3日10点00分,拍摄地点为巴松措风景区。该照片经过数据采集单元10的处理之后,可得出:照片的拍摄时间为“2008-10-3  10:00”,拍摄地点为“巴松措风景区”,照片类别为“风景”,而分类和聚类单元1014还可识别出该照片存在一张相似照片。基于上述关键词以及关键词值,数据库建立单元20在多媒体内容数据库中产生相应的多媒体内容记录,具体如表7所示。Similarly, for the photo shown in (b) in FIG. 4, the photo was taken at 10:00 on October 3, 2008, and the photo was taken at the Basongcuo Scenic Area. After the photo is processed by the data acquisition unit 10, it can be drawn that the shooting time of the photo is "2008-10-3 10:00", the shooting location is "Basongcuo Scenic Area", the photo category is "landscape", and the classification The sum clustering unit 1014 may also identify that there is a similar photo to the photo. Based on the above keywords and keyword values, the database building unit 20 generates corresponding multimedia content records in the multimedia content database, as shown in Table 7 in detail.

表7、图4中的(b)所示照片在多媒体内容数据库中的记录示例Record examples of photos shown in (b) in table 7, Fig. 4 in the multimedia content database

  关键字keywords   关键字值key value   相关属性描述Related property description   <媒体类型><media type>   照片 photo   <照片类别><photo category>   风景 landscape   <相似照片><similar photo>   1张 1 piece “1”=p3"1" = p3   <时间><time>   2008-10-3  10:002008-10-3 10:00   <地点><location>   巴松措风景区Basongcuo Scenic Area “属性”=拍摄地点"attribute" = shooting location

在表7中,p3表示与该照片相似的照片在多媒体内容数据库中的索引。In Table 7, p3 represents the index of the photo similar to the photo in the multimedia content database.

以上各个记录的关键字可基于多媒体内容的类型和搜索的要求而灵活设置,例如,可删除某些关键字,也可添加新的关键字,本发明并不受限于表1到表7给出的示例。数据库建立单元20将如上所述建立的多媒体内容数据库存储在存储单元30中。The keywords of each of the above records can be flexibly set based on the type of multimedia content and the requirements of the search, for example, certain keywords can be deleted, and new keywords can also be added, and the present invention is not limited to those given in Table 1 to Table 7 example. The database building unit 20 stores the multimedia content database built as described above in the storage unit 30 .

从表1到表7可以看出,移动终端中的各种多媒体内容往往涉及各个联系人之间的相互联系,例如,可通过短消息的内容、电子邮件的抄送方及内容、通话内容、照片中的人像等来反映不同联系人之间的关系。由于本发明的实施例将上述各种多媒体内容转换为文本数据,并建立了相应的数据库,因此,可清楚地反映出这种不同联系人之间的各种相互关系,从而便于移动终端的用户管理他的人际资源。It can be seen from Table 1 to Table 7 that the various multimedia contents in the mobile terminal often involve the interconnection between various contacts, for example, the content of the short message, the CC party and content of the email, the content of the call, portraits in photos to reflect the relationship between different contacts. Since the embodiment of the present invention converts the above-mentioned various multimedia contents into text data, and establishes a corresponding database, it can clearly reflect various interrelationships between such different contacts, thereby facilitating mobile terminal users Manage his human resources.

为了更进一步地将多媒体内容涉及到的人际关系与终端用户的联系人相互关联,作为一种可选方式,本发明通过数据库建立单元20在联系人数据库与多媒体内容数据库之间建立关系,具体操作为在联系人数据库的每个联系人记录中额外记录涉及多媒体内容和其它联系人的信息。In order to further correlate the interpersonal relationship involved in the multimedia content with the terminal user's contacts, as an optional method, the present invention establishes a relationship between the contact database and the multimedia content database through the database establishment unit 20, the specific operation Information related to multimedia content and other contacts is additionally recorded in each contact record of the contacts database.

例如,对于联系人欣欣,其在联系人数据库中的照片如图5中的(a)所示,在本发明的该实施例中,欣欣在联系人数据库中的索引为“n3”,联系人书库中的每个联系人基于预定的关键字及其相应的关键字值被记录,特别地,除了姓名、头像、手机、办公电话、住宅电话、铃声、电子邮件、地址等常规关键字中的至少一个或多个之外,本发明的实施例还特别针对多媒体内容数据库设置了关键字“通讯记录”、“媒体记录”和“关联记录”,其中,“通讯记录”指示该联系人在多媒体内容数据库的各个记录中作为<发送方>、<接收方>或<抄送方>出现的次数以及具体出现在哪个记录,“媒体记录”指示该联系人在多媒体内容数据库的各个记录中作为<标题>、<正文>、<人像>等其它位置出现的次数以及具体出现在哪个记录,“关联记录”基于多媒体内容数据库中的各个记录来指示该联系人与其它联系人发生关联的次数,也就是说,每当该联系人与某个其它联系人在某个多媒体内容记录中发生关联,则将指示二者关联次数的变量加1。由此可见,关键字“通讯记录”、“媒体记录”和“关联记录”的具体关键字值是随着多媒体内容记录的增加而实时更新的,具体说来,数据库建立单元20既负责建立多媒体内容数据库,还负责依据多媒体内容数据库的记录增加来更新联系人数据库。For example, for the contact person Xinxin, its photo in the contact database is shown in (a) in Figure 5. In this embodiment of the present invention, the index of Xinxin in the contact database is "n3", and the contact person Each contact in the library is recorded based on predetermined keywords and their corresponding keyword values, in particular, in addition to name, avatar, mobile phone, office phone, home phone, ringtone, e-mail, address, etc. among the regular keywords In addition to at least one or more, the embodiment of the present invention also sets the keywords "communication record", "media record" and "associated record" specifically for the multimedia content database, wherein the "communication record" indicates that the contact is in the multimedia The number of times and in which record the contact appears as <Sender>, <Receiver>, or <Cc> in each record in the content database, and Media Record indicates that the contact appears in each record in the multimedia content database as < Title>, <text>, <portrait> and other positions appearing times and which record they appear in, "associated record" indicates the number of times the contact is associated with other contacts based on each record in the multimedia content database, and also That is to say, whenever the contact is associated with some other contact in a certain multimedia content record, the variable indicating the number of times the two are associated is incremented by 1. It can be seen that the specific keyword values of keywords "communication records", "media records" and "associated records" are updated in real time along with the increase of multimedia content records. The content database is also responsible for updating the contact database according to the addition of records in the multimedia content database.

作为示例,下面的表8和表9示出联系人数据库中的记录“欣欣”和“小安”的示例,其中,记录“欣欣”的索引为n3,记录“小安”的索引为n4。As an example, Table 8 and Table 9 below show examples of records "Xinxin" and "Xiaoan" in the contact database, wherein the index of the record "Xinxin" is n3, and the index of the record "Xiaoan" is n4.

表8、记录“欣欣”在联系人数据库中的记录Table 8. The record of "Xinxin" in the contact database

  关键字keywords 关键字值key value   <姓名><name> 欣欣Xinxin   <头像><Avatar> 图5中的(a)所示As shown in (a) in Figure 5   <手机><mobile phone> 1380000000013800000000

  关键字keywords 关键字值key value   <办公室电话><office phone number> 6579998965799989   <住宅电话><Home Phone> 8982564389825643   <铃声><ringtone> Xxx.mp3Xxx.mp3   <电子邮件><email> Xinxin@abc.comXinxin@abc.com   <地址><address> Xx市xx区xx路xx小区xx号楼x单元x室Room x, unit x, building x, xx community, xx road, xx district, xx city   <通讯记录><Communication Record> “数量”=35,“1”=e8,“2”=s2,“3”=c9,..."Quantity" = 35, "1" = e8, "2" = s2, "3" = c9, ...   <媒体记录><Media Record> “数量”=28,“1”=p5,“2”=e20,“3”=c9,..."Quantity" = 28, "1" = p5, "2" = e20, "3" = c9, ...   <关联记录><associated record> “n1”=10,“n2”=“3”,“n4”=20,“n5”=0,..."n1" = 10, "n2" = "3", "n4" = 20, "n5" = 0, ...

在表8中,关键字<通讯记录>的关键字值指示该联系人在多媒体内容数据库的各个记录中作为<发送方>、<接收方>或<抄送方>出现的次数为35次,依次具体出现在e8、s2、c9...等记录中,其中,e8、s2和c9为相关多媒体记录在多媒体内容数据库中的索引,该索引表示法通过第一个字母表示多媒体内容的类别,后面的数字表示序号。关键字<媒体记录>的关键字值指示该联系人在多媒体内容数据库的各个记录中作为<标题>、<正文>、<人像>等其它位置出现的次数为28次,依次具体出现在p5、e20、c9...等记录中。关键字<关联记录>的关键字值基于多媒体内容数据库中的各个记录来指示该联系人与其它联系人发生关联的次数,即,与第一联系人n1联系了10次,与第二联系人n2联系了3次,与第四联系人n4联系了20次,与第五联系人n5联系了0次,其中,n1、n2、n4、n5分别为各个联系人在联系人数据库中的索引。In Table 8, the keyword value of the keyword <communication record> indicates that the number of occurrences of the contact as <sender>, <receiver> or <cc> in each record of the multimedia content database is 35 times, Specifically appear in e8, s2, c9... etc. records in turn, wherein e8, s2 and c9 are the indexes of relevant multimedia records in the multimedia content database, and the index notation indicates the category of multimedia content through the first letter, The following number indicates the serial number. The keyword value of the keyword <media record> indicates that the contact appears 28 times in various records of the multimedia content database as <title>, <text>, <portrait>, etc., specifically appearing in p5, e20, c9... etc. are being recorded. The keyword value of the keyword <associated record> indicates the number of times the contact has been associated with other contacts based on each record in the multimedia content database, that is, the contact with the first contact n1 has been contacted 10 times, and the contact with the second contact n2 has contacted 3 times, has contacted the fourth contact n4 20 times, and has contacted the fifth contact n5 0 times, wherein n1, n2, n4 and n5 are indexes of each contact in the contact database.

表9、记录“小安”在联系人数据库中的记录Table 9. The record of "Xiao'an" in the contact database

  关键字keywords   关键字值key value   <姓名><name>   小安Xiaoan   <头像><Avatar>   图5中的(b)所示Shown in (b) in Figure 5   <手机><mobile phone>   1340000000013400000000   <办公室电话><office phone number>   8295632182956321

  关键字keywords   关键字值key value   <住宅电话><Home Phone>   6321548763215487   <铃声><ringtone>   Yyy.mp3Yyy.mp3   <电子邮件><email>   Xiaoan@xyz.com Xiaoan@xyz.com   <地址><address>   Yy市yy区yy路yy小区yy号楼y单元y室Room y, unit y, building yy, yy community, yy road, yy district, yy city   <通讯记录><Communication Record>   “数量”=2,“1”=s60,“2”=c2"Quantity" = 2, "1" = s60, "2" = c2   <媒体记录><Media Record>   “数量”=8,“1”=p5,“2”=e20,“3”=s9,..."Quantity"=8, "1"=p5, "2"=e20, "3"=s9,...   <关联记录><associated record>   “n1”=0,“n2”=1,“n4”=20,“n5”=0,..."n1"=0, "n2"=1, "n4"=20, "n5"=0, ...

各个关键字及其关键字值的含义与表8的相同,就不再此一一描述了。The meanings of each keyword and its keyword value are the same as those in Table 8, and will not be described here one by one.

作为可选方式,数据库建立单元20也可将上述联系人数据库存储在存储单元30中。As an optional manner, the database establishing unit 20 may also store the above-mentioned contact database in the storage unit 30 .

图6是示出图1所示的在移动终端中管理多媒体内容的设备中的查询输入单元40的详细结构的示图。如图6所示,根据本发明示例性实施例的查询输入单元40可包括文本识别单元401,用于将用户通过键盘或手写板输入的查询信息转换为文本数据;文本捕获单元402和OCR(光字符识别)单元403,用于捕获和识别被查询照片上的字符数据;面部识别单元404,该面部识别单元404与数据采集单元10中的面部识别单元1015类似,用于识别出查询照片中各个人物的面部特征,具体说来,可参照联系人数据库中的照片或特定的人像数据库来识别出待查询照片中的各个人物的信息,并将面部识别的结果记录为文本数据;语音识别单元405,用于将用户输入的语音查询转换为文本数据,其工作方式与语音通话转换单元1013类似。上述各个单元仅仅是示例性的,本发明并不受限于此,查询输入的方式可更加简化,或者也可增加另外的查询输入方式。当用户输入的查询被识别为文本数据之后,搜索词提取单元406从识别的文本数据中提取关键的搜索词,查询词提取单元406的工作方式类似于数据采集单元10的文本分析单元102,用于将查询正文以及其它信息中的关键内容提取出来,作为搜索词。例如,所述搜索词可以是人名、时间、重要词语或其它信息。FIG. 6 is a diagram illustrating a detailed structure of a query input unit 40 in the apparatus for managing multimedia content in a mobile terminal illustrated in FIG. 1 . As shown in Figure 6, the query input unit 40 according to an exemplary embodiment of the present invention may include a text recognition unit 401, which is used to convert the query information input by the user through a keyboard or tablet into text data; a text capture unit 402 and an OCR ( Optical character recognition) unit 403, used to capture and identify the character data on the inquired photo; face recognition unit 404, similar to the face recognition unit 1015 in the data collection unit 10, used to identify the character data in the inquired photo The facial features of each person, specifically, can refer to the photo in the contact database or a specific portrait database to identify the information of each person in the photo to be queried, and record the result of facial recognition as text data; the speech recognition unit 405 , for converting the voice query input by the user into text data, and its working mode is similar to that of the voice call conversion unit 1013 . The above-mentioned units are only exemplary, and the present invention is not limited thereto, and the way of query input can be simplified, or another way of query input can also be added. After the query input by the user is identified as text data, the search term extraction unit 406 extracts key search terms from the identified text data. The working mode of the query term extraction unit 406 is similar to the text analysis unit 102 of the data acquisition unit 10, using It is used to extract the key content in the query text and other information as search terms. For example, the search term may be a person's name, time, important words or other information.

搜索单元50基于查询输入单元40提取的关键搜索词,从存储单元30存储的数据库中查询相应的多媒体内容记录。具体说来,搜索单元50可在多媒体内容数据库中的各个多媒体内容记录的关键字值中搜索与输入的搜索词相关的项,并按照特定的方式(如按照时间、地点、媒体类型等)来划分搜索的结果,并将处理后的结果输出到搜索结果输出单元60。在这里,搜索结果输出单元60可以采用形象化的方式来输出搜索结果。The search unit 50 queries the corresponding multimedia content records from the database stored in the storage unit 30 based on the key search words extracted by the query input unit 40 . Specifically, the search unit 50 can search for items related to the input search word in the keyword values of each multimedia content record in the multimedia content database, and search for items in a specific way (such as according to time, place, media type, etc.) The results of the search are divided, and the processed results are output to the search result output unit 60 . Here, the search result output unit 60 may output the search result in a visualized manner.

以下将参照图7来描述根据本发明示例性实施例进行搜索的示例,图7示出图1所示的在移动终端中管理多媒体内容的设备中的搜索结果输出单元60输出的搜索结果的示例。An example of searching according to an exemplary embodiment of the present invention will be described below with reference to FIG. 7, which shows an example of search results output by the search result output unit 60 in the device for managing multimedia content in the mobile terminal shown in FIG. .

具体说来,如图7的(a)所示,此时,查询输入单元40提取的搜索词为“张三”,搜索单元50可在多媒体内容数据库中搜索<发送方>、<接收方>或<抄送方>为“张三”的多媒体内容记录,并按照时间来进行排序,将排序后的结果输出到搜索结果输出单元60,搜索结果输出单元60输出如图7中的(a)所示的屏幕。作为另一种可选方式,如果“张三”是移动终端的联系人且已经建立了根据本发明的联系人数据库,则可通过张三在联系人数据库中的“通讯记录”项来搜索有关的多媒体记录。上述方式仅仅是示例,本领域技术人员完全可以采用不同的方式来应用已经建立的多媒体内容数据库(和联系人数据库),对各种内容进行搜索,并按照不同的方式来筛选和排序。例如,如果“张三”是移动终端的联系人且已经建立了根据本发明的联系人数据库,则还可通过张三在联系人数据库中的“媒体记录”项来搜索有关的多媒体记录,并输出搜索的结果。Specifically, as shown in (a) of Figure 7, at this time, the search word extracted by the query input unit 40 is "Zhang San", and the search unit 50 can search for <sender>, <receiver> in the multimedia content database. Or <cc party> is the multimedia content record of "Zhang San", and sort according to time, output the sorted results to the search result output unit 60, and the search result output unit 60 outputs (a) as shown in Figure 7 screen shown. As another optional way, if "Zhang San" is the contact person of the mobile terminal and has established the contact database according to the present invention, then the "communication record" item in the contact database of Zhang San can be used to search for related information. multimedia recording. The above method is just an example, and those skilled in the art can apply the established multimedia content database (and contact database) in different ways, search for various contents, and filter and sort them in different ways. For example, if "Zhang San" is a contact person of the mobile terminal and has set up a contact database according to the present invention, then the relevant multimedia records can also be searched for by Zhang San's "media record" item in the contact database, and Output the results of the search.

作为另一示例,如图7的(b)所示,此时,查询输入单元40提取的搜索词为“电影”,搜索单元50可在多媒体内容数据库中搜索<标题>和<正文>为“电影”或其近义词“影片”的多媒体内容记录,并按照时间来进行排序,将排序后的结果输出到搜索结果输出单元60,搜索结果输出单元60输出如图7中的(b)所示的屏幕。As another example, as shown in (b) of FIG. 7, at this time, the search term extracted by the query input unit 40 is "movie", and the search unit 50 can search for <title> and <text> in the multimedia content database as "" "movie" or its synonym "film" multimedia content record, and sort according to time, the sorted result is output to the search result output unit 60, and the search result output unit 60 outputs as shown in (b) among Fig. 7 Screen.

作为另一示例,如图7的(c)所示,此时,查询输入单元40提取的搜索词为“张三”,而“张三”是移动终端的联系人且已经建立了根据本发明的联系人数据库,在这种情况下,搜索单元50可根据联系人数据库中记录“张三”中的“关联记录”来得到“张三”与其它联系人的联络频率,优选地,这种联络既可表示直接联络,也可表示间接联络(即,张三通过他的某个有关联的联系人与另一联系人之间建立联系),而同时得到直接联络与间接联络的情况更有助于全面反映涉及张三的人际关系,搜索单元50将得到的联络频率输出到搜索结果输出单元60,搜索结果输出单元60可采用形象化的方式来输出搜索结果,例如,输出如图7中的(c)所示的屏幕,其中,张三与各个联系人之间用线条连接,而线条的粗细程度表示不同的联络频率,即,联络频率较高的联系人之间用较粗的线条表示,而联络频率较低的联系人之间用较细的线条表示,除了张三之外的各个联系人彼此之间的联络频率也可以一并表示出来,这样图7中的(c)就显示出了一张比较完备的以张三为主的人际关系图。As another example, as shown in (c) of FIG. 7, at this time, the search term extracted by the query input unit 40 is "Zhang San", and "Zhang San" is a contact person of the mobile terminal and has established a contact person according to the present invention. In this case, the search unit 50 can obtain the contact frequency between "Zhang San" and other contacts according to the "associated records" recorded in "Zhang San" in the contact database. Preferably, such Contact can mean direct contact or indirect contact (that is, Zhang San establishes contact with another contact person through one of his related contacts), and it is even more difficult to get direct contact and indirect contact at the same time. Helping to fully reflect the interpersonal relationship involving Zhang San, the search unit 50 outputs the obtained contact frequency to the search result output unit 60, and the search result output unit 60 can output the search result in a visualized manner, for example, output as shown in Figure 7 In the screen shown in (c), Zhang San is connected with various contacts by lines, and the thickness of the lines indicates different contact frequencies, that is, contacts with higher contact frequencies are connected by thicker lines Indicates that contacts with lower contact frequency are represented by thinner lines, and the contact frequency between each contact other than Zhang San can also be displayed together, so that (c) in Figure 7 is It shows a relatively complete diagram of interpersonal relationships centered on Zhang San.

特别地,如果查询输入单元40提取了多个搜索词,而这多个搜索词又会包括人名、时间、地点和重要词语等不同情况,则可通过搜索结果输出单元60请求用户来依次选择适当的搜索条件,即,不同搜索词之间的逻辑关系可由用户来设定。此外,多个搜索词之间的逻辑关系也可按照预设的情况来建立,比如,可默认多个搜索词之间为逻辑与的关系。在设定多个搜索词之间的逻辑关系之后,由搜索单元50进一步搜索相应的多媒体内容。In particular, if the query input unit 40 extracts a plurality of search words, and these search words may include different situations such as person's name, time, place, and important words, the user may be requested through the search result output unit 60 to sequentially select the appropriate search words. The search conditions, that is, the logical relationship between different search terms can be set by the user. In addition, the logical relationship between multiple search terms can also be established according to preset conditions, for example, a logical AND relationship between multiple search terms can be defaulted. After setting the logical relationship between the plurality of search terms, the search unit 50 further searches for the corresponding multimedia content.

根据本发明,能够将移动终端的各种多媒体内容以文本数据的形式存储在多媒体内容数据库中,从而反映出各个多媒体内容本身以及相互之间的隐含关系,用户可方便地基于建立的多媒体内容数据库进行多样化的搜索。此外,本发明还建立了与多媒体内容数据库相关的联系人数据库,从而能够实时地反映各个联系人之间的联络状态,便于用户掌握自己的人际资源。根据本发明的管理设备和方法是基于本机实现的,不需要额外的服务器,在方案实现方面也相对容易,成本较低。According to the present invention, various multimedia contents of the mobile terminal can be stored in the multimedia content database in the form of text data, thereby reflecting the implicit relationship between each multimedia content itself and each other, and the user can conveniently Database for diverse searches. In addition, the present invention also establishes a contact database related to the multimedia content database, so that the contact status among various contacts can be reflected in real time, and it is convenient for users to grasp their own interpersonal resources. The management equipment and method according to the present invention are realized based on the local machine, do not need an additional server, are relatively easy to implement in terms of schemes, and have low cost.

本发明的以上各个实施例仅仅是示例性的,本发明并不受限于此。本领域技术人员应该理解:基于多媒体内容转换为文本数据后的各种信息建立的数据库能够提供的一切搜索方案均可应用于本发明,也就是说,本发明通过将多媒体内容转换为文本数据,从而保留了相关的信息,而基于这些信息建立搜索关系,以便用户更加便利地管理移动终端的多媒体内容,全面掌握各个联系人之间的关系。本领域的技术人员应认识到:在不脱离本发明的原理和精神的情况下,可对这些实施例进行改变,其中,本发明的范围在权利要求及其等同物中限定。The above respective embodiments of the present invention are merely exemplary, and the present invention is not limited thereto. Those skilled in the art should understand: all search schemes that can be provided by the database that is established based on the various information after the multimedia content is converted into text data can be applied to the present invention, that is to say, the present invention converts the multimedia content into text data, Therefore, related information is retained, and a search relationship is established based on the information, so that the user can more conveniently manage the multimedia content of the mobile terminal and fully grasp the relationship between various contacts. Those skilled in the art will recognize that changes may be made in these embodiments without departing from the principles and spirit of the invention, the scope of which is defined in the claims and their equivalents.

Claims (25)

1.一种用于在移动终端中管理多媒体内容的设备,包括:1. A device for managing multimedia content in a mobile terminal, comprising: 数据采集单元,用于将多媒体内容转换为文本数据,并按照预定的反映多媒体内容关键属性的关键字从转换的文本数据中提取相应的关键字值;The data acquisition unit is used to convert the multimedia content into text data, and extract corresponding keyword values from the converted text data according to predetermined keywords reflecting the key attributes of the multimedia content; 数据库建立单元,用于基于数据采集单元获得的关键字值来建立多媒体内容数据库;A database establishment unit, configured to establish a multimedia content database based on the keyword value obtained by the data acquisition unit; 存储单元,用于存储数据库建立单元所建立的多媒体内容数据库;a storage unit for storing the multimedia content database established by the database establishment unit; 查询输入单元,用于将用户输入的查询信息识别为文本数据,并从识别的文本数据中提取关键的搜索词;a query input unit, configured to recognize query information input by a user as text data, and extract key search terms from the recognized text data; 搜索单元,用于基于查询输入单元提取的搜索词从存储在存储单元中的多媒体内容数据库中搜索与搜索词相关的多媒体内容;以及a search unit for searching a multimedia content related to the search term from a multimedia content database stored in the storage unit based on the search term extracted by the query input unit; and 搜索结果输出单元,用于向用户输出搜索单元的搜索结果。The search result output unit is used to output the search result of the search unit to the user. 2.如权利要求1所述的设备,其中,所述数据采集单元包括:2. The device according to claim 1, wherein the data acquisition unit comprises: 文本转换单元,用于将多媒体内容转换为文本数据;以及a text conversion unit for converting the multimedia content into text data; and 文本分析单元,用于按照预定的反映多媒体内容关键属性的关键字从转换的文本数据中提取相应的关键字值。The text analysis unit is used for extracting corresponding keyword values from the converted text data according to predetermined keywords reflecting the key attributes of the multimedia content. 3.如权利要求2所述的设备,其中,所述文本转换单元包括以下项中的至少一个:3. The device of claim 2, wherein the text conversion unit comprises at least one of: 短消息转换单元,用于将短消息转换为文本数据;A short message conversion unit is used to convert the short message into text data; 电子邮件转换单元,用于将电子邮件转换为文本数据;an email conversion unit for converting email into text data; 语音通话转换单元,用于将语音通话转换为文本数据;A voice call conversion unit is used to convert voice calls into text data; 场景分类和聚类单元,用于按照预定的类别将照片分类并通过聚类处理来得出与该照片类似的照片,并将照片的场景分类和聚类结果记录为文本数据;The scene classification and clustering unit is used to classify the photos according to predetermined categories and obtain photos similar to the photos through clustering processing, and record the scene classification and clustering results of the photos as text data; 面部识别单元,用于识别出照片中各个人物的面部特征以及总人数,并将面部识别的结果记录为文本数据;以及A facial recognition unit, configured to recognize the facial features and the total number of people in the photo, and record the results of the facial recognition as text data; and 电子地图转换单元,用于参照电子地图将定位系统的位置信息转换为文本数据。The electronic map conversion unit is used for converting the position information of the positioning system into text data with reference to the electronic map. 4.如权利要求3所述的设备,其中,所述文本分析单元包括:4. The device of claim 3, wherein the text analysis unit comprises: 词语划分单元,用于将文本转换单元输出的文本正文划分为多个词语;A word division unit, used to divide the text text output by the text conversion unit into a plurality of words; 词性确定单元,用于确定由词语划分单元划分的多个词语的词性;A part-of-speech determination unit is used to determine the parts of speech of a plurality of words divided by the word division unit; 人名提取单元,用于依照词性确定单元的输出结果来提取文本正文中的人名;A name extraction unit, used to extract the name of the person in the text body according to the output result of the part-of-speech determination unit; 时间提取单元,用于依照词性确定单元的输出结果来提取文本正文中的时间词语;A time extraction unit is used to extract the time words in the text body according to the output result of the part of speech determination unit; 重要词语提取单元,用于依照词性确定单元的输出结果来提取文本正文中的重要词语;以及An important word extraction unit is used to extract important words in the text body according to the output result of the part-of-speech determination unit; and 其它关键字值提取单元,用于从文本转换单元输出的除文本正文之外的其它信息提取关键字值。Other keyword value extraction unit is used for extracting keyword values from information other than the text body output by the text conversion unit. 5.如权利要求4所述的设备,还包括:时间信息解析单元,用于将时间提取单元提取的时间词语转换为时间信息。5. The device according to claim 4, further comprising: a time information parsing unit for converting the time words extracted by the time extracting unit into time information. 6.如权利要求4所述的设备,还包括:近义词产生单元,用于针对重要词语提取单元提取的重要词语来产生相应的近义词。6. The device as claimed in claim 4, further comprising: a synonym generation unit for generating corresponding synonyms for the important words extracted by the important word extraction unit. 7.如权利要求1所述的设备,其中,所述数据库建立单元基于关键字、关键字值以及相关属性描述来建立多媒体内容数据库。7. The device according to claim 1, wherein the database building unit builds the multimedia content database based on keywords, keyword values and related attribute descriptions. 8.如权利要求1所述的设备,其中,所述数据库建立单元还建立联系人数据库,并且联系人数据库的联系人记录随着多媒体内容数据库而更新。8. The device according to claim 1, wherein the database establishing unit further establishes a contact database, and the contact records of the contact database are updated along with the multimedia content database. 9.如权利要求1所述的设备,其中,所述查询输入单元包括:9. The device of claim 1, wherein the query input unit comprises: 文本识别单元,用于将用户通过键盘或手写板输入的查询信息转换为文本数据;The text recognition unit is used to convert the query information input by the user through the keyboard or tablet into text data; 文本捕获单元和光字符识别OCR单元,用于捕获和识别被查询照片上的字符数据;A text capture unit and an optical character recognition OCR unit are used to capture and recognize character data on the queried photo; 面部识别单元,用于识别出查询照片中各个人物的面部特征,并将面部识别的结果记录为文本数据;以及A facial recognition unit, configured to recognize the facial features of each person in the query photo, and record the result of the facial recognition as text data; and 语音识别单元,用于将用户输入的语音查询转换为文本数据。The speech recognition unit is used to convert the speech query input by the user into text data. 10.如权利要求1所述的设备,其中,所述搜索结果输出单元以形象化的方式向用户输出搜索单元的搜索结果。10. The apparatus of claim 1, wherein the search result output unit outputs the search result of the search unit to the user in a visualized manner. 11.如权利要求8所述的设备,其中,所述搜索单元基于查询输入单元提取的搜索词,参照联系人数据库,从存储在存储单元中的多媒体内容数据库中搜索与搜索词相关的多媒体内容。11. The device according to claim 8, wherein the search unit searches the multimedia content related to the search term from the multimedia content database stored in the storage unit with reference to the contact database based on the search term extracted by the query input unit . 12.如权利要求8所述的设备,其中,所述搜索单元基于多媒体内容数据库和联系人数据库来搜索联系人在多媒体内容中的相互关系。12. The apparatus of claim 8, wherein the search unit searches for a mutual relationship of the contacts in the multimedia content based on the multimedia content database and the contact database. 13.如权利要求1所述的设备,其中,如果查询输入单元提取多个搜索词,则搜索单元按照预定的逻辑关系或用户设置的逻辑关系来针对所述多个搜索词进行搜索。13. The device of claim 1, wherein if the query input unit extracts a plurality of search terms, the search unit searches for the plurality of search terms according to a predetermined logical relationship or a logical relationship set by a user. 14.一种用于在移动终端中管理多媒体内容的方法,包括:14. A method for managing multimedia content in a mobile terminal, comprising: 将多媒体内容转换为文本数据,并按照预定的反映多媒体内容关键属性的关键字从转换的文本数据中提取相应的关键字值;converting the multimedia content into text data, and extracting corresponding key values from the converted text data according to predetermined keywords reflecting the key attributes of the multimedia content; 基于所述关键字值来建立多媒体内容数据库;building a multimedia content database based on the key value; 存储所建立的多媒体内容数据库;storing the established multimedia content database; 将用户输入的查询信息识别为文本数据,并从识别的文本数据中提取关键的搜索词;Recognize the query information entered by the user as text data, and extract key search terms from the recognized text data; 基于提取的搜索词从存储的多媒体内容数据库中搜索与搜索词相关的多媒体内容;以及searching a stored multimedia content database for multimedia content related to the search term based on the extracted search term; and 向用户输出搜索结果。Output search results to the user. 15.如权利要求14所述的方法,其中,将多媒体内容转换为文本数据的步骤包括以下步骤中的至少一个:15. The method of claim 14, wherein the step of converting the multimedia content into text data comprises at least one of the following steps: 将短消息转换为文本数据;Convert short messages to text data; 将电子邮件转换为文本数据;Convert emails to text data; 将语音通话转换为文本数据;convert voice calls into text data; 按照预定的类别将照片分类并通过聚类处理来得出与该照片类似的照片,并将照片的分类和聚类结果记录为文本数据;Classify the photos according to the predetermined category and obtain photos similar to the photo through clustering processing, and record the classification and clustering results of the photos as text data; 识别出照片中各个人物的面部特征以及总人数,并将识别的结果记录为文本数据;以及Recognize the facial features and total number of people in the photos, and record the recognition results as text data; and 参照电子地图将定位系统的位置信息转换为文本数据。The position information of the positioning system is converted into text data with reference to the electronic map. 16.如权利要求14所述的方法,其中,按照预定的反映多媒体内容关键属性的关键字从转换的文本数据中提取相应的关键字值的步骤包括:16. The method as claimed in claim 14, wherein, according to predetermined keywords reflecting multimedia content key attributes, the step of extracting corresponding keyword values from converted text data comprises: 将文本正文划分为多个词语;Divide the text body into terms; 确定划分的多个词语的词性;determining the part of speech of the divided plurality of words; 依照划分的多个词语的词性来提取文本正文中的人名;Extract the names of people in the text body according to the parts of speech of the divided words; 依照划分的多个词语的词性来提取文本正文中的时间词语;According to the parts of speech of the divided multiple words, the time words in the text body are extracted; 依照划分的多个词语的词性来提取文本正文中的重要词语;以及extracting important words in the text body according to the parts of speech of the divided plurality of words; and 从除文本正文之外的其它信息提取关键字值。Key values are extracted from information other than the body of the text. 17.如权利要求16所述的方法,还包括:将提取的时间词语转换为时间信息。17. The method of claim 16, further comprising converting the extracted temporal words into temporal information. 18.如权利要求16所述的方法,还包括:针对提取的重要词语来产生相应的近义词。18. The method according to claim 16, further comprising: generating corresponding synonyms for the extracted important words. 19.如权利要求14所述的方法,其中,所述多媒体内容数据库中的多媒体内容记录包括关键字、关键字值以及相关属性描述。19. The method of claim 14, wherein the multimedia content records in the multimedia content database include keywords, keyword values and related attribute descriptions. 20.如权利要求14所述的方法,还包括:建立联系人数据库,并且联系人数据库的联系人记录随着多媒体内容数据库而更新。20. The method of claim 14, further comprising: establishing a contact database, and the contact records of the contact database are updated along with the multimedia content database. 21.如权利要求14所述的方法,其中,将用户输入的查询信息识别为文本数据的步骤包括以下步骤中的至少一个:21. The method of claim 14, wherein the step of recognizing the user-entered query information as text data comprises at least one of the following steps: 将用户通过键盘或手写板输入的查询信息转换为文本数据;Convert the query information entered by the user through the keyboard or tablet into text data; 捕获和识别被查询照片上的字符数据;capture and identify character data on the queried photo; 识别出查询照片中各个人物的面部特征,并将识别的结果记录为文本数据;以及Recognize the facial features of each person in the query photo, and record the recognition result as text data; and 将用户输入的语音查询转换为文本数据。Convert voice queries entered by users into text data. 22.如权利要求14所述的方法,其中,在向用户输出搜索结果的步骤中,以形象化的方式向用户输出搜索结果。22. The method according to claim 14, wherein, in the step of outputting the search result to the user, the search result is output to the user in a visualized manner. 23.如权利要求20所述的方法,还包括:基于提取的搜索词,参照联系人数据库,从存储的多媒体内容数据库中搜索与搜索词相关的多媒体内容。23. The method of claim 20, further comprising searching a stored multimedia content database for multimedia content related to the search term with reference to a contact database based on the extracted search term. 24.如权利要求20所述的方法,还包括:基于多媒体内容数据库和联系人数据库来搜索联系人在多媒体内容中的相互关系。24. The method of claim 20, further comprising searching for mutual relationships of the contacts in the multimedia content based on the multimedia content database and the contact database. 25.如权利要求14所述的方法,还包括:如果提取多个搜索词,则按照预定的逻辑关系或用户设置的逻辑关系来针对所述多个搜索词进行搜索。25. The method according to claim 14, further comprising: if a plurality of search terms are extracted, searching for the plurality of search terms according to a predetermined logical relationship or a logical relationship set by a user.
CN200910128310A 2009-03-30 2009-03-30 Device and method for managing multimedia content in mobile terminal Pending CN101853253A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN200910128310A CN101853253A (en) 2009-03-30 2009-03-30 Device and method for managing multimedia content in mobile terminal

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN200910128310A CN101853253A (en) 2009-03-30 2009-03-30 Device and method for managing multimedia content in mobile terminal

Publications (1)

Publication Number Publication Date
CN101853253A true CN101853253A (en) 2010-10-06

Family

ID=42804751

Family Applications (1)

Application Number Title Priority Date Filing Date
CN200910128310A Pending CN101853253A (en) 2009-03-30 2009-03-30 Device and method for managing multimedia content in mobile terminal

Country Status (1)

Country Link
CN (1) CN101853253A (en)

Cited By (26)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102164353A (en) * 2011-04-13 2011-08-24 青岛海信移动通信技术股份有限公司 Multimedia message service (MMS) information resolution method and equipment
CN102193994A (en) * 2011-04-22 2011-09-21 武汉大学 Method for searching Web services according to non-functional requirements of user
CN102637183A (en) * 2011-02-12 2012-08-15 北京千橡网景科技发展有限公司 Method and device for recommending friends to user in social network
CN102866987A (en) * 2011-07-06 2013-01-09 三星电子株式会社 Apparatus and method for transmitting message in mobile terminal
CN103020168A (en) * 2012-11-27 2013-04-03 辽宁省电力有限公司电力科学研究院 System and method for automatically generating reports during power monitoring and supervision
CN103139364A (en) * 2011-12-02 2013-06-05 联想移动通信科技有限公司 Method for automatically updating address book and device, mobile communication terminal
CN103280217A (en) * 2013-05-02 2013-09-04 锤子科技(北京)有限公司 Voice identification method and device of mobile terminal
CN103297582A (en) * 2012-02-24 2013-09-11 联想(北京)有限公司 Method for processing voice communication content and electronic devices
CN103869948A (en) * 2012-12-14 2014-06-18 联想(北京)有限公司 Voice command processing method and electronic device
CN104239449A (en) * 2014-09-01 2014-12-24 百度在线网络技术(北京)有限公司 Method and device for representing information
CN104375997A (en) * 2013-08-13 2015-02-25 腾讯科技(深圳)有限公司 Method and device for adding note information to instant messaging audio information
CN104408162A (en) * 2014-12-05 2015-03-11 国家电网公司 Multimedia system for forming text indexing and multimedia processing method
CN104620258A (en) * 2012-09-25 2015-05-13 株式会社东芝 Document classification assisting apparatus, method and program
CN104951426A (en) * 2014-03-26 2015-09-30 联想移动通信软件(武汉)有限公司 Taken photo classified processing method and device and terminal equipment
CN105528450A (en) * 2015-12-23 2016-04-27 北京奇虎科技有限公司 Method and device for naming photo album
CN105706085A (en) * 2013-09-27 2016-06-22 诺基亚技术有限公司 Visual representation of a character identity and a location identity
CN105893419A (en) * 2015-11-30 2016-08-24 乐视致新电子科技(天津)有限公司 Generation device, device and equipment of multimedia photo, and mobile phone
CN106257439A (en) * 2015-06-19 2016-12-28 Tcl集团股份有限公司 Multimedia file storage method and apparatus in multimedia player
CN106506847A (en) * 2016-11-23 2017-03-15 北京小米移动软件有限公司 New message prompt method and device
CN106970913A (en) * 2017-05-12 2017-07-21 湖南中周至尚信息技术有限公司 The extracting method and device of a kind of time
CN107526781A (en) * 2017-07-25 2017-12-29 无锡天脉聚源传媒科技有限公司 A kind of information search method and device
US9898847B2 (en) 2015-11-30 2018-02-20 Shanghai Sunson Activated Carbon Technology Co., Ltd. Multimedia picture generating method, device and electronic device
CN108595600A (en) * 2018-04-18 2018-09-28 努比亚技术有限公司 Photo classification method, mobile terminal and readable storage medium storing program for executing
CN109348071A (en) * 2018-12-25 2019-02-15 努比亚技术有限公司 Call-information display methods, terminal and computer readable storage medium
CN109977239A (en) * 2019-03-31 2019-07-05 联想(北京)有限公司 A kind of information processing method and electronic equipment
CN111460274A (en) * 2019-01-18 2020-07-28 北京字节跳动网络技术有限公司 Information processing method and device

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1350685A (en) * 1999-03-09 2002-05-22 皇家菲利浦电子有限公司 Method with a pulrality of speech recognizers
US20060005123A1 (en) * 2004-06-30 2006-01-05 Fujitsu Limited Information retrieval terminal
US20070005705A1 (en) * 2005-06-17 2007-01-04 Hung-Chih Yu System and method of dynamically displaying an associated message in a message
CN101292282A (en) * 2005-08-29 2008-10-22 沃伊斯博克斯科技公司 Mobile system and method supporting natural language man-machine interaction

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1350685A (en) * 1999-03-09 2002-05-22 皇家菲利浦电子有限公司 Method with a pulrality of speech recognizers
US20060005123A1 (en) * 2004-06-30 2006-01-05 Fujitsu Limited Information retrieval terminal
US20070005705A1 (en) * 2005-06-17 2007-01-04 Hung-Chih Yu System and method of dynamically displaying an associated message in a message
CN101292282A (en) * 2005-08-29 2008-10-22 沃伊斯博克斯科技公司 Mobile system and method supporting natural language man-machine interaction

Cited By (42)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102637183A (en) * 2011-02-12 2012-08-15 北京千橡网景科技发展有限公司 Method and device for recommending friends to user in social network
WO2012139471A1 (en) * 2011-04-13 2012-10-18 青岛海信移动通信技术股份有限公司 Method and device for parsing mms information
US10050914B2 (en) 2011-04-13 2018-08-14 Hisense Mobile Communications Technology Co., Ltd. Method and device for parsing MMS information
CN102164353B (en) * 2011-04-13 2013-08-28 青岛海信移动通信技术股份有限公司 Multimedia message service (MMS) information resolution method and equipment
CN102164353A (en) * 2011-04-13 2011-08-24 青岛海信移动通信技术股份有限公司 Multimedia message service (MMS) information resolution method and equipment
CN102193994A (en) * 2011-04-22 2011-09-21 武汉大学 Method for searching Web services according to non-functional requirements of user
CN102193994B (en) * 2011-04-22 2013-07-24 武汉大学 Method for searching Web services according to non-functional requirements of user
CN102866987A (en) * 2011-07-06 2013-01-09 三星电子株式会社 Apparatus and method for transmitting message in mobile terminal
CN103139364B (en) * 2011-12-02 2014-10-22 联想移动通信科技有限公司 Method for automatically updating address book and device, mobile communication terminal
CN103139364A (en) * 2011-12-02 2013-06-05 联想移动通信科技有限公司 Method for automatically updating address book and device, mobile communication terminal
CN103297582A (en) * 2012-02-24 2013-09-11 联想(北京)有限公司 Method for processing voice communication content and electronic devices
CN104620258A (en) * 2012-09-25 2015-05-13 株式会社东芝 Document classification assisting apparatus, method and program
CN103020168B (en) * 2012-11-27 2016-05-25 国网辽宁省电力有限公司电力科学研究院 Electrical measurement supervision report automatic generatioin system
CN103020168A (en) * 2012-11-27 2013-04-03 辽宁省电力有限公司电力科学研究院 System and method for automatically generating reports during power monitoring and supervision
CN103869948A (en) * 2012-12-14 2014-06-18 联想(北京)有限公司 Voice command processing method and electronic device
CN103280217B (en) * 2013-05-02 2016-05-04 锤子科技(北京)有限公司 A kind of audio recognition method of mobile terminal and device thereof
CN103280217A (en) * 2013-05-02 2013-09-04 锤子科技(北京)有限公司 Voice identification method and device of mobile terminal
US9502035B2 (en) 2013-05-02 2016-11-22 Smartisan Digital Co., Ltd. Voice recognition method for mobile terminal and device thereof
CN104375997A (en) * 2013-08-13 2015-02-25 腾讯科技(深圳)有限公司 Method and device for adding note information to instant messaging audio information
CN105706085B (en) * 2013-09-27 2019-03-22 诺基亚技术有限公司 The visual representation of role identification and station location marker
CN105706085A (en) * 2013-09-27 2016-06-22 诺基亚技术有限公司 Visual representation of a character identity and a location identity
CN104951426A (en) * 2014-03-26 2015-09-30 联想移动通信软件(武汉)有限公司 Taken photo classified processing method and device and terminal equipment
CN104239449A (en) * 2014-09-01 2014-12-24 百度在线网络技术(北京)有限公司 Method and device for representing information
CN104239449B (en) * 2014-09-01 2018-11-20 百度在线网络技术(北京)有限公司 Information display method and device
CN104408162B (en) * 2014-12-05 2017-10-31 国家电网公司 A kind of multimedia system and processing method for being used to form text index
CN104408162A (en) * 2014-12-05 2015-03-11 国家电网公司 Multimedia system for forming text indexing and multimedia processing method
CN106257439A (en) * 2015-06-19 2016-12-28 Tcl集团股份有限公司 Multimedia file storage method and apparatus in multimedia player
CN106257439B (en) * 2015-06-19 2020-01-14 Tcl集团股份有限公司 Multimedia file storage method and device in multimedia player
WO2017092280A1 (en) * 2015-11-30 2017-06-08 乐视控股(北京)有限公司 Multimedia photo generation method, apparatus and device, and mobile phone
CN105893419A (en) * 2015-11-30 2016-08-24 乐视致新电子科技(天津)有限公司 Generation device, device and equipment of multimedia photo, and mobile phone
US9898847B2 (en) 2015-11-30 2018-02-20 Shanghai Sunson Activated Carbon Technology Co., Ltd. Multimedia picture generating method, device and electronic device
CN105528450A (en) * 2015-12-23 2016-04-27 北京奇虎科技有限公司 Method and device for naming photo album
CN106506847A (en) * 2016-11-23 2017-03-15 北京小米移动软件有限公司 New message prompt method and device
CN106970913A (en) * 2017-05-12 2017-07-21 湖南中周至尚信息技术有限公司 The extracting method and device of a kind of time
CN107526781A (en) * 2017-07-25 2017-12-29 无锡天脉聚源传媒科技有限公司 A kind of information search method and device
CN108595600A (en) * 2018-04-18 2018-09-28 努比亚技术有限公司 Photo classification method, mobile terminal and readable storage medium storing program for executing
CN108595600B (en) * 2018-04-18 2023-12-15 努比亚技术有限公司 Photo classification method, mobile terminal and readable storage medium
CN109348071A (en) * 2018-12-25 2019-02-15 努比亚技术有限公司 Call-information display methods, terminal and computer readable storage medium
CN111460274A (en) * 2019-01-18 2020-07-28 北京字节跳动网络技术有限公司 Information processing method and device
CN111460274B (en) * 2019-01-18 2023-04-28 北京字节跳动网络技术有限公司 Information processing method and device
CN109977239A (en) * 2019-03-31 2019-07-05 联想(北京)有限公司 A kind of information processing method and electronic equipment
CN109977239B (en) * 2019-03-31 2023-08-18 联想(北京)有限公司 Information processing method and electronic equipment

Similar Documents

Publication Publication Date Title
CN101853253A (en) Device and method for managing multimedia content in mobile terminal
CN101547249B (en) Mobile termination and information classification management method thereof
CN100492356C (en) Method, system and device for managing media items
US9031953B2 (en) Method and system to curate media collections
CN112069326B (en) Knowledge graph construction method, device, electronic device and storage medium
CN102741835B (en) Method, device or system for image processing
US8831951B2 (en) Verbal labels for electronic messages
US20040044536A1 (en) Providing common contact discovery and management to electronic mail users
CN101330657B (en) An address book system and its implementation method
CN102214188A (en) Communication terminal and content storing and managing method thereof
WO2012016457A1 (en) Method and system for selecting data source
CN102752433A (en) Method for generating terminal and call recording file names
US20160004770A1 (en) Generation and use of an email frequent word list
CN106936971A (en) A kind of incoming person&#39;s information presentation system and reminding method
CN113656650A (en) Data fusion method and device, electronic device and storage medium
CN101826077B (en) Method and device for obtaining relation contact person record
WO2019152197A1 (en) Automatic image classification in electronic communications
CN101420489A (en) Communication device and data enquiry method
JP4920471B2 (en) Mail data classification device, mail data classification program, and mail data classification method
CN106209605B (en) Method and equipment for processing attachment in network information
US10733981B2 (en) Digital messaging system
CN117407565A (en) Resource processing methods, devices, electronic equipment and storage media
CN112307085B (en) Data processing method, device, electronic device and storage medium
KR100556597B1 (en) Field attribute determination system and method of raw data, message transmission system and method for transmitting message by merging analysis data which has determined field attribute of raw data
CN119003450A (en) Data processing method and related device

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20101006