[go: up one dir, main page]

CN101404026A - Crawler system construction method for video-previewing search engine - Google Patents

Crawler system construction method for video-previewing search engine Download PDF

Info

Publication number
CN101404026A
CN101404026A CNA2008101808250A CN200810180825A CN101404026A CN 101404026 A CN101404026 A CN 101404026A CN A2008101808250 A CNA2008101808250 A CN A2008101808250A CN 200810180825 A CN200810180825 A CN 200810180825A CN 101404026 A CN101404026 A CN 101404026A
Authority
CN
China
Prior art keywords
video
search engine
hyperlink
crawler system
video search
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CNA2008101808250A
Other languages
Chinese (zh)
Inventor
杨溥
郭军
陈�光
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing University of Posts and Telecommunications
Original Assignee
Beijing University of Posts and Telecommunications
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing University of Posts and Telecommunications filed Critical Beijing University of Posts and Telecommunications
Priority to CNA2008101808250A priority Critical patent/CN101404026A/en
Publication of CN101404026A publication Critical patent/CN101404026A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

本发明公开了一种可预览视频搜索引擎的爬虫系统的构建方法,该方法包括下列步骤:(1)超链接映射成列表;(2)检测列表状态;(3)摘要图片处理;(4)视频处理;(5)视频标题处理。通过应用本发明所描述的方法,可以为可预览视频搜索引擎的爬虫系统提供通用的设计方法;可以为可预览视频搜索引擎提供预览型数据集,简化可预览视频搜索引擎的其他部分的设计和开发,大幅度地降低可预览视频搜索引擎爬虫系统和可预览视频搜索引擎的开发成本。

Figure 200810180825

The invention discloses a method for constructing a crawler system capable of previewing a video search engine. The method comprises the following steps: (1) hyperlinks are mapped into a list; (2) list status is detected; (3) summary image processing; (4) Video processing; (5) Video title processing. By applying the method described in the present invention, a general design method can be provided for the crawler system of a video search engine that can be previewed; preview data sets can be provided for a video search engine that can be previewed, simplifying the design and implementation of other parts of the video search engine that can be previewed Development, greatly reducing the development cost of the previewable video search engine crawler system and the previewable video search engine.

Figure 200810180825

Description

可预览视频搜索引擎的爬虫系统的构建方法 Method for constructing a crawler system capable of previewing video search engines

技术领域 Technical field

本发明涉及网络数据采集系统的构建方法,尤其涉及一种可预览视频搜索引擎的爬虫系统的构建方法。The invention relates to a method for constructing a network data acquisition system, and in particular to a method for constructing a crawler system capable of previewing a video search engine.

背景技术 Background technology

随着信息时代的到来和影像视频技术的发展,影像视频由于有着无可比拟的优势和强烈的视觉冲击力而吸引着越来越多的人们欣赏。但是由于视频的数据量巨大和普遍网络带宽的限制,人们很难方便地在本机观看视频。正是由于这个主要原因,广域网上纷纷建立起许多视频网站,实行视频数据的在线播放来使得人们方便快捷的实时欣赏视频。但是随着视频网站视频数据量的激增,人们很难简单快捷地在广域网上找到所希望的视频,因此视频的搜索引擎就孕育而生。虽然视频搜索引擎能够带来极大地便利,但是视频不像文本信息那样易于识别,而且在线视频为了播放的流畅性也需要下载缓冲视频数据,加之视频数据量大,占用较多的带宽,且用户带宽和流量都是有限的,因此,用户希望在打开视频网页之前可以进行预判断,是否此视频是所要找的,是否值得去观看。若不是所需的,就不必去浪费时间和带宽去观看视频。因此视频搜索引擎的可预览性受到热切关注。With the advent of the information age and the development of video technology, video has attracted more and more people to appreciate it because of its incomparable advantages and strong visual impact. However, due to the huge amount of video data and the limitation of general network bandwidth, it is difficult for people to watch videos conveniently on their local computers. It is precisely because of this main reason that many video websites have been established on the wide area network to implement online playback of video data so that people can enjoy videos conveniently and quickly in real time. However, with the surge in the amount of video data on video websites, it is difficult for people to find the desired video on the wide area network simply and quickly, so video search engines are born. Although video search engines can bring great convenience, videos are not as easy to identify as text information, and online videos also need to download and buffer video data for smooth playback. In addition, the amount of video data is large, which occupies more bandwidth, and the user's bandwidth and traffic are limited. Therefore, users hope to make a pre-judgment before opening the video webpage whether the video is what they are looking for and whether it is worth watching. If it is not what you need, there is no need to waste time and bandwidth to watch the video. Therefore, the previewability of video search engines has received keen attention.

由于视频网站都包含视频的摘要图片和视频名称,通过摘要图片和视频名称就能够集中的反映视频的视觉主要内容,用户可以通过摘要图片和名称对视频进行预览和判断。因此视频的预览性数据的采集在构建可预览性视频搜索引擎的过程中是重中之重。目前,还没有一种系统的行之有效的视频预览性数据的采集系统构建方法。本发明通过引入超链接映射列表技术和基于该映射列表的查找技术来有效地采集视频的预览性数据。Since video websites all contain summary pictures and video names of videos, the main visual content of the video can be reflected in a concentrated manner through the summary pictures and video names, and users can preview and judge the video through the summary pictures and names. Therefore, the collection of preview data of the video is a top priority in the process of building a previewable video search engine. At present, there is no systematic and effective method for constructing a collection system for video preview data. The present invention effectively collects the preview data of the video by introducing a hyperlink mapping list technology and a search technology based on the mapping list.

发明内容 Contents of the invention

针对现有技术存在的问题,本发明的目的是提供一种可预览视频搜索引擎的爬虫系统的构建方法。In view of the problems existing in the prior art, the purpose of the present invention is to provide a method for constructing a crawler system that can preview a video search engine.

为达到上述目的,本发明的方法包括下列步骤:To achieve the above object, the method of the present invention comprises the following steps:

(1)超链接映射成列表;(1) Hyperlinks are mapped into lists;

(2)检测列表状态;(2) Check list status;

(3)摘要图片处理;(3) Abstract image processing;

(4)视频处理;(4) Video processing;

(5)视频标题处理。(5) Video title processing.

上述方法中,步骤(3)进一步包括:In the above method, step (3) further comprises:

(31)在超链接映射列表中,查找摘要图片;(31) In the hyperlink mapping list, search for the summary image;

(32)下载存储摘要图片。(32) Download and store summary images.

上述方法中,步骤(4)进一步包括:In the above method, step (4) further comprises:

(41)在超链接映射列表中,查找视频;(41) Search for the video in the hyperlink mapping list;

(42)下载存储视频;(42) Download and store videos;

上述方法中,步骤(5)进一步包括:In the above method, step (5) further comprises:

(51)下载视频播放页面;(51) Download video playback page;

(52)提取存储视频标题。(52) Extract the stored video title.

本发明的有益效果在于,通过应用本发明所描述的方法,可以为可预览视频搜索引擎的爬虫系统提供通用的设计方法;可以为可预览视频搜索引擎提供预览型数据集,简化可预览视频搜索引擎的其他部分的设计和开发,大幅度地降低可预览视频搜索引擎爬虫系统和可预览视频搜索引擎的开发成本。The beneficial effect of the present invention is that, by applying the method described in the present invention, a general design method can be provided for the crawler system of the previewable video search engine; a preview data set can be provided for the previewable video search engine, thereby simplifying the design and development of other parts of the previewable video search engine, and greatly reducing the development cost of the previewable video search engine crawler system and the previewable video search engine.

结合附图,本发明的其他特点和优点可以从下面通过举例来对本发明的原理进行解释的优选实施方式的说明中变得更清楚。Other features and advantages of the present invention will become more apparent from the following description of preferred embodiments, which illustrate the principles of the present invention by way of example, in conjunction with the accompanying drawings.

附图说明 Description of the attached figure

图1是根据本发明的一个实施方式的方法的流程图。FIG. 1 is a flow chart of a method according to one embodiment of the present invention.

具体实施方式 Specific implementation method

下面将结合附图对本发明的具体实施方式进行详细描述。The specific implementation modes of the present invention will be described in detail below with reference to the accompanying drawings.

图1是根据本发明的一个实施方式的方法的流程图。该流程开始于步骤101,需要指出的是以下所提及的视频网站仅仅是举例,具体的视频网站不构成对本发明的限制。然后在步骤102中,分析视频网页的所有超链接,并且将所有超链接按照在网页源代码中从上到下从左到右的顺序逐一提取出来,最终将其映射成为一个列表。需要说明的是起始网页应当是包含视频超链接丰富的web网页,如视频的播放页面等,这仅仅是最优举例,起始视频网页的不同不构成对本发明的限制。FIG1 is a flow chart of a method according to an embodiment of the present invention. The process starts at step 101. It should be noted that the video websites mentioned below are only examples, and the specific video websites do not constitute a limitation to the present invention. Then in step 102, all hyperlinks of the video web page are analyzed, and all hyperlinks are extracted one by one in the order from top to bottom and from left to right in the web page source code, and finally mapped into a list. It should be noted that the starting web page should be a web page rich in video hyperlinks, such as a video playback page, etc. This is only an optimal example, and the difference in the starting video web page does not constitute a limitation to the present invention.

超链接映射成列表,一种实施方式是从视频网页的构建结构进行分析抽取成表。下面通过举例来进一步说明。The hyperlinks are mapped into a list. One implementation method is to analyze and extract the list from the structure of the video web page. This is further explained below by taking an example.

<a    href=″http://www.tudou.com/programs/view/c74iyYGuDIc/″title=″多米诺骨牌新记录″target=″new″class=″inner″><imgsrc=″http://i01.img.tudou.com/data/imgs/i/023/746/281/m10.jpg″alt=″多米诺骨牌新记录″width=″120″height=″90″class=″pack_clipImg″</a><a    href=″http://www.tudou.com/programs/view/c74iyYGuDIc/″title=″New Record of Dominoes″target=″new″class=″inner″><imgsrc=″http://i01.img.tudou.com/data/imgs/i/023/746/281/m10.jpg″alt=″New Record of Dominoes″width=″120″height=″90″class=″pack_clipImg″</a>

以上为一个视频网页的一段包含视频超链接的源代码。其中包含两个超链接,分别为:The above is a source code of a video webpage that contains video hyperlinks. It contains two hyperlinks, namely:

http://www.tudou.com/programs/view/c74iyYGuDIc/http://www.tudou.com/programs/view/c74iyYGuDIc/

http://i01.img.tudou.com/data/imgs/i/023/746/281/m10.jpghttp://i01.img.tudou.com/data/imgs/i/023/746/281/m10.jpg

第一个为指向视频播放页面的超链接地址,第二个为该视频所对应的摘要图片超链接地址。视频网站构建结构的特点是指向视频播放页面的超链接与视频所对应的摘要图片超链接是紧挨着的,而且都以html标记语言来标记,由如上代码片段可以看出,两个超链接之间没有任何其他超链接,并且指向视频播放页面的超链接以href=标记,视频所对应的摘要图片超链接以img src=标记。因此,对于一个包含视频的网页,可以通过正则表达式匹配href=和img src=标记来查找网页中所有的超链接,比如上例将The first one is the hyperlink address pointing to the video playback page, and the second one is the hyperlink address of the summary image corresponding to the video. The characteristic of the video website construction structure is that the hyperlink pointing to the video playback page and the hyperlink to the summary image corresponding to the video are adjacent, and both are marked with html markup language. As can be seen from the above code snippet, there is no other hyperlink between the two hyperlinks, and the hyperlink pointing to the video playback page is marked with href=, and the hyperlink to the summary image corresponding to the video is marked with img src=. Therefore, for a web page containing a video, you can use regular expressions to match the href= and img src= tags to find all hyperlinks in the web page. For example, in the above example,

href=″http://www.tudou.com/programs/view/c74iyYGuDIc/″和imgsrc=″http://i01.img.tudou.com/data/imgs/i/023/746/281/m10.jpg″查找出来,然后将所有的超链接按照查找的顺序列出,即生成超链接映射列表,最后将当前网页的超链接映射列表放入原来的超链接映射列表末尾。一个映射表的存储实施方式是通过文本形式,直接将当前映射列表写入原来的超链接映射列表末尾。需要指出的是文本形式仅仅是举例,还有关系型数据库等存储形式,具体的存储形式不构成对本发明的限制。以上是超链接映射成列表的实施例,其他不同的实施例子不构成对本发明的限制。href="http://www.tudou.com/programs/view/c74iyYGuDIc/" and imgsrc="http://i01.img.tudou.com/data/imgs/i/023/746/281/m10.jpg" are used to search for them, and then all the hyperlinks are listed in the order of search, that is, a hyperlink mapping list is generated, and finally the hyperlink mapping list of the current web page is put at the end of the original hyperlink mapping list. One storage implementation method of a mapping table is to write the current mapping list directly to the end of the original hyperlink mapping list in text form. It should be pointed out that the text form is only an example, and there are other storage forms such as relational databases. The specific storage form does not constitute a limitation to the present invention. The above is an embodiment of mapping hyperlinks into a list, and other different implementation examples do not constitute a limitation to the present invention.

步骤102之后,流程进入步骤103。After step 102 , the process proceeds to step 103 .

在步骤103,分析检测超链接映射列表状态。一个检测超链接映射列表状态的具体实施方式是通过在处理过的表内超链接标记,从标记处累加一位,看是否是空的。若是空的,则说明映射列表全部处理完了;若不是空的,则说明映射列表没有处理完。以上是分析检测超链接映射列表状态的一个实施方式,其他不同的实施方式不构成对本发明的限制。In step 103, the state of the hyperlink mapping list is analyzed and detected. A specific implementation method for detecting the state of the hyperlink mapping list is to add one bit from the mark of the hyperlink in the processed table to see if it is empty. If it is empty, it means that the mapping list has been processed completely; if it is not empty, it means that the mapping list has not been processed completely. The above is an implementation method for analyzing and detecting the state of the hyperlink mapping list, and other different implementation methods do not constitute a limitation to the present invention.

若没处理完,则流程进入步骤104;若全部处理完,则流程进入步骤110。If the processing is not completed, the process proceeds to step 104; if the processing is completed, the process proceeds to step 110.

在步骤104,对步骤102中生成的超链接映射列表进行摘要图片超链接查找。一个查找摘要图片的具体实施方式是通过字符串匹配,如步骤102中的代码片段的例子,在超链接映射列表中匹配字符串img src=,其后面的内容:In step 104, the hyperlink map list generated in step 102 is searched for a summary image hyperlink. A specific implementation method for searching for summary images is through string matching, such as the example of the code snippet in step 102, matching the string img src = in the hyperlink map list, and the following content:

http://i01.img.tudou.com/data/imgs/i/023/746/281/m10.jpg就是摘要图片的超链接。以上是查找摘要图片的一个实施方式,其他不同的实施方式不构成对本发明的限制。http://i01.img.tudou.com/data/imgs/i/023/746/281/m10.jpg is the hyperlink of the abstract image. The above is an implementation method for searching for abstract images, and other different implementation methods do not constitute a limitation to the present invention.

步骤104之后,流程进入步骤105。After step 104 , the process proceeds to step 105 .

在步骤105,下载存储在步骤104中被查找到的摘要图片。一个实施例是运用关联性数据库系统存储下载的摘要图片,这样便于数据的管理。以上是下载存储摘要图片的一个实施方式,其他不同的实施方式不构成对本发明的限制。In step 105, the summary picture found in step 104 is downloaded and stored. One embodiment is to use a relational database system to store the downloaded summary pictures, which is convenient for data management. The above is an implementation method for downloading and storing summary pictures, and other different implementation methods do not constitute a limitation of the present invention.

步骤105之后,流程进入步骤106。After step 105 , the process proceeds to step 106 .

在步骤106,对步骤105下载存储的摘要图片所对应的视频的超链接在步骤102生成的超链接映射列表中进行查找。一个查找对应视频的具体实施方式是首先通过字符串匹配定位摘要图片的超链接,然后在此摘要图片超链接的位置向前匹配一个超链接即可。以上具体实施方式基于这样的原理:在视频网站的构建中,视频的超链接和所对应的摘要图片的超链接在一起前后相连,中间无任何其他超链接,且视频超链接在图片超链接的前面。如步骤102中的代码片段的例子,在超链接映射列表中,两个超链接是紧紧挨着的。由步骤104可知道摘要图片的超链接,在超链接映射列表匹配定位:img src=″http://i01.img.tudou.com/data/imgs/i/023/746/281/m10.jpg″,接着向前匹配标记href=,就得到与其对应的视频的超链接:http://www.tudou.com/programs/view/c74iyYGuDIc/。以上是查找对应视频的一个实施方式,其他不同的实施方式不构成对本发明的限制。In step 106, the hyperlink of the video corresponding to the summary picture downloaded and stored in step 105 is searched in the hyperlink mapping list generated in step 102. A specific implementation method of searching for the corresponding video is to first locate the hyperlink of the summary picture by string matching, and then match a hyperlink forward at the position of the hyperlink of the summary picture. The above specific implementation method is based on such a principle: in the construction of the video website, the hyperlink of the video and the hyperlink of the corresponding summary picture are connected together front and back, without any other hyperlink in the middle, and the video hyperlink is in front of the picture hyperlink. As the example of the code fragment in step 102, in the hyperlink mapping list, the two hyperlinks are closely adjacent. From step 104, the hyperlink of the summary image can be known, and the hyperlink mapping list is matched and located: img src = "http://i01.img.tudou.com/data/imgs/i/023/746/281/m10.jpg", and then the tag href = is matched forward to obtain the hyperlink of the corresponding video: http://www.tudou.com/programs/view/c74iyYGuDIc/. The above is an implementation method of searching for the corresponding video, and other different implementation methods do not constitute a limitation to the present invention.

步骤106之后,流程进入步骤107。After step 106 , the process proceeds to step 107 .

在步骤107,下载存储在步骤106中被查找到的视频。一个实施例是首先通过转址技术,得到真实的视频地址,然后运用关联性数据库系统存储下载的视频,可以将其插入到在步骤105中存储的摘要图片数据之后,这样便可以得到两者的关联数据集。以上是下载存储视频的一个实施方式,其他不同的实施方式不构成对本发明的限制。In step 107, the video found in step 106 is downloaded. In one embodiment, the real video address is first obtained by redirection technology, and then the downloaded video is stored using a relational database system, which can be inserted after the summary image data stored in step 105, so that the associated data set of the two can be obtained. The above is an implementation method for downloading and storing videos, and other different implementation methods do not constitute a limitation of the present invention.

步骤107之后,流程进入步骤108。After step 107 , the process proceeds to step 108 .

在步骤108,下载视频播放页面,即对步骤106中被查找的视频超链接进行下载处理。一个下载视频播放页面的具体实施方式是通过向超链接所对应的主机发送数据请求。如步骤102中的例子,向www.tudou.com主机发送programs/view/c74iyYGuDIc数据请求而下载数据。以上是下载视频播放页面的一个实施方式,其他不同的实施方式不构成对本发明的限制。In step 108, the video playing page is downloaded, that is, the video hyperlink found in step 106 is downloaded. A specific implementation method of downloading the video playing page is to send a data request to the host corresponding to the hyperlink. For example, in step 102, a data request for programs/view/c74iyYGuDIc is sent to the www.tudou.com host to download data. The above is an implementation method of downloading the video playing page, and other different implementation methods do not constitute a limitation of the present invention.

步骤108之后,流程进入步骤109。After step 108 , the process proceeds to step 109 .

在步骤109,提取存储该视频的标题,即对步骤108中被下载视频播放页面进行查找标题标记<title>。一个提取存储该视频的标题的具体实施方式是通过字符串查找在该播放页面中匹配<title>。如步骤102中的例子,在如下视频播放页面中:http://www.tudou.com/programs/view/c74iyYGuDIc/查找<title>,可得到<title>荷兰多米诺骨牌新记录</title>,中间的部分就是该视频的标题,提取中间部分,然后运用关联性数据库系统存储视频标题,可以将其插入到在步骤105中存储的摘要图片数据之前,这样便可以得到三者的关联数据集。以上是提取存储该视频的标题的一个实施方式,其他不同的实施方式不构成对本发明的限制。In step 109, the title of the video is extracted and stored, that is, the title tag <title> is searched for the video playback page downloaded in step 108. A specific implementation method for extracting and storing the title of the video is to match <title> in the playback page through a string search. As in the example in step 102, in the following video playback page: http://www.tudou.com/programs/view/c74iyYGuDIc/, <title>New Record of Dutch Dominoes</title> can be obtained, and the middle part is the title of the video. The middle part is extracted, and then the video title is stored using a relational database system, which can be inserted before the summary image data stored in step 105, so that the three related data sets can be obtained. The above is an implementation method for extracting and storing the title of the video, and other different implementation methods do not constitute a limitation of the present invention.

步骤109之后,将步骤108中下载的视频播放页面进行步骤102处理。After step 109, the video playback page downloaded in step 108 is processed in step 102.

在步骤110,系统结束。At step 110, the system ends.

以上结合附图描述了本发明的具体实施方式,各种举例说明不对发明的实质内容构成限制,本发明不限于上面提供的实施细节,可以在不脱离本发明特征的情况下以另外的实施例实现。所属技术领域的普通技术人员在阅读了说明书后可以对以前所述的具体实施方式做修改或变形,而不背离发明的实质和范围。The above describes the specific implementation of the present invention in conjunction with the accompanying drawings. Various examples do not limit the essential content of the invention. The present invention is not limited to the implementation details provided above, and can be implemented in other embodiments without departing from the characteristics of the present invention. After reading the specification, ordinary technicians in the relevant technical field can make modifications or deformations to the specific implementations described above without departing from the essence and scope of the invention.

Claims (4)

1.一种可预览视频搜索引擎的爬虫系统的构建方法,其特征在于包括下列步骤:1. a kind of construction method that can preview the crawler system of video search engine, it is characterized in that comprising the following steps: (1)超链接映射成列表;(1) Hyperlinks are mapped into lists; (2)检测列表状态;(2) Check the status of the list; (3)摘要图片处理;(3) Abstract image processing; (4)视频处理;(4) Video processing; (5)视频标题处理。(5) Video title processing. 2.根据权利要求1所述的可预览视频搜索引擎的爬虫系统的构建方法,其特征在于:步骤(3)进一步包括:2. the construction method of the crawler system that can preview video search engine according to claim 1, is characterized in that: step (3) further comprises: (31)在超链接映射列表中,查找摘要图片;(31) In the hyperlink mapping list, search for the summary picture; (32)下载存储摘要图片。(32) Download and store summary pictures. 3.根据权利要求1所述的可预览视频搜索引擎的爬虫系统的构建方法,其特征在于:步骤(4)进一步包括:3. the construction method of the crawler system that can preview video search engine according to claim 1, is characterized in that: step (4) further comprises: (41)在超链接映射列表中,查找视频;(41) in the hyperlink mapping list, find the video; (42)下载存储视频。(42) Download and store video. 4.根据权利要求1所述的可预览视频搜索引擎的爬虫系统的构建方法,其特征在于:步骤(5)进一步包括:4. the construction method of the crawler system that can preview video search engine according to claim 1, is characterized in that: step (5) further comprises: (51)下载视频播放页面;(51) Download the video playback page; (52)提取存储视频标题。(52) extract and store the video title.
CNA2008101808250A 2008-11-25 2008-11-25 Crawler system construction method for video-previewing search engine Pending CN101404026A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CNA2008101808250A CN101404026A (en) 2008-11-25 2008-11-25 Crawler system construction method for video-previewing search engine

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CNA2008101808250A CN101404026A (en) 2008-11-25 2008-11-25 Crawler system construction method for video-previewing search engine

Publications (1)

Publication Number Publication Date
CN101404026A true CN101404026A (en) 2009-04-08

Family

ID=40538038

Family Applications (1)

Application Number Title Priority Date Filing Date
CNA2008101808250A Pending CN101404026A (en) 2008-11-25 2008-11-25 Crawler system construction method for video-previewing search engine

Country Status (1)

Country Link
CN (1) CN101404026A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102325225A (en) * 2011-09-20 2012-01-18 北京鹏润鸿途科技有限公司 Method and device for playing video of mobile phone website
CN102982161A (en) * 2012-12-05 2013-03-20 北京奇虎科技有限公司 Method and device for acquiring webpage information
CN116881501A (en) * 2016-09-23 2023-10-13 奥多比公司 Providing relevant video scenes in response to a video search query

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102325225A (en) * 2011-09-20 2012-01-18 北京鹏润鸿途科技有限公司 Method and device for playing video of mobile phone website
CN102982161A (en) * 2012-12-05 2013-03-20 北京奇虎科技有限公司 Method and device for acquiring webpage information
CN116881501A (en) * 2016-09-23 2023-10-13 奥多比公司 Providing relevant video scenes in response to a video search query

Similar Documents

Publication Publication Date Title
US12086177B2 (en) System and method for labeling objects for use in vehicle movement
US9372926B2 (en) Intelligent video summaries in information access
KR101475126B1 (en) System and method of inclusion of interactive elements on a search results page
Chen et al. Geotracker: geospatial and temporal RSS navigation
CN102841920B (en) Method and device for extracting webpage frame information
US9971790B2 (en) Generating descriptive text for images in documents using seed descriptors
CN103514234B (en) A kind of page info extracting method and device
US20110191321A1 (en) Contextual display advertisements for a webpage
US9069794B1 (en) Determining location information for images using landmark, caption, and metadata location data
KR20010112686A (en) System and method for facilitating internet search by providing web document layout image and web site structure
CN101446954A (en) Wide area network crawler system for a video website
JP5226784B2 (en) Method and apparatus for providing moving image search service
CN105072460B (en) A kind of information labeling and correlating method based on video content element, system and equipment
CN101169786B (en) A walk-through method for pictures and jump pages of web application pages
CN105407359A (en) Intelligent television programme retrieving and recommending system based on classification label system
TW201030541A (en) Method and system to realize downloading network data into multimedia player
WO2009086730A1 (en) Information system for indexing, search, storage and display control of linked data
RU2399090C2 (en) System and method for real time internet search of multimedia content
CN106874502A (en) A kind of method of video search, device and terminal
CN102819613B (en) RSS information paging grasping system and method
US20090313558A1 (en) Semantic Image Collection Visualization
CN101404026A (en) Crawler system construction method for video-previewing search engine
KR101248186B1 (en) System for generating blog using each content in search result page and method thereof
CN104504070B (en) A search method and device
TWI238333B (en) Website information capturing system and method

Legal Events

Date Code Title Description
C06 Publication
C57 Notification of unclear or unknown address
DD01 Delivery of document by public notice

Addressee: Xu Weiran

Document name: Notification of Passing Preliminary Examination of the Application for Invention

PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C02 Deemed withdrawal of patent application after publication (patent law 2001)
WD01 Invention patent application deemed withdrawn after publication

Open date: 20090408