[go: up one dir, main page]

CN101763432A - Method for constructing lightweight webpage dynamic view - Google Patents

Method for constructing lightweight webpage dynamic view Download PDF

Info

Publication number
CN101763432A
CN101763432A CN201010033724A CN201010033724A CN101763432A CN 101763432 A CN101763432 A CN 101763432A CN 201010033724 A CN201010033724 A CN 201010033724A CN 201010033724 A CN201010033724 A CN 201010033724A CN 101763432 A CN101763432 A CN 101763432A
Authority
CN
China
Prior art keywords
page
script
embedded
document
dynamic
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201010033724A
Other languages
Chinese (zh)
Inventor
张慧琳
诸葛建伟
宋程昱
韩心慧
龚晓锐
邹维
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Peking University
Original Assignee
Peking University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Peking University filed Critical Peking University
Priority to CN201010033724A priority Critical patent/CN101763432A/en
Publication of CN101763432A publication Critical patent/CN101763432A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Information Transfer Between Computers (AREA)

Abstract

本发明公开了一种轻量级网页动态视图快速构建方法,属于计算机应用技术领域。本方法为:1)提取待分析页面的本地脚本和静态内嵌链接;2)从静态内嵌链接中提取出静态内嵌脚本;3)利用脚本执行引擎动态执行本地脚本和静态内嵌脚本,识别出该页面的动态内嵌链接;4)将动态内嵌链接所指向的文档和静态内嵌链接所指向的文档构成一内嵌文档集合;5)提取该页面与内嵌文档集合中的文档的引用-被引用关系;6)将识别出的静态内嵌页面和动态内嵌页面分别作为待分析页面,重复步骤1)~5),根据每次得到的内嵌文档集合和引用-被引用关系,建立页面动态视图。本发明以较少的时间代价和较小的系统代价完成页面动态视图的构建。

The invention discloses a method for quickly constructing a dynamic view of a lightweight web page, and belongs to the technical field of computer applications. The method is as follows: 1) extracting the local script and the static embedded link of the page to be analyzed; 2) extracting the static embedded script from the static embedded link; 3) using the script execution engine to dynamically execute the local script and the static embedded script, Identifying the dynamic embedded link of the page; 4) forming an embedded document set from the document pointed to by the dynamic embedded link and the document pointed to by the static embedded link; 5) extracting the document in the page and the embedded document set 6) Take the identified static embedded pages and dynamic embedded pages as the pages to be analyzed respectively, repeat steps 1) to 5), and according to the embedded document collection and cited-cited relationship, and establish a dynamic view of the page. The invention completes the construction of the page dynamic view with less time cost and less system cost.

Description

一种轻量级网页动态视图快速构建方法 A Lightweight Webpage Dynamic View Rapid Construction Method

技术领域technical field

本发明属于计算机应用技术领域,具体涉及一种将页面静态解析和脚本动态执行相结合,进行低交互式网页动态视图快速构建的方法。The invention belongs to the technical field of computer applications, and in particular relates to a method for quickly constructing low-interaction dynamic webpage views by combining static analysis of pages and dynamic execution of scripts.

背景技术Background technique

早期的web页面仅仅简单包含一些文本和图片等内容,随着近年来web技术的发展,网页编写人员常常利用一些内嵌链接和结合脚本运行,以一种无需用户交互的方式将更多活跃的元素和更多精彩的内容在页面中展现出来。Early web pages simply included some text and pictures. With the development of web technology in recent years, web page writers often use some embedded links and combined scripts to run more active pages in a way that does not require user interaction. Elements and more exciting content are displayed on the page.

内嵌链接是HTML页面中一类带有src属性的特殊的超链接形式,其特点是,当浏览器访问带有内嵌链接的页面时,无需用户点击,链接指向的内容就会被自动加载,如带有src属性的<iframe>、<frame>指向的是一个内嵌页面,<script>的src指向的是一个内嵌脚本文件。页面中的脚本作为一种客户端即时执行代码,大大提高了网页的浏览速度和交互能力,使得网页和用户之间实现了一种实时性的关系,常常通过脚本动态执行来生成一些内嵌链接并将其载入页面。Embedded links are a special form of hyperlinks with src attributes in HTML pages. Its characteristic is that when a browser accesses a page with an embedded link, the content pointed to by the link will be automatically loaded without the user clicking. , such as <iframe> with src attribute, <frame> points to an embedded page, and the src of <script> points to an embedded script file. The script in the page is used as a client to execute the code in real time, which greatly improves the browsing speed and interactive ability of the webpage, and realizes a real-time relationship between the webpage and the user. Often, some embedded links are generated through the dynamic execution of the script. and load it into the page.

本发明抽象出网页动态视图的概念,其对应于用户访问页面时,浏览器自动加载内嵌链接指向的内嵌文档的层次结构图。如附图1所示,用户访问页面X时,即使无进一步交互,图中所有内嵌文档均会被浏览器自动加载或执行。网页通过动态视图带给了用户一种更丰富的体验,然而却也给一些工作带来了挑战。例如网页木马的漏洞攻击代码可以存在于网页动态视图中任意位置而被浏览器自动加载并执行,进行网页木马的全面检测必须要基于网页动态视图;例如一些舆情分析,也必须基于网页动态视图来对网页进行全面的内容分析。The present invention abstracts the concept of webpage dynamic view, which corresponds to when a user visits a page, the browser automatically loads the hierarchical structure diagram of the embedded document pointed to by the embedded link. As shown in Figure 1, when a user visits page X, all embedded documents in the figure will be automatically loaded or executed by the browser even if there is no further interaction. Web pages bring users a richer experience through dynamic views, but it also brings challenges to some tasks. For example, the vulnerability attack code of a webpage Trojan can exist anywhere in the dynamic view of the webpage and be automatically loaded and executed by the browser. A comprehensive detection of a webpage Trojan must be based on the dynamic view of the webpage; for example, some public opinion analysis must also be based on the dynamic view of the webpage. Perform comprehensive content analysis of web pages.

用户访问页面时,该页面的动态视图会在浏览器中完成自动装载。因此可以采用一种高交互式的方法获取网页动态视图,即监控浏览器浏览页面时的行为,捕获到用户无交互情况下的所有文档访问内容,但是该种方法往往有较大的时间代价和系统代价。When a user visits a page, a dynamic view of that page is automatically loaded in the browser. Therefore, a highly interactive method can be used to obtain the dynamic view of the webpage, that is, to monitor the behavior of the browser when browsing the page, and capture all the document access content without user interaction, but this method often has a large time cost and system cost.

发明内容Contents of the invention

本发明对目前页面元素自动加载机理进行了深入分析,目的在于提供一种轻量级网页动态视图快速构建方法,其将页面静态解析和脚本动态执行相结合来模拟浏览器,能够在占用较少系统资源的同时快速地构建出网页动态视图。The present invention deeply analyzes the automatic loading mechanism of current page elements, and aims to provide a method for quickly constructing a lightweight web page dynamic view, which combines static analysis of pages and dynamic execution of scripts to simulate a browser, and can take up less time. While saving system resources, quickly build a dynamic view of the web page.

为了便于说明,本发明引入如下概念:For ease of description, the present invention introduces the following concepts:

1.内嵌链接:指含有src属性的一类特殊超链接,其所指向的是不同类型的内嵌文档,如<iframe>/<frame>类内嵌页面以及<script>内嵌脚本等,内嵌文档无需用户点击即可被浏览器自动加载;1. Embedded link: refers to a special type of hyperlink containing the src attribute, which points to different types of embedded documents, such as <iframe>/<frame> embedded pages and <script> embedded scripts, etc. Embedded documents can be automatically loaded by the browser without user clicking;

2.静态内嵌链接:在页面源文件中以静态标签形式指明的内嵌链接;2. Static embedded link: an embedded link specified in the form of a static tag in the page source file;

3.动态内嵌链接:通过脚本动态执行而自动生成的内嵌链接;3. Dynamic embedded link: an embedded link automatically generated through the dynamic execution of the script;

4.本地脚本:在<script>和</script>之间的脚本内容,相对于带有src的内嵌脚本而言;4. Local script: the script content between <script> and </script>, relative to the embedded script with src;

5.网页动态视图:页面被访问时,内嵌链接被自动加载的层次结构图(如附图1),其有如下三方面特征:5. Web page dynamic view: when the page is accessed, the hierarchical structure diagram of the embedded link is automatically loaded (as shown in Figure 1), which has the following three characteristics:

a)网页动态视图由被访问页面和该页面被访问时被浏览器自动载入的内嵌页面、内嵌脚本等内嵌链接所指向的文档组成;a) The web page dynamic view is composed of the visited page and the documents pointed to by embedded links such as embedded pages and embedded scripts automatically loaded by the browser when the page is accessed;

b)网页动态视图中的内嵌链接或者在页面源文件中以静态标签的形式指明或者通过脚本动态执行而生成;b) The embedded link in the dynamic view of the web page is either specified in the form of a static tag in the page source file or generated through the dynamic execution of the script;

c)网页动态视图有一定的层次结构——即内嵌页面同样也有内嵌链接。c) The dynamic view of the web page has a certain hierarchical structure—that is, embedded pages also have embedded links.

综合5中的a)、b)、c)三点:网页动态视图对应于用户访问页面时,浏览器对内嵌文档自动加载层次结构图。网页动态视图是一个树形结构,其中待分析文档和内嵌文档作为结点集,文档之间的引用-被引用关系作为边,待分析文档为树的根结点。用户访问某页面时,该页面动态视图中的所有内嵌文档都被浏览器按照顺序自动加载。Synthesize the three points of a), b) and c) in 5: the dynamic view of the webpage corresponds to when the user visits the page, the browser automatically loads the hierarchical structure diagram for the embedded document. The web page dynamic view is a tree structure, in which the document to be analyzed and the embedded document are used as node sets, the reference-reference relationship between documents is used as an edge, and the document to be analyzed is the root node of the tree. When a user visits a certain page, all embedded documents in the dynamic view of the page are automatically loaded by the browser in order.

本发明通过低交互式地模拟浏览器,分析出页面被浏览器访问时所有被载入的内嵌文档和引用-被引用关系,从而构建出网页的动态视图。主要的思想是通过页面静态解析结合脚本动态执行来获取当前页面中的内嵌链接,通过一定的策略来进一步递归分析内嵌页面,最终构建出页面被浏览器访问时相对应的网页动态视图。The invention simulates the browser in a low-interactive manner, analyzes all loaded embedded documents and reference-reference relationships when the page is accessed by the browser, and thus constructs a dynamic view of the web page. The main idea is to obtain embedded links in the current page through static analysis of the page combined with dynamic script execution, further recursively analyze the embedded page through a certain strategy, and finally construct a dynamic view of the webpage corresponding to the page accessed by the browser.

本发明提出的低交互式网页动态视图快速构建方法主要包括以下几个步骤:The low-interaction web page dynamic view rapid construction method proposed by the present invention mainly includes the following steps:

(1)将待分析页面指定为当前页面(1) Designate the page to be analyzed as the current page

(2)通过页面静态解析来对当前页面进行本地脚本解析和静态内嵌链接识别,其具体处理方法如下:(2) Perform local script analysis and static embedded link identification on the current page through static analysis of the page. The specific processing method is as follows:

a)本地脚本解析:提取本地脚本,交由步骤(3)待处理a) Local script parsing: extract the local script and submit it to step (3) to be processed

b)静态内嵌链接识别,识别出所有的静态内嵌链接并进一步进行如下操作:b) Static embedded link identification, identifying all static embedded links and further performing the following operations:

i.将识别出的静态内嵌脚本,交由步骤(3)待处理i. Submit the identified static embedded script to step (3) for processing

ii.将识别出的静态内嵌页面,交由步骤(5)待处理ii. Submit the identified static embedded page to step (5) for processing

(3)以脚本执行引擎为核心,进行脚本动态执行来识别出当前页面的动态内嵌链接,主要有如下两方面工作:(3) With the script execution engine as the core, the script is dynamically executed to identify the dynamic embedded link of the current page, mainly in the following two aspects:

a)进行DOM模拟,将其作为脚本执行的上下文环境,其主要分为如下两个部分:a) Perform DOM simulation and use it as the context environment for script execution, which is mainly divided into the following two parts:

i.用脚本语言模拟浏览器提供给脚本的应用程序接口(API)i. Use a scripting language to simulate the application programming interface (API) provided by the browser to the script

ii.用脚本语言模拟浏览器自动构建出与页面相对应的DOM树(DocumentObject Model,即文档对象模型)ii. Simulate the browser with a scripting language to automatically build a DOM tree (DocumentObject Model, Document Object Model) corresponding to the page

b)动态内嵌链接识别:在与内嵌链接自动加载相关的关键函数中捕获到动态生成的内嵌链接并进一步进行如下处理:b) Dynamic embedded link identification: Capture the dynamically generated embedded link in the key functions related to the automatic loading of embedded links and further process it as follows:

i.识别出的脚本,在步骤(3)中即时执行i. The identified script is executed immediately in step (3)

ii.识别出的动态内嵌页面,交由步骤(5)处理ii. The identified dynamic embedded pages are processed by step (5)

(4)步骤(2)和步骤(3)分析出了当前页面的静态内嵌链接和动态内嵌链接,这些内嵌链接指明的文档构成一个内嵌文档集合,当前页面和内嵌文档集合中的每个文档有引用和被引用关系(4) Steps (2) and (3) analyze the static embedded links and dynamic embedded links of the current page. The documents specified by these embedded links constitute an embedded document collection, and the current page and the embedded document collection Each document has a reference and a referenced relationship

(5)通过一定的内嵌页面递归分析策略来分析当前页面的内嵌页面:对(2)-b)-ii分析出的静态内嵌页面和(3)-b)-ii分析出的动态内嵌页面,逐个将其指定为当前页面,并回到步骤(2)进行分析(5) Analyze the embedded page of the current page through a certain embedded page recursive analysis strategy: the static embedded page analyzed by (2)-b)-ii and the dynamic analysis analyzed by (3)-b)-ii Embedded pages, designate them as the current page one by one, and return to step (2) for analysis

(6)根据每次步骤(4)所得到的内嵌文档集合和引用-被引用关系,完成页面动态视图构建。(6) According to the embedded document collection and reference-reference relationship obtained in step (4) each time, the page dynamic view construction is completed.

本发明的优点和积极效果如下:Advantage of the present invention and positive effect are as follows:

1.将页面静态解析和脚本动态执行结合,全面分析出页面被浏览器访问时所有被载入的内嵌文档和各个文档之间的引用-被引用关系。1. Combining static analysis of pages with dynamic execution of scripts, it can comprehensively analyze all loaded embedded documents and the reference-reference relationship between each document when the page is accessed by the browser.

2.轻量级低交互式模拟浏览器,以较少的时间代价和较小的系统代价完成页面动态视图的构建。2. Lightweight and low-interaction simulates the browser, and completes the construction of the dynamic view of the page with less time cost and less system cost.

附图说明Description of drawings

图1页面的动态视图示例An example of a dynamic view of a page in Figure 1

图2本发明一种轻量级网页动态视图快速构建方法的结构图Fig. 2 is a structural diagram of a method for quickly constructing a lightweight web page dynamic view in the present invention

具体实施方式Detailed ways

以下结合附图2,对本发明中方法进行详细说明,一个具体实施将数据结构1的实例集合作为网页动态视图的结构化表示,Below in conjunction with accompanying drawing 2, the method among the present invention is described in detail, and a concrete implementation uses the example set of data structure 1 as the structured expression of web page dynamic view,

                                 struct node_link{struct node_link{

                               Node_Type child_type;Node_Type child_type;

                                   string parentURL;string parentURL;

                                   string childURL;string childURL;

                         }                    }

                                 数据结构1定义Definition of data structure 1

其中enum Node_Type{InlineScript,InlinePage...}where enum Node_Type{InlineScript, InlinePage...}

该实施例的步骤如下:The steps of this embodiment are as follows:

(1)用JavaScript脚本语言模拟浏览器提供给脚本的各种DOM API,将代码放入defineDOMAPI.js文件中,其中:(1) Use the JavaScript scripting language to simulate various DOM APIs provided by the browser to the script, and put the code into the defineDOMAPI.js file, where:

a)定义window.open(),通过print方式将参数中的url输出;a) Define window.open(), and output the url in the parameter through print;

b)定义document.write(),其调用回调函数对其参数内容进行进一步解析,解析出的脚本内容通过执行eval函数在当前上下文即时执行,解析出的内嵌链接通过print方式输出;b) Define document.write(), which calls the callback function to further analyze the content of its parameters. The parsed script content is executed immediately in the current context by executing the eval function, and the parsed embedded link is output through print;

c)定义onmouseover、onmouseout、onload事件自动触发;c) Define onmouseover, onmouseout, onload events to trigger automatically;

d)DOM API中的其余属性赋值为默认值,其余函数定义为空函数体;d) The rest of the attributes in the DOM API are assigned default values, and the rest of the functions are defined as empty function bodies;

(2)将待分析文档的URL设定为当前页面的URL,集合S为空(2) Set the URL of the document to be analyzed as the URL of the current page, and the collection S is empty

(3)以当前页面的URL为名新建文件夹,并下载该页面,基于SAX(Sample API forXML),即“XML简单API”,对该页面进行静态解析,在解析过程中:(3) Create a new folder with the URL of the current page as the name, and download the page, based on SAX (Sample API for XML), that is, "XML simple API", perform static analysis on the page, during the analysis process:

a)每解析到一个标签,写出根据(1)中定义的API构建与该标签相对应DOM对象的JavaScript语句,将该JavaScript语句加入当前页面的buildDOMTree变量(string类型)中;a) Every time a label is parsed, write out the JavaScript statement that builds the DOM object corresponding to the label according to the API defined in (1), and add the JavaScript statement to the buildDOMTree variable (string type) of the current page;

b)每解析出静态内嵌链接,将src指向的URL放入当前页面的inline_linking变量(list类型)中;b) Put the URL pointed to by src into the inline_linking variable (list type) of the current page every time a static inline link is parsed;

此外:also:

c)解析出的本地脚本,放入当前页面的jscript变量(string类型)中;c) The parsed local script is put into the jscript variable (string type) of the current page;

d)解析到带有src的<script>,将其内容下载并放入当前页面的jscript变量中;d) parse to <script> with src, download its content and put it into the jscript variable of the current page;

e)解析到带有src的<iframe>或<frame>,将src指向的URL放入当前页面的inlinepage变量(List类型)e) Parse to <iframe> or <frame> with src, and put the URL pointed to by src into the inlinepage variable (List type) of the current page

(4)进行脚本动态执行,具体步骤为首先在当前目录下新建临时空文件tmp.js,接下来:(4) Perform script dynamic execution. The specific steps are to first create a temporary empty file tmp.js in the current directory, and then:

a)将(1)中的defineDOMAPI.js文件中内容写入tmp.js中a) Write the content in the defineDOMAPI.js file in (1) to tmp.js

b)将(3)-a)中的当前页面的buildDOMTree写入tmp.js中b) Write the buildDOMTree of the current page in (3)-a) into tmp.js

c)将(3)中的当前页面的jscript写入tmp.js中c) Write the jscript of the current page in (3) into tmp.js

d)用脚本引擎执行tmp.js,并将结果放入当前目录下的output.js文件中d) Execute tmp.js with the script engine, and put the result into the output.js file in the current directory

(5)主程序的merge模块提取output.js文件中的内嵌链接,放入当前页面的inline_linking变量,并将<iframe>或<frame>的src指向的URL放入当前页面的inlinepage变量中;(5) The merge module of the main program extracts the embedded link in the output.js file, puts it into the inline_linking variable of the current page, and puts the URL pointed to by the src of <iframe> or <frame> into the inlinepage variable of the current page;

(6)步骤(3)-b)和步骤(5)得到的inline_linking变量为当前页面中的静态内嵌链接和动态内嵌链接集合,下一步:根据当前页面的URL和inline_linking中的URL,构造数据结构1的不同实例,并将这些实例放入集合S(6) The inline_linking variable obtained in step (3)-b) and step (5) is a set of static inline links and dynamic inline links in the current page. Next step: according to the URL of the current page and the URL in inline_linking, construct different instances of the data structure 1, and put those instances into the set S

(7)递归分析当前页面的内嵌页面,即依次将当前页面inlinepage变量中的URL设定为当前页面的URL,回到步骤(3);(7) recursively analyze the embedded pages of the current page, promptly the URL in the current page inlinepage variable is set to the URL of the current page successively, and get back to step (3);

(8)集合S代表带分析页面的网页动态视图的数据表示(8) The set S represents the data representation of the dynamic view of the web page with the analysis page

尽管为说明目的公开了本发明的具体实施例和附图,其目的在于帮助理解本发明的内容并据以实施,但是本领域的技术人员可以理解:在不脱离本发明及所附的权利要求的精神和范围内,各种替换、变化和修改都是可能的。本发明不应局限于本说明书最佳实施例和附图所公开的内容,本发明要求保护的范围以权利要求书界定的范围为准。Although specific embodiments and drawings of the present invention are disclosed for the purpose of illustration, the purpose is to help understand the content of the present invention and implement it accordingly, but those skilled in the art can understand that: without departing from the present invention and the appended claims Various substitutions, changes and modifications are possible within the spirit and scope of . The present invention should not be limited to the content disclosed in the preferred embodiment of the specification and the accompanying drawings, and the protection scope of the present invention is subject to the scope defined in the claims.

Claims (6)

1. a method for constructing lightweight webpage dynamic view the steps include:
1) the local script of the extraction page to be analyzed and static inline link;
2) from described static inline link, extract the static inline script;
3) utilize script executing engine dynamically to carry out described local script and described static inline script, identify the dynamic embedded link of this page;
4) document and described static inline link document pointed described dynamic embedded link is pointed constitute an embedded collection of document;
That 5) extracts document in this page and the described embedded collection of document quotes-is cited relation;
6) the static inline page that will identify according to the link of described static inline and the dynamic embedded page that identifies according to described dynamic embedded link are respectively as the page to be analyzed, repeating step 1)~5), according to the embedded collection of document that obtains and quote-be cited relation at every turn, set up page dynamic view.
2. the method for claim 1 is characterized in that described page dynamic view is a tree structure, and document wherein to be analyzed and embedded document are set of node; Quoting and be cited that to close be the limit between the document, document to be analyzed is the root node of tree.
3. method as claimed in claim 1 or 2, when it is characterized in that described script executing engine is carried out described local script and described static inline script, embedded link to the dynamic generation that captures is handled, its method is: identify the script of the embedded link of described dynamic generation, and utilize described script executing engine to carry out this script; Identify the embedded link dynamic embedded page pointed of described dynamic generation simultaneously, and with this page as the page to be analyzed.
4. method as claimed in claim 1 or 2, it is characterized in that described script executing engine utilizes DOM Document Object Model as the script executing environment, its method is: utilize script simulation browser to offer the application programming interfaces of script, utilize script simulation browser to construct the document object model tree corresponding with the page to be analyzed then.
5. method as claimed in claim 4 is characterized in that adopting JavaScript script simulation browser to offer the application programming interfaces of the various document object model tree of script.
6. the method for claim 1 is characterized in that based on SAX the described page to be analyzed being carried out static state resolves, and extracts the local script and the static inline link of this page to be analyzed.
CN201010033724A 2010-01-05 2010-01-05 Method for constructing lightweight webpage dynamic view Pending CN101763432A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201010033724A CN101763432A (en) 2010-01-05 2010-01-05 Method for constructing lightweight webpage dynamic view

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201010033724A CN101763432A (en) 2010-01-05 2010-01-05 Method for constructing lightweight webpage dynamic view

Publications (1)

Publication Number Publication Date
CN101763432A true CN101763432A (en) 2010-06-30

Family

ID=42494596

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201010033724A Pending CN101763432A (en) 2010-01-05 2010-01-05 Method for constructing lightweight webpage dynamic view

Country Status (1)

Country Link
CN (1) CN101763432A (en)

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102346780A (en) * 2011-10-18 2012-02-08 北龙中网(北京)科技有限责任公司 Method and device for acquiring webpage address
CN102955854A (en) * 2012-11-06 2013-03-06 北京中娱在线网络科技有限公司 Webpage presenting method and device based on HTML5 (Hypertext Markup Language 5) protocol
CN103530289A (en) * 2012-07-02 2014-01-22 腾讯科技(深圳)有限公司 Webpage displaying method and device
CN103635897A (en) * 2011-06-23 2014-03-12 微软公司 Dynamically updating a running page
CN104169898A (en) * 2011-12-28 2014-11-26 英特尔公司 Method and apparatus for streaming metadata between devices using javaScript and HTML5
CN109960769A (en) * 2019-03-15 2019-07-02 广州视源电子科技股份有限公司 Webpage view display method and device, computer equipment and storage medium
US10540416B2 (en) 2011-06-23 2020-01-21 Microsoft Technology Licensing, Llc Linking source code to running element
CN117910438A (en) * 2024-03-13 2024-04-19 江苏中威科技软件系统有限公司 Dynamic format file DLF generation device

Cited By (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103635897B (en) * 2011-06-23 2019-04-26 微软技术许可有限责任公司 The method that dynamic update is carried out to the operation page
US10540416B2 (en) 2011-06-23 2020-01-21 Microsoft Technology Licensing, Llc Linking source code to running element
CN103635897A (en) * 2011-06-23 2014-03-12 微软公司 Dynamically updating a running page
US10534830B2 (en) 2011-06-23 2020-01-14 Microsoft Technology Licensing, Llc Dynamically updating a running page
CN102346780A (en) * 2011-10-18 2012-02-08 北龙中网(北京)科技有限责任公司 Method and device for acquiring webpage address
CN102346780B (en) * 2011-10-18 2016-01-06 北龙中网(北京)科技有限责任公司 The acquisition methods of web page address and acquisition device
CN104169898A (en) * 2011-12-28 2014-11-26 英特尔公司 Method and apparatus for streaming metadata between devices using javaScript and HTML5
US9848032B2 (en) 2011-12-28 2017-12-19 Intel Corporation Method and apparatus for streaming metadata between devices using JavaScript and HTML5
CN104169898B (en) * 2011-12-28 2018-04-27 英特尔公司 Method and apparatus for transmitting metadata as a stream between devices using JavaScript and HTML5
CN103530289A (en) * 2012-07-02 2014-01-22 腾讯科技(深圳)有限公司 Webpage displaying method and device
CN103530289B (en) * 2012-07-02 2018-06-22 腾讯科技(深圳)有限公司 Webpage display process and device
WO2014071749A1 (en) * 2012-11-06 2014-05-15 北京中娱在线网络科技有限公司 Html5-protocol-based webpage presentation method and device
CN102955854B (en) * 2012-11-06 2015-11-25 搜游网络科技(北京)有限公司 A kind of webpage exhibiting method based on HTML5 agreement and device
CN102955854A (en) * 2012-11-06 2013-03-06 北京中娱在线网络科技有限公司 Webpage presenting method and device based on HTML5 (Hypertext Markup Language 5) protocol
CN109960769A (en) * 2019-03-15 2019-07-02 广州视源电子科技股份有限公司 Webpage view display method and device, computer equipment and storage medium
CN109960769B (en) * 2019-03-15 2021-08-31 广州视源电子科技股份有限公司 Webpage view display method and device, computer equipment and storage medium
CN117910438A (en) * 2024-03-13 2024-04-19 江苏中威科技软件系统有限公司 Dynamic format file DLF generation device

Similar Documents

Publication Publication Date Title
CN108304498B (en) Webpage data acquisition method and device, computer equipment and storage medium
CN104200166B (en) Script-based website vulnerability scanning method and system
CN101127038B (en) System and method for downloading website static web pages
US8196039B2 (en) Relevant term extraction and classification for Wiki content
CN102184184B (en) Method for acquiring webpage dynamic information
WO2016173200A1 (en) Malicious website detection method and system
CN101763432A (en) Method for constructing lightweight webpage dynamic view
CN101562618B (en) A method and device for detecting internet horses
US8424004B2 (en) High performance script behavior detection through browser shimming
CN109144567B (en) Cross-platform webpage rendering method and device, server and storage medium
CN102662966B (en) A theme-oriented method and system for acquiring dynamic page content
CN103268361A (en) Method, device and system for extracting hidden URLs in web pages
CN104408204A (en) Method and device for obtaining webpage page link address
KR20080053293A (en) Server-Side Initial Content Rendering for Client Script Web Pages
CN102880607A (en) network dynamic content capturing method and network dynamic content crawler system
CN112612943A (en) Asynchronous processing framework-based data crawling method with automatic testing function
CN114297700B (en) Dynamic and static combined mobile application privacy protocol extraction method and related equipment
CN114546381A (en) Front-end page code file generation method, device, electronic device and storage medium
CN106294885A (en) A kind of data collection towards isomery webpage and mask method
CN100543741C (en) System and method for automatically downloading and filtering web pages
CN103853717A (en) Web crawler
CN106991144B (en) Method and system for customizing data crawling workflow
CN110851678A (en) Method and device for crawling data
CN118740675A (en) Network supportability testing method, device, equipment, medium and program product
CN112749351B (en) Link address determination method, device, computer readable storage medium and equipment

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C02 Deemed withdrawal of patent application after publication (patent law 2001)
WD01 Invention patent application deemed withdrawn after publication

Open date: 20100630