[go: up one dir, main page]

CN107316203B - Method and device for detecting display information, computer readable storage medium and terminal - Google Patents

Method and device for detecting display information, computer readable storage medium and terminal Download PDF

Info

Publication number
CN107316203B
CN107316203B CN201710387660.3A CN201710387660A CN107316203B CN 107316203 B CN107316203 B CN 107316203B CN 201710387660 A CN201710387660 A CN 201710387660A CN 107316203 B CN107316203 B CN 107316203B
Authority
CN
China
Prior art keywords
nested
webpage
display information
nested webpage
data
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201710387660.3A
Other languages
Chinese (zh)
Other versions
CN107316203A (en
Inventor
汤奇峰
巫君平
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zamplus Advertising Shanghai Co ltd
Original Assignee
Zamplus Advertising Shanghai Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zamplus Advertising Shanghai Co ltd filed Critical Zamplus Advertising Shanghai Co ltd
Priority to CN201710387660.3A priority Critical patent/CN107316203B/en
Publication of CN107316203A publication Critical patent/CN107316203A/en
Application granted granted Critical
Publication of CN107316203B publication Critical patent/CN107316203B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/02Marketing; Price estimation or determination; Fundraising
    • G06Q30/0241Advertisements
    • G06Q30/0242Determining effectiveness of advertisements
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/955Retrieval from the web using information identifiers, e.g. uniform resource locators [URL]

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • Business, Economics & Management (AREA)
  • General Physics & Mathematics (AREA)
  • Accounting & Taxation (AREA)
  • General Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Development Economics (AREA)
  • Finance (AREA)
  • Strategic Management (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Game Theory and Decision Science (AREA)
  • Economics (AREA)
  • Marketing (AREA)
  • General Business, Economics & Management (AREA)
  • Information Transfer Between Computers (AREA)

Abstract

A detection method and device for display information, a computer readable storage medium and a terminal are provided, the method comprises the following steps: determining a nested webpage set in a top-level page where a first nested webpage presenting information to be displayed is located, wherein the nested webpage set comprises windows of all nested webpages nested in the top-level page; sequentially accessing windows of the nested web pages in the nested web page set through the first nested web page; if the access is successful, judging whether the information presented by the nested webpage is the display information of the same type as the information to be displayed; and if the display information is judged, acquiring at least one statistical data of the nested webpage. The technical scheme of the invention can realize the detection of the display information so as to evaluate the display effect of the display information.

Description

Method and device for detecting display information, computer readable storage medium and terminal
Technical Field
The present invention relates to the field of information processing technologies, and in particular, to a method and an apparatus for detecting display information, a computer-readable storage medium, and a terminal.
Background
When the display information, such as an advertisement and a notification, is released in a Demand-Side Platform advertisement release system (DSP), since the DSP obtains a display position from a media service Platform (SSP) (also called a media alliance) in a real-time bidding manner for displaying, when a next display position is successfully obtained, the DSP needs to detect and know the display effect of other display positions displayed at the SSP (or a web page) at that time. The data can be used for analyzing the display bit quality of each SSP and the putting effect and putting strategy of other DSPs, and the putting strategy is adjusted accordingly, so that the effect of displaying information is optimized.
In the prior art, in order to obtain the display effect information of other display bits, a crawler is usually used to access the SSP, and then crawl the information of the relevant display bits. The other mode is that the DSP accesses a document object model (dom) element of a parent page to find iframe elements of other display positions, and then accesses the dom element in the iframe element nested webpage to acquire information. For example, a web page nested by the iframe1 of the presentation information 1 finds a parent page parent, then finds the iframe2 of the presentation information 2 in a child page of the parent page parent, and accesses the web page nested by the iframe2 to acquire information.
However, obtaining information by means of a crawler is not perfect. Because the crawler is a machine behavior, other DSPs can filter the traffic of the crawler through technical means during bidding, and information acquisition through the crawler fails; and the information error rate it obtains is high due to resource problems of the crawler (e.g., IP resources and cookie resources). When information is acquired in the iframe mode, the mode of rendering the display information by the DSP through the SSP is limited in the safe sandbox (namely, the iframe), and the access of the dom elements is not allowed under the cross-domain condition, so that the display effect of other display bits cannot be detected and evaluated.
Disclosure of Invention
The invention solves the technical problem of how to realize the detection of the display information for the display effect evaluation of the display information.
In order to solve the above technical problem, an embodiment of the present invention provides a method for detecting display information, where the method for detecting display information includes: determining a nested webpage set in a top-level page where a first nested webpage presenting information to be displayed is located, wherein the nested webpage set comprises windows of all nested webpages nested in the top-level page; sequentially accessing windows of the nested web pages in the nested web page set through the first nested web page; if the access is successful, judging whether the information presented by the nested webpage is the display information of the same type as the information to be displayed; and if the display information is judged, acquiring at least one statistical data of the nested webpage.
Optionally, the determining whether the information presented by the nested web page is the display information of the same type as the information to be displayed includes: acquiring a URL address of the nested webpage; and if the URL address of the nested webpage comprises a set field, determining that the information presented by the nested webpage is display information.
Optionally, the at least one statistical datum is selected from: the size of the nested webpage, the material data of the nested webpage, the source data of the nested webpage, the click data of the nested webpage, and the visible data of the nested webpage.
Optionally, the window of the nested webpage is a window object of the nested webpage.
Optionally, the acquiring at least one statistical data of the nested web page includes: and determining the size of the nested webpage according to the attribute information of the window object of the nested webpage.
Optionally, the acquiring at least one statistical data of the nested web page includes: acquiring HTML content of the window object of the nested webpage; acquiring the material data by one or more of the following modes: rendering the HTML content to obtain material data of the nested webpage;
and acquiring the URL address of the picture in the HTML content or the URL address of the video file to be used as material data of the nested webpage.
Optionally, the acquiring at least one statistical data of the nested web page includes: acquiring HTML content of the window object of the nested webpage; acquiring a target URL address in the HTML content; accessing the target URL address and acquiring a target page domain name; extracting source data of the nested webpage from the target page domain name; rendering the HTML content to obtain a rendering result; accessing the rendering result and acquiring a domain name of a target page; extracting source data of the nested webpage from the target page domain name;
optionally, the acquiring at least one statistical data of the nested web page includes: determining click data of the nested webpage based on a click event bound by a window object of the nested webpage; or, within a set time interval, if the file activation element is changed into an inner-layer iframe element, adding one to the number of clicks, and determining that the final number of clicks is click data of the nested webpage.
Optionally, the acquiring at least one statistical data of the nested web page includes: and if at least one part of the nested webpage is in a visible area, adding one to the visible number, and determining a final visible number as visible data of the nested webpage, wherein the visible area is within a screen range for presenting the nested webpage.
The embodiment of the invention also discloses a detection device for the display information, which comprises: the nested webpage set determining module is suitable for determining a nested webpage set in a top-level page where a first nested webpage presenting information to be displayed is located, wherein the nested webpage set comprises windows of all nested webpages nested in the top-level page; the access module is suitable for sequentially accessing the windows of the nested webpages in the nested webpage set through the first nested webpage; the judging module is suitable for judging whether the information presented by the nested webpage is the display information of the same type as the information to be displayed if the access is successful; and the acquisition module is suitable for acquiring at least one statistical data of the nested webpage if the display information is judged to be the display information.
Optionally, the determining module includes: the URL address acquisition unit is suitable for acquiring the URL address of the nested webpage; and the judging unit is suitable for determining the information presented by the nested webpage to be display information if the URL address of the nested webpage comprises a set field.
Optionally, the at least one statistical datum is selected from: the size of the nested webpage, the material data of the nested webpage, the source data of the nested webpage, the click data of the nested webpage, and the visible data of the nested webpage.
Optionally, the window of the nested webpage is a window object of the nested webpage.
Optionally, the obtaining module includes: and the size determining unit is suitable for determining the size of the nested webpage according to the attribute information of the window object of the nested webpage. The apparatus for detecting display information according to claim 15, wherein the obtaining module comprises: the first HTML content acquisition unit is suitable for acquiring HTML content of the window object of the nested webpage; the first rendering unit is suitable for rendering the HTML content to obtain material data of the nested webpage; and the address acquisition unit is suitable for acquiring the URL address of the picture in the HTML content or the URL address of the video file to serve as the material data of the nested webpage.
Optionally, the obtaining module includes: the second HTML content acquisition unit is suitable for acquiring the HTML content of the window object of the nested webpage; the target URL address acquisition unit is suitable for acquiring a target URL address in the HTML content; the access unit is suitable for accessing the target URL address and acquiring a target page domain name; the first extraction unit is suitable for extracting the source data of the nested webpage from the target page domain name; the second rendering unit is suitable for rendering the HTML content to obtain a rendering result; the target domain name acquisition unit is suitable for accessing the rendering result and acquiring a target page domain name; and the second extraction unit is suitable for extracting the source data of the nested webpage from the target page domain name.
Optionally, the obtaining module includes: the first click data determining unit is suitable for determining click data of the nested webpage based on a click event bound by a window object of the nested webpage; and the second click data determining unit is suitable for adding one to the click number if the file activation element is changed into an inner-layer iframe element within a set time interval, and determining that the final click number is the click data of the nested webpage.
Optionally, the obtaining module includes: a visibility data determination unit adapted to add one to a visibility number if at least a portion of the nested web page is in a visible area, and determine a final visibility number as visibility data of the nested web page, wherein the visible area is within a screen range in which the nested web page is rendered.
The embodiment of the invention also discloses a computer readable storage medium, wherein a computer instruction is stored on the computer readable storage medium, and the computer instruction executes the steps of the detection method of the display information when running.
The embodiment of the invention also discloses a terminal which comprises a memory and a processor, wherein the memory is stored with a computer instruction capable of running on the processor, and the processor executes the step of the detection method of the display information when running the computer instruction.
Compared with the prior art, the technical scheme of the embodiment of the invention has the following beneficial effects:
the technical scheme of the invention includes that a nested webpage set is determined in a top-level page where a first nested webpage presenting information to be displayed is located, wherein the nested webpage set comprises windows of all nested webpages nested in the top-level page; sequentially accessing windows of the nested web pages in the nested web page set through the first nested web page; if the access is successful, judging whether the information presented by the nested webpage is the display information of the same type as the information to be displayed; and if the display information is judged, acquiring at least one statistical data of the nested webpage. According to the technical scheme, a nested webpage set is determined through a top-level page where a first nested webpage is located, then windows of the nested webpages in the nested webpage set are sequentially accessed through the first nested webpage, if the access is successful, the first nested webpage and the accessed nested webpage are in the same domain, and when the information presented by the accessed nested webpage is judged to be the display information of the same type as the information to be displayed, at least one statistical data of the nested webpage is obtained for display effect evaluation. Compared with the prior art, the technical scheme of the invention avoids adopting an iframe access mode, can reduce the risk of refused access, thereby ensuring to obtain statistical data, realizing the detection of the display information and improving the detection efficiency of the display information.
Further, the at least one statistical data is selected from: the size of the nested webpage, the material data of the nested webpage, the source data of the nested webpage, the click data of the nested webpage, and the visible data of the nested webpage. According to the technical scheme, various types of statistical data are adopted, the display effect of the display information presented by the nested webpage can be evaluated from multiple dimensions, and the detection efficiency of the display information can be further improved.
Drawings
Fig. 1 is a flowchart of a detection method for displaying information according to an embodiment of the present invention;
fig. 2 is a schematic structural diagram of a detection device for displaying information according to an embodiment of the present invention.
Detailed Description
As described in the background, obtaining information by means of a crawler is not perfect. Because the crawler is a machine behavior, other DSPs can filter the traffic of the crawler through technical means during bidding, and information acquisition through the crawler fails; and the information error rate it obtains is high due to resource problems of the crawler (e.g., IP resources and cookie resources). When information is acquired in a frame (iframe) mode, since the mode of rendering the display information by the DSP is limited in a secure sandbox (i.e., iframe), and access of the dom elements to each other is not allowed in a cross-domain case, the display effect of other display bits cannot be detected and evaluated.
The inventor of the application analyzes and determines that since the webpages in the same domain (including the nested webpages and the nested webpages, and the parent webpages and the child webpages) can access each other, all the nested webpages can be determined, access is attempted, and then the presentation position information of different dimensions is collected to evaluate the putting effect of other presentation positions.
According to the technical scheme, a nested webpage set is determined through a top-level page where a first nested webpage is located, then windows of the nested webpages in the nested webpage set are sequentially accessed through the first nested webpage, if the access is successful, the first nested webpage and the accessed nested webpage are in the same domain, and when the information presented by the accessed nested webpage is judged to be the display information of the same type as the information to be displayed, at least one statistical data of the nested webpage is obtained for display effect evaluation. Compared with the prior art, the technical scheme of the invention avoids adopting an iframe access mode, can reduce the risk of refused access, thereby ensuring to obtain statistical data, realizing the detection of the display information and improving the detection efficiency of the display information.
In order to make the aforementioned objects, features and advantages of the present invention comprehensible, embodiments accompanied with figures are described in detail below.
Fig. 1 is a flowchart of a detection method for presentation information according to an embodiment of the present invention.
The method for detecting the display information shown in fig. 1 may include the following steps:
step S101: determining a nested webpage set in a top-level page where a first nested webpage presenting information to be displayed is located, wherein the nested webpage set comprises windows of all nested webpages nested in the top-level page;
step S102: sequentially accessing windows of the nested web pages in the nested web page set through the first nested web page;
step S103: if the access is successful, judging whether the information presented by the nested webpage is the display information of the same type as the information to be displayed;
step S104: and if the display information is judged, acquiring at least one statistical data of the nested webpage.
In specific implementation, in step S101, a nested web page set is determined in a top page where a first nested web page where information to be displayed is presented is located. A nested web page refers to a web page that is nested within another web page. In particular, web pages may have a multi-level nesting relationship, and the parent page of the first nested web page may also be a nested web page. The top page refers to a page without a parent page, i.e., a page that is not a nested page. According to the embodiment, all nested web pages nested by the top page can be determined through the top page. Each webpage corresponds to a webpage window, and the windows of all nested webpages nested in the top page, namely a nested webpage set, can be determined through the top page. The nested web page set is determined because information of the nested web page can be acquired through a window of the nested web page.
In a specific implementation, in step S102, windows of nested web pages in the set of nested web pages are sequentially accessed through the first nested web page. Specifically, due to the limitation of cross-domain access, the first nested web page can only access web pages in the same domain as the first nested web page. If the access fails, the nested webpage which indicates the accessed nested webpage is different from the first nested webpage in domain. More specifically, if the access is reported in error, it indicates that the access failed; if no error is reported, the access is successful.
It should be noted that, in this embodiment, sequentially accessing the windows of the nested web pages in the nested web page set through the first nested web page means that the information of the nested web pages is obtained through the first nested web page.
In a specific implementation, in step S103, if the access is successful, it needs to be determined whether the information presented by the nested web page is the same type of display information as the information to be displayed. That is, the statistical data of the nested web page can be obtained for evaluating the display effect only when the information presented by the nested web page is the display information of the same type as the information to be displayed, otherwise, the obtained statistical data is invalid. Therefore, in order to improve the detection efficiency and avoid acquiring invalid data, the type of information presented by the nested web page is determined in step S103.
For example, the type of the information to be displayed is an advertisement, and if the information presented by the nested web page is also an advertisement, the statistical data of the nested web page can be acquired to evaluate the advertisement putting effect; if the information presented by the nested web page is a notification, the statistics to retrieve the nested web page are invalid.
In a specific implementation, in step S104, if it is determined that the information presented by the nested web page is the presentation information, at least one statistical data of the nested web page is acquired. In other words, when the determination result in step S103 indicates that the information presented by the nested web page is the display information of the same type as the information to be displayed, at least one statistical data of the nested web page is obtained for evaluating the impression effect of the nested web page (i.e., the display position).
In this embodiment, to-be-displayed information is presented in the first nested web page (i.e., the display position), in order to evaluate the display effect of the first nested web page, the display effect of a web page in the same domain as the first nested web page may be evaluated, so as to adjust the release policy according to the display effect, so that the display information presented in the first nested web page at the next time may have a better display effect.
The embodiment of the invention avoids adopting the mode of accessing the iframe, can reduce the risk of being refused to access, thereby ensuring to obtain the statistical data and improving the detection efficiency of the display information.
Preferably, step S103 may include the steps of: acquiring a URL address of the nested webpage; and if the URL address of the nested webpage comprises a set field, determining that the information presented by the nested webpage is display information. Specifically, if the window of the nested webpage is successfully accessed through the first nested webpage, the information of the accessed nested webpage can be acquired; in order to determine whether the information presented by the nested web page is the same type of display information as the information to be displayed, the URL address of the nested web page may be acquired. And if the URL address of the nested webpage comprises a set field, determining that the information presented by the nested webpage is the display information, otherwise, determining that the information presented by the nested webpage is not the display information.
Those skilled in the art will appreciate that the set fields may be configured according to different application environments and different nested web pages. For example, in the Baidu SSP, the URL address of the nested web page includes a setting field BAIDU _ SSP _ define; in the Taobao SSP, the URL address of the nested web page includes alicdn.com or taobaocdn.com.
It can be understood that, when judging whether the information presented by the nested web page is the same type of the display information as the information to be displayed, other modes can be combined: acquiring a URL address and HyperText Markup Language (HTML) content of the nested webpage; and if the URL address of the nested webpage comprises a set field and the HTML content comprises set information, determining that the information presented by the nested webpage is display information. For example, in Google SSP, the URL address of the nested web page includes a double. net or Google synthesis. com/safeframe, and the HTML code of the nested web page includes a double. net/page/adview or "Google provided ad".
Preferably, the window of the nested webpage is a window object of the nested webpage. In particular, when the nested web page is displayed in a browser, the browser can create a corresponding window object for the nested web page. And acquiring the information of the nested webpage through the attribute information of the window object. Location may obtain the URL address of the nested web page, for example, through the attribute information window.
It will be appreciated by those skilled in the art that the window object is provided with attribute information that can be used to characterize the window object. For example, the attribute information innerheight may represent the height of the document display area of the window; the attribute information innerwidth may represent the width of the document display area of the window; the attribute information length may indicate the number of frames in the window, etc., and will not be described herein again.
Preferably, the at least one statistical datum is selected from: the size of the nested webpage, the material data of the nested webpage, the source data of the nested webpage, the click data of the nested webpage, and the visible data of the nested webpage. According to the embodiment of the invention, by adopting various types of statistical data, the display effect of the display information presented by the nested webpage can be evaluated from multiple dimensions, and the detection efficiency of the display information can be further improved.
Preferably, step S104 may include the steps of: and determining the size of the nested webpage according to the attribute information of the window object of the nested webpage. In particular, the nested web page size may be obtained by attribute information window of a window object (i.e., document object) of the nested web page. The nested web page size may include a width and a height of the nested web page. More specifically, the width and height of the nested web page may be obtained by at least one of the following attribute information: lnnerwidth, document, body, offsetwidth, document, body, offsetheight.
Preferably, step S104 may include the steps of: acquiring HTML content of the window object of the nested webpage; and rendering the HTML content to obtain the material data of the nested webpage. The material data of the nested web page can refer to the material that constitutes the information presented by the nested web page. Specifically, the HTML content can be acquired through the attribute information of the window object of the nested webpage, and the data which is visually displayed can be obtained after the HTML content is rendered, so that the HTML content can be used as the material data of the nested webpage for evaluating the delivery effect. More specifically, a headless browser (headlessbrowser) may be used to render HTML content in batches, and then save the screenshots. The embodiment can acquire the material data for html5 material data and iframe nesting.
Preferably, step S104 may include the steps of: acquiring HTML content of the window object of the nested webpage; and acquiring the URL address of the picture in the HTML content or the URL address of the video file to be used as material data of the nested webpage. Specifically, the URL addresses of the pictures and flash (flash) files in the HTML content can be analyzed, and the URL addresses are saved and summarized. The embodiment can acquire the picture material data and the flash material data.
Preferably, step S104 may include the steps of: acquiring HTML content of the window object of the nested webpage; acquiring a target URL address in the HTML content; accessing the target URL address and acquiring a target page domain name; and extracting the source data of the nested webpage from the domain name of the target page. Specifically, the href attributes of all the a elements can be analyzed through HTML content, the href attribute of the a element is usually used for specifying the URL address of a hyperlink target, a final target page domain name is obtained by accessing the URL address in the href attribute, and the source data of the nested web page can be obtained through the domain name. For example, the href attribute of the a element is < a href ═ http:// baidu.com >, where http:// baidu.com indicates a target URL address, and after accessing the URL address, advertiser information can be extracted. The embodiment can be applied to the acquisition of the source data of the nested webpage with the A element.
Preferably, step S104 may include the steps of: acquiring HTML content of the window object of the nested webpage; rendering the HTML content to obtain a rendering result; accessing the rendering result and acquiring a domain name of a target page; and extracting the source data of the nested webpage from the domain name of the target page. Specifically, by rendering the HTML content, after a rendering result is accessed, the target page can be skipped to, and the source data of the nested webpage can be extracted from the domain name of the target page. More specifically, a headless browser may be used to render HTML content in batches, followed by click-through access. The embodiment can be applied to the acquisition of the source data of the nested webpage without the A element.
Preferably, step S104 may include the steps of: determining click data of the nested webpage based on a click event bound by a window object of the nested webpage; or, within a set time interval, if the file activation element is changed into an inner-layer iframe element, adding one to the number of clicks, and determining that the final number of clicks is click data of the nested webpage. In particular, click data may be used to evaluate click effectiveness. The relevant click action can be obtained through a click event (click event) bound to the window object of the nested webpage, and if the click event is triggered, the click is judged to be a click, so that click data is obtained.
However, since the rendering of the nested web page generally uses a layer of iframe to nest (DSPiframe), and the iframe and the first nested web page are cross-domain, the click data cannot be obtained by using the click event. In this case, whether the nested web page is clicked or not may be determined by a file activation element (document. When the nested web page is clicked, the file activation element is changed to an inner iframe element. More specifically, whether the file activation element is changed to the inner iframe can be judged within a set time interval, for example, within 200 milliseconds, if the file activation element is changed to the inner iframe, the nested webpage is clicked, and the number of clicks is increased by one.
It should be noted that the time length of the set time interval may be adaptively modified and configured according to the actual application environment, for example, 100 ms or 300 ms, which is not limited in the embodiment of the present invention.
Preferably, step S104 may include the steps of: and if at least one part of the nested webpage is in a visible area, adding one to the visible number, and determining a final visible number as visible data of the nested webpage, wherein the visible area is within a screen range for presenting the nested webpage. Specifically, the visible data of the nested web page indicates whether the nested web page is in an area visible to a user, and the visible data of the nested web page is one of important indicators for evaluating the display effect of the nested web page.
In a specific implementation, the nested web page may be delivered to a terminal device for displaying, and the terminal device may have a Central Processing Unit (CPU). When the CPU processes the animation, the animation rendering operation can be performed on the animation, and when the animation is in a visible region and an invisible region, the processing space of the CPU consumed by the animation rendering operation is different, so that the time frequency of playing the animation is different. Therefore, by utilizing the characteristic that the processing space of a CPU (Central processing Unit) consumed by the animation rendering when the animation rendering is executed in the visible area and the invisible area is different, so that the time frequency of animation playing is different, at least one animation is created in the area where the nested webpage is located, the time interval between two playing actions of each animation is detected, and the comparison with the set time is carried out, so that whether the display information is in the visible area or not is determined.
Fig. 2 is a schematic structural diagram of a detection device for displaying information according to an embodiment of the present invention.
The apparatus 20 for detecting display information shown in fig. 2 may include a nested web page set determining module 201, an accessing module 202, a determining module 203, and an obtaining module 204.
The nested webpage set determining module 201 is adapted to determine a nested webpage set in a top-level page where a first nested webpage presenting information to be displayed is located, where the nested webpage set includes windows of all nested webpages nested in the top-level page; the access module 202 is adapted to sequentially access windows of nested web pages in the set of nested web pages through the first nested web page; the judging module 203 is adapted to judge whether the information presented by the nested webpage is the same type of display information as the information to be displayed if the access is successful; the obtaining module 204 is adapted to obtain at least one statistical data of the nested web page if it is determined to display information.
Compared with the prior art, the embodiment of the invention avoids adopting the access iframe mode, can reduce the risk of refused access, thereby ensuring to obtain the statistical data and improving the detection efficiency of the display information.
Preferably, the judging module 203 may include a URL address acquiring unit 2031 and a judging unit 2032. The URL address obtaining unit 2031 is adapted to obtain a URL address of the nested web page; the decision unit 2032 is adapted to determine that the information presented by the nested web page is presentation information if the URL address of the nested web page comprises a set field.
Preferably, the at least one statistical datum is selected from: the size of the nested webpage, the material data of the nested webpage, the source data of the nested webpage, the click data of the nested webpage, and the visible data of the nested webpage. According to the embodiment of the invention, by adopting various types of statistical data, the display effect of the display information presented by the nested webpage can be evaluated from multiple dimensions, and the detection efficiency of the display information can be further improved.
Preferably, the window of the nested webpage is a window object of the nested webpage.
Preferably, the obtaining module 204 may include a size determining unit 2041, and the size determining unit 2041 is adapted to determine the size of the nested web page according to the attribute information of the window object of the nested web page.
Preferably, the obtaining module 204 may include a first HTML content obtaining unit 2042 and a first rendering unit 2043, the first HTML content obtaining unit is adapted to obtain HTML content of the window object of the nested web page; the first rendering unit 2043 is adapted to render the HTML content to obtain material data of the nested web page.
Preferably, the obtaining module 204 may further include an address obtaining unit 2044, and the address obtaining unit 2044 is adapted to obtain a URL address of a picture in the HTML content or a URL address of a video file as the material data of the nested web page.
Preferably, the obtaining module 204 may include a second HTML content obtaining unit 2045, a target URL address obtaining unit 2046, an accessing unit 2047, and a first extracting unit 2048. The second HTML content obtaining unit 2045 is adapted to obtain HTML content of the window object of the nested web page; the target URL address obtaining unit 2046 is adapted to obtain a target URL address in the HTML content; the accessing unit 2047 is adapted to access the target URL address and obtain a target page domain name; the first extracting unit 2048 is adapted to extract the source data of the nested web page from the target page domain name.
Preferably, the obtaining module 204 may include a second rendering unit 2049, a target domain name obtaining unit 2050, and a second extracting unit 2051. The second rendering unit 2049 is adapted to render the HTML content to obtain a rendering result; the target domain name obtaining unit 2050 is adapted to access the rendering result and obtain a target page domain name; the second extraction unit 2051 is adapted to extract the source data of the nested web page from the target page domain name.
Preferably, the obtaining module 204 may include a first click data determination unit 2052 and a second click data determination unit 2053. The first click data determining unit 2052 is adapted to determine click data of the nested web page based on a click event bound by a window object of the nested web page; the second click data determination unit 2053 is adapted to, within a set time interval, increment the number of clicks by one if the file activation element is changed to the inner iframe element, and determine that the final number of clicks is the click data of the nested web page.
Preferably, the obtaining module 204 may comprise a visible data determining unit 2054, the visible data determining unit 2054 being adapted to add one to the visible number if at least a part of the nested web page is in a visible area, and to determine the final visible number as the visible data of the nested web page, wherein the visible area is within a screen area where the nested web page is rendered.
For more details on the operation principle and the operation mode of the information displaying detection apparatus 20, reference may be made to the related description in fig. 1, and details are not repeated here.
The embodiment of the invention also discloses a readable storage medium, wherein computer instructions are stored on the readable storage medium, and when the computer instructions are executed, the steps of the detection method for the display information shown in fig. 1 can be executed. The storage medium may include ROM, RAM, magnetic or optical disks, etc.
The embodiment of the invention also discloses user equipment which can comprise a memory and a processor, wherein the memory is stored with computer instructions capable of running on the processor. The processor, when executing the computer instructions, may perform the steps of the method for detecting presence information shown in fig. 1. The user equipment includes but is not limited to a mobile phone, a computer, a tablet computer and other terminal equipment.
Although the present invention is disclosed above, the present invention is not limited thereto. Various changes and modifications may be effected therein by one skilled in the art without departing from the spirit and scope of the invention as defined in the appended claims.

Claims (18)

1. A method for detecting display information, comprising:
determining a nested webpage set in a top-level page where a first nested webpage presenting information to be displayed is located, wherein the nested webpage set comprises windows of all nested webpages nested in the top-level page;
sequentially accessing windows of the nested web pages in the nested web page set through the first nested web page;
if the access is successful, judging whether the information presented by the nested webpage is the display information of the same type as the information to be displayed; the judging whether the information presented by the nested webpage is the display information of the same type as the information to be displayed comprises the following steps: acquiring a URL address of the nested webpage; if the URL address of the nested webpage comprises a set field, determining that the information presented by the nested webpage is display information;
and if the display information is judged, acquiring at least one statistical data of the nested webpage.
2. The method for detecting presence information according to claim 1, wherein said at least one statistic is selected from the group consisting of: the size of the nested webpage, the material data of the nested webpage, the source data of the nested webpage, the click data of the nested webpage, and the visible data of the nested webpage.
3. The method for detecting the presentation information according to claim 2, wherein the window of the nested web page is a window object of the nested web page.
4. The method for detecting the display information according to claim 3, wherein the obtaining at least one statistic of the nested web pages comprises:
and determining the size of the nested webpage according to the attribute information of the window object of the nested webpage.
5. The method for detecting the display information according to claim 3, wherein the obtaining at least one statistic of the nested web pages comprises:
acquiring HTML content of the window object of the nested webpage;
acquiring the material data by one or more of the following modes: rendering the HTML content to obtain material data of the nested webpage; and acquiring the URL address of the picture in the HTML content or the URL address of the video file to be used as material data of the nested webpage.
6. The method for detecting the display information according to claim 3, wherein the obtaining at least one statistic of the nested web pages comprises:
acquiring HTML content of the window object of the nested webpage;
acquiring a target URL address in the HTML content, accessing the target URL address, acquiring a target page domain name, and extracting source data of the nested webpage from the target page domain name;
or rendering the HTML content to obtain a rendering result, accessing the rendering result, acquiring a domain name of a target page, and extracting the source data of the nested webpage from the domain name of the target page.
7. The method for detecting the display information according to claim 3, wherein the obtaining at least one statistic of the nested web pages comprises:
determining click data of the nested webpage based on a click event bound by a window object of the nested webpage;
or, within a set time interval, if the file activation element is changed into an inner-layer iframe element, adding one to the number of clicks, and determining that the final number of clicks is click data of the nested webpage.
8. The method for detecting the display information according to claim 3, wherein the obtaining at least one statistic of the nested web pages comprises:
and if at least one part of the nested webpage is in a visible area, adding one to the visible number, and determining a final visible number as visible data of the nested webpage, wherein the visible area is within a screen range for presenting the nested webpage.
9. A detection device for displaying information, comprising:
the nested webpage set determining module is suitable for determining a nested webpage set in a top-level page where a first nested webpage presenting information to be displayed is located, wherein the nested webpage set comprises windows of all nested webpages nested in the top-level page;
the access module is suitable for sequentially accessing the windows of the nested webpages in the nested webpage set through the first nested webpage;
the judging module is suitable for judging whether the information presented by the nested webpage is the display information of the same type as the information to be displayed if the access is successful; the judging module comprises: the URL address acquisition unit is suitable for acquiring the URL address of the nested webpage; the judging unit is suitable for determining the information presented by the nested webpage to be display information if the URL address of the nested webpage comprises a set field;
and the acquisition module is suitable for acquiring at least one statistical data of the nested webpage if the display information is judged to be the display information.
10. The apparatus for detecting the presence of information as claimed in claim 9, wherein said at least one statistic is selected from the group consisting of: the size of the nested webpage, the material data of the nested webpage, the source data of the nested webpage, the click data of the nested webpage, and the visible data of the nested webpage.
11. The apparatus for detecting presentation information of claim 10, wherein the window of the nested web page is a window object of the nested web page.
12. The apparatus for detecting display information according to claim 11, wherein the obtaining module comprises:
and the size determining unit is suitable for determining the size of the nested webpage according to the attribute information of the window object of the nested webpage.
13. The apparatus for detecting display information according to claim 11, wherein the obtaining module comprises:
the first HTML content acquisition unit is suitable for acquiring HTML content of the window object of the nested webpage;
the first rendering unit is suitable for rendering the HTML content to obtain material data of the nested webpage;
and the address acquisition unit is suitable for acquiring the URL address of the picture in the HTML content or the URL address of the video file to serve as the material data of the nested webpage.
14. The apparatus for detecting display information according to claim 11, wherein the obtaining module comprises:
the second HTML content acquisition unit is suitable for acquiring the HTML content of the window object of the nested webpage;
the target URL address acquisition unit is suitable for acquiring a target URL address in the HTML content;
the access unit is suitable for accessing the target URL address and acquiring a target page domain name;
the first extraction unit is suitable for extracting the source data of the nested webpage from the target page domain name;
the second rendering unit is suitable for rendering the HTML content to obtain a rendering result;
the target domain name acquisition unit is suitable for accessing the rendering result and acquiring a target page domain name;
and the second extraction unit is suitable for extracting the source data of the nested webpage from the target page domain name.
15. The apparatus for detecting display information according to claim 11, wherein the obtaining module comprises:
the first click data determining unit is suitable for determining click data of the nested webpage based on a click event bound by a Window object of the nested webpage;
and the second click data determining unit is suitable for adding one to the click number if the file activation element is changed into an inner-layer iframe element within a set time interval, and determining that the final click number is the click data of the nested webpage.
16. The apparatus for detecting display information according to claim 11, wherein the obtaining module comprises:
a visibility data determination unit adapted to add one to a visibility number if at least a portion of the nested web page is in a visible area, and determine a final visibility number as visibility data of the nested web page, wherein the visible area is within a screen range in which the nested web page is rendered.
17. A computer-readable storage medium having stored thereon computer instructions, wherein the computer instructions are executable to perform the steps of the method for detecting presence information according to any one of claims 1 to 8.
18. A terminal comprising a memory and a processor, the memory having stored thereon computer instructions executable on the processor, wherein the processor executes the computer instructions to perform the steps of the method for detecting presence information according to any one of claims 1 to 8.
CN201710387660.3A 2017-05-26 2017-05-26 Method and device for detecting display information, computer readable storage medium and terminal Active CN107316203B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710387660.3A CN107316203B (en) 2017-05-26 2017-05-26 Method and device for detecting display information, computer readable storage medium and terminal

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710387660.3A CN107316203B (en) 2017-05-26 2017-05-26 Method and device for detecting display information, computer readable storage medium and terminal

Publications (2)

Publication Number Publication Date
CN107316203A CN107316203A (en) 2017-11-03
CN107316203B true CN107316203B (en) 2020-10-20

Family

ID=60181469

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710387660.3A Active CN107316203B (en) 2017-05-26 2017-05-26 Method and device for detecting display information, computer readable storage medium and terminal

Country Status (1)

Country Link
CN (1) CN107316203B (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109740100B (en) * 2019-01-03 2022-02-22 北京字节跳动网络技术有限公司 Webpage node sniffing method, device, equipment and storage medium
CN112685211B (en) * 2021-01-04 2024-06-04 北京金山云网络技术有限公司 Error information display method and device, electronic equipment and medium
CN114442890A (en) * 2021-12-28 2022-05-06 北京沃东天骏信息技术有限公司 Information analysis method, device, equipment and readable medium for mobile terminal webpage

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN100550022C (en) * 2007-12-25 2009-10-14 华为技术有限公司 Data access method and device between web pages in nested web page structure
US9361631B2 (en) * 2010-01-06 2016-06-07 Ghostery, Inc. Managing and monitoring digital advertising
CN102096781B (en) * 2011-01-18 2012-11-28 南京邮电大学 Fishing detection method based on webpage relevance
WO2013035089A2 (en) * 2011-09-08 2013-03-14 Double Verify Inc. System and method for enhanced interaction between an iframe or a web page and an embedded iframe from a different domain
CN104714980B (en) * 2013-12-17 2018-04-06 阿里巴巴集团控股有限公司 Page nesting determining method of path and device

Also Published As

Publication number Publication date
CN107316203A (en) 2017-11-03

Similar Documents

Publication Publication Date Title
US9805377B2 (en) Unified content visibility
JP6382276B2 (en) Ads present in the view
US9443257B2 (en) Securing expandable display advertisements in a display advertising environment
JP6117452B1 (en) System and method for optimizing content layout using behavioral metric
US8914496B1 (en) Tracking user behavior relative to a network page
US12353574B2 (en) Page processing method, electronic apparatus and non-transitory computer-readable storage medium
US9953014B1 (en) Collection management in document object model virtualization
US9940391B2 (en) System, method and computer readable medium for web crawling
CN110209966B (en) Webpage refreshing method, webpage system and electronic equipment
US20120192063A1 (en) On-the-fly transformation of graphical representation of content
CN104021172A (en) Advertisement filtering method and advertisement filtering device
CN102523130B (en) Bad webpage detection method and device
CN108733559B (en) Page event triggering method, terminal equipment and medium
CN106033450B (en) Advertisement blocking method and device and browser
US20140149586A1 (en) Internet panel for capturing active and intentional online activity
US20170371888A1 (en) Method for advertisement interception in dual-kernel browser and browser apparatus
CN107316203B (en) Method and device for detecting display information, computer readable storage medium and terminal
CN106446617B (en) A kind of static page access method with source file defencive function
CN104881273A (en) Webpage rendering analysis method and terminal device
US10282761B2 (en) Systems and processes for detecting content blocking software
CN105260424A (en) Processing method and apparatus for webpage browsing historical records and most common accesses of user
CN107070873B (en) Webpage illegal data screening method and system, data screening server and browser
CN103761257B (en) Web page processing method and system based on mobile browser
CN104881452B (en) Resource address sniffing method, device and system
TW201631536A (en) Terminal and recording medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant