US20200133986A1 - Network graph parser - Google Patents
Network graph parser Download PDFInfo
- Publication number
- US20200133986A1 US20200133986A1 US16/654,048 US201916654048A US2020133986A1 US 20200133986 A1 US20200133986 A1 US 20200133986A1 US 201916654048 A US201916654048 A US 201916654048A US 2020133986 A1 US2020133986 A1 US 2020133986A1
- Authority
- US
- United States
- Prior art keywords
- user
- network site
- data
- objects
- additional
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/957—Browsing optimisation, e.g. caching or content distillation
- G06F16/9577—Optimising the visualization of content, e.g. distillation of HTML documents
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/205—Parsing
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/205—Parsing
- G06F40/221—Parsing markup language streams
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/048—Interaction techniques based on graphical user interfaces [GUI]
- G06F3/0484—Interaction techniques based on graphical user interfaces [GUI] for the control of specific functions or operations, e.g. selecting or manipulating an object, an image or a displayed text element, setting a parameter value or selecting a range
- G06F3/04842—Selection of displayed objects or displayed text elements
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T11/00—2D [Two Dimensional] image generation
- G06T11/20—Drawing from basic elements, e.g. lines or circles
- G06T11/206—Drawing of charts or graphs
Definitions
- Embodiments of the present disclosure relate generally to pattern detection and, more particularly, but not by way of limitation, to manipulating data via a network graph parser to expose previously undetected patterns.
- a code repository website allows users to publish software code projects to the website so that other users can access, view, edit, or otherwise use the published software code. Identifying how different projects (e.g., software coding projects) are related to one another is currently impractical because the project data on the code repository websites is largely unstructured.
- FIG. 1 is a block diagram illustrating a networked system in which a network graph parser can be implemented, according to some example embodiments.
- FIG. 2 is a block diagram showing functional components provided within the network graph parser, according to some example embodiments.
- FIG. 3 shows a flow diagram for generating node data for export, according to some example embodiments.
- FIG. 4 shows a flow diagram for parsing node data from multiple selected entities, according to some example embodiments.
- FIGS. 5A and 5B show example visualizations of node data, according to some example embodiments.
- FIG. 6 shows a flow diagram for selecting entities, according to some example embodiments.
- FIG. 7 shows example visualizations from node data of different selected entities, according to some example embodiments.
- FIG. 8 shows an example flow diagram for processing entity data, according to some example embodiments.
- FIG. 9 shows example visualizations, according to some example embodiments.
- FIG. 10 shows an example flow diagram for receiving filter instructions, according to some example embodiments.
- FIG. 11 shows example visualization and user interface elements for filtering node data, according to some example embodiments.
- FIG. 12 shows a flow diagram for filtering node data, according to some example embodiments.
- FIGS. 13A and 13B show example visualization and user interface elements for filtering node data, according to some example embodiments.
- FIG. 14 shows an example flow diagram for analysis and export of node data, according to some example embodiments.
- FIGS. 15A and 15B show example user interfaces for processing network graphs using a network graph parser, according to some example embodiments.
- FIG. 16 illustrates a diagrammatic representation of a machine in the form of a computer system within which a set of instructions may be executed for causing the machine to perform any one or more of the methodologies discussed herein, according to an example embodiment.
- a network graph parser is implemented to parse data from websites (e.g., code repository websites) into human understandable patterns.
- the code repository websites are websites or network-based publication platforms (e.g., Internet forums) that allow users to publish data viewable by other users of the website or platform.
- a software developer can create a project page on a code repository site and publish his/her code for the project to the project page. Other uses may navigate to the project page, view, download, or modify the code for the projects.
- the network graph parser is installed as a browser plugin of an Internet browser application.
- a data analyst may navigate to a given page on a repository website projects, such as a page created or associated with the project or a contributor.
- the analyst may then trigger the parse operation by selecting a browser plugin button.
- the parse operation goes through the page and saves data on the page and on related pages.
- the network graph parser may identify links to projects listed on the repository website.
- the network graph parser may navigate to each of the projects.
- the saved data may be used to generate a visual representation (e.g., a network graph) of the collected data.
- the data analyst may manipulate the visual representation to explore patterns. Further, the data analyst may hone down onto specific subsets by issuing filter instructions. For example, the data analyst may filter out any connections that don't have at least two connections to other nodes. Contributors may have connections to one another by working together on the same coding project, as an example.
- the various filter instructions expose previously invisible patterns in the network graph.
- the honed down data containing the pattern can then be exported over a network to a data analysis server for further analysis, according to some example embodiments.
- FIG. 1 is a block diagram depicting a networked system 100 comprising an electronic device 110 , and one or more components external to the electronic device 110 .
- these external components include a database system 10 , network 120 , and a plurality of repository servers 130 - 1 to 130 - n , that host repository websites.
- the electronic device 110 is a client device, such as a personal computer, a tablet computer, a personal digital assistant (PDA), a mobile phone, a smart-phone, or any other web-enabled computing device with a processor and a memory.
- the electronic device 110 has installed thereon a web browser application (e.g., web browser 1632 in FIG. 16 ), on which is installed a network graph parser.
- a web browser application e.g., web browser 1632 in FIG. 16
- the network graph parser is integrated into the web browser application as a plugin or browser extension.
- Each of the plurality of repository servers 130 - 1 to 130 - n comprises hardware and software.
- Each of the plurality of repository servers 130 - 1 to 130 - n is able to communicate with the electronic device 110 via the network 120 .
- some of the plurality of repository servers 130 - 1 to 130 - n can be a part of a cloud, which can include, for example, one or more networked servers.
- networked servers may be termed a data center or a server farm.
- Such data centers currently are maintained by various communication network service providers.
- Network 120 can be, for example, the Internet, an intranet, a local area network, a wide area network, a campus area network, a metropolitan area network, an extranet, a private extranet, or a combination of any of these or other appropriate networks.
- the electronic device 110 is separate from the external database system 10 but connected thereto by a link.
- the database system 10 may be disposed in an air-gapped, high-side environment, where the database system 10 is physically isolated from the network 120 and the electronic device 110 , such that a higher level of classified information can be maintained in the database system 10 .
- the electronic device 110 may be implemented by one or more specially configured computing devices.
- the electronic device 110 may be hard-wired to perform the operations, techniques, etc. described herein.
- the electronic device 110 can include digital electronic devices such as one or more application-specific integrated circuits (ASICs) or field programmable gate arrays (FPGAs) that are persistently programmed to perform the operations, techniques, etc. described herein.
- ASICs application-specific integrated circuits
- FPGAs field programmable gate arrays
- the electronic device 110 can include one or more general purpose hardware processors (including processor circuitry) programmed to perform such features of the present disclosure pursuant to program instructions in firmware, memory, other storage, or a combination.
- the electronic device 110 can also combine custom hard-wired logic, ASICs, or FPGAs with custom programming to accomplish the methods and other features.
- the electronic device 110 can be generally controlled and coordinated by operating system software, such as iOS, Android, Blackberry, Chrome OS, Windows XP, Windows Vista, Windows 7, Windows 8, Windows Server, Windows CE, Unix, Linux, SunOS, Solaris, VxWorks, or a proprietary operating system.
- operating system software such as iOS, Android, Blackberry, Chrome OS, Windows XP, Windows Vista, Windows 7, Windows 8, Windows Server, Windows CE, Unix, Linux, SunOS, Solaris, VxWorks, or a proprietary operating system.
- the operating system controls and schedules computer processes for execution, perform memory management, provide file system, networking, I/O services, and provides a user interface functionality, such as a graphical user interface (“GUI”), among other things.
- GUI graphical user interface
- FIG. 2 shows internal functional components of the network graph parser 234 , according to some example embodiments.
- the network graph parser is implemented as a plug-in or browser extension for a web browser.
- the network graph parser 234 comprises an interface engine 210 , a parse engine 220 , a node data engine 230 , a visualization engine 240 , and an export engine 250 .
- the interface engine 210 is configured to interface with the browser 1632 as a plugin. Further, the interface engine 210 is configured to interface with entities outside the electronic device 110 , such as the repository server 130 .
- the parse engine 220 is configured to parse node data from a code project webpage. Node data is data of an object associated with a given project.
- the node data is user data (e.g., data of software developers) associated with a given project, where each user may be involved in several different software projects.
- the node data includes code portions (e.g., classes, functions) shared between different projects. For example, two different projects may share an Optical Character Recognition (OCR) class, and the OCR class code can be used as a node associated with each software project's network graph, as discussed in further detail below.
- OCR Optical Character Recognition
- metadata describing an object e.g., code portion
- the parse engine 220 is configured to send node data requests for the users of the repository website.
- the repository website can receive the requests and issue responses including the requested node data.
- the node data engine 230 is configured to process the node data received via the code projects webpage (e.g., via spidering) or received from the repository website.
- the node data engine 230 can receive filter instructions from a user and cull (e.g., refine) the node data by removing data of users that do not meet the requirements of the filter instruction, as explained in further detail below.
- the visualization engine 240 is configured to use the initial node data or the refined node data and generate different types of visualizations for display on the display screen of the electronic device 110 .
- the visualizations may include a network graph, a histogram, graphs such bar charts or data plots, and other visualizations.
- the export engine 250 is configured to export the refined dataset to an analysis server for further analysis.
- FIG. 3 is a flowchart representing an exemplary method 300 performed by an electronic device for collecting and analyzing data from repository website systems, according to some example embodiments. While the flowchart discloses the following operations in a particular order, it will be appreciated that at least some of the operations can be moved, modified, or deleted where appropriate, consistent with the teachings of the present disclosure.
- a user can utilize an electronic device (e.g., electronic device 110 ) that comprises a web browser 1632 , for example, GoogleTM ChromeTM, MozillaTM FirefoxTM, MicrosoftTM Internet ExplorerTM, etc.
- the web browser 1632 is usable to access web content (e.g. provided by the repository servers 130 - 1 to 130 - n ) via a network (e.g., network 120 ), such as the Internet or an intranet.
- a network e.g., network 120
- a network graph parser 234 is installed as a plugin in the web browser 1632 of the electronic device.
- the network graph parser 234 may be termed a browser extension, according to some example embodiments.
- the network graph parser 234 extends the functionality of the web browser 1632 , as is described in detail below.
- the network graph parser 234 may be authored using a web technologies such as HTML, JavaScript, or CSS (Cascading Style Sheets).
- the interface engine 210 accesses communication and node data related content from a repository server 130 (e.g., one of the repository servers 130 - 1 to 130 - n ) using the web browser 1632 .
- a repository server 130 e.g., one of the repository servers 130 - 1 to 130 - n
- the interface engine 210 accesses communication and node data related content from a repository server 130 (e.g., one of the repository servers 130 - 1 to 130 - n ) using the web browser 1632 .
- a repository server 130 e.g., one of the repository servers 130 - 1 to 130 - n .
- the repository server 130 is accessed through the browser 1632 causing sending of a request (e.g. an HTTP request) to the repository server 130 (in particular to a webserver included as part thereof.
- a request e.g. an HTTP request
- the information received from the repository webservice comprises a projects webpage showing different coding or software projects associated with a user of the repository webservice.
- node data is a user profile and relates to an entity that is included as part of the repository network service provided by the repository server 130 .
- an entity typically relates to an individual programmer, but may relate to an organization, for instance a business or other group.
- a software developer profile includes at least a unique identifier (the identifier uniquely identifies the entity on the repository service), a name for the entity (typically a string of text, perhaps alphanumeric characters) and a plurality of links between the entity and other entities that form part of the repository webservice.
- the links may be bidirectional in nature. For example, two software developers may collaborate on the same code project. Because the two developers work on the same coding project, they may be bidirecitonally linked under the assumption that each knows of the other as a fellow coder (e.g., team member, colleague) on the project.
- the links may alternatively be unidirectional, e.g., the first software developer receives updates published by the second software developer but the second software developer does not receive updates published by the first software developer.
- the data stored on the repository website indicates the type of communication activity between the users.
- the node data may include an indication that a first user commented on a pending code update on a project page on the repository website.
- the links may indicate the another entity by including a identifier that is unique to the other entity.
- repository webservices provide an identifier that is an alphanumeric string.
- the string may be known to the entity and other users (e.g. it may be their username) or it may be a system-generated identifier which does not need to be known to the user (e.g. a string such as “exampleidentifier$43*”).
- the profile may also include a uniform resource locator (URL) that is unique to the entity.
- URL uniform resource locator
- a user profile of the repository website may also have other information associated with pre-defined fields, for instance ‘high school attended’, ‘place of residence’, ‘place of work’, ‘undergraduate study subject’, etc.
- the profile may also have other content such as photographs, videos, comments or profile text, etc.
- Profile content may be associated with particular dates (and as such may appear in a timeline on a user's profile page) or may not be dependent on a date (and so may not generally appear on a timeline).
- profile content may be associated with geotagged data.
- user profiles are imported in response to user input.
- a first profile is imported by the network graph parser 234 in response to the user selecting a first entity in the repository service. This may occur for instance by the user selecting a hyperlink in a code projects webpage provided by the repository server 130 .
- the code projects webpage may be provided by the repository server 130 in response to the user entering text, e.g. the whole or part of a name on an entity, into a search field of a webpage provided by the by the repository server 130 .
- the code projects webpage displays coding projects of the first entity, where each of the coding projects has its own projects webpage, which can be spidered as described above.
- the network graph parser 234 upon selection of the first entity, sends a request to the repository server 130 identifying the first entity.
- the repository server 130 provides the code projects webpage of the first entity, which is then parsed by the network graph parser 234 .
- the network graph parser 234 extracts node data from the code projects webpage by accessing the source code (e.g., markup language) of the code projects webpage and then extracting the node data listed in the source code.
- the received node data is stored in volatile memory (e.g. RAM) allocated to the browser 1632 , but is not stored in permanent memory, e.g. ROM.
- the user selects a second entity. This may occur for instance by the user selecting a hyperlink relating to the second entity in a second code projects webpage provided by the repository server 130 .
- the network graph parser 234 sends a request to the repository server 130 identifying the second entity.
- the repository server 130 provides a second code projects webpage that lists all the coding projects for the second entity on the repository server.
- the parse engine 220 parses the source code of the second code projects webpage to extract additional node data of the users associated with the second entity (e.g., users that have worked on the same coding project as the second entity).
- the received profile is stored in volatile memory (e.g. RAM) allocated to the browser 1632 , but is not stored in permanent memory, e.g. ROM.
- the network graph parser 234 is configured to automatically import node data for entities to which the first and second entities are linked, e.g., for which links from the first and second entities exist.
- the parse engine 220 is configured to import such node data by sending requests to the repository server 130 identifying the further entities and navigating to the code projects webpages of the entities.
- the node data engine 230 transforms the contributor data parsed from the projects webpage from a first format into a second format.
- the underlying source code of the projects webpage may be a markup language, such as HTML.
- the node data parsed from the projects webpage may also be in the markup language format.
- the node data engine 230 is configured to transform the node data from the markup language format to an attribute-value format, such as JSON (JavaScript Object Notation).
- JSON JavaScript Object Notation
- the visualization engine 240 creates a visual representation from the parsed node data (e.g., node data in the attribute-value format).
- the visual representation is generated as a network graph in an additional tab of the browser 1632 .
- the network graph includes a collection of nodes connected by edges. Each node corresponds to a user from one of the projects listed on a code projects webpage, and connections between individual nodes may be visually represented as lines, for example straight lines.
- two nodes are connected on the repository server if each of the nodes are associated with the same coding project.
- the graph may lend itself to be further processed, analyzed and manipulated by an analyst or other user. The details regarding operation 340 are explained in more detail later.
- the export engine 250 exports the graph formed from the operation 340 to the database system 10 .
- the database system 10 is connected to the electronic device 110 (as shown in FIG. 1 ), according to some example embodiments. Further, according to some example embodiments, the database system 10 is implemented as a backend system disposed in an air-gapped, high-side environment, separated from the network 120 and the electronic device 110 . The database system 10 may be dedicated to receive data for further analysis. Therefore, network graph parser 234 of operation 340 can be used to collect and pre-process the data such that it is compatible with the database system 10 .
- FIG. 4 is a flowchart representing a method 400 performed by the network graph parser 234 for importing node data of the first and second entities from the repository server 130 and for creating a graph, according to some example embodiments.
- the method 400 is an example of sub-operations performed to complete operations 330 and 340 of FIG. 3 discussed above.
- FIGS. 5A and 5B show an example of the graph created by the exemplary method 400 .
- the interface engine 210 receives selection of first entity through a user input, for instance through a bookmark, favorite, or through selection of an option provided in a list of search results.
- the interface engine 210 requests the profile of the first entity. This involves the network graph parser 234 accessing the repository server 130 via the network 120 and in particular accessing the first entity (e.g., projects webpage of the first entity) in the repository server 130 .
- the network graph parser 234 may send an HTTP request to the repository server 130 , the request including the unique identifier of the first entity.
- the network graph parser 234 receives the profile or projects webpage of first entity.
- the profile is for example received as an HTTP response.
- the profile includes a name for the first entity and details of connections of the first entity.
- the connections define links to other entities, and include unique identifiers for the other entities.
- one or more webpages of the first entity may be exposed through automatic scrolling of the one or more webpages. For example, a top portion of a first entity's webpage may be initially retrieved, and further portions below the top portions may be auto populated by script as those portions are scrolled to. In some example embodiments, the auto populated scrolled-to portions are received at operation 430 .
- the visualization engine 240 displays a graph relating to first entity.
- the network graph parser 234 may display a group or ‘cloud’ of nodes, each node relating to an entity.
- the node relating to the first entity is displayed with different visible characteristics to nodes for other entities. For instance, it may be a different color or size. All the nodes for entities linked to the first entity are shown as being connected by the inclusion on the graph of a line, e.g. a straight line, connecting the node to the node for the first entity. In some embodiments, connections between nodes other than connections between the first entity and other nodes may not be displayed in the graph.
- a further entity (e.g., a second entity of operation 460 ) need not be specified for links between nodes to be created.
- an entity associated with a given code repository page may be identified (e.g., at operation 410 ).
- the code repository page may list other coding projects with which the entity is involved (e.g., develops code).
- Each coding project may list other further entities associated with the given project.
- the additional projects and additional entities can all automatically be included in a single network graph, according to some example embodiments.
- FIG. 5A shows an example of parsing profiles of users.
- a node 501 is displayed as an empty circle.
- the node 501 corresponds to the first entity.
- Each of ten nodes displayed as a group around the node 501 represents a different entity to which the first entity is linked or connected, as identified from the profile of the first entity.
- Each such node in the group (other than the first node 501 ) is connected to the node 501 with a respective straight line, which represents a link between the corresponding two entities.
- a group of nodes connected to the first node 501 once by a single link in the created graph may be represented as being enclosed within a dotted circle, as shown in the right panel of FIG. 5A .
- the network graph parser 234 begins requesting profiles of entities linked to by the first entity.
- the profiles are parsed from a code projects webpage of the first entity.
- users associated with the first entity may be displayed in a projects webpage.
- the underlying markup language of the code projects webpage can be parsed to extract the username, user profile URL, and other information for each of the users associated with the first entity.
- profiles of the entities are stored as they are received.
- the profiles are stored in non-volatile memory that is allocated to the browser 1632 .
- Profiles may continue to be requested and saved as a background task whilst the network graph parser 234 performs other tasks.
- the interface engine 210 receives selection of a second entity. This may occur as described above in relation to receiving selection of the first entity.
- the network graph parser 234 receives the profile of the second entity, after requesting the profile of the second entity.
- the profile is for example received as part of an http response.
- the profile includes at least a name for the second entity and details of connections of the second entity.
- the connections define links to other entities, and include unique identifiers for the other entities.
- the visualization engine 240 displays a graph relating to the first and second entities.
- the network graph parser 234 may display three groups (or clouds) of nodes 510 , 520 , 530 , each node relating to an entity.
- the nodes 501 and 502 relating to the first and second entities are displayed with different visible characteristics to nodes for other entities. For instance, they may be a different color or size.
- Each node of the first group 530 of nodes corresponds to an entity linked to in the profiles of both the first and second entities.
- Each node of the second group 510 of nodes corresponds to an entity linked to by the profile of the first entity but not by the profile of the second entity.
- Each node of the third group 520 of nodes corresponds to an entity linked to by the profile of the second entity but not by the profile of the first entity. All the nodes for entities connected to the first entity are shown as being connected to the node 501 by the inclusion on the graph of a line, e.g. a straight line, connecting the node to the node for the first entity. All the nodes for entities connected to the second entity are shown as being connected to the node 502 by the inclusion on the graph of a line, e.g. a straight line, connecting the node to the node for the second entity. Connections between nodes other than connections between one of the node 501 and the node 502 and other nodes are not displayed in the graph.
- the visualization engine 240 creates a new graph after removing the graph as shown in FIG. 5A .
- the network graph parser 234 may augment and rearrange the graph created at operation 440 .
- An example of the created graph at operation 490 is shown in FIG. 5B .
- Nodes 501 and 502 represent the first entity and the second entity, respectively.
- Third group 510 comprising seven nodes (represented as filled circles), correspond to the entities linked to by both the first entity and the second entity.
- the second group 510 comprise only three nodes because seven nodes previously in the sole group now belong to the first group 530 .
- the profile of the second entity includes links to fourteen entities.
- Seven entities linked to in the profile for the second entity belong to the first group 530 , and the other seven entities belong to the third group 520 .
- the nodes of the group from FIG. 5A is now split and rearranged into two groups, namely second group 510 and first group 530 . Therefore, in displaying nodes corresponding to the entities linked to by the first entity 501 and second entity 502 , accessed by the network graph parser 234 , they are grouped into three groups: the second group 510 linked only to the first entity 501 , the third group 520 linked only to the second entity 502 , and the first group 530 linked both to the first entity 501 and second entity 502 .
- the interface engine 210 begins requesting profiles of entities linked to by the second entity.
- the operations of 450 and 495 e.g., requests for profiles of related entities
- the operations of 450 and 495 are performed automatically by the network graph parser.
- the network graph parser 234 automatically retrieves and sends the profile information of the specified first entity but also retrieves and sends profile information of entities related to the first entity automatically (e.g., without the user manually initiating the request for profile information of the related entities).
- profiles of the entities are stored as they are received.
- the profiles are stored in volatile memory, e.g. the RAM 1606 , that is allocated to the browser 1632 .
- Profiles may continue to be requested and saved as a background task whilst the network graph parser 234 performs other tasks.
- the display operations of method 400 e.g., operations 440 and 490
- the display operations of method 400 are bypassed until the some or all of the information collection operations (e.g., operations 410 , 420 , 430 , 450 , 455 , 460 , 470 , 480 , 495 , and 497 ) are completed.
- FIG. 6 is a flowchart showing a method 600 performed by the network graph parser 234 for importing further seed entities from the electronic repository website system hosted from the repository server 130 and for creating a graph, according to some example embodiments. This may correspond to at least part of operations 330 and 340 of FIG. 3 , as discussed above.
- FIG. 7 shows an example of the graph created by the exemplary method 600 .
- the plugin Prior to operation 610 , the plugin is processing two or more entities, for instance as is shown in FIG. 5B and is present at the end of the flowchart of FIG. 4 .
- the interface engine 210 receives selection of further entity through a user input, for instance through a bookmark, favorite or through selection of an option provided in a list of search results.
- the interface engine 210 requests or parses the profile of the further entity. This is similar to operation 420 . This involves the interface engine 210 accessing the repository server 130 via the network 120 and accessing first entity in one of the electronic repository webservice system in the repository server 130 .
- the interface engine 210 may send an HTTP request to the repository server 130 , the request including the unique identifier of the first entity.
- the network connection parser can parse a code projects webpage to extract profile information of user connected to the second entity.
- the interface engine 210 receives the profile of the further entity. This is similar to operation 430 .
- the profile is for example received as part of an HTTP response.
- the profile includes at least a name for the first entity and details of connections of the further entity.
- the connections define links to other entities, and include unique identifiers for the other entities.
- the visualization engine 240 displays a graph relating to all the selected entities.
- the visualization engine 240 may cause display of multiple groups (or clouds) of nodes, each node relating to an entity.
- Each group relates to a collection of nodes that have the same connections to the selected entities. Where there are three selected entities, there are seven groups.
- Each node of the first group of nodes corresponds to an entity linked to in the profiles of both the first and second entities, but not the third entity.
- Each node of the second group of nodes corresponds to an entity linked to by the profile of the first entity but not by the profile of the second or third entities.
- Each node of the third group of nodes corresponds to an entity linked to by the profile of the second entity but not by the profile of the first or third entities.
- Each node of the fourth group of nodes corresponds to an entity linked to in the profiles of both the first and third entities, but not the second entity.
- Each node of the fifth group of nodes corresponds to an entity linked to by the profile of the second and third entities but not by the profile of the first entity.
- Each node of the sixth group of nodes corresponds to an entity linked to by the profiles of the second and third entities but not by the profile of the first entity.
- Each node of the seventh group corresponds to an entity linked to by each of the first, second and third entities.
- One or more of the groups may not exist, if there are no nodes that meet the criteria for that group (these groups might be said to have zero nodes).
- the nodes relating to the selected entities are displayed with different visible characteristics to nodes for other entities. For instance, they may be a different color or size. All the nodes for entities connected to the one of the selected entities are shown as being connected by the inclusion on the graph of a line, e.g. a straight line, connecting the node to the node for the selected entity. Where a non-selected node has links to multiple selected entities, there is a line for each such connection. In some embodiments, connections between two nodes that relate to non-selected entities may be hidden or not displayed in the graph. In some example embodiments, the graph may simplify or de-clutter the graph by hiding links between nodes and/or nodes based upon whether a give node or one of its neighbors is selected. For example, if the user selects a given node, the visualization engine may only display notes that are directly linked to the given node.
- the interface engine 210 begins requesting profiles of entities linked to by the further entity.
- the user manually requests the profiles of entities linked to by the further entity.
- profiles of the entities are stored as they are received.
- the profiles may be stored in volatile memory that is allocated to the browser 1632 .
- Profiles may continue to be requested and saved as a background task whilst the network graph parser 234 performs other tasks.
- the operation may check whether another entity has been selected by the user. If so, the operation returns to operation 620 , where the profile for the further selected entity is requested. Further, in some example embodiments, the selections of additional entities are processed in batches.
- the user can select a plurality of entities, then request their information as a batch process (e.g., as part of a single request).
- the display operation of method 600 may be bypassed or delayed until other operations are complete.
- operation 650 an information collection related operation
- operation 640 a display related operation
- the information collected at operation 650 may be stored to memory and operation 640 is bypassed and a display is never generated).
- FIG. 7 shows a screenshot of an example of graph generated by the visualization engine 240 , according to some example embodiments.
- six nodes 701 , 702 , 703 , 704 , 705 and 706 correspond to six entities that have been selected by a user.
- the nodes 701 , 702 , 703 , 704 , 705 and 706 corresponding to user-selected entities are displayed as empty circles. Entities linked to by the selected entities are represented as nodes, and are displayed as filled circles.
- a line connects each node pair representing linked entities if at least one of the linked entities is a user-selected entity. I
- nodes are grouped together depending on which one combination of the six user-selected entities they are linked to.
- the group of nodes 711 correspond to non user-selected entities linked to two of the user-selected entities 701 and 705 .
- Node 712 corresponds to the only non user-selected entity linked to by the user-selected entities 702 , 703 and 706 , hence the node 712 forms a group on its own.
- the group of nodes 713 includes nodes relating to entities linked to user-selected entities 703 and 706 .
- Each group is displayed separated from other groups, e.g. with a gap between the groups which is visibly significantly larger than the gaps between adjacent nodes forming part of a single group.
- each node is generated from node data from the same code repository website.
- some of the nodes are generated from node data from a first code repository website and some of the nodes are generated from node data from a second code repository website different from the first. In this way, an analyst user can determine relationships between nodes (e.g., software project data) across different code repository websites.
- FIG. 8 is a flowchart representing an exemplary method 800 performed by the network graph parser 234 to generate a list in the form of a histogram from the imported profiles.
- the method 800 is performed when a graph relating to at least one selected entity is provided for display by the network graph parser 234 and when the profiles for all of the selected entities and the entities linked to the selected entities have been received from the repository server 130 .
- the histogram may be provided in response to a user input selecting a histogram option, for instance through interaction with a user interface element in a sidebar, dock, pull-down menu etc.
- FIG. 9 shows an example of a graph 900 generated by the method 600 . It also shows an example of a histogram 990 created by the exemplary method 800 . It further shows a profile viewer 995 generated by selecting a node displayed in the graph 900 .
- the graph 900 , the histogram 990 , and the profile viewer 995 are displayed at the same time on different parts of the display 212 , for instance in the layout shown in the Figure.
- the graph 900 has been generated from the imported profiles of three user-selected entities, corresponding to displayed nodes 901 , 902 and 903 . As explained in FIG.
- the circles 910 , 920 and 930 represents first, second, third group of nodes corresponding to entities directly linked only to the user-selected entity nodes 901 , 902 , 903 , respectively.
- the network graph parser 234 selects one of the fields of a profile relating to one of the selected entities 901 , 902 , 903 .
- the profile contains fields of information common to all or many of the profiles such as place of birth, birth year, high school, and place of work.
- the node data engine 230 searches in all or selected imported profiles for profiles which have the same information in the same field.
- the node data engine 230 identifies which fields of the profile of the selected entity are populated.
- the plugin extracts the information (text, numbers or text and numbers) from the profile and searches the corresponding field of all the other profiles for the same information. Since the profiles for the entities are stored in the volatile memory allocated to the browser 1632 , this searching can be relatively fast.
- the node data engine 230 generates a record indicating any other entity which has the same information in the same field of the profile.
- the record is made in the working (volatile) memory 206 allocated to the web browser 1632 .
- the node data engine 230 determines whether there are other fields in the profile for the selected entity that include information and that have yet to be processed. If there are such other fields, then the method proceeds to operation 850 , where another field is selected, before the method returns to operation 820 . If all the fields have been processed, the method proceeds to operation 860 .
- the node data engine 230 determines whether all the selected entities have been processed. If not, then the next entity is selected for processing at operation 870 and the method then returns to operation 810 . If so, then at operation 880 the visualization engine 240 generates a histogram from the processed data. According to some embodiments, operation 880 is reached only when all completed fields for the selected entities (the entities which have been selected by a user in the method 300 , the method 400 or the method 600 ).
- operation 880 involves identifying counting the number of profiles with the same information in the same field, and forming a list. The list may ordered according to the count of profiles or by a value of the field.
- the histogram is displayed on a display screen of electronic device 110 at operation 890 .
- Operations 810 to 880 may be performed by the network graph parser 234 without the user having requested a histogram, according to some example embodiments. In this case, however, the histogram may be displayed at operation 890 only in response to the option having been selected by the user. In FIG. 9 , an example of such a histogram 990 is shown.
- the items in the profile description information shared by more than one entities in the graphs were A university, B high school, C high school, living in D city, living in E city, working at F company, working at G company and self-employment.
- the interface engine 210 receives a user input selecting one of the items.
- the user input may be in the form of the user clicking on the row of the histogram 990 .
- the user input may be in the form of moving cursors to indicate the desired entry in the histogram.
- the user input has been received for ‘Lives in E city,’ which is shared by five entities corresponding to nodes displayed in the graph.
- five nodes corresponding to the five entities sharing the profile description information ‘Lives in E city’ are highlighted.
- the five entities are treated as being participants in the “Lives in E city” group”; that is, the user's have the attribute of living in E city.
- the highlighted entities are represented by the differently colored nodes 904 , 905 , 906 , 907 and 908 .
- any one of the nodes in the graph 900 may be selected by the user using the input device 214 and the cursor control 216 .
- the profile 995 of the entity corresponding to the nodes may be displayed near the graph 900 .
- the profile view 995 may be generated and displayed near the graph 900 .
- the information included in the profile view 995 is present in the volatile memory allocated to the browser 1632 because the profile information was retrieved from the repository server 130 during performance of the method 400 , the method 600 or the method 800 .
- FIG. 10 is a flowchart showing a method 1000 performed by the network graph parser 234 of the electronic device 110 to provide a search facility which can be used to search the profiles of the imported entities, according to some example embodiments.
- the search facility may be provided in response to a user input selecting a search facility option, for instance through interaction with a user interface element in a sidebar, dock, pull-down menu etc.
- the network graph parser 234 may generate a search tool 1150 which can receive a user input for a keyword.
- the keyword ‘E city’ is input into a text entry box provided by the search tool 1150 .
- the keyword ‘E city’ corresponds to a group of users that live in the city called ‘E city”.
- FIG. 11 shows an example of a graph 1100 generated by the method 600 and an example of a search tool 1150 generated and operated by the exemplary method 1000 .
- a graph 1100 generated by the method 600 is shown in the left panel of FIG. 11 .
- the graph 1100 is similar to the graph 900 in FIG. 9 .
- the graph 1100 is generated from the imported lists of three accessed entities 1101 , 1102 and 1103 .
- the search tool 1150 may provide any form of user interface element that can receive the input of the user from the input device 214 .
- the search tool 1150 may provide a text box into which a user can type alphanumeric characters such as a word or words.
- the node data engine 230 may search in the profiles of the imported entities in the generated graph 1100 which have an entry that matches with the keyword input in the search tool 1150 . This is performed by searching the information in the profiles as stored in the working volatile memory allocated to the browser 1632 .
- the method proceeds to operation 1040 .
- the corresponding nodes in the graph 1100 are highlighted via the visualization engine 240 . If not, the result of search is reported at operation 1050 . In the example of FIG. 11 , five entities 1104 , 1105 , 1106 , 1107 and 1108 , are highlighted as a result of the search for the keyword ‘E City.’
- FIG. 12 is a flowchart showing a method 1200 performed by the network graph parser 234 of the electronic device 110 to filter the data associated with the entities in a plotted graph to produce a reduced graph, according to some example embodiments.
- FIG. 13 shows examples of graphs 1300 generated by the method 600 and examples of reduced graphs 1310 (e.g., a refined visual representation) and 1320 generated by the exemplary method 1200 .
- the filter instruction may be provided in response to a user input selecting a filter option, for instance through interaction with a user interface element in a sidebar, dock, pull-down menu etc. If the number of entities displayed in the graph 1300 is large, the graph may be of limited use to an analyst.
- the filtering method 1200 allows the isolation of the most significant entities and the removal of less significant entities. Such operation of filtering or reducing data may lead to more efficient, focused and targeted approach in repository website user analysis. This applies to analysis using the network graph parser 234 and to subsequent analysis after export to the database system 10 . Furthermore, trimming the graph before exporting data to the database system 10 may prevent the personal profile data of only marginally relevant or irrelevant individuals unnecessarily entering into the database system 10 for analysis. It may also provide regulation compliance advantages since information relating to fewer entities is imported into the database system 10 .
- the interface engine 210 generates a user interface element 1350 configured to receive a user input specifying a connection parameter, such as a minimum number of links that is of interest to the user (e.g., a level of connectedness). Limiting the minimum number of links may assist in selecting the entities with the most meaningful connections in the network represented in the graph 1300 .
- the user interface element 1350 may receive the user input via the input device 1614 or the cursor control 1616 .
- the node data engine 230 identifies the entities linked to other entities by the number of connections specified by the user input at operation 1210 . All of the connections in FIG. 13 correspond to links to one of the selected entities 1301 , 1302 and 1303 . Therefore, the number of links of an entity in the example of FIG. 13 only corresponds to the number of connections to the user-selected entities 1301 , 1302 and 1303 .
- the user input may be “2” or both “2” and “3”.
- the user input of both “2” and “3”, as shown in FIG. 13A may instruct the node data engine 230 to identify the entities with two and three links to selected entities.
- the user input of “3”, as shown in FIG. 13B may cause the network graph parser 234 to identify only the entities with links to three selected entities.
- the node data engine 230 searches the nodes (e.g., underlying node data in JSON format) corresponding to the identified entities.
- the nodes e.g., underlying node data in JSON format
- the entities having two links with the user-selected entities 1301 , 1302 and 1303 , corresponding to group of node 1305 have been highlighted as empty circles.
- the entities that are not identified at operation 1230 and that are not the user-selected entities 1301 , 1302 and 1303 may be removed from the graph 1300 , according to some example embodiments. This may be achieved by the network graph parser 234 receiving a user input to ‘inverse select’ the other entities that are not highlighted At operation 1230 , and then receiving an input to delete the selected nodes/entities, the delete input being received via the input device 1614 or the cursor control 1616 . Alternatively, the network graph parser 234 may receive a user input (e.g., a filter instruction) to remove all the entities except the highlighted entities at operation 1230 and the user-selected entities 1301 , 1302 and 1303 .
- a user input e.g., a filter instruction
- FIG. 13A shows an example of a graph 1310 reduced from the graph 1300 according to the method 1200 .
- the graph 1310 shows only the user-selected entities 1301 , 1302 and 1303 and the entities that are linked to two or three of the accessed entities, groups of nodes 1304 , 1305 , 1306 and 1307 .
- the graph 1320 shows an example of a graph trimmed from the graph 1300 according to the exemplary method 1200 .
- the graph 1320 shows only the user-selected entities 1301 , 1302 and 1303 and the entities that are linked to all three of the user-selected entities, namely the group of nodes 1305 .
- the reduced graphs 1310 or 1320 and/or associated profile description information may be exported to the database system 10 via export engine 250 .
- FIGS. 13A and 13B it is appreciated that the operations may first be performed on the underlying data used to generate the graphs. That is, the graph 1300 may be generated from initial node data collected from a connections page. A connection parameter may be received from the user that specifies the number of connections required to remain in the node data. Nodes not meeting the attribute specified by the connection parameter are removed. The resulting refined node dataset is then used to generate graph 1310 .
- FIG. 14 is a flowchart representing an exemplary method 1400 performed by an electronic device 110 to export the data associated with the entities in the reduced graphs 1310 and 1320 . This may correspond to operation 350 discussed above in relation to FIG. 3 .
- the network graph parser 234 may receive a user input which instructs the network graph parser 234 to export the reduced graphs 1310 or 1320 and associated data such as profile description information of the entities corresponding to the nodes displayed in the graphs 1310 or 1320 .
- the interface engine 210 receives a user input specifying an analysis description.
- the analysis description may be free text. It may relate to the origin, the history and the description of the data and the details regarding the repository website analysis performed.
- the analysis description may assist in generating trails such that it can be monitored that the performed analysis complies with any rules or regulations that may be relevant in the specific field of analysis.
- the analysis description also may be useful in case multiple sets of reduced and processed graphs are generated from different starting accessed entities, for example. If a specific entities appear in multiple sets of graphs, the analysis description of each graph may provide additional information therefore provide compounding value of multiple investigations.
- the network graph parser 234 may export the data to the database system 10 via export engine 250 .
- Operation 1430 may involve exporting data relating to entities corresponding to nodes displayed in the graph to the database system 10 without exporting data relating to entities corresponding to nodes not displayed in the graph.
- the reduced graph and the associated data may be transformed according to the specific ontology of the deployment for further analysis.
- profiles for the user-selected entities are sourced from the same electronic repository website service provider, the scope is not limited to this.
- profiles for an entity may be retrieved from two or more different repository servers 130 - 1 to 130 - n .
- the entity would ordinarily have different identities or usernames on the different electronic repository websites.
- the profiles can be determined by the network graph parser 234 to be related to the same entity by information included in either profile or in both profiles, or may be entered into the network graph parser 234 by the user of the network graph parser 234 .
- two or more different entities from different electronic repository servers 130 may be selected by the user of the network graph parser 234 as seed entities.
- information in profiles for linked to entities may be used to connect profiles in one or more of the repository servers (e.g., repository server 130 - 1 ) to corresponding profiles for the same entities in another repository server (e.g., repository server 130 - 2 ).
- FIG. 15A shows an example browser 1500 for parsing node data using the network graph parser 234 , according to some example embodiments.
- an analyst user navigates to the user profile of a user on a code repository website.
- the analyst user navigates to the URL 1505 (“repository/joan.labrador/”), which is a projects webpage of the software developer “Joan Labrador”.
- the analyst user is a user attempting to identify patterns between software projects and the software developer is a user that uploads the source code to a project webpage of a given software project.
- the projects webpage displays the user's uploaded software or project data 1510 as display elements (e.g., boxes, static text, hyperlinks).
- the title for each of the projects may contain a hyperlink that links to the project page for the corresponding project.
- “Smartwatch Exercise App” may be a hyperlink that links to a project page for that project.
- the project page for “Smartwatch Exercise App” may display source code uploaded by the software developer “Joan Labrador”.
- the project page may further contain links to the user profile pages of the seventeen developers that work on that project.
- the projects webpage is received as HTTP data from the repository server 130 .
- the webpage is generated from underlying source code in a format, such as HTML.
- the analyst user selects a plugin button 1515 which, as displayed, is integrated into the browser 1500 .
- the interface engine 210 displays a popup window 1520 having different parse options.
- the first option “Graph” parses all users associated with the user “Joan Labrador” and creates a visualization from the data as discussed above.
- the second option “Add to graph” adds Joan Labrador as a second entity.
- the analyst user may have selected a first user to parse (e.g., collect node data of related developers), and then want to select Joan Labrador as a further entity to parse (e.g., collect node data of developers related to Joan Labrador to add to the graph).
- a first user to parse e.g., collect node data of related developers
- Joan Labrador e.g., collect node data of developers related to Joan Labrador to add to the graph
- the network graph parser 234 parses the source code that generates the projects webpage to extract node data from Joan's projects as discussed above.
- the parse engine 220 can identify each of Joan's projects, including (1) “Smartwatch Exercise App”, (2) Java Note Taking client”, and (3) “Acme Corp. Enterprise CRM System”.
- the parse engine can navigate to the project page for each of the projects to identify users associated with Joan.
- the parse engine 220 can user the hyperlink “Smartwatch Exercise App” to navigate to the project page for that project.
- the parse engine can then identify user profile links on the project page (e.g., the 17 developers working on the “Smartwatch Exercise App” project) and navigate to the user pages to collect node data such as user name, profile page URL, for each of the associated users.
- the parse engine may perform similar operations to collect node data for the users associated with the other two code projects.
- the resulting data can then be used to generate visualizations, as shown in FIG. 15B .
- FIG. 15B displays a user interface 1550 showing a visualization 1555 generated from the node data of users associated with Joan Labrador through one or more coding or software projects. Each circle or node corresponds to a user associated with Joan through a project.
- the user interface 1550 may open in a second tab of the browser 1550 .
- the user interface 1550 includes a main area in which the visualization is displayed, and a right bar area 1570 . For example, selecting one of the buttons may display the user interface element 1350 ( FIG. 13B ) which the analyst user can use to specify a connection parameter. Father, as illustrated in the example of FIG.
- the right bar area 1570 can be used to show parsed node data 1557 of the selected entity “Joan Labrador.”
- the parsed node data 1557 may be parsed or extracted from the underlying source code of the webpage displayed in FIG. 15A (e.g. a user profile page).
- the corresponding node data for the node is shown in the right bar area 1570 .
- the right bar area may be used to show other types of visualizations, such as the histogram 990 , instead of the node data.
- the analyst can then user the histogram to select groups to modify the visualization 1555 .
- the network graph parser spiders to one or more hyperlink for each users listed in a project page and to collect parsed node data similar to Joan's parsed node data 1557 .
- FIG. 16 is a block diagram that illustrates a computer system 1600 , which may constitute the electronic device 110 , according to some example embodiments.
- computer system 1600 includes a bus 1602 or other communication mechanism for communicating information, and one or more hardware processors 1604 (including processor circuitry), coupled with bus 1602 for processing information.
- One or more hardware processors 1604 can be, for example, one or more general purpose microprocessors, each including processor circuitry.
- Computer system 1600 also includes a main memory 1606 , such as a random access memory (RAM) or other dynamic storage device, coupled to bus 1602 for storing information and instructions to be executed by processor 1604 .
- main memory 1606 such as a random access memory (RAM) or other dynamic storage device
- Main memory 1606 also can be used for storing temporary variables or other intermediate information during execution of instructions to be executed by processor 1604 . Such instructions, when stored in non-transitory storage media accessible to one or more processors 1604 , render computer system 1600 into a special-purpose machine that is customized to perform the operations specified in the instructions. Main memory 1606 may also be used for temporarily storing the whole of part of applications, such as the web browser 1632 , including the network graph parser 234 , while they are being executed by the electronic device 110 . As illustrated in FIG. 2 , the network graph parser 234 may be integrated or installed into the web browser 1632 . For example, the network graph parser 234 may be installed as a plugin or extension of the web browser 1632 .
- the main memory 1606 is a volatile memory in that data stored therein is lost when power is no longer provided to the memory 1606 .
- the main memory 1606 is used to temporarily store information that is being processed by software applications, including the web browser 1632 and the network graph parser 234 .
- information that is temporarily stored includes webpages and ancillary content that is received from the repository servers 130 - 1 to 130 - n .
- information that is temporarily stored also includes information parsed from webpages by the network graph parser 234 and information derived from such received information by the plugin, as is described in detail below.
- Computer system 1600 further includes a read only memory (ROM) 1608 or other static storage device coupled to bus 1602 for storing static information and instructions for processor 1604 .
- the ROM 1608 is used for permanent storage of applications such as the web browser 1632 , including the network graph parser 234 , when the electronic device is not powered on and/or when the applications are not being executed by the processor 1604 .
- the storage is of the computer code or instructions that constitute the applications.
- a storage device 1610 such as a magnetic disk, optical disk, or USB thumb drive (Flash drive), etc., is provided and coupled to bus 1602 for storing information and instructions.
- Computer system 1600 can be coupled via bus 1602 to a display 1612 , such as an LCD or plasma display, or a touchscreen or cathode ray tube (CRT), for displaying information to a computer user.
- a display 1612 such as an LCD or plasma display, or a touchscreen or cathode ray tube (CRT)
- An input device 1614 is coupled to bus 1602 for communicating information and command selections to processor 1604 .
- cursor control 1616 is Another type of user input device, such as a mouse, a trackball, or cursor direction keys for communicating direction information and command selections to processor 1604 and for controlling cursor movement on display 1612 .
- cursor control such as a mouse, a trackball, or cursor direction keys for communicating direction information and command selections to processor 1604 and for controlling cursor movement on display 1612 .
- the same direction information and command selections as cursor control may be implemented via receiving touches on a touch screen without a cursor.
- the processor 1604 under control of software and/or operating system
- non-transitory media refers to any media storing data and/or instructions that cause a machine to operate in a specific fashion. Such non-transitory media can comprise non-volatile media and/or volatile media.
- Non-volatile media includes, for example, optical or magnetic disks, such as storage device 1610 .
- Volatile media includes dynamic memory, such as main memory 1606 .
- non-transitory media include, for example, a floppy disk, a flexible disk, hard disk, solid state drive, magnetic tape, or any other magnetic data storage medium, a CD-ROM, any other optical data storage medium, any physical medium with patterns of holes, a RAM, a PROM, and EPROM, a FLASH-EPROM, NVRAM, any other memory chip or cartridge, and networked versions of the same.
- Non-transitory media is distinct from, but can be used in conjunction with, transmission media.
- Transmission media participates in transferring information between storage media.
- transmission media includes coaxial cables, copper wire and fiber optics, including the wires that comprise bus 1602 .
- Transmission media can also take the form of acoustic or light waves, such as those generated during radio-wave and infra-red data communications.
- Various forms of media can be involved in carrying one or more sequences of one or more instructions to processor 1604 for execution.
- the instructions can initially be carried on a magnetic disk or solid state drive of a remote computer.
- the remote computer can load the instructions into its dynamic memory and send the instructions over a telephone line using a modem.
- a modem local to computer system 1600 can receive the data on the telephone line and use an infra-red transmitter to convert the data to an infra-red signal.
- An infra-red detector can receive the data carried in the infra-red signal and appropriate circuitry can place the data on bus 1602 .
- Bus 1602 carries the data to main memory 206 , from which processor 1604 retrieves and executes the instructions.
- the instructions received by main memory 1606 can optionally be stored on storage device 1610 either before or after execution by processor 1604 .
- Computer system 1600 also includes a communication interface 1618 coupled to bus 1602 .
- Communication interface 1618 provides a two-way data communication coupling to a network link 1621 that is connected to a local network 1622 .
- communication interface 1618 can be an integrated services digital network (ISDN) card, cable modem, satellite modem, or a modem to provide a data communication connection to a corresponding type of telephone line.
- ISDN integrated services digital network
- communication interface 1618 can be a local area network (LAN) card to provide a data communication connection to a compatible LAN.
- LAN local area network
- Wireless links can also be implemented.
- communication interface 1618 sends and receives electrical, electromagnetic or optical signals that carry digital data streams representing various types of information.
- Network link 1621 typically provides data communication through one or more networks to other data devices.
- network link 1621 can provide a connection through local network 1622 to data equipment operated by an Internet Service Provider (ISP) 1626 .
- ISP 1626 in turn provides data communication services through the world wide packet data communication network now commonly referred to as the “Internet” 1628 .
- Internet 1628 uses electrical, electromagnetic or optical signals that carry digital data streams.
- the signals through the various networks and the signals on network link 1621 and through communication interface 1618 , which carry the digital data to and from computer system 1600 are example forms of transmission media.
- Computer system 1600 can send messages and receive data, including program code, through the network(s), network link 1621 and communication interface 1618 .
- a server 1627 might transmit a requested code for an application program through Internet 1628 , ISP 1626 , local network 1622 and communication interface 1618 .
- the received code can be executed by processor 1604 as it is received, and/or stored in storage device 1610 , or other non-volatile storage for later execution.
- the network graph parser 234 is integrated into the web browser 1632 to form part of the web browser 1632 .
- the user can first download the network graph parser 234 from an appropriate web site or other source (e.g. portable storage such as a thumb drive or a storage device on a local network) and then can proceed to install the network graph parser 234 .
- an appropriate web site or other source e.g. portable storage such as a thumb drive or a storage device on a local network
- a typical network graph parser 234 is designed to be compatible to a specific web browser 1632 (e.g., GoogleTM ChromeTM, MozillaTM FirefoxTM, MicrosoftTM Internet ExplorerTM, etc.)
- the network graph parser 234 can become a part of the web browser 1632 automatically after the network graph parser 234 is installed.
- various actions are described as being performed by the network graph parser 234 and/or the web browser 1632 . It will be appreciated that this is shorthand for computer program instructions that form part of the network graph parser 234 or the browser 1632 , as the case may be, being executed by the processor 1604 and causing the processor 1604 to take the action. In doing so, some or all of the computer code/instructions constituting the network graph parser 1634 and the browser 1632 are copied from the ROM 1608 and stored in the main memory 206 , which is a volatile memory, such that the computer code/instructions constituting the network graph parser 234 and the browser 1632 can be executed by the processor 1604 .
- the processor 204 In executing the computer code/instructions constituting the network graph parser 234 and the browser 1632 , the processor 204 is controlled to store data (other than the computer code/instructions constituting the network graph parser 234 and the browser 1632 ) temporarily in the main memory 1606 .
- the main memory 1606 is volatile memory and as such data stored therein is lost when the main memory 1606 is de-powered.
- Engines can constitute either software engines (e.g., code embodied on a machine-readable medium) or hardware engines.
- a “hardware module” is a tangible unit capable of performing certain operations and can be configured or arranged in a certain physical manner.
- one or more computer systems e.g., a standalone computer system, a client computer system, or a server computer system
- one or more hardware engines of a computer system e.g., a processor or a group of processors
- software e.g., an application or application portion
- a hardware engines can be implemented mechanically, electronically, or any suitable combination thereof.
- a hardware engines can include dedicated circuitry or logic that is permanently configured to perform certain operations.
- a hardware engines can be a special-purpose processor, such as a Field-Programmable Gate Array (FPGA) or an Application Specific Integrated Circuit (ASIC).
- a hardware engines may also include programmable logic or circuitry that is temporarily configured by software to perform certain operations.
- a hardware engines can include software executed by a general-purpose processor or other programmable processor. Once configured by such software, hardware modules become specific machines (or specific components of a machine) uniquely tailored to perform the configured functions and are no longer general-purpose processors. It will be appreciated that the decision to implement a hardware engines mechanically, in dedicated and permanently configured circuitry, or in temporarily configured circuitry (e.g., configured by software) can be driven by cost and time considerations.
- hardware engine should be understood to encompass a tangible entity, be that an entity that is physically constructed, permanently configured (e.g., hardwired), or temporarily configured (e.g., programmed) to operate in a certain manner or to perform certain operations described herein.
- “hardware-implemented engine” refers to a hardware module. Considering embodiments in which hardware engines are temporarily configured (e.g., programmed), each of the hardware modules need not be configured or instantiated at any one instance in time. For example, where a hardware module comprises a general-purpose processor configured by software to become a special-purpose processor, the general-purpose processor may be configured as respectively different special-purpose processors (e.g., comprising different hardware modules) at different times. Software accordingly configures a particular processor or processors, for example, to constitute a particular hardware module at one instance of time and to constitute a different hardware module at a different instance of time.
- Hardware modules can provide information to, and receive information from, other hardware modules. Accordingly, the described hardware modules can be regarded as being communicatively coupled. Where multiple hardware modules exist contemporaneously, communications can be achieved through signal transmission (e.g., over appropriate circuits and buses) between or among two or more of the hardware modules. In embodiments in which multiple hardware modules are configured or instantiated at different times, communications between such hardware modules may be achieved, for example, through the storage and retrieval of information in memory structures to which the multiple hardware modules have access. For example, one hardware module can perform an operation and store the output of that operation in a memory device to which it is communicatively coupled. A further hardware module can then, at a later time, access the memory device to retrieve and process the stored output. Hardware modules can also initiate communications with input or output devices, and can operate on a resource (e.g., a collection of information).
- a resource e.g., a collection of information
- processors that are temporarily configured (e.g., by software) or permanently configured to perform the relevant operations. Whether temporarily or permanently configured, such processors constitute processor-implemented modules that operate to perform one or more operations or functions described herein.
- processor-implemented module refers to a hardware module implemented using one or more processors.
- the methods described herein can be at least partially processor-implemented, with a particular processor or processors being an example of hardware.
- a particular processor or processors being an example of hardware.
- the operations of a method can be performed by one or more processors or processor-implemented modules.
- the one or more processors may also operate to support performance of the relevant operations in a “cloud computing” environment or as a “software as a service” (SaaS).
- SaaS software as a service
- at least some of the operations may be performed by a group of computers (as examples of machines including processors), with these operations being accessible via a network (e.g., the Internet) and via one or more appropriate interfaces (e.g., an Application Program Interface (API)).
- API Application Program Interface
- processors may be distributed among the processors, not only residing within a single machine, but deployed across a number of machines.
- the processors or processor-implemented modules can be located in a single geographic location (e.g., within a home environment, an office environment, or a server farm). In other example embodiments, the processors or processor-implemented modules are distributed across a number of geographic locations.
- FIGS. 1-15 are implemented in some embodiments in the context of a machine and an associated software architecture.
- the sections below describe representative software architecture and machine (e.g., hardware) architecture that are suitable for use with the disclosed embodiments.
- inventive subject matter has been described with reference to specific example embodiments, various modifications and changes may be made to these embodiments without departing from the broader scope of embodiments of the present disclosure.
- inventive subject matter may be referred to herein, individually or collectively, by the term “invention” merely for convenience and without intending to voluntarily limit the scope of this application to any single disclosure or inventive concept if more than one is, in fact, disclosed.
- the term “or” may be construed in either an inclusive or exclusive sense. Moreover, plural instances may be provided for resources, operations, or structures described herein as a single instance. Additionally, boundaries between various resources, operations, modules, engines, and data stores are somewhat arbitrary, and particular operations are illustrated in a context of specific illustrative configurations. Other allocations of functionality are envisioned and may fall within a scope of various embodiments of the present disclosure. In general, structures and functionality presented as separate resources in the example configurations may be implemented as a combined structure or resource. Similarly, structures and functionality presented as a single resource may be implemented as separate resources. These and other variations, modifications, additions, and improvements fall within a scope of embodiments of the present disclosure as represented by the appended claims. The specification and drawings are, accordingly, to be regarded in an illustrative rather than a restrictive sense.
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- Databases & Information Systems (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Computational Linguistics (AREA)
- General Health & Medical Sciences (AREA)
- Health & Medical Sciences (AREA)
- Artificial Intelligence (AREA)
- Data Mining & Analysis (AREA)
- Multimedia (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Information Transfer Between Computers (AREA)
Abstract
Description
- This application is a continuation of U.S. patent application Ser. No. 15/642,820, filed Jul. 6, 2017, which claims priority to U.S. Provisional Patent Application Ser. No. 62/448,081, filed Jan. 19, 2017, the disclosure of which are incorporated herein in their entireties by reference.
- Embodiments of the present disclosure relate generally to pattern detection and, more particularly, but not by way of limitation, to manipulating data via a network graph parser to expose previously undetected patterns.
- A code repository website allows users to publish software code projects to the website so that other users can access, view, edit, or otherwise use the published software code. Identifying how different projects (e.g., software coding projects) are related to one another is currently impractical because the project data on the code repository websites is largely unstructured.
- Various ones of the appended drawings merely illustrate example embodiments of the present disclosure and should not be considered as limiting its scope.
-
FIG. 1 is a block diagram illustrating a networked system in which a network graph parser can be implemented, according to some example embodiments. -
FIG. 2 is a block diagram showing functional components provided within the network graph parser, according to some example embodiments. -
FIG. 3 shows a flow diagram for generating node data for export, according to some example embodiments. -
FIG. 4 shows a flow diagram for parsing node data from multiple selected entities, according to some example embodiments. -
FIGS. 5A and 5B show example visualizations of node data, according to some example embodiments. -
FIG. 6 shows a flow diagram for selecting entities, according to some example embodiments. -
FIG. 7 shows example visualizations from node data of different selected entities, according to some example embodiments. -
FIG. 8 shows an example flow diagram for processing entity data, according to some example embodiments. -
FIG. 9 shows example visualizations, according to some example embodiments. -
FIG. 10 shows an example flow diagram for receiving filter instructions, according to some example embodiments. -
FIG. 11 shows example visualization and user interface elements for filtering node data, according to some example embodiments. -
FIG. 12 shows a flow diagram for filtering node data, according to some example embodiments. -
FIGS. 13A and 13B show example visualization and user interface elements for filtering node data, according to some example embodiments. -
FIG. 14 shows an example flow diagram for analysis and export of node data, according to some example embodiments. -
FIGS. 15A and 15B show example user interfaces for processing network graphs using a network graph parser, according to some example embodiments. -
FIG. 16 illustrates a diagrammatic representation of a machine in the form of a computer system within which a set of instructions may be executed for causing the machine to perform any one or more of the methodologies discussed herein, according to an example embodiment. - The description that follows includes systems, methods, techniques, instruction sequences, and computing machine program products that embody illustrative embodiments of the disclosure. In the following description, for the purposes of explanation, numerous specific details are set forth in order to provide an understanding of various embodiments of the inventive subject matter. It will be evident, however, to those skilled in the art, that embodiments of the inventive subject matter may be practiced without these specific details. In general, well-known instruction instances, protocols, structures, and techniques are not necessarily shown in detail.
- In various example embodiments, a network graph parser is implemented to parse data from websites (e.g., code repository websites) into human understandable patterns. According to some example embodiments, the code repository websites are websites or network-based publication platforms (e.g., Internet forums) that allow users to publish data viewable by other users of the website or platform. For example, a software developer can create a project page on a code repository site and publish his/her code for the project to the project page. Other uses may navigate to the project page, view, download, or modify the code for the projects.
- According to some example embodiments, the network graph parser is installed as a browser plugin of an Internet browser application. A data analyst may navigate to a given page on a repository website projects, such as a page created or associated with the project or a contributor. The analyst may then trigger the parse operation by selecting a browser plugin button. The parse operation goes through the page and saves data on the page and on related pages. For example, the network graph parser may identify links to projects listed on the repository website. In some embodiments, the network graph parser may navigate to each of the projects.
- The saved data may be used to generate a visual representation (e.g., a network graph) of the collected data. The data analyst may manipulate the visual representation to explore patterns. Further, the data analyst may hone down onto specific subsets by issuing filter instructions. For example, the data analyst may filter out any connections that don't have at least two connections to other nodes. Contributors may have connections to one another by working together on the same coding project, as an example. The various filter instructions expose previously invisible patterns in the network graph. The honed down data containing the pattern can then be exported over a network to a data analysis server for further analysis, according to some example embodiments.
-
FIG. 1 is a block diagram depicting anetworked system 100 comprising anelectronic device 110, and one or more components external to theelectronic device 110. These external components include adatabase system 10,network 120, and a plurality of repository servers 130-1 to 130-n, that host repository websites. According to some example embodiments, theelectronic device 110 is a client device, such as a personal computer, a tablet computer, a personal digital assistant (PDA), a mobile phone, a smart-phone, or any other web-enabled computing device with a processor and a memory. Theelectronic device 110 has installed thereon a web browser application (e.g.,web browser 1632 inFIG. 16 ), on which is installed a network graph parser. According to some example embodiments, the network graph parser is integrated into the web browser application as a plugin or browser extension. Each of the plurality of repository servers 130-1 to 130-n comprises hardware and software. Each of the plurality of repository servers 130-1 to 130-n is able to communicate with theelectronic device 110 via thenetwork 120. - In some embodiments, some of the plurality of repository servers 130-1 to 130-n can be a part of a cloud, which can include, for example, one or more networked servers. Such networked servers may be termed a data center or a server farm. Such data centers currently are maintained by various communication network service providers.
Network 120 can be, for example, the Internet, an intranet, a local area network, a wide area network, a campus area network, a metropolitan area network, an extranet, a private extranet, or a combination of any of these or other appropriate networks. - For the exemplary embodiment of
FIG. 1 , it is understood that theelectronic device 110 is separate from theexternal database system 10 but connected thereto by a link. Alternatively, thedatabase system 10 may be disposed in an air-gapped, high-side environment, where thedatabase system 10 is physically isolated from thenetwork 120 and theelectronic device 110, such that a higher level of classified information can be maintained in thedatabase system 10. - The
electronic device 110 may be implemented by one or more specially configured computing devices. Theelectronic device 110 may be hard-wired to perform the operations, techniques, etc. described herein. Theelectronic device 110 can include digital electronic devices such as one or more application-specific integrated circuits (ASICs) or field programmable gate arrays (FPGAs) that are persistently programmed to perform the operations, techniques, etc. described herein. Theelectronic device 110 can include one or more general purpose hardware processors (including processor circuitry) programmed to perform such features of the present disclosure pursuant to program instructions in firmware, memory, other storage, or a combination. Theelectronic device 110 can also combine custom hard-wired logic, ASICs, or FPGAs with custom programming to accomplish the methods and other features. - The
electronic device 110 can be generally controlled and coordinated by operating system software, such as iOS, Android, Blackberry, Chrome OS, Windows XP, Windows Vista,Windows 7,Windows 8, Windows Server, Windows CE, Unix, Linux, SunOS, Solaris, VxWorks, or a proprietary operating system. The operating system controls and schedules computer processes for execution, perform memory management, provide file system, networking, I/O services, and provides a user interface functionality, such as a graphical user interface (“GUI”), among other things. -
FIG. 2 shows internal functional components of thenetwork graph parser 234, according to some example embodiments. In the illustrated embodiment, the network graph parser is implemented as a plug-in or browser extension for a web browser. As illustrated, thenetwork graph parser 234 comprises aninterface engine 210, a parseengine 220, anode data engine 230, avisualization engine 240, and anexport engine 250. Theinterface engine 210 is configured to interface with thebrowser 1632 as a plugin. Further, theinterface engine 210 is configured to interface with entities outside theelectronic device 110, such as therepository server 130. The parseengine 220 is configured to parse node data from a code project webpage. Node data is data of an object associated with a given project. In some example embodiments, the node data is user data (e.g., data of software developers) associated with a given project, where each user may be involved in several different software projects. In some example embodiments, the node data includes code portions (e.g., classes, functions) shared between different projects. For example, two different projects may share an Optical Character Recognition (OCR) class, and the OCR class code can be used as a node associated with each software project's network graph, as discussed in further detail below. As a further example, according to some example embodiments, metadata describing an object (e.g., code portion) may be used as a node associated with each software project's network graph. - In some example embodiments, where the repository website is configured to provide node data, the parse
engine 220 is configured to send node data requests for the users of the repository website. The repository website can receive the requests and issue responses including the requested node data. Thenode data engine 230 is configured to process the node data received via the code projects webpage (e.g., via spidering) or received from the repository website. Thenode data engine 230 can receive filter instructions from a user and cull (e.g., refine) the node data by removing data of users that do not meet the requirements of the filter instruction, as explained in further detail below. Thevisualization engine 240 is configured to use the initial node data or the refined node data and generate different types of visualizations for display on the display screen of theelectronic device 110. The visualizations may include a network graph, a histogram, graphs such bar charts or data plots, and other visualizations. Theexport engine 250 is configured to export the refined dataset to an analysis server for further analysis. -
FIG. 3 is a flowchart representing anexemplary method 300 performed by an electronic device for collecting and analyzing data from repository website systems, according to some example embodiments. While the flowchart discloses the following operations in a particular order, it will be appreciated that at least some of the operations can be moved, modified, or deleted where appropriate, consistent with the teachings of the present disclosure. In the depicted embodiment ofFIG. 3 , a user can utilize an electronic device (e.g., electronic device 110) that comprises aweb browser 1632, for example, Google™ Chrome™, Mozilla™ Firefox™, Microsoft™ Internet Explorer™, etc. Theweb browser 1632 is usable to access web content (e.g. provided by the repository servers 130-1 to 130-n) via a network (e.g., network 120), such as the Internet or an intranet. - At
operation 310, anetwork graph parser 234 is installed as a plugin in theweb browser 1632 of the electronic device. Thenetwork graph parser 234 may be termed a browser extension, according to some example embodiments. Thenetwork graph parser 234 extends the functionality of theweb browser 1632, as is described in detail below. Thenetwork graph parser 234 may be authored using a web technologies such as HTML, JavaScript, or CSS (Cascading Style Sheets). - Referring again to
FIG. 3 , atoperation 320, theinterface engine 210 accesses communication and node data related content from a repository server 130 (e.g., one of the repository servers 130-1 to 130-n) using theweb browser 1632. In the following description, reference is made generally to accessing content from arepository server 130, and it will be appreciated that, unless the context indicates otherwise, such references are to accessing content from aparticular repository server 130, for instance the first repository server 130-1. - According to some example embodiments, the
repository server 130 is accessed through thebrowser 1632 causing sending of a request (e.g. an HTTP request) to the repository server 130 (in particular to a webserver included as part thereof. - Once the user has accessed the
repository server 130 using thebrowser 1632, they may control thebrowser 1632 to interact with therepository server 130 using user interface controls provided in thebrowser 1632 by thenetwork graph parser 234 or using controls provided by the browser itself. In some example embodiments, the information received from the repository webservice comprises a projects webpage showing different coding or software projects associated with a user of the repository webservice. - At
operation 330, the parseengine 220 parses the data from the projects webpage and stores the data in local memory of theelectronic device 110. According to some example embodiments, node data is a user profile and relates to an entity that is included as part of the repository network service provided by therepository server 130. Further, according to some example embodiments, an entity typically relates to an individual programmer, but may relate to an organization, for instance a business or other group. In some example embodiments, a software developer profile includes at least a unique identifier (the identifier uniquely identifies the entity on the repository service), a name for the entity (typically a string of text, perhaps alphanumeric characters) and a plurality of links between the entity and other entities that form part of the repository webservice. - The links may be bidirectional in nature. For example, two software developers may collaborate on the same code project. Because the two developers work on the same coding project, they may be bidirecitonally linked under the assumption that each knows of the other as a fellow coder (e.g., team member, colleague) on the project. The links may alternatively be unidirectional, e.g., the first software developer receives updates published by the second software developer but the second software developer does not receive updates published by the first software developer. In some embodiments, the data stored on the repository website indicates the type of communication activity between the users. For example, the node data may include an indication that a first user commented on a pending code update on a project page on the repository website. The links may indicate the another entity by including a identifier that is unique to the other entity. Typically, repository webservices provide an identifier that is an alphanumeric string. The string may be known to the entity and other users (e.g. it may be their username) or it may be a system-generated identifier which does not need to be known to the user (e.g. a string such as “exampleidentifier$43*”). The profile may also include a uniform resource locator (URL) that is unique to the entity.
- A user profile of the repository website may also have other information associated with pre-defined fields, for instance ‘high school attended’, ‘place of residence’, ‘place of work’, ‘undergraduate study subject’, etc. The profile may also have other content such as photographs, videos, comments or profile text, etc. Profile content may be associated with particular dates (and as such may appear in a timeline on a user's profile page) or may not be dependent on a date (and so may not generally appear on a timeline). In some embodiments, profile content may be associated with geotagged data.
- In some example embodiments, user profiles are imported in response to user input. For example, a first profile is imported by the
network graph parser 234 in response to the user selecting a first entity in the repository service. This may occur for instance by the user selecting a hyperlink in a code projects webpage provided by therepository server 130. The code projects webpage may be provided by therepository server 130 in response to the user entering text, e.g. the whole or part of a name on an entity, into a search field of a webpage provided by the by therepository server 130. The code projects webpage displays coding projects of the first entity, where each of the coding projects has its own projects webpage, which can be spidered as described above. According to some example embodiments, upon selection of the first entity, thenetwork graph parser 234 sends a request to therepository server 130 identifying the first entity. In response, therepository server 130 provides the code projects webpage of the first entity, which is then parsed by thenetwork graph parser 234. In some example embodiments, thenetwork graph parser 234 extracts node data from the code projects webpage by accessing the source code (e.g., markup language) of the code projects webpage and then extracting the node data listed in the source code. The received node data is stored in volatile memory (e.g. RAM) allocated to thebrowser 1632, but is not stored in permanent memory, e.g. ROM. - After the node data of the first entity is imported by the
network graph parser 234, or at least after importation has begun, the user selects a second entity. This may occur for instance by the user selecting a hyperlink relating to the second entity in a second code projects webpage provided by therepository server 130. Upon selection of the second entity, thenetwork graph parser 234 sends a request to therepository server 130 identifying the second entity. In response, therepository server 130 provides a second code projects webpage that lists all the coding projects for the second entity on the repository server. The parseengine 220 then parses the source code of the second code projects webpage to extract additional node data of the users associated with the second entity (e.g., users that have worked on the same coding project as the second entity). The received profile is stored in volatile memory (e.g. RAM) allocated to thebrowser 1632, but is not stored in permanent memory, e.g. ROM. - According to some example embodiments, the
network graph parser 234 is configured to automatically import node data for entities to which the first and second entities are linked, e.g., for which links from the first and second entities exist. The parseengine 220 is configured to import such node data by sending requests to therepository server 130 identifying the further entities and navigating to the code projects webpages of the entities. - At operation 235, the
node data engine 230 transforms the contributor data parsed from the projects webpage from a first format into a second format. For example, the underlying source code of the projects webpage may be a markup language, such as HTML. The node data parsed from the projects webpage may also be in the markup language format. Thenode data engine 230 is configured to transform the node data from the markup language format to an attribute-value format, such as JSON (JavaScript Object Notation). The node data in the second format can be used for filtering and generation of the visualizations. - At operation 340, the
visualization engine 240 creates a visual representation from the parsed node data (e.g., node data in the attribute-value format). In some example embodiments, the visual representation is generated as a network graph in an additional tab of thebrowser 1632. The network graph includes a collection of nodes connected by edges. Each node corresponds to a user from one of the projects listed on a code projects webpage, and connections between individual nodes may be visually represented as lines, for example straight lines. In some example embodiments, two nodes are connected on the repository server if each of the nodes are associated with the same coding project. The graph may lend itself to be further processed, analyzed and manipulated by an analyst or other user. The details regarding operation 340 are explained in more detail later. - At
operation 350, theexport engine 250 exports the graph formed from the operation 340 to thedatabase system 10. Thedatabase system 10 is connected to the electronic device 110 (as shown inFIG. 1 ), according to some example embodiments. Further, according to some example embodiments, thedatabase system 10 is implemented as a backend system disposed in an air-gapped, high-side environment, separated from thenetwork 120 and theelectronic device 110. Thedatabase system 10 may be dedicated to receive data for further analysis. Therefore,network graph parser 234 of operation 340 can be used to collect and pre-process the data such that it is compatible with thedatabase system 10. -
FIG. 4 is a flowchart representing amethod 400 performed by thenetwork graph parser 234 for importing node data of the first and second entities from therepository server 130 and for creating a graph, according to some example embodiments. Themethod 400 is an example of sub-operations performed to completeoperations 330 and 340 ofFIG. 3 discussed above.FIGS. 5A and 5B show an example of the graph created by theexemplary method 400. - At operation 410, the
interface engine 210 receives selection of first entity through a user input, for instance through a bookmark, favorite, or through selection of an option provided in a list of search results. Atoperation 420, theinterface engine 210 requests the profile of the first entity. This involves thenetwork graph parser 234 accessing therepository server 130 via thenetwork 120 and in particular accessing the first entity (e.g., projects webpage of the first entity) in therepository server 130. In particular, thenetwork graph parser 234 may send an HTTP request to therepository server 130, the request including the unique identifier of the first entity. - At
operation 430, thenetwork graph parser 234 receives the profile or projects webpage of first entity. The profile is for example received as an HTTP response. According to some example embodiments, the profile includes a name for the first entity and details of connections of the first entity. The connections define links to other entities, and include unique identifiers for the other entities. In some example embodiments, one or more webpages of the first entity may be exposed through automatic scrolling of the one or more webpages. For example, a top portion of a first entity's webpage may be initially retrieved, and further portions below the top portions may be auto populated by script as those portions are scrolled to. In some example embodiments, the auto populated scrolled-to portions are received atoperation 430. - At
operation 440, thevisualization engine 240 displays a graph relating to first entity. For example, thenetwork graph parser 234 may display a group or ‘cloud’ of nodes, each node relating to an entity. The node relating to the first entity is displayed with different visible characteristics to nodes for other entities. For instance, it may be a different color or size. All the nodes for entities linked to the first entity are shown as being connected by the inclusion on the graph of a line, e.g. a straight line, connecting the node to the node for the first entity. In some embodiments, connections between nodes other than connections between the first entity and other nodes may not be displayed in the graph. - Further, in some example embodiments, a further entity (e.g., a second entity of operation 460) need not be specified for links between nodes to be created. For example, an entity associated with a given code repository page may be identified (e.g., at operation 410). The code repository page may list other coding projects with which the entity is involved (e.g., develops code). Each coding project may list other further entities associated with the given project. Using the identified entity, the additional projects and additional entities can all automatically be included in a single network graph, according to some example embodiments.
- In the following discussion, the terms ‘connected’ and ‘linked’ in relation to entities included in the electronic repository website can be used interchangeably.
FIG. 5A shows an example of parsing profiles of users. In the left panel ofFIG. 5A , anode 501 is displayed as an empty circle. Thenode 501 corresponds to the first entity. Each of ten nodes displayed as a group around thenode 501 represents a different entity to which the first entity is linked or connected, as identified from the profile of the first entity. Each such node in the group (other than the first node 501) is connected to thenode 501 with a respective straight line, which represents a link between the corresponding two entities. In the following, a group of nodes connected to thefirst node 501 once by a single link in the created graph may be represented as being enclosed within a dotted circle, as shown in the right panel ofFIG. 5A . - At
operation 450, thenetwork graph parser 234 begins requesting profiles of entities linked to by the first entity. In some example embodiments, the profiles are parsed from a code projects webpage of the first entity. For example, users associated with the first entity may be displayed in a projects webpage. The underlying markup language of the code projects webpage can be parsed to extract the username, user profile URL, and other information for each of the users associated with the first entity. - At
operation 455, profiles of the entities are stored as they are received. In one embodiment, the profiles are stored in non-volatile memory that is allocated to thebrowser 1632. Profiles may continue to be requested and saved as a background task whilst thenetwork graph parser 234 performs other tasks. - At
operation 460, theinterface engine 210 receives selection of a second entity. This may occur as described above in relation to receiving selection of the first entity. - At operation 470, the
network graph parser 234 receives the profile of the second entity, after requesting the profile of the second entity. The profile is for example received as part of an http response. The profile includes at least a name for the second entity and details of connections of the second entity. The connections define links to other entities, and include unique identifiers for the other entities. - At
operation 490, thevisualization engine 240 displays a graph relating to the first and second entities. For example, thenetwork graph parser 234 may display three groups (or clouds) ofnodes nodes first group 530 of nodes corresponds to an entity linked to in the profiles of both the first and second entities. Each node of thesecond group 510 of nodes corresponds to an entity linked to by the profile of the first entity but not by the profile of the second entity. Each node of thethird group 520 of nodes corresponds to an entity linked to by the profile of the second entity but not by the profile of the first entity. All the nodes for entities connected to the first entity are shown as being connected to thenode 501 by the inclusion on the graph of a line, e.g. a straight line, connecting the node to the node for the first entity. All the nodes for entities connected to the second entity are shown as being connected to thenode 502 by the inclusion on the graph of a line, e.g. a straight line, connecting the node to the node for the second entity. Connections between nodes other than connections between one of thenode 501 and thenode 502 and other nodes are not displayed in the graph. - At
operation 490, thevisualization engine 240 creates a new graph after removing the graph as shown inFIG. 5A . Alternatively, thenetwork graph parser 234 may augment and rearrange the graph created atoperation 440. An example of the created graph atoperation 490 is shown inFIG. 5B .Nodes Third group 510 comprising seven nodes (represented as filled circles), correspond to the entities linked to by both the first entity and the second entity. Thesecond group 510 comprise only three nodes because seven nodes previously in the sole group now belong to thefirst group 530. The profile of the second entity includes links to fourteen entities. Seven entities linked to in the profile for the second entity belong to thefirst group 530, and the other seven entities belong to thethird group 520. The nodes of the group fromFIG. 5A is now split and rearranged into two groups, namelysecond group 510 andfirst group 530. Therefore, in displaying nodes corresponding to the entities linked to by thefirst entity 501 andsecond entity 502, accessed by thenetwork graph parser 234, they are grouped into three groups: thesecond group 510 linked only to thefirst entity 501, thethird group 520 linked only to thesecond entity 502, and thefirst group 530 linked both to thefirst entity 501 andsecond entity 502. - At
operation 495, theinterface engine 210 begins requesting profiles of entities linked to by the second entity. In some example embodiments, the operations of 450 and 495 (e.g., requests for profiles of related entities) are initiated by a manual user request. For example, after the user (at operation 420) requests profile of first entity, the user (at operation 450) further requests (e.g., using a GUI button) the profiles of entities related to the first entity. Further, according to some example embodiments, the operations of 450 and 495 are performed automatically by the network graph parser. For example, after the user (at operation 420) requests profile information of the first entity, thenetwork graph parser 234 automatically retrieves and sends the profile information of the specified first entity but also retrieves and sends profile information of entities related to the first entity automatically (e.g., without the user manually initiating the request for profile information of the related entities). - At
operation 497, profiles of the entities are stored as they are received. In one embodiment, the profiles are stored in volatile memory, e.g. theRAM 1606, that is allocated to thebrowser 1632. Profiles may continue to be requested and saved as a background task whilst thenetwork graph parser 234 performs other tasks. Further, according to some example embodiments, the display operations of method 400 (e.g.,operations 440 and 490) are bypassed until the some or all of the information collection operations (e.g.,operations -
FIG. 6 is a flowchart showing amethod 600 performed by thenetwork graph parser 234 for importing further seed entities from the electronic repository website system hosted from therepository server 130 and for creating a graph, according to some example embodiments. This may correspond to at least part ofoperations 330 and 340 ofFIG. 3 , as discussed above. -
FIG. 7 shows an example of the graph created by theexemplary method 600. Prior tooperation 610, the plugin is processing two or more entities, for instance as is shown inFIG. 5B and is present at the end of the flowchart ofFIG. 4 . Atoperation 610, theinterface engine 210 receives selection of further entity through a user input, for instance through a bookmark, favorite or through selection of an option provided in a list of search results. - At
operation 620, theinterface engine 210 requests or parses the profile of the further entity. This is similar tooperation 420. This involves theinterface engine 210 accessing therepository server 130 via thenetwork 120 and accessing first entity in one of the electronic repository webservice system in therepository server 130. In particular, theinterface engine 210 may send an HTTP request to therepository server 130, the request including the unique identifier of the first entity. Alternatively, the network connection parser can parse a code projects webpage to extract profile information of user connected to the second entity. - At
operation 630, theinterface engine 210 receives the profile of the further entity. This is similar tooperation 430. The profile is for example received as part of an HTTP response. The profile includes at least a name for the first entity and details of connections of the further entity. The connections define links to other entities, and include unique identifiers for the other entities. - At
operation 640, thevisualization engine 240 displays a graph relating to all the selected entities. Here, thevisualization engine 240 may cause display of multiple groups (or clouds) of nodes, each node relating to an entity. Each group relates to a collection of nodes that have the same connections to the selected entities. Where there are three selected entities, there are seven groups. Each node of the first group of nodes corresponds to an entity linked to in the profiles of both the first and second entities, but not the third entity. Each node of the second group of nodes corresponds to an entity linked to by the profile of the first entity but not by the profile of the second or third entities. Each node of the third group of nodes corresponds to an entity linked to by the profile of the second entity but not by the profile of the first or third entities. Each node of the fourth group of nodes corresponds to an entity linked to in the profiles of both the first and third entities, but not the second entity. Each node of the fifth group of nodes corresponds to an entity linked to by the profile of the second and third entities but not by the profile of the first entity. Each node of the sixth group of nodes corresponds to an entity linked to by the profiles of the second and third entities but not by the profile of the first entity. Each node of the seventh group corresponds to an entity linked to by each of the first, second and third entities. One or more of the groups may not exist, if there are no nodes that meet the criteria for that group (these groups might be said to have zero nodes). - The nodes relating to the selected entities are displayed with different visible characteristics to nodes for other entities. For instance, they may be a different color or size. All the nodes for entities connected to the one of the selected entities are shown as being connected by the inclusion on the graph of a line, e.g. a straight line, connecting the node to the node for the selected entity. Where a non-selected node has links to multiple selected entities, there is a line for each such connection. In some embodiments, connections between two nodes that relate to non-selected entities may be hidden or not displayed in the graph. In some example embodiments, the graph may simplify or de-clutter the graph by hiding links between nodes and/or nodes based upon whether a give node or one of its neighbors is selected. For example, if the user selects a given node, the visualization engine may only display notes that are directly linked to the given node.
- At
operation 650, theinterface engine 210 begins requesting profiles of entities linked to by the further entity. In some example embodiments, the user manually requests the profiles of entities linked to by the further entity. Atoperation 660, profiles of the entities are stored as they are received. In one embodiment, the profiles may be stored in volatile memory that is allocated to thebrowser 1632. Profiles may continue to be requested and saved as a background task whilst thenetwork graph parser 234 performs other tasks. Atoperation 670, the operation may check whether another entity has been selected by the user. If so, the operation returns tooperation 620, where the profile for the further selected entity is requested. Further, in some example embodiments, the selections of additional entities are processed in batches. For example, instead of requesting information of a single further entity and then receiving the information of the single further entity (e.g., method 600), the user can select a plurality of entities, then request their information as a batch process (e.g., as part of a single request). - Further, according to some example embodiments, the display operation of
method 600 may be bypassed or delayed until other operations are complete. For example, operation 650 (an information collection related operation) may be performed before operation 640 (a display related operation). As a further example, the information collected atoperation 650 may be stored to memory andoperation 640 is bypassed and a display is never generated). -
FIG. 7 shows a screenshot of an example of graph generated by thevisualization engine 240, according to some example embodiments. Here, sixnodes nodes - It can be seen from
FIG. 7 that nodes are grouped together depending on which one combination of the six user-selected entities they are linked to. For example, the group ofnodes 711 correspond to non user-selected entities linked to two of the user-selectedentities entities nodes 713 includes nodes relating to entities linked to user-selectedentities -
FIG. 8 is a flowchart representing anexemplary method 800 performed by thenetwork graph parser 234 to generate a list in the form of a histogram from the imported profiles. Themethod 800 is performed when a graph relating to at least one selected entity is provided for display by thenetwork graph parser 234 and when the profiles for all of the selected entities and the entities linked to the selected entities have been received from therepository server 130. The histogram may be provided in response to a user input selecting a histogram option, for instance through interaction with a user interface element in a sidebar, dock, pull-down menu etc. -
FIG. 9 shows an example of agraph 900 generated by themethod 600. It also shows an example of ahistogram 990 created by theexemplary method 800. It further shows aprofile viewer 995 generated by selecting a node displayed in thegraph 900. Thegraph 900, thehistogram 990, and theprofile viewer 995 are displayed at the same time on different parts of the display 212, for instance in the layout shown in the Figure. Thegraph 900 has been generated from the imported profiles of three user-selected entities, corresponding to displayednodes FIG. 5a and the accompanying paragraphs above, thecircles entity nodes nodes entities group 905 of nodes linked to all three of the selectedentities - At
operation 810, thenetwork graph parser 234 selects one of the fields of a profile relating to one of the selectedentities - At
operation 820, thenode data engine 230 then searches in all or selected imported profiles for profiles which have the same information in the same field. In particular, thenode data engine 230 identifies which fields of the profile of the selected entity are populated. For a populated field, the plugin extracts the information (text, numbers or text and numbers) from the profile and searches the corresponding field of all the other profiles for the same information. Since the profiles for the entities are stored in the volatile memory allocated to thebrowser 1632, this searching can be relatively fast. - At
operation 830, thenode data engine 230 generates a record indicating any other entity which has the same information in the same field of the profile. The record is made in the working (volatile) memory 206 allocated to theweb browser 1632. - At
operation 840, thenode data engine 230 determines whether there are other fields in the profile for the selected entity that include information and that have yet to be processed. If there are such other fields, then the method proceeds tooperation 850, where another field is selected, before the method returns tooperation 820. If all the fields have been processed, the method proceeds tooperation 860. - At
operation 860, thenode data engine 230 determines whether all the selected entities have been processed. If not, then the next entity is selected for processing atoperation 870 and the method then returns tooperation 810. If so, then atoperation 880 thevisualization engine 240 generates a histogram from the processed data. According to some embodiments,operation 880 is reached only when all completed fields for the selected entities (the entities which have been selected by a user in themethod 300, themethod 400 or the method 600). - According to some example embodiments,
operation 880 involves identifying counting the number of profiles with the same information in the same field, and forming a list. The list may ordered according to the count of profiles or by a value of the field. Followingoperation 880, the histogram is displayed on a display screen ofelectronic device 110 atoperation 890.Operations 810 to 880 may be performed by thenetwork graph parser 234 without the user having requested a histogram, according to some example embodiments. In this case, however, the histogram may be displayed atoperation 890 only in response to the option having been selected by the user. InFIG. 9 , an example of such ahistogram 990 is shown. In this example, the items in the profile description information shared by more than one entities in the graphs were A university, B high school, C high school, living in D city, living in E city, working at F company, working at G company and self-employment. - Returning to
FIG. 9 , atoperation 893, theinterface engine 210 receives a user input selecting one of the items. In some embodiments, the user input may be in the form of the user clicking on the row of thehistogram 990. In some embodiments, the user input may be in the form of moving cursors to indicate the desired entry in the histogram. In the example shown inFIG. 9 , the user input has been received for ‘Lives in E city,’ which is shared by five entities corresponding to nodes displayed in the graph. Atoperation 896, in response to this user input, five nodes corresponding to the five entities sharing the profile description information ‘Lives in E city’ are highlighted. The five entities are treated as being participants in the “Lives in E city” group”; that is, the user's have the attribute of living in E city. InFIG. 9 , the highlighted entities are represented by the differently colorednodes - At any time, any one of the nodes in the
graph 900 may be selected by the user using the input device 214 and the cursor control 216. Once selected, theprofile 995 of the entity corresponding to the nodes may be displayed near thegraph 900. InFIG. 9 , for example, when the entity corresponding to thenode 904, which is highlighted due to the fact that the profile indicates that the entity ‘lives in E city,’ is selected by the user, theprofile view 995 may be generated and displayed near thegraph 900. The information included in theprofile view 995 is present in the volatile memory allocated to thebrowser 1632 because the profile information was retrieved from therepository server 130 during performance of themethod 400, themethod 600 or themethod 800. -
FIG. 10 is a flowchart showing amethod 1000 performed by thenetwork graph parser 234 of theelectronic device 110 to provide a search facility which can be used to search the profiles of the imported entities, according to some example embodiments. The search facility may be provided in response to a user input selecting a search facility option, for instance through interaction with a user interface element in a sidebar, dock, pull-down menu etc. Atoperation 1010, thenetwork graph parser 234 may generate asearch tool 1150 which can receive a user input for a keyword. In the example ofFIG. 11 , the keyword ‘E city’ is input into a text entry box provided by thesearch tool 1150. The keyword ‘E city’ corresponds to a group of users that live in the city called ‘E city”. -
FIG. 11 shows an example of a graph 1100 generated by themethod 600 and an example of asearch tool 1150 generated and operated by theexemplary method 1000. In the left panel ofFIG. 11 , an example of a graph 1100 generated by themethod 600 is shown. In this example, the graph 1100 is similar to thegraph 900 inFIG. 9 . The graph 1100 is generated from the imported lists of three accessed entities 1101, 1102 and 1103. Thesearch tool 1150 may provide any form of user interface element that can receive the input of the user from the input device 214. For instance, thesearch tool 1150 may provide a text box into which a user can type alphanumeric characters such as a word or words. - At
operation 1020, thenode data engine 230 may search in the profiles of the imported entities in the generated graph 1100 which have an entry that matches with the keyword input in thesearch tool 1150. This is performed by searching the information in the profiles as stored in the working volatile memory allocated to thebrowser 1632. Atoperation 1030, if one or more profiles are found to have the same text as the input text, the method proceeds tooperation 1040. Here, the corresponding nodes in the graph 1100 are highlighted via thevisualization engine 240. If not, the result of search is reported atoperation 1050. In the example ofFIG. 11 , five entities 1104, 1105, 1106, 1107 and 1108, are highlighted as a result of the search for the keyword ‘E City.’ -
FIG. 12 is a flowchart showing amethod 1200 performed by thenetwork graph parser 234 of theelectronic device 110 to filter the data associated with the entities in a plotted graph to produce a reduced graph, according to some example embodiments.FIG. 13 shows examples ofgraphs 1300 generated by themethod 600 and examples of reduced graphs 1310 (e.g., a refined visual representation) and 1320 generated by theexemplary method 1200. - The filter instruction may be provided in response to a user input selecting a filter option, for instance through interaction with a user interface element in a sidebar, dock, pull-down menu etc. If the number of entities displayed in the
graph 1300 is large, the graph may be of limited use to an analyst. Thefiltering method 1200 allows the isolation of the most significant entities and the removal of less significant entities. Such operation of filtering or reducing data may lead to more efficient, focused and targeted approach in repository website user analysis. This applies to analysis using thenetwork graph parser 234 and to subsequent analysis after export to thedatabase system 10. Furthermore, trimming the graph before exporting data to thedatabase system 10 may prevent the personal profile data of only marginally relevant or irrelevant individuals unnecessarily entering into thedatabase system 10 for analysis. It may also provide regulation compliance advantages since information relating to fewer entities is imported into thedatabase system 10. - At
operation 1210, theinterface engine 210 generates auser interface element 1350 configured to receive a user input specifying a connection parameter, such as a minimum number of links that is of interest to the user (e.g., a level of connectedness). Limiting the minimum number of links may assist in selecting the entities with the most meaningful connections in the network represented in thegraph 1300. Theuser interface element 1350 may receive the user input via theinput device 1614 or thecursor control 1616. - At
operation 1220, thenode data engine 230 identifies the entities linked to other entities by the number of connections specified by the user input atoperation 1210. All of the connections inFIG. 13 correspond to links to one of the selectedentities FIG. 13 only corresponds to the number of connections to the user-selectedentities - In the example of
FIG. 13A , the maximum number of links between entities is three. Therefore, the user input may be “2” or both “2” and “3”. The user input of both “2” and “3”, as shown inFIG. 13A , may instruct thenode data engine 230 to identify the entities with two and three links to selected entities. The user input of “3”, as shown inFIG. 13B , may cause thenetwork graph parser 234 to identify only the entities with links to three selected entities. Returning toFIG. 12 , atoperation 1230, thenode data engine 230 searches the nodes (e.g., underlying node data in JSON format) corresponding to the identified entities. InFIG. 13A , as a result of search in this operation, the nodes corresponding to entities having two and three links with the user-selectedentities nodes - In
FIG. 13B , the entities having two links with the user-selectedentities node 1305, have been highlighted as empty circles. Returning toFIG. 12 , atoperation 1240, the entities that are not identified atoperation 1230 and that are not the user-selectedentities graph 1300, according to some example embodiments. This may be achieved by thenetwork graph parser 234 receiving a user input to ‘inverse select’ the other entities that are not highlighted Atoperation 1230, and then receiving an input to delete the selected nodes/entities, the delete input being received via theinput device 1614 or thecursor control 1616. Alternatively, thenetwork graph parser 234 may receive a user input (e.g., a filter instruction) to remove all the entities except the highlighted entities atoperation 1230 and the user-selectedentities -
FIG. 13A shows an example of agraph 1310 reduced from thegraph 1300 according to themethod 1200. In theuser interface element 1350, “2” and “3” links have been specified by the user and thegraph 1310 shows only the user-selectedentities nodes FIG. 13B , thegraph 1320 shows an example of a graph trimmed from thegraph 1300 according to theexemplary method 1200. In theuser interface element 1350, “3” links have been specified and thegraph 1320 shows only the user-selectedentities nodes 1305. - In case the profile description information have been imported along with the entities in the
graph 1300, they may be removed along with the entity atoperation 1240. Afteroperation 1240, the reducedgraphs database system 10 viaexport engine 250. Though visual graphs are depicted inFIGS. 13A and 13B , it is appreciated that the operations may first be performed on the underlying data used to generate the graphs. That is, thegraph 1300 may be generated from initial node data collected from a connections page. A connection parameter may be received from the user that specifies the number of connections required to remain in the node data. Nodes not meeting the attribute specified by the connection parameter are removed. The resulting refined node dataset is then used to generategraph 1310. -
FIG. 14 is a flowchart representing anexemplary method 1400 performed by anelectronic device 110 to export the data associated with the entities in the reducedgraphs operation 350 discussed above in relation toFIG. 3 . Atoperation 1410, thenetwork graph parser 234 may receive a user input which instructs thenetwork graph parser 234 to export the reducedgraphs graphs - At
operation 1420, theinterface engine 210 receives a user input specifying an analysis description. The analysis description may be free text. It may relate to the origin, the history and the description of the data and the details regarding the repository website analysis performed. The analysis description may assist in generating trails such that it can be monitored that the performed analysis complies with any rules or regulations that may be relevant in the specific field of analysis. The analysis description also may be useful in case multiple sets of reduced and processed graphs are generated from different starting accessed entities, for example. If a specific entities appear in multiple sets of graphs, the analysis description of each graph may provide additional information therefore provide compounding value of multiple investigations. - At
operation 1430, thenetwork graph parser 234 may export the data to thedatabase system 10 viaexport engine 250.Operation 1430 may involve exporting data relating to entities corresponding to nodes displayed in the graph to thedatabase system 10 without exporting data relating to entities corresponding to nodes not displayed in the graph. In thedatabase system 10, the reduced graph and the associated data may be transformed according to the specific ontology of the deployment for further analysis. - Various modification and alternatives will be apparent to the person skilled in the art and all such modifications and alternatives are intended to be encompassed with the claims Some such modifications and alternatives will now be described.
- Although in the above, the profiles for the user-selected entities are sourced from the same electronic repository website service provider, the scope is not limited to this. In other embodiments, profiles for an entity may be retrieved from two or more different repository servers 130-1 to 130-n. In this case, the entity would ordinarily have different identities or usernames on the different electronic repository websites. However, the profiles can be determined by the
network graph parser 234 to be related to the same entity by information included in either profile or in both profiles, or may be entered into thenetwork graph parser 234 by the user of thenetwork graph parser 234. Alternatively or in addition, two or more different entities from differentelectronic repository servers 130 may be selected by the user of thenetwork graph parser 234 as seed entities. In this case, information in profiles for linked to entities may be used to connect profiles in one or more of the repository servers (e.g., repository server 130-1) to corresponding profiles for the same entities in another repository server (e.g., repository server 130-2). - In the above, when an entity is selected for analysis, all of the entities linked to by that profile are retrieved from the
electronic repository server 130 and displayed in a graph. Alternatively, a user may specify a limit on the number of entities that are to be retrieved from theelectronic repository server 130 by thenetwork graph parser 234 and displayed in a graph. This may be globally set as a setting by the plugin, or it may be selected or entered by the user at the time of selecting the entity. In the above, the histogram is formed from same information in same fields or profiles. Alternatively or in addition, information such as geotag information from photos, comments, mentions, replies, and/or such like. -
FIG. 15A shows anexample browser 1500 for parsing node data using thenetwork graph parser 234, according to some example embodiments. In the example ofFIG. 15A , an analyst user navigates to the user profile of a user on a code repository website. For example, the analyst user navigates to the URL 1505 (“repository/joan.labrador/”), which is a projects webpage of the software developer “Joan Labrador”. In some example embodiments, the analyst user is a user attempting to identify patterns between software projects and the software developer is a user that uploads the source code to a project webpage of a given software project. - The projects webpage displays the user's uploaded software or
project data 1510 as display elements (e.g., boxes, static text, hyperlinks). The title for each of the projects may contain a hyperlink that links to the project page for the corresponding project. For example, in the first listed project, “Smartwatch Exercise App” may be a hyperlink that links to a project page for that project. The project page for “Smartwatch Exercise App” may display source code uploaded by the software developer “Joan Labrador”. The project page may further contain links to the user profile pages of the seventeen developers that work on that project. - The projects webpage is received as HTTP data from the
repository server 130. The webpage is generated from underlying source code in a format, such as HTML. To initiate parsing, the analyst user selects aplugin button 1515 which, as displayed, is integrated into thebrowser 1500. Responsive to the selection, theinterface engine 210 displays apopup window 1520 having different parse options. According to some example embodiments, the first option “Graph” parses all users associated with the user “Joan Labrador” and creates a visualization from the data as discussed above. The second option “Add to graph” adds Joan Labrador as a second entity. For example, the analyst user may have selected a first user to parse (e.g., collect node data of related developers), and then want to select Joan Labrador as a further entity to parse (e.g., collect node data of developers related to Joan Labrador to add to the graph). - Assuming, to continue the example, the data analyst selects the first option “Graph”, the
network graph parser 234 parses the source code that generates the projects webpage to extract node data from Joan's projects as discussed above. For example, the parseengine 220 can identify each of Joan's projects, including (1) “Smartwatch Exercise App”, (2) Java Note Taking client”, and (3) “Acme Corp. Enterprise CRM System”. The parse engine can navigate to the project page for each of the projects to identify users associated with Joan. For example, the parseengine 220 can user the hyperlink “Smartwatch Exercise App” to navigate to the project page for that project. Further, the parse engine can then identify user profile links on the project page (e.g., the 17 developers working on the “Smartwatch Exercise App” project) and navigate to the user pages to collect node data such as user name, profile page URL, for each of the associated users. The parse engine may perform similar operations to collect node data for the users associated with the other two code projects. The resulting data can then be used to generate visualizations, as shown inFIG. 15B . - In
FIG. 15B , displays auser interface 1550 showing avisualization 1555 generated from the node data of users associated with Joan Labrador through one or more coding or software projects. Each circle or node corresponds to a user associated with Joan through a project. Theuser interface 1550 may open in a second tab of thebrowser 1550. As illustrated, theuser interface 1550 includes a main area in which the visualization is displayed, and aright bar area 1570. For example, selecting one of the buttons may display the user interface element 1350 (FIG. 13B ) which the analyst user can use to specify a connection parameter. Father, as illustrated in the example ofFIG. 15B , theright bar area 1570 can be used to show parsednode data 1557 of the selected entity “Joan Labrador.” The parsednode data 1557 may be parsed or extracted from the underlying source code of the webpage displayed inFIG. 15A (e.g. a user profile page). According to some example embodiments, if a user select a node from thevisualization 1555, the corresponding node data for the node is shown in theright bar area 1570. - Further, according to some example embodiments, the right bar area may be used to show other types of visualizations, such as the
histogram 990, instead of the node data. The analyst can then user the histogram to select groups to modify thevisualization 1555. In some example embodiments, the network graph parser spiders to one or more hyperlink for each users listed in a project page and to collect parsed node data similar to Joan's parsednode data 1557. -
FIG. 16 is a block diagram that illustrates acomputer system 1600, which may constitute theelectronic device 110, according to some example embodiments. As illustrated,computer system 1600 includes abus 1602 or other communication mechanism for communicating information, and one or more hardware processors 1604 (including processor circuitry), coupled withbus 1602 for processing information. One ormore hardware processors 1604 can be, for example, one or more general purpose microprocessors, each including processor circuitry.Computer system 1600 also includes amain memory 1606, such as a random access memory (RAM) or other dynamic storage device, coupled tobus 1602 for storing information and instructions to be executed byprocessor 1604. -
Main memory 1606 also can be used for storing temporary variables or other intermediate information during execution of instructions to be executed byprocessor 1604. Such instructions, when stored in non-transitory storage media accessible to one ormore processors 1604, rendercomputer system 1600 into a special-purpose machine that is customized to perform the operations specified in the instructions.Main memory 1606 may also be used for temporarily storing the whole of part of applications, such as theweb browser 1632, including thenetwork graph parser 234, while they are being executed by theelectronic device 110. As illustrated inFIG. 2 , thenetwork graph parser 234 may be integrated or installed into theweb browser 1632. For example, thenetwork graph parser 234 may be installed as a plugin or extension of theweb browser 1632. - The
main memory 1606 is a volatile memory in that data stored therein is lost when power is no longer provided to thememory 1606. Themain memory 1606 is used to temporarily store information that is being processed by software applications, including theweb browser 1632 and thenetwork graph parser 234. In relation to theweb browser 1632 and thenetwork graph parser 234, information that is temporarily stored includes webpages and ancillary content that is received from the repository servers 130-1 to 130-n. In relation to theweb browser 1632 and thenetwork graph parser 234, information that is temporarily stored also includes information parsed from webpages by thenetwork graph parser 234 and information derived from such received information by the plugin, as is described in detail below. -
Computer system 1600 further includes a read only memory (ROM) 1608 or other static storage device coupled tobus 1602 for storing static information and instructions forprocessor 1604. TheROM 1608 is used for permanent storage of applications such as theweb browser 1632, including thenetwork graph parser 234, when the electronic device is not powered on and/or when the applications are not being executed by theprocessor 1604. The storage is of the computer code or instructions that constitute the applications. A storage device 1610, such as a magnetic disk, optical disk, or USB thumb drive (Flash drive), etc., is provided and coupled tobus 1602 for storing information and instructions. -
Computer system 1600 can be coupled viabus 1602 to adisplay 1612, such as an LCD or plasma display, or a touchscreen or cathode ray tube (CRT), for displaying information to a computer user. Aninput device 1614, for instance a keyboard, including alphanumeric and other keys, is coupled tobus 1602 for communicating information and command selections toprocessor 1604. Another type of user input device iscursor control 1616, such as a mouse, a trackball, or cursor direction keys for communicating direction information and command selections toprocessor 1604 and for controlling cursor movement ondisplay 1612. In some embodiments, the same direction information and command selections as cursor control may be implemented via receiving touches on a touch screen without a cursor. It will be appreciated that theprocessor 1604, under control of software and/or operating system, causes display of graphics and text, and that thedisplay 1612 displays such. Displaying a graph comprises displaying a graphical representation. - The term “non-transitory media” as used herein refers to any media storing data and/or instructions that cause a machine to operate in a specific fashion. Such non-transitory media can comprise non-volatile media and/or volatile media. Non-volatile media includes, for example, optical or magnetic disks, such as storage device 1610. Volatile media includes dynamic memory, such as
main memory 1606. Common forms of non-transitory media include, for example, a floppy disk, a flexible disk, hard disk, solid state drive, magnetic tape, or any other magnetic data storage medium, a CD-ROM, any other optical data storage medium, any physical medium with patterns of holes, a RAM, a PROM, and EPROM, a FLASH-EPROM, NVRAM, any other memory chip or cartridge, and networked versions of the same. - Non-transitory media is distinct from, but can be used in conjunction with, transmission media. Transmission media participates in transferring information between storage media. For example, transmission media includes coaxial cables, copper wire and fiber optics, including the wires that comprise
bus 1602. Transmission media can also take the form of acoustic or light waves, such as those generated during radio-wave and infra-red data communications. Various forms of media can be involved in carrying one or more sequences of one or more instructions toprocessor 1604 for execution. For example, the instructions can initially be carried on a magnetic disk or solid state drive of a remote computer. The remote computer can load the instructions into its dynamic memory and send the instructions over a telephone line using a modem. A modem local tocomputer system 1600 can receive the data on the telephone line and use an infra-red transmitter to convert the data to an infra-red signal. An infra-red detector can receive the data carried in the infra-red signal and appropriate circuitry can place the data onbus 1602.Bus 1602 carries the data to main memory 206, from whichprocessor 1604 retrieves and executes the instructions. The instructions received bymain memory 1606 can optionally be stored on storage device 1610 either before or after execution byprocessor 1604. -
Computer system 1600 also includes acommunication interface 1618 coupled tobus 1602.Communication interface 1618 provides a two-way data communication coupling to a network link 1621 that is connected to alocal network 1622. For example,communication interface 1618 can be an integrated services digital network (ISDN) card, cable modem, satellite modem, or a modem to provide a data communication connection to a corresponding type of telephone line. As another example,communication interface 1618 can be a local area network (LAN) card to provide a data communication connection to a compatible LAN. Wireless links can also be implemented. In any such implementation,communication interface 1618 sends and receives electrical, electromagnetic or optical signals that carry digital data streams representing various types of information. - Network link 1621 typically provides data communication through one or more networks to other data devices. For example, network link 1621 can provide a connection through
local network 1622 to data equipment operated by an Internet Service Provider (ISP) 1626.ISP 1626 in turn provides data communication services through the world wide packet data communication network now commonly referred to as the “Internet” 1628.Local network 1622 andInternet 1628 both use electrical, electromagnetic or optical signals that carry digital data streams. The signals through the various networks and the signals on network link 1621 and throughcommunication interface 1618, which carry the digital data to and fromcomputer system 1600, are example forms of transmission media. -
Computer system 1600 can send messages and receive data, including program code, through the network(s), network link 1621 andcommunication interface 1618. In the Internet example, a server 1627 might transmit a requested code for an application program throughInternet 1628,ISP 1626,local network 1622 andcommunication interface 1618. The received code can be executed byprocessor 1604 as it is received, and/or stored in storage device 1610, or other non-volatile storage for later execution. - The
network graph parser 234 is integrated into theweb browser 1632 to form part of theweb browser 1632. The user can first download thenetwork graph parser 234 from an appropriate web site or other source (e.g. portable storage such as a thumb drive or a storage device on a local network) and then can proceed to install thenetwork graph parser 234. Since a typicalnetwork graph parser 234 is designed to be compatible to a specific web browser 1632 (e.g., Google™ Chrome™, Mozilla™ Firefox™, Microsoft™ Internet Explorer™, etc.), thenetwork graph parser 234 can become a part of theweb browser 1632 automatically after thenetwork graph parser 234 is installed. - Above, various actions are described as being performed by the
network graph parser 234 and/or theweb browser 1632. It will be appreciated that this is shorthand for computer program instructions that form part of thenetwork graph parser 234 or thebrowser 1632, as the case may be, being executed by theprocessor 1604 and causing theprocessor 1604 to take the action. In doing so, some or all of the computer code/instructions constituting the network graph parser 1634 and thebrowser 1632 are copied from theROM 1608 and stored in the main memory 206, which is a volatile memory, such that the computer code/instructions constituting thenetwork graph parser 234 and thebrowser 1632 can be executed by theprocessor 1604. In executing the computer code/instructions constituting thenetwork graph parser 234 and thebrowser 1632, the processor 204 is controlled to store data (other than the computer code/instructions constituting thenetwork graph parser 234 and the browser 1632) temporarily in themain memory 1606. As mentioned above, themain memory 1606 is volatile memory and as such data stored therein is lost when themain memory 1606 is de-powered. - Certain embodiments are described herein as including logic or a number of components, modules, or engines. Engines can constitute either software engines (e.g., code embodied on a machine-readable medium) or hardware engines. A “hardware module” is a tangible unit capable of performing certain operations and can be configured or arranged in a certain physical manner. In various example embodiments, one or more computer systems (e.g., a standalone computer system, a client computer system, or a server computer system) or one or more hardware engines of a computer system (e.g., a processor or a group of processors) can be configured by software (e.g., an application or application portion) as a hardware module that operates to perform certain operations as described herein.
- In some embodiments, a hardware engines can be implemented mechanically, electronically, or any suitable combination thereof. For example, a hardware engines can include dedicated circuitry or logic that is permanently configured to perform certain operations. For example, a hardware engines can be a special-purpose processor, such as a Field-Programmable Gate Array (FPGA) or an Application Specific Integrated Circuit (ASIC). A hardware engines may also include programmable logic or circuitry that is temporarily configured by software to perform certain operations. For example, a hardware engines can include software executed by a general-purpose processor or other programmable processor. Once configured by such software, hardware modules become specific machines (or specific components of a machine) uniquely tailored to perform the configured functions and are no longer general-purpose processors. It will be appreciated that the decision to implement a hardware engines mechanically, in dedicated and permanently configured circuitry, or in temporarily configured circuitry (e.g., configured by software) can be driven by cost and time considerations.
- Accordingly, the phrase “hardware engine” should be understood to encompass a tangible entity, be that an entity that is physically constructed, permanently configured (e.g., hardwired), or temporarily configured (e.g., programmed) to operate in a certain manner or to perform certain operations described herein. As used herein, “hardware-implemented engine” refers to a hardware module. Considering embodiments in which hardware engines are temporarily configured (e.g., programmed), each of the hardware modules need not be configured or instantiated at any one instance in time. For example, where a hardware module comprises a general-purpose processor configured by software to become a special-purpose processor, the general-purpose processor may be configured as respectively different special-purpose processors (e.g., comprising different hardware modules) at different times. Software accordingly configures a particular processor or processors, for example, to constitute a particular hardware module at one instance of time and to constitute a different hardware module at a different instance of time.
- Hardware modules can provide information to, and receive information from, other hardware modules. Accordingly, the described hardware modules can be regarded as being communicatively coupled. Where multiple hardware modules exist contemporaneously, communications can be achieved through signal transmission (e.g., over appropriate circuits and buses) between or among two or more of the hardware modules. In embodiments in which multiple hardware modules are configured or instantiated at different times, communications between such hardware modules may be achieved, for example, through the storage and retrieval of information in memory structures to which the multiple hardware modules have access. For example, one hardware module can perform an operation and store the output of that operation in a memory device to which it is communicatively coupled. A further hardware module can then, at a later time, access the memory device to retrieve and process the stored output. Hardware modules can also initiate communications with input or output devices, and can operate on a resource (e.g., a collection of information).
- The various operations of example methods described herein can be performed, at least partially, by one or more processors that are temporarily configured (e.g., by software) or permanently configured to perform the relevant operations. Whether temporarily or permanently configured, such processors constitute processor-implemented modules that operate to perform one or more operations or functions described herein. As used herein, “processor-implemented module” refers to a hardware module implemented using one or more processors.
- Similarly, the methods described herein can be at least partially processor-implemented, with a particular processor or processors being an example of hardware. For example, at least some of the operations of a method can be performed by one or more processors or processor-implemented modules. Moreover, the one or more processors may also operate to support performance of the relevant operations in a “cloud computing” environment or as a “software as a service” (SaaS). For example, at least some of the operations may be performed by a group of computers (as examples of machines including processors), with these operations being accessible via a network (e.g., the Internet) and via one or more appropriate interfaces (e.g., an Application Program Interface (API)).
- The performance of certain of the operations may be distributed among the processors, not only residing within a single machine, but deployed across a number of machines. In some example embodiments, the processors or processor-implemented modules can be located in a single geographic location (e.g., within a home environment, an office environment, or a server farm). In other example embodiments, the processors or processor-implemented modules are distributed across a number of geographic locations.
- The modules, methods, applications and so forth described in conjunction with
FIGS. 1-15 are implemented in some embodiments in the context of a machine and an associated software architecture. The sections below describe representative software architecture and machine (e.g., hardware) architecture that are suitable for use with the disclosed embodiments. - Throughout this specification, plural instances may implement components, operations, or structures described as a single instance. Although individual operations of one or more methods are illustrated and described as separate operations, one or more of the individual operations may be performed concurrently, and nothing requires that the operations be performed in the order illustrated. Structures and functionality presented as separate components in example configurations may be implemented as a combined structure or component. Similarly, structures and functionality presented as a single component may be implemented as separate components. These and other variations, modifications, additions, and improvements fall within the scope of the subject matter herein.
- Although an overview of the inventive subject matter has been described with reference to specific example embodiments, various modifications and changes may be made to these embodiments without departing from the broader scope of embodiments of the present disclosure. Such embodiments of the inventive subject matter may be referred to herein, individually or collectively, by the term “invention” merely for convenience and without intending to voluntarily limit the scope of this application to any single disclosure or inventive concept if more than one is, in fact, disclosed.
- The embodiments illustrated herein are described in sufficient detail to enable those skilled in the art to practice the teachings disclosed. Other embodiments may be used and derived therefrom, such that structural and logical substitutions and changes may be made without departing from the scope of this disclosure. The Detailed Description, therefore, is not to be taken in a limiting sense, and the scope of various embodiments is defined only by the appended claims, along with the full range of equivalents to which such claims are entitled.
- As used herein, the term “or” may be construed in either an inclusive or exclusive sense. Moreover, plural instances may be provided for resources, operations, or structures described herein as a single instance. Additionally, boundaries between various resources, operations, modules, engines, and data stores are somewhat arbitrary, and particular operations are illustrated in a context of specific illustrative configurations. Other allocations of functionality are envisioned and may fall within a scope of various embodiments of the present disclosure. In general, structures and functionality presented as separate resources in the example configurations may be implemented as a combined structure or resource. Similarly, structures and functionality presented as a single resource may be implemented as separate resources. These and other variations, modifications, additions, and improvements fall within a scope of embodiments of the present disclosure as represented by the appended claims. The specification and drawings are, accordingly, to be regarded in an illustrative rather than a restrictive sense.
Claims (21)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US16/654,048 US20200133986A1 (en) | 2017-01-19 | 2019-10-16 | Network graph parser |
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201762448081P | 2017-01-19 | 2017-01-19 | |
US15/642,820 US10509844B1 (en) | 2017-01-19 | 2017-07-06 | Network graph parser |
US16/654,048 US20200133986A1 (en) | 2017-01-19 | 2019-10-16 | Network graph parser |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US15/642,820 Continuation US10509844B1 (en) | 2017-01-19 | 2017-07-06 | Network graph parser |
Publications (1)
Publication Number | Publication Date |
---|---|
US20200133986A1 true US20200133986A1 (en) | 2020-04-30 |
Family
ID=68841359
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US15/642,820 Active 2037-10-04 US10509844B1 (en) | 2017-01-19 | 2017-07-06 | Network graph parser |
US16/654,048 Abandoned US20200133986A1 (en) | 2017-01-19 | 2019-10-16 | Network graph parser |
Family Applications Before (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US15/642,820 Active 2037-10-04 US10509844B1 (en) | 2017-01-19 | 2017-07-06 | Network graph parser |
Country Status (1)
Country | Link |
---|---|
US (2) | US10509844B1 (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113221015A (en) * | 2021-07-06 | 2021-08-06 | 深圳市炆石数据有限公司 | Homologous user determination and homologous network construction method, system and storage medium |
WO2022105682A1 (en) * | 2020-11-18 | 2022-05-27 | 北京字节跳动网络技术有限公司 | Paging data acquisition method and apparatus, electronic device, and computer readable storage medium |
Families Citing this family (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
GB201805067D0 (en) * | 2018-03-28 | 2018-05-09 | Benevolentai Tech Limited | Search tool using a relationship tree |
US11132403B2 (en) | 2019-09-06 | 2021-09-28 | Digital Asset Capital, Inc. | Graph-manipulation based domain-specific execution environment |
US20210073286A1 (en) * | 2019-09-06 | 2021-03-11 | Digital Asset Capital, Inc. | Multigraph verification |
US10990879B2 (en) | 2019-09-06 | 2021-04-27 | Digital Asset Capital, Inc. | Graph expansion and outcome determination for graph-defined program states |
US12118021B1 (en) * | 2024-03-30 | 2024-10-15 | Integer, Llc | Research and investigation systems incorporating graph databases |
CN119065562B (en) * | 2024-11-05 | 2025-01-24 | 南京涵韬信息科技有限公司 | Data export method, system, equipment and storage medium |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US176609A (en) * | 1876-04-25 | Improvement in stoves for burning oil | ||
US20050076054A1 (en) * | 2003-10-07 | 2005-04-07 | Moon Billy Gayle | Arrangement for autonomous mobile network nodes to organize a wireless mobile network based on detected physical and logical changes |
US20080140655A1 (en) * | 2004-12-15 | 2008-06-12 | Hoos Holger H | Systems and Methods for Storing, Maintaining and Providing Access to Information |
US20090265475A1 (en) * | 2005-08-30 | 2009-10-22 | Matsushita Electric Industrial Co., Ltd. | Content delivery method, content delivery server, communication terminal apparatus, and content delivery system |
US20160210767A1 (en) * | 2012-03-21 | 2016-07-21 | Zymeworks Inc. | Systems and methods for making two dimensional graphs of complex molecules |
US9978162B1 (en) * | 2015-03-31 | 2018-05-22 | Cisco Technology, Inc. | Rules-based causality visualization framework |
Family Cites Families (386)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CA2104226A1 (en) | 1991-03-12 | 1992-09-13 | Evan M. Rothfield | Database management system graphical query front end |
US5426747A (en) | 1991-03-22 | 1995-06-20 | Object Design, Inc. | Method and apparatus for virtual memory mapping and transaction management in an object-oriented database system |
US5428737A (en) | 1991-10-16 | 1995-06-27 | International Business Machines Corporation | Comprehensive bilateral translation between SQL and graphically depicted queries |
JPH0689307A (en) | 1992-05-04 | 1994-03-29 | Internatl Business Mach Corp <Ibm> | Device and method for displaying information in database |
JP2710548B2 (en) | 1993-03-17 | 1998-02-10 | インターナショナル・ビジネス・マシーンズ・コーポレイション | Method for retrieving data and converting between Boolean algebraic and graphic representations |
US5918225A (en) | 1993-04-16 | 1999-06-29 | Sybase, Inc. | SQL-based database system with improved indexing methodology |
US5794228A (en) | 1993-04-16 | 1998-08-11 | Sybase, Inc. | Database system with buffer manager providing per page native data compression and decompression |
US5794229A (en) | 1993-04-16 | 1998-08-11 | Sybase, Inc. | Database system with methodology for storing a database table by vertically partitioning all columns of the table |
US5911138A (en) | 1993-06-04 | 1999-06-08 | International Business Machines Corporation | Database search facility having improved user interface |
US5608899A (en) | 1993-06-04 | 1997-03-04 | International Business Machines Corporation | Method and apparatus for searching a database by interactively modifying a database query |
US5613105A (en) | 1993-06-30 | 1997-03-18 | Microsoft Corporation | Efficient storage of objects in a file system |
US6877137B1 (en) | 1998-04-09 | 2005-04-05 | Rose Blush Software Llc | System, method and computer program product for mediating notes and note sub-notes linked or otherwise associated with stored or networked web pages |
US5742806A (en) | 1994-01-31 | 1998-04-21 | Sun Microsystems, Inc. | Apparatus and method for decomposing database queries for database management system including multiprocessor digital data processing system |
US5560005A (en) | 1994-02-25 | 1996-09-24 | Actamed Corp. | Methods and systems for object-based relational distributed databases |
US5542089A (en) | 1994-07-26 | 1996-07-30 | International Business Machines Corporation | Method and apparatus for estimating the number of occurrences of frequent values in a data set |
US5515488A (en) | 1994-08-30 | 1996-05-07 | Xerox Corporation | Method and apparatus for concurrent graphical visualization of a database search and its search history |
US6321274B1 (en) | 1996-06-28 | 2001-11-20 | Microsoft Corporation | Multiple procedure calls in a single request |
US5870559A (en) | 1996-10-15 | 1999-02-09 | Mercury Interactive | Software system and associated methods for facilitating the analysis and management of web sites |
US6430305B1 (en) | 1996-12-20 | 2002-08-06 | Synaptics, Incorporated | Identity verification methods |
US5857329A (en) | 1997-03-14 | 1999-01-12 | Deere & Company | One-piece combined muffler exhaust outlet and exhaust gas deflector |
US6026233A (en) | 1997-05-27 | 2000-02-15 | Microsoft Corporation | Method and apparatus for presenting and selecting options to modify a programming language statement |
US6208985B1 (en) | 1997-07-09 | 2001-03-27 | Caseventure Llc | Data refinery: a direct manipulation user interface for data querying with integrated qualitative and quantitative graphical representations of query construction and query result presentation |
US6236994B1 (en) | 1997-10-21 | 2001-05-22 | Xerox Corporation | Method and apparatus for the integration of information and knowledge |
US7168039B2 (en) | 1998-06-02 | 2007-01-23 | International Business Machines Corporation | Method and system for reducing the horizontal space required for displaying a column containing text data |
US6178519B1 (en) | 1998-12-10 | 2001-01-23 | Mci Worldcom, Inc. | Cluster-wide database system |
KR100313198B1 (en) | 1999-03-05 | 2001-11-05 | 윤덕용 | Multi-dimensional Selectivity Estimation Using Compressed Histogram Information |
US7418399B2 (en) | 1999-03-10 | 2008-08-26 | Illinois Institute Of Technology | Methods and kits for managing diagnosis and therapeutics of bacterial infections |
US6560774B1 (en) | 1999-09-01 | 2003-05-06 | Microsoft Corporation | Verifier to check intermediate language |
WO2001022285A2 (en) | 1999-09-21 | 2001-03-29 | Borthwick Andrew E | A probabilistic record linkage model derived from training data |
US6745382B1 (en) | 2000-04-13 | 2004-06-01 | Worldcom, Inc. | CORBA wrappers for rules automation technology |
US8386945B1 (en) | 2000-05-17 | 2013-02-26 | Eastman Kodak Company | System and method for implementing compound documents in a production printing workflow |
GB2366498A (en) | 2000-08-25 | 2002-03-06 | Copyn Ltd | Method of bookmarking a section of a web-page and storing said bookmarks |
US6795868B1 (en) | 2000-08-31 | 2004-09-21 | Data Junction Corp. | System and method for event-driven data transformation |
US20020065708A1 (en) | 2000-09-22 | 2002-05-30 | Hikmet Senay | Method and system for interactive visual analyses of organizational interactions |
US8707185B2 (en) | 2000-10-10 | 2014-04-22 | Addnclick, Inc. | Dynamic information management system and method for content delivery and sharing in content-, metadata- and viewer-based, live social networking among users concurrently engaged in the same and/or similar content |
US8117281B2 (en) | 2006-11-02 | 2012-02-14 | Addnclick, Inc. | Using internet content as a means to establish live social networks by linking internet users to each other who are simultaneously engaged in the same and/or similar content |
US6976024B1 (en) | 2000-10-12 | 2005-12-13 | International Buisness Machines Corporation | Batch submission API |
US6754640B2 (en) | 2000-10-30 | 2004-06-22 | William O. Bozeman | Universal positive pay match, authentication, authorization, settlement and clearing system |
US6857120B1 (en) | 2000-11-01 | 2005-02-15 | International Business Machines Corporation | Method for characterizing program execution by periodic call stack inspection |
US6978419B1 (en) | 2000-11-15 | 2005-12-20 | Justsystem Corporation | Method and apparatus for efficient identification of duplicate and near-duplicate documents and text spans using high-discriminability text fragments |
US7370040B1 (en) | 2000-11-21 | 2008-05-06 | Microsoft Corporation | Searching with adaptively configurable user interface and extensible query language |
US20020103705A1 (en) | 2000-12-06 | 2002-08-01 | Forecourt Communication Group | Method and apparatus for using prior purchases to select activities to present to a customer |
US7529698B2 (en) | 2001-01-16 | 2009-05-05 | Raymond Anthony Joao | Apparatus and method for providing transaction history information, account history information, and/or charge-back information |
US7299202B2 (en) | 2001-02-07 | 2007-11-20 | Exalt Solutions, Inc. | Intelligent multimedia e-catalog |
US20100057622A1 (en) | 2001-02-27 | 2010-03-04 | Faith Patrick L | Distributed Quantum Encrypted Pattern Generation And Scoring |
US7499922B1 (en) | 2001-04-26 | 2009-03-03 | Dakota Software Corp. | Information retrieval system and method |
US6980984B1 (en) | 2001-05-16 | 2005-12-27 | Kanisa, Inc. | Content provider systems and methods using structured data |
US7877421B2 (en) | 2001-05-25 | 2011-01-25 | International Business Machines Corporation | Method and system for mapping enterprise data assets to a semantic information model |
US7155728B1 (en) | 2001-06-28 | 2006-12-26 | Microsoft Corporation | Remoting features |
US7100147B2 (en) | 2001-06-28 | 2006-08-29 | International Business Machines Corporation | Method, system, and program for generating a workflow |
AU2002316479A1 (en) | 2001-07-03 | 2003-01-21 | Altaworks Corporation | System and methods for monitoring performance metrics |
US20030023620A1 (en) | 2001-07-30 | 2003-01-30 | Nicholas Trotta | Creation of media-interaction profiles |
US7461077B1 (en) | 2001-07-31 | 2008-12-02 | Nicholas Greenwood | Representation of data records |
US7028223B1 (en) | 2001-08-13 | 2006-04-11 | Parasoft Corporation | System and method for testing of web services |
US7165101B2 (en) | 2001-12-03 | 2007-01-16 | Sun Microsystems, Inc. | Transparent optimization of network traffic in distributed systems |
US7519589B2 (en) | 2003-02-04 | 2009-04-14 | Cataphora, Inc. | Method and apparatus for sociological data analysis |
WO2003081376A2 (en) | 2002-03-20 | 2003-10-02 | Catalina Marketing International Inc. | Targeted incentives based upon predicted behavior |
US20050021397A1 (en) | 2003-07-22 | 2005-01-27 | Cui Yingwei Claire | Content-targeted advertising using collected user behavior data |
US7533026B2 (en) | 2002-04-12 | 2009-05-12 | International Business Machines Corporation | Facilitating management of service elements usable in providing information technology service offerings |
US20040126840A1 (en) | 2002-12-23 | 2004-07-01 | Affymetrix, Inc. | Method, system and computer software for providing genomic ontological data |
US20040012633A1 (en) | 2002-04-26 | 2004-01-22 | Affymetrix, Inc., A Corporation Organized Under The Laws Of Delaware | System, method, and computer program product for dynamic display, and analysis of biological sequence data |
US7127467B2 (en) | 2002-05-10 | 2006-10-24 | Oracle International Corporation | Managing expressions in a database system |
US8244895B2 (en) | 2002-07-15 | 2012-08-14 | Hewlett-Packard Development Company, L.P. | Method and apparatus for applying receiving attributes using constraints |
GB0221257D0 (en) | 2002-09-13 | 2002-10-23 | Ibm | Automated testing |
US7383513B2 (en) | 2002-09-25 | 2008-06-03 | Oracle International Corporation | Graphical condition builder for facilitating database queries |
GB0224589D0 (en) | 2002-10-22 | 2002-12-04 | British Telecomm | Method and system for processing or searching user records |
US20040088177A1 (en) | 2002-11-04 | 2004-05-06 | Electronic Data Systems Corporation | Employee performance management method and system |
US7546607B2 (en) | 2002-11-19 | 2009-06-09 | Microsoft Corporation | Native code exposing virtual machine managed object |
US7243093B2 (en) | 2002-11-27 | 2007-07-10 | International Business Machines Corporation | Federated query management |
US20040111480A1 (en) | 2002-12-09 | 2004-06-10 | Yue Jonathan Zhanjun | Message screening system and method |
US8589273B2 (en) | 2002-12-23 | 2013-11-19 | Ge Corporate Financial Services, Inc. | Methods and systems for managing risk management information |
US7752117B2 (en) | 2003-01-31 | 2010-07-06 | Trading Technologies International, Inc. | System and method for money management in electronic trading environment |
US20040153418A1 (en) | 2003-02-05 | 2004-08-05 | Hanweck Gerald Alfred | System and method for providing access to data from proprietary tools |
US7099888B2 (en) | 2003-03-26 | 2006-08-29 | Oracle International Corporation | Accessing a remotely located nested object |
US8386377B1 (en) | 2003-05-12 | 2013-02-26 | Id Analytics, Inc. | System and method for credit scoring using an identity network connectivity |
US9607092B2 (en) | 2003-05-20 | 2017-03-28 | Excalibur Ip, Llc | Mapping method and system |
US7369912B2 (en) | 2003-05-29 | 2008-05-06 | Fisher-Rosemount Systems, Inc. | Batch execution engine with independent batch execution processes |
US7620648B2 (en) | 2003-06-20 | 2009-11-17 | International Business Machines Corporation | Universal annotation configuration and deployment |
US8412566B2 (en) | 2003-07-08 | 2013-04-02 | Yt Acquisition Corporation | High-precision customer-based targeting by individual usage statistics |
US7216133B2 (en) | 2003-07-29 | 2007-05-08 | Microsoft Corporation | Synchronizing logical views independent of physical storage representations |
US20060143075A1 (en) | 2003-09-22 | 2006-06-29 | Ryan Carr | Assumed demographics, predicted behaviour, and targeted incentives |
US7584172B2 (en) | 2003-10-16 | 2009-09-01 | Sap Ag | Control for selecting data query and visual configuration |
WO2005050625A2 (en) | 2003-11-14 | 2005-06-02 | Senvid, Inc. | Managed peer-to-peer applications in a secure network |
US7917376B2 (en) | 2003-12-29 | 2011-03-29 | Montefiore Medical Center | System and method for monitoring patient care |
US20050154769A1 (en) | 2004-01-13 | 2005-07-14 | Llumen, Inc. | Systems and methods for benchmarking business performance data against aggregated business performance data |
US20050154628A1 (en) | 2004-01-13 | 2005-07-14 | Illumen, Inc. | Automated management of business performance information |
US7343552B2 (en) | 2004-02-12 | 2008-03-11 | Fuji Xerox Co., Ltd. | Systems and methods for freeform annotations |
US7085890B2 (en) | 2004-02-19 | 2006-08-01 | International Business Machines Corporation | Memory mapping to reduce cache conflicts in multiprocessor systems |
US20060026120A1 (en) | 2004-03-24 | 2006-02-02 | Update Publications Lp | Method and system for collecting, processing, and distributing residential property data |
US20050226473A1 (en) | 2004-04-07 | 2005-10-13 | Subramanyan Ramesh | Electronic Documents Signing and Compliance Monitoring Invention |
EP2487600A1 (en) | 2004-05-04 | 2012-08-15 | Boston Consulting Group, Inc. | Method and apparatus for selecting, analyzing and visualizing related database records as a network |
CN101288060B (en) | 2004-05-25 | 2012-11-07 | 波斯蒂尼公司 | Electronic message source reputation information system |
EP1769435A1 (en) | 2004-05-25 | 2007-04-04 | Arion Human Capital Limited | Data analysis and flow control system |
US8055672B2 (en) | 2004-06-10 | 2011-11-08 | International Business Machines Corporation | Dynamic graphical database query and data mining interface |
US7617232B2 (en) | 2004-09-02 | 2009-11-10 | Microsoft Corporation | Centralized terminology and glossary development |
US7406592B1 (en) | 2004-09-23 | 2008-07-29 | American Megatrends, Inc. | Method, system, and apparatus for efficient evaluation of boolean expressions |
US7512738B2 (en) | 2004-09-30 | 2009-03-31 | Intel Corporation | Allocating call stack frame entries at different memory levels to functions in a program |
US7366723B2 (en) | 2004-10-05 | 2008-04-29 | Sap Ag | Visual query modeling for configurable patterns |
GB0422750D0 (en) | 2004-10-13 | 2004-11-17 | Ciphergrid Ltd | Remote database technique |
US20060080616A1 (en) | 2004-10-13 | 2006-04-13 | Xerox Corporation | Systems, methods and user interfaces for document workflow construction |
CA2484694A1 (en) | 2004-10-14 | 2006-04-14 | Alcatel | Database ram cache |
US20060129992A1 (en) | 2004-11-10 | 2006-06-15 | Oberholtzer Brian K | Software test and performance monitoring system |
US7797197B2 (en) | 2004-11-12 | 2010-09-14 | Amazon Technologies, Inc. | Method and system for analyzing the performance of affiliate sites |
US7899796B1 (en) | 2004-11-23 | 2011-03-01 | Andrew Borthwick | Batch automated blocking and record matching |
US20060143079A1 (en) | 2004-12-29 | 2006-06-29 | Jayanta Basak | Cross-channel customer matching |
US8700414B2 (en) | 2004-12-29 | 2014-04-15 | Sap Ag | System supported optimization of event resolution |
US7783679B2 (en) | 2005-01-12 | 2010-08-24 | Computer Associates Think, Inc. | Efficient processing of time series data |
US8091784B1 (en) | 2005-03-09 | 2012-01-10 | Diebold, Incorporated | Banking system controlled responsive to data bearing records |
US7483028B2 (en) | 2005-03-15 | 2009-01-27 | Microsoft Corporation | Providing 1D and 2D connectors in a connected diagram |
WO2006102270A2 (en) | 2005-03-22 | 2006-09-28 | Cooper Kim A | Performance motivation systems and methods for contact centers |
US7596528B1 (en) | 2005-03-31 | 2009-09-29 | Trading Technologies International, Inc. | System and method for dynamically regulating order entry in an electronic trading environment |
US7672968B2 (en) | 2005-05-12 | 2010-03-02 | Apple Inc. | Displaying a tooltip associated with a concurrently displayed database object |
US8020110B2 (en) | 2005-05-26 | 2011-09-13 | Weisermazars Llp | Methods for defining queries, generating query results and displaying same |
US7962842B2 (en) | 2005-05-30 | 2011-06-14 | International Business Machines Corporation | Method and systems for accessing data by spelling discrimination letters of link names |
US8161122B2 (en) | 2005-06-03 | 2012-04-17 | Messagemind, Inc. | System and method of dynamically prioritized electronic mail graphical user interface, and measuring email productivity and collaboration trends |
EP1732034A1 (en) | 2005-06-06 | 2006-12-13 | First Data Corporation | System and method for authorizing electronic payment transactions |
US8341259B2 (en) | 2005-06-06 | 2012-12-25 | Adobe Systems Incorporated | ASP for web analytics including a real-time segmentation workbench |
US7571192B2 (en) | 2005-06-15 | 2009-08-04 | Oracle International Corporation | Methods and apparatus for maintaining consistency during analysis of large data sets |
US20070005582A1 (en) | 2005-06-17 | 2007-01-04 | Honeywell International Inc. | Building of database queries from graphical operations |
US20100199167A1 (en) | 2005-06-24 | 2010-08-05 | Justsystems Corporation | Document processing apparatus |
US7991764B2 (en) | 2005-07-22 | 2011-08-02 | Yogesh Chunilal Rathod | Method and system for communication, publishing, searching, sharing and dynamically providing a journal feed |
US20070178501A1 (en) | 2005-12-06 | 2007-08-02 | Matthew Rabinowitz | System and method for integrating and validating genotypic, phenotypic and medical information into a database according to a standardized ontology |
US7421429B2 (en) | 2005-08-04 | 2008-09-02 | Microsoft Corporation | Generate blog context ranking using track-back weight, context weight and, cumulative comment weight |
CN1913441A (en) | 2005-08-09 | 2007-02-14 | 张永敏 | Continuous changed data set transmission and updating method |
US8095866B2 (en) | 2005-09-09 | 2012-01-10 | Microsoft Corporation | Filtering user interface for a data summary table |
US20070094248A1 (en) | 2005-09-26 | 2007-04-26 | Bea Systems, Inc. | System and method for managing content by workflows |
US8306986B2 (en) | 2005-09-30 | 2012-11-06 | American Express Travel Related Services Company, Inc. | Method, system, and computer program product for linking customer information |
US7933897B2 (en) | 2005-10-12 | 2011-04-26 | Google Inc. | Entity display priority in a distributed geographic information system |
US20090168163A1 (en) | 2005-11-01 | 2009-07-02 | Global Bionic Optics Pty Ltd. | Optical lens systems |
US8726144B2 (en) | 2005-12-23 | 2014-05-13 | Xerox Corporation | Interactive learning-based document annotation |
US7870512B2 (en) | 2005-12-28 | 2011-01-11 | Sap Ag | User interface (UI) prototype using UI taxonomy |
US7801912B2 (en) | 2005-12-29 | 2010-09-21 | Amazon Technologies, Inc. | Method and apparatus for a searchable data service |
US7788296B2 (en) | 2005-12-29 | 2010-08-31 | Guidewire Software, Inc. | Method and apparatus for managing a computer-based address book for incident-related work |
US7831917B1 (en) | 2005-12-30 | 2010-11-09 | Google Inc. | Method, system, and graphical user interface for identifying and communicating with meeting spots |
US8712828B2 (en) | 2005-12-30 | 2014-04-29 | Accenture Global Services Limited | Churn prediction and management system |
US20070192281A1 (en) | 2006-02-02 | 2007-08-16 | International Business Machines Corporation | Methods and apparatus for displaying real-time search trends in graphical search specification and result interfaces |
US20070185867A1 (en) | 2006-02-03 | 2007-08-09 | Matteo Maga | Statistical modeling methods for determining customer distribution by churn probability within a customer population |
US7853573B2 (en) | 2006-05-03 | 2010-12-14 | Oracle International Corporation | Efficient replication of XML data in a relational database management system |
US20070260582A1 (en) | 2006-05-05 | 2007-11-08 | Inetsoft Technology | Method and System for Visual Query Construction and Representation |
US7756843B1 (en) | 2006-05-25 | 2010-07-13 | Juniper Networks, Inc. | Identifying and processing confidential information on network endpoints |
US9195985B2 (en) | 2006-06-08 | 2015-11-24 | Iii Holdings 1, Llc | Method, system, and computer program product for customer-level data verification |
US8290943B2 (en) | 2006-07-14 | 2012-10-16 | Raytheon Company | Geographical information display system and method |
US8230332B2 (en) | 2006-08-30 | 2012-07-24 | Compsci Resources, Llc | Interactive user interface for converting unstructured documents |
US7899822B2 (en) | 2006-09-08 | 2011-03-01 | International Business Machines Corporation | Automatically linking documents with relevant structured information |
US8054756B2 (en) | 2006-09-18 | 2011-11-08 | Yahoo! Inc. | Path discovery and analytics for network data |
US20080270328A1 (en) | 2006-10-18 | 2008-10-30 | Chad Lafferty | Building and Using Intelligent Software Agents For Optimizing Oil And Gas Wells |
US7792353B2 (en) | 2006-10-31 | 2010-09-07 | Hewlett-Packard Development Company, L.P. | Retraining a machine-learning classifier using re-labeled training samples |
US8229902B2 (en) | 2006-11-01 | 2012-07-24 | Ab Initio Technology Llc | Managing storage of individually accessible data units |
US7853614B2 (en) | 2006-11-27 | 2010-12-14 | Rapleaf, Inc. | Hierarchical, traceable, and association reputation assessment of email domains |
US7680939B2 (en) | 2006-12-20 | 2010-03-16 | Yahoo! Inc. | Graphical user interface to manipulate syndication data feeds |
US8290838B1 (en) | 2006-12-29 | 2012-10-16 | Amazon Technologies, Inc. | Indicating irregularities in online financial transactions |
US8799871B2 (en) | 2007-01-08 | 2014-08-05 | The Mathworks, Inc. | Computation of elementwise expression in parallel |
US8171418B2 (en) | 2007-01-31 | 2012-05-01 | Salesforce.Com, Inc. | Method and system for presenting a visual representation of the portion of the sets of data that a query is expected to return |
CN101246486B (en) | 2007-02-13 | 2012-02-01 | 国际商业机器公司 | Method and apparatus for improved process of expressions |
US20080208735A1 (en) | 2007-02-22 | 2008-08-28 | American Expresstravel Related Services Company, Inc., A New York Corporation | Method, System, and Computer Program Product for Managing Business Customer Contacts |
US7689624B2 (en) | 2007-03-01 | 2010-03-30 | Microsoft Corporation | Graph-based search leveraging sentiment analysis of user comments |
WO2008115519A1 (en) | 2007-03-20 | 2008-09-25 | President And Fellows Of Harvard College | A system for estimating a distribution of message content categories in source data |
US8036971B2 (en) | 2007-03-30 | 2011-10-11 | Palantir Technologies, Inc. | Generating dynamic date sets that represent market conditions |
US20080255973A1 (en) | 2007-04-10 | 2008-10-16 | Robert El Wade | Sales transaction analysis tool and associated method of use |
US20080294663A1 (en) | 2007-05-14 | 2008-11-27 | Heinley Brandon J | Creation and management of visual timelines |
US7930547B2 (en) | 2007-06-15 | 2011-04-19 | Alcatel-Lucent Usa Inc. | High accuracy bloom filter using partitioned hashing |
US20090006150A1 (en) | 2007-06-29 | 2009-01-01 | Sap Ag | Coherent multi-dimensional business process model |
US8386996B2 (en) | 2007-06-29 | 2013-02-26 | Sap Ag | Process extension wizard for coherent multi-dimensional business process models |
US8037046B2 (en) | 2007-06-29 | 2011-10-11 | Microsoft Corporation | Collecting and presenting temporal-based action information |
US20090125359A1 (en) | 2007-07-09 | 2009-05-14 | Robert Knapic | Integrating a methodology management system with project tasks in a project management system |
US7761525B2 (en) | 2007-08-23 | 2010-07-20 | International Business Machines Corporation | System and method for providing improved time references in documents |
US20130066673A1 (en) | 2007-09-06 | 2013-03-14 | Digg, Inc. | Adapting thresholds |
US20090083275A1 (en) | 2007-09-24 | 2009-03-26 | Nokia Corporation | Method, Apparatus and Computer Program Product for Performing a Visual Search Using Grid-Based Feature Organization |
US8849728B2 (en) | 2007-10-01 | 2014-09-30 | Purdue Research Foundation | Visual analytics law enforcement tools |
US8484115B2 (en) | 2007-10-03 | 2013-07-09 | Palantir Technologies, Inc. | Object-oriented time series generator |
US20090094270A1 (en) | 2007-10-08 | 2009-04-09 | Alirez Baldomero J | Method of building a validation database |
US8214308B2 (en) | 2007-10-23 | 2012-07-03 | Sas Institute Inc. | Computer-implemented systems and methods for updating predictive models |
US8397168B2 (en) | 2008-04-05 | 2013-03-12 | Social Communications Company | Interfacing with a spatial virtual communication environment |
US7650310B2 (en) | 2007-10-30 | 2010-01-19 | Intuit Inc. | Technique for reducing phishing |
US20090126020A1 (en) | 2007-11-09 | 2009-05-14 | Norton Richard Elliott | Engine for rule based content filtering |
US9898767B2 (en) | 2007-11-14 | 2018-02-20 | Panjiva, Inc. | Transaction facilitating marketplace platform |
US20090132953A1 (en) | 2007-11-16 | 2009-05-21 | Iac Search & Media, Inc. | User interface and method in local search system with vertical search results and an interactive map |
US8270577B2 (en) | 2007-12-13 | 2012-09-18 | Verizon Patent And Licensing Inc. | Multiple visual voicemail mailboxes |
US8417715B1 (en) | 2007-12-19 | 2013-04-09 | Tilmann Bruckhaus | Platform independent plug-in methods and systems for data mining and analytics |
US20090161147A1 (en) | 2007-12-20 | 2009-06-25 | Sharp Laboratories Of America, Inc. | Personal document container |
US20090172674A1 (en) | 2007-12-28 | 2009-07-02 | International Business Machines Corporation | Managing the computer collection of information in an information technology environment |
US8055633B2 (en) | 2008-01-21 | 2011-11-08 | International Business Machines Corporation | Method, system and computer program product for duplicate detection |
KR100915295B1 (en) | 2008-01-22 | 2009-09-03 | 성균관대학교산학협력단 | System and method for search service having a function of automatic classification of search results |
US7877367B2 (en) | 2008-01-22 | 2011-01-25 | International Business Machines Corporation | Computer method and apparatus for graphical inquiry specification with progressive summary |
US20090193012A1 (en) | 2008-01-29 | 2009-07-30 | James Charles Williams | Inheritance in a Search Index |
US20090199047A1 (en) | 2008-01-31 | 2009-08-06 | Yahoo! Inc. | Executing software performance test jobs in a clustered system |
US9274923B2 (en) | 2008-03-25 | 2016-03-01 | Wind River Systems, Inc. | System and method for stack crawl testing and caching |
US8150829B2 (en) | 2008-04-11 | 2012-04-03 | Fujitsu Limited | Facilitating display of an interactive and dynamic cloud of terms related to one or more input terms |
US8121962B2 (en) | 2008-04-25 | 2012-02-21 | Fair Isaac Corporation | Automated entity identification for efficient profiling in an event probability prediction system |
US20090282068A1 (en) | 2008-05-12 | 2009-11-12 | Shockro John J | Semantic packager |
US20090319515A1 (en) | 2008-06-02 | 2009-12-24 | Steven Minton | System and method for managing entity knowledgebases |
US20090307049A1 (en) | 2008-06-05 | 2009-12-10 | Fair Isaac Corporation | Soft Co-Clustering of Data |
US8860754B2 (en) | 2008-06-22 | 2014-10-14 | Tableau Software, Inc. | Methods and systems of automatically generating marks in a graphical view |
US8499287B2 (en) | 2008-06-23 | 2013-07-30 | Microsoft Corporation | Analysis of thread synchronization events |
US7908521B2 (en) | 2008-06-25 | 2011-03-15 | Microsoft Corporation | Process reflection |
US20100011282A1 (en) | 2008-07-11 | 2010-01-14 | iCyte Pty Ltd. | Annotation system and method |
EP2329357A4 (en) | 2008-08-04 | 2013-09-25 | Quid Inc | PERFORMANCE ANALYZING MACHINES FOR UNITS |
US8429194B2 (en) | 2008-09-15 | 2013-04-23 | Palantir Technologies, Inc. | Document-based workflows |
KR101495132B1 (en) | 2008-09-24 | 2015-02-25 | 삼성전자주식회사 | Method for displaying data on a mobile terminal and its mobile terminal |
CN101685449B (en) | 2008-09-26 | 2012-07-11 | 国际商业机器公司 | Method and system for connecting tables in a plurality of heterogeneous distributed databases |
US20100114887A1 (en) | 2008-10-17 | 2010-05-06 | Google Inc. | Textual Disambiguation Using Social Connections |
US8391584B2 (en) | 2008-10-20 | 2013-03-05 | Jpmorgan Chase Bank, N.A. | Method and system for duplicate check detection |
US9032254B2 (en) | 2008-10-29 | 2015-05-12 | Aternity Information Systems Ltd. | Real time monitoring of computer for determining speed and energy consumption of various processes |
US8103962B2 (en) | 2008-11-04 | 2012-01-24 | Brigham Young University | Form-based ontology creation and information harvesting |
US20100131502A1 (en) | 2008-11-25 | 2010-05-27 | Fordham Bradley S | Cohort group generation and automatic updating |
US8805861B2 (en) | 2008-12-09 | 2014-08-12 | Google Inc. | Methods and systems to train models to extract and integrate information from data sources |
US8312038B2 (en) | 2008-12-18 | 2012-11-13 | Oracle International Corporation | Criteria builder for query builder |
US8719350B2 (en) | 2008-12-23 | 2014-05-06 | International Business Machines Corporation | Email addressee verification |
US20100169376A1 (en) | 2008-12-29 | 2010-07-01 | Yahoo! Inc. | Visual search engine for personal dating |
US20100262688A1 (en) | 2009-01-21 | 2010-10-14 | Daniar Hussain | Systems, methods, and devices for detecting security vulnerabilities in ip networks |
US20100191563A1 (en) | 2009-01-23 | 2010-07-29 | Doctors' Administrative Solutions, Llc | Physician Practice Optimization Tracking |
US20110213655A1 (en) | 2009-01-24 | 2011-09-01 | Kontera Technologies, Inc. | Hybrid contextual advertising and related content analysis and display techniques |
US8073857B2 (en) | 2009-02-17 | 2011-12-06 | International Business Machines Corporation | Semantics-based data transformation over a wire in mashups |
US20100211535A1 (en) | 2009-02-17 | 2010-08-19 | Rosenberger Mark Elliot | Methods and systems for management of data |
US8473454B2 (en) | 2009-03-10 | 2013-06-25 | Xerox Corporation | System and method of on-demand document processing |
US20100235915A1 (en) | 2009-03-12 | 2010-09-16 | Nasir Memon | Using host symptoms, host roles, and/or host reputation for detection of host infection |
US8719249B2 (en) | 2009-05-12 | 2014-05-06 | Microsoft Corporation | Query classification |
US20130218879A1 (en) | 2009-05-15 | 2013-08-22 | Hyundai Motor Company | Update systems of space of interest data and methods thereof |
US9268761B2 (en) | 2009-06-05 | 2016-02-23 | Microsoft Technology Licensing, Llc | In-line dynamic text with variable formatting |
US8495151B2 (en) | 2009-06-05 | 2013-07-23 | Chandra Bodapati | Methods and systems for determining email addresses |
US9104695B1 (en) | 2009-07-27 | 2015-08-11 | Palantir Technologies, Inc. | Geotagging structured data |
US8606804B2 (en) | 2009-08-05 | 2013-12-10 | Microsoft Corporation | Runtime-defined dynamic queries |
US8583673B2 (en) | 2009-08-17 | 2013-11-12 | Microsoft Corporation | Progressive filtering of search results |
US9280777B2 (en) | 2009-09-08 | 2016-03-08 | Target Brands, Inc. | Operations dashboard |
US20110066497A1 (en) | 2009-09-14 | 2011-03-17 | Choicestream, Inc. | Personalized advertising and recommendation |
US8214490B1 (en) | 2009-09-15 | 2012-07-03 | Symantec Corporation | Compact input compensating reputation data tracking mechanism |
US20110074811A1 (en) | 2009-09-25 | 2011-03-31 | Apple Inc. | Map Layout for Print Production |
US20110078173A1 (en) | 2009-09-30 | 2011-03-31 | Avaya Inc. | Social Network User Interface |
US8595058B2 (en) | 2009-10-15 | 2013-11-26 | Visa U.S.A. | Systems and methods to match identifiers |
US9158816B2 (en) | 2009-10-21 | 2015-10-13 | Microsoft Technology Licensing, Llc | Event processing with XML query based on reusable XML query template |
US9165304B2 (en) | 2009-10-23 | 2015-10-20 | Service Management Group, Inc. | Analyzing consumer behavior using electronically-captured consumer location data |
CN102054015B (en) | 2009-10-28 | 2014-05-07 | 财团法人工业技术研究院 | System and method for organizing community intelligence information using an organic object data model |
US20110112995A1 (en) | 2009-10-28 | 2011-05-12 | Industrial Technology Research Institute | Systems and methods for organizing collective social intelligence information using an organic object data model |
US8806355B2 (en) | 2009-11-06 | 2014-08-12 | Cisco Technology, Inc. | Method and apparatus for visualizing and navigating within an immersive collaboration environment |
US20110131547A1 (en) | 2009-12-01 | 2011-06-02 | International Business Machines Corporation | Method and system defining and interchanging diagrams of graphical modeling languages |
US11122009B2 (en) | 2009-12-01 | 2021-09-14 | Apple Inc. | Systems and methods for identifying geographic locations of social media content collected over social networks |
US8645478B2 (en) | 2009-12-10 | 2014-02-04 | Mcafee, Inc. | System and method for monitoring social engineering in a computer network environment |
GB2476121A (en) | 2009-12-14 | 2011-06-15 | Colin Westlake | Linking interactions using a reference for an internet user's web session |
US20110153384A1 (en) | 2009-12-17 | 2011-06-23 | Matthew Donald Horne | Visual comps builder |
US10324598B2 (en) * | 2009-12-18 | 2019-06-18 | Graphika, Inc. | System and method for a search engine content filter |
EP2524299A4 (en) | 2010-01-11 | 2013-11-13 | Panjiva Inc | Evaluating public records of supply transactions for financial investment decisions |
US9026552B2 (en) | 2010-01-18 | 2015-05-05 | Salesforce.Com, Inc. | System and method for linking contact records to company locations |
US20110208822A1 (en) | 2010-02-22 | 2011-08-25 | Yogesh Chunilal Rathod | Method and system for customized, contextual, dynamic and unified communication, zero click advertisement and prospective customers search engine |
US20110208565A1 (en) | 2010-02-23 | 2011-08-25 | Michael Ross | complex process management |
US8478709B2 (en) | 2010-03-08 | 2013-07-02 | Hewlett-Packard Development Company, L.P. | Evaluation of client status for likelihood of churn |
US20110231296A1 (en) | 2010-03-16 | 2011-09-22 | UberMedia, Inc. | Systems and methods for interacting with messages, authors, and followers |
US8577911B1 (en) | 2010-03-23 | 2013-11-05 | Google Inc. | Presenting search term refinements |
US8739118B2 (en) | 2010-04-08 | 2014-05-27 | Microsoft Corporation | Pragmatic mapping specification, compilation and validation |
US8306846B2 (en) | 2010-04-12 | 2012-11-06 | First Data Corporation | Transaction location analytics systems and methods |
US20110258216A1 (en) | 2010-04-20 | 2011-10-20 | International Business Machines Corporation | Usability enhancements for bookmarks of browsers |
US8255399B2 (en) | 2010-04-28 | 2012-08-28 | Microsoft Corporation | Data classifier |
US8874432B2 (en) | 2010-04-28 | 2014-10-28 | Nec Laboratories America, Inc. | Systems and methods for semi-supervised relationship extraction |
US8626770B2 (en) | 2010-05-03 | 2014-01-07 | International Business Machines Corporation | Iceberg query evaluation implementing a compressed bitmap index |
US20110289397A1 (en) | 2010-05-19 | 2011-11-24 | Mauricio Eastmond | Displaying Table Data in a Limited Display Area |
US20110295649A1 (en) | 2010-05-31 | 2011-12-01 | International Business Machines Corporation | Automatic churn prediction |
US8799867B1 (en) | 2010-06-08 | 2014-08-05 | Cadence Design Systems, Inc. | Methods, systems, and articles of manufacture for synchronizing software verification flows |
US8756224B2 (en) | 2010-06-16 | 2014-06-17 | Rallyverse, Inc. | Methods, systems, and media for content ranking using real-time data |
US8380719B2 (en) | 2010-06-18 | 2013-02-19 | Microsoft Corporation | Semantic content searching |
US8352908B2 (en) | 2010-06-28 | 2013-01-08 | International Business Machines Corporation | Multi-modal conversion tool for form-type applications |
KR101196935B1 (en) | 2010-07-05 | 2012-11-05 | 엔에이치엔(주) | Method and system for providing reprsentation words of real-time popular keyword |
US8489641B1 (en) | 2010-07-08 | 2013-07-16 | Google Inc. | Displaying layers of search results on a map |
US8407341B2 (en) | 2010-07-09 | 2013-03-26 | Bank Of America Corporation | Monitoring communications |
CA2707916C (en) | 2010-07-14 | 2015-12-01 | Ibm Canada Limited - Ibm Canada Limitee | Intelligent timesheet assistance |
US8554653B2 (en) | 2010-07-22 | 2013-10-08 | Visa International Service Association | Systems and methods to identify payment accounts having business spending activities |
US8775530B2 (en) | 2010-08-25 | 2014-07-08 | International Business Machines Corporation | Communication management method and system |
US20120066166A1 (en) | 2010-09-10 | 2012-03-15 | International Business Machines Corporation | Predictive Analytics for Semi-Structured Case Oriented Processes |
US20120078595A1 (en) | 2010-09-24 | 2012-03-29 | Nokia Corporation | Method and apparatus for ontology matching |
US9069842B2 (en) | 2010-09-28 | 2015-06-30 | The Mitre Corporation | Accessing documents using predictive word sequences |
US8549004B2 (en) | 2010-09-30 | 2013-10-01 | Hewlett-Packard Development Company, L.P. | Estimation of unique database values |
US8498998B2 (en) | 2010-10-11 | 2013-07-30 | International Business Machines Corporation | Grouping identity records to generate candidate lists to use in an entity and relationship resolution process |
US8719252B2 (en) | 2010-10-22 | 2014-05-06 | Daniel Paul Miranker | Accessing relational databases as resource description framework databases |
US8949158B2 (en) | 2010-10-25 | 2015-02-03 | Intelius Inc. | Cost-sensitive alternating decision trees for record linkage |
JP5706137B2 (en) | 2010-11-22 | 2015-04-22 | インターナショナル・ビジネス・マシーンズ・コーポレーションInternational Business Machines Corporation | Method and computer program for displaying a plurality of posts (groups of data) on a computer screen in real time along a plurality of axes |
AU2011332881B2 (en) | 2010-11-24 | 2016-06-30 | LogRhythm Inc. | Advanced intelligence engine |
US8839133B2 (en) | 2010-12-02 | 2014-09-16 | Microsoft Corporation | Data visualizations including interactive time line representations |
CN102546446A (en) | 2010-12-13 | 2012-07-04 | 太仓市浏河镇亿网行网络技术服务部 | Email device |
US20120159449A1 (en) | 2010-12-15 | 2012-06-21 | International Business Machines Corporation | Call Stack Inspection For A Thread Of Execution |
US9141405B2 (en) | 2010-12-15 | 2015-09-22 | International Business Machines Corporation | User interface construction |
US20120158527A1 (en) | 2010-12-21 | 2012-06-21 | Class6Ix, Llc | Systems, Methods and/or Computer Readable Storage Media Facilitating Aggregation and/or Personalized Sequencing of News Video Content |
US20120173381A1 (en) | 2011-01-03 | 2012-07-05 | Stanley Benjamin Smith | Process and system for pricing and processing weighted data in a federated or subscription based data source |
IL211163A0 (en) | 2011-02-10 | 2011-04-28 | Univ Ben Gurion | A method for generating a randomized data structure for representing sets, based on bloom filters |
EP2678774A4 (en) | 2011-02-24 | 2015-04-08 | Lexisnexis Division Of Reed Elsevier Inc | Methods for electronic document searching and graphically representing electronic document searches |
US8381120B2 (en) | 2011-04-11 | 2013-02-19 | Credibility Corp. | Visualization tools for reviewing credibility and stateful hierarchical access to credibility |
US8984121B1 (en) * | 2011-04-21 | 2015-03-17 | Intuit Inc. | Dependency visualization and fault diagnosis using multidimensional models for software offerings |
US8966486B2 (en) | 2011-05-03 | 2015-02-24 | Microsoft Corporation | Distributed multi-phase batch job processing |
US9104765B2 (en) | 2011-06-17 | 2015-08-11 | Robert Osann, Jr. | Automatic webpage characterization and search results annotation |
US8726379B1 (en) | 2011-07-15 | 2014-05-13 | Norse Corporation | Systems and methods for dynamic protection from electronic attacks |
US8982130B2 (en) | 2011-07-15 | 2015-03-17 | Green Charge Networks | Cluster mapping to highlight areas of electrical congestion |
US20130024268A1 (en) | 2011-07-22 | 2013-01-24 | Ebay Inc. | Incentivizing the linking of internet content to products for sale |
US9996807B2 (en) | 2011-08-17 | 2018-06-12 | Roundhouse One Llc | Multidimensional digital platform for building integration and analysis |
US20130054551A1 (en) | 2011-08-24 | 2013-02-28 | Sap Ag | Global product database |
US8966392B2 (en) | 2011-08-29 | 2015-02-24 | Novell, Inc. | Event management apparatus, systems, and methods |
US8630892B2 (en) | 2011-08-31 | 2014-01-14 | Accenture Global Services Limited | Churn analysis system |
GB201115083D0 (en) | 2011-08-31 | 2011-10-19 | Data Connection Ltd | Identifying data items |
US8949164B1 (en) | 2011-09-08 | 2015-02-03 | George O. Mohler | Event forecasting system |
CN103765432A (en) | 2011-09-08 | 2014-04-30 | 惠普发展公司,有限责任合伙企业 | Visual component and drill down mapping |
WO2013044141A2 (en) | 2011-09-22 | 2013-03-28 | Capgemini U.S. Llc | Process transformation and transitioning apparatuses, methods and systems |
US8433702B1 (en) | 2011-09-28 | 2013-04-30 | Palantir Technologies, Inc. | Horizon histogram optimizations |
US8560494B1 (en) | 2011-09-30 | 2013-10-15 | Palantir Technologies, Inc. | Visual data importer |
US20130086482A1 (en) | 2011-09-30 | 2013-04-04 | Cbs Interactive, Inc. | Displaying plurality of content items in window |
WO2013052872A2 (en) | 2011-10-05 | 2013-04-11 | Mastercard International Incorporated | Nomination engine |
US8626545B2 (en) | 2011-10-17 | 2014-01-07 | CrowdFlower, Inc. | Predicting future performance of multiple workers on crowdsourcing tasks and selecting repeated crowdsourcing workers |
US8843421B2 (en) | 2011-11-01 | 2014-09-23 | Accenture Global Services Limited | Identification of entities likely to engage in a behavior |
US9159024B2 (en) | 2011-12-07 | 2015-10-13 | Wal-Mart Stores, Inc. | Real-time predictive intelligence platform |
CN103167093A (en) | 2011-12-08 | 2013-06-19 | 青岛海信移动通信技术股份有限公司 | Filling method of mobile phone email address |
US9026480B2 (en) | 2011-12-21 | 2015-05-05 | Telenav, Inc. | Navigation system with point of interest classification mechanism and method of operation thereof |
US8880420B2 (en) | 2011-12-27 | 2014-11-04 | Grubhub, Inc. | Utility for creating heatmaps for the study of competitive advantage in the restaurant marketplace |
US8843431B2 (en) | 2012-01-16 | 2014-09-23 | International Business Machines Corporation | Social network analysis for churn prediction |
US8909648B2 (en) | 2012-01-18 | 2014-12-09 | Technion Research & Development Foundation Limited | Methods and systems of supervised learning of semantic relatedness |
US8812958B2 (en) * | 2012-02-12 | 2014-08-19 | Saba Software, Inc. | Methods and apparatus for analyzing a social network |
US8965422B2 (en) | 2012-02-23 | 2015-02-24 | Blackberry Limited | Tagging instant message content for retrieval using mobile communication devices |
US20130226944A1 (en) | 2012-02-24 | 2013-08-29 | Microsoft Corporation | Format independent data transformation |
US9378526B2 (en) | 2012-03-02 | 2016-06-28 | Palantir Technologies, Inc. | System and method for accessing data objects via remote references |
JP2013191187A (en) | 2012-03-15 | 2013-09-26 | Fujitsu Ltd | Processing device, program and processing system |
US20130263019A1 (en) | 2012-03-30 | 2013-10-03 | Maria G. Castellanos | Analyzing social media |
US8983936B2 (en) | 2012-04-04 | 2015-03-17 | Microsoft Corporation | Incremental visualization for structured data in an enterprise-level data store |
US9298856B2 (en) | 2012-04-23 | 2016-03-29 | Sap Se | Interactive data exploration and visualization tool |
US9043710B2 (en) | 2012-04-26 | 2015-05-26 | Sap Se | Switch control in report generation |
US10304036B2 (en) | 2012-05-07 | 2019-05-28 | Nasdaq, Inc. | Social media profiling for one or more authors using one or more social media platforms |
EP2662782A1 (en) | 2012-05-10 | 2013-11-13 | Siemens Aktiengesellschaft | Method and system for storing data in a database |
CN104820685B (en) | 2012-06-28 | 2019-02-22 | 北京奇虎科技有限公司 | A kind of social network search method and social network search system |
US9563674B2 (en) | 2012-08-20 | 2017-02-07 | Microsoft Technology Licensing, Llc | Data exploration user interface |
US10163158B2 (en) | 2012-08-27 | 2018-12-25 | Yuh-Shen Song | Transactional monitoring system |
US20140068487A1 (en) | 2012-09-05 | 2014-03-06 | Roche Diagnostics Operations, Inc. | Computer Implemented Methods For Visualizing Correlations Between Blood Glucose Data And Events And Apparatuses Thereof |
US9798768B2 (en) | 2012-09-10 | 2017-10-24 | Palantir Technologies, Inc. | Search around visual queries |
US20140095509A1 (en) | 2012-10-02 | 2014-04-03 | Banjo, Inc. | Method of tagging content lacking geotags with a location |
US9513780B2 (en) | 2012-10-08 | 2016-12-06 | Fisher-Rosemount Systems, Inc. | Distribution policy for propagation of changes to process control objects |
US9104786B2 (en) | 2012-10-12 | 2015-08-11 | International Business Machines Corporation | Iterative refinement of cohorts using visual exploration and data analytics |
US8688573B1 (en) | 2012-10-16 | 2014-04-01 | Intuit Inc. | Method and system for identifying a merchant payee associated with a cash transaction |
US9348677B2 (en) | 2012-10-22 | 2016-05-24 | Palantir Technologies Inc. | System and method for batch evaluation programs |
US9471370B2 (en) | 2012-10-22 | 2016-10-18 | Palantir Technologies, Inc. | System and method for stack-based batch evaluation of program instructions |
US8914886B2 (en) | 2012-10-29 | 2014-12-16 | Mcafee, Inc. | Dynamic quarantining for malware detection |
US9378030B2 (en) | 2013-10-01 | 2016-06-28 | Aetherpal, Inc. | Method and apparatus for interactive mobile device guidance |
US10504127B2 (en) | 2012-11-15 | 2019-12-10 | Home Depot Product Authority, Llc | System and method for classifying relevant competitors |
US20140143009A1 (en) | 2012-11-16 | 2014-05-22 | International Business Machines Corporation | Risk reward estimation for company-country pairs |
US20140156527A1 (en) | 2012-11-30 | 2014-06-05 | Bank Of America Corporation | Pre-payment authorization categorization |
US20140157172A1 (en) | 2012-11-30 | 2014-06-05 | Drillmap | Geographic layout of petroleum drilling data and methods for processing data |
US10672008B2 (en) | 2012-12-06 | 2020-06-02 | Jpmorgan Chase Bank, N.A. | System and method for data analytics |
US9497289B2 (en) | 2012-12-07 | 2016-11-15 | Genesys Telecommunications Laboratories, Inc. | System and method for social message classification based on influence |
US10108668B2 (en) | 2012-12-14 | 2018-10-23 | Sap Se | Column smart mechanism for column based database |
US9294576B2 (en) | 2013-01-02 | 2016-03-22 | Microsoft Technology Licensing, Llc | Social media impact assessment |
US20140195515A1 (en) | 2013-01-10 | 2014-07-10 | I3 Analytics | Methods and systems for querying and displaying data using interactive three-dimensional representations |
US8639552B1 (en) | 2013-01-24 | 2014-01-28 | Broadvision, Inc. | Systems and methods for creating and sharing tasks |
US9805407B2 (en) | 2013-01-25 | 2017-10-31 | Illumina, Inc. | Methods and systems for using a cloud computing environment to configure and sell a biological sample preparation cartridge and share related data |
US20140222521A1 (en) | 2013-02-07 | 2014-08-07 | Ibms, Llc | Intelligent management and compliance verification in distributed work flow environments |
US20140222793A1 (en) | 2013-02-07 | 2014-08-07 | Parlance Corporation | System and Method for Automatically Importing, Refreshing, Maintaining, and Merging Contact Sets |
US9264393B2 (en) | 2013-02-13 | 2016-02-16 | International Business Machines Corporation | Mail server-based dynamic workflow management |
US8744890B1 (en) | 2013-02-14 | 2014-06-03 | Aktana, Inc. | System and method for managing system-level workflow strategy and individual workflow activity |
US20140244388A1 (en) | 2013-02-28 | 2014-08-28 | MetroStar Systems, Inc. | Social Content Synchronization |
US9378306B2 (en) | 2013-03-12 | 2016-06-28 | Business Objects Software Ltd. | Binning visual definition for visual intelligence |
US10140664B2 (en) | 2013-03-14 | 2018-11-27 | Palantir Technologies Inc. | Resolving similar entities from a transaction database |
US9501202B2 (en) | 2013-03-15 | 2016-11-22 | Palantir Technologies, Inc. | Computer graphical user interface with genomic workflow |
GB2513720A (en) | 2013-03-15 | 2014-11-05 | Palantir Technologies Inc | Computer-implemented systems and methods for comparing and associating objects |
US8924388B2 (en) | 2013-03-15 | 2014-12-30 | Palantir Technologies Inc. | Computer-implemented systems and methods for comparing and associating objects |
US9898167B2 (en) | 2013-03-15 | 2018-02-20 | Palantir Technologies Inc. | Systems and methods for providing a tagging interface for external content |
GB2513721A (en) | 2013-03-15 | 2014-11-05 | Palantir Technologies Inc | Computer-implemented systems and methods for comparing and associating objects |
US8996511B2 (en) | 2013-03-15 | 2015-03-31 | Envizium, Inc. | System, method, and computer product for providing search results in a hierarchical graphical format |
US9372929B2 (en) | 2013-03-20 | 2016-06-21 | Securboration, Inc. | Methods and systems for node and link identification |
US8799799B1 (en) | 2013-05-07 | 2014-08-05 | Palantir Technologies Inc. | Interactive geospatial map |
US9576248B2 (en) | 2013-06-01 | 2017-02-21 | Adam M. Hurwitz | Record linkage sharing using labeled comparison vectors and a machine learning domain classification trainer |
US9280618B1 (en) | 2013-07-26 | 2016-03-08 | Applied Predictive Technologies, Inc. | Systems and methods for control strategy criteria selection |
US8752178B2 (en) | 2013-07-31 | 2014-06-10 | Splunk Inc. | Blacklisting and whitelisting of security-related events |
US8938686B1 (en) | 2013-10-03 | 2015-01-20 | Palantir Technologies Inc. | Systems and methods for analyzing performance of an entity |
US8812960B1 (en) | 2013-10-07 | 2014-08-19 | Palantir Technologies Inc. | Cohort-based presentation of user interaction data |
US20150112772A1 (en) | 2013-10-11 | 2015-04-23 | Crowdpac, Inc. | Interface and methods for tracking and analyzing political ideology and interests |
US9792194B2 (en) | 2013-10-18 | 2017-10-17 | International Business Machines Corporation | Performance regression manager for large scale systems |
US20150120717A1 (en) * | 2013-10-25 | 2015-04-30 | Marketwire L.P. | Systems and methods for determining influencers in a social data network and ranking data objects based on influencers |
US8832594B1 (en) | 2013-11-04 | 2014-09-09 | Palantir Technologies Inc. | Space-optimized display of multi-column tables with selective text truncation based on a combined text width |
US9396246B2 (en) | 2013-11-08 | 2016-07-19 | International Business Machines Corporation | Reporting and summarizing metrics in sparse relationships on an OLTP database |
US9356937B2 (en) | 2013-11-13 | 2016-05-31 | International Business Machines Corporation | Disambiguating conflicting content filter rules |
US9105000B1 (en) | 2013-12-10 | 2015-08-11 | Palantir Technologies Inc. | Aggregating data from a plurality of data sources |
US10356032B2 (en) | 2013-12-26 | 2019-07-16 | Palantir Technologies Inc. | System and method for detecting confidential information emails |
US8832832B1 (en) | 2014-01-03 | 2014-09-09 | Palantir Technologies Inc. | IP reputation |
US8924429B1 (en) | 2014-03-18 | 2014-12-30 | Palantir Technologies Inc. | Determining and extracting changed data from a data source |
US9129219B1 (en) | 2014-06-30 | 2015-09-08 | Palantir Technologies, Inc. | Crime risk forecasting |
US9256664B2 (en) | 2014-07-03 | 2016-02-09 | Palantir Technologies Inc. | System and method for news events detection and visualization |
US20160026923A1 (en) | 2014-07-22 | 2016-01-28 | Palantir Technologies Inc. | System and method for determining a propensity of entity to take a specified action |
US9390086B2 (en) | 2014-09-11 | 2016-07-12 | Palantir Technologies Inc. | Classification system with methodology for efficient verification |
US9483546B2 (en) | 2014-12-15 | 2016-11-01 | Palantir Technologies Inc. | System and method for associating related records to common entities across multiple lists |
US10552994B2 (en) | 2014-12-22 | 2020-02-04 | Palantir Technologies Inc. | Systems and interactive user interfaces for dynamic retrieval, analysis, and triage of data items |
US9335911B1 (en) | 2014-12-29 | 2016-05-10 | Palantir Technologies Inc. | Interactive user interface for dynamic data analysis exploration and query processing |
US9881226B1 (en) * | 2015-09-24 | 2018-01-30 | Amazon Technologies, Inc. | Object relation builder |
US10565225B2 (en) * | 2016-03-04 | 2020-02-18 | International Business Machines Corporation | Exploration and navigation of a content collection |
US10176609B2 (en) * | 2016-05-11 | 2019-01-08 | Runtime Collective Limited | Analysis and visualization of interaction and influence in a network |
US9836183B1 (en) * | 2016-09-14 | 2017-12-05 | Quid, Inc. | Summarized network graph for semantic similarity graphs of large corpora |
US10417523B2 (en) * | 2016-11-07 | 2019-09-17 | Ayasdi Ai Llc | Dimension grouping and reduction for model generation, testing, and documentation |
-
2017
- 2017-07-06 US US15/642,820 patent/US10509844B1/en active Active
-
2019
- 2019-10-16 US US16/654,048 patent/US20200133986A1/en not_active Abandoned
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US176609A (en) * | 1876-04-25 | Improvement in stoves for burning oil | ||
US20050076054A1 (en) * | 2003-10-07 | 2005-04-07 | Moon Billy Gayle | Arrangement for autonomous mobile network nodes to organize a wireless mobile network based on detected physical and logical changes |
US20080140655A1 (en) * | 2004-12-15 | 2008-06-12 | Hoos Holger H | Systems and Methods for Storing, Maintaining and Providing Access to Information |
US20090265475A1 (en) * | 2005-08-30 | 2009-10-22 | Matsushita Electric Industrial Co., Ltd. | Content delivery method, content delivery server, communication terminal apparatus, and content delivery system |
US20160210767A1 (en) * | 2012-03-21 | 2016-07-21 | Zymeworks Inc. | Systems and methods for making two dimensional graphs of complex molecules |
US9978162B1 (en) * | 2015-03-31 | 2018-05-22 | Cisco Technology, Inc. | Rules-based causality visualization framework |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2022105682A1 (en) * | 2020-11-18 | 2022-05-27 | 北京字节跳动网络技术有限公司 | Paging data acquisition method and apparatus, electronic device, and computer readable storage medium |
CN113221015A (en) * | 2021-07-06 | 2021-08-06 | 深圳市炆石数据有限公司 | Homologous user determination and homologous network construction method, system and storage medium |
Also Published As
Publication number | Publication date |
---|---|
US10509844B1 (en) | 2019-12-17 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US10509844B1 (en) | Network graph parser | |
US11921805B2 (en) | Web document enhancement | |
US11798209B1 (en) | Systems and methods for rendering a third party visualization in response to events received from search queries | |
US8407576B1 (en) | Situational web-based dashboard | |
EP2891992A1 (en) | Systems and methods for visual definition of data associations | |
EP2778977A1 (en) | Systems and methods for providing a tagging interface for external content | |
US20130263023A1 (en) | Graphical Overlay Related To Data Mining and Analytics | |
US20120233160A1 (en) | System and method for assisting a user to identify the contexts of search results | |
US20160259630A1 (en) | Systems, apparatus and methods for sharing visual model-based applications | |
US20100070856A1 (en) | Method for Graphical Visualization of Multiple Traversed Breadcrumb Trails | |
US11314757B2 (en) | Search results modulator | |
US9792008B2 (en) | User interface with analytics overlay | |
US20150242536A1 (en) | Advanced Search Page with Dynamic Generation of a Search Query String | |
US8413062B1 (en) | Method and system for accessing interface design elements via a wireframe mock-up | |
KR102713074B1 (en) | Method and apparatus for processing data | |
US10534826B2 (en) | Guided search via content analytics and ontology | |
WO2016092412A1 (en) | Generation of mapping definitions for content management system | |
US12223000B2 (en) | Data shaping system | |
US11151213B2 (en) | Browser search result option to facilitate human spidering | |
US20240054174A1 (en) | Methods and systems for obtaining and storing web pages | |
CN107357796B (en) | Network information acquisition method, equipment and programmable equipment | |
US20090288033A1 (en) | User-Directed Capture of Unstructured Information from Web Pages with Assignment to Data Type | |
US20100268780A1 (en) | Situational Application Creation Based on Observed User Behavior | |
US10922476B1 (en) | Resource-efficient generation of visual layout information associated with network-accessible documents | |
Ni et al. | Visualizing linked data with JavaScript |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AS | Assignment |
Owner name: PALANTIR TECHNOLOGIES INC., CALIFORNIA Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:MCINTYRE, THOMAS;ROSEN, CARL;BALL, ELIOT;AND OTHERS;SIGNING DATES FROM 20170719 TO 20180717;REEL/FRAME:051631/0608 |
|
STPP | Information on status: patent application and granting procedure in general |
Free format text: NON FINAL ACTION MAILED |
|
AS | Assignment |
Owner name: MORGAN STANLEY SENIOR FUNDING, INC., NEW YORK Free format text: SECURITY INTEREST;ASSIGNOR:PALANTIR TECHNOLOGIES INC.;REEL/FRAME:052856/0817 Effective date: 20200604 |
|
STCB | Information on status: application discontinuation |
Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION |
|
AS | Assignment |
Owner name: WELLS FARGO BANK, N.A., NORTH CAROLINA Free format text: SECURITY INTEREST;ASSIGNOR:PALANTIR TECHNOLOGIES INC.;REEL/FRAME:060572/0506 Effective date: 20220701 |