US20130246422A1 - System and method for clustering host inventories - Google Patents
System and method for clustering host inventories Download PDFInfo
- Publication number
- US20130246422A1 US20130246422A1 US12/880,125 US88012510A US2013246422A1 US 20130246422 A1 US20130246422 A1 US 20130246422A1 US 88012510 A US88012510 A US 88012510A US 2013246422 A1 US2013246422 A1 US 2013246422A1
- Authority
- US
- United States
- Prior art keywords
- file
- keyword
- hosts
- host
- inventories
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/35—Clustering; Classification
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/40—Information retrieval; Database structures therefor; File system structures therefor of multimedia data, e.g. slideshows comprising image and additional audio data
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L41/00—Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
- H04L41/08—Configuration management of networks or network elements
- H04L41/0893—Assignment of logical groups to network elements
Definitions
- This disclosure relates in general to the field of computer network administration and support and, more particularly, to identifying similar software inventories on selected hosts.
- executable software inventories may vary greatly among computers across departmental groups. Varied executable software inventories on computers within organizations present numerous difficulties to IT administrators to maintain, to troubleshoot, to service, and to provide uninterrupted access for business or other necessary activities.
- Innovative tools are needed to assist IT administrators to successfully support computer network environments with computers having incongruities between executable software inventories.
- FIG. 1 is a pictorial representation of an exemplary network environment in which various embodiments of a system and method for clustering host inventories may be implemented in accordance with the present disclosure
- FIG. 2 is a simplified block diagram of a computer, which may be utilized in embodiments in accordance with the present disclosure
- FIG. 3 is a simplified flowchart illustrating a series of example steps associated with the system in accordance with one embodiment of the present disclosure
- FIG. 4 illustrates an n ⁇ m vector matrix format used in accordance with an embodiment of the present disclosure
- FIG. 5 is a flowchart illustrating a series of example steps for generating values for an n ⁇ m vector matrix as shown in FIG. 4 in accordance with one embodiment of the present disclosure
- FIG. 6 illustrates an example selected group of hosts in a network environment to which embodiments of the present disclosure may be applied
- FIG. 7 illustrates a vector matrix created by application of the flow of FIG. 5 to the example selected group of hosts of FIG. 6 ;
- FIG. 8 is an example cluster diagram of the hosts of FIG. 6 that could be created from the system in accordance with embodiments of the present disclosure
- FIG. 9 is a simplified flowchart illustrating a series of example steps associated with the system in accordance with another embodiment of the present disclosure.
- FIG. 10 illustrates an n ⁇ n similarity matrix format used in accordance with one embodiment of the present disclosure
- FIG. 11 is a flowchart illustrating a series of example steps for generating values for an n ⁇ n similarity matrix as shown in FIG. 10 in accordance with one embodiment of the present disclosure.
- FIG. 12 illustrates a similarity matrix created by application of the flow of FIG. 11 to the example selected group of hosts of FIG. 6 .
- a method in one example implementation includes obtaining a plurality of host file inventories corresponding respectively to a plurality of hosts, calculating input data using the plurality of host file inventories, and providing the input data to a clustering procedure to group the plurality of hosts into one or more clusters of hosts.
- the method further includes each cluster of hosts being grouped using a predetermined similarity criteria. More specific embodiments include each of the plurality of host file inventories including a set of one or more file identifiers with each file identifier representing a different executable software file on a corresponding one of the plurality of hosts. In another more specific embodiment, the method includes each of the one or more file identifiers including a token sequence of one or more tokens.
- the calculating the input data includes transforming the plurality of host file inventories into a similarity matrix. In another more specific embodiment, the calculating the input data includes transforming the plurality of host file inventories into a matrix of keyword vectors in Euclidean space, where each keyword vector corresponds to one of the plurality of hosts.
- FIG. 1 is a pictorial representation of a computer network environment 100 in which embodiments of a system for clustering host inventories may be implemented in accordance with the present disclosure.
- Computer network environment 100 illustrates a network of computers including a plurality of hosts 110 a , 110 b , and 110 c (referred to collectively herein as hosts 110 ), which may each have, respectively, a set of executable files 112 a , 112 b , and 112 c (referred to collectively herein as sets of executable files 112 ) and a host inventory feed 114 a , 114 b , and 114 c (referred to collectively herein as host inventory feeds 114 ).
- hosts 110 may each have, respectively, a set of executable files 112 a , 112 b , and 112 c (referred to collectively herein as sets of executable files 112 ) and a host inventory feed 114 a , 114 b , and 114 c (referred to collectively
- Hosts 110 may be operably connected to a central server 130 through communication link 120 .
- Central server 130 may include an administrative module 140 , a host inventory preparation module 150 , and a clustering module 160 .
- a management console 170 can also be suitably connected to central server 130 to provide an interface for users such as Information Technology (IT) administrators, network operators, and the like.
- IT Information Technology
- the system for clustering host inventories may be utilized to provide valuable information to users (e.g., IT administrators, network operators, etc.) identifying computers having similar operating systems and installed executable software files.
- users e.g., IT administrators, network operators, etc.
- software inventories from hosts 110 may be transformed by host inventory preparation module 150 into input data for a clustering algorithm or procedure.
- Clustering module 160 may apply the clustering algorithm to the prepared input data to create a clustering diagram or other information identifying logical groupings of hosts 110 having similar operating systems and installed sets of executable files 112 .
- the clustering diagram may also identify any outlier hosts 110 having significant differences in operating systems and/or executable files relative to the other hosts 110 in network environment 100 .
- the IT administrator or other user is provided with valuable information that enables the discovery of trends and exceptions of computers, such as hosts 110 , in the particular network environment.
- common policies may be applied to computers within logical groupings and remedial action may be taken on any identified outlier computers.
- Typical network environments used in organizations and by individuals often include a plurality of computers such as end user desktops, laptops, servers, network appliances, and the like, and each may have an installed set of executable software.
- network environments may include hundreds or thousands of computers, which may span different buildings, cities, and/or geographical areas around the world. IT administrators may be tasked with the extraordinary responsibility of maintaining these computers in a way that minimizes or eliminates disruption to business activities.
- One difficulty IT administrators face includes maintaining multiple computers in a chaotic or heterogeneous network environment.
- congruency between executable software of the computers may be minimal.
- executable files may be stored in different memory locations on different computers, different versions of executable files may be installed in different computers, executable files may be stored on some computers but not on others, and the like.
- Such networks may require additional time and resources to be adequately supported as IT administrators may need to individualize policies, maintenance, upgrades, repairs, and/or any other type of support to suit particular computers having nonstandard executable software and/or operating systems.
- Homogenous network environments in which executable software of computers are congruent or at least similar, may also benefit from a system and method for clustering host inventories.
- particular computers may occasionally deviate from standard computers within the network environment.
- malicious software may break through the various firewalls and other network barriers creating one or more deviant computers.
- end users of computers may install various executable software files from transportable disks or download such software creating deviant computers.
- a system for clustering host inventories could readily identify any outliers having nonstandard and possibly malicious executable software.
- a system and method for clustering host inventories could greatly enhance abilities of IT administrators or other users managing computer networks to effectively support both heterogeneous and homogeneous network environments.
- the system which may be implemented in a computer such as server 130 , enables identification of logical groupings of computers with similar executable file inventories and identification of outliers (e.g., computers with drastically different executable file inventories).
- host file inventories of executable files from hosts 110 are provided for evaluation.
- the host file inventories are transformed into input data for a clustering algorithm.
- the clustering algorithm is applied and one or more diagrams or charts may be created to show logical clusters or groupings of hosts 110 having the same or similar software inventories.
- the diagrams or charts may also show any of the hosts 110 that drastically deviate from other hosts 110 .
- the system provides network or IT administrators with valuable information that may be used to more effectively manage hosts 110 within network environment 100 .
- references to various features e.g., elements, structures, modules, components, steps, etc.
- references to various features are intended to mean that any such features may be included in one or more embodiments of the present disclosure, but may or may not necessarily be included in the same embodiments.
- the example network environment 100 may be configured as one or more networks and may be configured in any form including, but not limited to, local area networks (LANs), wide area networks (WANs) such as the Internet, or any combination thereof.
- communication link 120 may represent any electronic link supporting a LAN environment such as, for example, cable, Ethernet, wireless (e.g., WiFi), ATM, fiber optics, etc. or any combination thereof.
- communication link 120 may represent a remote connection to central server 130 through any appropriate medium (e.g., digital subscriber lines (DSL), telephone lines, T1 lines, T3 lines, wireless, satellite, fiber optics, cable, Ethernet, etc. or any combination thereof) and/or through any additional networks such as a wide area network (e.g., the Internet).
- gateways, routers and the like may be used to facilitate electronic communication between hosts 110 and central server 130 .
- hosts 110 may represent end user computers that could be operated by end users.
- the end user computers may include desktops, laptops, and mobile or handheld computers (e.g., personal digital assistants (PDAs) or mobile phones).
- Hosts 110 can also represent other computers (e.g., servers, appliances, etc.) having executable software, which could be similarly evaluated and clustered by the system, using executable file inventories derived from sets of executable files 112 on such hosts 110 .
- FIG. 1 is intended as an example and should not be construed to imply architectural limitations in the present disclosure.
- Sets of executable files 112 on hosts 110 can include all executable files on respective hosts 110 .
- references to “executable file”, “program file”, “executable software file”, and “executable software” are meant to encompass any software file comprising instructions that can be understood and processed by a computer such as executable files, library modules, object files, other executable modules, script files, interpreter files, and the like.
- the system could be configured to allow the IT administrator to select a particular type of executable file to be clustered. For example, an IT Administrator may choose only dynamic-link library (DLL) modules for clustering.
- DLL dynamic-link library
- the IT administrator may also be permitted to select particular hosts to which clustering is applied. For example, all end user computers in a network or within a particular part of the network may be selected. In another example, all servers within a network or within a particular part of the network may be selected.
- Central server 130 of network environment 100 represents an exemplary server or other computer linked to hosts 110 , which may provide services to hosts 110 .
- the system of clustering host inventories may be implemented in central server 130 using various embodiments of host inventory preparation module 150 and clustering module 160 .
- keyword techniques may be used with vector based clustering in one example embodiment.
- host inventory preparation module 150 creates an (n ⁇ m) vector matrix where columns of the matrix may correspond to a determined number (i.e., “m”) of unique keywords, each of which is associated with one or more executable files in a selected number (i.e., “n”) of hosts.
- the rows of the vector matrix may correspond to the n selected hosts.
- Clustering module 160 can then apply a clustering algorithm to the vector matrix to create logical groupings of the n selected hosts.
- compression techniques may be used with similarity based clustering.
- host inventory preparation module 150 may create an (n ⁇ n) similarity matrix using compression techniques for a selected number (i.e., “n”) of hosts.
- Clustering module 160 may then apply a clustering algorithm to the similarity matrix to create logical groupings of the n selected hosts.
- selected hosts may include all of the hosts 110 in a particular network environment such as network environment 100 .
- selected hosts may include particular hosts selected by a user or predefined by policy, with such hosts existing in one or more network environments.
- Management console 170 linked to central server 130 may provide viewable cluster data for the IT administrators or other authorized users.
- Administrative module 140 may also be incorporated to allow IT administrators or other authorized users to add the logical groupings from a cluster analysis to an enterprise management system and to apply common policies to selected groupings.
- deviant or exceptional groupings or outliers can trigger various remedial actions (e.g., emails, vulnerability scans, etc.).
- management console 170 may also provide a user interface for the IT Administrator to select particular hosts and/or particular types of executable files to be included in the clustering procedure, in addition to other user provided configuration data for the system.
- One exemplary enterprise management system that could be used includes McAfee® electronic Policy Orchestrator (ePO) software manufactured by McAfee, Inc. of Santa Clara, Calif.
- FIG. 2 is a simplified block diagram of a general or special purpose computer 200 , such as hosts 110 , central server 130 , or other computing devices connected to network environment 100 .
- Computer 200 may include various components such as a processor 220 , a main memory 230 , a secondary storage 240 , a network interface 250 , a user interface 260 , and a removable memory interface 270 .
- a bus 210 such as a system bus, may provide electronic communication between processor 210 and the other components, memory, and interfaces of computer 200 .
- Processor 220 which may also be referred to as a central processing unit (CPU), can include any general or special-purpose processor capable of executing machine readable instructions and performing operations on data as instructed by the machine readable instructions.
- Main memory 230 may be directly accessible to processor 220 for accessing machine instructions and can be in the form of random access memory (RAM) or any type of dynamic storage (e.g., dynamic random access memory (DRAM)).
- Secondary storage 240 can be any non-volatile memory such as a hard disk, which is capable of storing electronic data including executable software files. Externally stored electronic data may be provided to computer 200 through removable memory interface 270 .
- Removable memory interface 270 may provide connection to any type of external memory such as compact discs (CDs), digital video discs (DVDs), flash drives, external hard drives, or any other external media.
- Network interface 250 can be any network interface controller (NIC) that provides a suitable network connection between computer 200 and any networks to which computer 200 connects for sending and receiving electronic data.
- network interface 250 could be an Ethernet adapter, a token ring adapter, or a wireless adapter.
- a user interface 260 may be provided to allow a user to interact with the computer 200 via any suitable means, including a graphical user interface display.
- any appropriate input mechanism may also be included such as a keyboard, mouse, voice recognition, touch pad, input screen, etc.
- processor 220 and bus 210 may be suitably coupled to processor 220 and bus 210 in the form of memory management units (MMU), additional symmetric multiprocessing (SMP) elements, read only memory (ROM), peripheral component interconnect (PCI) bus and corresponding bridges, small computer system interface (SCSI)/integrated drive electronics (IDE) elements, etc.
- MMU memory management units
- SMP symmetric multiprocessing
- ROM read only memory
- PCI peripheral component interconnect
- IDE integrated drive electronics
- computer is meant to encompass any personal computers, network appliances, routers, switches, gateways, processors, servers, load balancers, firewalls, or any other suitable device, component, element, or object operable to affect or process electronic information in a network environment.
- FIG. 3 an example system flow 300 of a keyword-based embodiment of the system and method for clustering host inventories is illustrated.
- Each host file inventory can include a set of file identifiers, with each file identifier representing a different executable file in the set of executable files 112 a , 112 b , or 112 c of the corresponding selected host 110 a , 110 b , or 110 c .
- Each file identifier may include a sequence of one or more tokens associated with the executable file represented by the file identifier.
- a simple file identifier could include a single token having a checksum configuration.
- a checksum can be a mathematical value or hash sum (e.g., a fixed string of numerical digits) derived by applying an algorithm to an executable file. If the algorithm is applied to another executable file that is identical to the first executable file, then the checksums should match. However, if the other executable file is different in any way (e.g., different type of software program, same software program but different version, same software program that has been altered in some way, etc.), then the checksums are very unlikely to match. Thus, the same executable file stored in different hosts or stored in different locations on disk of the same host should have identical checksums.
- a file identifier could include a sequence of first and second tokens having a checksum configuration and file path configuration, respectively, where the file path indicates where the executable file is stored on disk in the particular host in which it is installed.
- first token of the file identifier generated for executable file X on host 110 a could be the same as the first token of the file identifier generated for executable file Y on host 110 b .
- second token of the file identifier generated for executable file X on host 110 a could be different than the second token of the file identifier generated for executable file Y on host 110 b.
- Numerous other file identifiers may be configured by using any number of tokens and configuring the tokens to include any combination of available program file attributes, checksums, and/or file paths.
- Program file attributes may include, for example, creation date, modification date, security settings, vendor name, and the like.
- file identifiers may be configured with any number of such tokens, an executable file without a particular program file attribute, which is selected as one of the tokens, may have a file identifier with only the tokens available to that executable file.
- the file identifier is configured to include a first token (e.g., a checksum) and a second token (e.g., a vendor name)
- a first token e.g., a checksum
- a second token e.g., a vendor name
- an executable file without an embedded vendor name would have a file identifier with only a first token corresponding to the file checksum.
- an executable file having an embedded vendor name would have a file identifier with both first and second tokens corresponding to the file checksum and vendor name, respectively.
- the file identifiers and resulting host file inventories I 1 through I n may be provided by various implementations.
- the file identifiers and resulting host file inventories may be generated by host inventory feeds 114 for each host 110 and pushed to central server 130 .
- central server 130 may provide the user selected configuration criteria to each host 110 .
- Host inventory feeds 114 may then generate file identifiers with token sequences having the particular user-selected configuration.
- checksums for each executable file may be generated on hosts 110 by host inventory feeds 114 and then pushed to central server 130 along with other file attributes and file paths such that host inventory preparation module 150 of central server 130 can generate the file identifiers and resulting host file inventories for each of the selected hosts 110 .
- enumeration of executable files from the sets of executable files 112 of selected hosts 110 can be achieved by existing security technology such as, for example, Policy Auditor software or Application Control software, both manufactured by McAfee, Inc. of Santa Clara, Calif.
- step 320 a keyword method is used to transform host file inventories I 1 through I n into a vector matrix, which will be further described herein with reference to FIGS. 4 and 5 .
- step 330 a vector-based clustering analysis is performed on the vector matrix.
- Exemplary types of clustering analysis that may be performed on the vector matrix include agglomerative hierarchical clustering and partitional clustering.
- the results of such clustering techniques may be stored in a memory element of central server (e.g., secondary storage 240 of computer 200 ), or may be stored in a database or other memory element external to central server 130 .
- one or more reports can be generated indicating the clustered groupings determined during the clustering analysis and can be provided to authorized users by various methods (e.g., screen displays, electronic files, hard copies, emails, etc.).
- Exemplary reports may include a textual report and/or a visual representation (e.g., a proximity plot, a dendrogram, heat maps of a permuted keyword matrix, heat maps of a reduced keyword matrix where rows and columns have been merged to illustrate clusters, other cluster plots, etc.) enabling the user to view logical groupings of the selected hosts.
- a graphical user interface of management console 170 may display a proximity plot having physical representations of each host, with identifiable logical groupings (e.g., uniquely colored groupings, circled or otherwise enclosed groupings, representations of groupings with connected lines, etc.).
- identifiable logical groupings e.g., uniquely colored groupings, circled or otherwise enclosed groupings, representations of groupings with connected lines, etc.
- an IT Administrator or other authorized user can apply common policies to hosts within the logical groupings and remedial action may be taken on any identified outlier hosts.
- outlier hosts may be remediated to a standard software configuration as defined by the IT Administrators.
- FIG. 4 illustrates a matrix format 400 used when generating a vector matrix in one embodiment of the system and method of clustering host inventories.
- host file inventories are transformed into a vector matrix in Euclidean space using a keyword method.
- the following variables may be identified when generating a vector matrix:
- the number of keywords associated with an executable file equals the number of tokens in the token sequence of the file identifier representing the executable file. Therefore, one or more keywords can be associated with each executable file in sets of executable files 112 of selected hosts 110 .
- each keyword could be associated with multiple executable files in the same or different hosts.
- a keyword sequence km may be defined as a sequence of unique keywords K 1 through K m , where each keyword is associated with one or more executable files in sets of executable files 112 of all selected hosts 110 .
- Vector matrix format 400 includes n rows 460 and m columns 470 , with n and m defining the dimensions of the resulting n-by-m (i.e., n ⁇ m) vector matrix.
- Each entry 480 is denoted by a variable with subscripts i and j (i.e., a i,j ) where i and j correspond to the respective row and column where the entry is located. For example, entry a 2,1 is found in row 2, column 1 of vector matrix format 400 .
- Each row of entries represents a row vector 410 , 420 , and 430 for its corresponding host H 1 , H 2 and H n .
- a 1,1 , a 1,2 , through a 1,m define row vector 410 for host H 1 .
- row vectors 410 , 420 , through 430 can be provided as input data to a vector-based clustering algorithm to create a cluster graph or plot showing logical groupings of hosts H 1 through H n , having similar inventories of executable files and any host outliers having dissimilar host inventories.
- FIG. 5 illustrates a flow 500 using a keyword method to transform host file inventories into a list of vectors in Euclidean space, represented by vector matrix format 400 .
- Flow 500 corresponds to step 320 in flow 300 of FIG. 3 and may be implemented, at least in part, by host inventory preparation module 150 of central server 130 shown in FIG. 1 .
- km the file identifiers of all host file inventories of selected hosts 110 can be evaluated to find each unique keyword.
- each of the file identifiers of the host file inventories includes a sequence of first and second tokens with the first token having a file checksum configuration and the second token having a file path configuration. If the same version of Microsoft® Word software is installed on each of the selected hosts 110 a , 110 b , and 110 c in the same location on disk, then each file identifier representing the Microsoft® Word software in each of the host file inventories includes a first token containing a checksum of the Microsoft® Word software and a second token containing a file path of the software.
- keyword sequence km could include a first keyword (K 1 ) containing the checksum, which is the same on each of the selected hosts 110 a , 110 b , and 110 c , and a second keyword (K 2 ) containing the file path, which is the same on each of the selected hosts 110 a , 110 b , and 110 c.
- the algorithm of flow 500 computes a list of position vectors for an n ⁇ m vector matrix.
- Variables ‘i’ and ‘j’ are used to construct the vector matrix having n ⁇ m vector matrix format 400 , in m-dimensional keyword space, for each host H i by iterating over j through km and producing appropriate values for the position vectors indicating whether each host file inventory I i contains each keyword K j .
- step 520 variable i is set to 1 and steps 530 , 570 , and 575 form an outer loop iterating through the hosts.
- Variable j is set to 1 in step 530 and steps 540 through 565 form an inner loop iterating over j through km.
- step 540 keyword K j is retrieved.
- decision box 545 a query is made as to whether keyword K j is found in host file inventory I i of host H i .
- host file inventory I i of host H i is searched for a file identifier containing keyword K j . If keyword K j is not found in host file inventory I i then flow moves to step 550 where row i, column j (i.e., a i,j ) in the vector matrix may be updated with an appropriate value indicating keyword K j was not found in host file inventory I i . However, if in step 545 , keyword K j is found in host file inventory I i , then flow moves to step 555 where row i, column j (i.e., a i,j ) in the vector matrix may be updated with an appropriate value indicating keyword K j was found in host file inventory I i .
- entries a i,j in the vector matrix may vary depending upon the particular implementation of the system.
- an entry a i,j is assigned a ‘1’ value in step 555 , indicating keyword K j was found in host file inventory I i , or a ‘0’ value in step 560 , indicating keyword K j was not found in host file inventory I i .
- vector matrix contains only ‘0’ and/or ‘1’ values.
- entry a i,j is assigned a value in step 555 or 550 corresponding to a frequency of occurrence of keyword K j in host file inventory I i .
- file identifiers in a host file inventory I 1 include a first token configured as a checksum and a second token configured as a vendor name, with three executable files on host H 1 having the same embedded vendor name, XYZ, resulting in keyword K 2 of keyword sequence km being assigned the embedded vendor name XYZ.
- entry a 1,2 could be updated with a value of 3 because of the three occurrences of vendor name XYZ in file identifiers of host file inventory I 1 .
- vector matrix may contain ‘0’ values and/or positive integer values.
- decision box 560 If, however, in decision box 560 it is determined that j is not less than m (i.e., j ⁇ m), then host file inventory I i has been searched for all of the keywords K 1 through K m in keyword sequence km, so flow moves to decision box 570 , which is part of the outer loop of flow 500 . A query is made in decision box 570 to determine whether i ⁇ n, and if i ⁇ n, then not all of the hosts have been evaluated to generate corresponding keyword vectors. Therefore, flow moves to step 575 where i is set to i+1, and flow loops back to step 530 .
- decision box 570 if i is not less than n in), then all of the hosts H 1 through H n have been evaluated such that all of the vectors have been created in n ⁇ m vector matrix and, therefore, the flow ends.
- the embodiment of the flow 500 shown in FIG. 5 creates a vector in keyword space successively for each host H 1 through H n .
- Other embodiments could be configured to switch the inner and outer loops in flow 500 such that for each keyword K j , a column of vector matrix entries is produced by iterating over i through hosts H i and producing an appropriate value when keyword K j is found in host file inventory I i of host H i and an appropriate value when K j is not found in host file inventory I i of host H i . This processing could be repeated until all columns are filled, thereby generating the list of vectors in rows 1 through n.
- the clustering analysis performed on the resulting vector matrix may include commonly available clustering techniques such as agglomerative hierarchical clustering or partitional clustering.
- agglomerative hierarchical clustering each element begins as a separate cluster and elements are merged into successively larger clusters, which may be represented in a tree structure called a dendrogram.
- a root of the tree represents a single cluster of all of the elements and the leaves of the tree represent separated clusters of the elements.
- merging schemes in agglomerative hierarchical clustering used to achieve logical groupings may include schemes well-known in the art such as single-link (i.e., the distance between clusters is equal to the shortest distance from any member of one cluster to any member of another cluster), complete-link (i.e., the distance between clusters is equal to the greatest distance from any member of one cluster to any member of another cluster), group-average (i.e., the distance between clusters is equal to the average distance from any member of one cluster to any member of another cluster), and centroid (i.e., the distance between clusters is equal to the distance from the center of any one cluster to the center of another cluster).
- single-link i.e., the distance between clusters is equal to the shortest distance from any member of one cluster to any member of another cluster
- complete-link i.e., the distance between clusters is equal to the greatest distance from any member of one cluster to any member of another cluster
- group-average i.e., the distance between clusters is equal to
- Cut point determination may be made, for example, at a specified level of similarity or when consecutive similarities are the greatest, which is known in the art.
- a tree structure representing clusters could be cut at a predetermined height resulting in more or less clusters depending on the selected height at which the cut is made.
- Cut point determinations may be determined based on a particular network environment or particular hosts being clustered. In one example embodiment, an IT administrator or other authorized user could define the cut point determination used by the clustering procedure by determining a desired threshold for similarity based on the particular network environment.
- partitional clustering typically involves an algorithm that determines all clusters at one time.
- predetermined similarity criteria may provide, for example, a selected number of clusters to be generated or a maximum diameter for the clusters.
- One exemplary software package that implements these various clustering techniques is CLUTO Software for Clustering High-Dimensional Datasets developed by George Karypis, Professor at the Department of Computer Science & Engineering, University of Minnesota, Minneapolis and Saint Paul, Minn., which may be found on the World Wide Web at http://glaros.dtc.umn.edu/gkhome/view/cluto.
- FIGS. 6 , 7 , and 8 an example selected plurality of hosts 600 with executable files, a vector matrix 700 generated using the executable files of selected hosts 600 , and an example resulting cluster plot 800 are illustrated, respectively.
- host 1 H 1
- Host 2 H 2
- Host 3 H 3
- Host 3 is shown with a set of executable files 603 including executable files 610 , 620 , 630 , and 680 .
- Host 4 (H 4 ) is shown with a set of executable files 604 including executable files 610 , 650 , and 660 .
- Host 5 (H 5 ) is shown with a set of executable files 605 including executable files 640 , 670 , and 680 .
- FIG. 7 illustrates the resulting vector matrix 700 after the keyword method of flow 500 has been applied to the sets of executable files 601 through 605 of the selected plurality of hosts 600 of FIG. 6 .
- Vector matrix 700 shows hosts 1 through 5 (H 1 through H 5 ) corresponding to rows 760 containing keyword vectors 710 , 720 , 730 , 740 , and 750 , respectively.
- Columns 770 of vector matrix 700 are designated by keywords K 1 through K 8 .
- entries 780 include a ‘1’ indicating that keyword K j is contained in a file identifier of host file inventory I i , or a ‘0’ indicating that keyword K j is not contained in a file identifier of host file inventory I i .
- step 520 variable i is set to 1 and then the iterative flow begins to create n ⁇ m (5 ⁇ 8) vector matrix 700 shown in FIG. 7 .
- step 530 variable j is set to 1 and keyword K j (K 1 ) is retrieved from km in step 540 .
- step 545 host file inventory I i (I 1 ) of host H i (H 1 ) is searched for keyword K j (K 1 ).
- keyword K 1 is found in host file inventory I 1 of host H 1 , so flow moves to step 555 where a ‘1’ entry is added to row i, column j (row 1, column 1) of vector matrix 700 .
- the inner iterative loop then begins in step 540 to search for all keywords in host file inventory I i (I 2 ) of host H i (H 2 ) beginning with keyword K j (K 1 ).
- rows 760 are successively filled with a ‘1’ or a ‘0’ value for each entry a i,j until each vector row 710 through 750 has been completed.
- each entry a i,j in rows 760 could be filled with a value corresponding to the frequency of occurrence of keyword K j found in host file inventory I i .
- Vector matrix 700 can be provided as input data to a vector based clustering procedure, as previously described herein.
- Information generated from the clustering procedure could be provided in numerous ways such as, for example, reports, screen displays, files, emails, etc.
- the information could be provided in a proximity plot such as example proximity plot 800 illustrated in FIG. 8 .
- Proximity plot 800 is an example graph that could be created by a vector-based clustering procedure applied to vector matrix 700 of FIG. 7 . If agglomerative hierarchical clustering is used, clusters 810 , 820 , and 830 may be determined based on a cut point determination. If partitional clustering is used, clusters 810 , 820 , and 830 may be generated based on a predetermined number of clusters.
- Proximity plot 800 shows two clusters and one outlier.
- Cluster 810 represents hosts H 1 and H 3 and cluster 820 represents hosts H 2 and H 4 .
- Hosts H 1 and H 3 may be clustered together because they have three common executable files 610 , 620 , and 630 .
- Hosts H 2 and H 4 may be clustered together because they also have three common executable files 610 , 650 , and 660 .
- Outlier 830 represents host H 5 , which may be indicated as an outlier, because it has, in this example, none or only one common executable file with each of the other hosts.
- the clustering information is displayed on proximity plot 800 shown in FIG. 8 , other textual reports and/or visual representations, as previously described herein with reference to FIG. 3 , may be used to show clusters 810 and 820 and outlier 830 .
- FIG. 9 illustrates an example system flow 900 of a compression-based embodiment of a system and method for clustering host inventories.
- step 920 a compression technique may be used to transform host file inventories into a similarity matrix, which will be further described herein with reference to FIGS. 10 and 11 .
- step 930 a similarity-based clustering analysis can be performed on the similarity matrix.
- the similarity-based clustering analysis performed on the similarity matrix may include, for example, agglomerative hierarchical clustering or partitional clustering.
- the results of such clustering techniques may be stored in a memory element of central server (e.g., secondary storage 240 of computer 200 ), or may be stored in a database or other memory element external to central server 130 .
- step 940 one or more reports can be generated indicating the clustered groupings determined during the clustering analysis, as previously described herein with reference to FIG. 3 .
- reports for similarity-based clustering may include a textual report and/or a visual representation (e.g., a proximity plot, a dendrogram, heat maps of a similarity matrix where rows and columns have been merged to illustrate clusters, other cluster plots, etc.) enabling the user to view logical groupings of the selected hosts.
- an IT Administrator or other authorized user can apply common policies to computers within the logical groupings and remedial action may be taken on any identified outlier computers.
- outlier computers may be remediated to a standard software configuration as defined by the IT Administrators.
- FIG. 10 illustrates a matrix format 1000 used when generating a similarity matrix in one embodiment of the system and method of clustering host inventories.
- the similarity matrix is generated by applying a compression method to a plurality of host file inventories, each of which includes a set of file identifiers.
- each of the sets of file identifiers may represent one of the sets of executable files 112 a , 112 b , or 112 c on the corresponding selected host 110 a , 110 b , or 110 c .
- the following variables may be identified when generating a similarity matrix:
- Similarity matrix format 1000 includes n rows 1060 and n columns 1070 , with ‘n’ defining the number of dimensions of the resulting n-by-n (i.e., n ⁇ n) similarity matrix.
- Each entry 1080 is denoted by a variable with subscripts i and j (i.e., a i,j ) where i and j correspond to the respective row and column where the entry is located. For example, entry a 2,1 is found in row 2, column 1 of similarity matrix format 1000 .
- each entry a i,j has a numerical value representing the similarity distance between host H i and host H j with 1 representing the highest degree of similarity.
- the similarity distances represented by entries a 1,1 through a n,n can include any numerical value from 0 to 1, inclusively (i.e., 0 ⁇ a i,j ⁇ 1).
- a value of 1 in a i,j may indicate hosts H i and H j have identical host file inventories and therefore, identical sets of executable files
- a value of zero in a i,j may indicate hosts H i and H j have no common file identifiers in their respective host file inventories and therefore, no common executable files in their respective sets of executable files.
- the resulting similarity matrix can be provided as input data into a similarity-based clustering algorithm to create a cluster graph or plot showing logical groupings of hosts H 1 through H n having similar sets of executable files and outlier hosts having dissimilar sets of executable files.
- the clustering analysis performed on the resulting similarity matrix may include commonly available clustering techniques such as agglomerative hierarchical clustering or partitional clustering, as previously described herein with reference to clustering analysis of a vector matrix.
- FIG. 11 illustrates a flow 1100 using a compression method to transform host file inventories I 1 through I n of hosts H 1 through H n , respectively, into a similarity matrix.
- Flow 1100 corresponds to step 920 of FIG. 9 and may be implemented, at least in part, by host inventory preparation module 150 of central server 130 , shown in FIG. 1 .
- i is set to 1 in step 1110
- j is set to 1 in step 1115 .
- Variables ‘i’ and ‘j’ are used to construct the n ⁇ n similarity matrix for the selected plurality of hosts being clustered.
- Steps 1115 , 1175 , and 1180 form an outer loop iterating through the rows of hosts and steps 1120 through 1170 form an inner loop iterating through the columns of hosts.
- a list of file identifiers (e.g., checksums, checksums combined with a file path, checksums combined with one or more file attributes, etc.) representing a set of executable files on host H i are extracted from host file inventory I i and put in a file F i .
- a list of file identifiers representing a set of executable files on host H i are extracted from host file inventory I j and put in a file F j .
- files F i and F j are concatenated and put in file F ij .
- files F i , F j , and F ij are examples implementation of the system, and that memory buffers or any other suitable representation allowing concatenation, compression, and length determination of data may also be used.
- a compression utility such as, for example, gzip, bzip, bzip2, zlib, or zip compression utilities may be used to compress files F i , F j , and F ij .
- the list of file identifiers in files F i , F j , and F ij may be sorted to enable more accurate compression by the compression utility.
- file F i is compressed and the length of the result is represented as C.
- file F j is compressed and the length of the result is represented as C j .
- step 1150 file F ij is compressed and the length of the result is represented as C ij .
- normalized compression distance (NCD i,j ) between H i and H j is computed in step 1155 .
- NCD Normalized compression distance
- NID normalized information distance
- NCD i,j [C ij ⁇ min ⁇ C i ,C j ⁇ ]/max ⁇ C i ,C j ⁇
- step 1170 j is set to j+1.
- Flow then loops back to step 1120 where the inner loop of flow 1100 repeats and the similarity distance is computed between host H i and the next host H j with j j+1.
- j is not less than n (i.e., j ⁇ n)
- decision box 1175 if i is not less than n (i.e., i ⁇ n) then entries for all of the rows i through n have been computed and, therefore, the similarity matrix has been completed and flow ends.
- flow 1100 could be optimized in numerous ways.
- One optimization technique includes caching the lengths of compressed files C i and C j , which are used multiple times during flow 1100 to calculate entries 1080 in the similarity matrix.
- the extracted lists of file identifiers F i and F j may also be cached for use during flow 1100 .
- the matrix should be symmetric along the diagonal a 1,1 through a n,n . This symmetry could be used in the implementation of the system to compute only one-half of the matrix and then reflect the results over the diagonal.
- FIG. 12 shows an example similarity matrix 1200 generated by applying the compression method of flow 1100 of FIG. 11 to host file inventories I 1 through I 5 of the example selected plurality of hosts 600 of FIG. 6 .
- FIG. 12 shows hosts 1 through 5 (H 1 through H 5 ) corresponding to rows 1260 and columns 1270 , forming a 5 ⁇ 5 similarity matrix 1200 .
- Entries 1280 of similarity matrix 1200 include values from 0 to 1, inclusively. The closer the value is to 1, the closer the distance or greater the similarity of the corresponding hosts in row i, column j.
- each entry in matrix 1200 with the same host in the corresponding row and column (e.g, a 1,1 , a 2,2 , a 3,3 , etc.) has a value of 1 because the hosts, and therefore the host file inventories, are identical.
- each entry in similarity matrix 1200 in which the corresponding hosts H i and H j have respective executable file inventories I i and I j with no common executable files, has a value of zero (e.g., a 5,4 , a 4,5 ).
- Each of the host file inventories I 1 through I 5 includes a set of file identifiers representing one of the sets of executable files 601 , 602 , 603 , 604 , and 605 .
- Each executable file in a set of executable files is represented by a separate file identifier in the particular host file inventory.
- each file identifier includes a single token having a checksum configuration
- the following host file inventories of hosts H 1 through H 5 could include file identifiers D 1 through D 8 , which represent executable files 610 through 680 , respectively:
- step 1110 i is set to 1 and then the iterative looping begins to create an n ⁇ n (5 ⁇ 5) similarity matrix 1200 shown in FIG. 12 .
- step 1115 j is set to 1 and flow passes to steps 1120 through 1125 where the following variables can be determined:
- step 1155 After the NCD 1,1 value is computed in step 1155 , flow moves to step 1160 and a i,j is computed:
- the ‘1’ value is added to row i, column j (row 1, column 1) of similarity matrix 1200 .
- NCD i,j NCD 1,2
- NCD 1,2 NCD 1,2
- NCD 1,2 is used to compute a 1,2 as 0.25, which is added to row i, column j (row 1, column 2) of similarity matrix 1200 .
- the inner iterative loop then begins in step 1120 to determine the similarity distance between host file inventory I i (I 2 ) of host H i (H 2 ) and each host file inventory I j (I 1 through I 5 ).
- rows 1160 are successively filled with similarity distance values a i,j until each row has been completed.
- similarity matrix 1200 can be provided as input to a similarity-based clustering procedure, as previously described herein with reference to clustering techniques used with a vector matrix.
- Information generated from the clustering procedure could be provided in numerous ways, as previously described herein with reference to FIG. 9 .
- the information could be provided in a proximity plot such as example proximity plot 800 illustrated in FIG. 8 , which has been previously shown and described herein.
- Software for achieving the operations outlined herein can be provided at various locations (e.g., the corporate IT headquarters, end user computers, distributed servers in the cloud, etc.). In other embodiments, this software could be received or downloaded from a web server (e.g., in the context of purchasing individual end-user licenses for separate networks, devices, servers, etc.) in order to provide this system for clustering host inventories. In one example implementation, this software is resident in one or more computers sought to be protected from a security attack (or protected from unwanted or unauthorized manipulations of data).
- the software of the system for clustering host inventories in a computer network environment could involve a proprietary element (e.g., as part of a network security solution with McAfee® EPO software, McAfee® Application Control software, etc.), which could be provided in (or be proximate to) these identified elements, or be provided in any other device, server, network appliance, console, firewall, switch, information technology (IT) device, distributed server, etc., or be provided as a complementary solution (e.g., in conjunction with a firewall), or provisioned somewhere in the network.
- a proprietary element e.g., as part of a network security solution with McAfee® EPO software, McAfee® Application Control software, etc.
- IT information technology
- the clustering activities outlined herein may be implemented in software. This could be inclusive of software provided in central server 130 (e.g., via administrative module 140 , host inventory preparation module 150 and clustering module 160 ) and hosts 110 (e.g., via host inventory feed 114 ). These elements and/or modules can cooperate with each other in order to perform clustering activities as discussed herein. In other embodiments, these features may be provided external to these elements, included in other devices to achieve these intended functionalities, or consolidated in any appropriate manner. For example, some of the processors associated with the various elements may be removed, or otherwise consolidated such that a single processor and a single memory location are responsible for certain activities. In a general sense, the arrangement depicted in FIG. 1 may be more logical in its representation, whereas a physical architecture may include various permutations/combinations/hybrids of these elements.
- all of these elements include software (or reciprocating software) that can coordinate, manage, or otherwise cooperate in order to achieve the clustering operations, as outlined herein.
- One or all of these elements may include any suitable algorithms, hardware, software, components, modules, interfaces, or objects that facilitate the operations thereof.
- such a configuration may be inclusive of logic encoded in one or more tangible media (e.g., embedded logic provided in an application specific integrated circuit (ASIC), digital signal processor (DSP) instructions, software (potentially inclusive of object code and source code) to be executed by a processor, or other similar machine, etc.), which may be inclusive of non-transitory media.
- ASIC application specific integrated circuit
- DSP digital signal processor
- one or more memory elements can store data used for the operations described herein. This includes the memory element being able to store software, logic, code, or processor instructions that are executed to carry out the activities described in this Specification.
- a processor can execute any type of instructions associated with the data to achieve the operations detailed herein in this Specification.
- the processors could transform an element or an article (e.g., data) from one state or thing to another state or thing.
- the activities outlined herein may be implemented with fixed logic or programmable logic (e.g., software/computer instructions executed by a processor) and the elements identified herein could be some type of a programmable processor, programmable digital logic (e.g., a field programmable gate array (FPGA), an erasable programmable read only memory (EPROM), an electrically erasable programmable read only memory (EEPROM)) or an ASIC that includes digital logic, software, code, electronic instructions, or any suitable combination thereof.
- FPGA field programmable gate array
- EPROM erasable programmable read only memory
- EEPROM electrically erasable programmable read only memory
- any of the memory items discussed herein should be construed as being encompassed within the broad term ‘memory element.’
- any of the potential processing elements, modules, and machines described in this Specification should be construed as being encompassed within the broad term ‘processor.’
- processor any of the computers, servers, and other devices may also include suitable interfaces for receiving, transmitting, and/or otherwise communicating data or information in a network environment.
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Computer Networks & Wireless Communication (AREA)
- Signal Processing (AREA)
- Multimedia (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Description
- This disclosure relates in general to the field of computer network administration and support and, more particularly, to identifying similar software inventories on selected hosts.
- The field of computer network administration and support has become increasingly important and complicated in today's society. Computer network environments are configured for virtually every organization and usually have multiple interconnected computers (e.g., end user computers, laptops, servers, printing devices, etc.). Typically, each computer has its own set of executable software, each of which can be represented by an executable software inventory. For Information Technology (IT) administrators, congruency among executable software inventories of similar computers (e.g., desktops and laptops) simplifies maintenance and control of the network environment. Differences between executable software inventories, however, can arise in even the most tightly controlled network environments. In addition, each organization may develop its own approach to computer network administration and, consequently, some organizations may have very little congruency and may experience undesirable diversity of executable software on their computers. Particularly in very large organizations, executable software inventories may vary greatly among computers across departmental groups. Varied executable software inventories on computers within organizations present numerous difficulties to IT administrators to maintain, to troubleshoot, to service, and to provide uninterrupted access for business or other necessary activities. Innovative tools are needed to assist IT administrators to successfully support computer network environments with computers having incongruities between executable software inventories.
- To provide a more complete understanding of the present disclosure and features and advantages thereof, reference is made to the following description, taken in conjunction with the accompanying figures, wherein like reference numerals represent like parts, in which:
-
FIG. 1 is a pictorial representation of an exemplary network environment in which various embodiments of a system and method for clustering host inventories may be implemented in accordance with the present disclosure; -
FIG. 2 is a simplified block diagram of a computer, which may be utilized in embodiments in accordance with the present disclosure; -
FIG. 3 is a simplified flowchart illustrating a series of example steps associated with the system in accordance with one embodiment of the present disclosure; -
FIG. 4 illustrates an n×m vector matrix format used in accordance with an embodiment of the present disclosure; -
FIG. 5 is a flowchart illustrating a series of example steps for generating values for an n×m vector matrix as shown inFIG. 4 in accordance with one embodiment of the present disclosure; -
FIG. 6 illustrates an example selected group of hosts in a network environment to which embodiments of the present disclosure may be applied; -
FIG. 7 illustrates a vector matrix created by application of the flow ofFIG. 5 to the example selected group of hosts ofFIG. 6 ; -
FIG. 8 is an example cluster diagram of the hosts ofFIG. 6 that could be created from the system in accordance with embodiments of the present disclosure; -
FIG. 9 is a simplified flowchart illustrating a series of example steps associated with the system in accordance with another embodiment of the present disclosure; -
FIG. 10 illustrates an n×n similarity matrix format used in accordance with one embodiment of the present disclosure; -
FIG. 11 is a flowchart illustrating a series of example steps for generating values for an n×n similarity matrix as shown inFIG. 10 in accordance with one embodiment of the present disclosure; and -
FIG. 12 illustrates a similarity matrix created by application of the flow ofFIG. 11 to the example selected group of hosts ofFIG. 6 . - A method in one example implementation includes obtaining a plurality of host file inventories corresponding respectively to a plurality of hosts, calculating input data using the plurality of host file inventories, and providing the input data to a clustering procedure to group the plurality of hosts into one or more clusters of hosts. The method further includes each cluster of hosts being grouped using a predetermined similarity criteria. More specific embodiments include each of the plurality of host file inventories including a set of one or more file identifiers with each file identifier representing a different executable software file on a corresponding one of the plurality of hosts. In another more specific embodiment, the method includes each of the one or more file identifiers including a token sequence of one or more tokens. In other more specific embodiments, the calculating the input data includes transforming the plurality of host file inventories into a similarity matrix. In another more specific embodiment, the calculating the input data includes transforming the plurality of host file inventories into a matrix of keyword vectors in Euclidean space, where each keyword vector corresponds to one of the plurality of hosts.
-
FIG. 1 is a pictorial representation of acomputer network environment 100 in which embodiments of a system for clustering host inventories may be implemented in accordance with the present disclosure.Computer network environment 100 illustrates a network of computers including a plurality of 110 a, 110 b, and 110 c (referred to collectively herein as hosts 110), which may each have, respectively, a set ofhosts 112 a, 112 b, and 112 c (referred to collectively herein as sets of executable files 112) and aexecutable files 114 a, 114 b, and 114 c (referred to collectively herein as host inventory feeds 114). Hosts 110 may be operably connected to ahost inventory feed central server 130 throughcommunication link 120.Central server 130 may include anadministrative module 140, a hostinventory preparation module 150, and aclustering module 160. Amanagement console 170 can also be suitably connected tocentral server 130 to provide an interface for users such as Information Technology (IT) administrators, network operators, and the like. - In example embodiments, the system for clustering host inventories may be utilized to provide valuable information to users (e.g., IT administrators, network operators, etc.) identifying computers having similar operating systems and installed executable software files. In one example, when the system for clustering host inventories is applied to a computer network environment such as
network environment 100 ofFIG. 1 , software inventories from hosts 110 may be transformed by hostinventory preparation module 150 into input data for a clustering algorithm or procedure.Clustering module 160 may apply the clustering algorithm to the prepared input data to create a clustering diagram or other information identifying logical groupings of hosts 110 having similar operating systems and installed sets of executable files 112. The clustering diagram may also identify any outlier hosts 110 having significant differences in operating systems and/or executable files relative to the other hosts 110 innetwork environment 100. Thus, the IT administrator or other user is provided with valuable information that enables the discovery of trends and exceptions of computers, such as hosts 110, in the particular network environment. As a result, common policies may be applied to computers within logical groupings and remedial action may be taken on any identified outlier computers. - For purposes of illustrating the techniques of the system for clustering host inventories, it is important to understand the activities occurring within a given network. The following foundational information may be viewed as a basis from which the present disclosure may be properly explained. Such information is offered earnestly for purposes of explanation only and, accordingly, should not be construed in any way to limit the broad scope of the present disclosure and its potential applications.
- Typical network environments used in organizations and by individuals often include a plurality of computers such as end user desktops, laptops, servers, network appliances, and the like, and each may have an installed set of executable software. In large organizations, network environments may include hundreds or thousands of computers, which may span different buildings, cities, and/or geographical areas around the world. IT administrators may be tasked with the extraordinary responsibility of maintaining these computers in a way that minimizes or eliminates disruption to business activities.
- One difficulty IT administrators face includes maintaining multiple computers in a chaotic or heterogeneous network environment. In such an environment, congruency between executable software of the computers may be minimal. For example, executable files may be stored in different memory locations on different computers, different versions of executable files may be installed in different computers, executable files may be stored on some computers but not on others, and the like. Such networks may require additional time and resources to be adequately supported as IT administrators may need to individualize policies, maintenance, upgrades, repairs, and/or any other type of support to suit particular computers having nonstandard executable software and/or operating systems.
- Homogenous network environments, in which executable software of computers are congruent or at least similar, may also benefit from a system and method for clustering host inventories. In homogeneous environments or substantially homogeneous environments, particular computers may occasionally deviate from standard computers within the network environment. For example, malicious software may break through the various firewalls and other network barriers creating one or more deviant computers. In addition, end users of computers may install various executable software files from transportable disks or download such software creating deviant computers. In accordance with the present disclosure, a system for clustering host inventories could readily identify any outliers having nonstandard and possibly malicious executable software.
- A system and method for clustering host inventories, as outlined in
FIG. 1 , could greatly enhance abilities of IT administrators or other users managing computer networks to effectively support both heterogeneous and homogeneous network environments. The system, which may be implemented in a computer such asserver 130, enables identification of logical groupings of computers with similar executable file inventories and identification of outliers (e.g., computers with drastically different executable file inventories). In accordance with one example implementation, host file inventories of executable files from hosts 110 are provided for evaluation. The host file inventories are transformed into input data for a clustering algorithm. Once the input data is prepared, the clustering algorithm is applied and one or more diagrams or charts may be created to show logical clusters or groupings of hosts 110 having the same or similar software inventories. In addition, the diagrams or charts may also show any of the hosts 110 that drastically deviate from other hosts 110. Thus, the system provides network or IT administrators with valuable information that may be used to more effectively manage hosts 110 withinnetwork environment 100. - Note that in this Specification, references to various features (e.g., elements, structures, modules, components, steps, etc.) included in “one embodiment”, “example embodiment”, “an embodiment”, “another embodiment”, “some embodiments”, “various embodiments”, “one example”, “other embodiments”, and the like are intended to mean that any such features may be included in one or more embodiments of the present disclosure, but may or may not necessarily be included in the same embodiments.
- Turning to the infrastructure of
FIG. 1 , theexample network environment 100 may be configured as one or more networks and may be configured in any form including, but not limited to, local area networks (LANs), wide area networks (WANs) such as the Internet, or any combination thereof. In some embodiments,communication link 120 may represent any electronic link supporting a LAN environment such as, for example, cable, Ethernet, wireless (e.g., WiFi), ATM, fiber optics, etc. or any combination thereof. In other embodiments,communication link 120 may represent a remote connection tocentral server 130 through any appropriate medium (e.g., digital subscriber lines (DSL), telephone lines, T1 lines, T3 lines, wireless, satellite, fiber optics, cable, Ethernet, etc. or any combination thereof) and/or through any additional networks such as a wide area network (e.g., the Internet). In addition, gateways, routers and the like may be used to facilitate electronic communication between hosts 110 andcentral server 130. - In an example embodiment, hosts 110 may represent end user computers that could be operated by end users. The end user computers may include desktops, laptops, and mobile or handheld computers (e.g., personal digital assistants (PDAs) or mobile phones). Hosts 110 can also represent other computers (e.g., servers, appliances, etc.) having executable software, which could be similarly evaluated and clustered by the system, using executable file inventories derived from sets of executable files 112 on such hosts 110. It should be noted that the network configurations and interconnections shown and described herein are for illustrative purposes only.
FIG. 1 is intended as an example and should not be construed to imply architectural limitations in the present disclosure. - Sets of executable files 112 on hosts 110 can include all executable files on respective hosts 110. In this Specification, references to “executable file”, “program file”, “executable software file”, and “executable software” are meant to encompass any software file comprising instructions that can be understood and processed by a computer such as executable files, library modules, object files, other executable modules, script files, interpreter files, and the like. In one embodiment, the system could be configured to allow the IT administrator to select a particular type of executable file to be clustered. For example, an IT Administrator may choose only dynamic-link library (DLL) modules for clustering. Thus, sets of executable files 112 would include only DLL modules on the respective hosts 110. In addition, the IT administrator may also be permitted to select particular hosts to which clustering is applied. For example, all end user computers in a network or within a particular part of the network may be selected. In another example, all servers within a network or within a particular part of the network may be selected.
-
Central server 130 ofnetwork environment 100 represents an exemplary server or other computer linked to hosts 110, which may provide services to hosts 110. The system of clustering host inventories may be implemented incentral server 130 using various embodiments of hostinventory preparation module 150 andclustering module 160. For example, keyword techniques may be used with vector based clustering in one example embodiment. In this example embodiment, hostinventory preparation module 150 creates an (n×m) vector matrix where columns of the matrix may correspond to a determined number (i.e., “m”) of unique keywords, each of which is associated with one or more executable files in a selected number (i.e., “n”) of hosts. The rows of the vector matrix may correspond to the n selected hosts.Clustering module 160 can then apply a clustering algorithm to the vector matrix to create logical groupings of the n selected hosts. In another example embodiment, compression techniques may be used with similarity based clustering. In this example embodiment, hostinventory preparation module 150 may create an (n×n) similarity matrix using compression techniques for a selected number (i.e., “n”) of hosts.Clustering module 160 may then apply a clustering algorithm to the similarity matrix to create logical groupings of the n selected hosts. In one embodiment, selected hosts may include all of the hosts 110 in a particular network environment such asnetwork environment 100. In other embodiments, selected hosts may include particular hosts selected by a user or predefined by policy, with such hosts existing in one or more network environments. -
Management console 170 linked tocentral server 130 may provide viewable cluster data for the IT administrators or other authorized users.Administrative module 140 may also be incorporated to allow IT administrators or other authorized users to add the logical groupings from a cluster analysis to an enterprise management system and to apply common policies to selected groupings. In addition, deviant or exceptional groupings or outliers can trigger various remedial actions (e.g., emails, vulnerability scans, etc.). In addition,management console 170 may also provide a user interface for the IT Administrator to select particular hosts and/or particular types of executable files to be included in the clustering procedure, in addition to other user provided configuration data for the system. One exemplary enterprise management system that could be used includes McAfee® electronic Policy Orchestrator (ePO) software manufactured by McAfee, Inc. of Santa Clara, Calif. - Turning to
FIG. 2 ,FIG. 2 is a simplified block diagram of a general orspecial purpose computer 200, such as hosts 110,central server 130, or other computing devices connected to networkenvironment 100.Computer 200 may include various components such as aprocessor 220, amain memory 230, asecondary storage 240, anetwork interface 250, auser interface 260, and aremovable memory interface 270. Abus 210, such as a system bus, may provide electronic communication betweenprocessor 210 and the other components, memory, and interfaces ofcomputer 200. -
Processor 220, which may also be referred to as a central processing unit (CPU), can include any general or special-purpose processor capable of executing machine readable instructions and performing operations on data as instructed by the machine readable instructions.Main memory 230 may be directly accessible toprocessor 220 for accessing machine instructions and can be in the form of random access memory (RAM) or any type of dynamic storage (e.g., dynamic random access memory (DRAM)).Secondary storage 240 can be any non-volatile memory such as a hard disk, which is capable of storing electronic data including executable software files. Externally stored electronic data may be provided tocomputer 200 throughremovable memory interface 270.Removable memory interface 270 may provide connection to any type of external memory such as compact discs (CDs), digital video discs (DVDs), flash drives, external hard drives, or any other external media. -
Network interface 250 can be any network interface controller (NIC) that provides a suitable network connection betweencomputer 200 and any networks to whichcomputer 200 connects for sending and receiving electronic data. For example,network interface 250 could be an Ethernet adapter, a token ring adapter, or a wireless adapter. Auser interface 260 may be provided to allow a user to interact with thecomputer 200 via any suitable means, including a graphical user interface display. In addition, any appropriate input mechanism may also be included such as a keyboard, mouse, voice recognition, touch pad, input screen, etc. - Not shown in
FIG. 2 is additional hardware that may be suitably coupled toprocessor 220 andbus 210 in the form of memory management units (MMU), additional symmetric multiprocessing (SMP) elements, read only memory (ROM), peripheral component interconnect (PCI) bus and corresponding bridges, small computer system interface (SCSI)/integrated drive electronics (IDE) elements, etc. Any suitable operating systems will also be configured incomputer 200 to appropriately manage the operation of hardware components therein. These elements, shown and/or described with reference tocomputer 200, are intended for illustrative purposes and are not meant to imply architectural limitations of computers such as hosts 110 andcentral server 130, utilized in accordance with the present disclosure. As used herein in this Specification, the term ‘computer’ is meant to encompass any personal computers, network appliances, routers, switches, gateways, processors, servers, load balancers, firewalls, or any other suitable device, component, element, or object operable to affect or process electronic information in a network environment. - Turning to
FIG. 3 , an example system flow 300 of a keyword-based embodiment of the system and method for clustering host inventories is illustrated. Flow may begin atstep 310 where host file inventories (I1 through In, with n=number of selected hosts) are generated for each of the selected hosts 110. Each host file inventory can include a set of file identifiers, with each file identifier representing a different executable file in the set of 112 a, 112 b, or 112 c of the corresponding selectedexecutable files 110 a, 110 b, or 110 c. Each file identifier may include a sequence of one or more tokens associated with the executable file represented by the file identifier. In one embodiment, the user may be provided with the option of choosing the number of tokens and the configuration of each token. For example, a simple file identifier could include a single token having a checksum configuration. A checksum can be a mathematical value or hash sum (e.g., a fixed string of numerical digits) derived by applying an algorithm to an executable file. If the algorithm is applied to another executable file that is identical to the first executable file, then the checksums should match. However, if the other executable file is different in any way (e.g., different type of software program, same software program but different version, same software program that has been altered in some way, etc.), then the checksums are very unlikely to match. Thus, the same executable file stored in different hosts or stored in different locations on disk of the same host should have identical checksums.host - In other examples, more complex file identifiers could be selected to provide a higher level of distinctiveness of an executable file. In one such example, a file identifier could include a sequence of first and second tokens having a checksum configuration and file path configuration, respectively, where the file path indicates where the executable file is stored on disk in the particular host in which it is installed. Thus, if identical executable files X and Y are installed on
host 110 a andhost 110 b, respectively, but are stored in different locations of memory, then the first token of the file identifier generated for executable file X onhost 110 a could be the same as the first token of the file identifier generated for executable file Y onhost 110 b. However, the second token of the file identifier generated for executable file X onhost 110 a could be different than the second token of the file identifier generated for executable file Y onhost 110 b. - Numerous other file identifiers may be configured by using any number of tokens and configuring the tokens to include any combination of available program file attributes, checksums, and/or file paths. Program file attributes may include, for example, creation date, modification date, security settings, vendor name, and the like. Although file identifiers may be configured with any number of such tokens, an executable file without a particular program file attribute, which is selected as one of the tokens, may have a file identifier with only the tokens available to that executable file. For example, if the file identifier is configured to include a first token (e.g., a checksum) and a second token (e.g., a vendor name), then an executable file without an embedded vendor name would have a file identifier with only a first token corresponding to the file checksum. In contrast, an executable file having an embedded vendor name would have a file identifier with both first and second tokens corresponding to the file checksum and vendor name, respectively.
- The file identifiers and resulting host file inventories I1 through In may be provided by various implementations. In one embodiment, the file identifiers and resulting host file inventories may be generated by host inventory feeds 114 for each host 110 and pushed to
central server 130. For embodiments in which a user configures the file identifier by selecting a number of tokens for the token sequence and by selecting individual token configurations,central server 130 may provide the user selected configuration criteria to each host 110. Host inventory feeds 114 may then generate file identifiers with token sequences having the particular user-selected configuration. In another embodiment, checksums for each executable file may be generated on hosts 110 by host inventory feeds 114 and then pushed tocentral server 130 along with other file attributes and file paths such that hostinventory preparation module 150 ofcentral server 130 can generate the file identifiers and resulting host file inventories for each of the selected hosts 110. In one embodiment, enumeration of executable files from the sets of executable files 112 of selected hosts 110 can be achieved by existing security technology such as, for example, Policy Auditor software or Application Control software, both manufactured by McAfee, Inc. of Santa Clara, Calif. - Referring again to
FIG. 3 , after file identifiers and host file inventories have been determined for all of the selected hosts 110 instep 310, flow then moves to step 320 where a keyword method is used to transform host file inventories I1 through In into a vector matrix, which will be further described herein with reference toFIGS. 4 and 5 . Once a vector matrix is created, flow moves to step 330 where a vector-based clustering analysis is performed on the vector matrix. Exemplary types of clustering analysis that may be performed on the vector matrix include agglomerative hierarchical clustering and partitional clustering. The results of such clustering techniques may be stored in a memory element of central server (e.g.,secondary storage 240 of computer 200), or may be stored in a database or other memory element external tocentral server 130. - After vector-based clustering has been performed on the vector matrix in
step 330, flow moves to step 340 where one or more reports can be generated indicating the clustered groupings determined during the clustering analysis and can be provided to authorized users by various methods (e.g., screen displays, electronic files, hard copies, emails, etc.). Exemplary reports may include a textual report and/or a visual representation (e.g., a proximity plot, a dendrogram, heat maps of a permuted keyword matrix, heat maps of a reduced keyword matrix where rows and columns have been merged to illustrate clusters, other cluster plots, etc.) enabling the user to view logical groupings of the selected hosts. For example, after the clustering analysis has been performed, a graphical user interface ofmanagement console 170 may display a proximity plot having physical representations of each host, with identifiable logical groupings (e.g., uniquely colored groupings, circled or otherwise enclosed groupings, representations of groupings with connected lines, etc.). Once the similar groupings and outlier hosts have been identified, an IT Administrator or other authorized user can apply common policies to hosts within the logical groupings and remedial action may be taken on any identified outlier hosts. For example, outlier hosts may be remediated to a standard software configuration as defined by the IT Administrators. - Turning to
FIG. 4 ,FIG. 4 illustrates amatrix format 400 used when generating a vector matrix in one embodiment of the system and method of clustering host inventories. In this embodiment, host file inventories are transformed into a vector matrix in Euclidean space using a keyword method. The following variables may be identified when generating a vector matrix: -
- n=number of selected hosts
- m=number of keywords
- Hi=host in network, with i=1 to n (e.g., H1=
host 110 a, H2=host 110 b, H3=host 110 c) - Kj=unique keyword, with j=1 to m
- Ii=host file inventory on Hi, with i=1 to n
- The number of keywords associated with an executable file equals the number of tokens in the token sequence of the file identifier representing the executable file. Therefore, one or more keywords can be associated with each executable file in sets of executable files 112 of selected hosts 110. In addition, each keyword could be associated with multiple executable files in the same or different hosts. Thus, a keyword sequence km may be defined as a sequence of unique keywords K1 through Km, where each keyword is associated with one or more executable files in sets of executable files 112 of all selected hosts 110.
-
Vector matrix format 400 includesn rows 460 and mcolumns 470, with n and m defining the dimensions of the resulting n-by-m (i.e., n×m) vector matrix. Each row ofvector matrix format 400 is denoted by a unique host Hi (i=1 to n), and each column is denoted by a unique keyword Kj (j=1 to m) of keyword sequence km. Eachentry 480 is denoted by a variable with subscripts i and j (i.e., ai,j) where i and j correspond to the respective row and column where the entry is located. For example, entry a2,1 is found inrow 2,column 1 ofvector matrix format 400. Each row of entries represents a 410, 420, and 430 for its corresponding host H1, H2 and Hn. For example, a1,1, a1,2, through a1,m definerow vector row vector 410 for host H1. Once each of theentries 480 has been filled with a determined value, 410, 420, through 430 can be provided as input data to a vector-based clustering algorithm to create a cluster graph or plot showing logical groupings of hosts H1 through Hn, having similar inventories of executable files and any host outliers having dissimilar host inventories.row vectors - Turning to
FIG. 5 ,FIG. 5 illustrates aflow 500 using a keyword method to transform host file inventories into a list of vectors in Euclidean space, represented byvector matrix format 400.Flow 500 corresponds to step 320 inflow 300 ofFIG. 3 and may be implemented, at least in part, by hostinventory preparation module 150 ofcentral server 130 shown inFIG. 1 . Flow may begin atstep 510 to determine keyword sequence km, which is a sequence of m unique keywords (km=K1, K2, . . . Km) and is a basis for m-dimensional keyword space. In one embodiment, to determine km the file identifiers of all host file inventories of selected hosts 110 can be evaluated to find each unique keyword. In one example, each of the file identifiers of the host file inventories (I1 through In) includes a sequence of first and second tokens with the first token having a file checksum configuration and the second token having a file path configuration. If the same version of Microsoft® Word software is installed on each of the selected hosts 110 a, 110 b, and 110 c in the same location on disk, then each file identifier representing the Microsoft® Word software in each of the host file inventories includes a first token containing a checksum of the Microsoft® Word software and a second token containing a file path of the software. In this example, keyword sequence km could include a first keyword (K1) containing the checksum, which is the same on each of the selected hosts 110 a, 110 b, and 110 c, and a second keyword (K2) containing the file path, which is the same on each of the selected hosts 110 a, 110 b, and 110 c. - Once keyword sequence km has been determined, the algorithm of
flow 500 computes a list of position vectors for an n×m vector matrix. Variables ‘i’ and ‘j’ are used to construct the vector matrix having n×mvector matrix format 400, in m-dimensional keyword space, for each host Hi by iterating over j through km and producing appropriate values for the position vectors indicating whether each host file inventory Ii contains each keyword Kj. - The iterative flow to find keywords of keyword sequence km in file identifiers of host inventories is illustrated in
steps 520 through 575 ofFIG. 5 . Instep 520, variable i is set to 1 and 530, 570, and 575 form an outer loop iterating through the hosts. Variable j is set to 1 insteps step 530 andsteps 540 through 565 form an inner loop iterating over j through km. After variables i and j are set to 1, flow moves to step 540 where keyword Kj is retrieved. Flow moves todecision box 545 where a query is made as to whether keyword Kj is found in host file inventory Ii of host Hi. Thus, host file inventory Ii of host Hi is searched for a file identifier containing keyword Kj. If keyword Kj is not found in host file inventory Ii then flow moves to step 550 where row i, column j (i.e., ai,j) in the vector matrix may be updated with an appropriate value indicating keyword Kj was not found in host file inventory Ii. However, if instep 545, keyword Kj is found in host file inventory Ii, then flow moves to step 555 where row i, column j (i.e., ai,j) in the vector matrix may be updated with an appropriate value indicating keyword Kj was found in host file inventory Ii. - The values of entries ai,j in the vector matrix, which indicate whether keyword Kj is found in a host file inventory Ii, may vary depending upon the particular implementation of the system. In one embodiment, an entry ai,j is assigned a ‘1’ value in
step 555, indicating keyword Kj was found in host file inventory Ii, or a ‘0’ value instep 560, indicating keyword Kj was not found in host file inventory Ii. Thus, in this embodiment, vector matrix contains only ‘0’ and/or ‘1’ values. In another embodiment, entry ai,j is assigned a value in 555 or 550 corresponding to a frequency of occurrence of keyword Kj in host file inventory Ii. For example, assume file identifiers in a host file inventory I1 include a first token configured as a checksum and a second token configured as a vendor name, with three executable files on host H1 having the same embedded vendor name, XYZ, resulting in keyword K2 of keyword sequence km being assigned the embedded vendor name XYZ. In this embodiment, when host file inventory I1 of host H1 is searched for keyword K2, entry a1,2 could be updated with a value of 3 because of the three occurrences of vendor name XYZ in file identifiers of host file inventory I1. Thus, in this embodiment, vector matrix may contain ‘0’ values and/or positive integer values.step - After row i, column j is filled with an appropriate value in
555 or 550, flow moves tostep decision box 560 where a query is made as to whether j<m. If j<m, then host file inventory Ii of host Hi has not been checked for all of the keywords in km. Therefore, flow moves to step 565 where j is set to j+1, and flow loops back to step 540 to get the next keyword Kj (with j=j+1) in km and search for Kj in host file inventory Ii. If, however, indecision box 560 it is determined that j is not less than m (i.e., j≧m), then host file inventory Ii has been searched for all of the keywords K1 through Km in keyword sequence km, so flow moves todecision box 570, which is part of the outer loop offlow 500. A query is made indecision box 570 to determine whether i<n, and if i<n, then not all of the hosts have been evaluated to generate corresponding keyword vectors. Therefore, flow moves to step 575 where i is set to i+1, and flow loops back tostep 530. Instep 530, j is set to 1 again, so that a vector for the next host Hi (with i=i+1) can be generated by inner loop steps 540 through 565. With reference again todecision box 570, if i is not less than n in), then all of the hosts H1 through Hn have been evaluated such that all of the vectors have been created in n×m vector matrix and, therefore, the flow ends. - The embodiment of the
flow 500 shown inFIG. 5 creates a vector in keyword space successively for each host H1 through Hn. Other embodiments, however, could be configured to switch the inner and outer loops inflow 500 such that for each keyword Kj, a column of vector matrix entries is produced by iterating over i through hosts Hi and producing an appropriate value when keyword Kj is found in host file inventory Ii of host Hi and an appropriate value when Kj is not found in host file inventory Ii of host Hi. This processing could be repeated until all columns are filled, thereby generating the list of vectors inrows 1 through n. - The clustering analysis performed on the resulting vector matrix may include commonly available clustering techniques such as agglomerative hierarchical clustering or partitional clustering. In agglomerative hierarchical clustering, each element begins as a separate cluster and elements are merged into successively larger clusters, which may be represented in a tree structure called a dendrogram. A root of the tree represents a single cluster of all of the elements and the leaves of the tree represent separated clusters of the elements. Generally, merging schemes in agglomerative hierarchical clustering used to achieve logical groupings may include schemes well-known in the art such as single-link (i.e., the distance between clusters is equal to the shortest distance from any member of one cluster to any member of another cluster), complete-link (i.e., the distance between clusters is equal to the greatest distance from any member of one cluster to any member of another cluster), group-average (i.e., the distance between clusters is equal to the average distance from any member of one cluster to any member of another cluster), and centroid (i.e., the distance between clusters is equal to the distance from the center of any one cluster to the center of another cluster).
- Known techniques may be implemented in which predetermined similarity criteria sets the point at which clustering is halted (e.g., cut point determination). Cut point determination may be made, for example, at a specified level of similarity or when consecutive similarities are the greatest, which is known in the art. As an example, a tree structure representing clusters could be cut at a predetermined height resulting in more or less clusters depending on the selected height at which the cut is made. Cut point determinations may be determined based on a particular network environment or particular hosts being clustered. In one example embodiment, an IT administrator or other authorized user could define the cut point determination used by the clustering procedure by determining a desired threshold for similarity based on the particular network environment.
- In other embodiments, partitional clustering may be used. Partitional clustering typically involves an algorithm that determines all clusters at one time. In partitional clustering, predetermined similarity criteria may provide, for example, a selected number of clusters to be generated or a maximum diameter for the clusters. One exemplary software package that implements these various clustering techniques is CLUTO Software for Clustering High-Dimensional Datasets developed by George Karypis, Professor at the Department of Computer Science & Engineering, University of Minnesota, Minneapolis and Saint Paul, Minn., which may be found on the World Wide Web at http://glaros.dtc.umn.edu/gkhome/view/cluto.
- Turning to
FIGS. 6 , 7, and 8, an example selected plurality ofhosts 600 with executable files, avector matrix 700 generated using the executable files of selectedhosts 600, and an example resultingcluster plot 800 are illustrated, respectively. InFIG. 6 , host 1 (H1) is shown with a set ofexecutable files 601 including 610, 620, 630, and 640. Host 2 (H2) is shown with a set ofexecutable files executable files 602 including 610, 650, 660, and 670. Host 3 (H3) is shown with a set ofexecutable files executable files 603 including 610, 620, 630, and 680. Host 4 (H4) is shown with a set ofexecutable files executable files 604 including 610, 650, and 660. Host 5 (H5) is shown with a set ofexecutable files executable files 605 including 640, 670, and 680.executable files -
FIG. 7 illustrates the resultingvector matrix 700 after the keyword method offlow 500 has been applied to the sets ofexecutable files 601 through 605 of the selected plurality ofhosts 600 ofFIG. 6 .Vector matrix 700 shows hosts 1 through 5 (H1 through H5) corresponding torows 760 containing 710, 720, 730, 740, and 750, respectively.keyword vectors Columns 770 ofvector matrix 700 are designated by keywords K1 through K8. In thisexample vector matrix 700,entries 780 include a ‘1’ indicating that keyword Kj is contained in a file identifier of host file inventory Ii, or a ‘0’ indicating that keyword Kj is not contained in a file identifier of host file inventory Ii. - In the example scenario of applying the keyword method of
flow 500 to the sets ofexecutable files 601 through 605 of selected plurality ofhosts 600 in order to createvector matrix 700, the following variables may be identified: -
- n=5 (host computers)
- H1 through H5=hosts in network (e.g., H1=
host 1, H2=host 2, etc.) - I1 through I5=host file inventories representing sets of
executable files 601 through 605, respectively - m=8 (keywords)
- K1 through K8=unique keywords
Each of the host file inventories I1 through I5 includes a set of file identifiers representing one of the sets of 601, 602, 603, 604, and 605, respectively. Each executable file in a set of executable files is represented by a separate file identifier in the particular host file inventory. In this exemplary scenario, file identifiers each include a first token having a checksum configuration. Unique keywords are determined among all sets of executable files of selected hosts H1 through H5. Thus, 8 unique keywords may be determined for selected hosts 600:executable files - K1=checksum for
executable file 610 - K2=checksum for
executable file 620 - K3=checksum for
executable file 630 - K4=checksum for
executable file 640 - K5=checksum for
executable file 650 - K6=checksum for
executable file 660 - K7=checksum for
executable file 670 - K8=checksum for
executable file 680
A keyword sequence km can then be created instep 520 with the 8 unique keywords: - km=K1K2K3K4K5K6K7K8
Thus, in this example scenario, the following host file inventories I1 through I5 could include file identifiers having first tokens equivalent to the following keywords: - I1→K1, K2, K3, K4
- I2→K1, K5, K6, K7
- I3→K1, K2, K3, K8
- I4→K1, K5, K6
- I5→K4, K7, K8
- Once keyword sequence km is determined, flow moves to step 520 where variable i is set to 1 and then the iterative flow begins to create n×m (5×8)
vector matrix 700 shown inFIG. 7 . Instep 530, variable j is set to 1 and keyword Kj (K1) is retrieved from km instep 540. Flow moves todecision box 545 where host file inventory Ii (I1) of host Hi (H1) is searched for keyword Kj (K1). In this example, keyword K1 is found in host file inventory I1 of host H1, so flow moves to step 555 where a ‘1’ entry is added to row i, column j (row 1, column 1) ofvector matrix 700. Aftervector matrix 700 has been updated flow moves todecision box 560 where a query is made as to whether j<m. Since 1 is less than 8, the flow moves to step 565 where j is set to 2 (i.e., j=j+1). Flow then loops back to step 540 to search for the next keyword Kj (K2) in host file inventory Ii (I1) of host Hi (H1). In this case, keyword K2 is found in host file inventory I1 so a ‘1’ entry is added to row i, column j (row 1, column 2) ofvector matrix 700. The variable j is still less than 8, (i.e., 2<8) as determined indecision box 560, so flow moves to step 565 and j is set to 3 (i.e., j=j+1). This iterative processing continues for each value of j until j=8, thereby filling in eachentry 780 ofkeyword vector 710 for host Hi (H1). - After the
last entry 780 ofkeyword vector 710 has been added tovector matrix 700, flow moves todecision box 560 where a query is made as to whether j<m (i.e., Is 8<8?). Because j is not less than 8, flow moves todecision box 570 where a query is made as to whether i<n (i.e., Is 1<5?). Because 1 is less than 5, flow moves to step 575 where i is set to 2 (i.e., i=i+1) and flow loops back to step 530 where j is set to 1. The inner iterative loop then begins instep 540 to search for all keywords in host file inventory Ii (I2) of host Hi (H2) beginning with keyword Kj (K1). Thus, in the embodiment used in this example scenario,rows 760 are successively filled with a ‘1’ or a ‘0’ value for each entry ai,j until eachvector row 710 through 750 has been completed. As previously discussed herein, however, another embodiment provides that each entry ai,j inrows 760 could be filled with a value corresponding to the frequency of occurrence of keyword Kj found in host file inventory Ii. -
Vector matrix 700 can be provided as input data to a vector based clustering procedure, as previously described herein. Information generated from the clustering procedure could be provided in numerous ways such as, for example, reports, screen displays, files, emails, etc. In one example, the information could be provided in a proximity plot such asexample proximity plot 800 illustrated inFIG. 8 .Proximity plot 800 is an example graph that could be created by a vector-based clustering procedure applied tovector matrix 700 ofFIG. 7 . If agglomerative hierarchical clustering is used, 810, 820, and 830 may be determined based on a cut point determination. If partitional clustering is used,clusters 810, 820, and 830 may be generated based on a predetermined number of clusters.clusters Proximity plot 800 shows two clusters and one outlier.Cluster 810 represents hosts H1 and H3 andcluster 820 represents hosts H2 and H4. Hosts H1 and H3 may be clustered together because they have three common 610, 620, and 630. Hosts H2 and H4 may be clustered together because they also have three commonexecutable files 610, 650, and 660.executable files Outlier 830 represents host H5, which may be indicated as an outlier, because it has, in this example, none or only one common executable file with each of the other hosts. Although the clustering information is displayed onproximity plot 800 shown inFIG. 8 , other textual reports and/or visual representations, as previously described herein with reference toFIG. 3 , may be used to show 810 and 820 andclusters outlier 830. - Turning to
FIG. 9 ,FIG. 9 illustrates an example system flow 900 of a compression-based embodiment of a system and method for clustering host inventories. Flow may begin atstep 910 where file identifiers and host file inventories (I1 through In, with n=number of selected hosts) may be generated for each of the selected hosts 110, as previously described herein with reference toFIG. 3 . - After file identifiers and host file inventories have been determined for each of the selected hosts 110, flow then moves to step 920 where a compression technique may be used to transform host file inventories into a similarity matrix, which will be further described herein with reference to
FIGS. 10 and 11 . Once a similarity matrix is created, flow moves to step 930 where a similarity-based clustering analysis can be performed on the similarity matrix. The similarity-based clustering analysis performed on the similarity matrix may include, for example, agglomerative hierarchical clustering or partitional clustering. The results of such clustering techniques may be stored in a memory element of central server (e.g.,secondary storage 240 of computer 200), or may be stored in a database or other memory element external tocentral server 130. - After similarity-based clustering has been performed on the similarity matrix in
step 930, flow moves to step 940 where one or more reports can be generated indicating the clustered groupings determined during the clustering analysis, as previously described herein with reference toFIG. 3 . Such reports for similarity-based clustering may include a textual report and/or a visual representation (e.g., a proximity plot, a dendrogram, heat maps of a similarity matrix where rows and columns have been merged to illustrate clusters, other cluster plots, etc.) enabling the user to view logical groupings of the selected hosts. Once the similar groupings and outlier hosts have been identified, an IT Administrator or other authorized user can apply common policies to computers within the logical groupings and remedial action may be taken on any identified outlier computers. For example, outlier computers may be remediated to a standard software configuration as defined by the IT Administrators. - Turning to
FIG. 10 ,FIG. 10 illustrates amatrix format 1000 used when generating a similarity matrix in one embodiment of the system and method of clustering host inventories. The similarity matrix is generated by applying a compression method to a plurality of host file inventories, each of which includes a set of file identifiers. As an example, each of the sets of file identifiers may represent one of the sets of 112 a, 112 b, or 112 c on the corresponding selectedexecutable files 110 a, 110 b, or 110 c. In addition, the following variables may be identified when generating a similarity matrix:host -
- n=number of selected hosts
- Hi=host in network, with i=1 to n (e.g., H1=
host 110 a, H2=host 110 b, H3=host 110 c) - Hj=host in network, with j=1 to n (e.g., H1=
host 110 a, H2=host 110 b, H3=host 110 c) - Ii=host file inventory of Hi
- Ij=host file inventory of Hj
-
Similarity matrix format 1000 includesn rows 1060 andn columns 1070, with ‘n’ defining the number of dimensions of the resulting n-by-n (i.e., n×n) similarity matrix. Each row ofsimilarity matrix format 1000 is denoted by host Hi (i=1 to n), and each column is denoted by host Hj (j=1 to n). Eachentry 1080 is denoted by a variable with subscripts i and j (i.e., ai,j) where i and j correspond to the respective row and column where the entry is located. For example, entry a2,1 is found inrow 2,column 1 ofsimilarity matrix format 1000. - When a similarity matrix is created in accordance with one embodiment of this disclosure, each entry ai,j has a numerical value representing the similarity distance between host Hi and host Hj with 1 representing the highest degree of similarity. In one embodiment, the similarity distances represented by entries a1,1 through an,n can include any numerical value from 0 to 1, inclusively (i.e., 0≦ai,j≦1). In this embodiment, the closer ai,j is to 1, the greater the similarity is between host file inventories Ii and Ij of hosts Hi and Hj, and the closer ai,j is to zero, the greater the difference is between host file inventories Ii and Ij of hosts Hi and Hj. Thus, a value of 1 in ai,j may indicate hosts Hi and Hj have identical host file inventories and therefore, identical sets of executable files, whereas a value of zero in ai,j may indicate hosts Hi and Hj have no common file identifiers in their respective host file inventories and therefore, no common executable files in their respective sets of executable files. Once each of the
entries 1080 has been filled with a calculated value, the resulting similarity matrix can be provided as input data into a similarity-based clustering algorithm to create a cluster graph or plot showing logical groupings of hosts H1 through Hn having similar sets of executable files and outlier hosts having dissimilar sets of executable files. The clustering analysis performed on the resulting similarity matrix may include commonly available clustering techniques such as agglomerative hierarchical clustering or partitional clustering, as previously described herein with reference to clustering analysis of a vector matrix. - Turning to
FIG. 11 ,FIG. 11 illustrates aflow 1100 using a compression method to transform host file inventories I1 through In of hosts H1 through Hn, respectively, into a similarity matrix.Flow 1100 corresponds to step 920 ofFIG. 9 and may be implemented, at least in part, by hostinventory preparation module 150 ofcentral server 130, shown inFIG. 1 . Whenflow 1100 begins, i is set to 1 instep 1110 and j is set to 1 instep 1115. Variables ‘i’ and ‘j’ are used to construct the n×n similarity matrix for the selected plurality of hosts being clustered. 1115, 1175, and 1180 form an outer loop iterating through the rows of hosts andSteps steps 1120 through 1170 form an inner loop iterating through the columns of hosts. - In
step 1120, a list of file identifiers (e.g., checksums, checksums combined with a file path, checksums combined with one or more file attributes, etc.) representing a set of executable files on host Hi are extracted from host file inventory Ii and put in a file Fi. Instep 1125, a list of file identifiers representing a set of executable files on host Hi are extracted from host file inventory Ij and put in a file Fj. Instep 1130, files Fi and Fj are concatenated and put in file Fij. It will be apparent that the use of files Fi, Fj, and Fij to store file identifiers is an example implementation of the system, and that memory buffers or any other suitable representation allowing concatenation, compression, and length determination of data may also be used. - After files Fi, Fj, and Fij are prepared, compression is applied to each of the files. A compression utility such as, for example, gzip, bzip, bzip2, zlib, or zip compression utilities may be used to compress files Fi, Fj, and Fij. Also, in some embodiments, the list of file identifiers in files Fi, Fj, and Fij may be sorted to enable more accurate compression by the compression utility. In
step 1140, file Fi is compressed and the length of the result is represented as C. Instep 1145, file Fj is compressed and the length of the result is represented as Cj. Instep 1150, file Fij is compressed and the length of the result is represented as Cij. After compressing each of the files, normalized compression distance (NCDi,j) between Hi and Hj is computed instep 1155. - Normalized compression distance (NCD) is used for clustering and is based on an algorithm developed by Kolmogorov called normalized information distance (NID). NCD is discussed in detail in Rudi Cilibrasi's 2007 thesis entitled “Statistical Interference through Data Compression,” which may be found at http://www.illc.uva.nl/Publications/Dissertations/DS-2007-01.text.pdf and can be used to compute the distance between similar data. NCD may be computed using the following equation:
-
NCD i,j =[C ij−min{C i ,C j}]/max{C i ,C j} - Once NCDi,j has been computed, flow moves to step 1160 where ai,j is computed by the following equation: ai,j=1−NCDi,j. The value ai,j is then used to construct the similarity matrix by adding ai,j to row i, column j. After the similarity matrix has been updated in
step 1160, flow moves todecision box 1165 and a query is made as to whether j<n. If j<n, then additional entries in row i of the similarity matrix need to be computed (i.e., similarity distance has not been computed between host Hi and all of the hosts Hj (j=1 to n). In this case, flow moves to step 1170 where j is set to j+1. Flow then loops back to step 1120 where the inner loop offlow 1100 repeats and the similarity distance is computed between host Hi and the next host Hj with j=j+ 1. - With reference again to
decision box 1165, if j is not less than n (i.e., j≧n), then all of the entries in row i have been computed and flow moves todecision box 1175 where a query is made as to whether i<n. If i<n, then not all rows ofsimilarity matrix 1000 have been computed, and therefore, flow moves to step 1180 where i is set to i+1. Flow then loops back to step 1115 where j is set to 1 so that entries ai,j for the next row i (Hi, with i=i+1) can be generated byinner loop steps 1120 through 1170. With reference again todecision box 1175, if i is not less than n (i.e., i≧n) then entries for all of the rows i through n have been computed and, therefore, the similarity matrix has been completed and flow ends. - It will be apparent that
flow 1100 could be optimized in numerous ways. One optimization technique includes caching the lengths of compressed files Ci and Cj, which are used multiple times duringflow 1100 to calculateentries 1080 in the similarity matrix. In addition, the extracted lists of file identifiers Fi and Fj may also be cached for use duringflow 1100. It will also be noted that the matrix should be symmetric along the diagonal a1,1 through an,n. This symmetry could be used in the implementation of the system to compute only one-half of the matrix and then reflect the results over the diagonal. - Turning to 12,
FIG. 12 shows anexample similarity matrix 1200 generated by applying the compression method offlow 1100 ofFIG. 11 to host file inventories I1 through I5 of the example selected plurality ofhosts 600 ofFIG. 6 .FIG. 12 shows hosts 1 through 5 (H1 through H5) corresponding torows 1260 andcolumns 1270, forming a 5×5similarity matrix 1200.Entries 1280 ofsimilarity matrix 1200 include values from 0 to 1, inclusively. The closer the value is to 1, the closer the distance or greater the similarity of the corresponding hosts in row i, column j. For example, each entry inmatrix 1200 with the same host in the corresponding row and column, (e.g, a1,1, a2,2, a3,3, etc.) has a value of 1 because the hosts, and therefore the host file inventories, are identical. In contrast, each entry insimilarity matrix 1200, in which the corresponding hosts Hi and Hj have respective executable file inventories Ii and Ij with no common executable files, has a value of zero (e.g., a5,4, a4,5). - Applying the
compression method flow 1100 ofFIG. 11 to the example selected plurality ofhosts 600 ofFIG. 6 , in order to transform host file inventories intosimilarity matrix 1200, the following variables can be identified: -
- n=5 (hosts)
- H1 through H5=hosts in network (e.g., H1=
host 1, H2=host 2, etc.) - I1 through I5=host file inventories representing sets of
executable files 601 through 605, respectively
- Each of the host file inventories I1 through I5 includes a set of file identifiers representing one of the sets of
601, 602, 603, 604, and 605. Each executable file in a set of executable files is represented by a separate file identifier in the particular host file inventory. In this example scenario in which each file identifier includes a single token having a checksum configuration, the following host file inventories of hosts H1 through H5 could include file identifiers D1 through D8, which representexecutable files executable files 610 through 680, respectively: -
- I1→D1, D2, D3, D4
- I2→D1, D5, D6, D2
- I3→D1, D2, D3, D8
- I4→D1, D5, D6
- I5→D4, D7, D8
- In
step 1110, i is set to 1 and then the iterative looping begins to create an n×n (5×5)similarity matrix 1200 shown inFIG. 12 . In step 1115 j is set to 1 and flow passes tosteps 1120 through 1125 where the following variables can be determined: -
- Fi (F1)=D1D2D3D4 (i.e., list of file identifiers for Ii (I1))
- Fj (F1)=D1D2D3D4 (i.e., list of file identifiers for Ij (I1))
- Fij (F1F1)=D1D2D3D4D1D2D3D4 (i.e., concatenated files Fi (F1) and Fj (F1))
Flow then moves tosteps 1140 through 1150 where compression is applied to these files and the length of the compressed files is represented as follows: - Ci (C1)=length of compressed file Fi (F1)
- Cj (C1)=length of compressed file Fj (F1)
- Cij (C1C1)=length of compressed file Fij (F1F1)
For simplicity of explanation, example arbitrary values are provided in which each file identifier has a defined length of 1, such that C1=4 and C1C1=4. It will be apparent, however, that these values are provided for example purposes only and may not accurately reflect actual values produced by a compression utility. After compression has been applied to the files, NCDi,j is computed using the compressed values Ci, Cj, and Cij. In this example,
-
- After the NCD1,1 value is computed in
step 1155, flow moves to step 1160 and ai,j is computed: -
- The ‘1’ value is added to row i, column j (
row 1, column 1) ofsimilarity matrix 1200. Aftersimilarity matrix 1200 has been updated, flow moves todecision box 1165 where a query is made as to whether j<n. Since 1 is less than 5, flow moves to step 1170 where j is set to 2 (i.e., j=j+1). Flow then loops back to step 1120 to determine the similarity distance between Hi (H1) and the next host Hj (H2). In this case, after extraction and compression are performed, NCDi,j (NCD1,2) is computed as 0.75, because H1 and H2 have only one common file identifier D1 and, therefore, only onecommon executable file 601. Instep 1160, NCD1,2 is used to compute a1,2 as 0.25, which is added to row i, column j (row 1, column 2) ofsimilarity matrix 1200. The variable j is still less than 5, (i.e., 2<5) as determined indecision box 1165, so flow moves to step 1170 and j is set to 3 (i.e., j=j+1). This iterative processing continues for each value of j until j=5, thereby filling in each entry for H1 in row i (row 1) ofsimilarity matrix 1200. - After the last entry of row i (row 1) has been added to
similarity matrix 1200, flow moves todecision box 1165 where a query is made as to whether j<n (i.e., Is 5<5?). Because j is not less than 5, flow moves todecision box 1175 where a query is made as to whether i<n (i.e., Is 1<5?). Because 1 is less than 5, flow moves to step 1180 where i is set to 2 (i.e., i=i+1) and flow loops back to step 1115 where j is set to 1. The inner iterative loop then begins instep 1120 to determine the similarity distance between host file inventory Ii (I2) of host Hi (H2) and each host file inventory Ij (I1 through I5). Thus,rows 1160 are successively filled with similarity distance values ai,j until each row has been completed. - After the compression method of
flow 1100 has finished processing,similarity matrix 1200 can be provided as input to a similarity-based clustering procedure, as previously described herein with reference to clustering techniques used with a vector matrix. Information generated from the clustering procedure could be provided in numerous ways, as previously described herein with reference toFIG. 9 . In one example, the information could be provided in a proximity plot such asexample proximity plot 800 illustrated inFIG. 8 , which has been previously shown and described herein. - Software for achieving the operations outlined herein can be provided at various locations (e.g., the corporate IT headquarters, end user computers, distributed servers in the cloud, etc.). In other embodiments, this software could be received or downloaded from a web server (e.g., in the context of purchasing individual end-user licenses for separate networks, devices, servers, etc.) in order to provide this system for clustering host inventories. In one example implementation, this software is resident in one or more computers sought to be protected from a security attack (or protected from unwanted or unauthorized manipulations of data).
- In other examples, the software of the system for clustering host inventories in a computer network environment could involve a proprietary element (e.g., as part of a network security solution with McAfee® EPO software, McAfee® Application Control software, etc.), which could be provided in (or be proximate to) these identified elements, or be provided in any other device, server, network appliance, console, firewall, switch, information technology (IT) device, distributed server, etc., or be provided as a complementary solution (e.g., in conjunction with a firewall), or provisioned somewhere in the network.
- In certain example implementations, the clustering activities outlined herein may be implemented in software. This could be inclusive of software provided in central server 130 (e.g., via
administrative module 140, hostinventory preparation module 150 and clustering module 160) and hosts 110 (e.g., via host inventory feed 114). These elements and/or modules can cooperate with each other in order to perform clustering activities as discussed herein. In other embodiments, these features may be provided external to these elements, included in other devices to achieve these intended functionalities, or consolidated in any appropriate manner. For example, some of the processors associated with the various elements may be removed, or otherwise consolidated such that a single processor and a single memory location are responsible for certain activities. In a general sense, the arrangement depicted inFIG. 1 may be more logical in its representation, whereas a physical architecture may include various permutations/combinations/hybrids of these elements. - In various embodiments, all of these elements (e.g., hosts 110, central server 130) include software (or reciprocating software) that can coordinate, manage, or otherwise cooperate in order to achieve the clustering operations, as outlined herein. One or all of these elements may include any suitable algorithms, hardware, software, components, modules, interfaces, or objects that facilitate the operations thereof. In the implementation involving software, such a configuration may be inclusive of logic encoded in one or more tangible media (e.g., embedded logic provided in an application specific integrated circuit (ASIC), digital signal processor (DSP) instructions, software (potentially inclusive of object code and source code) to be executed by a processor, or other similar machine, etc.), which may be inclusive of non-transitory media. In some of these instances, one or more memory elements (as shown in
FIG. 2 ) can store data used for the operations described herein. This includes the memory element being able to store software, logic, code, or processor instructions that are executed to carry out the activities described in this Specification. A processor can execute any type of instructions associated with the data to achieve the operations detailed herein in this Specification. In one example, the processors (as shown inFIG. 2 ) could transform an element or an article (e.g., data) from one state or thing to another state or thing. In another example, the activities outlined herein may be implemented with fixed logic or programmable logic (e.g., software/computer instructions executed by a processor) and the elements identified herein could be some type of a programmable processor, programmable digital logic (e.g., a field programmable gate array (FPGA), an erasable programmable read only memory (EPROM), an electrically erasable programmable read only memory (EEPROM)) or an ASIC that includes digital logic, software, code, electronic instructions, or any suitable combination thereof. - Any of the memory items discussed herein should be construed as being encompassed within the broad term ‘memory element.’ Similarly, any of the potential processing elements, modules, and machines described in this Specification should be construed as being encompassed within the broad term ‘processor.’ Each of the computers, servers, and other devices may also include suitable interfaces for receiving, transmitting, and/or otherwise communicating data or information in a network environment.
- Note that with the examples provided herein, interaction may be described in terms of two, three, four, or more network components. However, this has been done for purposes of clarity and example only. It should be appreciated that the system can be consolidated in any suitable manner. Along similar design alternatives, any of the illustrated computers, modules, components, and elements of
FIG. 1 may be combined in various possible configurations, all of which are clearly within the broad scope of this Specification. In certain cases, it may be easier to describe one or more of the functionalities of a given set of flows by only referencing a limited number of components or network elements. Therefore, it should also be appreciated that the system ofFIG. 1 (and its teachings) is readily scalable. The system can accommodate a large number of components, as well as more complicated or sophisticated arrangements and configurations. Accordingly, the examples provided should not limit the scope or inhibit the broad teachings of the system as potentially applied to a myriad of other architectures. - It is also important to note that the operations described with reference to the preceding FIGURES illustrate only some of the possible scenarios that may be executed by, or within, the system. Some of these operations may be deleted or removed where appropriate, or these operations may be modified or changed considerably without departing from the scope of the discussed concepts. In addition, the timing of these operations may be altered considerably and still achieve the results taught in this disclosure. The preceding operational flows have been offered for purposes of example and discussion. Substantial flexibility is provided by the clustering system in that any suitable arrangements, chronologies, configurations, and timing mechanisms may be provided without departing from the teachings of the discussed concepts.
Claims (30)
Priority Applications (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US12/880,125 US8549003B1 (en) | 2010-09-12 | 2010-09-12 | System and method for clustering host inventories |
| US14/016,497 US8843496B2 (en) | 2010-09-12 | 2013-09-03 | System and method for clustering host inventories |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US12/880,125 US8549003B1 (en) | 2010-09-12 | 2010-09-12 | System and method for clustering host inventories |
Related Child Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US14/016,497 Continuation US8843496B2 (en) | 2010-09-12 | 2013-09-03 | System and method for clustering host inventories |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| US20130246422A1 true US20130246422A1 (en) | 2013-09-19 |
| US8549003B1 US8549003B1 (en) | 2013-10-01 |
Family
ID=49158653
Family Applications (2)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US12/880,125 Active 2031-05-25 US8549003B1 (en) | 2010-09-12 | 2010-09-12 | System and method for clustering host inventories |
| US14/016,497 Expired - Fee Related US8843496B2 (en) | 2010-09-12 | 2013-09-03 | System and method for clustering host inventories |
Family Applications After (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US14/016,497 Expired - Fee Related US8843496B2 (en) | 2010-09-12 | 2013-09-03 | System and method for clustering host inventories |
Country Status (1)
| Country | Link |
|---|---|
| US (2) | US8549003B1 (en) |
Cited By (13)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20120166317A1 (en) * | 2010-12-23 | 2012-06-28 | Bladelogic, Inc. | Auto-Suggesting IT Asset Groups Using Clustering Techniques |
| US20130117676A1 (en) * | 2011-11-04 | 2013-05-09 | International Business Machines Corporation | Visually analyzing, clustering, transforming and consolidating real and virtual machine images in a computing environment |
| US9305076B1 (en) * | 2012-06-28 | 2016-04-05 | Google Inc. | Flattening a cluster hierarchy tree to filter documents |
| US9519802B2 (en) * | 2014-05-07 | 2016-12-13 | American Express Travel Related Services Company, Inc. | Systems and methods for document and data protection |
| US20170109186A1 (en) * | 2015-10-20 | 2017-04-20 | International Business Machines Corporation | Server build optimization |
| US20170147602A1 (en) * | 2015-11-24 | 2017-05-25 | Red Hat, Inc. | Allocating file system metadata to storage nodes of distributed file system |
| CN109063198A (en) * | 2018-09-10 | 2018-12-21 | 浙江广播电视集团 | Melt the multidimensional visual search recommender system of media resource |
| US10389641B2 (en) * | 2015-03-31 | 2019-08-20 | British Telecommunications Public Limited Company | Network operation |
| US11240119B2 (en) | 2015-07-31 | 2022-02-01 | British Telecommunications Public Limited Company | Network operation |
| US11336534B2 (en) | 2015-03-31 | 2022-05-17 | British Telecommunications Public Limited Company | Network operation |
| US11973773B2 (en) * | 2020-05-15 | 2024-04-30 | Arbor Networks, Inc. | Detecting and mitigating zero-day attacks |
| US20240406197A1 (en) * | 2023-05-30 | 2024-12-05 | HYAS Infosec Inc. | Tiered remediation of outlier domain name system requests |
| US12549578B2 (en) * | 2023-05-30 | 2026-02-10 | HYAS Infosec Inc. | Tiered remediation of outlier domain name system requests |
Families Citing this family (19)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US7856661B1 (en) | 2005-07-14 | 2010-12-21 | Mcafee, Inc. | Classification of software on networked systems |
| US7757269B1 (en) | 2006-02-02 | 2010-07-13 | Mcafee, Inc. | Enforcing alignment of approved changes and deployed changes in the software change life-cycle |
| US7895573B1 (en) | 2006-03-27 | 2011-02-22 | Mcafee, Inc. | Execution environment file inventory |
| US8332929B1 (en) | 2007-01-10 | 2012-12-11 | Mcafee, Inc. | Method and apparatus for process enforced configuration management |
| US9424154B2 (en) | 2007-01-10 | 2016-08-23 | Mcafee, Inc. | Method of and system for computer system state checks |
| US8381284B2 (en) | 2009-08-21 | 2013-02-19 | Mcafee, Inc. | System and method for enforcing security policies in a virtual environment |
| US8938800B2 (en) | 2010-07-28 | 2015-01-20 | Mcafee, Inc. | System and method for network level protection against malicious software |
| US8925101B2 (en) | 2010-07-28 | 2014-12-30 | Mcafee, Inc. | System and method for local protection against malicious software |
| US8549003B1 (en) | 2010-09-12 | 2013-10-01 | Mcafee, Inc. | System and method for clustering host inventories |
| US9075993B2 (en) | 2011-01-24 | 2015-07-07 | Mcafee, Inc. | System and method for selectively grouping and managing program files |
| US9112830B2 (en) | 2011-02-23 | 2015-08-18 | Mcafee, Inc. | System and method for interlocking a host and a gateway |
| US9594881B2 (en) | 2011-09-09 | 2017-03-14 | Mcafee, Inc. | System and method for passive threat detection using virtual memory inspection |
| US8973144B2 (en) | 2011-10-13 | 2015-03-03 | Mcafee, Inc. | System and method for kernel rootkit protection in a hypervisor environment |
| US9069586B2 (en) | 2011-10-13 | 2015-06-30 | Mcafee, Inc. | System and method for kernel rootkit protection in a hypervisor environment |
| US8713668B2 (en) | 2011-10-17 | 2014-04-29 | Mcafee, Inc. | System and method for redirected firewall discovery in a network environment |
| US8800024B2 (en) | 2011-10-17 | 2014-08-05 | Mcafee, Inc. | System and method for host-initiated firewall discovery in a network environment |
| US8739272B1 (en) | 2012-04-02 | 2014-05-27 | Mcafee, Inc. | System and method for interlocking a host and a gateway |
| US8973146B2 (en) | 2012-12-27 | 2015-03-03 | Mcafee, Inc. | Herd based scan avoidance system in a network environment |
| EP3061030A4 (en) | 2013-10-24 | 2017-04-19 | McAfee, Inc. | Agent assisted malicious application blocking in a network environment |
Citations (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20080010304A1 (en) * | 2006-03-29 | 2008-01-10 | Santosh Vempala | Techniques for clustering a set of objects |
| US7809704B2 (en) * | 2006-06-15 | 2010-10-05 | Microsoft Corporation | Combining spectral and probabilistic clustering |
Family Cites Families (247)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US4982430A (en) | 1985-04-24 | 1991-01-01 | General Instrument Corporation | Bootstrap channel security arrangement for communication network |
| US4688169A (en) | 1985-05-30 | 1987-08-18 | Joshi Bhagirath S | Computer software security system |
| US5155847A (en) | 1988-08-03 | 1992-10-13 | Minicom Data Corporation | Method and apparatus for updating software at remote locations |
| US5560008A (en) | 1989-05-15 | 1996-09-24 | International Business Machines Corporation | Remote authentication and authorization in a distributed data processing system |
| CA2010591C (en) | 1989-10-20 | 1999-01-26 | Phillip M. Adams | Kernels, description tables and device drivers |
| US5222134A (en) | 1990-11-07 | 1993-06-22 | Tau Systems Corporation | Secure system for activating personal computer software at remote locations |
| US5390314A (en) | 1992-10-09 | 1995-02-14 | American Airlines, Inc. | Method and apparatus for developing scripts that access mainframe resources that can be executed on various computer systems having different interface languages without modification |
| US5339261A (en) | 1992-10-22 | 1994-08-16 | Base 10 Systems, Inc. | System for operating application software in a safety critical environment |
| US5584009A (en) | 1993-10-18 | 1996-12-10 | Cyrix Corporation | System and method of retiring store data from a write buffer |
| JP3042341B2 (en) | 1994-11-30 | 2000-05-15 | 日本電気株式会社 | Local I/O control method for a cluster-connected multiprocessor system |
| US6282712B1 (en) | 1995-03-10 | 2001-08-28 | Microsoft Corporation | Automatic software installation on heterogeneous networked computer systems |
| US5699513A (en) | 1995-03-31 | 1997-12-16 | Motorola, Inc. | Method for secure network access via message intercept |
| US5787427A (en) | 1996-01-03 | 1998-07-28 | International Business Machines Corporation | Information handling system, method, and article of manufacture for efficient object security processing by grouping objects sharing common control access policies |
| US5842017A (en) | 1996-01-29 | 1998-11-24 | Digital Equipment Corporation | Method and apparatus for forming a translation unit |
| US5907709A (en) | 1996-02-08 | 1999-05-25 | Inprise Corporation | Development system with methods for detecting invalid use and management of resources and memory at runtime |
| US5907708A (en) | 1996-06-03 | 1999-05-25 | Sun Microsystems, Inc. | System and method for facilitating avoidance of an exception of a predetermined type in a digital computer system by providing fix-up code for an instruction in response to detection of an exception condition resulting from execution thereof |
| US5787177A (en) | 1996-08-01 | 1998-07-28 | Harris Corporation | Integrated network security access control system |
| US5926832A (en) | 1996-09-26 | 1999-07-20 | Transmeta Corporation | Method and apparatus for aliasing memory data in an advanced microprocessor |
| US5991881A (en) | 1996-11-08 | 1999-11-23 | Harris Corporation | Network surveillance system |
| US5987611A (en) | 1996-12-31 | 1999-11-16 | Zone Labs, Inc. | System and methodology for managing internet access on a per application basis for client computers connected to the internet |
| US6141698A (en) | 1997-01-29 | 2000-10-31 | Network Commerce Inc. | Method and system for injecting new code into existing application code |
| US6587877B1 (en) | 1997-03-25 | 2003-07-01 | Lucent Technologies Inc. | Management of time and expense when communicating between a host and a communication network |
| US6192475B1 (en) | 1997-03-31 | 2001-02-20 | David R. Wallace | System and method for cloaking software |
| US6167522A (en) | 1997-04-01 | 2000-12-26 | Sun Microsystems, Inc. | Method and apparatus for providing security for servers executing application programs received via a network |
| US6356957B2 (en) | 1997-04-03 | 2002-03-12 | Hewlett-Packard Company | Method for emulating native object oriented foundation classes on a target object oriented programming system using a template library |
| US6073142A (en) | 1997-06-23 | 2000-06-06 | Park City Group | Automated post office based rule analysis of e-mail messages and other data objects for controlled distribution in network environments |
| US6275938B1 (en) | 1997-08-28 | 2001-08-14 | Microsoft Corporation | Security enhancement for untrusted executable code |
| US6192401B1 (en) | 1997-10-21 | 2001-02-20 | Sun Microsystems, Inc. | System and method for determining cluster membership in a heterogeneous distributed system |
| US6393465B2 (en) | 1997-11-25 | 2002-05-21 | Nixmail Corporation | Junk electronic mail detector and eliminator |
| US5987610A (en) | 1998-02-12 | 1999-11-16 | Ameritech Corporation | Computer virus screening methods and systems |
| WO1999057654A1 (en) | 1998-05-06 | 1999-11-11 | Matsushita Electric Industrial Co., Ltd. | Method and system for digital data transmission/reception |
| US6795966B1 (en) | 1998-05-15 | 2004-09-21 | Vmware, Inc. | Mechanism for restoring, porting, replicating and checkpointing computer systems using state extraction |
| US6442686B1 (en) | 1998-07-02 | 2002-08-27 | Networks Associates Technology, Inc. | System and methodology for messaging server-based management and enforcement of crypto policies |
| US6338149B1 (en) | 1998-07-31 | 2002-01-08 | Westinghouse Electric Company Llc | Change monitoring system for a computer system |
| US6546425B1 (en) | 1998-10-09 | 2003-04-08 | Netmotion Wireless, Inc. | Method and apparatus for providing mobile and other intermittent connectivity in a computing environment |
| JP3753873B2 (en) | 1998-11-11 | 2006-03-08 | 株式会社島津製作所 | Spectrophotometer |
| US6969352B2 (en) | 1999-06-22 | 2005-11-29 | Teratech Corporation | Ultrasound probe with integrated electronics |
| US6453468B1 (en) | 1999-06-30 | 2002-09-17 | B-Hub, Inc. | Methods for improving reliability while upgrading software programs in a clustered computer system |
| US6567857B1 (en) | 1999-07-29 | 2003-05-20 | Sun Microsystems, Inc. | Method and apparatus for dynamic proxy insertion in network traffic flow |
| US6256773B1 (en) | 1999-08-31 | 2001-07-03 | Accenture Llp | System, method and article of manufacture for configuration management in a development architecture framework |
| US6990591B1 (en) | 1999-11-18 | 2006-01-24 | Secureworks, Inc. | Method and system for remotely configuring and monitoring a communication device |
| US6321267B1 (en) | 1999-11-23 | 2001-11-20 | Escom Corporation | Method and apparatus for filtering junk email |
| US6662219B1 (en) | 1999-12-15 | 2003-12-09 | Microsoft Corporation | System for determining at subgroup of nodes relative weight to represent cluster by obtaining exclusive possession of quorum resource |
| US6460050B1 (en) | 1999-12-22 | 2002-10-01 | Mark Raymond Pace | Distributed content identification system |
| US6769008B1 (en) | 2000-01-10 | 2004-07-27 | Sun Microsystems, Inc. | Method and apparatus for dynamically altering configurations of clustered computer systems |
| WO2001069439A1 (en) | 2000-03-17 | 2001-09-20 | Filesx Ltd. | Accelerating responses to requests made by users to an internet |
| US6748534B1 (en) | 2000-03-31 | 2004-06-08 | Networks Associates, Inc. | System and method for partitioned distributed scanning of a large dataset for viruses and other malware |
| CA2305078A1 (en) | 2000-04-12 | 2001-10-12 | Cloakware Corporation | Tamper resistant software - mass data encoding |
| US7325127B2 (en) | 2000-04-25 | 2008-01-29 | Secure Data In Motion, Inc. | Security server system |
| WO2001084285A2 (en) | 2000-04-28 | 2001-11-08 | Internet Security Systems, Inc. | Method and system for managing computer security information |
| US6769115B1 (en) | 2000-05-01 | 2004-07-27 | Emc Corporation | Adaptive interface for a software development environment |
| US6847993B1 (en) | 2000-05-31 | 2005-01-25 | International Business Machines Corporation | Method, system and program products for managing cluster configurations |
| US6934755B1 (en) | 2000-06-02 | 2005-08-23 | Sun Microsystems, Inc. | System and method for migrating processes on a network |
| US6611925B1 (en) | 2000-06-13 | 2003-08-26 | Networks Associates Technology, Inc. | Single point of entry/origination item scanning within an enterprise or workgroup |
| US6901519B1 (en) | 2000-06-22 | 2005-05-31 | Infobahn, Inc. | E-mail virus protection system and method |
| US8204999B2 (en) | 2000-07-10 | 2012-06-19 | Oracle International Corporation | Query string processing |
| US7093239B1 (en) | 2000-07-14 | 2006-08-15 | Internet Security Systems, Inc. | Computer immune system and method for detecting unwanted code in a computer system |
| US7350204B2 (en) | 2000-07-24 | 2008-03-25 | Microsoft Corporation | Policies for secure software execution |
| WO2002013437A2 (en) | 2000-08-04 | 2002-02-14 | Xtradyne Technologies Ag | Method and system for session based authorization and access control for networked application objects |
| US7707305B2 (en) | 2000-10-17 | 2010-04-27 | Cisco Technology, Inc. | Methods and apparatus for protecting against overload conditions on nodes of a distributed network |
| US7146305B2 (en) | 2000-10-24 | 2006-12-05 | Vcis, Inc. | Analytical virtual machine |
| US7606898B1 (en) | 2000-10-24 | 2009-10-20 | Microsoft Corporation | System and method for distributed management of shared computers |
| US6930985B1 (en) | 2000-10-26 | 2005-08-16 | Extreme Networks, Inc. | Method and apparatus for management of configuration in a network |
| US6834301B1 (en) | 2000-11-08 | 2004-12-21 | Networks Associates Technology, Inc. | System and method for configuration, management, and monitoring of a computer network using inheritance |
| US6766334B1 (en) | 2000-11-21 | 2004-07-20 | Microsoft Corporation | Project-based configuration management method and apparatus |
| US20020069367A1 (en) | 2000-12-06 | 2002-06-06 | Glen Tindal | Network operating system data directory |
| US6907600B2 (en) | 2000-12-27 | 2005-06-14 | Intel Corporation | Virtual translation lookaside buffer |
| JP2002244898A (en) | 2001-02-19 | 2002-08-30 | Hitachi Ltd | Database management program and database system |
| US6918110B2 (en) | 2001-04-11 | 2005-07-12 | Hewlett-Packard Development Company, L.P. | Dynamic instrumentation of an executable program by means of causing a breakpoint at the entry point of a function and providing instrumentation code |
| US6715050B2 (en) | 2001-05-31 | 2004-03-30 | Oracle International Corporation | Storage access keys |
| US6988101B2 (en) | 2001-05-31 | 2006-01-17 | International Business Machines Corporation | Method, system, and computer program product for providing an extensible file system for accessing a foreign file system from a local data processing system |
| US6988124B2 (en) | 2001-06-06 | 2006-01-17 | Microsoft Corporation | Locating potentially identical objects across multiple computers based on stochastic partitioning of workload |
| US7290266B2 (en) | 2001-06-14 | 2007-10-30 | Cisco Technology, Inc. | Access control by a real-time stateful reference monitor with a state collection training mode and a lockdown mode for detecting predetermined patterns of events indicative of requests for operating system resources resulting in a decision to allow or block activity identified in a sequence of events based on a rule set defining a processing policy |
| US7065767B2 (en) | 2001-06-29 | 2006-06-20 | Intel Corporation | Managed hosting server auditing and change tracking |
| US7069330B1 (en) | 2001-07-05 | 2006-06-27 | Mcafee, Inc. | Control of interaction between client computer applications and network resources |
| US20030023736A1 (en) | 2001-07-12 | 2003-01-30 | Kurt Abkemeier | Method and system for filtering messages |
| US20030014667A1 (en) | 2001-07-16 | 2003-01-16 | Andrei Kolichtchak | Buffer overflow attack detection and suppression |
| US6877088B2 (en) | 2001-08-08 | 2005-04-05 | Sun Microsystems, Inc. | Methods and apparatus for controlling speculative execution of instructions based on a multiaccess memory condition |
| US7007302B1 (en) | 2001-08-31 | 2006-02-28 | Mcafee, Inc. | Efficient management and blocking of malicious code and hacking attempts in a network environment |
| US7010796B1 (en) | 2001-09-28 | 2006-03-07 | Emc Corporation | Methods and apparatus providing remote operation of an application programming interface |
| US7177267B2 (en) | 2001-11-09 | 2007-02-13 | Adc Dsl Systems, Inc. | Hardware monitoring and configuration management |
| US7346781B2 (en) | 2001-12-06 | 2008-03-18 | Mcafee, Inc. | Initiating execution of a computer program from an encrypted version of a computer program |
| US7159036B2 (en) | 2001-12-10 | 2007-01-02 | Mcafee, Inc. | Updating data from a source computer to groups of destination computers |
| US7039949B2 (en) | 2001-12-10 | 2006-05-02 | Brian Ross Cartmell | Method and system for blocking unwanted communications |
| US10033700B2 (en) | 2001-12-12 | 2018-07-24 | Intellectual Ventures I Llc | Dynamic evaluation of access rights |
| CA2469633C (en) | 2001-12-13 | 2011-06-14 | Japan Science And Technology Agency | Software safety execution system |
| US7398389B2 (en) | 2001-12-20 | 2008-07-08 | Coretrace Corporation | Kernel-based network security infrastructure |
| JP3906356B2 (en) | 2001-12-27 | 2007-04-18 | 独立行政法人情報通信研究機構 | Syntax analysis method and apparatus |
| US7743415B2 (en) | 2002-01-31 | 2010-06-22 | Riverbed Technology, Inc. | Denial of service attacks characterization |
| US20030167399A1 (en) | 2002-03-01 | 2003-09-04 | Yves Audebert | Method and system for performing post issuance configuration and data changes to a personal security device using a communications pipe |
| US6941449B2 (en) | 2002-03-04 | 2005-09-06 | Hewlett-Packard Development Company, L.P. | Method and apparatus for performing critical tasks using speculative operations |
| US7600021B2 (en) | 2002-04-03 | 2009-10-06 | Microsoft Corporation | Delta replication of source files and packages across networked resources |
| US20070253430A1 (en) | 2002-04-23 | 2007-11-01 | Minami John S | Gigabit Ethernet Adapter |
| US7370360B2 (en) | 2002-05-13 | 2008-05-06 | International Business Machines Corporation | Computer immune system and method for detecting unwanted code in a P-code or partially compiled native-code program executing within a virtual machine |
| US7823148B2 (en) | 2002-05-22 | 2010-10-26 | Oracle America, Inc. | System and method for performing patch installation via a graphical user interface |
| US20030221190A1 (en) | 2002-05-22 | 2003-11-27 | Sun Microsystems, Inc. | System and method for performing patch installation on multiple devices |
| US7024404B1 (en) | 2002-05-28 | 2006-04-04 | The State University Rutgers | Retrieval and display of data objects using a cross-group ranking metric |
| US7512977B2 (en) | 2003-06-11 | 2009-03-31 | Symantec Corporation | Intrustion protection system utilizing layers |
| US7823203B2 (en) | 2002-06-17 | 2010-10-26 | At&T Intellectual Property Ii, L.P. | Method and device for detecting computer network intrusions |
| US7139916B2 (en) | 2002-06-28 | 2006-11-21 | Ebay, Inc. | Method and system for monitoring user interaction with a computer |
| US8924484B2 (en) | 2002-07-16 | 2014-12-30 | Sonicwall, Inc. | Active e-mail filter with challenge-response |
| US7522906B2 (en) | 2002-08-09 | 2009-04-21 | Wavelink Corporation | Mobile unit configuration management for WLANs |
| US7624347B2 (en) | 2002-09-17 | 2009-11-24 | At&T Intellectual Property I, L.P. | System and method for forwarding full header information in email messages |
| US7546333B2 (en) | 2002-10-23 | 2009-06-09 | Netapp, Inc. | Methods and systems for predictive change management for access paths in networks |
| US7353501B2 (en) | 2002-11-18 | 2008-04-01 | Microsoft Corporation | Generic wrapper scheme |
| US7865931B1 (en) | 2002-11-25 | 2011-01-04 | Accenture Global Services Limited | Universal authorization and access control security measure for applications |
| US20040143749A1 (en) | 2003-01-16 | 2004-07-22 | Platformlogic, Inc. | Behavior-based host-based intrusion prevention system |
| US20040167906A1 (en) | 2003-02-25 | 2004-08-26 | Smith Randolph C. | System consolidation tool and method for patching multiple servers |
| US7024548B1 (en) | 2003-03-10 | 2006-04-04 | Cisco Technology, Inc. | Methods and apparatus for auditing and tracking changes to an existing configuration of a computerized device |
| US7529754B2 (en) | 2003-03-14 | 2009-05-05 | Websense, Inc. | System and method of monitoring and controlling application files |
| US20060133223A1 (en) | 2003-03-28 | 2006-06-22 | Matsusuhita Electric Industrial Co., Ltd. | Recording medium, recording device usint the same, and reproduction device |
| US7607010B2 (en) | 2003-04-12 | 2009-10-20 | Deep Nines, Inc. | System and method for network edge data protection |
| US20050108516A1 (en) | 2003-04-17 | 2005-05-19 | Robert Balzer | By-pass and tampering protection for application wrappers |
| US20040230963A1 (en) | 2003-05-12 | 2004-11-18 | Rothman Michael A. | Method for updating firmware in an operating system agnostic manner |
| DE10324189A1 (en) | 2003-05-28 | 2004-12-16 | Robert Bosch Gmbh | Method for controlling access to a resource of an application in a data processing device |
| US7657599B2 (en) | 2003-05-29 | 2010-02-02 | Mindshare Design, Inc. | Systems and methods for automatically updating electronic mail access lists |
| US20050108562A1 (en) | 2003-06-18 | 2005-05-19 | Khazan Roger I. | Technique for detecting executable malicious code using a combination of static and dynamic analyses |
| US7283517B2 (en) | 2003-07-22 | 2007-10-16 | Innomedia Pte | Stand alone multi-media terminal adapter with network address translation and port partitioning |
| US7886093B1 (en) | 2003-07-31 | 2011-02-08 | Hewlett-Packard Development Company, L.P. | Electronic device network supporting compression and decompression in electronic devices |
| US7464408B1 (en) | 2003-08-29 | 2008-12-09 | Solidcore Systems, Inc. | Damage containment by translation |
| US8539063B1 (en) | 2003-08-29 | 2013-09-17 | Mcafee, Inc. | Method and system for containment of networked application client software by explicit human input |
| US7356703B2 (en) | 2003-09-16 | 2008-04-08 | At&T Delaware Intellectual Property, Inc. | Time-based computer access controls |
| US20050114672A1 (en) | 2003-11-20 | 2005-05-26 | Encryptx Corporation | Data rights management of digital information in a portable software permission wrapper |
| US7600219B2 (en) | 2003-12-10 | 2009-10-06 | Sap Ag | Method and system to monitor software interface updates and assess backward compatibility |
| US7546594B2 (en) | 2003-12-15 | 2009-06-09 | Microsoft Corporation | System and method for updating installation components using an installation component delta patch in a networked environment |
| US7840968B1 (en) | 2003-12-17 | 2010-11-23 | Mcafee, Inc. | Method and system for containment of usage of language interfaces |
| US7272654B1 (en) | 2004-03-04 | 2007-09-18 | Sandbox Networks, Inc. | Virtualizing network-attached-storage (NAS) with a compact table that stores lossy hashes of file names and parent handles rather than full names |
| US7783735B1 (en) | 2004-03-22 | 2010-08-24 | Mcafee, Inc. | Containment of network communication |
| WO2005099340A2 (en) | 2004-04-19 | 2005-10-27 | Securewave S.A. | On-line centralized and local authorization of executable files |
| US20060004875A1 (en) | 2004-05-11 | 2006-01-05 | Microsoft Corporation | CMDB schema |
| US7890946B2 (en) | 2004-05-11 | 2011-02-15 | Microsoft Corporation | Efficient patching |
| EP1762114B1 (en) | 2004-05-24 | 2015-11-04 | Google, Inc. | Location based access control in a wireless network |
| US7818377B2 (en) | 2004-05-24 | 2010-10-19 | Microsoft Corporation | Extended message rule architecture |
| US7506170B2 (en) | 2004-05-28 | 2009-03-17 | Microsoft Corporation | Method for secure access to multiple secure networks |
| US20050273858A1 (en) | 2004-06-07 | 2005-12-08 | Erez Zadok | Stackable file systems and methods thereof |
| JP4341517B2 (en) | 2004-06-21 | 2009-10-07 | 日本電気株式会社 | Security policy management system, security policy management method and program |
| US20050289538A1 (en) | 2004-06-23 | 2005-12-29 | International Business Machines Corporation | Deploying an application software on a virtual deployment target |
| US7203864B2 (en) | 2004-06-25 | 2007-04-10 | Hewlett-Packard Development Company, L.P. | Method and system for clustering computers into peer groups and comparing individual computers to their peers |
| US7908653B2 (en) | 2004-06-29 | 2011-03-15 | Intel Corporation | Method of improving computer security through sandboxing |
| US20060015501A1 (en) | 2004-07-19 | 2006-01-19 | International Business Machines Corporation | System, method and program product to determine a time interval at which to check conditions to permit access to a file |
| US7937455B2 (en) | 2004-07-28 | 2011-05-03 | Oracle International Corporation | Methods and systems for modifying nodes in a cluster environment |
| US7703090B2 (en) | 2004-08-31 | 2010-04-20 | Microsoft Corporation | Patch un-installation |
| US7873955B1 (en) | 2004-09-07 | 2011-01-18 | Mcafee, Inc. | Solidifying the executable software set of a computer |
| US7506364B2 (en) | 2004-10-01 | 2009-03-17 | Microsoft Corporation | Integrated access authorization |
| US20060080656A1 (en) | 2004-10-12 | 2006-04-13 | Microsoft Corporation | Methods and instructions for patch management |
| US9329905B2 (en) | 2004-10-15 | 2016-05-03 | Emc Corporation | Method and apparatus for configuring, monitoring and/or managing resource groups including a virtual machine |
| US7765538B2 (en) | 2004-10-29 | 2010-07-27 | Hewlett-Packard Development Company, L.P. | Method and apparatus for determining which program patches to recommend for installation |
| US20060101277A1 (en) | 2004-11-10 | 2006-05-11 | Meenan Patrick A | Detecting and remedying unauthorized computer programs |
| US7698744B2 (en) | 2004-12-03 | 2010-04-13 | Whitecell Software Inc. | Secure system for allowing the execution of authorized computer program code |
| US8479193B2 (en) | 2004-12-17 | 2013-07-02 | Intel Corporation | Method, apparatus and system for enhancing the usability of virtual machines |
| US7765544B2 (en) | 2004-12-17 | 2010-07-27 | Intel Corporation | Method, apparatus and system for improving security in a virtual machine host |
| US7607170B2 (en) | 2004-12-22 | 2009-10-20 | Radware Ltd. | Stateful attack protection |
| US7302558B2 (en) | 2005-01-25 | 2007-11-27 | Goldman Sachs & Co. | Systems and methods to facilitate the creation and configuration management of computing systems |
| US20130247027A1 (en) | 2005-02-16 | 2013-09-19 | Solidcore Systems, Inc. | Distribution and installation of solidified software on a computer |
| US8056138B2 (en) | 2005-02-26 | 2011-11-08 | International Business Machines Corporation | System, method, and service for detecting improper manipulation of an application |
| US7836504B2 (en) | 2005-03-01 | 2010-11-16 | Microsoft Corporation | On-access scan of memory for malware |
| US7685635B2 (en) | 2005-03-11 | 2010-03-23 | Microsoft Corporation | Systems and methods for multi-level intercept processing in a virtual machine environment |
| TW200707417A (en) | 2005-03-18 | 2007-02-16 | Sony Corp | Reproducing apparatus, reproducing method, program, program storage medium, data delivery system, data structure, and manufacturing method of recording medium |
| US7552479B1 (en) | 2005-03-22 | 2009-06-23 | Symantec Corporation | Detecting shellcode that modifies IAT entries |
| US7770151B2 (en) | 2005-04-07 | 2010-08-03 | International Business Machines Corporation | Automatic generation of solution deployment descriptors |
| US7349931B2 (en) | 2005-04-14 | 2008-03-25 | Webroot Software, Inc. | System and method for scanning obfuscated files for pestware |
| US8590044B2 (en) | 2005-04-14 | 2013-11-19 | International Business Machines Corporation | Selective virus scanning system and method |
| US7603552B1 (en) | 2005-05-04 | 2009-10-13 | Mcafee, Inc. | Piracy prevention using unique module translation |
| US7363463B2 (en) | 2005-05-13 | 2008-04-22 | Microsoft Corporation | Method and system for caching address translations from multiple address spaces in virtual machines |
| WO2006137057A2 (en) | 2005-06-21 | 2006-12-28 | Onigma Ltd. | A method and a system for providing comprehensive protection against leakage of sensitive information assets using host based agents, content- meta-data and rules-based policies |
| US8839450B2 (en) | 2007-08-02 | 2014-09-16 | Intel Corporation | Secure vault service for software components within an execution environment |
| US7739721B2 (en) | 2005-07-11 | 2010-06-15 | Microsoft Corporation | Per-user and system granular audit policy implementation |
| US7856661B1 (en) | 2005-07-14 | 2010-12-21 | Mcafee, Inc. | Classification of software on networked systems |
| US7895651B2 (en) | 2005-07-29 | 2011-02-22 | Bit 9, Inc. | Content tracking in a network security system |
| US7962616B2 (en) | 2005-08-11 | 2011-06-14 | Micro Focus (Us), Inc. | Real-time activity monitoring and reporting |
| US7340574B2 (en) | 2005-08-30 | 2008-03-04 | Rockwell Automation Technologies, Inc. | Method and apparatus for synchronizing an industrial controller with a redundant controller |
| US8327353B2 (en) | 2005-08-30 | 2012-12-04 | Microsoft Corporation | Hierarchical virtualization with a multi-level virtualization mechanism |
| US20070074199A1 (en) | 2005-09-27 | 2007-03-29 | Sebastian Schoenberg | Method and apparatus for delivering microcode updates through virtual machine operations |
| US8131825B2 (en) | 2005-10-07 | 2012-03-06 | Citrix Systems, Inc. | Method and a system for responding locally to requests for file metadata associated with files stored remotely |
| US7725737B2 (en) | 2005-10-14 | 2010-05-25 | Check Point Software Technologies, Inc. | System and methodology providing secure workspace environment |
| US20070169079A1 (en) | 2005-11-08 | 2007-07-19 | Microsoft Corporation | Software update management |
| US7836303B2 (en) | 2005-12-09 | 2010-11-16 | University Of Washington | Web browser operating system |
| US7856538B2 (en) | 2005-12-12 | 2010-12-21 | Systex, Inc. | Methods, systems and computer readable medium for detecting memory overflow conditions |
| US20070143851A1 (en) | 2005-12-21 | 2007-06-21 | Fiberlink | Method and systems for controlling access to computing resources based on known security vulnerabilities |
| US20070174429A1 (en) | 2006-01-24 | 2007-07-26 | Citrix Systems, Inc. | Methods and servers for establishing a connection between a client system and a virtual machine hosting a requested computing environment |
| US7757269B1 (en) | 2006-02-02 | 2010-07-13 | Mcafee, Inc. | Enforcing alignment of approved changes and deployed changes in the software change life-cycle |
| WO2007099273A1 (en) | 2006-03-03 | 2007-09-07 | Arm Limited | Monitoring values of signals within an integrated circuit |
| US8621433B2 (en) | 2006-03-20 | 2013-12-31 | Microsoft Corporation | Managing version information for software components |
| US7895573B1 (en) | 2006-03-27 | 2011-02-22 | Mcafee, Inc. | Execution environment file inventory |
| US7870387B1 (en) | 2006-04-07 | 2011-01-11 | Mcafee, Inc. | Program-based authorization |
| US8015563B2 (en) | 2006-04-14 | 2011-09-06 | Microsoft Corporation | Managing virtual machines with system-wide policies |
| US7966659B1 (en) | 2006-04-18 | 2011-06-21 | Rockwell Automation Technologies, Inc. | Distributed learn mode for configuring a firewall, security authority, intrusion detection/prevention devices, and the like |
| US8352930B1 (en) | 2006-04-24 | 2013-01-08 | Mcafee, Inc. | Software modification by group to minimize breakage |
| US8458673B2 (en) | 2006-04-26 | 2013-06-04 | Flexera Software Llc | Computer-implemented method and system for binding digital rights management executable code to a software application |
| US7849502B1 (en) | 2006-04-29 | 2010-12-07 | Ironport Systems, Inc. | Apparatus for monitoring network traffic |
| US8555404B1 (en) | 2006-05-18 | 2013-10-08 | Mcafee, Inc. | Connectivity-based authorization |
| US8291409B2 (en) | 2006-05-22 | 2012-10-16 | Microsoft Corporation | Updating virtual machine with patch on host that does not have network access |
| US7937334B2 (en) * | 2006-05-31 | 2011-05-03 | Lockheed Martin Corporation | System and method for defining normal operating regions and identifying anomalous behavior of units within a fleet, operating in a complex, dynamic environment |
| US7761912B2 (en) | 2006-06-06 | 2010-07-20 | Microsoft Corporation | Reputation driven firewall |
| US20070300215A1 (en) | 2006-06-26 | 2007-12-27 | Bardsley Jeffrey S | Methods, systems, and computer program products for obtaining and utilizing a score indicative of an overall performance effect of a software update on a software host |
| US8365294B2 (en) | 2006-06-30 | 2013-01-29 | Intel Corporation | Hardware platform authentication and multi-platform validation |
| US8468526B2 (en) | 2006-06-30 | 2013-06-18 | Intel Corporation | Concurrent thread execution using user-level asynchronous signaling |
| US8572721B2 (en) | 2006-08-03 | 2013-10-29 | Citrix Systems, Inc. | Methods and systems for routing packets in a VPN-client-to-VPN-client connection via an SSL/VPN network appliance |
| US8015388B1 (en) | 2006-08-04 | 2011-09-06 | Vmware, Inc. | Bypassing guest page table walk for shadow page table entries not present in guest page table |
| US8161475B2 (en) | 2006-09-29 | 2012-04-17 | Microsoft Corporation | Automatic load and balancing for virtual machines to meet resource requirements |
| US9697019B1 (en) | 2006-10-17 | 2017-07-04 | Manageiq, Inc. | Adapt a virtual machine to comply with system enforced policies and derive an optimized variant of the adapted virtual machine |
| US7689817B2 (en) | 2006-11-16 | 2010-03-30 | Intel Corporation | Methods and apparatus for defeating malware |
| US8091127B2 (en) | 2006-12-11 | 2012-01-03 | International Business Machines Corporation | Heuristic malware detection |
| US8336046B2 (en) | 2006-12-29 | 2012-12-18 | Intel Corporation | Dynamic VM cloning on request from application based on mapping of virtual hardware configuration to the identified physical hardware resources |
| US7996836B1 (en) | 2006-12-29 | 2011-08-09 | Symantec Corporation | Using a hypervisor to provide computer security |
| US8381209B2 (en) | 2007-01-03 | 2013-02-19 | International Business Machines Corporation | Moveable access control list (ACL) mechanisms for hypervisors and virtual machines and virtual port firewalls |
| US8254568B2 (en) | 2007-01-07 | 2012-08-28 | Apple Inc. | Secure booting a computing device |
| US9424154B2 (en) | 2007-01-10 | 2016-08-23 | Mcafee, Inc. | Method of and system for computer system state checks |
| US8332929B1 (en) | 2007-01-10 | 2012-12-11 | Mcafee, Inc. | Method and apparatus for process enforced configuration management |
| US8380987B2 (en) | 2007-01-25 | 2013-02-19 | Microsoft Corporation | Protection agents and privilege modes |
| US8276201B2 (en) | 2007-03-22 | 2012-09-25 | International Business Machines Corporation | Integrity protection in data processing systems |
| US7930327B2 (en) | 2007-05-21 | 2011-04-19 | International Business Machines Corporation | Method and apparatus for obtaining the absolute path name of an open file system object from its file descriptor |
| US20080301770A1 (en) | 2007-05-31 | 2008-12-04 | Kinder Nathan G | Identity based virtual machine selector |
| US20090007100A1 (en) | 2007-06-28 | 2009-01-01 | Microsoft Corporation | Suspending a Running Operating System to Enable Security Scanning |
| US8763115B2 (en) | 2007-08-08 | 2014-06-24 | Vmware, Inc. | Impeding progress of malicious guest software |
| US8332375B2 (en) | 2007-08-29 | 2012-12-11 | Nirvanix, Inc. | Method and system for moving requested files from one storage location to another |
| US8250641B2 (en) | 2007-09-17 | 2012-08-21 | Intel Corporation | Method and apparatus for dynamic switching and real time security control on virtualized systems |
| US20090113111A1 (en) | 2007-10-30 | 2009-04-30 | Vmware, Inc. | Secure identification of execution contexts |
| US8195931B1 (en) | 2007-10-31 | 2012-06-05 | Mcafee, Inc. | Application change control |
| JP5238235B2 (en) | 2007-12-07 | 2013-07-17 | 株式会社日立製作所 | Management apparatus and management method |
| US8515075B1 (en) | 2008-01-31 | 2013-08-20 | Mcafee, Inc. | Method of and system for malicious software detection using critical address space protection |
| US8336094B2 (en) | 2008-03-27 | 2012-12-18 | Juniper Networks, Inc. | Hierarchical firewalls |
| US8321931B2 (en) | 2008-03-31 | 2012-11-27 | Intel Corporation | Method and apparatus for sequential hypervisor invocation |
| US8615502B2 (en) | 2008-04-18 | 2013-12-24 | Mcafee, Inc. | Method of and system for reverse mapping vnode pointers |
| WO2010016904A2 (en) | 2008-08-07 | 2010-02-11 | Serge Nabutovsky | Link exchange system and method |
| US8065714B2 (en) | 2008-09-12 | 2011-11-22 | Hytrust, Inc. | Methods and systems for securely managing virtualization platform |
| US8196203B2 (en) | 2008-09-25 | 2012-06-05 | Symantec Corporation | Method and apparatus for determining software trustworthiness |
| US9141381B2 (en) | 2008-10-27 | 2015-09-22 | Vmware, Inc. | Version control environment for virtual machines |
| US8544003B1 (en) | 2008-12-11 | 2013-09-24 | Mcafee, Inc. | System and method for managing virtual machine configurations |
| US8291497B1 (en) | 2009-03-20 | 2012-10-16 | Symantec Corporation | Systems and methods for byte-level context diversity-based automatic malware signature generation |
| US8060722B2 (en) | 2009-03-27 | 2011-11-15 | Vmware, Inc. | Hardware assistance for shadow page table coherence with guest page mappings |
| US8359422B2 (en) | 2009-06-26 | 2013-01-22 | Vmware, Inc. | System and method to reduce trace faults in software MMU virtualization |
| US8341627B2 (en) | 2009-08-21 | 2012-12-25 | Mcafee, Inc. | Method and system for providing user space address protection from writable memory area in a virtual environment |
| US8381284B2 (en) | 2009-08-21 | 2013-02-19 | Mcafee, Inc. | System and method for enforcing security policies in a virtual environment |
| US8306988B1 (en) | 2009-10-26 | 2012-11-06 | Mcafee, Inc. | System, method, and computer program product for segmenting a database based, at least in part, on a prevalence associated with known objects included in the database |
| US8302194B2 (en) | 2009-10-26 | 2012-10-30 | Symantec Corporation | Using file prevalence to inform aggressiveness of behavioral heuristics |
| US9552497B2 (en) | 2009-11-10 | 2017-01-24 | Mcafee, Inc. | System and method for preventing data loss using virtual machine wrapped applications |
| US8925101B2 (en) | 2010-07-28 | 2014-12-30 | Mcafee, Inc. | System and method for local protection against malicious software |
| US8938800B2 (en) | 2010-07-28 | 2015-01-20 | Mcafee, Inc. | System and method for network level protection against malicious software |
| US8549003B1 (en) | 2010-09-12 | 2013-10-01 | Mcafee, Inc. | System and method for clustering host inventories |
| US8495060B1 (en) | 2010-12-07 | 2013-07-23 | Trend Micro, Inc. | Prioritization of reports using content data change from baseline |
| US9075993B2 (en) | 2011-01-24 | 2015-07-07 | Mcafee, Inc. | System and method for selectively grouping and managing program files |
| US20130247192A1 (en) | 2011-03-01 | 2013-09-19 | Sven Krasser | System and method for botnet detection by comprehensive email behavioral analysis |
| US8694738B2 (en) | 2011-10-11 | 2014-04-08 | Mcafee, Inc. | System and method for critical address space protection in a hypervisor environment |
| US9069586B2 (en) | 2011-10-13 | 2015-06-30 | Mcafee, Inc. | System and method for kernel rootkit protection in a hypervisor environment |
| US8973144B2 (en) | 2011-10-13 | 2015-03-03 | Mcafee, Inc. | System and method for kernel rootkit protection in a hypervisor environment |
| US9215240B2 (en) | 2013-07-25 | 2015-12-15 | Splunk Inc. | Investigative and dynamic detection of potential security-threat indicators from events in big data |
-
2010
- 2010-09-12 US US12/880,125 patent/US8549003B1/en active Active
-
2013
- 2013-09-03 US US14/016,497 patent/US8843496B2/en not_active Expired - Fee Related
Patent Citations (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20080010304A1 (en) * | 2006-03-29 | 2008-01-10 | Santosh Vempala | Techniques for clustering a set of objects |
| US7809704B2 (en) * | 2006-06-15 | 2010-10-05 | Microsoft Corporation | Combining spectral and probabilistic clustering |
Non-Patent Citations (1)
| Title |
|---|
| Taskar et al., probabilistic classification and clustering in relational data, 2001, Google, 7 pages. * |
Cited By (20)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20120166317A1 (en) * | 2010-12-23 | 2012-06-28 | Bladelogic, Inc. | Auto-Suggesting IT Asset Groups Using Clustering Techniques |
| US10824986B2 (en) * | 2010-12-23 | 2020-11-03 | Bladelogic, Inc. | Auto-suggesting IT asset groups using clustering techniques |
| US20130117676A1 (en) * | 2011-11-04 | 2013-05-09 | International Business Machines Corporation | Visually analyzing, clustering, transforming and consolidating real and virtual machine images in a computing environment |
| US8954859B2 (en) * | 2011-11-04 | 2015-02-10 | International Business Machines Corporation | Visually analyzing, clustering, transforming and consolidating real and virtual machine images in a computing environment |
| US9305076B1 (en) * | 2012-06-28 | 2016-04-05 | Google Inc. | Flattening a cluster hierarchy tree to filter documents |
| US9519802B2 (en) * | 2014-05-07 | 2016-12-13 | American Express Travel Related Services Company, Inc. | Systems and methods for document and data protection |
| US9633224B1 (en) | 2014-05-07 | 2017-04-25 | American Express Travel Related Services Company, Inc. | Protecting sensitive data prior to reaching the cloud |
| US20170185801A1 (en) * | 2014-05-07 | 2017-06-29 | American Express Travel Related Services Company, Inc. | Pre-cloud data protection |
| US9965647B2 (en) * | 2014-05-07 | 2018-05-08 | American Express Travel Related Services Company, Inc. | Pre-cloud data protection |
| US10389641B2 (en) * | 2015-03-31 | 2019-08-20 | British Telecommunications Public Limited Company | Network operation |
| US11336534B2 (en) | 2015-03-31 | 2022-05-17 | British Telecommunications Public Limited Company | Network operation |
| US11240119B2 (en) | 2015-07-31 | 2022-02-01 | British Telecommunications Public Limited Company | Network operation |
| US20170109186A1 (en) * | 2015-10-20 | 2017-04-20 | International Business Machines Corporation | Server build optimization |
| US10025611B2 (en) * | 2015-10-20 | 2018-07-17 | International Business Machines Corporation | Server build optimization |
| US10810168B2 (en) * | 2015-11-24 | 2020-10-20 | Red Hat, Inc. | Allocating file system metadata to storage nodes of distributed file system |
| US20170147602A1 (en) * | 2015-11-24 | 2017-05-25 | Red Hat, Inc. | Allocating file system metadata to storage nodes of distributed file system |
| CN109063198A (en) * | 2018-09-10 | 2018-12-21 | 浙江广播电视集团 | Melt the multidimensional visual search recommender system of media resource |
| US11973773B2 (en) * | 2020-05-15 | 2024-04-30 | Arbor Networks, Inc. | Detecting and mitigating zero-day attacks |
| US20240406197A1 (en) * | 2023-05-30 | 2024-12-05 | HYAS Infosec Inc. | Tiered remediation of outlier domain name system requests |
| US12549578B2 (en) * | 2023-05-30 | 2026-02-10 | HYAS Infosec Inc. | Tiered remediation of outlier domain name system requests |
Also Published As
| Publication number | Publication date |
|---|---|
| US8843496B2 (en) | 2014-09-23 |
| US8549003B1 (en) | 2013-10-01 |
| US20140006405A1 (en) | 2014-01-02 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US8549003B1 (en) | System and method for clustering host inventories | |
| US11924021B1 (en) | Actionable event responder architecture | |
| US12289295B1 (en) | Trusted tunnel bridge | |
| US11614856B2 (en) | Row-based event subset display based on field metrics | |
| US10942960B2 (en) | Automatic triage model execution in machine data driven monitoring automation apparatus with visualization | |
| US10942946B2 (en) | Automatic triage model execution in machine data driven monitoring automation apparatus | |
| US11144528B2 (en) | Event time selection output techniques | |
| US11392590B2 (en) | Triggering alerts from searches on events | |
| US9553771B1 (en) | Bloom filter index for device discovery | |
| US12393860B2 (en) | Systems and methods for optimizing machine learning models by summarizing list characteristics based on multi-dimensional feature vectors | |
| CN110659740A (en) | Ordering and updating machine learning models based on data input at edge nodes | |
| JP2021502625A (en) | Computer implementation methods, computer systems, systems, and computer programs that determine security anomalies | |
| US10042875B2 (en) | Bloom filter index for device discovery | |
| EP3815342B1 (en) | Adaptive user-interface assembling and rendering | |
| US10331441B2 (en) | Source code mapping through context specific key word indexes and fingerprinting | |
| US12189931B1 (en) | Drill down of statistics chart row | |
| EP3218811A1 (en) | Testing insecure computing environments using random data sets generated from characterizations of real data sets | |
| US8621550B1 (en) | Information technology resource compliance templates | |
| US11048760B1 (en) | Techniques for placing content in and applying layers in an extended reality environment | |
| EP2678809A1 (en) | Entity fingerprints | |
| De La Torre-Abaitua et al. | On the application of compression-based metrics to identifying anomalous behaviour in web traffic | |
| US11895237B1 (en) | Scaled authentication of endpoint devices | |
| US10693628B2 (en) | Enabling distance-based operations on data encrypted using a homomorphic encryption scheme with inefficient decryption | |
| US20220383220A1 (en) | Identifying solution packages based on determined relationships | |
| US20130055090A1 (en) | Bundling configuration items into a composite configuration item |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| AS | Assignment |
Owner name: MCAFEE, INC., CALIFORNIA Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:BHARGAVA, RISHI;REESE, DAVID P., JR.;REEL/FRAME:024972/0186 Effective date: 20100910 |
|
| FEPP | Fee payment procedure |
Free format text: PAYOR NUMBER ASSIGNED (ORIGINAL EVENT CODE: ASPN); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY |
|
| STCF | Information on status: patent grant |
Free format text: PATENTED CASE |
|
| FPAY | Fee payment |
Year of fee payment: 4 |
|
| AS | Assignment |
Owner name: MCAFEE, LLC, CALIFORNIA Free format text: CHANGE OF NAME AND ENTITY CONVERSION;ASSIGNOR:MCAFEE, INC.;REEL/FRAME:043665/0918 Effective date: 20161220 |
|
| AS | Assignment |
Owner name: MORGAN STANLEY SENIOR FUNDING, INC., MARYLAND Free format text: SECURITY INTEREST;ASSIGNOR:MCAFEE, LLC;REEL/FRAME:045056/0676 Effective date: 20170929 Owner name: JPMORGAN CHASE BANK, N.A., NEW YORK Free format text: SECURITY INTEREST;ASSIGNOR:MCAFEE, LLC;REEL/FRAME:045055/0786 Effective date: 20170929 |
|
| AS | Assignment |
Owner name: MORGAN STANLEY SENIOR FUNDING, INC., MARYLAND Free format text: CORRECTIVE ASSIGNMENT TO CORRECT THE REMOVE PATENT 6336186 PREVIOUSLY RECORDED ON REEL 045056 FRAME 0676. ASSIGNOR(S) HEREBY CONFIRMS THE SECURITY INTEREST;ASSIGNOR:MCAFEE, LLC;REEL/FRAME:054206/0593 Effective date: 20170929 Owner name: JPMORGAN CHASE BANK, N.A., NEW YORK Free format text: CORRECTIVE ASSIGNMENT TO CORRECT THE REMOVE PATENT 6336186 PREVIOUSLY RECORDED ON REEL 045055 FRAME 786. ASSIGNOR(S) HEREBY CONFIRMS THE SECURITY INTEREST;ASSIGNOR:MCAFEE, LLC;REEL/FRAME:055854/0047 Effective date: 20170929 |
|
| AS | Assignment |
Owner name: MCAFEE, LLC, CALIFORNIA Free format text: RELEASE OF INTELLECTUAL PROPERTY COLLATERAL - REEL/FRAME 045055/0786;ASSIGNOR:JPMORGAN CHASE BANK, N.A., AS COLLATERAL AGENT;REEL/FRAME:054238/0001 Effective date: 20201026 |
|
| MAFP | Maintenance fee payment |
Free format text: PAYMENT OF MAINTENANCE FEE, 8TH YEAR, LARGE ENTITY (ORIGINAL EVENT CODE: M1552); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY Year of fee payment: 8 |
|
| AS | Assignment |
Owner name: MCAFEE, LLC, CALIFORNIA Free format text: RELEASE OF INTELLECTUAL PROPERTY COLLATERAL - REEL/FRAME 045056/0676;ASSIGNOR:MORGAN STANLEY SENIOR FUNDING, INC., AS COLLATERAL AGENT;REEL/FRAME:059354/0213 Effective date: 20220301 |
|
| AS | Assignment |
Owner name: JPMORGAN CHASE BANK, N.A., AS ADMINISTRATIVE AGENT AND COLLATERAL AGENT, NEW YORK Free format text: SECURITY INTEREST;ASSIGNOR:MCAFEE, LLC;REEL/FRAME:059354/0335 Effective date: 20220301 |
|
| AS | Assignment |
Owner name: JPMORGAN CHASE BANK, N.A., AS ADMINISTRATIVE AGENT, NEW YORK Free format text: CORRECTIVE ASSIGNMENT TO CORRECT THE THE PATENT TITLES AND REMOVE DUPLICATES IN THE SCHEDULE PREVIOUSLY RECORDED AT REEL: 059354 FRAME: 0335. ASSIGNOR(S) HEREBY CONFIRMS THE ASSIGNMENT;ASSIGNOR:MCAFEE, LLC;REEL/FRAME:060792/0307 Effective date: 20220301 |
|
| MAFP | Maintenance fee payment |
Free format text: PAYMENT OF MAINTENANCE FEE, 12TH YEAR, LARGE ENTITY (ORIGINAL EVENT CODE: M1553); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY Year of fee payment: 12 |