[go: up one dir, main page]

WO2010075367A2 - Chaîne de travail informatisée et procédé de réalisation d'une chaîne de travail dans un ordinateur - Google Patents

Chaîne de travail informatisée et procédé de réalisation d'une chaîne de travail dans un ordinateur Download PDF

Info

Publication number
WO2010075367A2
WO2010075367A2 PCT/US2009/069165 US2009069165W WO2010075367A2 WO 2010075367 A2 WO2010075367 A2 WO 2010075367A2 US 2009069165 W US2009069165 W US 2009069165W WO 2010075367 A2 WO2010075367 A2 WO 2010075367A2
Authority
WO
WIPO (PCT)
Prior art keywords
work
queue
request
chain
monitor
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/US2009/069165
Other languages
English (en)
Other versions
WO2010075367A3 (fr
Inventor
Johney Tsai
David Strong
Chi Lin
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Unisys Corp
Original Assignee
Unisys Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US12/340,844 external-priority patent/US20100161715A1/en
Priority claimed from US12/502,504 external-priority patent/US20100162244A1/en
Priority claimed from US12/502,273 external-priority patent/US20100169408A1/en
Application filed by Unisys Corp filed Critical Unisys Corp
Publication of WO2010075367A2 publication Critical patent/WO2010075367A2/fr
Publication of WO2010075367A3 publication Critical patent/WO2010075367A3/fr
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/54Interprogram communication
    • G06F9/546Message passing systems or structures, e.g. queues

Definitions

  • the instant disclosure relates to a computer work chain comprising work queues that are linkable such that a work result produced by one work queue in the work chain is deliverable to a next work queue in the work chain.
  • Computer work chains are used to perform work functions in a computer processing device, such as a central processing unit (CPU) of a server, for example.
  • Computer work chains are implemented in software running on the computer processing device.
  • the work chain is typically made up of a plurality of work queues, with each work queue being capable of performing one or more work tasks.
  • the work chain executes when a caller makes a call to a method associated with the work chain.
  • Work chains are typically designed to operate asynchronously such that when a call is made to the method, control returns to the caller while the work chain processes the call.
  • the work chain completes the processing of the call, the work chain notifies the caller that the call has been processed and returns a return value to the caller.
  • the work queues are typically arranged in a list. Each work queue in the list typically has functionality for receiving a value that is provided as input to the work queue, performing at least one process on the received value, and outputting the processed value to the next work queue in the work chain.
  • the work chain has a pool of worker threads from which the work queues select worker threads to perform the functions of the work queues.
  • a work chain monitor determines whether a worker thread in the pool is available to be used by the work queue, and if so, allocates the available worker thread to the work queue.
  • Work chains often include additional functionality, such as exception monitoring and logging.
  • the invention is directed to a computerized work chain and methods for performing a work chain.
  • the work chain comprises at least one processing device, M work queues, where M is a positive integer that is greater than or equal to one, and a work queue handler.
  • Each work queue comprises a queue monitor, an exception monitor, a pool of worker threads, a logger, and a data queue.
  • the processing device is configured to perform the computerized work chain.
  • the work queues are implemented in the processing device.
  • the work queue handler is implemented in the processing device.
  • the work chain has a work chain input and a work chain output.
  • the work queue handler forms the work chain by linking the work queues QJ through QN together such that respective outputs of work queues QO through QN-I are linked to respective inputs of work queues Ql through QN, respectively.
  • the input of work queue QO is linked to the work chain input and an output of work queue QN is linked to the work chain output.
  • Work requests JO through JN are saved in the data queues of work queues QO through QN, respectively.
  • the Jl through JN work requests correspond to JO through JN-I work results, respectively, produced by the work queues QO through QN-I processing the JO through JN work requests, respectively, with respective worker threads of the QO through QN-I work queues, respectively.
  • a JN work result produced by work queue QN processing work request JN is provided at the output of the work chain.
  • the method comprises the following steps A - F.
  • step A a work request at an input to the work chain is received in a work queue handler of the work chain.
  • step C the J ⁇ work queue receives the work request at its input and attempts to process the work request. If the J ⁇ work queue is successful at processing the work request, the work queue outputs a work result at its output. In step D, if the J th work queue was successful at producing the work result, it sends a notification from the J ⁇ work queue to the work queue handler to indicate that the J th work result has been successfully produced. In step E, if the notification has been received in the work queue handler, the work queue handler determines whether the value of J is equal to N. If the value of J is not equal to N, the handler increments the value of J from a previous J value to a new J value.
  • the method After J has been incremented, the method returns to step C with the work result produced at the output of the work queue at the J ⁇ position corresponding to the previous J value being provided as a work request at the input of the work queue at the J ⁇ position corresponding to the new J value. If it is determined at step E that the notification has been received and that the value of J is equal to N, the handler causes the J th work result to be output from an output of the work chain.
  • the invention also provides a computer-readable medium having a computer program stored thereon comprising computer instructions for performing a work chain in a processing device.
  • the program comprises first, second, third, and fourth sets of instructions.
  • the first set of computer instructions receives a work request at an input to the work chain.
  • Each work queue comprises a respective queue monitor, a respective exception monitor, a respective pool of worker threads, a respective logger, and a respective data queue.
  • the third set of computer instructions performs a J ⁇ work queue algorithm that attempts to process the work request in the J ⁇ work queue. If the J ⁇ work queue algorithm is successful at processing the work request, the J th work queue algorithm outputs a work result from an output of the J ⁇ work queue and outputs a call back notification. The notification provides an indication that the J th work result has been successfully produced.
  • the J ⁇ work queue algorithm includes a J ⁇ work queue monitor, a J ⁇ exception monitor, a J ⁇ pool of worker threads, a J th logger, and a J th data queue.
  • the fourth set of instructions determines whether the notification has been output by the third set of instructions, and if so, whether the value of J is equal to N. If the value of J is not equal to N, the fourth set of instructions causes the value of J to be incremented from a previous J value to a new J value. After J has been incremented, the third set of instructions uses the work result produced at the output of the work queue at the J th position in the linked list corresponding to the previous J value to be used as a work request at the input of the work queue at the J ⁇ position in the linked list corresponding to the new J value. If the fourth set of instructions determines that the notification has been output by the third set of instructions and that the value of J is equal to N, the fourth set of instructions causes the work result output from the J th work queue to be output from an output of the work chain.
  • FIG. 1 illustrates a block diagram of the JERM system in accordance with an embodiment.
  • FIG. 2 illustrates a block diagram of the JERM system in accordance with another illustrative embodiment.
  • Fig. 3 illustrates a block diagram of a work chain comprising a plurality of work queues and a work queue handler in accordance with an illustrative embodiment.
  • Fig. 4 illustrates a block diagram that represents the functional components of one of the work queues shown in Fig. 3 in accordance with an illustrative embodiment.
  • Fig. 5 illustrates a flowchart that represents the method performed by the work chain described above with reference to Fig. 3 in accordance with an illustrative embodiment.
  • Fig. 6 illustrates a flowchart that represents the method performed by the exemplary work queue shown in Fig. 4 in accordance with an illustrative embodiment.
  • Fig. 5 illustrates a flowchart that represents the method performed by the exemplary work queue shown in Fig. 4 in accordance with an illustrative embodiment.
  • Fig. 6 illustrates a flowchart that represents the method performed by the exemplary work queue shown in Fig. 4 in accordance with an illustrative embodiment
  • FIG. 7 illustrates a flowchart that represents a method in accordance with an illustrative embodiment for performing Java enterprise resource management on the client side of the JERM management system shown in Fig. 1.
  • Fig. 8 illustrates a flowchart that represents a method in accordance with an illustrative embodiment for performing Java enterprise resource management on the server side of the JERM management system shown in Fig. 1.
  • the invention is directed to a work chain and methods performed by the work chain.
  • the work chain is implemented in a combination of hardware and software.
  • the work chain comprises at least one processing device configured to perform the computerized work chain, M work queues implemented in the one or more processing devices, and a work queue handler implemented in the one or more processing devices, where M is a positive integer that is greater than or equal to one.
  • Each work queue comprises a queue monitor, an exception monitor, a pool of worker threads, a logger, and a data queue.
  • the work queue handler forms the work chain by linking the M work queues together such that respective outputs of a first one of the work queues through an M nth -1 one of the work queues are linked to respective inputs of a second one of the work queues through an M nth one of the work queues, respectively.
  • JERM Java enterprise resource management
  • the JERM system combines attributes of run-time RMSs and call-analysis RMSs to allow both timing metrics and call metrics to be monitored in real-time, and which can cause appropriate actions to be taken in real-time.
  • the work chain is not limited with respect to environments or industries in which it is suitably employed, as will be understood by persons of ordinary skill in the art, in view of the description provided herein. Persons of ordinary skill in the art will understand, in view of the description provided herein, that the work chain is suitable for use in many different environments and industries.
  • the description herein of the work chain being employed in a JERM system is provided merely for the purpose of giving a real-world example of one suitable use of the work chain.
  • a detailed description of the exemplary JERM system will be provided and then a description of the work chain as employed in the JERM system will be provided.
  • the JERM system with which the work chain may be employed provides a level of granularity with respect to the monitoring of methods that are triggered during a transaction that is equivalent to or better than that which is currently provided in the aforementioned known call-analysis RMSs.
  • the JERM system also provides information associated with the timing of hops that occur between servers, and between and within applications, during a transaction. Because all of this information is obtained in realtime, the JERM system is able to respond in real-time, or near real-time, to cause resources to be allocated or re-allocated in a way that provides improved efficiency and productivity, and in a manner that enables the enterprise to quickly recover from resource failures.
  • the JERM system is a scalable solution that can be widely implemented with relative ease and that can be varied with relative ease in order to meet a wide variety of implementation needs.
  • Fig. 1 is a block diagram illustrating an the JERM system 100.
  • the JERM system 100 comprises a client side 110 and a server side 120.
  • a client Production Server 1 runs various computer software programs, including, but not limited to, an application computer software program 2, a metrics gathering computer software program 10, a metrics serializer and socket generator computer software program 20, and a JERM agent computer software program 30.
  • the Production Server 1 is typically one of many servers located on the client side 110.
  • the Production Server 1 and other servers (not shown) are typically located in a data center (not shown) of the enterprise (not shown).
  • the Production Server 1 may be one of several servers of a server farm, or cluster, that perform similar processing operations, or applications.
  • each server is controlled by the application computer software program that is being run on the server.
  • each server of the same farm may run the same application software program and may have the same operating system (OS) and hardware.
  • a data center may have multiple server farms, with each farm being dedicated to a particular purpose.
  • the application program 2 that is run by the Production Server 1 may be virtually any Java Enterprise Edition (Java EE) program that performs one or more methods associated with a transaction, or all methods associated with a transaction.
  • the metrics gathering program 10 monitors the execution of the application program 2 and gathers certain metrics. The metrics that are gathered depend on the manner in which metrics gathering program 10 is configured.
  • a user interface (UI) 90 is capable of accessing the production server 1 to modify the configuration of the metrics gathering program 10 in order to add, modify or remove metrics.
  • Typical system-level metrics that may be gathered include CPU utilization, RAM usage, disk I/O performance, and network I/O performance.
  • Typical application-level metrics that may be gathered include response time metrics, SQL call metrics, and EJB call metrics. It should be noted, however, that the disclosed system and method are not limited with respect to the type or number of metrics that may be gathered by the metrics gathering program 10.
  • metrics that are gathered by the metrics gathering program 10 are provided to the metrics serializer and socket generator (MSSG) software program 20.
  • the MSSG program 20 serializes each metric into a serial byte stream and generates a communications socket that will be used to communicate the serial byte stream to the JERM Management Server 40 located on the server side 120 of the JERM system 100. The serial byte stream is then transmitted over the socket 80 to the JERM Management Server 40.
  • the socket 80 is typically a Transmission Control Protocol/Internet Protocol ("TCP/IP") socket that provides a bidirectional communications link between an I/O port of the Production Server 1 and an I/O port of the JERM Management Server 40.
  • TCP/IP Transmission Control Protocol/Internet Protocol
  • the JERM Management Server 40 runs various computer software programs, including, but not limited to, a metrics deserializer computer software program 50, a rules manager computer software program 60, and an actions manager computer software program 70.
  • the metrics deserializer program 50 receives the serial byte stream communicated via the socket 80 and performs a deserialization algorithm that deserializes the serial byte stream to produce a deserialized metric.
  • the deserialized metric comprises parallel bits or bytes of data that represent the metric gathered on the client side 110 by the metrics gathering program 10.
  • the deserialized metric is then received by the rules manager program 60.
  • the rules manager program 60 analyzes the deserialized metric and determines whether a rule exists that is to be applied to the deserialized metric. If a determination is made by the rules manager program 60 that such a rule exists, the rules manager program 60 applies the rule to the deserialized metric and makes a decision based on the application of the rule.
  • the rules manager program 60 then sends the decision to the actions manager program 70.
  • the actions manager program 70 analyzes the decision and decides if one or more actions are to be taken.
  • the actions manager program 70 causes one or more actions to be taken by sending a command to the Production Server 1 on the client side 110, or to some other server (not shown) on the client side 110. As stated above, there may be multiple instances of the Production Server 1 on the client side 110, so the action that is taken may be directed at a different server (not shown) on the client side 110.
  • each Production Server 1 on the client side 110 runs the JERM agent software program 30. For ease of illustration, only a single Production Server 1 is shown in Fig. 1.
  • the JERM agent program 30 is configured to detect if a command has been sent from the actions manager program 70 and to take whatever action is identified by the command.
  • the actions include scaling out one or more physical and/or virtual instances and scaling in one or more physical and/or virtual instances.
  • the commands that are sent from the actions manager program 70 to one or more of the JERM agent programs 30 of one or more of the Production Servers 1 are sent over a communications link 130, which may be an Internet socket connection or some other type of communications link.
  • a communications link 130 which may be an Internet socket connection or some other type of communications link.
  • An example of an action that scales out another physical instance is an action that causes another Production Server 1 to be brought online or to be re-purposed.
  • the rules manager program 60 may process the respective CPU load metrics for the respective accounts receivable servers, which correspond to Production Servers 1 , and decide that the CPU loads are above a threshold limit defined by the associated rule. The rules manager program 60 will then send this decision to the actions manager program 70. The actions manager program 70 will then send commands to one or more JERM agent programs 30 running of one or more accounts payable servers, which also correspond to Production Servers 1, instructing the JERM agent programs 30 to cause their respective servers to process a portion of the accounts receivable processing loads.
  • the actions manager program 70 also sends commands to one or more JERM agent programs 30 of one or more of the accounts receivable servers instructing those agents 30 to cause their respective accounts receivable servers to offload a portion of their respective accounts receivable processing loads to the accounts payable servers.
  • An example where the action taken by the actions manager program 70 is the scaling out of one or more virtual instances is as follows. Assuming that the application program 2 running on the Production Server 1 is a particular application program, such as the checkout application program described above, the actions manager program 70 may send a command to the JERM agent program 30 that instructs the JERM agent program 30 to cause the Production Server 1 to invoke another instance of the checkout application program so that there are now two instances of the checkout application program running on the Production Server 1.
  • the actions manager program 70 can reduce the number and types of physical and virtual instances that are scaled out at any given time. For example, if the rules manager program 60 determines that the CPU loads on a farm of accounts payable servers are low (i.e., below a threshold limit), indicating that the serves are being underutilized, the actions manager program 70 may cause the processing loads on one or more of the accounts payable Production Servers 1 of the farm to be offloaded onto one or more of the other accounts payable Production Servers 1 of the farm to enable the Production Servers 1 from which the loads have been offloaded to be turn off or re-purposed.
  • the rules manager program 60 determines that the CPU loads on a farm of accounts payable servers are low (i.e., below a threshold limit), indicating that the serves are being underutilized
  • the actions manager program 70 may cause the processing loads on one or more of the accounts payable Production Servers 1 of the farm to be offloaded onto one or more of the other accounts payable Production Servers 1 of the farm to enable the Production Servers 1 from which the loads have been offloaded to
  • Fig. 2 is a block diagram of the JERM system 200 in accordance with another illustrative embodiment.
  • the JERM system 200 of Fig. 2 includes some of the same components as those of the JERM system 100 shown in Fig. 1, but also includes some additional components and functionality not included in the JERM system 100 of Fig. 1.
  • the JERM system 200 of Fig. 2 has a client side 210 and a server side 220, which have a Production Server 230 and a JERM Management Server 310, respectively.
  • the Production Server 230 runs various computer software programs, including, but not limited to, an application computer software program 240, a metrics gathering computer software program 250, a client Managed Bean (MBean) computer software program 260, and a JERM agent computer software program 270.
  • the Production Server 230 is typically one of many servers located on the client side 210.
  • the Production Server 230 and other servers (not shown) are typically located in a data center (not shown) of the enterprise (not shown).
  • the JERM Management Server 310 typically communicates with and manages multiple servers, some of which are substantially identical to (e.g., additional instances of) the Production Server 230 running application program 240 and some of which are different from the Production Server 230 and perform functions that are different from those performed by the Production Server 230.
  • the application program 240 may be any program that performs one or more methods associated with a transaction, or that performs all methods associated with a transaction.
  • the metrics gathering program 250 monitors the execution of the application program 240 and gathers certain metrics. The metrics that are gathered depend on the manner in which the metrics gathering program 250 is configured.
  • the metrics gathering program 250 gathers metrics by aspecting JBoss interceptors.
  • JBoss is an application server program for use with Java EE and EJBs.
  • An EJB is an architecture for creating program components written in the Java programming language that run on the server in a client/server model.
  • An interceptor as that term is used herein, is a programming construct that is inserted between a method and an invoker of the method, i.e., between the caller and the callee.
  • the metrics gathering program 250 injects, or aspects, JBoss interceptors into the application program 240.
  • a UI 410 which is typically a graphical UI (GUI) enables a user to interact with the metrics gatherer program 250 to add, modify or remove metrics so that the user can easily change the types of metrics that are being monitored and gathered.
  • GUI graphical UI
  • Typical system-level metrics that may be gathered include CPU utilization, RAM usage, disk I/O performance, and network I/O performance.
  • Typical application-level metrics that may be gathered include response time metrics, SQL call metrics, and EJB call metrics.
  • the client MBean program 260 receives the metrics gathered by the JBoss interceptors of the metrics gathering program 250 and performs a serialization algorithm that converts the metrics into a serial byte stream.
  • An MBean is an object in the Java programming language that is used to manage applications, services or devices, depending on the class of the MBean that is used.
  • the client MBean program 260 also sets up an Internet socket 280 for the purpose of communicating the serial byte stream from the client side 210 to the server side 220.
  • the metrics are typically sent from the client side 210 to the server side 220 at the end of a transaction that is performed by the application program 240.
  • the MBean program 260 wraps a client-side work chain comprising computer software code that performs the serialization and socket generation algorithms.
  • the server side 220 includes a JERM Management Server 310, which is configured to run a server MBean computer software program 320, a JERM rules manager computer software program 330, and a JERM actions manager computer software program 370.
  • the server MBean program 320 communicates with the client MBean program 260 via the socket 280 to receive the serial byte stream.
  • the server MBean program 320 performs a deserialization algorithm that deserializes the serial byte stream to convert the byte stream into parallel bits or bytes of data representing the metrics.
  • the JERM rules manager program 330 analyzes the deserialized metric and determines whether a rule exists that is to be applied to the deserialized metric.
  • the rules manager program 330 applies the rule to the deserialized metric and makes a decision based on the application of the rule.
  • the rules manager program 330 then sends the decision to a JERM rules manager proxy computer software program 360, which formats the decision into a web service request and sends the web service request to the JERM actions manager program 370.
  • the deserialization algorithm performed by the server MBean program 320 and the JERM rules manager program 330 are preferably implemented as a server- side work chain.
  • the JERM actions manager program 370 is typically implemented as a web service that is requested by the JERM rules manager proxy program 360.
  • the JERM actions manager program 370 includes an action decider computer program 380 and an instance manager program 390.
  • the actions decider program 380 analyzes the request and decides if one or more actions are to be taken. If so, the actions decider program 380 sends instructions to the instance manager program 390 indicating one or more actions that need to be taken.
  • the instance manager program 390 has knowledge of all of the physical and virtual instances that are currently running on the client side 210, and therefore can make the ultimate decision on the type and number of physical and/or virtual instances that are to be scaled out and/or scaled in on the client side 210.
  • the JERM actions manager program sends instructions via one or more of the communications links 330 to one or more corresponding JERM agent programs 270 of one or more of the Production Servers 230 on the client side 210.
  • Each Production Server 230 on the client side 210 runs a JERM agent program 270.
  • the JERM agent program 270 is configured to detect if a command has been sent from the actions manager 370 and to take whatever action is identified by the command.
  • the actions include scaling out another physical and/or virtual instance and scaling in one or more physical and/or virtual instances.
  • the communications link 330 may be a TCP/IP socket connection or other type of communications link.
  • the types of actions that may be taken include, without limitation, those actions described above with reference to Fig. 1.
  • the UI 410 also connects to the JERM rules manager program 330 and to the JERM actions manager program 370.
  • the JERM rules manager program 330 is actually a combination of multiple programs that operate in conjunction with one another to perform various tasks.
  • One of these programs is a rules builder program 350.
  • a user interacts via the UI 410 with the rules builder program 350 to cause rules to be added, modified or removed from a rules database, which is typically part of the rules builder program 350, but may be external to the rules builder program 350. This feature allows a user to easily modify the rules that are applied by the JBoss rules applier program 340.
  • the connection between the UI 410 and the JERM actions manager program 370 enables a user to add, modify or remove the types of actions that the JERM actions manager 370 will cause to be taken. This feature facilitates the scalability of the JERM system 200. Over time, changes will typically be made to the client side 210. For example, additional resources (e.g., servers, application programs and/or devices) may be added to the client side 210 as the enterprise grows. Also, new resources may be substituted for older resources, for example, as resources wear out or better performing resources become available. Through interaction between the UI 410 and the JERM actions manager program 370, changes can be made to the instance manager program 390 to reflect changes that are made to the client side 210.
  • additional resources e.g., servers, application programs and/or devices
  • the instance manager program 390 typically will maintain one or more lists of (1) the total resources by type, network address and purpose that are employed on the client side 210, (2) the types, purposes and addresses of resources that are available at any given time, and (3) the types, purposes and addresses of resources that are in use at any given time. As resource changes are made on the client side 210, a user can update the lists maintained by the instance manager program 390 to reflect these changes.
  • the work chain and the associated methods are not limited to being used in a JERM system, it is worth mentioning some of the important features that enable the JERM system 200 to provide improved performance over known RMSs of the above-described type.
  • These features include: (1) the use of interceptors by the metrics gatherer program 250 to gather metrics without affecting the performance of a transaction while it is being performed by the application program 240: (2) the use of the client MBean program 260 and client-side work chain to convert the metrics into serial byte streams and send the serial byte stream over a TCP/IP socket 280 to the server side 220; and (3) the use of the server MBean program 320 and the server-side work chain to deserialize the byte stream received over the socket 280 and to apply applicable rules to the deserialized byte stream to produce a decision.
  • the metrics gatherer program 250 can be easily modified by a user, e.g., via the UI 410. Such modifications enable the user to update and/or change the types of metrics that are being monitored by the metrics gatherer program 250. This feature provides great flexibility with respect to the manner in which resources are monitored, which, in turn, provides great flexibility in deciding actions that need to be taken to improve performance on the client side 210 and taking those actions.
  • Certain functionality on the client side 210 and on the server side 220 is implemented with a client-side work chain and with a server-side work chain, respectively.
  • the client-side work chain comprises only the functionality that performs the serialization and socket generation programs that are wrapped in the client MBean 260.
  • the server-side work chain comprises the functionality for performing the socket communication and deserialization algorithms wrapped in the server MBean 320, and the functionality for performing the algorithms of the rules manager program 330.
  • These work chains operate like assembly lines, and parts of the work chains can be removed or altered to change the behavior of the JERM system 200 without affecting the behavior of the application program 240.
  • the work chains are typically configured in XML, and therefore, changes can be made to the work chains in XML, which is an easier task than modifying programs written in other types of languages which are tightly coupled. It should be noted, however, that it is not necessary that the work chains be implemented in any particular programming language. XML is merely an example of a suitable programming language for implementing the work chains. Prior to describing illustrative examples of the manners in which these work chains may be implemented on the client side 210 and server side 220, the general nature of the work chain will be described with reference to Fig. 3.
  • Fig. 3 illustrates a block diagram of a work chain 500 that demonstrates its functional components and the interaction between those components in accordance with an illustrative or exemplary embodiment.
  • the work chain 500 typically comprises XML code configured for execution by a processing device, such as a microprocessor, for example.
  • a processing device such as a microprocessor
  • Each of the functional components of the work chain 500 performs one or more particular functions in the work chain 500.
  • the work chain 500 is made up of M work queues 510 that can be logically arranged into a pipe configuration, where M is a positive integer that is greater than or equal to one, and a work queue handler 520.
  • M is a positive integer that is greater than or equal to one
  • a work queue handler 520 For ease of illustration, the work chain 500 is shown in Fig.
  • M is equal to three in this example.
  • the work chain 500 may comprise virtually any number of work queues 510.
  • the work queue handler 520 interacts with each of the work queues 510, as will be described below in more detail.
  • the work chain 500 implemented on the server side 220 may have the same number of work queues 510 as the work chain 500 implemented on the client side 210, in which case the number of work queues 510 in both the client- side and server- side work chains is equal to M.
  • the number of work queues 510 in the client-side work chain will typically be different from the number of work queues in the server-side work chain. Therefore, the number of work queues in the server-side work chain will be designated herein as being equal to or greater than L, where L is a positive integer that is greater than or equal to one, and where L may be, but need not be, equal to M. Also, it should also be noted that the client side 210 may include a work chain in cases in which the server side 220 does not include a work chain, and vice versa.
  • Each of the work queues 51OA, 51OB and 51OC has an input/output (I/O) interface 512A, 512B and 512C, respectively.
  • the I/O interfaces 512A - 512C communicate with an I/O interface 520A of the work queue handler 520.
  • the work queue handler 520 receives requests to be processed by the work chain 500 from a request originator (not shown) that is external to the work chain 500.
  • the external originator of these requests will vary depending on the scenario in which the work chain 500 is implemented. For example, in the case where the work chain 500 is implemented on the client side 210 shown in Fig. 2, the originator of the requests is typically the client MBean 260, which wraps the serializer and socket generator that comprise the work chain 500.
  • the work queue handler 520 comprises, or has access to, a linked list of all of the work queues 51OA - 51OC that can be linked into a work chain 500.
  • a work request from an external originator is sent to the work chain 500, the request is received by the work queue handler 520.
  • the handler 520 selects the first work queue 510 in the linked list and assigns the request to the selected work queue 510.
  • the work queue 510 at a given position in the work chain 500 will be referred to hereinafter as "Qj", where the subscript "J" represents the position of Q in the work chain 500. Therefore, in the illustrative embodiment of Fig.
  • the output of Qj_i is the input of Qj and the output of Qj is the input of Qj + i .
  • Qj is the first work queue 510 in the work chain 500, so its input is the input request received by the work chain 500 and its output is the input of Qi.
  • the request received by the handler 520 from the external request originator is assigned by the handler 510 to the work queue QO in the list, which is work queue 510A in the illustrative embodiment of Fig. 3.
  • the handler 520 causes the work result to be assigned to work queue Ql .
  • the work queue 510 sends a call back to the handler 520.
  • the handler 520 assigns the work result produced by the successful work queue 510 to the next work queue 510 in the work chain 500.
  • the handler 520 makes a synchronous call to the selected work queue 510.
  • the result of the synchronous call is a success if the handler 520 is able to successfully assign this request to the selected work queue 510 before a timeout failure occurs.
  • the result of the synchronous call is unsuccessful if the handler 520 is not able to successfully assign the request to the selected work queue 510 before a timeout failure occurs.
  • the handler 520 successfully assigned a request to work queue 51OA and that work queue 51OA successfully processed the request and sent a call back to the handler 520.
  • the handler 520 selects the work queue 51OB to receive the result produced by work queue 51OA.
  • the output of the work queue 51OA is used as the input of the work queue 51OB.
  • the handler 520 will attempt to synchronously add the result to the work queue 51OB using the aforementioned synchronous call. If the synchronous call fails, the handler 520 will assume that work queue 51OB did not successfully process the request. This process continues until the work chain 500 has produced its final result. The handler 520 then causes the final result to be output at the work chain output.
  • Fig. 4 illustrates a block diagram that represents the functional components of one of the work queues 510 shown in Fig. 3 in accordance with an illustrative embodiment.
  • the work queues 510 preferably have identical configurations. Therefore, the functional components of only one of the work queues, work queue 51OA, are shown in Fig. 4.
  • the work queue 510A includes the I/O interface 512A, a queue monitor 521, an exception monitor 522, one or more worker threads 523, a logger 524, and a data queue 525.
  • the data queue 525 is a data structure that stores an incoming request received at the I/O interface 512A of the work queue 51OA.
  • the queue monitor 521 is a programming thread that monitors the data queue 525 to determine if a request is stored therein, and if so, to determine if a worker thread 523 is available to handle the request.
  • the queue monitor 521 maintains a list of available worker threads 523 in the work queue 510A. In essence, the list maintained by the queue monitor 521 constitutes a pool of available worker threads 523 for the corresponding work queue 51OA.
  • the worker threads 523 are programming threads configured to perform the tasks of processing the requests and producing a work result for the corresponding work queue 510.
  • the queue monitor 521 determines that a request is stored in the data queue 525 and that a worker thread from the worker thread pool 523 is available to process the request, the queue monitor 521 reads the request from the data queue 525 and assigns the request to an available worker thread. The available worker thread is then removed from the pool of available worker threads 523 and begins processing the request. If the worker thread that is assigned the request successfully completes the processing of the request, the worker thread sends the aforementioned call back to the handler 520 to inform the handler 520 that it has successfully processed the request. The handler 520 then causes the result produced by the worker thread to be handed off, i.e., assigned, to the next work queue 510 in the work chain 500.
  • each work queue 510 has its own pool of worker threads 523.
  • the number of worker threads that are in the worker thread pool 523 is selected based on the type of tasks or tasks that are to be performed by the work queue 510. Therefore, work queues 510 that are expected to be longer-running work queues 510 can be defined to have larger pools of worker threads 523 than those which are expected to be shorter-running work queues 510. This feature prevents longer-running work queues 510 from slowing down the work chain 500. This feature also reduces contention between worker threads trying to obtain work.
  • the number of worker threads that are in a pool of worker threads 523 for a given work queue 510 can be easily modified by modifying the code associated with that particular work queue 510 to increase or decrease the number of worker threads that are in its worker thread pool 523. This feature eliminates the need to modify the entire work chain in order to modify a particular work queue 510.
  • the exception monitor 522 is a programming thread that monitors the worker threads 523 to determine whether or not an uncaught exception occurred while the worker thread 523 was processing the request that caused the worker thread 523 to fail before it finished processing the request. If a worker thread 523 is processing a request when an exception occurs, and the exception is not caught by the worker thread 523 itself, the exception monitor 522 returns the failed worker thread 523 to the pool of available worker threads 523 for the given work queue 510.
  • the exception monitor 522 is useful in this regard because without it, if an exception occurs that is not caught by the worker thread 523, the Java Virtual Machine (JVM) (not shown) will detect that the uncaught exception has occurred and will then terminate the failed worker thread 523, making it unavailable to process future requests.
  • JVM Java Virtual Machine
  • the exception monitor 522 detects the occurrence of an uncaught exception and returns the failed worker thread 523 to the worker thread pool before the JVM has an opportunity to terminate the failed worker thread 523. Returning failed worker threads 523 to the worker thread pool rather than allowing them to be terminated by the JVM increases the number of worker threads 523 that are available at any given time for processing incoming requests to the work chain 500.
  • the logger 524 is a programming thread that logs certain information relating to the request, such as, for example, whether an exception occurred during the processing of a request that resulted in a worker thread 523 failing before it was able to complete the processing of the request, the type of exception that occurred, the location in the code at which the exception occurred, and the state of the process at the instant in time when the exception occurred.
  • each of the work queues 510 in the work chain 500 is capable of being stopped by the handler 520.
  • the request originator sends a poison command to the work chain 500.
  • the handler 520 receives the poison command and causes an appropriate poison command to be sent to each of the work queues 510.
  • the work queue 510 sends a corresponding poison request to its own data queue 525 that causes all of the worker threads 523 of that work queue 510 to shutdown.
  • the work queues 510 are GenericWorkQueue base types, but each work queue 510 may have worker threads 523 that perform functions that are different from the functions performed by the worker threads 523 of the other work queues 510.
  • all of the worker threads 523 of work queue 51OA may be configured to perform a particular process, e.g., Process A, while all of the worker threads 523 of work queue 510B may be configured to perform another particular process, e.g., Process B, which is different from Process A.
  • Process A e.g., Process A
  • Process B e.g., Process B
  • the poison command that is needed to stop work queue 51OA will typically be different from the poison command that is needed to stop work queue 51OB.
  • Fig. 5 illustrates a flowchart that represents the method performed by the work chain described above with reference to Fig. 3 in accordance with an illustrative embodiment. The method will be described with reference to Figs. 3 and 5.
  • a work request is received at an input to the work chain 500 and provided to the work queue handler 520, as indicated by block 551.
  • the work queue handler 520 selects a J ⁇ work queue 510 from a linked list of M work queues to process the work request and assigns the work request to the selected work queue 510, as indicated by block 553.
  • M is a positive integer
  • the initial values of M and J are set (not shown) prior to initial values.
  • the J ⁇ position corresponds to the first position, position 0, in the linked list.
  • the selected work queue 510 produces a work result and notifies the work queue handler 520 that the work request has been successfully processed, as indicated by block 554.
  • Fig. 6 illustrates a flowchart that represents the method performed by the exemplary work queue 510A shown in Fig. 4 in accordance with an illustrative embodiment. The method will be described with reference to Figs. 3, 4 and 6.
  • a work request that has been assigned to the work queue 510A is received at the I/O interface 521 A of the work queue 51OA and stored in the data queue 525 of the work queue 510, as indicated by block 571.
  • the queue monitor 521 of the work queue 51OA determines whether or not a worker thread of the pool of worker threads 523 is available to process the work request, as indicated by block 573.
  • the queue monitor 521 allocates the request to the available worker thread, as indicated by block 575. The worker thread then attempts to process the work request and produces a work result, as indicated by block 576. When the worker thread stops processing the work request, a determination is made by the queue monitor 51OA as to whether or not the worker thread was able to successfully process the work request, as indicated by block 578. If so, the queue monitor 521 causes a notification to be sent over the I/O interface 512A of the work queue 510 to the work queue handler 500, as indicated by block 579. The worker thread is then returned to the pool of available worker threads, as indicated by block 581. The process then proceeds to block 585. If a determination is made at block 573 that a worker thread is not available to process the work request, the process also proceeds to block 585.
  • the process proceeds to block 583.
  • the exception monitor 522 determines whether an exception occurred during the process of the request by the worker thread that was not caught by the worker thread. If so, the exception monitor 522 returns the worker thread to the pool of available worker threads 523, as indicated by block 584.
  • the logger 524 of the work queue 51OA logs the aforementioned information relating to the processing of the work request by the work queue 51OA, such as, for example, whether an exception occurred during the processing of the request, and if so, the type of exception that occurred, as indicated by block 585.
  • the work chain is typically, but not necessarily, implemented in XML code.
  • the following XML code corresponds to the client-side work chain configuration file in accordance with the embodiment referred to above in which the client- side work chain only includes the functionality corresponding to the serialization and socket generation programs that are wrapped in the client MBean 260 shown in Fig. 2.
  • the client-side work chain can be easily modified to include an audit algorithm work queue that logs information to a remote log identifying any processes that have interacted with the data being processed through the client-side work chain.
  • Such a modification may be made by adding the following audit ⁇ work queue> to the XML code listed above:
  • the rules builder program 350 shown in Fig. 2 can also be easily modified by a user by making changes to one or more portions of the server-side work chain comprising the rules builder program 350 by, for example, using the user interface 410.
  • Making the rules builder program 350 easily modifiable makes it easy to modify the JERM rules manager program 330.
  • the entire behavior of the JERM management server 310 can be modified by simply modifying XML code of the server- side work chain. Such ability enhances flexibility, ease of use, and scalability of the JERM management system 200.
  • an archiver computer software program could be added to the JERM management server 310 to perform archiving tasks, i.e., logging of metrics data.
  • a work queue similar to the audit work queue that was added above to the client-side work chain is added to the server-side work chain at a location in the work chain following the rules manager code represented by block 330 in Fig. 2.
  • the archiver work queue will have a namespace, minimum (minThreads) and maximum (MaxThreads) worker thread limits, and a timeout period (addTimeout) limit.
  • the Min and Max thread limits describe how many worker threads are to be allocated to the work queue.
  • the addTimeout limit describes the time period in milliseconds (ms) that the server 310 will wait before it stops trying to add to a full work queue. If for some reason it is later decided that the archiver work queue or another work queue is no longer needed, the work queue can easily be removed by the user via, for example, the user interface 410. For example, if the JERM system 200 is only intended to monitor, gather, and archive metrics data, the work queue of the portion of the server-side work chain corresponding to the JERM rules manager program 330 may be removed. This feature allows the vendor that provides the JERM system 200 to the enterprise customer to add functionality to the JERM system 200 by shipping one or more additional modules that plug into the client-side work chain, the server-side work chain, or both.
  • the addition of such a module or module does not affect any of the core code of the JERM system 200, but allows the customer to design and implement its own custom modules for its specific business needs.
  • the combination of all of these features makes the JERM system 200 a superior RMS over known RMSs in that the JERM system 200 has improved scalability, improved flexibility, improved response time, improved metrics monitoring granularity, and improved action taking ability over what is possible with known RMSs.
  • the JERM system 200 is capable of monitoring, gathering, and acting upon both timing metrics and call metrics, which, as described above, is generally not possible with existing RMSs. As described above, existing RMSs tend to only monitor, gather, and act upon either timing metrics or call metrics.
  • Fig. 7 is a flowchart that illustrates a method in accordance with an illustrative embodiment for performing Java enterprise resource management on the client side.
  • a server is configured to run at least one application computer software program, at least one metrics gatherer computer software program, at least one metrics serializer and socket generator computer software program implemented as a work chain 500 (Fig.
  • Fig. 8 is a flowchart that illustrates a method in accordance with an illustrative embodiment for performing Java enterprise resource management on the server side.
  • the server-side work chain performs byte stream deserialization to produce deserialized bits that represent the gathered metric, as indicated by block 621.
  • the portion of the server-side work chain that performs the JERM rules manager program analyzes the deserialized bits to determine whether a rule exists that applies to the corresponding metric, and if so, applies the applicable rule to the deserialized bits, as indicated by block 622. This decision is then output from the server-side work chain, as indicated by block 623.
  • the decision is then received by an actions manager computer software program, as indicated by block 624.
  • the actions manager program determines, based on the decision provided to it, one or more actions that are to be taken, if any, as indicated by block 625.
  • the actions manager program then sends one or more commands to one or more JERM agent programs running on one or more servers on the client side instructing the JERM agent programs to cause their respective servers to perform the corresponding action or actions, as indicated by block 626.
  • the actions may include scaling out one or more physical and/or virtual instances or scaling in one or more physical and/or virtual instances.
  • the actions may also include re-purposing or re-allocation of a physical resource.
  • the disclosed system and method are not limited with respect to the types of physical instances that may be scaled out, scaled in, re-purposed or re-allocated.
  • An example of a physical instance is a server.
  • a virtual instance may include, without limitation, an application computer software program, a JVM, or the like.
  • the disclosed system and method are not limited with respect to the types of virtual instances that may be scaled out or scaled in.
  • Virtual instances generally are not re-purposed or re-allocated, although that does not mean that the JERM system could not re-purpose or re-allocate virtual instances should a need arise to do so.
  • the computer code for implementing these algorithms is stored on some type of computer- readable medium (CRM).
  • CRM may be any type of CRM, including, but not limited to, a random access memory (RAM) device, a read-only memory (ROM) device, a programmable ROM (PROM) device, an erasable PROM (EPROM) device, a flash memory device, or other type of memory device.
  • the computer code that implements the work chain is executed in some type of processing device, such as, for example, one or more microprocessors, microcontrollers, special purpose application specific integrated circuit (ASICs), programmable logic arrays (PLAs), programmable gate array (PGAs), or any combination of one or more of such processing devices.
  • processing device such as, for example, one or more microprocessors, microcontrollers, special purpose application specific integrated circuit (ASICs), programmable logic arrays (PLAs), programmable gate array (PGAs), or any combination of one or more of such processing devices.
  • ASICs application specific integrated circuit
  • PDAs programmable logic arrays
  • PGAs programmable gate array

Landscapes

  • Engineering & Computer Science (AREA)
  • Software Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Debugging And Monitoring (AREA)
  • General Factory Administration (AREA)

Abstract

La présente invention concerne une chaîne de travail informatisée et des procédés afférents. Ladite chaîne de travail comprend au moins un dispositif de traitement configuré pour réaliser les files d'attente M de chaîne de travail informatisée mises en œuvre dans le ou les dispositifs de traitement, et un gestionnaire de files d'attente mis en œuvre dans le ou les dispositifs de traitement, M étant un nombre entier positif qui est supérieur ou égal à un. Chaque file d'attente comprend un moniteur de file, un moniteur d'exception, un groupe de fils d'exécution, un enregistreur chronologique automatique, et une file de données. Le gestionnaire de file d'attente forme la chaîne de travail par la liaison des files d'attente M, de telle sorte que des sorties respectives d'une première file d'attente par un Mnth-1 des files d'attente sont liées à des entrées respectives d'une seconde file d'attente par l'une des files d'attente Mnth, respectivement.
PCT/US2009/069165 2008-12-22 2009-12-22 Chaîne de travail informatisée et procédé de réalisation d'une chaîne de travail dans un ordinateur Ceased WO2010075367A2 (fr)

Applications Claiming Priority (8)

Application Number Priority Date Filing Date Title
US12/340,844 2008-12-22
US12/340,844 US20100161715A1 (en) 2008-12-22 2008-12-22 Java enterprise resource management system and method
US12/347,032 2008-12-31
US12/347,032 US20100161719A1 (en) 2008-12-22 2008-12-31 JAVA Enterprise Resource Management System and Method
US12/502,273 2009-07-14
US12/502,504 US20100162244A1 (en) 2008-12-22 2009-07-14 Computer work chain and a method for performing a work chain in a computer
US12/502,504 2009-07-14
US12/502,273 US20100169408A1 (en) 2008-12-31 2009-07-14 Method and apparatus for implementing a work chain in a java enterprise resource management system

Publications (2)

Publication Number Publication Date
WO2010075367A2 true WO2010075367A2 (fr) 2010-07-01
WO2010075367A3 WO2010075367A3 (fr) 2010-10-07

Family

ID=42288409

Family Applications (2)

Application Number Title Priority Date Filing Date
PCT/US2009/069149 Ceased WO2010075355A2 (fr) 2008-12-22 2009-12-22 Procédé et appareil pour implanter une chaîne de travail dans un système de gestion des ressources d'entreprise java
PCT/US2009/069165 Ceased WO2010075367A2 (fr) 2008-12-22 2009-12-22 Chaîne de travail informatisée et procédé de réalisation d'une chaîne de travail dans un ordinateur

Family Applications Before (1)

Application Number Title Priority Date Filing Date
PCT/US2009/069149 Ceased WO2010075355A2 (fr) 2008-12-22 2009-12-22 Procédé et appareil pour implanter une chaîne de travail dans un système de gestion des ressources d'entreprise java

Country Status (1)

Country Link
WO (2) WO2010075355A2 (fr)

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
AU3356200A (en) * 1999-02-03 2000-08-25 Onesoft Corporation Targeting and profiling participants in a modular system and method for processing transactions
US6377939B1 (en) * 1999-05-04 2002-04-23 Metratech Pipelined method and apparatus for processing communication metering data
US7113942B2 (en) * 2003-06-27 2006-09-26 Microsoft Corporation Scalable storage and processing of hierarchical documents
US20080091806A1 (en) * 2006-10-11 2008-04-17 Jinmei Shen Dynamic On-Demand Clustering

Also Published As

Publication number Publication date
WO2010075367A3 (fr) 2010-10-07
WO2010075355A3 (fr) 2010-10-07
WO2010075355A2 (fr) 2010-07-01

Similar Documents

Publication Publication Date Title
US8112751B2 (en) Executing tasks through multiple processors that process different portions of a replicable task
US8909698B2 (en) Grid-enabled, service-oriented architecture for enabling high-speed computing applications
US6074427A (en) Apparatus and method for simulating multiple nodes on a single machine
US20110252137A1 (en) Systems and Methods for Dynamically Provisioning Cloud Computing Resources
US20100162244A1 (en) Computer work chain and a method for performing a work chain in a computer
EP3087483A1 (fr) Système et procédé pour prendre en charge une invocation asynchrone dans une grille de données distribuée
US6922796B1 (en) Method and apparatus for performing failure recovery in a Java platform
CN111736809A (zh) 分布式机器人集群网络管理框架及其实现方法
US20100169408A1 (en) Method and apparatus for implementing a work chain in a java enterprise resource management system
JP5056464B2 (ja) プロセス監視方法、情報処理装置、及びプログラム
US20100161719A1 (en) JAVA Enterprise Resource Management System and Method
US6480879B1 (en) Framework for providing quality of service requirements in a distributed object-oriented computer system
CN111274018A (zh) 一种基于dl框架下的分布式训练方法
CN111031123A (zh) Spark任务的提交方法、系统、客户端及服务端
WO2010075367A2 (fr) Chaîne de travail informatisée et procédé de réalisation d'une chaîne de travail dans un ordinateur
CN115242786A (zh) 基于容器集群的多模式大数据作业调度系统及方法
CN114095514B (zh) 一种数据库访问方法和系统
CN114564340A (zh) 航天地面系统分布式软件高可用方法
CN118819825A (zh) 并发处理方法、并发控制系统、电子设备及存储介质
CN113742073B (zh) 一种基于lsb接口的集群控制方法
CN116643874A (zh) Numa架构下的中间件优化方法、装置、设备及介质
Rao et al. Analysis and implementation of a parallel computing cluster for solving computational problems in data analytics
CN109525651B (zh) 应用程序数据节点动态增加方法及系统
US20120054767A1 (en) Recording medium for resource management program, resource management device, and resource management method
Johnson An Implementation Study of the Impact

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 09835748

Country of ref document: EP

Kind code of ref document: A2

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 09835748

Country of ref document: EP

Kind code of ref document: A2