[go: up one dir, main page]

GB2323944A - Parallel data processing system - Google Patents

Parallel data processing system Download PDF

Info

Publication number
GB2323944A
GB2323944A GB9706549A GB9706549A GB2323944A GB 2323944 A GB2323944 A GB 2323944A GB 9706549 A GB9706549 A GB 9706549A GB 9706549 A GB9706549 A GB 9706549A GB 2323944 A GB2323944 A GB 2323944A
Authority
GB
United Kingdom
Prior art keywords
message
node
nodes
message handling
procedure
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
GB9706549A
Other versions
GB9706549D0 (en
Inventor
Colin Leonard Bird
Graham Derek Wallis
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
International Business Machines Corp
Original Assignee
International Business Machines Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by International Business Machines Corp filed Critical International Business Machines Corp
Priority to GB9706549A priority Critical patent/GB2323944A/en
Publication of GB9706549D0 publication Critical patent/GB9706549D0/en
Priority to US08/906,685 priority patent/US5905904A/en
Priority to TW086113982A priority patent/TW491979B/en
Priority to KR1019980003047A priority patent/KR100300899B1/en
Priority to SG9800524A priority patent/SG83676A1/en
Priority to DE69840286T priority patent/DE69840286D1/en
Priority to EP98301952A priority patent/EP0871118B1/en
Priority to JP10084705A priority patent/JP3090641B2/en
Publication of GB2323944A publication Critical patent/GB2323944A/en
Withdrawn legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/22Microcontrol or microprogram arrangements
    • G06F9/28Enhancement of operational speed, e.g. by using several microcontrol devices operating in parallel
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/54Interprogram communication
    • G06F9/546Message passing systems or structures, e.g. queues

Landscapes

  • Engineering & Computer Science (AREA)
  • Software Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Multi Processors (AREA)

Abstract

A parallel processing system having a number of processing nodes (S1...Sn) each of which is provided with a message handling kernel (13...n3) and an associated procedure register (14...n4), the procedure registers being loadable by loading means (4) under the control of an application (3) and the nodes being able to exchange messages over a message interface (2) and to process messages to determine the message handling procedures to be invoked by the associated node in accordance with the contents of the associated procedure register.

Description

2323944 1 PARALLEL DATA PROCESSING SYSTEM AND METHOD OF CONTROLLING SUCH A
SYSTEM
Field of the Invention
This invention relates generally to data processing systems and more particularly to systems wherein a number of processing units are organised to process data simultaneously. Such systems, referred to as parallel processing systems, are typically used to process large quantities of data more rapidly than would be the case using a single processor, usually with the additional advantage of enhanced reliability.
Backclround of the Invention while these advantages can readily be achieved where sets of data to be processed by the processing units or nodes are independent, a problem arises when the result of processing in one or more nodes affects the processing operations taking place in another node or nodes.
If, for example, code is written to perform in parallel what has hitherto been an essentially serial operation, the dependencies between the various parallel operations may become extremely complex, to the point where the analysis of potential conflict can be prohibitively difficult. The code may require a great deal of testing and corrective procedures in order to deal with the synchronising messages which are necessarily exchanged between parallel strands of an application program.
Such a situation can arise in the analysis of the substantial quantities of data, extending to many millions of records, obtained in commercial activities such as insurance and retailing. Speedy analysis is essential if marketing strategy is to be effectively and flexibly matched to customer needs and preferences, and a number of processes have been developed to recognise marketing trends in order to respond effectively thereto.
In one such process, referred to as data mining, records are analysed serially to develop a model which can be used for predicting trends. The model is dynamically updated as the analysis proceeds, and each processing step is required to be taken in accordance with the model that is current. Data mining has accordingly been traditionally performed as a serial, single processor operation. This is necessarily 2 a computationally intensive process, and considerable advantage could be gained by the application of parallel processing techniques to processes such as data mining.
Summary of the Invention
In accordance with the invention there is provided a parallel processing system comprising a plurality of processing nodes each adapted to process data under the control of an application programme, in which each node has associated therewith a message handling kernel function and a procedure register, said system including a message interface interconnecting the message handling kernels associated with said nodes and means for loading message handling procedures selectively into the procedure registers of the various nodes, each message handling kernel being adapted to respond to messages applied to said message interface during execution of said application programme to control its associated node in accordance with message handling procedures selected from the associated procedure register.
There is further provided a method of controlling a parallel processing system having a plurality of processing nodes comprising the steps of providing, for each node, a message handling kernel and a procedure register, loading selectively into the procedure registers of each node message handling procedures to be invoked by the message handling kernel associated with that node and controlling the node in accordance with message handling procedures selected from the associated procedure register under the control of messages received from another node or nodes in the system.
in a preferred embodiment of the invention the system is configured to define a master-slave relationship, one of the nodes being designated as a master node in which the application programme and control thereof resides while the others operate as slave nodes responding to messages originating from the master. The procedure registers associated with each slave node are loaded initially under the control of the master node and may be updated from time to time as the application proceeds.
In another aspect of the invention, there is provided a computer program product comprising a computer program recorded on a recording medium for use in controlling execution of application programs on a 3 parallel processing system having a plurality of processing nodes, said computer program comprising: means for providing, for each of said nodes, a message handling kernel and a procedure register; means for loading selectively into the procedure registers of each node message handling procedures to be invoked by the message handling kernel provided for that node; wherein each message handling kernel is adapted to control its associated node in accordance with message handling procedures selected from the associated procedure register.
A significant advantage of the message handling kernel is its ability correctly to process any message type as and when it arrives without requiring undesirable complexity within the application itself. Since the structure of an application program for a parallel system can be greatly simplified if a message handling solution according to the invention is used, application programs for parallel systems which are developed for use with the present invention are likely to be inherently more reliable and more easily maintainable than would be possible without the present invention.
Brief DescrijDtion of the Drawinqs FIG 1 shows a parallel processing system embodying the invention.
FIGS 2 is a table showing in simplified form a group of records which may be processed using the system of Fig. 1.
FIG 3 is a flow chart showing the steps performed by the message handling kernel.
FIG 4 shows the structure of a typical message and the manner in which it is interpreted at a processing node.
Detailed Description of the Preferred Embodiment
Referring first to Fig. 1 a parallel processing system embodying the invention comprises a number of processing nodes S1,S2... Sn each having an associated processor 11, 21,....nl for operating on records stored in a storage unit 12, 22,....n2. A software implemented message handling kernel function 13, 23,....n3 is defined within each of the processing 4 nodes and has associated with it a procedure register 14,24 n4. A bi-directional message interface, 2, interconnects the message handling kernels in all nodes and an application controller, 3 is arranged to provide procedures over an interface, 4, for initialising the various procedure registers 14.... n4 and if necessary updating these registers as the application proceeds. In the preferred embodiment, the procedure registers in the various nodes are loaded on a selective basis with message handling procedures appropriate to the particular application.
in the preferred embodiment node 1 is designated as a master node while the remaining nodes are designated slaves. Thus the application programme and its control reside in the master node 1, and the slaves respond to messages from the master node. while the system may be implemented using various computer platforms the preferred embodiment employs a parallel array of IBM RISC System/6000 processors.(RISC System/6000 is a trademark of IBM Corporation).
A simplified group of records of the type to which the system is applicable is shown in Fig. 2. which represents a large database of records developed by an insurance company from customer data. The purpose of a data mining operation using such data may be for example to identify groups of people whose records have certain characteristics within a range of similarity so as to develop categories to which common terms and conditions of insurance can be offered. This process is referred to as segmentation.
In operation, the system is loaded with the records in such a way as to effect a reasonably even distribution of records between the various nodes. As shown by the letters associated with the storage units 12.... n2 in Fig. 1, this may be done by allocating records in stripes or subsets, or other approaches may be followed depending on the needs of the application. Running the application then involves first the development of statistics by all nodes with subsequent aggregation by the master node, followed by the construction of an initial model from a sample of records, using those statistics.
The master node starts the development of the initial model by issuing a trigger message, following which all nodes send their contribution of records from their associated storage units. on completion of the development of the initial model, the master node distributes it to the slaves.
There then follows a series of passes using all nodes. Each pass is a complete scan through the data and within a given pass, there are a number of phases. During a phase the models on individual nodes would diverge if permitted to do so, so the changes are accumulated and the master node consolidates them at the end of each phase and distributes the latest model state to all the slaves. The initiation of passes and phases is controlled by trigger messages sent by the master node. At the end of a given pass the master node may determine that the application should be terminated, for example if it detects that the model is sufficiently stable to meet the requirements of the application, and issues a termination message.
AS the application proceeds, trigger messages are used to re-synchronise the nodes. In this connection it will be appreciated that a typical parallel processing system may well be running a number of applications and that at any one time a node may be performing other tasks in other partitions which may make it unavailable to process a request immediately. Information reflecting the status and availability of the node can be determined by the master via the message interface 2. In another embodiment, a message indicating the status and availability of the node is returned to the master via the message interface.
when the master node receives an indication that a slave node is available it sends a message requesting further processing. The type of processing required is defined by message handling procedures loaded into the procedure register associated with the node in question and the message includes an identifier ID which enables the message handling kernel to select the appropriate procedure.
Typically two or three passes are required to achieve the required degree of stability. The master node then issues a termination message. Any slave node recognising such a termination message must respond by performing the appropriate termination procedure at whatever point it has reached in the processing of data.
It should be noted that a significant advantage of the message handling kernel is its ability correctly to process any message type as 6 and when it arrives without requiring undesirable complexity within the application itself.
The system of Fig. 1 provides:
1. A message handling kernel which is independent of the parallel processing application; 2. A method of providing the kernel with a list of the message types that the application requires, but with no knowledge of the content of each type; 3. A message structure which enables the kernel to determine which message handling procedure to invoke, without being aware of the content; and 4. A registration mechanism (the procedure register) for the application to provide message handling procedures for the message types it has told the kernel it requires.
This is illustrated in Fig. 3, which shows the sequence of events for processing messages, and Fig. 4, which depicts a typical message structure. Before any messages are sent the application controller provides to the various procedure registers message handling procedures for each of the message types it intends the associated node to handle.
A typical message is intended to trigger a specific action, such as a message sent from the master to each of the slaves when requesting data. Referring to Fig. 4., the message structure includes an indicator portion 41 and a body portion 42. The indicator portion contains a value for the length of the information in the body portion, a message length of zero indicating that the message is a trigger with no body portion. The message is inspected by the kernel to select the message handling procedure appropriate for the ID included in the message.
A second typical message is a slave's response to such a trigger, asking for data. Having assembled the data for the reply, the slave sends a message to the master which includes a body portion. The kernel associated with the master, on receiving this reply, goes on to read the body portion and passes it on to the message handling procedure which is 7 registered to deal with the message ID contained in the indicator portion.
The function performed by the various message handling kernels and their associated procedure registers can thus be likened to that of a funnel through which all messages on the interface 2 are passed. This is illustrated in flow chart form in Fig. 3.
Referring to Fig. 3. an incoming message on message interface 2 is received at step 31 and a test is made at step 32 to determine whether the message type is in the list of types required by the current application. If it is not, a warning is issued at 33 and the message is purged at 34. At step 35 a further test is made to determine whether a message handling procedure for the particular message received is registered in the associated procedure register 14, 24,-n4. If no message handling procedure is registered a flag is raised at 36 indicating that the message cannot be handled and the message is also purged at step 34. If the appropriate message handling procedure is found a test is made to determine whether the message is a trigger message as discussed above with a message length of zero or whether data is also carried by the message. In the latter case the message body is received at 38 for use in further processing at the node. if the message is a trigger message this step is bypassed and in both cases the message handling procedure is executed at step 39 after which the message is purged at 34 and the message handling kernel returns to the waiting mode for the reception of further messages.
In the preferred system the message handling procedures are implemented as user objects responding to triggers and data messages transmitted by message objects generated in the nodes.
In the present embodiment, a single message handling procedure is registered in the procedure register for each message type permitting the application logic sequence to be set out entirely within the message handling procedure. This permits easy maintainability since the logic resides in one place only. However a sequence of message handling procedures can be registered for each message type representing the steps that an application will pass through in response to that message type. This makes the application code more concise since the complexity of sequencing is built into the message handling kernel, providing a generic re-usable message handling process.
8 while there has been described a parallel processing system organised on a master-slave basis with the application and application controller provided within the master node, it will be appreciated that the techniques described are equally applicable to an arrangement where there is no one node in control. In such an organisation, referred to as a "peer-to-peern organisation, all nodes communicate with each other, each responding to messages according to their current state and to message handling procedures registered registers. The techniques described are arrangement in which one of the nodes is and running of the application.
in the various procedure similarly applicable to an solely dedicated to the control Furthermore, although the embodiment of Figure 1 has been described in the context of the process of data mining and segmentation, it will be appreciated by those skilled in the art that the techniques described are applicable generally in the field of parallel data processing.
9

Claims (11)

1. A parallel data processing system comprising a plurality of processing nodes (S1,_Sn) each adapted to process data under the control of an application program (3), in which each node has associated therewith a message handling kernel (13,_n3) and a procedure register (14---.n4), said system including a message interface (2) interconnecting the message handling kernels associated with said nodes and means (4) for loading message handling procedures selectively into the procedure registers of the various nodes, each message handling kernel being adapted to respond to messages applied to said message interface during execution of said application program to control its associated node in accordance with message handling procedures selected from the associated procedure register.
2. A system as claimed in claim 1 in which each message handling kernel is further adapted to deliver to said message interface messages reporting the progress of message handling procedures selected from said associated procedure register.
3. A system as claimed in claim 1 or claim 2 in which each message includes a type identifier and has one or more message handling procedures associated therewith according to its type.
4. A system as claimed in any preceding claim, configured to define a master-slave relationship, one of the nodes being designated as a master node in which the application programme and control thereof resides while the remaining nodes operate as slave nodes responding to messages originating from the master node.
5. A system as claimed in claim 4 in which the procedure registers associated with each slave node are loaded initially from an application residing within the master node and are updated as the application proceeds.
6. A method of controlling a parallel processing system having a plurality of processing nodes comprising the steps of providing, for each node, a message handling kernel and a procedure register, loading selectively into the procedure registers of each node message handling procedures to be invoked by the message handling kernel associated with that node and controlling the node in accordance with message handling procedures selected from the associated procedure register under the control of messages received from another node or nodes in the system.
7. A method as claimed in claim 6 including the further steps of identifying each message by means of a type identifier, and providing in each procedure register one or more message handling procedures for each message so identified.
8. A method as claimed in claim 6 or claim 7 including the further step of designating one of said processing nodes as a master node and the remaining nodes as slave nodes.
9. A method as claimed in claim 8 including the further steps of loading the procedure registers associated with each slave node initially from an application residing within the master node, and subsequently updating the procedure registers associated with the slave nodes as the application program proceeds.
10. A parallel data processing system, substantially as described with reference to the accompanying drawings.
11. A method of controlling a parallel data processing system, substantially as described with reference to the accompanying drawings.
GB9706549A 1997-01-04 1997-04-01 Parallel data processing system Withdrawn GB2323944A (en)

Priority Applications (8)

Application Number Priority Date Filing Date Title
GB9706549A GB2323944A (en) 1997-04-01 1997-04-01 Parallel data processing system
US08/906,685 US5905904A (en) 1997-01-04 1997-08-05 Parallel data processing system and method of controlling such a system
TW086113982A TW491979B (en) 1997-04-01 1997-09-25 Parallel data processing system and method of controlling such a system
KR1019980003047A KR100300899B1 (en) 1997-04-01 1998-02-04 Parallel data processing system and method of controlling such a system
SG9800524A SG83676A1 (en) 1997-04-01 1998-03-10 Parallel data processing system and method of controlling such a system
DE69840286T DE69840286D1 (en) 1997-04-01 1998-03-16 Parallel data processing system and control method for such a system
EP98301952A EP0871118B1 (en) 1997-04-01 1998-03-16 Parallel data processing system and method of controlling such a system
JP10084705A JP3090641B2 (en) 1997-04-01 1998-03-30 Parallel data processing system and control method thereof

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
GB9706549A GB2323944A (en) 1997-04-01 1997-04-01 Parallel data processing system

Publications (2)

Publication Number Publication Date
GB9706549D0 GB9706549D0 (en) 1997-05-21
GB2323944A true GB2323944A (en) 1998-10-07

Family

ID=10810076

Family Applications (1)

Application Number Title Priority Date Filing Date
GB9706549A Withdrawn GB2323944A (en) 1997-01-04 1997-04-01 Parallel data processing system

Country Status (8)

Country Link
US (1) US5905904A (en)
EP (1) EP0871118B1 (en)
JP (1) JP3090641B2 (en)
KR (1) KR100300899B1 (en)
DE (1) DE69840286D1 (en)
GB (1) GB2323944A (en)
SG (1) SG83676A1 (en)
TW (1) TW491979B (en)

Families Citing this family (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6409567B1 (en) 1997-12-15 2002-06-25 E.I. Du Pont De Nemours And Company Past-deposited carbon electron emitters
US7240059B2 (en) * 2002-11-14 2007-07-03 Seisint, Inc. System and method for configuring a parallel-processing database system
US6968335B2 (en) * 2002-11-14 2005-11-22 Sesint, Inc. Method and system for parallel processing of database queries
US7657540B1 (en) 2003-02-04 2010-02-02 Seisint, Inc. Method and system for linking and delinking data records
US20090187928A1 (en) * 2007-10-22 2009-07-23 Rextide Inc. Primary Application Advertising Layer
US8266168B2 (en) * 2008-04-24 2012-09-11 Lexisnexis Risk & Information Analytics Group Inc. Database systems and methods for linking records and entity representations with sufficiently high confidence
US8639691B2 (en) 2008-07-02 2014-01-28 Lexisnexis Risk Solutions Fl Inc. System for and method of partitioning match templates
US9411859B2 (en) 2009-12-14 2016-08-09 Lexisnexis Risk Solutions Fl Inc External linking based on hierarchical level weightings
US9189505B2 (en) 2010-08-09 2015-11-17 Lexisnexis Risk Data Management, Inc. System of and method for entity representation splitting without the need for human interaction
JP5949506B2 (en) * 2012-11-30 2016-07-06 富士通株式会社 Distributed processing method, information processing apparatus, and program

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB1496779A (en) * 1973-12-18 1978-01-05 Honeywell Inf Systems Microprogrammed processor
US4882674A (en) * 1985-03-05 1989-11-21 Wang Laboratories, Inc. Apparatus and method for control of one computer system by another computer system

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4654654A (en) * 1983-02-07 1987-03-31 At&T Bell Laboratories Data network acknowledgement arrangement
US4970658A (en) * 1989-02-16 1990-11-13 Tesseract Corporation Knowledge engineering tool
US5339430A (en) * 1992-07-01 1994-08-16 Telefonaktiebolaget L M Ericsson System for dynamic run-time binding of software modules in a computer system

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB1496779A (en) * 1973-12-18 1978-01-05 Honeywell Inf Systems Microprogrammed processor
US4882674A (en) * 1985-03-05 1989-11-21 Wang Laboratories, Inc. Apparatus and method for control of one computer system by another computer system

Also Published As

Publication number Publication date
DE69840286D1 (en) 2009-01-15
TW491979B (en) 2002-06-21
KR100300899B1 (en) 2001-09-06
SG83676A1 (en) 2001-10-16
US5905904A (en) 1999-05-18
JP3090641B2 (en) 2000-09-25
GB9706549D0 (en) 1997-05-21
EP0871118A1 (en) 1998-10-14
EP0871118B1 (en) 2008-12-03
KR19980079668A (en) 1998-11-25
JPH10334066A (en) 1998-12-18

Similar Documents

Publication Publication Date Title
US5787019A (en) System and method for handling dynamic changes in device states
US5630049A (en) Method and apparatus for testing software on a computer network
Tsichritzis et al. Operating systems
US4123795A (en) Control system for a stored program multiprocessor computer
US5748959A (en) Method of conducting asynchronous distributed collective operations
US5442785A (en) Method and apparatus for passing messages between application programs on host processors coupled to a record lock processor
US5745679A (en) Method and device for file transfer by cascade release
EP0855056B1 (en) Object-oriented method maintenance mechanism that does not require cessation of the computer system
KR20020079849A (en) System and method for rapid completion of data processing tasks distributed on a network
JPH03103963A (en) Apparatus and method for assigning task to a plurality of processors
AU2001241453A1 (en) System and method for rapid completion of data processing tasks distributed on a network
US5905904A (en) Parallel data processing system and method of controlling such a system
US5485573A (en) Method and apparatus for assisting in the determination of the source of errors in a multi-host data base management system
AU603876B2 (en) Multiple i/o bus virtual broadcast of programmed i/o instructions
US20200310828A1 (en) Method, function manager and arrangement for handling function calls
US5613133A (en) Microcode loading with continued program execution
CN112749062A (en) Server program monitoring method and device, computer equipment and storage medium
US5862340A (en) Method operating in each node of a computer system providing and utilizing special records for collective communication commands to increase work efficiency at each node
US5926635A (en) Method for reducing complexities when communicating between tasks
US7272712B1 (en) Data structure and method for managing modules associated with a kernel
US6012149A (en) Computer system with polymorphic fault processing
CN115495484B (en) Method, device and medium for data return
CN118964370A (en) An operating system, method, terminal device and storage medium for database files
Harvey et al. Comparing inter-tool communication in control-centred tool integration frameworks
US7536434B1 (en) Global dynamic persistent information architecture

Legal Events

Date Code Title Description
WAP Application withdrawn, taken to be withdrawn or refused ** after publication under section 16(1)