[go: up one dir, main page]

CN103929320A - An Integrated Platform for IT System Disaster Recovery - Google Patents

An Integrated Platform for IT System Disaster Recovery Download PDF

Info

Publication number
CN103929320A
CN103929320A CN201310013623.8A CN201310013623A CN103929320A CN 103929320 A CN103929320 A CN 103929320A CN 201310013623 A CN201310013623 A CN 201310013623A CN 103929320 A CN103929320 A CN 103929320A
Authority
CN
China
Prior art keywords
module
business
data
integrated platform
recovery
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201310013623.8A
Other languages
Chinese (zh)
Inventor
戚跃民
郝建明
伍福生
简超
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
China Unionpay Co Ltd
Original Assignee
China Unionpay Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by China Unionpay Co Ltd filed Critical China Unionpay Co Ltd
Priority to CN201310013623.8A priority Critical patent/CN103929320A/en
Priority to PCT/CN2014/070331 priority patent/WO2014110994A1/en
Publication of CN103929320A publication Critical patent/CN103929320A/en
Pending legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/50Network service management, e.g. ensuring proper service fulfilment according to agreements

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

本发明涉及一种针对IT系统灾难恢复的集成平台,用于对异地分布式IT系统进行集中管理,该集成平台包括:系统管理模块(100)、系统通信模块(200)、数据同步模块(300)、数据比较分析模块(400)、数据存储模块(500)、业务流程模块(600)、业务恢复模块(700)、安全审计模块(800)。根据本发明的针对IT系统灾难恢复的集成平台,能够实现各个业务主机之间的实时通信,能够将数据恢复和业务切换统一到业务流程中,因此,本发明能够提供一种在异地分布式IT系统发生灾难时将硬件恢复、数据恢复、业务恢复三者有效结合的针对IT系统灾难恢复的集成平台。

The invention relates to an integrated platform for IT system disaster recovery, which is used for centralized management of remote distributed IT systems. The integrated platform includes: a system management module (100), a system communication module (200), and a data synchronization module (300 ), a data comparison and analysis module (400), a data storage module (500), a business process module (600), a business recovery module (700), and a security audit module (800). According to the integrated platform for IT system disaster recovery of the present invention, real-time communication between various business hosts can be realized, and data recovery and business switching can be unified into the business process. Therefore, the present invention can provide a remote distributed IT An integrated platform for IT system disaster recovery that effectively combines hardware recovery, data recovery, and business recovery when a system disaster occurs.

Description

A kind of integrated platform for IT system disaster recovery
Technical field
The present invention relates to the disaster tolerance technology in network service, particularly, relate to the integrated platform that can carry out data recovery and keep data consistency in the situation that of IT system generation disaster.
Background technology
Along with the arrival of information age, data become the core of social normal operation more and more highlightedly.For Yi Ge enterprise, data affect the key of its survival and development especially, the user of every profession and trade and enterprise are day by day strong to the dependence of network application and data message, and outburst disaster can cause significant impact to the data of whole enterprise and business production as fire, flood, earthquake or terrorist incident etc.Therefore, how to guarantee that business data is not lost when disaster occurs, assurance system service is resumed operation as early as possible, becomes one of focus of people's concern, and therefore, disaster tolerance technology becomes the focus that industry-by-industry is paid close attention to day by day.
Disaster tolerance is generally divided into three ranks the degree ensureing: data level, system-level, service level.
The focus of data redundancy is data, and disaster can guarantee after occurring that the original data of user can not lose or be destroyed.Data redundancy is different from backup, and it requires the backup of data to be saved in strange land.
System-level disaster tolerance is carrying out application, to process and can (service server district) copy portion again on the basis of data redundancy, that is to say, in a set of support system of the same structure of backup site, system-level disaster tolerance can provide continual application service, and the service request that user applies can be continued pellucidly operation and not be subject to the impact that disaster occurs.
Data redundancy and system-level disaster tolerance are all within IT category, yet for regular traffic, only the guarantee of IT system is also not enough, and some user need to build the service level disaster tolerance of highest level.
In prior art, when large-scale strange land distributed system, in this locality, disaster occurs, focus on that hardware (system) level restoration, data level recover or service level is recovered, but lack a kind of integrated platform that three is effectively combined.
 
Summary of the invention
In view of the above problems, the present invention aims to provide and a kind ofly can effectively control and can keep the integrated platform for IT system disaster recovery of the consistency of data and the continuation of business to the business host implementation in the distributed system of strange land.
Integrated platform for IT system disaster recovery of the present invention can effectively solve the concentration problem of each main frame in the distributed I T system of strange land, and to business, host implementation is effectively controlled, and by operation flow function flexibly, ensures operational sustainability.The integrated platform of one-tenth IT system of the present invention disaster recovery realized and each business main frame between real time communication, data are recovered and business is switched and unified in operation flow.
Integrated platform for IT system disaster recovery of the present invention, for strange land distributed I T system is managed concentratedly, this strange land distributed I T system possesses the Liang Tai administrative center of a plurality of local service main frames, a plurality of cross regional business main frame, management local service main frame and cross regional business main frame, and this integrated platform comprises:
System management module, for monitoring in real time and manage described strange land distributed I T system, so that the information that described administrative center can each business main frame of Real-time Obtaining;
System communication module, be deployed in described each local service main frame, each cross regional business main frame, administrative center, and for realize between communicating by letter between each local service main frame and administrative center, each cross regional business main frame and administrative center communicate by letter and administrative center between communication;
Data simultaneous module, for realizing the real time data synchronization of described strange land distributed I T system;
Data comparative analysis module, for realizing the consistency checking of described strange land distributed I T system data;
Data memory module, for realizing the data storage of described strange land distributed I T system;
Operation flow module, for realizing all kinds of operation flows of described strange land distributed I T system;
Business recovery module, for realizing the adapter of cross regional business flow process the described strange land distributed I T system generation disaster in the situation that;
Security audit module, is encrypted, deciphers for reception and the transmission of the message between each business main frame He Ge administrative center.
Preferably, described system management module, described data simultaneous module, described data comparative analysis module, described data memory module, described operation flow module, described business recovery module, described security audit module are all associated with described system communication module, described data simultaneous module, described data comparative analysis module, described data memory module, described operation flow module all with described business recovery module relation.
Preferably, described system communication module is for realizing information receiving and transmitting, message parse, command execution, the result feedback between each business main frame He Ge administrative center.
Preferably, between described each administrative center and each unit module in described each business main frame, by WTC interface, being connected Tuxedo/Q serves.
Preferably, described security audit module is encrypted by using WSL to insert message the sending and receiving of the message between the unit module in administrative center and business main frame.
Preferably, the generation that described security audit module is inserted use-z in the process of message by WSL is carried out rsa and is encrypted and arrange.
Preferably, described data comparative analysis module can be carried out data location and analysis and carry out accordingly data and remedy according to the otherness of data.
Preferably, described business recovery module comprises: for starting the adapting system application in strange land and the first submodule of database; For obtaining the second submodule of the time of disaster switching; The 3rd submodule switching for carrying out network.
Preferably, described data simultaneous module synchronously copies with the data that realize between local service system for disposing mirrored storage in local service system.
Preferably, described data simultaneous module is for realizing the data asynchronous replication between local mirrored storage and strange land storage.
Preferably, described data simultaneous module also for covering local data base by strange land data after local system recovery business.
Preferably, described Liang Tai administrative center is that function is identical and backup each other.
Preferably, described administrative center adopts the authentication of ldap server mirror image.
The technical problem that the present invention mainly solves is as follows: how (1) is to the centralized management of strange land distributed system and control; (2) how to realize the fastext switching of strange land distributed system, guarantee the continuation of business when disaster occurs; (3) how to realize the automatic processing of business; (4) how to monitor the state of controlled end; (5) how to compare the consistency of two places Service Database.
For above-mentioned technical problem (1), the technological means adopting is: the information receiving and transmitting mechanism by between administrative center and each controlled end (used Tuxedo /queue of Q reliable news), realizes the control to all business main frames.
For above-mentioned technical problem (2), the technological means adopting is: for every suit operation system of operation, set up a corresponding strange land and switch and switchback flow process, and remedying module with data combines, the sustainability of business when the former guarantees disaster generation, the integrality of data when the latter guarantees disaster generation.
For above-mentioned technical problem (3), the technological means adopting is: because administrative center can realize the management of all business main frames and control, so the daily operation of operation system can realize automation, by administrative center, send the fixedly job instruction of flow process and realize; For more specific business demands, also can be realized by a set of arbitrary procedure of business personnel oneself definition in addition, its mode is versatile and flexible.
For above-mentioned technical problem (4), the technological means adopting is: in Liang Tai administrative center, dispose respectively the WSL service of corresponding tuxedo, simultaneously on every controlled end, WSNADDR environmental variance is set, the value of environmental variance is that the WSL of tuxedo service end issue serves (ip address: port numbers), be connected to tuxedo service end for tuxedo client-side program (controlled application end program), corresponding address.If connection failure, interval, after 30 seconds, reconnects.Meanwhile, controlled end can regularly send heartbeat message to administrative center, and administrative center judges that whether the state of controlled end is normal accordingly.
For above-mentioned technical problem (5), the technological means adopting is: pass through data comparison module, can compare any table in the Service Database of two places or a table set (multiple tables), manner of comparison is various, has 1] comparison to table record number; 2] comparison of some field in his-and-hers watches; 3] his-and-hers watches carry out the comparison of MD5 algorithm; By these data manner of comparison, whether unanimously can accurately find out local and remote side Service Database, inconsistently where can tell its otherness of user.
In sum, integrated platform for IT system disaster recovery of the present invention can be realized the real time communication between each business main frame, can data recovery and business be switched unified in operation flow, therefore, the present invention can provide a kind of and when strange land distributed I T system generation disaster, hardware recovery, data are recovered, the integrated platform for IT system disaster recovery of the effective combination of business recovery three.
 
Accompanying drawing explanation
Fig. 1 means that the integrated platform for IT system disaster recovery of the present invention manages the organigram of strange land distributed I T system concentratedly.
Fig. 2 means the organigram of the integrated platform for IT system disaster recovery of the present invention.
Fig. 3 means that the data that the integrated platform for IT system disaster recovery of the present invention carries out are stored, the schematic diagram of data synchronization processing.
Fig. 4 means that the unit module under the management of the integrated platform for IT system disaster recovery of the present invention is the handling process of controlled end.
 
Embodiment
What introduce below is some in a plurality of embodiment of the present invention, aims to provide basic understanding of the present invention.Be not intended to confirm key of the present invention or conclusive key element or limit claimed scope.
Fig. 1 means that the integrated platform for IT system disaster recovery of the present invention manages the organigram of strange land distributed I T system concentratedly.As shown in Figure 1, this strange land distributed I T system possesses a plurality of local service main frames and (in this locality, possesses business main frame 1, business main frame 2, business main frame 3 ... ..), a plurality of cross regional business main frames (possess business main frame 4, business main frame 5, business main frame 6 in strange land ... ..), the Liang Tai administrative center of management local service main frame and cross regional business main frame.Local service main frame, cross regional business main frame and Liang Tai administrative center are associated by communication line.Wherein, Liang Tai administrative center management function identical, sealed each other.This strange land distributed I T system comprises above-mentioned all business main frames and administrative center." controlled end " that will mention is in the present invention the module being deployed on all business main frames.Like this concerning of the present invention for the integrated platform of IT system disaster recovery, the operation that all business main frames can be accepted to be correlated with from the instruction of administrative center (therefore, here also can by the main frame of having disposed this unit module referred to as " controlled end ").
Fig. 2 means the organigram of the integrated platform for IT system disaster recovery of the present invention.
As shown in Figure 2, the integrated platform for IT system disaster recovery of the present invention comprises: for monitoring in real time and manage described strange land distributed I T system so that described administrative center can each business main frame of Real-time Obtaining the system management module 100 of information; Be deployed in described each local service main frame, each cross regional business main frame, administrative center and for realize between communicating by letter between each local service main frame and administrative center, each cross regional business main frame and administrative center communicate by letter and administrative center between the system communication module 200 of communication; For realizing the data simultaneous module 300 of the real time data synchronization of described strange land distributed I T system; For realizing the data comparative analysis module 400 of the consistency checking of described strange land distributed I T system data; For realizing the data memory module 500 of the data storage of described strange land distributed I T system; For realizing the operation flow module 600 of all kinds of operation flows of described strange land distributed I T system; For realize the business recovery module 700 of the adapter of cross regional business flow process in the situation that of described strange land distributed I T system generation disaster; For the security audit module 800 that reception and the transmission of the message between each business main frame He Ge administrative center are encrypted, are deciphered.
System management module 100, data simultaneous module 300, data comparative analysis module 400, data memory module 500, operation flow module 600, business recovery module 700, security audit module 800 are all associated with system communication module 200.Data simultaneous module 300, data comparative analysis module 400, data memory module 500, operation flow module 600 are all associated with business recovery module (700).
This strange land distributed I T system can be monitored and manage to system management module 100 in real time, by the reliable message communication mechanism between each associative cell module and administrative center in system, makes the information that described administrative center can each business main frame of Real-time Obtaining.
System communication module 200 is deployed in intrasystem each unit module of this strange land distributed I T and administrative center, and for real-time messages transmitting-receiving, message parse, command execution, result feedback, be the basis of realizing IT system disaster recovery.Between administrative center and each unit module, by WTC interface, connect Tuxedo/Q service.The WEB of WTCShi BEA company supports the fastening means between product Weblogic and middleware product Tuxedo, full name Weblogic Tuxedo Connector.WTC makes between Weblogic and Tuxedo, to have two-way access ability, the middleware product of Tuxedo Ye Shi BEA company, Tuxedo/Q parts can be realized in reliable mode, it allows message to be stored in lasting medium after queuing up, if disk or non-lasting medium are as in internal memory, so that for later.In the present invention, administrative center has disposed Weblogic platform and java application, each unit module (being each controlled end) deploy Tuxedo /queue of Q reliable news, be used for information order-> execution-> that receiving management center sends to deposit results messages in response queue.And communicating by letter between Tuxedo between the Weblogic of administrative center and each unit module used, it is exactly WTC interface.
Administrative center sends relevant command messages, and receives the message that returns results of carrying out.The command messages at each unit module receiving management center, and send the message that returns results of carrying out.In the time of implementation overlength of order or the situation of network failure, Tuxedo/Q can provide reliable messenger service, has guaranteed the integrality that message is transmitted.A mechanism, provides the more flexible more reliable asynchronous execution method simultaneously than tpacall () like this, has met the needs of strange land distributed system.Therefore,, in the present invention by adopt Tuxedo/Q between administrative center and each controlled end, can strange land distributed system be continued centralized management and be controlled.
Data simultaneous module 300, data comparative analysis module 400, data memory module 500 have built the assurance of data consistency in the distributed system of strange land jointly, data simultaneous module 300 and data module 500 storages are for realizing the real-time synchronization of the operation system data that in system, strange land distributes, the checking of the operation system data consistency that data comparative analysis module 400 distributes for strange land also can be carried out data location and analysis according to otherness, carries out relevant data and remedies.Data simultaneous module 300, data comparative analysis module 400, data memory module 500 are the basis of business recovery module 600.
Operation flow module 600 will be for effectively realizing every operation flow of strange land compartment system, procedure information is based on basic elements such as flow process, step, function, combination functions, adopt the formal definition of orderly functional steps in database, and can pass through script custom-modification.Administrative center's program is read procedure information and is explained and carry out, and completes the execution of traffic flow function, realizes fixing routine work flow process in system, and these operation flows are referred to as fixedly flow process.In addition, for dealing with, process some interim system requirements, as equipment replacement, line upkeep, troubleshooting etc., need the random business function of carrying out a series of necessity, derive thus the function of arbitrary procedure, support is carried out for the selection of defined relevant specific function, is for the supplementing of fixed service flow process, and is a kind of control mode of operation system very flexibly.
The assurance of business recovery module 700 based on data consistency, the module by operation flow represents, and has guaranteed that strange land distributed system is when the abnormal conditions such as disaster occur, and can realize fast cross regional business flow process and take over, and has guaranteed the continuation of data and business.Business recovery module 700 comprises following submodule: for starting the operation system application in strange land and the first submodule of database; For obtaining the second submodule of the time of disaster switching; The 3rd submodule switching for carrying out network.Give an example: in situation about breaking down such as the transfer service system at center, Shanghai, need to be switched to center, Beijing at once, now the first submodule of business recovery module 700 can start adapting system application and the database in strange land, and judge whether to possess switching condition, the second submodule of business recovery module 700 obtains the time point (preparing against follow-up the remedying of data of carrying out) that disaster is switched, and the 3rd submodule of business recovery module 700 is carried out the work such as network switching.
If these flow processs are decomposed, each step in flow process is exactly in fact the control to certain business main frame, automatically complete the operational order that administrative center sends, after this flow performing is complete, concerning user, be all transparent, and actual trading processing place has become Beijing by Shanghai, guaranteed the continuation of business.After center, Shanghai operation system is recovered, also corresponding a set of service switchback flow process, can normally deliver to transaction Shanghai after switchback and process.And data are remedied flow process can remedy center, Shanghai by switching the transaction of processing in Beijing during this period of time.
Security audit module 800 is for avoiding the plaintext transmission of message between administrative center and unit module, and message has been increased and encrypt arranged, and inserts the parameter of uses-z in the process of message carry out rsa encryption setting by WSL.And, in the reception of message with in sending, all with encrypted form, to carry out, message is deciphering automatically again after receiving, can guarantee like this safety that data are transmitted.Meanwhile, administrative center adopts unified LDAP(Lightweight Directory Access Protocol) server carries out authentication.Operator's authority configuration information is taken from ldap server equally, first checks associated authorization when carrying out various functions operation.Only have authorized user could carry out the function of every operation flow.In addition, security audit module 800 also records and audits log-on message, Operation Log, flow performing.
In the situation that integrated platform for IT system disaster recovery of the present invention shown in Fig. 2 is managed concentratedly the strange land distributed I T system shown in Fig. 1, on the business main frame of disposing for local and remote side, there is controlled end, local and remote side is disposed administrative center separately simultaneously, each administrative center all realizes communication with all business main frames of local and remote side, to reach the object of system management, business realizing and recovery.
Fig. 3 means that the data that the integrated platform for IT system disaster recovery of the present invention carries out are stored, the schematic diagram of data synchronization processing.As shown in Figure 3, in the integrated platform local service system of IT system disaster recovery of the present invention, disposed a set of mirrored storage, realized that data between local main business system synchronously copy and data are bidirectional replication.Between the storage of local mirrored storage and strange land, realized data asynchronous replication and data are unidirectional replication.Such data synchronization mechanism has guaranteed, when local service system or data generation disaster, can in strange land, realize business recovery rapidly, and data can not lost.After local system recovery business, strange land data can be covered in local data base again.
About " data covering ", can understand like this, for example, local service system (Shanghai) continues an example of mentioning above when describing " business recovery module 700 ": when need to be switched Beijing, can record the time point T0 switching, after switching, all transaction reality has been transformed into center, Beijing and has processed.After center, Shanghai recovery business, can carry out the switchback flow process of corresponding operation system, also can record the time point T1 of switching simultaneously, follow-uply will remedy flow process by executing data, because time difference of T1-T0 is exactly that transaction is in the time period of Beijing center processing.So remedying flow process, data now can start, Beijing administrative center can send instruction, from the transaction data base at center, Beijing, read data (namely strange land data) during this period of time, fiber optic network by this segment data by Beijing to Shanghai passes to Shanghai administrative center, and then Shanghai administrative center can be inserted into these data in corresponding Service Database.Like this, no matter for operation system or user, transaction data is all complete, just as not switching.
Fig. 4 means that the unit module under the management of the integrated platform for IT system disaster recovery of the present invention is that controlled end is to the handling process of information receiving and transmitting (idiographic flow that namely system communication module 200 carries out).As shown in Figure 4, at a unit module, first, and process initialization, allocation space, and generate one with the chained list of head node.On the server of Liang Tai administrative center, dispose respectively the WSL service of corresponding tuxedo, simultaneously on every client-server, WSNADDR environmental variance is set, the value of environmental variance is that the WSL of tuxedo service end issue serves (ip address: port numbers), be connected to tuxedo service end for tuxedo client-side program (controlled application end program), corresponding address.If connection failure, interval, after 30 seconds, reconnects.
Chained list is mainly used in depositing the state information of the current executive process of carrying out, the content of each node comprises that process number, message function number, message uniqueness mark, set of parameter values, process start time and the whether available sign (0 for available, and 1 is unavailable) of this node carried out.After executive process is finished dealing with, host process can empty the nodal information in chained list corresponding to this executive process, and availability sign is set to 0, for later.
The validity of judgement message is mainly that the value (value) of the checking mark of the application system in command messages (system), function number (func_id), IP address (ip), time (time), type of message (type) etc. is carried out to validity judgement.
Function treatment script carries out feature operation while processing, can be according to different situations, and whether to processing, how this situation such as processes judge and determines, avoids the unnecessary operation of mistake, returns to corresponding value.Return value is 0 presentation function operational processes success, non-zero expression unsuccessfully.
When the message receiving is interrupt message, host process sends interrupt signal to corresponding executive process, and executive process receives after interrupt signal, stops circulation, no longer carries out operation below.
In Fig. 4, the execution flow process that left-hand component is host process.Host process is a cyclic program, mainly completes transmission heartbeat message, accepts message and judges the validity of message, according to message content, produces corresponding executive process, and the operation of the executive process carried out of management.
According to the above-mentioned management of the integrated platform for IT system disaster recovery of the present invention, quick, simple, effective disaster recovery mechanism can be provided, in design object, reach RPO=0, RTO=0, when actual disaster occurs, also can within the shortest time, provide lasting business service.Aspect disaster recovery, at present industry generally acknowledges have three desired values must effort.The one, recovery time, how long enterprise does not have IT if standing, in the state of stopping doing business; The 2nd, how long network can recover; The 3rd, the recovery of service layer.In whole recovery process, the measurement index of most critical has two: one is RTO, and another is RPO.So-called RTO(Recovery Time Objective) after referring to that disaster occurs, from IT system when machine causes in service pause, to IT system, return to and can support all departments' running, recover in operation, the time period between these 2 is called RTO.So-called RPO(Recovery Point Objective) refer to from system and application data, realize returning to and can support all departments' business running, what kind of renewal degree system and creation data should return to.This renewal degree can be the Backup Data of upper a week, can be also the real time data of last transaction.Visible, the management of the integrated platform for IT system disaster recovery of the present invention can provide lasting business service when there is disaster within the shortest time.
And according to the integrated platform for IT system disaster recovery of the present invention, controlled end (being each unit module) can be safeguarded the reliable robustness moving that is connected, keeps abundance keeping with server end automatically.
And, according to the integrated platform for IT system disaster recovery of the present invention, can effectively monitor the running status of the controlled end of all deployment (being each unit module), running status for miscellaneous service flow process provides effective monitoring, meanwhile, can provide management maintenance mode for configurable parameter.
And, according to the integrated platform for IT system disaster recovery of the present invention, for operation flow, can realize flexible configuration and combination, such as supporting that configuring by parametrization the generality of dealing with business function changes; For the mistake occurring in flow performing, the function of abnormality processing is provided on stream, realize for abnormal effective processing.
And in order to guarantee the RTO of strange land distributed system, the performance requirement of RPO, the integrated platform for IT system disaster recovery of the present invention is realized miscellaneous service flow process by design, the flow process completing under daily, inside the plan and disaster scenario is controlled.Fixed service flow process and the arbitrarily realization of functional sequence are the Core Features that calamity provides for application system.For effectively realizing every operation flow, procedure information, based on basic elements such as flow process, step, function, combination functions, adopts the formal definition of orderly functional steps in database, and can pass through script custom-modification.Administrative center's program is read procedure information and is explained and carry out, and complete the execution of traffic flow function, these operation flows are referred to as fixedly flow process.For dealing with, process some interim operation system requirements in addition, as equipment replacement, line upkeep, troubleshooting etc., need the random business function of carrying out a series of necessity.
Above example has mainly illustrated the integrated platform that the present invention is directed to IT system disaster recovery.Although only some of them the specific embodiment of the present invention is described, those of ordinary skills should understand, and the present invention can be within not departing from its purport and scope implements with many other forms.Therefore, the example of showing and execution mode are regarded as illustrative and not restrictive, and in the situation that not departing from spirit of the present invention as defined in appended each claim and scope, the present invention may be contained various modifications and replacement.

Claims (13)

1.一种针对IT系统灾难恢复的集成平台,用于对异地分布式IT系统进行集中管理,该异地分布式IT系统具备多个本地业务主机、多个异地业务主机、管理本地业务主机和异地业务主机的两台管理中心,该集成平台包括: 1. An integrated platform for IT system disaster recovery, which is used for centralized management of remote distributed IT systems. The remote distributed IT system has multiple local business hosts, multiple remote business hosts, management of local business hosts and remote Two management centers of business hosts, the integrated platform includes: 系统管理模块(100),用于实时监控和管理所述异地分布式IT系统,以使得所述管理中心能够实时获取各业务主机的信息; A system management module (100), configured to monitor and manage the remote distributed IT system in real time, so that the management center can obtain information of each business host in real time; 系统通信模块(200),部署于所述各本地业务主机、各异地业务主机、管理中心,并且用于实现各本地业务主机与管理中心间的通信、各异地业务主机与管理中心间的通信、以及管理中心之间的通信; The system communication module (200) is deployed in the local business hosts, business hosts in different places, and management centers, and is used to realize the communication between the local business hosts and the management center, the communication between the business hosts in different places and the management center, and communication between management centers; 数据同步模块(300),用于实现所述异地分布式IT系统中的数据实时同步; A data synchronization module (300), configured to realize real-time data synchronization in the remote distributed IT system; 数据比较分析模块(400),用于实现所述异地分布式IT系统中数据的一致性验证; A data comparison and analysis module (400), configured to implement consistency verification of data in the remote distributed IT system; 数据存储模块(500),用于实现所述异地分布式IT系统中的数据存储; A data storage module (500), configured to implement data storage in the remote distributed IT system; 业务流程模块(600),用于实现所述异地分布式IT系统中的各类业务流程; A business process module (600), configured to realize various business processes in the remote distributed IT system; 业务恢复模块(700),用于在所述异地分布式IT系统发生灾难的情况下实现异地业务流程的接管; A business recovery module (700), configured to take over remote business processes in the event of a disaster in the remote distributed IT system; 安全审计模块(800),用于对各业务主机和各管理中心之间的消息的接收和发送进行加密、解密。 The security audit module (800) is used for encrypting and decrypting the receiving and sending of messages between each business host and each management center. 2.如权利要求1所述的针对IT系统灾难恢复的集成平台,其特征在于, 2. the integrated platform for IT system disaster recovery as claimed in claim 1, is characterized in that, 所述系统管理模块(100)、所述数据同步模块(300)、所述数据比较分析模块(400)、所述数据存储模块(500)、所述业务流程模块(600)、所述业务恢复模块(700)、所述安全审计模块(800)均与所述系统通信模块(200)关联, The system management module (100), the data synchronization module (300), the data comparison and analysis module (400), the data storage module (500), the business process module (600), the business recovery module module (700), the security audit module (800) are associated with the system communication module (200), 所述数据同步模块(300)、所述数据比较分析模块(400)、所述数据存储模块(500)、所述业务流程模块(600)均与所述业务恢复模块(700)关联。 The data synchronization module (300), the data comparison and analysis module (400), the data storage module (500), and the business process module (600) are all associated with the business recovery module (700). 3.如权利要求2所述的针对IT系统灾难恢复的集成平台,其特征在于, 3. the integrated platform for IT system disaster recovery as claimed in claim 2, is characterized in that, 所述系统通信模块用于实现各业务主机和各管理中心之间的消息收发、消息解析、命令执行、结果反馈。 The system communication module is used to realize message sending and receiving, message parsing, command execution and result feedback between each business host and each management center. 4.如权利要求3所述的针对IT系统灾难恢复的集成平台,其特征在于, 4. the integrated platform for IT system disaster recovery as claimed in claim 3, is characterized in that, 所述各管理中心与所述各业务主机中的各单元模块之间通过WTC接口连接Tuxedo/Q服务。 Tuxedo/Q services are connected between each management center and each unit module in each service host through a WTC interface. 5.如权利要求4所述的针对IT系统灾难恢复的集成平台,其特征在于, 5. the integrated platform for IT system disaster recovery as claimed in claim 4, is characterized in that, 所述安全审计模块对管理中心与业务主机中的各个单元模块之间的消息的发送和接收通过使用WSL插入消息进行加密。 The security audit module encrypts the sending and receiving of messages between the management center and each unit module in the service host by using WSL to insert messages. 6.如权利要求5所述的针对IT系统灾难恢复的集成平台,其特征在于, 6. the integrated platform for IT system disaster recovery as claimed in claim 5, is characterized in that, 所述安全审计模块通过WSL插入消息的过程中使用-z的产生进行rsa加密设置。 In the process of inserting the message through WSL, the security audit module uses the generation of -z to perform rsa encryption setting. 7.如权利要求6所述的针对IT系统灾难恢复的集成平台,其特征在于, 7. the integrated platform for IT system disaster recovery as claimed in claim 6, is characterized in that, 所述数据比较分析模块(400)能够对本地和异地的业务系统数据库进行数据一致性比较,能够对差异性进行定位,并且能够针对其差异性进行数据追补。 The data comparison and analysis module (400) can perform data consistency comparison on local and remote business system databases, can locate differences, and can perform data supplement for the differences. 8.如权利要求7中所述的针对IT系统灾难恢复的集成平台,其特征在于, 8. The integrated platform for IT system disaster recovery as claimed in claim 7, characterized in that, 所述业务恢复模块(700)包括: The business recovery module (700) includes: 用于启动异地的业务系统应用和数据库,并判断是否具备切换条件的第一子模块; The first sub-module for starting business system applications and databases in different places and judging whether switching conditions are met; 用于获取灾难切换的时间点的第二子模块; The second submodule used to obtain the time point of disaster switching; 用于执行网络切换的第三子模块。 A third submodule for performing network switching. 9.如权利要求1~8中任意一项所述的针对IT系统灾难恢复的集成平台,其特征在于, 9. The integrated platform for disaster recovery of IT systems according to any one of claims 1 to 8, wherein: 所述数据同步模块(300)用于在本地业务系统中部署镜像存储以实现本地业务系统之间的数据同步复制。 The data synchronization module (300) is used for deploying mirror storage in local business systems to realize synchronous replication of data between local business systems. 10.如权利要求9所述的针对IT系统灾难恢复的集成平台,其特征在于, 10. the integrated platform for IT system disaster recovery as claimed in claim 9, is characterized in that, 所述数据同步模块(300)用于实现本地镜像存储与异地存储之间的数据异步复制。 The data synchronization module (300) is used to realize asynchronous data replication between local mirror storage and remote storage. 11.如权利要求10所述的针对IT系统灾难恢复的集成平台,其特征在于, 11. the integrated platform for IT system disaster recovery as claimed in claim 10, is characterized in that, 所述数据同步模块(300)还用于在本地系统恢复业务后将异地数据回补到本地数据库中。 The data synchronization module (300) is also used for backing up remote data into the local database after the local system resumes business. 12.如权利要求11所述的针对IT系统灾难恢复的集成平台,其特征在于, 12. the integrated platform for IT system disaster recovery as claimed in claim 11, is characterized in that, 所述两台管理中心为功能相同并且互为备份。 The two management centers have the same function and are mutual backups. 13.如权利要求12所述的针对IT系统灾难恢复的集成平台,其特征在于, 13. The integrated platform for IT system disaster recovery as claimed in claim 12, characterized in that, 所述管理中心采用LDAP服务器镜像身份认证。 The management center adopts LDAP server image identity authentication.
CN201310013623.8A 2013-01-15 2013-01-15 An Integrated Platform for IT System Disaster Recovery Pending CN103929320A (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN201310013623.8A CN103929320A (en) 2013-01-15 2013-01-15 An Integrated Platform for IT System Disaster Recovery
PCT/CN2014/070331 WO2014110994A1 (en) 2013-01-15 2014-01-08 Integrated platform for disaster recovery of it system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201310013623.8A CN103929320A (en) 2013-01-15 2013-01-15 An Integrated Platform for IT System Disaster Recovery

Publications (1)

Publication Number Publication Date
CN103929320A true CN103929320A (en) 2014-07-16

Family

ID=51147404

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201310013623.8A Pending CN103929320A (en) 2013-01-15 2013-01-15 An Integrated Platform for IT System Disaster Recovery

Country Status (2)

Country Link
CN (1) CN103929320A (en)
WO (1) WO2014110994A1 (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104601678A (en) * 2014-12-31 2015-05-06 江苏中科梦兰电子科技有限公司 High concurrence white board remote real-time synchronization method
WO2016188100A1 (en) * 2015-11-10 2016-12-01 中国建设银行股份有限公司 Information system fault scenario information collection method and system
CN111124696A (en) * 2019-12-30 2020-05-08 北京三快在线科技有限公司 Unit group creation method, unit group creation device, unit group data synchronization method, unit group data synchronization device, unit and storage medium

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1753373A (en) * 2004-09-23 2006-03-29 华为技术有限公司 Remote disaster allowable system and method
CN101118509A (en) * 2007-09-12 2008-02-06 华为技术有限公司 Method, device and system for remote disaster recovery of memory database
US20100318812A1 (en) * 2009-06-12 2010-12-16 Microsoft Corporation Secure and private backup storage and processing for trusted computing and data services

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1753373A (en) * 2004-09-23 2006-03-29 华为技术有限公司 Remote disaster allowable system and method
CN101118509A (en) * 2007-09-12 2008-02-06 华为技术有限公司 Method, device and system for remote disaster recovery of memory database
US20100318812A1 (en) * 2009-06-12 2010-12-16 Microsoft Corporation Secure and private backup storage and processing for trusted computing and data services

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104601678A (en) * 2014-12-31 2015-05-06 江苏中科梦兰电子科技有限公司 High concurrence white board remote real-time synchronization method
CN104601678B (en) * 2014-12-31 2018-10-09 江苏中科梦兰电子科技有限公司 A kind of big concurrent blank remote real-time synchronous method
WO2016188100A1 (en) * 2015-11-10 2016-12-01 中国建设银行股份有限公司 Information system fault scenario information collection method and system
US10545807B2 (en) 2015-11-10 2020-01-28 China Construction Bank Corporation Method and system for acquiring parameter sets at a preset time interval and matching parameters to obtain a fault scenario type
CN111124696A (en) * 2019-12-30 2020-05-08 北京三快在线科技有限公司 Unit group creation method, unit group creation device, unit group data synchronization method, unit group data synchronization device, unit and storage medium
CN111124696B (en) * 2019-12-30 2023-06-23 北京三快在线科技有限公司 Unit group creation, data synchronization method, device, unit and storage medium

Also Published As

Publication number Publication date
WO2014110994A1 (en) 2014-07-24

Similar Documents

Publication Publication Date Title
US11816003B2 (en) Methods for securely facilitating data protection workflows and devices thereof
US8300831B2 (en) Redundant key server encryption environment
US9141814B1 (en) Methods and computer systems with provisions for high availability of cryptographic keys
US10110667B2 (en) System and method for providing data and application continuity in a computer system
US9037844B2 (en) System and method for securely communicating with electronic meters
US10990605B2 (en) Instance data replication
CN202563493U (en) Unstructured data sharing disaster platform
US9231779B2 (en) Redundant automation system
US11228486B2 (en) Methods for managing storage virtual machine configuration changes in a distributed storage system and devices thereof
CN109995522A (en) A secure data mirroring method with key agreement function
CN112099878A (en) Application software configuration management method, device and system
WO2024120227A1 (en) Container data protection system, method and apparatus, and device and readable storage medium
US10402377B1 (en) Data recovery in a distributed computing environment
CN109842506A (en) Key management system disaster tolerance processing method, device, system and storage medium
CN103929320A (en) An Integrated Platform for IT System Disaster Recovery
CN106250048B (en) Method and apparatus for managing storage array
CN106657390A (en) Cluster file system directory isolation method, cluster file system directory isolation device and cluster file system directory isolation system
JP7346313B2 (en) Database management systems, cloud provision systems, data replication systems, and programs
EP3719599B1 (en) Network-distributed process control system and method for managing redundancy thereof
Liu et al. G-cloud: a highly reliable and secure IaaS platform
JP7567576B2 (en) Control system and control method thereof
EP2739010B1 (en) Method for improving reliability of distributed computer systems based on service-oriented architecture
JP2013003956A (en) Failure recovery management device, failure recovery management method, and failure recovery management program
Mane et al. Building a high availability-OpenStack
CN117240455A (en) An encryption system based on IPsec link encryption method

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20140716

RJ01 Rejection of invention patent application after publication