WO2016062037A1 - Procédé, appareil, et système de transmission d'informations, et gestion de défaillance de contrôleur via des cartes d'interface - Google Patents
Procédé, appareil, et système de transmission d'informations, et gestion de défaillance de contrôleur via des cartes d'interface Download PDFInfo
- Publication number
- WO2016062037A1 WO2016062037A1 PCT/CN2015/076658 CN2015076658W WO2016062037A1 WO 2016062037 A1 WO2016062037 A1 WO 2016062037A1 CN 2015076658 W CN2015076658 W CN 2015076658W WO 2016062037 A1 WO2016062037 A1 WO 2016062037A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- controller
- interface card
- controllers
- notification message
- fault
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L41/00—Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
- H04L41/34—Signalling channels for network management communication
- H04L41/344—Out-of-band transfers
Definitions
- the present invention relates to the field of communications, and in particular, to a method for transmitting information through an interface card, a controller fault handling method, an apparatus, and a system.
- controllers are designed with redundancy. If one controller fails, it will not affect the system business. All services are taken over by redundant controllers. This design is very important for storage reliability. of.
- the interface card for controller control and management belongs to each controller. As shown in Figure 1, interface card 1 is connected to controller A, and interface card 2 is connected to controller B. Each interface card only serves one control. Device. For example, if controller A fails, the service on the link is stopped and the interface card 1 cannot be used any more. Among them, one controller can also be connected with multiple interface cards. In Figure 1, an interface card is taken as an example, but each interface card can only serve one controller.
- Embodiments of the present invention provide a method, a device, and a system for processing information through a interface card, which are used to solve a service interruption in a controller faulty interface card and a device that cannot be used continuously with the interface card. problem.
- a storage system comprising:
- M controllers for controlling the system;
- the M controllers include a main controller and M-1 slave controllers as redundant, and M is a positive integer;
- N interface cards wherein each interface card is coupled to at least two controllers for relaying signals transmitted by the controller, or signals transmitted to the controller, or for processing from the controller Signal; N is an integer less than or equal to M.
- the interface card is connected to the controller through a PCIE bus.
- a serial control bus and/or parallel control is further connected between the interface card and the controller Bus for transmitting control signals.
- the system further includes at least one storage device, wherein Each storage device is coupled to the at least one interface card to enable the controller to interact with the storage device via a corresponding interface card.
- a second aspect of the present invention provides a method for transmitting information through an interface card, including:
- the second controller of the M controllers competes for the new primary controller
- the second controller performs information relaying by using at least one of the N interface cards; wherein the first interface card is respectively connected to the first controller and the second controller.
- the method further includes:
- the second controller receives a first fault notification message sent by the first controller, where the first fault notification message is used to notify the second controller that the first controller has a fault.
- the second controller in the M controllers competes for a new master control After the device, it also includes:
- the second controller receives a second fault notification message sent by a third controller of the M controllers, where the second fault notification message is used to notify the second controller, the third controller A failure has occurred;
- the second controller removes information of the third controller from a list of redundant controllers; wherein the redundant controller list is used to record information that can be used as redundant controllers.
- a third aspect of the present invention provides a controller fault processing method, including:
- the first interface card connected to the first controller and serving the first controller of the N interface cards receives the a second fault notification message sent by the first controller, where the second fault notification message is used to notify the first interface card, where the first controller has a fault;
- the first interface card controls the port connected to the first controller to enter an inactive state according to the second failure notification message to stop communication with the first controller.
- the method further includes:
- the first interface card receives a master control notification message sent by a second controller of the M controllers, where the master control notification message is used to notify the first interface card, and the second controller has been contending For the new primary controller;
- the first interface card controls, according to the master control notification message, that a port connected to the second controller enters an active state to communicate with the second controller through a port connected to the second controller.
- a fourth aspect of the present invention provides a method for transmitting information through an interface card, including:
- the first controller When the first controller that is the primary controller of the M controllers included in the storage system fails, the first controller sends a second failure notification message to the first interface card, where the second failure notification message is used.
- the first interface card is configured to notify the first interface card that the first controller is faulty; wherein the first interface card is connected to the first controller in the N interface cards included in the storage system, and is An interface card served by the first controller;
- the first interface card controls, according to the second failure notification message, that a port connected to the first controller enters an inactive state to stop communication with the first controller;
- the second controller When the second controller of the M controllers contends to be the new master controller, the second controller sends a master control notification message to the first interface card, where the master control notification message is used to notify The first interface card, the second controller has competed for a new primary controller; wherein the first interface card is connected to the second controller;
- the first interface card controls, according to the master control notification message, that a port connected to the second controller enters an active state to communicate with the second controller through a port connected to the second controller.
- a controller comprising:
- An operation module when the first controller as the main controller of the M controllers fails, causing the controller to compete for a new main controller;
- a communication module configured to perform information relaying by using at least one of the N interface cards; wherein the first interface card is respectively connected to the first controller and the controller.
- the controller further includes: a receiving module, configured to: before the operating module causes the controller to compete for a new primary controller, Receiving a first fault notification message sent by the first controller, where the first fault notification message is used to notify the controller that the first controller has a fault.
- the controller further includes a receiving module and a removing module;
- the receiving module is configured to: after the operating module causes the controller to compete for a new primary controller, receive a second fault notification message sent by a third controller of the M controllers, where the second fault notification is sent a message is used to notify the controller that the third controller has failed;
- the removal module is configured to remove information of the third controller from a list of redundant controllers; wherein the redundant controller list is used to record information that can be used as redundant controllers.
- an interface card including:
- a receiving module configured to receive a second fault notification message sent by the first controller when a first controller that is the master controller of the M controllers fails, where the second fault notification message is used to notify the In the interface card, the first controller is faulty; wherein the interface card is an interface card of the N interface cards that is connected to the first controller and serves the first controller;
- control module configured to control, according to the second fault notification message, a port connected to the first controller to enter an inactive state to stop communication with the first controller.
- the receiving module is further configured to: after receiving the second fault notification message sent by the first controller, receive the M controls a master control notification message sent by the second controller in the device, the master control notification message is used to notify the interface card, and the second controller has competed for a new master controller;
- the control module is further configured to: according to the master control notification message, control a port connected to the second controller to enter an activation state, to connect a port connected to the second controller with the second controller Communicate.
- a storage system including:
- a first controller configured to send a second fault message to the first interface card when the first controller that is the master controller in the M controllers included in the storage system fails, the second The fault notification message is used to notify the first interface card that the first controller has a fault; wherein the first interface card is among the N interface cards included in the storage system and the first control An interface card that is connected to and serves the first controller;
- the first interface card is configured to control, according to the second fault notification message, a port connected to the first controller to enter an inactive state to stop communication with the first controller;
- a second controller configured to send a master control notification message to the first interface card when the second controller of the M controllers contends to be a new master controller, the master control notification message For notifying the first interface card, the second controller has competed for a new primary controller; wherein the first interface card is connected to the second controller;
- the first interface card is further configured to control, according to the master control notification message, with the second controller
- the connected port enters an active state to communicate with the second controller through a port connected to the second controller.
- an interface card is connected to at least two controllers. If one of the controllers connected to an interface card fails, the interface card can stop serving the controller, and the interface card is also Connected to other controllers, it can continue to serve other controllers. In this way, even if the controller fails, the interface card can continue to serve other controllers as long as there is no fault, and can continue to be used. Compared with the prior art, the services in the interface card are not interrupted, and the interface card is connected. Other hardware devices can continue to transmit information through the interface card, which ensures the reliability of the system.
- the interface card and the device connected to the interface card can continue to be used, which also saves hardware resources to a certain extent and improves the utilization of the interface card.
- the number of interface cards can be reduced to a certain extent, the system structure tends to be simple, and the volume of the system is reduced.
- FIG. 1 is a structural diagram of a storage system in the prior art
- FIG. 2 is a schematic structural diagram of a storage system according to an embodiment of the present invention.
- FIG. 3 is a detailed structural diagram of an implementation manner of a storage system according to an embodiment of the present invention.
- FIG. 4 is a schematic structural diagram of another implementation manner of a storage system according to an embodiment of the present invention.
- FIG. 5 is a main flowchart of a method for transmitting information by using an interface card according to an embodiment of the present invention
- FIG. 6 is a main flowchart of a controller fault processing method according to an embodiment of the present invention.
- FIG. 7 is a main flowchart of another method for transmitting information by using an interface card according to an embodiment of the present invention.
- FIG. 8 is a main structural block diagram of a controller in an embodiment of the present invention.
- FIG. 9 is a main structural block diagram of an interface card according to an embodiment of the present invention.
- An embodiment of the present invention provides a storage system, including: M controllers for controlling the system; the M controllers include one main controller and M-1 as redundant slave controllers, M Positive An integer number; N interface cards, wherein each interface card is coupled to at least two controllers for relaying signals transmitted by the controller or transmitted to the controller, or for processing from the control The signal of the device; N is an integer less than or equal to M.
- an interface card is connected to at least two controllers. If one of the controllers connected to an interface card fails, the interface card can stop serving the controller, and the interface card is also Connected to other controllers, it can continue to serve other controllers. In this way, even if the controller fails, the interface card can continue to serve other controllers as long as there is no fault, and can continue to be used. Compared with the prior art, the services in the interface card are not interrupted, and the interface card is connected. Other hardware devices can continue to transmit information through the interface card, which ensures the reliability of the system.
- the interface card and the device connected to the interface card can continue to be used, which also saves hardware resources to a certain extent and improves the utilization of the interface card.
- the number of interface cards can be reduced to a certain extent, the system structure tends to be simple, and the volume of the system is reduced.
- system and “network” are used interchangeably herein.
- the term “and/or” in this context is merely an association describing the associated object, indicating that there may be three relationships, for example, A and / or B, which may indicate that A exists separately, and both A and B exist, respectively. B these three situations.
- the character "/" in this article unless otherwise specified, generally indicates that the contextual object is an "or" relationship.
- an embodiment of the present invention provides a storage system, which may include M controllers 201 and N interface cards 202.
- M controllers 201 are used to control the system.
- the M controllers 201 include one main controller 201 and M-1 as redundant slave controllers 201, and M is a positive integer. That is, the M-1 slave controllers 201 are backups of the master controller 201, and when the master controller 201 fails, one of the slave controllers 201 can continue to operate as the master controller 201, allowing the system operation to continue.
- N interface cards 202 each of which is connected to at least two controllers 201 of the M controllers 201 for transmitting or transmitting the controller 201 connected to the interface card 202.
- the signal from the controller 201 connected to the interface card 202 is relayed, or the interface card 202 is also used to process signals from the controller 201 connected to the interface card 202.
- N is an integer less than or equal to M, that is, in the system, the number of interface cards 202 is less than or equal to the number of controllers 201.
- one interface card 202 corresponds to multiple controllers 201 (ie, connected to at least two controllers 201), one controller 201 may correspond to only one interface card 202, or one controller 201 may correspond to multiple Interface card 202.
- the interface card 202 can have multiple types of ports that can be connected to different hardware modules. Then the signal sent by the controller 201 can be relayed through the interface card 202. For example, the format of the signal sent by the controller 201 to the interface card 202 is format 1. If the controller 201 wants to send the signal to the hardware module 1, and the corresponding signal format of the hardware module 1 is format 2, the interface card 202 can The format of the received signal is converted from format 1 to format 2 and then sent to the hardware module 1. Similarly, the signal sent by the hardware module 1 to the controller 201 is also relayed through the interface card 202.
- the controller 201 can also send signals to the interface card 202, such as signals for controlling the interface card 202, or signals to the interface card 202 to inform the status of the controller 201, etc., which the interface card 202 can process.
- the interface card 202 can also send a signal to the controller 201, such as a signal for notifying the controller 201 of the status of the interface card 202, and the like.
- the interface card 202 and the controller 201 can be connected through a PCIE (Peripheral Component Interconnect Express) bus.
- PCIE Peripheral Component Interconnect Express
- PCIE TX for an interface card 202 and a controller 201 connected thereto, There may be two PCIE buses connected, namely PCIE TX and PCIE RX, respectively called PCIE transmit bus and PCIE receive bus, where the transmission and reception are both for the controller 201.
- PCIE TX the controller 201 can transmit signals at high speed, and the controller 201 can receive signals through the PCIE RX.
- FIG 3 there are two controllers 201, a for PCIE TX and b for PCIE RX.
- the PCIE TX and the PCIE RX can be designed according to actual service requirements, and can be bandwidths of different standards such as X4, X8, and X16, and are the main service data channels between the controller 201 and the interface card 202.
- a serial control bus and/or a parallel control bus may be connected for transmitting control signals.
- a serial control bus and a parallel control bus are connected at the same time.
- c denotes a serial control bus
- d denotes a parallel control bus.
- the serial control bus transmits a low-speed signal, and is mainly used to perform some mutual handshake between the controller 201 and the interface card 202, for example, the controller 201 can read the interface card 202 type, status, alarm information, etc., and The interface card 202 acquires status information of the controller 201, such as master-slave status information of the controller 201, that is, whether the controller 201 is the master controller 201 or the slave controller 201, and the like.
- the parallel control bus transmits high-speed signals, and is mainly used for the controller 201 to control the state of the interface card 202 and upgrade the firmware, version, etc. of the interface card 202, including general basic control signals, such as in-position signals, power-on enable, and interrupt. Signal, reset signal, etc.
- serial control bus and the parallel control bus are all buses that can be customized according to the required functions, and are not limited to a fixed form.
- each of the N interface cards 202 can be connected to the server, so that information exchange can be implemented between the server and the system through the interface card 202.
- the storage system may further include at least one storage device, wherein each storage device is connected to at least one interface card 202, so that the controller 201 and the storage device can pass the corresponding interface card. 202 performs information interaction.
- the storage device may for example be a hard disk or may be other types of devices for storing information.
- Figure 3 is an interface card 202, and the connection The port card 202 is connected to two storage devices as an example.
- each interface card 202 can have a control module, and the function of the interface card 202 is implemented by the control module.
- the control module in the interface card 202 is a module that can process, control, convert or switch signals between the controller 201 and the interface card 202, and may be one or more chips or external onboard lines, according to the state of the controller 201. Or command to implement the state change and switching of the port connected to the controller 201 on the interface card 202, and also convert the signal from the PCIE protocol format to other various protocol formats, such as converting from the PCIE protocol format to the FC (Fibre Channel, Fibre Channel protocol format, from PCIE protocol format to GE (Gigabit Ethernet, Gigabit Ethernet interface) protocol format, from PCIE protocol format to SAS (Serial Attached SCSI) protocol format, and so on.
- the control module can also implement a mirroring (NT) function to back up data in real time with the controller 201.
- connection line between the two controllers 201 there is a connection line between the two controllers 201, for example between the two controllers 201, there may be a connection line for transmitting mirrored data, wherein the mirrored data may be Refers to backup data, which may have a connection line for transmitting heartbeat information, and may also have a serial control bus and/or a parallel control bus.
- each of the two controllers 201 may have a connection line, for example, both may have a connection line for transmitting mirror data and a connection line for transmitting heartbeat information, and It may have a serial control bus and/or a parallel control bus.
- the two controllers 201 perform state information transmission through mirroring and heartbeat, and monitor each other's status and service characteristics in real time.
- Both of the above-described Figures 2 and 3 are architectural designs that are more redundant with a more commonly used dual-control memory controller and an interface card 202.
- the service flow is transmitted by the main controller 201 through the PCIE bus and the interface card 202.
- the switch card 202 converts the PCIE format message into a message of another protocol bus format, such as a message.
- the FC card, the GE protocol, the SAS protocol, and the like, the interface card 202 can be connected to a server port of the front end or a disk of the back end.
- the main controller 201 fails, it will pass the heartbeat signal or power down between the controllers 201.
- the interrupt signal is notified to the slave controller 201, and the slave controller 201 can compete for the master controller 201. Because the interface card 202 is connected to both controllers 201, the interface card 202 controls the port state to perform corresponding switching, and the interface card 202 can continue to work to serve the new primary controller 201, so that the service of the interface card 202 can continue.
- the service is maintained with a server at the front end of the interface card 202 or a cascading box at the back end.
- FIG. 4 another possible schematic diagram of the storage system is provided.
- each controller 201 can connect two interface cards 202.
- the controller 201 on the left is first used as the main controller 201, and the main controller 201 uses the interface card 202 on the left.
- the controller 201 notifies the right controller 201.
- the controller 201 also notifies each interface card 202 connected thereto so that each interface card changes the corresponding port state.
- the controller 201 on the right competes for the main controller 201.
- the controller 201 on the right side may continue to select to use the interface card 202 on the left side, or may choose to use the interface card 202 on the right side, or may also select to use the interface card 202 on the left side and the interface card 202 on the right side.
- they may be according to pre-set rules, or may be randomly selected, and the like.
- the interface card 202 and the plurality of controllers 201 implement a redundant design in an intersecting manner, and the redundant manner of the cross-connects makes the entire interface card 202
- the reliability of the service is better.
- the controller 201 and the interface card 202 are both faulty, as long as there are controllers 201 and interface cards 202 that have not failed, the services at the front and rear ends of the interface card 202 can be continued to avoid service interruption.
- the system reliability has been greatly improved.
- an interface card 202 can be connected to a plurality of controllers 201 to be able to serve a plurality of controllers 201.
- the controller 201 on the left is the master controller 201
- the interface card 202 is the controller 201 on the left. If the main controller 201 fails, the right slave controller 201 can compete for the slave controller 201, and the interface card 202 can continue to serve the controller 201 on the right side, without being associated with the controller 201.
- the interface card 202 connected to the controller 201 cannot be used as much as possible to ensure the continuity of the service in the interface card 202, so as to be connected with the interface card. Other devices that are connected can continue to be used, improve the reliability of the system, save hardware resources as much as possible, and improve the utilization of the interface card 202.
- an embodiment of the present invention provides a method for transmitting information through an interface card, which may be applied to the storage system shown in FIG. 2, FIG. 3, and FIG. The main flow of the method is described below.
- Step 501 When the first controller as the main controller of the M controllers fails, the second controller of the M controllers competes for the new main controller.
- the M controllers 201 include a main controller 201 and M-1 as redundant slave controllers 201.
- the M controllers 201 belong to a storage system, and the storage system further includes N An interface card 202, wherein each interface card 202 is coupled to at least two controllers 201 for relaying signals transmitted by the controller 201 or transmitted to the controller 201, or for processing from the The signal of the controller 201; N is an integer less than or equal to M.
- the first controller is the controller 201 on the left side of the figure
- the second controller is the controller 201 on the right side of the figure.
- the storage system When the storage system starts to work, it first needs to be powered on. After the power is turned on, the system performs initialization. After the initialization is completed, the main controller 201 detects the in-position signal of the interface card 202, that is, determines whether the interface card 202 has been inserted into the correct insertion. groove. If the interface card 202 is not in place, the interface card 202 needs to be inserted. If it can be detected that the interface card is pulled low (because it is normally active low), the binary value of the type of the interface card 202 is determined by the driver. That is, it is judged whether the type of the interface card 202 is a type supported by the system. If the type of the interface card 202 is of a non-system supported type, the alarm message is sent by the driver, the red light on the interface card 202 is illuminated, and the user can reinsert the interface card 202 supported by the system.
- the system sends power-on enable to the interface card 202.
- the signal, clock signal and reset signal cause the interface card 202 to begin operation.
- the interface card 202 and the main controller 201 automatically negotiate the port, and the so-called negotiation port refers to the interface card 202.
- information such as a transmission bandwidth, a transmission rate, and the like of a transmission channel between the interface card 202 and the main controller 201 is determined.
- the interface card 202 and the main controller 201 can communicate.
- the main controller 201 at this time is, for example, the controller 201 on the left side in FIG.
- the method may further include:
- the second controller receives a first fault notification message sent by the first controller, where the first fault notification message is used to notify the second controller that the first controller has a fault.
- the main controller 201 When the main controller 201 fails, the main controller 201 notifies the slave controller 201 of the failure information of the main controller 201 by the power-down interrupt signal or the heartbeat signal, that is, transmits the first failure notification message to the slave controller 201, and then The controller 201 competes for the main controller 201, and the new main controller 201 at this time refers to, for example, the controller 201 on the right side of FIG. At the same time, the original main controller 201 also notifies the interface card 202 of the failure information, or the interface card 202 can automatically detect the status of each controller 201 periodically, periodically or randomly.
- the controller 201 on the right side notifies the interface card 202 through the serial control bus or the parallel control bus between the interface card 202, and the controller 201 has already competed for the master controller 201, that is, the master-slave has occurred between the controllers 201.
- the port connected between the controller 201 enters an inactive state, or enters a mirrored state, and the port connected between the interface card 202 and the controller 201 on the right can start working, that is, the interface between the interface card 202 and the controller 201 on the right.
- the port enters the active state.
- the port on the control interface card 202 enters an inactive state, a mirrored state, or an activated state, which may be specifically performed by a control module in the interface card 202.
- control port When the control port enters the mirroring state, it can control the port to transfer mirror data, that is, back up data. At this time, the port that enters the mirroring state is equivalent to the backup port of another port.
- the method may further include:
- the second controller receives the second reason sent by the third controller of the M controllers 201
- the second notification message is used to notify the second controller that the third controller has a fault
- the second controller removes information of the third controller from a list of redundant controllers; wherein the redundant controller list is used to record information of each controller 201 that can be redundant.
- the system includes three controllers 201, which are a controller 1, a controller 2 and a controller 3, respectively.
- the controller 1 is a master controller 201
- the controller 2 and the controller 3 are slave controllers. 201.
- the controller 2 competes for the main controller 201, and then the controller 3 is still the slave controller 201.
- the controller 3 When the controller 2 is working, if the controller 3 also fails, the controller 3 notifies the controller 2 by the power-down interrupt signal or the heartbeat signal, that is, sends the second fault notification message to the controller 2, and the controller 2
- the information of the controller 3 can be removed from the list of redundant controllers.
- the controller 3 also notifies the interface card 202 of the fault information, or the interface card 202 can automatically detect the state of each controller 201 periodically, periodically or randomly, and the interface card 202 controls the port entry with the controller 3. Inactive or mirrored state.
- the slave controller 201 when the slave controller 201 fails, the service between the master controller 201 and the interface card 202 is not affected. If there are other slave controllers 201 in the system, for example, there is also a fourth controller. Then, when the controller 2 as the main controller 201 fails, the controller 2 does not send the fault information to the faulty controller 3, that is, when the master-slave switch is required, the faulty slave controller 201 is not selected. Instead, the slave controller 201 is selected to be faultless. Of course, if there is no other slave controller 201 in the system, if the controller 2 as the master controller 201 also fails, the system may stop running.
- the controller 3 can notify the controller 2 by means of a heartbeat signal, etc., and the controller 2 will re-introducing the controller 3 into a selection range in which master-slave switching can be performed, that is, re-controlling the controller 3.
- the information is added to the list of redundant controllers.
- the slave controller 201 can also notify the interface card 202 when recovering from the failure of the controller 201.
- the controller 201 and the interface card 202 can exchange information.
- the controller 201 can send a notification message to the interface card 202 in real time, timing, or upon stateful transition to inform the interface card 202 of the current state of the controller 201, or the interface card 202 can also be in real time, timed, or randomly.
- a probe message is sent to the controller 201 to ascertain the current state of the controller 201.
- the interface card 202 can send a notification message to the controller 201 in real time, timing, or when there is a state transition to inform the controller 201 of the current state of the interface card 202, or the controller 201 can also be real time, timed, or randomly.
- a probe message is sent to the interface card 202 to ascertain the current state of the interface card 202.
- Step 502 The second controller performs information relaying by using at least one of the N interface cards, where the first interface card is respectively associated with the first controller and the second controller. connection.
- the second controller competes for the main controller 201.
- the first controller and the second controller are connected to the same interface card 202.
- the interface card 202 can continue to be used.
- the second controller can continue to use the interface card 202, which is referred to herein as the first interface card.
- an embodiment of the present invention provides a controller fault processing method, which may be applied to the storage system shown in FIG. 2, FIG. 3, and FIG. The main flow of the method is described below.
- Step 601 When the first controller that is the primary controller of the M controllers fails, the first interface card that is connected to the first controller and serves the first controller among the N interface cards Receiving a second fault notification message sent by the first controller, where the second fault notification message is used to notify the first interface card, and the first controller has a fault.
- the M controllers 201 include a main controller 201 and M-1 as redundant slave controllers 201, and the M controllers 201 and the N interface cards 202 belong to a storage system, wherein Each interface card 202 is coupled to at least two controllers 201 for relaying signals transmitted by the controller 201 or transmitted to the controller 201, or for processing from the controller 201 Signal; N is an integer less than or equal to M.
- the controller 201 on the left side is the main controller 201.
- the first failure notification message may be sent to the slave controller 201, and at the same time, the interface card 202 may also be sent to the interface card 202.
- the second failure notification message is described. The specific process has been described in the introduction of FIG. 2 to FIG. 5, and will not be described here.
- the method may further include:
- the first interface card receives a master control notification message sent by a second controller of the M controllers 201, where the master control notification message is used to notify the first interface card, and the second controller has Competing for a new master controller 201;
- the first interface card controls, according to the master control notification message, that a port connected to the second controller enters an active state to communicate with the second controller through a port connected to the second controller.
- the controller 201 on the left side is the main controller 201.
- the first failure notification message can be sent to the slave controller 201, and the controller 201 competes for the new master.
- the controller 201 sends a new control notification message to the first interface card, and after receiving the main control notification message, the first interface card activates and controls the new main control.
- the port between the devices 201 communicates with the new main controller 201.
- Step 602 The first interface card controls, according to the second failure notification message, that a port connected to the first controller enters an inactive state to stop communication with the first controller.
- the interface card 202 Upon receiving the second failure notification message, the interface card 202 can control the port connected to the controller 201 on the left to enter an inactive state, so that communication with the controller 201 on the left can be stopped.
- the master control notification message may also be sent to the interface card 202, and the interface card 202 may control the port connected to the controller 201 on the right to enter an active state, thereby The controller 201 performs communication.
- an embodiment of the present invention provides another method for transmitting information through an interface card, which may be applied to the storage system shown in FIG. 2, FIG. 3, and FIG. The main flow of the method is described below.
- Step 701 When a first controller that is the primary controller of the M controllers included in the storage system fails, the first controller sends a second fault notification message to the first interface card, where the second fault occurs.
- the notification message is used to notify the first interface card that the first controller is faulty; wherein the first interface card is connected to the first controller in the storage system including N interface cards And an interface card serving the first controller.
- the controller 201 on the left side is the main controller 201.
- the first failure notification message may be sent to the slave controller 201, and at the same time, the interface card 202 may also be sent to the interface card 202.
- the second failure notification message is described. The specific process has been described in the introduction of FIG. 2 to FIG. 5, and will not be described here.
- Step 702 The first interface card controls, according to the second failure notification message, that a port connected to the first controller enters an inactive state to stop communication with the first controller.
- the interface card 202 Upon receiving the second failure notification message, the interface card 202 can control the port connected to the controller 201 on the left to enter an inactive state, so that communication with the controller 201 on the left can be stopped.
- Step 703 When the second controller of the M controllers contends to be the new master controller, the second controller sends a master control notification message to the first interface card, where the master control notification message is sent. And the second controller is used to notify the first interface card that the second controller has competed for a new primary controller; wherein the first interface card is connected to the second controller.
- Step 704 The first interface card controls, according to the master control notification message, that a port connected to the second controller enters an active state to pass a port connected to the second controller and the second control. Communicate.
- the controller 201 on the left is the main controller 201, when it is out When the fault occurs, the first fault notification message may be sent to the slave controller 201, and then the slave controller 201 competes for the new master controller 201, and the new master controller 201 sends the new master controller 201 to the first interface card.
- the master control notification message after the first interface card receives the master control notification message, activates a port with the new master controller 201 to communicate with the new master controller 201, the specific process It has been described in the introduction of Figures 2 to 5, and will not be described here.
- an embodiment of the present invention provides a controller, which may be the controller 201 in the storage system shown in FIG. 2 to FIG. 4, that is, the flow of FIG. 5-7.
- the controller 201 described above, in particular, the controller 201 may be the second controller described in the flow of Figures 5-7.
- the controller 201 can include an operation module 801 and a communication module 802.
- the operation module 801 is configured to cause the controller 201 to compete for a new main controller 201 when a failure occurs in the M controllers 201 as the first controller of the main controller 201.
- the M controllers 201 include a main controller 201 and M-1 as redundant slave controllers 201, and the M controllers 201 belong to the storage system, and the storage system further includes N interface cards 202, wherein each interface card 202 is coupled to at least two controllers 201 for relaying signals transmitted by the controller 201 or transmitted to the controller 201, or for processing from The signal of the controller 201; N is an integer less than or equal to M.
- the communication module 802 is configured to perform information relaying by using at least one of the N interface cards 202; wherein the first interface card is respectively connected to the first controller and the controller.
- the controller 201 may further include a receiving module, configured to: before the operating module 801 causes the controller 201 to compete for the new primary controller 201, receive the first controller to send The first failure notification message is used to notify the controller 201 that the first controller has a failure.
- the controller 201 may further include the receiving module and the removing module.
- the receiving module is configured to receive a second fault notification message sent by a third controller of the M controllers 201 after the operating module 801 causes the controller 201 to compete for the new master controller 201.
- the second failure notification message is used to notify the controller 201 that the third controller has a fault;
- the removal module is configured to remove information of the third controller from a list of redundant controllers; wherein the redundant controller list is used to record information of each controller 201 that can be redundant.
- an embodiment of the present invention provides an interface card, which may be an interface card 202 in the storage system shown in FIG. 2 to FIG. 4, that is, the flow of FIG. 5-7.
- the interface card 202 described above, in particular, the interface card 202 can be the first interface card described in the flow of Figures 5-7.
- the interface card 202 can include a receiving module 901 and a control module 902.
- the receiving module 901 is configured to receive a second fault notification message sent by the first controller when a first controller of the M controllers 201 is faulty as the primary controller 201, where the second fault notification message is used. Informing the interface card 202 that the first controller has a fault; wherein the interface card 202 is connected to the first controller and served by the first controller in the N interface cards 202. Interface card 202;
- the control module 902 is configured to control, according to the second failure notification message, a port connected to the first controller to enter an inactive state to stop communication with the first controller.
- the receiving module 901 is further configured to: after receiving the second fault notification message sent by the first controller, receive the second controller sent by the M controllers 201
- the master control notification message is used to notify the interface card 202 that the second controller has contend for the new master controller 201;
- the control module 902 is further configured to: according to the master control notification message And controlling a port connected to the second controller to enter an active state to communicate with the second controller through a port connected to the second controller.
- an embodiment of the present invention further provides a storage system, which may be the storage system shown in FIG. 2 to FIG. 4, that is, the storage system described in the flowcharts of FIG. 5-7.
- the storage system can include a first controller, a first interface card, and a second controller.
- the storage system may include multiple controllers 201 and multiple interface cards 202, here only two controllers 201 (ie, the first controller and the second controller) and An interface card 202 (ie, the first interface card) is taken as an example.
- the first controller is configured to send a second fault to the first interface card when the first controller that is the main controller 201 in the M controllers 201 included in the storage system fails
- the second failure notification message is used to notify the first interface card that the first controller has a fault; wherein the first interface card is the N interface cards 202 included in the storage system.
- An interface card connected to the first controller and serving the first controller;
- the first interface card is configured to control, according to the second fault notification message, a port connected to the first controller to enter an inactive state to stop communication with the first controller;
- the second controller is configured to send a master control notification message to the first interface card when the second controller of the M controllers 201 competes for a new master controller 201,
- the master control notification message is used to notify the first interface card, and the second controller has competed for the new primary controller 201; wherein the first interface card is connected to the second controller;
- the first interface card is further configured to control, according to the master control notification message, a port connected to the second controller to enter an active state, by using a port connected to the second controller, and the second control Communicate.
- An embodiment of the present invention provides a storage system, including: M controllers 201 for controlling the system; the M controllers 201 include a main controller 201 and M-1 as redundant slave controls.
- the device 201, M is a positive integer; N interface cards 202, wherein each interface card 202 is connected to at least two controllers 201 for transmitting signals transmitted by the controller 201 or transmitted to the controller 201 Performing a relay, or for processing a signal from the controller 201; N is an integer less than or equal to M.
- an interface card 202 is connected to at least two controllers 201. If one of the controllers 201 connected to an interface card 202 fails, the interface card 202 can stop serving the controller 201. At the same time, the interface card 202 is also connected to other controllers 201, and can continue to serve other controllers 201. In this way, even if the controller 201 fails, the interface card 202 can continue to serve other controllers 201 as long as there is no fault, and can continue to be used. Compared with the prior art, the services in the interface card 202 are not interrupted, and Other hardware connected to the interface card 202 The device can also continue to transmit information through the interface card 202, thereby ensuring the reliability of the system.
- the interface card 202 and the device connected to the interface card 202 can continue to be used, which also saves hardware resources to some extent and improves the utilization of the interface card 202.
- the number of the interface cards 202 can be reduced to some extent, the system structure tends to be simple, and the volume of the system is reduced.
- the disclosed system, apparatus, and method may be implemented in other manners.
- the device embodiments described above are merely illustrative.
- the division of the modules or units is only a logical function division.
- there may be another division manner for example, multiple units or components may be used. Combinations can be integrated into another system, or some features can be ignored or not executed.
- the mutual coupling or direct coupling or communication connection shown or discussed may be an indirect coupling or communication connection through some interface, device or unit, and may be in an electrical, mechanical or other form.
- the units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, may be located in one place, or may be distributed to multiple network units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution of the embodiment.
- each functional unit in each embodiment of the present application may be integrated into one processing unit, or each unit may exist physically separately, or two or more units may be integrated into one unit.
- the above integrated unit can be implemented in the form of hardware or in the form of a software functional unit.
- the integrated unit is implemented in the form of a software functional unit and sold as a standalone product Or when used, it can be stored in a computer readable storage medium.
- the technical solution of the present application in essence or the contribution to the prior art, or all or part of the technical solution may be embodied in the form of a software product stored in a storage medium.
- a number of instructions are included to cause a computer device (which may be a personal computer, server, or network device, etc.) or a processor to perform all or part of the steps of the methods described in various embodiments of the present application.
- the foregoing storage medium includes: a U disk, a mobile hard disk, a read-only memory (ROM), a random access memory (RAM), a magnetic disk, or an optical disk, and the like. .
Landscapes
- Hardware Redundancy (AREA)
- Engineering & Computer Science (AREA)
- Computer Networks & Wireless Communication (AREA)
- Signal Processing (AREA)
Abstract
L'invention concerne un système de stockage utilisé pour résoudre les problèmes techniques lié au fait qu'une interruption de service se produit dans une carte d'interface défectueuse d'un contrôleur et qu'il est impossible de continuer à utiliser un appareil connecté à la carte d'interface. Le système comprend : un nombre M de contrôleurs (201) pour commander le système, le nombre M de contrôleurs (201) comprenant un contrôleur maître (201) et un nombre M-1 de contrôleurs asservis (201) en tant que redondance, et M étant un entier positif ; un nombre N de cartes d'interface (202) qui sont connectées chacune à au moins deux contrôleurs (201), de sorte à transférer des signaux transmis par les contrôleurs (201) ou des signaux transmettant aux contrôleurs (201), ou à traiter des signaux provenant des contrôleurs (201), N étant un entier inférieur ou égal à M. L'invention concerne également un procédé et un appareil correspondant de transmission d'informations et de gestion de défaillance des contrôleurs (201) via les cartes d'interface (202).
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201410579922.2A CN104410510B (zh) | 2014-10-24 | 2014-10-24 | 通过接口卡传输信息的方法、装置及系统 |
| CN201410579922.2 | 2014-10-24 |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2016062037A1 true WO2016062037A1 (fr) | 2016-04-28 |
Family
ID=52648108
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/CN2015/076658 Ceased WO2016062037A1 (fr) | 2014-10-24 | 2015-04-15 | Procédé, appareil, et système de transmission d'informations, et gestion de défaillance de contrôleur via des cartes d'interface |
Country Status (2)
| Country | Link |
|---|---|
| CN (1) | CN104410510B (fr) |
| WO (1) | WO2016062037A1 (fr) |
Cited By (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN109542198A (zh) * | 2018-11-20 | 2019-03-29 | 郑州云海信息技术有限公司 | 一种控制pcie卡上电的方法及设备 |
| CN111737062A (zh) * | 2020-06-24 | 2020-10-02 | 浙江大华技术股份有限公司 | 一种备份处理方法、装置及系统 |
| CN112000286A (zh) * | 2020-08-13 | 2020-11-27 | 北京浪潮数据技术有限公司 | 一种四控全闪存储系统及其故障处理方法、装置 |
Families Citing this family (9)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN104410510B (zh) * | 2014-10-24 | 2018-07-03 | 华为技术有限公司 | 通过接口卡传输信息的方法、装置及系统 |
| CN105335101B (zh) * | 2015-09-29 | 2018-11-20 | 浪潮(北京)电子信息产业有限公司 | 一种数据处理方法和系统 |
| CN106059791B (zh) * | 2016-05-13 | 2020-04-14 | 华为技术有限公司 | 一种存储系统中业务的链路切换方法和存储设备 |
| CN106302480B (zh) * | 2016-08-19 | 2019-05-10 | 浪潮(北京)电子信息产业有限公司 | 一种基于ntb硬件和scsi通信协议的通信方法 |
| US11909635B2 (en) * | 2021-03-05 | 2024-02-20 | Juniper Networks, Inc. | Hardware-assisted fast data path switchover for a network device with redundant forwarding components |
| CN114880254B (zh) * | 2022-04-02 | 2025-09-16 | 锐捷网络股份有限公司 | 一种表项读取方法、装置及网络设备 |
| CN115391105B (zh) * | 2022-08-31 | 2025-07-18 | 杭州宏杉科技股份有限公司 | 一种应用于存储设备的存储控制方法和装置 |
| CN115657975B (zh) * | 2022-12-29 | 2023-03-31 | 浪潮电子信息产业股份有限公司 | 一种磁盘数据读写控制方法、相关组件及前端共享卡 |
| CN117439971B (zh) * | 2023-10-10 | 2024-12-13 | 深圳市佳合丰新能源科技有限公司 | 一种地址分配方法、系统、计算机设备及存储介质 |
Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN1753376A (zh) * | 2005-10-27 | 2006-03-29 | 杭州华为三康技术有限公司 | 双主控网络设备及其主备切换方法 |
| CN1909559A (zh) * | 2006-08-30 | 2007-02-07 | 杭州华为三康技术有限公司 | 基于快速外围组件互连的接口板及其切换主控板的方法 |
| CN102195845A (zh) * | 2010-03-03 | 2011-09-21 | 杭州华三通信技术有限公司 | 一种实现主控板主备切换的方法、装置和设备 |
| CN203482216U (zh) * | 2013-09-24 | 2014-03-12 | 浙江大华系统工程有限公司 | 一种网络设备 |
| CN104410510A (zh) * | 2014-10-24 | 2015-03-11 | 华为技术有限公司 | 通过接口卡传输信息、控制器故障处理方法、装置及系统 |
Family Cites Families (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN101068140B (zh) * | 2007-06-27 | 2010-06-16 | 中兴通讯股份有限公司 | 一种实现主/备pci设备切换的装置和方法 |
| CN101252531A (zh) * | 2008-04-02 | 2008-08-27 | 杭州华三通信技术有限公司 | 一种实现负载分担和主备倒换的方法、系统及设备 |
-
2014
- 2014-10-24 CN CN201410579922.2A patent/CN104410510B/zh active Active
-
2015
- 2015-04-15 WO PCT/CN2015/076658 patent/WO2016062037A1/fr not_active Ceased
Patent Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN1753376A (zh) * | 2005-10-27 | 2006-03-29 | 杭州华为三康技术有限公司 | 双主控网络设备及其主备切换方法 |
| CN1909559A (zh) * | 2006-08-30 | 2007-02-07 | 杭州华为三康技术有限公司 | 基于快速外围组件互连的接口板及其切换主控板的方法 |
| CN102195845A (zh) * | 2010-03-03 | 2011-09-21 | 杭州华三通信技术有限公司 | 一种实现主控板主备切换的方法、装置和设备 |
| CN203482216U (zh) * | 2013-09-24 | 2014-03-12 | 浙江大华系统工程有限公司 | 一种网络设备 |
| CN104410510A (zh) * | 2014-10-24 | 2015-03-11 | 华为技术有限公司 | 通过接口卡传输信息、控制器故障处理方法、装置及系统 |
Cited By (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN109542198A (zh) * | 2018-11-20 | 2019-03-29 | 郑州云海信息技术有限公司 | 一种控制pcie卡上电的方法及设备 |
| CN109542198B (zh) * | 2018-11-20 | 2022-02-18 | 郑州云海信息技术有限公司 | 一种控制pcie卡上电的方法及设备 |
| CN111737062A (zh) * | 2020-06-24 | 2020-10-02 | 浙江大华技术股份有限公司 | 一种备份处理方法、装置及系统 |
| CN112000286A (zh) * | 2020-08-13 | 2020-11-27 | 北京浪潮数据技术有限公司 | 一种四控全闪存储系统及其故障处理方法、装置 |
| CN112000286B (zh) * | 2020-08-13 | 2023-02-28 | 北京浪潮数据技术有限公司 | 一种四控全闪存储系统及其故障处理方法、装置 |
Also Published As
| Publication number | Publication date |
|---|---|
| CN104410510A (zh) | 2015-03-11 |
| CN104410510B (zh) | 2018-07-03 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN104410510B (zh) | 通过接口卡传输信息的方法、装置及系统 | |
| US8127059B1 (en) | Apparatus for interconnecting hosts with storage devices | |
| EP2052326B1 (fr) | Système d'extension sas à isolation de défauts | |
| US20190235465A1 (en) | Backplane-based plc system with hot swap function | |
| US10275373B2 (en) | Hot swappable device and method | |
| KR20210094069A (ko) | 대체 프로토콜 선택 | |
| US12519740B2 (en) | Method to reset switch when controller fault is detected | |
| CN101557379B (zh) | 一种pcie接口的链路重组方法和装置 | |
| CN110419035B (zh) | Usb主机到主机自动切换 | |
| CN108737188B (zh) | 一种网卡故障切换系统 | |
| CN100418047C (zh) | 磁盘阵列装置以及其控制方法 | |
| CN101488105B (zh) | 实现存储双控制器高可用性的方法及存储双控制器系统 | |
| EP2137906B1 (fr) | Communication d'information de configuration sur une liaison d'interconnexion standard | |
| US9116881B2 (en) | Routing switch apparatus, network switch system, and routing switching method | |
| US11061462B2 (en) | Remote terminal apparatus enabled to reset a plug-and-play compatible device even fixedly connected without removing the device from the apparatus, control method thereof, computer system, and non-transitory recording medium | |
| CN111181766B (zh) | 一种冗余fc网络系统及其实现交换机动态配置的方法 | |
| CN103970705A (zh) | 一种采用冗余对称的可热插拔io箱的多路服务器架构设计 | |
| JP6134720B2 (ja) | 接続方法 | |
| JP5176914B2 (ja) | 伝送装置及び冗長構成部の系切替え方法 | |
| JP7746581B2 (ja) | 記憶システム、データ処理方法、および装置 | |
| CN118606117A (zh) | 一种四控互联镜像系统、数据传输方法、设备和介质 | |
| CN120803374B (zh) | 存储系统及存储系统集群 | |
| CN111475440A (zh) | 基于异步传输协议的通讯控制方法及装置、电子设备 | |
| GB2489838A (en) | Processor trace circuit, which shares a bus with the processor being monitored | |
| JP2001086146A (ja) | Fc_alシステムの制御方法 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 15851877 Country of ref document: EP Kind code of ref document: A1 |
|
| NENP | Non-entry into the national phase |
Ref country code: DE |
|
| 122 | Ep: pct application non-entry in european phase |
Ref document number: 15851877 Country of ref document: EP Kind code of ref document: A1 |