[go: up one dir, main page]

WO2025077411A1 - Io expansion architecture, io switch and pcie device - Google Patents

Io expansion architecture, io switch and pcie device Download PDF

Info

Publication number
WO2025077411A1
WO2025077411A1 PCT/CN2024/111690 CN2024111690W WO2025077411A1 WO 2025077411 A1 WO2025077411 A1 WO 2025077411A1 CN 2024111690 W CN2024111690 W CN 2024111690W WO 2025077411 A1 WO2025077411 A1 WO 2025077411A1
Authority
WO
WIPO (PCT)
Prior art keywords
pcie
target
management unit
switch
unit
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
PCT/CN2024/111690
Other languages
French (fr)
Chinese (zh)
Inventor
贾会娟
慈潭龙
徐通
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Suzhou Metabrain Intelligent Technology Co Ltd
Original Assignee
Suzhou Metabrain Intelligent Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Suzhou Metabrain Intelligent Technology Co Ltd filed Critical Suzhou Metabrain Intelligent Technology Co Ltd
Publication of WO2025077411A1 publication Critical patent/WO2025077411A1/en
Pending legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L49/00Packet switching elements
    • H04L49/10Packet switching elements characterised by the switching fabric construction
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L49/00Packet switching elements
    • H04L49/10Packet switching elements characterised by the switching fabric construction
    • H04L49/109Integrated on microchip, e.g. switch-on-chip
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L49/00Packet switching elements
    • H04L49/10Packet switching elements characterised by the switching fabric construction
    • H04L49/111Switch interfaces, e.g. port details
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L49/00Packet switching elements
    • H04L49/40Constructional details, e.g. power supply, mechanical construction or backplane
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04QSELECTING
    • H04Q1/00Details of selecting apparatus or arrangements
    • H04Q1/02Constructional details
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04QSELECTING
    • H04Q1/00Details of selecting apparatus or arrangements
    • H04Q1/02Constructional details
    • H04Q1/028Subscriber network interface devices
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04QSELECTING
    • H04Q1/00Details of selecting apparatus or arrangements
    • H04Q1/02Constructional details
    • H04Q1/03Power distribution arrangements
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04QSELECTING
    • H04Q1/00Details of selecting apparatus or arrangements
    • H04Q1/02Constructional details
    • H04Q1/035Cooling of active equipments, e.g. air ducts
    • HELECTRICITY
    • H05ELECTRIC TECHNIQUES NOT OTHERWISE PROVIDED FOR
    • H05KPRINTED CIRCUITS; CASINGS OR CONSTRUCTIONAL DETAILS OF ELECTRIC APPARATUS; MANUFACTURE OF ASSEMBLAGES OF ELECTRICAL COMPONENTS
    • H05K7/00Constructional details common to different types of electric apparatus
    • H05K7/20Modifications to facilitate cooling, ventilating, or heating
    • H05K7/20009Modifications to facilitate cooling, ventilating, or heating using a gaseous coolant in electronic enclosures
    • H05K7/20136Forced ventilation, e.g. by fans
    • H05K7/20172Fan mounting or fan specifications
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Definitions

  • each of the inner connector and the outer connector is respectively connected to an ID definition chip, and the ID definition chip is used to perform ID definition on the corresponding inner connector or the outer connector;
  • the baseboard management controller and the task management unit are specifically used to send corresponding configuration information to each of the PCIe Switch chips respectively when the PCIe interconnection topology is identified;
  • the configuration information includes type information of all connection interfaces of the PCIe Switch chip;
  • the connection interface includes an interface for connecting to the internal connector or the external connector;
  • the type information includes: uplink interface information, downlink interface information and internal interconnection interface information;
  • the uplink interface information indicates that the corresponding connection interface is an uplink interface,
  • the upstream interface is used to transmit PCIe signals to the PCIe device connected upstream;
  • the downstream interface information indicates that the corresponding connection interface is a downstream interface, and the downstream interface is used to transmit the PCIe signal to the PCIe device connected downstream;
  • the interconnection interface information indicates that the corresponding connection interface is an internal interconnection interface, and the internal interconnection interface is used to transmit the PCIe signal to the remaining PCIe Switch chips interconnected with the current PCIe Switch chip;
  • the PCIe device connected upstream
  • the PCIe Switch chip is specifically used to transmit the PCIe signal to a preset message queue when the task processing status fed back by all the target PCIe devices is in the working state; and when any of the target PCIe devices is monitored to be in the idle state, based on the message queue, transmit the PCIe signal to the target PCIe device in the idle state.
  • the logic control unit is further used to send a power switch signal to the power rail unit to control the power rail to start or stop supplying power.
  • the first preset number of internal connectors is distributed on a first side of the corresponding extended baseboard
  • the second preset number of external connectors is distributed on a second side of the corresponding extended baseboard
  • the first side and the second side are opposite sides.
  • the switch connector and the external connector are the same type of connector.
  • the present application also provides a PCIe fusion architecture system, including:
  • the PCIe signal is sent to any of the target PCIe devices whose task processing state is an idle state.
  • the present application also provides an electronic device, including a memory, a processor, and a computer program stored in the memory and executable on the processor, wherein when the processor executes the program, the PCIe signal fusion management method as described in any one of the above is implemented.
  • the IO expansion architecture, IO switch and PCIe device provided by the present application are provided with at least one IO expansion unit, the IO expansion unit includes an expansion substrate, the expansion substrate is provided with at least two PCIe Switch chips, a first preset number of internal connectors, and a second preset number of external connectors; one of the internal connectors One end of the internal connector is connected to any PCIe Switch chip on the expansion baseboard, and the other end of the internal connector is used for interconnection between PCIe Switch chips; one end of the external connector is connected to any PCIe Switch chip on the expansion baseboard, and the other end of the external connector is used to connect PCIe devices to complete the interconnection between multiple PCIe devices.
  • FIG1 is a schematic diagram of the structure of an embodiment of an IO expansion architecture provided by the present application.
  • FIG2 is a schematic diagram of the structure of an embodiment of an IO switch provided by the present application.
  • FIG3 is a schematic diagram of the structure of another embodiment of an IO switch provided by the present application.
  • FIG4 is a schematic diagram of a structure of an embodiment of a PCIe interconnection topology in a PCIe converged architecture system provided by the present application;
  • FIG5 is a flow chart of an embodiment of a PCIe signal fusion management method provided by the present application.
  • FIG. 6 is a schematic diagram of the structure of an electronic device provided in the present application.
  • 340-PSU power supply unit 350-HVDC power supply unit.
  • the IO expansion unit includes an expansion base plate 110, the expansion base plate has at least Two PCIe Switch chips (simplified as SW in FIG. 1 ), a first preset number of internal connectors 130 , and a second preset number of external connectors 140 .
  • the number of the IO expansion units can be set according to actual conditions, such as 1, 2, 3 or 4.
  • PCIe adopts a point-to-point working mode.
  • the expansion of PCIe resources can be achieved, thereby facilitating communication between multiple PCIe devices.
  • the first preset number can be determined according to the number of PCIe Switch chips 120 in the IO expansion architecture. For example: Assume that there are four IO expansion units in the IO expansion architecture, and each IO expansion unit includes two PCIe Switch chips 120, that is, there are a total of 8 PCIe Switch chips 120. Then, in order to achieve the interconnection between the 8 PCIe Switch chips 120, 32 internal connectors 130 need to be set to achieve the interconnection between the 8 PCIe Switch chips 120. Among them, each PCIe Switch chip 120 corresponds to 4 internal connectors 130.
  • the second preset number can also be set according to actual conditions, such as 8, 10, etc.
  • the preferred value of the second preset number is 10, that is, 10 external connectors 140 are set on each expansion base plate 110, wherein each PCIe Switch chip 120 corresponds to 5 corresponding external connectors 140, so as to expand as many PCIe resources as possible and connect as many PCIe devices as possible within the range of standard size.
  • the IO expansion architecture in the above embodiment can better support the extraction of a large or sufficient amount of PCIe resources or IO resources, can better meet the construction requirements of complex topologies in the converged architecture system, has high flexibility, provides the possibility of engineering implementation for the construction of complex topologies in the converged architecture system, has low cost and strong feasibility.
  • the CDFP connector is used not only for receiving and sending PCIe signals (i.e., receiving PCIe signals sent by the PCIe device to which it is connected, and sending PCIe signals to the PCIe device to which it is connected), but also for sending Send multiple preset signals to meet different needs.
  • the CDFP connector also sends a VPP (Virtual Pin Port, Virtual Pin Interface)_I2C (Inter-Integrated Circuit, Integrated Circuit Bus) signal and an Alert (reminder) signal to the corresponding PCIe Switch chip 120, and the VPP I2C signal and the Alert signal are used for hot plugging of hard disks in the SSD resource pool.
  • VPP I2C signal can be used for hot plug PCIe management.
  • Each IO expansion unit has an expansion base plate 110, and four expansion base plates 110 are stacked in sequence.
  • Each expansion base plate 110 is provided with two PCIe Switch chips 120, 8 internal connectors 130, and 10 external connectors 140.
  • PCIe Switch chips 120, 8 internal connectors 130, and 10 external connectors 140 By setting two PCIe Switch chips 120, 8 internal connectors 130, and 10 external connectors 140 on each expansion base plate 110, sufficient IO resources can be drawn out, which is helpful for the construction of a converged architecture.
  • one end of each of the four internal connectors 130 is connected to a corresponding PCIe Switch chip 120 on the same expansion base plate 110, and the other end is connected to the internal connectors 130 of other IO expansion units.
  • each of the five external connectors 140 is connected to a corresponding PCIe Switch chip 120 on the same expansion base plate 110, and the other end is used to connect to an external PCIe device to complete the interconnection of multiple PCIe devices and form a PCIe interconnection topology.
  • the IO expansion architecture in this example can support the extraction of 40 IO resources. It can meet the construction requirements of complex topologies in the converged architecture system, is more convenient to implement, and has higher feasibility.
  • the IO expansion unit is also used to connect to a preset ID management unit 210.
  • the internal connector 130 and the external connector 140 in the IO expansion unit are respectively used to connect to the ID management unit 210.
  • the external connector 140 in the IO expansion unit is used to access a PCIe device to obtain a PCIe signal.
  • the PCIe signal is transmitted to the PCIe Switch chip 120.
  • the ID management unit 210 sends the ID signal to a preset control module 220, including: the ID management unit 210 sends the ID signal to a multiplexer 221 (MUX, Multiplex) preset in the control module 220, the multiplexer 221 determines the target management unit based on the target management unit control instruction sent by the logic control unit 222, and sends all the received PCIe signals to the target management unit.
  • a multiplexer 221 MUX, Multiplex
  • the IO expansion architecture in the above embodiment is provided with at least one IO expansion unit, wherein the IO expansion unit includes an expansion base plate 110, wherein the expansion base plate is provided with at least two PCIe Switch chips (simplified as SW in FIG. 1 ), a first preset number of internal connectors 130, and a second preset number of external connectors 140.
  • the IO expansion unit includes an expansion base plate 110, wherein the expansion base plate is provided with at least two PCIe Switch chips (simplified as SW in FIG. 1 ), a first preset number of internal connectors 130, and a second preset number of external connectors 140.
  • this embodiment further provides an IO switch, including:
  • the ID management unit 210 the control module 220, and the IO expansion architecture as described above.
  • the control module 220 is used to manage all the PCIe Switch chips 120 in the IO expansion architecture based on all the ID signals.
  • the IO switch in this embodiment has good IO resource expansion and sharing capabilities, sufficient system robustness and flexibility, and can maintain IO exchanges between many PCIe devices in the entire cabinet. It can better meet the construction requirements of complex topologies in the converged architecture system, and provides the possibility of engineering implementation for the construction of complex topologies in the converged architecture system at a low cost.
  • the ID management unit 210 in the IO switch, it is possible to collect the ID signals of all the internal connectors 130 and the external connectors 140 in the IO expansion architecture, and send the collected ID signals to the external connectors 140.
  • the IO switch in this embodiment can not only support the extraction of a large or sufficient amount of IO resources, but also better meet the construction requirements of complex topologies in the converged architecture system, with high flexibility, providing the possibility of engineering implementation for the construction of complex topologies in the converged architecture system, with low cost, strong feasibility and high degree of automation.
  • each of the inner connector 130 and the outer connector 140 is connected to an ID definition chip, and the ID definition chip is used to perform ID definition on the corresponding inner connector 130 or the outer connector 140;
  • the ID definition chip is also used to receive the ID query instruction, and based on the ID query instruction, feed back the ID signal to the corresponding internal connector 130 or the external connector 140;
  • the internal connector 130 or the external connector 140 feeds the ID signal back to the ID management unit 210 .
  • the ID management unit 210 is connected to each of the external connectors 140 , the internal connectors 130 , and the control module 220 , respectively.
  • each of the ID definition chips has the same I2C (Inter-Integrated Circuit) address and is in exactly the same I2C channel. That is, during the ID signal acquisition process, the ID management unit 210 does not need to switch the access address multiple times, which speeds up the recognition speed, reduces the program complexity, has strong feasibility, low cost, and high recognition efficiency.
  • I2C Inter-Integrated Circuit
  • control module 220 includes:
  • the logic control unit 222 is used to monitor the operating status of the baseboard management controller 225 and all the task management units; based on the operating status and the task management priorities corresponding to the baseboard management controller 225 and the task management units, determine that one of the baseboard management controller 225 and all the task management units is a target management unit; based on the determined target management unit, generate a target management unit instruction; and send the target management unit instruction to the multiplexer 221;
  • the multiplexer 221 is further used to determine the target management unit based on the target management unit instruction and send the ID signal to the target management unit;
  • the baseboard management controller 225 and at least one task management unit are used to perform PCIe topology identification based on the ID signal when receiving the ID signal to obtain a PCIe interconnection topology; perform PCIe topology identification on all the PCIe Switch chips 120 based on the PCIe interconnection topology. Bank management;
  • the ID management unit 210 is connected to the multiplexer 221, and the multiplexer 221 is respectively connected to the logic control unit 222, the baseboard management controller 225, and all the task management units.
  • the ID management unit 210 and the multiplexer 221 can perform signal transmission or communication based on any one of USB (Universal Serial Bus), I2C and UART (Universal Asynchronous Receiver/Transmitter).
  • signal transmission or communication can be performed between the multiplexer 221 and the logic control unit 222, between the multiplexer 221 and the baseboard management controller 225, and between the multiplexer 221 and each of the task management units based on any one of USB, I2C and UART.
  • corresponding communication protocols or communication methods can be adopted to meet different communication requirements and support functional diversification.
  • the multiplexer 221 receives the PCIe signal transmitted by the ID management unit 210, and sends the PCIe signal to the target management unit.
  • the determination of the target management unit is determined by the logic control unit 222. That is, the logic control unit 222 monitors the operating status of the baseboard management controller 225 and all the task management units in real time; based on the operating status and the task management priorities corresponding to the baseboard management controller 225 and the task management unit, one of the baseboard management controller 225 and all the task management units is determined as the target management unit; based on the determined target management unit, a target management unit instruction is generated; the target management unit instruction is sent to the multiplexer 221; the multiplexer 221 determines the target management unit based on the target management unit instruction, and sends the PCIe signal to the target management unit.
  • the baseboard management controller 225 and all the task management units in this embodiment can all realize the function of managing all the PCIe Switch chips 120. From this functional point of view, the baseboard management controller 225 and all the task management units are redundant. The above embodiment can maximize the reliability of the IO switch and avoid unnecessary losses caused by the failure of a single signal distribution functional unit by determining one of the baseboard management controller 225 and all the task management units as the target management unit.
  • the management priority of all the PCIe Switch chips 120 lies mainly in the task management unit, that is, when all the task management units fail, that is, when all the task management units are in abnormal operation state, the baseboard management controller 225 is determined as the target management unit, and the baseboard management controller 225 as the target management unit performs subsequent signal allocation and processing.
  • the baseboard management controller 225 and all the task management units communicate with each other to achieve real-time data exchange and backup, thereby greatly improving the stability and reliability of the IO switch.
  • the task management unit refers to a CPU (Central Processing Unit) management unit.
  • CPU Central Processing Unit
  • At least one of the task management units includes: a first task management unit 223 and a second task management unit 224. That is, a first CPU management unit and a second CPU management unit. By setting the first CPU management unit and the second CPU management unit, redundancy of the task management unit is formed, which helps to improve the stability and reliability of the IO switch.
  • the first CPU management unit and the second CPU management unit are both buckle cards that comply with the COMe (COM Express) specification, which is convenient for subsequent processor upgrades.
  • each of the task management units supports a PCIe network card, a PCIe NVMe (Non-Volatile Memory express, non-volatile memory host controller interface specification) hard disk or SATA (Serial Advanced Technology Attachment, Serial ATA) hard disk, a PCIe debug interface (for PCIe Switch chip 120) and other IO resources.
  • PCIe network card a PCIe NVMe (Non-Volatile Memory express, non-volatile memory host controller interface specification) hard disk or SATA (Serial Advanced Technology Attachment, Serial ATA) hard disk
  • SATA Serial Advanced Technology Attachment, Serial ATA
  • PCIe debug interface for PCIe Switch chip 120
  • the logic control unit 222 is specifically used to determine the first task management unit 223 as the target management unit when the first task management unit 223 is in a normal operating state; determine the second task management unit 224 as the target management unit when the first task management unit 223 is in an abnormal operating state and the second task management unit 224 is in a normal operating state; determine the baseboard management controller 225 as the target management unit when both the first task management unit 223 and the second task management unit 224 are in an abnormal operating state and the baseboard management controller 225 is in a normal operating state. That is, the task management priority in this embodiment is the first task management unit 223-the second task management unit 224-the baseboard management controller 225.
  • the first task management unit 223, the second task management unit 224 and the baseboard management controller 225 are task-switched, that is, which unit or controller is responsible for receiving ID signals, performing PCIe interconnection topology identification, and managing all the PCIe Switch chips 120.
  • the stability and reliability of the IO switch can be well guaranteed, and the flexibility is high.
  • the logic control unit 222 can well realize the redundant switching, power-on and power-off timing and reset logic control in the IO switch, and can accurately guarantee the operation of the IO switch.
  • the second task management unit 224 is determined as the target management unit.
  • an attempt is made to restart the first task management unit 223. If the restart is completed, the multiplexer 221 will be notified to switch the control to the first task management unit 223, that is, the first task management unit 223 will be re-determined as the target management unit.
  • the baseboard management controller 225 will be determined as the target management unit.
  • the number of task management units can be increased or decreased according to actual conditions to meet different actual needs.
  • the baseboard management controller 225 and the task management unit are specifically used to send corresponding configuration information to each PCIe Switch chip when identifying the PCIe interconnection topology;
  • the configuration information includes type information of all connection interfaces of the PCIe Switch chip;
  • the connection interface includes an interface for connecting to the internal connector or the external connector;
  • the type information includes: uplink interface information, downlink interface information and internal interconnection interface information;
  • the uplink interface information indicates that the corresponding connection interface is an uplink interface, and the uplink interface is used to transmit PCIe signals to the PCIe device connected upstream;
  • the downlink interface information indicates that the corresponding connection interface is a downlink interface, and the downlink interface is used to transmit the PCIe signal to the PCIe device connected downstream;
  • the interconnection interface information indicates that the corresponding connection interface is an internal interconnection interface, and the internal interconnection interface is used to transmit the PCIe signal to the remaining PCIe Switch chips interconnected with the current PCIe Switch chip;
  • the uplink PCIe device
  • the baseboard management controller 225 receives the ID signal, performs topology identification based on all the ID signals, and obtains the PCIe interconnection topology.
  • the configuration information includes type information of all connection interfaces of the PCIe Switch chip 120;
  • the connection interface includes an interface for connecting to the internal connector or the external connector;
  • the type information includes: uplink interface information, downlink interface information and internal interconnection interface information;
  • the uplink interface information indicates that the corresponding connection interface is an uplink interface, and the uplink interface is used to transmit PCIe signals to the PCIe device connected upstream;
  • the downlink interface information indicates that the corresponding connection interface is a downlink interface, and the downlink interface is used to transmit the PCIe signal to the PCIe device connected downstream;
  • the interconnection interface information indicates that the corresponding connection interface is an internal interconnection interface, and the internal interconnection interface is used to transmit the PC
  • the task management unit receives the ID signal, performs topology identification based on all the ID signals, and obtains the PCIe interconnection topology.
  • the PCIe interconnection topology is identified, corresponding configuration information is sent to each PCIe Switch chip 120; the configuration information includes type information of all connection interfaces of the PCIe Switch chip 120; the connection interface includes an interface for connecting to the internal connector or the external connector; the type information includes: uplink interface information, downlink interface information, and internal interconnection interface information; the uplink interface information refers to the corresponding connection interface information.
  • the PCIe Switch chip 120 is used to obtain the target device type corresponding to the PCIe signal when receiving the PCIe signal sent by the PCIe device; based on the target device type, determine at least one target interface corresponding to the target device type from all the connection interfaces of the current PCIe Switch chip 120, and determine that the PCIe device connected to the target interface is the target PCIe device; send a status query instruction to the target PCIe device through the target interface; receive the task processing status fed back by each of the target PCIe devices, the task processing status being fed back by the target PCIe device based on the status query instruction; and send the PCIe signal to any of the target PCIe devices whose task processing status is an idle state.
  • each PCIe Switch chip 120 of the IO expansion unit is provided with a network interface for realizing network connection.
  • the network management unit 310 is also externally connected to a plurality of network interfaces, which can be used to access any device among the baseboard management controller 225, the task management unit, and the PCIe Switch chip 120.
  • the network interface can be an RJ45 (Registered Jack 45, a type of connector) network interface.
  • the baseboard management controller 225 and the task management unit both transmit configuration information to the IO expansion unit through the network management unit 310 .
  • a power rail unit 330 the input end of the power rail unit 330 is used to connect to the power supply unit, and the output end of the power rail unit 330 is respectively connected to the IO expansion unit, the ID management unit 210, the baseboard management controller 225 and the task management unit, and is used to supply power to the IO expansion unit, the ID management unit 210, the baseboard management controller 225 and the task management unit.
  • the power rail unit 330 is also used to supply power to the heat dissipation unit 320. Specifically, a 12V power supply can be used to supply power to the heat dissipation unit 320.
  • the power supply unit includes: a PSU (Power Supply Unit, power supply, alternating current) power supply unit 340 and an HVDC (High Voltage Direct Current, high voltage direct current) power supply unit 350.
  • a PSU Power Supply Unit, power supply, alternating current
  • HVDC High Voltage Direct Current, high voltage direct current
  • the PSU power supply unit 340 is connected to an external alternating current (AC) power supply
  • the HVDC power supply unit 350 is connected to an external direct current (DC) power supply (DC power supply).
  • DC power supply direct current
  • the external power supply is converted into 12V used inside the IO switch to meet different power supply scenarios.
  • the above-mentioned AC power supply can be 220V, etc.
  • the DC power supply can be 380V, etc.
  • the logic control unit 222 is further configured to send a power switch signal (PWREN) to the power rail unit 330 to control the power rail to start or stop supplying power.
  • PWREN power switch signal
  • the first preset number of the inner connectors 130 are distributed on the first side of the corresponding extension base 110
  • the second preset number of the outer connectors 140 are distributed on the second side of the corresponding extension base 110, and the first side and the second side are opposite sides.
  • the IO switch in the above embodiment can realize the extraction of a large number or sufficient IO resources to connect upstream and downstream PCIe devices by adopting a highly flexible IO expansion architecture.
  • a multiplexer 221, a logic control unit 222, a first task management unit 223 and a second task management unit 224 the stability and reliability of the IO switch operation can be better guaranteed, with low cost, high feasibility, and high density, robustness and flexibility. It has strong system robustness to maintain the stability of the IO switch system and ensure IO data exchange between many devices in the entire cabinet. It has flexible scalability and is compatible with different power supply methods and different forms of PCIe Switch interconnection topologies. flutter.
  • the PCIe device provided in the present application is described below.
  • the PCIe device described below and the IO switch described above can refer to each other.
  • This embodiment further provides a PCIe device, including: a switch connector, the switch connector is used to connect to the external connector 140 in the above-mentioned IO expansion architecture, or to connect to the external connector 140 in any one of the above-mentioned IO switches.
  • the PCIe device in this embodiment can better realize the interaction and intercommunication between different PCIe resources by connecting to the external connector 140 in the above-mentioned IO expansion architecture, or to connect to the external connector 140 in any one of the above-mentioned IO switches, and has high flexibility.
  • the switch connector is the same type of connector as the external connector 140.
  • the switch connector is a CDFP connector.
  • the PCIe device is a host, a graphics processing unit (GPU), a solid state drive (SSD), or a memory.
  • GPU graphics processing unit
  • SSD solid state drive
  • PCIe fusion architecture system provided by the present application is described below.
  • This embodiment also provides a PCIe fusion architecture system, including:
  • the IO switch as described in any one of the above, and the multiple PCIe devices as described in any one of the above, the multiple PCIe devices are respectively connected to the corresponding external connector 140.
  • the PCIe converged architecture system in this embodiment can better support the extraction of a large number or sufficient IO resources, can better meet the construction requirements of complex topologies in the converged architecture system, has high flexibility, strong feasibility, and low cost.
  • FIG4 eight PCIe Switch chips 120 are exemplarily divided into four upper and four lower chips, and the external connectors 140 of the four upper PCIe Switch chips 120 are connected to the host (such as Host0...Host4, Host5...Host9, Host10...Host14, Host15...Host19 in FIG4 ). And the external connectors 140 of the four lower PCIe Switch chips 120 are connected to the solid state drive, graphics processor or memory (such as Device 0...Device4, Device5...Device9, Device10...Device14, Device15...Device19 in FIG4 ), that is, the Device in FIG4 can be a solid state drive, a graphics processor or a memory.
  • the host such as Host0...Host4, Host5...Host9, Host10...Host14, Host15...Host19 in FIG4
  • the external connectors 140 of the four lower PCIe Switch chips 120 are connected to the solid state drive, graphics processor or memory (such as Device 0.
  • a PCIe fusion architecture system In the Host-IO switch-Device mode, a PCIe fusion architecture system is built, which has strong real-time performance and better meets the construction requirements of complex topologies. It can schedule and allocate many GPU resources, SSD resources and Memory resources between different hosts. It should be mentioned that in the specific implementation process, other topological forms can also be used for topological construction.
  • each PCIe Switch chip 120 is connected to five external connectors 140 and three or four internal connectors 130.
  • Each internal connector 130 is used to connect to the internal connector 130 of other PCIe Switch chips 120 to form interconnections between the PCIe Switch chips 120.
  • Each external connector 140 is connected to a PCIe device, which may be a host, a solid-state drive, a graphics processor, or a memory.
  • the six PCIe Switch chips 120 may be divided into three upper and three lower ones, and the external connectors 140 of the three upper PCIe Switch chips 120 may be connected to the host.
  • the external connectors 140 of the three lower PCIe Switch chips 120 may be connected to a solid-state drive, a graphics processor, or a memory.
  • This may enable the construction of a converged architecture or a PCIe interconnection topology based on six PCIe Switch chips 120. It may be possible to schedule and allocate a number of GPU resources, SSD resources, and Memory resources between different Hosts to meet different converged architecture construction requirements.
  • each PCIe Switch chip 120 is connected to five external connectors 140 and two, three or four internal connectors 130.
  • Each internal connector 130 is used to connect to the internal connectors 130 of other PCIe Switch chips 120 to form an interconnection between the PCIe Switch chips 120.
  • Each external connector 140 is connected to a PCIe device, which can be a host, a solid state drive, a graphics processor or a memory.
  • the four PCIe Switch chips 120 can be divided into two upper and two lower ones, and the external connectors 140 of the two upper PCIe Switch chips 120 are connected to the host.
  • the external connectors 140 of the two lower PCIe Switch chips 120 are connected to the solid state drive, the graphics processor or the memory. This enables the construction of a fusion architecture or PCIe interconnection topology based on four PCIe Switch chips 120, with high flexibility.
  • Each PCIe Switch chip 120 is connected to five external connectors 140 and one internal connector 130.
  • Each PCIe Switch chip 120 is connected to the internal connector 130 of other PCIe Switch chips 120 through its own internal connector 130 to form an interconnection between the two PCIe Switch chips 120.
  • Each external connector 140 is connected to a PCIe device, which can be a host, a solid state drive, a graphics processor or a memory. Connect the external connector 140 of one PCIe Switch chip 120 to the host. And connect the external connector 140 of another PCIe Switch chip 120 to the solid state drive, the graphics processor or the memory. This enables construction of a fusion architecture or PCIe interconnection topology based on two PCIe Switch chips 120, with high flexibility and low cost.
  • the PCIe converged architecture system in the above embodiment can better support the extraction of a large or sufficient amount of IO resources, can better meet the construction requirements of complex topologies in the converged architecture system, has high flexibility, and provides the possibility of engineering implementation for the construction of complex topologies in the converged architecture system. It has high density, robustness and flexibility, and can meet the application requirements of different scenarios.
  • the PCIe signal fusion management method provided in the present application is described below.
  • the PCIe signal fusion management method described below and the PCIe fusion architecture system described above can be referenced to each other.
  • this embodiment further provides a PCIe signal fusion management method based on the PCIe fusion architecture system as described above, including:
  • S510 Receive a PCIe signal sent by a PCIe device.
  • S530 Based on the target device type, determine at least one target interface corresponding to the target device type from all connection interfaces of the current PCIe Switch chip, and determine that the PCIe device connected to the target interface is the target PCIe device.
  • S540 Send a status query instruction to the target PCIe device through the target interface.
  • S550 Receive a task processing status fed back by each of the target PCIe devices, where the task processing status information is fed back by the target PCIe device based on the status query instruction.
  • S560 Send the PCIe signal to any target PCIe device whose task processing state is idle. This method can improve the transmission efficiency of PCIe signals in a PCIe converged architecture system, has high flexibility, strong feasibility and low cost.
  • the PCIe signal fusion management method further includes: when the task processing status fed back by all the target PCIe devices is in a working state, transmitting the PCIe signal to a preset message queue; when any of the target PCIe devices is monitored to be in an idle state, based on the message queue, transmitting the PCIe signal to the target PCIe device in an idle state.
  • the PCIe signal fusion management method in the above embodiment is based on the PCIe fusion architecture system in the above embodiment.
  • the PCIe fusion architecture system can better support the extraction of a large or sufficient amount of IO resources, can better meet the construction requirements of complex topologies in the fusion architecture system, has high flexibility, provides the possibility of engineering implementation for the construction of complex topologies in the fusion architecture system, has low cost, and has both high density, robustness and flexibility.
  • the IO expansion architecture, IO switch and PCIe device provided by the present application are provided with at least one IO expansion unit, the IO expansion unit includes an expansion substrate, the expansion substrate is provided with at least two PCIe Switch chips, a first preset number of internal connectors, and a second preset number of external connectors; one end of the internal connector is connected to any PCIe Switch chip on the expansion substrate, and the other end of the internal connector is used for interconnection between PCIe Switch chips; one end of the external connector is connected to any PCIe Switch chip on the expansion substrate, and the other end of the external connector is used to connect PCIe devices to complete the interconnection between multiple PCIe devices.
  • FIG6 illustrates a schematic diagram of the physical structure of an electronic device.
  • the electronic device may include: a processor 610, a communications interface 620, a memory 630, and a communications bus 640, wherein the processor 610, the communications interface 620, and the memory 630 communicate with each other via the communications bus 640.
  • the processor 610 may call the logic instructions in the memory 630 to execute a PCIe signal fusion management method, the method comprising: receiving a PCIe signal sent by a PCIe device; obtaining a target device type corresponding to the PCIe signal; based on the target device type, determining at least one target interface corresponding to the target device type from all connection interfaces of the current PCIe Switch chip, and determining the PCIe device connected to the target interface as a target PCIe device; sending a status query instruction to the target PCIe device through the target interface; receiving the task processing status fed back by each of the target PCIe devices, wherein the task processing status information is fed back by the target PCIe device based on the status query instruction; sending the PCIe signal to any of the target PCIe devices whose task processing status is an idle state.
  • the logic instructions in the above-mentioned memory 630 can be implemented in the form of software functional units and can be stored in a computer-readable storage medium when sold or used as an independent product.
  • the technical solution of the present application can be essentially or partly embodied in the form of a software product that contributes to the relevant technology.
  • the computer software product is stored in a storage medium, including several instructions to enable a computer device (which can be a personal computer, server, or network device, etc.) to perform all or part of the steps of the method described in each embodiment of the present application.
  • the aforementioned storage medium includes: U disk, mobile hard disk, read-only memory (ROM, Read-Only Memory), random access memory (RAM, Random Access Memory), disk or optical disk and other media that can store program codes.
  • the present application also provides a non-transitory computer-readable storage medium having a computer program stored thereon, which, when executed by a processor, is implemented to execute the PCIe signal fusion management method provided by the above methods, the method comprising: receiving a PCIe signal sent by a PCIe device; obtaining a target device type corresponding to the PCIe signal; based on the target device type, determining at least one target interface corresponding to the target device type from all connection interfaces of the current PCIe Switch chip, and determining that the PCIe device connected to the target interface is a target PCIe device; sending a status query instruction to the target PCIe device through the target interface; receiving a task processing status fed back by each of the target PCIe devices, the task processing status information being fed back by the target PCIe device based on the status query instruction; and sending the PCIe signal to any of the target PCIe devices whose task processing status is an idle state.
  • the device embodiments described above are merely illustrative, wherein the units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, they may be located in one place, or they may be distributed on multiple network units. Some or all of the modules may be selected according to actual needs to achieve the purpose of the scheme of this embodiment. Ordinary technicians in this field can understand and implement it without paying creative labor.

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Microelectronics & Electronic Packaging (AREA)
  • Power Engineering (AREA)
  • Physics & Mathematics (AREA)
  • Thermal Sciences (AREA)
  • Debugging And Monitoring (AREA)
  • Information Transfer Systems (AREA)
  • Multi Processors (AREA)

Abstract

The present application relates to the technical field of computers, and provides an IO expansion architecture, an IO switch and a PCIe device. The IO expansion architecture comprises: at least one IO expansion unit, wherein the IO expansion unit comprises an expansion substrate, and the expansion substrate is provided with at least two PCIe Switch chips, a first preset number of internal connectors, and a second preset number of external connectors; one end of each internal connector is connected to any PCIe Switch chip on the expansion substrate, and the other end of the internal connector is used for interconnection between the PCIe Switch chips; and one end of each external connector is connected to any PCIe Switch chip on the expansion substrate, and the other end of the external connector is used for being connected to the PCIe device so as to complete interconnection between the plurality of PCIe devices. Extraction of a large amount of or sufficient amount of IO resources can be well supported, and the construction requirements of complex topologies in a fusion architecture system are better met.

Description

IO拓展架构、IO交换机及PCIe设备IO expansion architecture, IO switches and PCIe devices

相关申请的交叉引用CROSS-REFERENCE TO RELATED APPLICATIONS

本申请要求于2023年10月10日提交中国专利局,申请号为202311304011.4,申请名称为“IO拓展架构、IO交换机及PCIe设备”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application claims priority to the Chinese patent application filed with the China Patent Office on October 10, 2023, with application number 202311304011.4, and application name “IO expansion architecture, IO switch and PCIe device”, all contents of which are incorporated by reference in this application.

技术领域Technical Field

本申请涉及计算机技术领域,尤其涉及一种IO拓展架构、IO交换机及PCIe设备。The present application relates to the field of computer technology, and in particular to an IO expansion architecture, an IO switch and a PCIe device.

背景技术Background Art

随着计算机技术的飞速发展,AI模型越做越大,使得计算资源池化受到了广泛的关注,融合架构系统也因此应运而生。在融合架构系统中,Host(主机)、IO(Input/Output,输入/输出)、GPU(Graphic Processing Unit,图形处理器)、SSD(Solid State Drive,固态硬盘)、Memory(内存)等均被解耦为独立的模块,成为了专门的Host资源池、IO资源池、GPU资源池、SSD资源池、Memory资源池等,以便实现设备的快速拓展和升级。其中,IO资源池即指IO交换机。为了实现对大量PCIe设备资源,如Host资源、GPU资源、SSD资源和Memory资源等的融合管理,IO交换机起着至关重要的作用。With the rapid development of computer technology, AI models are getting bigger and bigger, which has attracted widespread attention to the pooling of computing resources, and the converged architecture system has come into being. In the converged architecture system, Host, IO (Input/Output), GPU (Graphic Processing Unit), SSD (Solid State Drive), Memory, etc. are all decoupled into independent modules, becoming dedicated Host resource pools, IO resource pools, GPU resource pools, SSD resource pools, Memory resource pools, etc., in order to achieve rapid expansion and upgrading of equipment. Among them, the IO resource pool refers to the IO switch. In order to achieve the integrated management of a large number of PCIe device resources, such as Host resources, GPU resources, SSD resources and Memory resources, the IO switch plays a vital role.

现有的IO交换机,通常在交换机基板上设置固定数量(如8个等)的PCIe(Peripheral Component Interconnect Express,高速串行计算机扩展总线标准)插槽,利用标准全高/半高HBA(Host Bus Adapter,主机总线适配器)从PCIe插槽中将IO资源引出,再利用引出的IO资源进行对PCIe设备资源进行接入和接出管理。然而,此种方式无法较好地支持大量的IO资源的引出,无法较好地满足融合架构系统中复杂拓扑的构建需求,且需要利用HBA进行协议转换,实施较麻烦,成本较高。Existing IO switches usually set a fixed number (such as 8) of PCIe (Peripheral Component Interconnect Express, a high-speed serial computer expansion bus standard) slots on the switch substrate, use standard full-height/half-height HBA (Host Bus Adapter) to lead IO resources from the PCIe slot, and then use the led IO resources to access and manage PCIe device resources. However, this method cannot well support the lead-out of a large number of IO resources, cannot well meet the construction requirements of complex topologies in converged architecture systems, and requires the use of HBA for protocol conversion, which is cumbersome to implement and has high costs.

发明内容Summary of the invention

本申请提供一种IO拓展架构、IO交换机及PCIe设备,用以解决相关技术中无法较好地支持大量的IO资源的引出,无法较好地满足融合架构系统中复杂拓扑的构建需求的问题。The present application provides an IO expansion architecture, an IO switch and a PCIe device to solve the problem that the related technology cannot well support the extraction of a large number of IO resources and cannot well meet the construction requirements of complex topologies in a converged architecture system.

本申请提供一种IO拓展架构,包括:This application provides an IO expansion architecture, including:

至少一个IO拓展单元,所述IO拓展单元包括拓展基板,所述拓展基板设有至少两个PCIe Switch芯片、第一预设数量的对内连接器,以及第二预设数量的对外连接器; At least one IO expansion unit, the IO expansion unit comprising an expansion base plate, the expansion base plate having at least two PCIe Switch chips, a first preset number of internal connectors, and a second preset number of external connectors;

所述对内连接器的一端与所述拓展基板上任一个所述PCIe Switch芯片连接,所述对内连接器的另一端用于所述PCIe Switch芯片之间的互联;所述对外连接器的一端与所述拓展基板上任一个所述PCIe Switch芯片连接,所述对外连接器的另一端用于连接PCIe设备,以完成多个所述PCIe设备之间的互联。One end of the internal connector is connected to any one of the PCIe Switch chips on the extension baseboard, and the other end of the internal connector is used for interconnection between the PCIe Switch chips; one end of the external connector is connected to any one of the PCIe Switch chips on the extension baseboard, and the other end of the external connector is used to connect a PCIe device to complete the interconnection between multiple PCIe devices.

本申请还提供一种IO交换机,包括:The present application also provides an IO switch, comprising:

ID管理单元、控制模块、以及如上述所述的IO拓展架构;ID management unit, control module, and IO expansion architecture as described above;

所述ID管理单元用于采集所述IO拓展架构中全部所述对内连接器和所述对外连接器的ID信号,将所述ID信号传输至所述控制模块;The ID management unit is used to collect ID signals of all the internal connectors and the external connectors in the IO expansion architecture, and transmit the ID signals to the control module;

所述控制模块用于基于全部所述ID信号,对所述IO拓展架构中的全部所述PCIe Switch芯片进行管理。The control module is used to manage all the PCIe Switch chips in the IO expansion architecture based on all the ID signals.

根据本申请提供的一种IO交换机,每个所述对内连接器和所述对外连接器均各自连接有一个ID定义芯片,所述ID定义芯片用于对对应的所述对内连接器或所述对外连接器进行ID定义;According to an IO switch provided by the present application, each of the inner connector and the outer connector is respectively connected to an ID definition chip, and the ID definition chip is used to perform ID definition on the corresponding inner connector or the outer connector;

所述ID管理单元具体用于通过所述对外连接器或所述对内连接器,向每个所述ID定义芯片发送ID查询指令;The ID management unit is specifically used to send an ID query instruction to each of the ID definition chips through the external connector or the internal connector;

所述ID定义芯片还用于接收所述ID查询指令,基于所述ID查询指令,向对应的所述对内连接器或所述对外连接器反馈所述ID信号;The ID definition chip is also used to receive the ID query instruction, and based on the ID query instruction, feed back the ID signal to the corresponding internal connector or the external connector;

所述对内连接器或所述对外连接器在接收到所述ID信号的情况下,将所述ID信号反馈至所述ID管理单元;When receiving the ID signal, the internal connector or the external connector feeds back the ID signal to the ID management unit;

所述ID管理单元分别与每个所述对外连接器、所述对内连接器和所述控制模块连接。The ID management unit is connected to each of the external connectors, the internal connectors and the control module respectively.

根据本申请提供的一种IO交换机,所述控制模块包括:According to an IO switch provided by the present application, the control module includes:

多路选择器,用于接收所述ID管理单元传输的所述ID信号;A multiplexer, used for receiving the ID signal transmitted by the ID management unit;

逻辑控制单元,用于监测基板管理控制器和全部任务管理单元的运行状态;基于所述运行状态、以及所述基板管理控制器和所述任务管理单元各自对应的任务管理优先级,确定所述基板管理控制器和全部所述任务管理单元中的其中一个为目标管理单元;基于确定的所述目标管理单元,生成目标管理单元指令;将所述目标管理单元指令发送至所述多路选择器;A logic control unit, configured to monitor the operating status of a baseboard management controller and all task management units; based on the operating status and the task management priorities corresponding to the baseboard management controller and the task management unit, determine one of the baseboard management controller and all the task management units as a target management unit; based on the determined target management unit, generate a target management unit instruction; and send the target management unit instruction to the multiplexer;

所述多路选择器还用于基于所述目标管理单元指令,确定所述目标管理单元,将所述ID信号发送至所述目标管理单元;The multiplexer is further used to determine the target management unit based on the target management unit instruction and send the ID signal to the target management unit;

基板管理控制器和至少一个任务管理单元,所述基板管理控制器和所述任务管理单元均用于在接收到所述ID信号的情况下,基于所述ID信号,进行PCIe拓扑识别,得到PCIe互联拓扑;基于所述PCIe互联拓扑,对全部所述PCIe Switch芯片进行管理;A baseboard management controller and at least one task management unit, wherein the baseboard management controller and the task management unit are used to perform PCIe topology identification based on the ID signal when receiving the ID signal to obtain a PCIe interconnection topology; and manage all the PCIe Switch chips based on the PCIe interconnection topology;

所述ID管理单元和所述多路选择器连接,所述多路选择器分别与所述逻辑控制单元、所述基板管理控制器、以及全部所述任务管理单元连接。The ID management unit is connected to the multiplexer, and the multiplexer is respectively connected to the logic control unit, the baseboard management controller, and all the task management units.

根据本申请提供的一种IO交换机,至少一个所述任务管理单元包括:第一任务管理单元和第二任务管理单元; According to an IO switch provided by the present application, at least one of the task management units includes: a first task management unit and a second task management unit;

所述逻辑控制单元具体用于当所述第一任务管理单元为正常运行状态时,将所述第一任务管理单元确定为所述目标管理单元;当所述第一任务管理单元为异常运行状态,且所述第二任务管理单元为正常运行状态时,将所述第二任务管理单元确定为所述目标管理单元;当所述第一任务管理单元和所述第二任务管理单元均为异常运行状态,且所述基板管理控制器为正常运行状态时,将所述基板管理控制器确定为所述目标管理单元。The logic control unit is specifically used to determine the first task management unit as the target management unit when the first task management unit is in a normal operating state; to determine the second task management unit as the target management unit when the first task management unit is in an abnormal operating state and the second task management unit is in a normal operating state; and to determine the baseboard management controller as the target management unit when both the first task management unit and the second task management unit are in an abnormal operating state and the baseboard management controller is in a normal operating state.

根据本申请提供的一种IO交换机,所述基板管理控制器和所述任务管理单元均具体用于在识别到所述PCIe互联拓扑的情况下,向每个所述PCIe Switch芯片分别发送对应的配置信息;所述配置信息包括所述PCIe Switch芯片的全部连接接口的类型信息;所述连接接口包括用于与所述对内连接器或所述对外连接器连接的接口;所述类型信息包括:上行接口信息、下行接口信息以及对内互联接口信息;所述上行接口信息指对应的所述连接接口为上行接口,所述上行接口用于向上行连接的所述PCIe设备传输PCIe信号;所述下行接口信息指对应的所述连接接口为下行接口,所述下行接口用于向下行连接的所述PCIe设备传输所述PCIe信号;所述互联接口信息指对应的连接接口为对内互联接口,所述对内互联接口用于向与当前所述PCIe Switch芯片互联的其余所述PCIe Switch芯片传输所述PCIe信号;上行连接的所述PCIe设备指主机,下行连接的所述PCIe设备为图形处理器、固态硬盘或内存;According to an IO switch provided by the present application, the baseboard management controller and the task management unit are specifically used to send corresponding configuration information to each of the PCIe Switch chips respectively when the PCIe interconnection topology is identified; the configuration information includes type information of all connection interfaces of the PCIe Switch chip; the connection interface includes an interface for connecting to the internal connector or the external connector; the type information includes: uplink interface information, downlink interface information and internal interconnection interface information; the uplink interface information indicates that the corresponding connection interface is an uplink interface, The upstream interface is used to transmit PCIe signals to the PCIe device connected upstream; the downstream interface information indicates that the corresponding connection interface is a downstream interface, and the downstream interface is used to transmit the PCIe signal to the PCIe device connected downstream; the interconnection interface information indicates that the corresponding connection interface is an internal interconnection interface, and the internal interconnection interface is used to transmit the PCIe signal to the remaining PCIe Switch chips interconnected with the current PCIe Switch chip; the PCIe device connected upstream refers to a host, and the PCIe device connected downstream is a graphics processor, a solid-state drive or a memory;

所述PCIe Switch芯片用于基于接收到的所述配置信息,对自身的全部所述连接接口进行配置。The PCIe Switch chip is used to configure all of its own connection interfaces based on the received configuration information.

根据本申请提供的一种IO交换机,所述PCIe Switch芯片用于在接收到所述PCIe设备发送的所述PCIe信号的情况下,获取所述PCIe信号对应的目标设备类型;基于所述目标设备类型,从当前所述PCIe Switch芯片的全部所述连接接口中确定与所述目标设备类型相对应的至少一个目标接口,确定所述目标接口连接的所述PCIe设备为目标PCIe设备;通过所述目标接口向所述目标PCIe设备发送状态查询指令;接收每个所述目标PCIe设备反馈的任务处理状态,所述任务处理状态为所述目标PCIe设备基于所述状态查询指令反馈的;将所述PCIe信号发送至所述任务处理状态为空闲状态的任一所述目标PCIe设备。According to an IO switch provided by the present application, the PCIe Switch chip is used to obtain the target device type corresponding to the PCIe signal when receiving the PCIe signal sent by the PCIe device; based on the target device type, determine at least one target interface corresponding to the target device type from all the connection interfaces of the current PCIe Switch chip, and determine that the PCIe device connected to the target interface is the target PCIe device; send a status query instruction to the target PCIe device through the target interface; receive the task processing status fed back by each of the target PCIe devices, the task processing status being fed back by the target PCIe device based on the status query instruction; and send the PCIe signal to any of the target PCIe devices whose task processing status is an idle state.

根据本申请提供的一种IO交换机,所述PCIe Switch芯片具体用于在全部所述目标PCIe设备反馈的所述任务处理状态均为工作状态的情况下,将所述PCIe信号传输至预设的消息队列;在监测到任一所述目标PCIe设备为所述空闲状态的情况下,基于所述消息队列,将所述PCIe信号传输至空闲状态的所述目标PCIe设备。According to an IO switch provided by the present application, the PCIe Switch chip is specifically used to transmit the PCIe signal to a preset message queue when the task processing status fed back by all the target PCIe devices is in the working state; and when any of the target PCIe devices is monitored to be in the idle state, based on the message queue, transmit the PCIe signal to the target PCIe device in the idle state.

根据本申请提供的一种IO交换机,还包括:网络管理单元,所述网络管理单元分别与所述基板管理控制器、所述任务管理单元和所述IO拓展单元连接,以完成所述基板管理控制器、所述IO拓展单元、以及全部所述任务管理单元之间的两两互通。According to an IO switch provided by the present application, it also includes: a network management unit, which is connected to the baseboard management controller, the task management unit and the IO expansion unit respectively to complete the two-to-two communication between the baseboard management controller, the IO expansion unit, and all the task management units.

根据本申请提供的一种IO交换机,还包括:散热单元,用于对所述IO交换机进行散热。An IO switch provided according to the present application further includes: a heat dissipation unit, which is used to dissipate heat for the IO switch.

根据本申请提供的一种IO交换机,还包括:电源轨单元,所述电源轨单元的输入端 用于与供电单元连接,所述电源轨单元的输出端分别与所述IO拓展单元、所述ID管理单元、所述基板管理控制器和所述任务管理单元连接,用于为所述IO拓展单元、所述ID管理单元、所述基板管理控制器和所述任务管理单元供电。According to an IO switch provided by the present application, the IO switch further includes: a power rail unit, wherein an input terminal of the power rail unit Used to be connected to the power supply unit, the output end of the power rail unit is respectively connected to the IO expansion unit, the ID management unit, the baseboard management controller and the task management unit, and is used to supply power to the IO expansion unit, the ID management unit, the baseboard management controller and the task management unit.

根据本申请提供的一种IO交换机,所述逻辑控制单元还用于向所述电源轨单元发送电源开关信号,以控制所述电源轨开始供电或停止供电。According to an IO switch provided by the present application, the logic control unit is further used to send a power switch signal to the power rail unit to control the power rail to start or stop supplying power.

根据本申请提供的一种IO交换机,所述第一预设数量的所述对内连接器分布于对应的所述拓展基板的第一侧,所述第二预设数量的所述对外连接器分布于对应的所述拓展基板第二侧,所述第一侧和第二侧为相对侧。According to an IO switch provided by the present application, the first preset number of internal connectors is distributed on a first side of the corresponding extended baseboard, the second preset number of external connectors is distributed on a second side of the corresponding extended baseboard, and the first side and the second side are opposite sides.

本申请还提供一种PCIe设备,包括:交换机连接器,所述交换机连接器用于与上述所述的IO拓展架构中的对外连接器连接,或者用于与上述中任一项所述的IO交换机中的对外连接器连接。The present application also provides a PCIe device, including: a switch connector, wherein the switch connector is used to connect to the external connector in the above-mentioned IO expansion architecture, or to connect to the external connector in any of the above-mentioned IO switches.

根据本申请提供的一种PCIe设备,所述交换机连接器与所述对外连接器为同一种连接器。According to a PCIe device provided by the present application, the switch connector and the external connector are the same type of connector.

根据本申请提供的一种PCIe设备,所述PCIe设备为主机、图形处理器、固态硬盘或内存。According to a PCIe device provided by the present application, the PCIe device is a host, a graphics processor, a solid-state drive or a memory.

本申请还提供一种PCIe融合架构系统,包括:The present application also provides a PCIe fusion architecture system, including:

如上述中任一项所述的IO交换机,以及多个如上述中任一项所述的PCIe设备,多个所述PCIe设备分别与对应的所述对外连接器连接。An IO switch as described in any one of the above, and a plurality of PCIe devices as described in any one of the above, wherein the plurality of PCIe devices are respectively connected to the corresponding external connectors.

本申请还提供一种基于如上述所述的PCIe融合架构系统的PCIe信号融合管理方法,包括:The present application also provides a PCIe signal fusion management method based on the PCIe fusion architecture system as described above, comprising:

接收PCIe设备发送的PCIe信号;Receive PCIe signals sent by PCIe devices;

获取所述PCIe信号对应的目标设备类型;Obtaining the target device type corresponding to the PCIe signal;

基于所述目标设备类型,从当前PCIe Switch芯片的全部连接接口中确定与所述目标设备类型相对应的至少一个目标接口,确定所述目标接口连接的所述PCIe设备为目标PCIe设备;Based on the target device type, determining at least one target interface corresponding to the target device type from all connection interfaces of the current PCIe Switch chip, and determining the PCIe device connected to the target interface as the target PCIe device;

通过所述目标接口向所述目标PCIe设备发送状态查询指令;Sending a status query instruction to the target PCIe device through the target interface;

接收每个所述目标PCIe设备反馈的任务处理状态,所述任务处理状态信息为所述目标PCIe设备基于所述状态查询指令反馈的;Receiving a task processing status fed back by each of the target PCIe devices, wherein the task processing status information is fed back by the target PCIe device based on the status query instruction;

将所述PCIe信号发送至所述任务处理状态为空闲状态的任一所述目标PCIe设备。The PCIe signal is sent to any of the target PCIe devices whose task processing state is an idle state.

本申请还提供一种电子设备,包括存储器、处理器及存储在存储器上并可在处理器上运行的计算机程序,所述处理器执行所述程序时实现如上述任一种所述的PCIe信号融合管理方法。The present application also provides an electronic device, including a memory, a processor, and a computer program stored in the memory and executable on the processor, wherein when the processor executes the program, the PCIe signal fusion management method as described in any one of the above is implemented.

本申请还提供一种非暂态计算机可读存储介质,其上存储有计算机程序,该计算机程序被处理器执行时实现如上述任一种所述的PCIe信号融合管理方法。The present application also provides a non-transitory computer-readable storage medium having a computer program stored thereon, and when the computer program is executed by a processor, the PCIe signal fusion management method as described in any one of the above is implemented.

本申请的有益效果:本申请提供的IO拓展架构、IO交换机及PCIe设备,通过设置至少一个IO拓展单元,IO拓展单元包括拓展基板,拓展基板设有至少两个PCIe Switch芯片、第一预设数量的对内连接器,以及第二预设数量的对外连接器;对内连接器的一 端与拓展基板上任一个PCIe Switch芯片连接,对内连接器的另一端用于PCIe Switch芯片之间的互联;对外连接器的一端与拓展基板上任一个PCIe Switch芯片连接,对外连接器的另一端用于连接PCIe设备,以完成多个PCIe设备之间的互联。本申请中的IO拓展架构能够较好地支持大量或足量的IO资源的引出,能够较好地满足融合架构系统中复杂拓扑的构建需求,灵活度较高,为融合架构系统中复杂拓扑的构建提供了工程实施的可能性。Beneficial effects of the present application: The IO expansion architecture, IO switch and PCIe device provided by the present application are provided with at least one IO expansion unit, the IO expansion unit includes an expansion substrate, the expansion substrate is provided with at least two PCIe Switch chips, a first preset number of internal connectors, and a second preset number of external connectors; one of the internal connectors One end of the internal connector is connected to any PCIe Switch chip on the expansion baseboard, and the other end of the internal connector is used for interconnection between PCIe Switch chips; one end of the external connector is connected to any PCIe Switch chip on the expansion baseboard, and the other end of the external connector is used to connect PCIe devices to complete the interconnection between multiple PCIe devices. The IO expansion architecture in this application can better support the extraction of a large or sufficient amount of IO resources, can better meet the construction requirements of complex topologies in converged architecture systems, has high flexibility, and provides the possibility of engineering implementation for the construction of complex topologies in converged architecture systems.

附图说明BRIEF DESCRIPTION OF THE DRAWINGS

为了更清楚地说明本申请或相关技术中的技术方案,下面将对实施例或相关技术描述中所需要使用的附图作一简单地介绍,显而易见地,下面描述中的附图是本申请的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。In order to more clearly illustrate the technical solutions in the present application or related technologies, the following is a brief introduction to the drawings required for use in the embodiments or related technical descriptions. Obviously, the drawings described below are some embodiments of the present application. For ordinary technicians in this field, other drawings can be obtained based on these drawings without paying any creative work.

图1是本申请提供的IO拓展架构的一个实施例的结构示意图;FIG1 is a schematic diagram of the structure of an embodiment of an IO expansion architecture provided by the present application;

图2是本申请提供的IO交换机的一个实施例的结构示意图;FIG2 is a schematic diagram of the structure of an embodiment of an IO switch provided by the present application;

图3是本申请提供的IO交换机的另一个实施例的结构示意图;FIG3 is a schematic diagram of the structure of another embodiment of an IO switch provided by the present application;

图4是本申请提供的PCIe融合架构系统中PCIe互联拓扑的一个实施例的结构示意图;FIG4 is a schematic diagram of a structure of an embodiment of a PCIe interconnection topology in a PCIe converged architecture system provided by the present application;

图5是本申请提供的PCIe信号融合管理方法的一个实施例的流程示意图;FIG5 is a flow chart of an embodiment of a PCIe signal fusion management method provided by the present application;

图6是本申请提供的电子设备的结构示意图。FIG. 6 is a schematic diagram of the structure of an electronic device provided in the present application.

附图标识:Figure ID:

110-拓展基板;120-PCIe Switch芯片;130-对内连接器;110- expansion baseboard; 120- PCIe Switch chip; 130- internal connector;

140-对外连接器;210-ID管理单元;220-控制模块;140-external connector; 210-ID management unit; 220-control module;

221-多路选择器;222-逻辑控制单元;223-第一任务管理单元;221-multiplexer; 222-logic control unit; 223-first task management unit;

224-第二任务管理单元;225-基板管理控制器;224-a second task management unit; 225-a baseboard management controller;

310-网络管理单元;320-散热单元;330-电源轨单元;310-network management unit; 320-heat dissipation unit; 330-power rail unit;

340-PSU供电单元;350-HVDC供电单元。340-PSU power supply unit; 350-HVDC power supply unit.

具体实施方式DETAILED DESCRIPTION

为使本申请的目的、技术方案和优点更加清楚,下面将结合本申请中的附图,对本申请中的技术方案进行清楚、完整地描述,显然,所描述的实施例是本申请一部分实施例,而不是全部的实施例。基于本申请中的实施例,本领域普通技术人员在没有作出创造性劳动前提下所获得的所有其他实施例,都属于本申请保护的范围。In order to make the purpose, technical solutions and advantages of this application clearer, the technical solutions in this application will be clearly and completely described below in conjunction with the drawings in this application. Obviously, the described embodiments are part of the embodiments of this application, not all of them. Based on the embodiments in this application, all other embodiments obtained by ordinary technicians in this field without creative work are within the scope of protection of this application.

下面以实施例的方式,结合图1-图6描述本申请提供的IO拓展架构、IO交换机及PCIe设备。The following describes the IO expansion architecture, IO switch and PCIe device provided by the present application in the form of embodiments in combination with Figures 1 to 6.

请参考图1,本实施例提供的IO拓展架构,包括:Please refer to FIG1 , the IO expansion architecture provided in this embodiment includes:

至少一个IO拓展单元,所述IO拓展单元包括拓展基板110,所述拓展基板设有至少 两个PCIe Switch(PCIe交换)芯片(图1中简化表示为SW)、第一预设数量的对内连接器130,以及第二预设数量的对外连接器140。At least one IO expansion unit, the IO expansion unit includes an expansion base plate 110, the expansion base plate has at least Two PCIe Switch chips (simplified as SW in FIG. 1 ), a first preset number of internal connectors 130 , and a second preset number of external connectors 140 .

所述对内连接器130的一端与所述拓展基板110上任一个所述PCIe Switch芯片120连接,所述对内连接器130的另一端用于所述PCIe Switch芯片120之间的互联;所述对外连接器140的一端与所述拓展基板110上任一个所述PCIe Switch芯片120连接,所述对外连接器140的另一端用于连接PCIe设备,以完成多个所述PCIe设备之间的互联。One end of the internal connector 130 is connected to any one of the PCIe Switch chips 120 on the extension baseboard 110, and the other end of the internal connector 130 is used for interconnection between the PCIe Switch chips 120; one end of the external connector 140 is connected to any one of the PCIe Switch chips 120 on the extension baseboard 110, and the other end of the external connector 140 is used for connecting a PCIe device to complete the interconnection between multiple PCIe devices.

具体地,所述IO拓展单元的数量可以根据实际情况进行设置,如1个、2个、3个或4个等。PCIe采用的是点对点(point to point)的工作模式,上述实施例中通过采用PCIe Switch芯片120,能够实现PCIe资源的扩展,从而便于实现多个PCIe设备之间的通信。Specifically, the number of the IO expansion units can be set according to actual conditions, such as 1, 2, 3 or 4. PCIe adopts a point-to-point working mode. In the above embodiment, by adopting the PCIe Switch chip 120, the expansion of PCIe resources can be achieved, thereby facilitating communication between multiple PCIe devices.

进一步地,通过在拓展基板110上设置至少两个PCIe Switch芯片120,能够扩展出尽可能多的PCIe资源。通过设置第一预设数量的对内连接器130,能够较好地实现IO拓展架构中各个PCIe Switch芯片120之间的互联。通过设置第二预设数量的对外连接器140,能够较好地实现多个PCIe设备的接入,从而实现PCIe互联拓扑的构建。Furthermore, by setting at least two PCIe Switch chips 120 on the expansion base plate 110, as many PCIe resources as possible can be expanded. By setting a first preset number of internal connectors 130, the interconnection between the PCIe Switch chips 120 in the IO expansion architecture can be better realized. By setting a second preset number of external connectors 140, the access of multiple PCIe devices can be better realized, thereby realizing the construction of the PCIe interconnection topology.

需要说明的是,所述第一预设数量可以根据IO拓展架构中PCIe Switch芯片120的数量确定。例如:假设IO拓展架构中共设有四个IO拓展单元,每个IO拓展单元包括2个PCIe Switch芯片120,即总共有8个PCIe Switch芯片120。那么,为了实现8个PCIe Switch芯片120之间的互联,则需要设置32个对内连接器130,用于实现8个PCIe Switch芯片120之间的互联。其中,每个PCIe Switch芯片120对应4个对内连接器130。考虑到服务器(IO拓展架构所在的服务器)通常具有标准尺寸限制,经多次试验得到,所述第一预设数量的较优的取值为8,即每个拓展基板110上设置8个对内连接器130,每个PCIe Switch芯片120对应4个对内连接器130。上述第一预设数量的较优的取值仅供参考,在实际实施过程中,可以根据实际情况进行设置。It should be noted that the first preset number can be determined according to the number of PCIe Switch chips 120 in the IO expansion architecture. For example: Assume that there are four IO expansion units in the IO expansion architecture, and each IO expansion unit includes two PCIe Switch chips 120, that is, there are a total of 8 PCIe Switch chips 120. Then, in order to achieve the interconnection between the 8 PCIe Switch chips 120, 32 internal connectors 130 need to be set to achieve the interconnection between the 8 PCIe Switch chips 120. Among them, each PCIe Switch chip 120 corresponds to 4 internal connectors 130. Considering that the server (the server where the IO expansion architecture is located) usually has a standard size restriction, after multiple experiments, it is found that the preferred value of the first preset number is 8, that is, 8 internal connectors 130 are set on each expansion substrate 110, and each PCIe Switch chip 120 corresponds to 4 internal connectors 130. The above-mentioned preferred value of the first preset number is for reference only, and in actual implementation, it can be set according to actual conditions.

另外,所述第二预设数量也可以根据实际情况进行设置,如8个、10个等。同样地,由于服务器的尺寸具有标准定义或限制,经多次试验得出,所述第二预设数量的较优取值为10个,即每个拓展基板110上设置10个对外连接器140,其中,每个PCIe Switch芯片120均对应5个相应的对外连接器140,实现在标准尺寸的范围内,拓展出尽可能多的PCIe资源,连接尽可能多的PCIe设备。In addition, the second preset number can also be set according to actual conditions, such as 8, 10, etc. Similarly, since the size of the server has a standard definition or limitation, after multiple tests, it is concluded that the preferred value of the second preset number is 10, that is, 10 external connectors 140 are set on each expansion base plate 110, wherein each PCIe Switch chip 120 corresponds to 5 corresponding external connectors 140, so as to expand as many PCIe resources as possible and connect as many PCIe devices as possible within the range of standard size.

上述实施例中的IO拓展架构能够较好地支持大量或足量的PCIe资源或IO资源的引出,能够较好地满足融合架构系统中复杂拓扑的构建需求,灵活度较高,为融合架构系统中复杂拓扑的构建提供了工程实施的可能性,成本较低,可实施性较强。The IO expansion architecture in the above embodiment can better support the extraction of a large or sufficient amount of PCIe resources or IO resources, can better meet the construction requirements of complex topologies in the converged architecture system, has high flexibility, provides the possibility of engineering implementation for the construction of complex topologies in the converged architecture system, has low cost and strong feasibility.

需要提及的是,上述实施例中的对内连接器130可以为MCIO(Mini Cool Edge IO,迷你冷边IO)连接器。上述实施例中的对外连接器140可以为CDFP(Compact Duplex Fiber Pluggable,紧凑型双工光纤可插拔,一种高密度光纤连接器)连接器。通过采用上述MCIO连接器作为对内连接器130,采用CDFP连接器作为对外连接器140,能够满足数据传输高带宽、高传输速率等要求。It should be mentioned that the inner connector 130 in the above embodiment may be an MCIO (Mini Cool Edge IO) connector. The outer connector 140 in the above embodiment may be a CDFP (Compact Duplex Fiber Pluggable, a high-density fiber optic connector) connector. By using the above MCIO connector as the inner connector 130 and the CDFP connector as the outer connector 140, the requirements of high bandwidth and high transmission rate of data transmission can be met.

需要说明的是,CDFP连接器除了用于PCIe信号的接收与发送(即接收其连接的PCIe设备发送的PCIe信号,以及向其连接的PCIe设备发送PCIe信号)以外,还用于发 送多个预设信号,以满足不同的需求。具体地,CDFP连接器还向对应的PCIe Switch芯片120发送VPP(Virtual Pin Port,虚拟引脚接口)_I2C(Inter-Integrated Circuit,集成电路总线)信号及Alert(提醒)信号,所述VPP I2C信号及Alert信号用于SSD资源池的硬盘热插拔。可以理解的,VPP I2C信号可用于热插拔PCIe管理。Alert信号可用于热插拔提醒。另外,为了实现与对应的PCIe Switch芯片120之间的同步,所述CDFP连接器还向对应的PCIe Switch芯片120发送一路PCIe时钟信号,以实现时钟同步。另外,所述CDFP连接器还向基板管理控制器225(BMC,Baseboard Management Controller)和全部任务管理单元中的其中一个,发送Mngt(Management,管理)_I2C信号,用于多个资源池之间的设备信息交互。再有,所述CDFP连接器还向控制模块220中的逻辑控制单元222发送Power enable(能量使能)信号、Box present(IO拓展单元连接状态)信号、以及PERST(全局复位信号)信号,用于整机柜系统的识别、上下电和复位。并且,所述CDFP连接器还向预设的ID(Identity document,身份标识号)管理单元发送Cable(有线传输)_I2C信号,用于读取CDFP线缆信息并进行加解密操作,以及ID_I2C信号,用于识别CDFP连接器的预设ID。通过上述对CDFP连接器发送的信号类型的定义,能够实现对Host资源池、GPU资源池、SSD资源池和Memory资源池的兼容。It should be noted that the CDFP connector is used not only for receiving and sending PCIe signals (i.e., receiving PCIe signals sent by the PCIe device to which it is connected, and sending PCIe signals to the PCIe device to which it is connected), but also for sending Send multiple preset signals to meet different needs. Specifically, the CDFP connector also sends a VPP (Virtual Pin Port, Virtual Pin Interface)_I2C (Inter-Integrated Circuit, Integrated Circuit Bus) signal and an Alert (reminder) signal to the corresponding PCIe Switch chip 120, and the VPP I2C signal and the Alert signal are used for hot plugging of hard disks in the SSD resource pool. It can be understood that the VPP I2C signal can be used for hot plug PCIe management. The Alert signal can be used for hot plug reminders. In addition, in order to achieve synchronization with the corresponding PCIe Switch chip 120, the CDFP connector also sends a PCIe clock signal to the corresponding PCIe Switch chip 120 to achieve clock synchronization. In addition, the CDFP connector also sends a Mngt (Management, Management)_I2C signal to the baseboard management controller 225 (BMC, Baseboard Management Controller) and one of all task management units for device information interaction between multiple resource pools. Furthermore, the CDFP connector also sends a Power enable signal, a Box present signal, and a PERST signal to the logic control unit 222 in the control module 220, which are used for identification, power on and off, and reset of the entire cabinet system. In addition, the CDFP connector also sends a Cable_I2C signal to the preset ID (Identity document) management unit for reading the CDFP cable information and performing encryption and decryption operations, as well as an ID_I2C signal for identifying the preset ID of the CDFP connector. Through the above definition of the signal type sent by the CDFP connector, compatibility with the Host resource pool, GPU resource pool, SSD resource pool, and Memory resource pool can be achieved.

还需要说明的是,MCIO连接器为标准板边连接器,用于PCIe Switch芯片120之间的信号互联。MCIO连接器发送的信号,除了PCIe信号外,还包含一路ID_I2C信号用于不同PCIe Switch芯片120进行MCIO连接器的预设ID识别。It should also be noted that the MCIO connector is a standard board edge connector used for signal interconnection between PCIe Switch chips 120. In addition to the PCIe signal, the signal sent by the MCIO connector also includes an ID_I2C signal for different PCIe Switch chips 120 to identify the preset ID of the MCIO connector.

通过搭建上述IO拓展架构,能够较好地实现大量或足量的IO资源的引出,能够较好地满足融合架构系统中复杂拓扑的构建需求,灵活度较高,为融合架构系统中复杂拓扑的构建提供了工程实施的可能性。且通过对CDFP连接器的发送信号进行多种定义,能够满足多样化的功能需求。By building the above IO expansion architecture, a large or sufficient amount of IO resources can be better brought out, which can better meet the construction requirements of complex topologies in converged architecture systems. It has high flexibility and provides the possibility of engineering implementation for the construction of complex topologies in converged architecture systems. And by defining the sending signals of the CDFP connector in multiple ways, it can meet diverse functional requirements.

图1中以四个IO拓展单元为例,每个IO拓展单元均有一个拓展基板110,四块拓展基板110依次叠放设置。每块拓展基板110均设有两个PCIe Switch芯片120、8个对内连接器130、以及10个对外连接器140。通过在每块拓展基板110上均设置两个PCIe Switch芯片120、8个对内连接器130、以及10个对外连接器140,能够引出足量的IO资源,从而有助于融合架构的构建。其中,每四个对内连接器130的一端与同一拓展基板110上对应的一个PCIe Switch芯片120连接,另一端则分别与其他IO拓展单元的对内连接器130连接。每5个对外连接器140的一端与同一拓展基板110上对应的一个PCIe Switch芯片120连接,另一端则用于与外部的PCIe设备连接,以完成多个PCIe设备的互联,形成PCIe互联拓扑。以此完成对上述实施例中的IO拓展架构的构建。本示例中的IO拓展架构能够支持40路的IO资源的引出。能够满足融合架构系统中复杂拓扑的构建需求,实施较方便,可行性较高。In FIG1, four IO expansion units are taken as an example. Each IO expansion unit has an expansion base plate 110, and four expansion base plates 110 are stacked in sequence. Each expansion base plate 110 is provided with two PCIe Switch chips 120, 8 internal connectors 130, and 10 external connectors 140. By setting two PCIe Switch chips 120, 8 internal connectors 130, and 10 external connectors 140 on each expansion base plate 110, sufficient IO resources can be drawn out, which is helpful for the construction of a converged architecture. Among them, one end of each of the four internal connectors 130 is connected to a corresponding PCIe Switch chip 120 on the same expansion base plate 110, and the other end is connected to the internal connectors 130 of other IO expansion units. One end of each of the five external connectors 140 is connected to a corresponding PCIe Switch chip 120 on the same expansion base plate 110, and the other end is used to connect to an external PCIe device to complete the interconnection of multiple PCIe devices and form a PCIe interconnection topology. In this way, the construction of the IO expansion architecture in the above embodiment is completed. The IO expansion architecture in this example can support the extraction of 40 IO resources. It can meet the construction requirements of complex topologies in the converged architecture system, is more convenient to implement, and has higher feasibility.

在一些实施例中,所述IO拓展单元还用于与预设的ID管理单元210连接。具体地,所述IO拓展单元中的对内连接器130和对外连接器140分别用于与所述ID管理单元210连接。所述IO拓展单元中的对外连接器140用于接入PCIe设备,以得到PCIe信号。将所述PCIe信号传输至PCIe Switch芯片120。 In some embodiments, the IO expansion unit is also used to connect to a preset ID management unit 210. Specifically, the internal connector 130 and the external connector 140 in the IO expansion unit are respectively used to connect to the ID management unit 210. The external connector 140 in the IO expansion unit is used to access a PCIe device to obtain a PCIe signal. The PCIe signal is transmitted to the PCIe Switch chip 120.

在一些实施例中,所述对外连接器140和对内连接器130还用于将各自的ID信号发送至所述ID管理单元210。所述ID管理单元210将所述ID信号发送至预设的控制模块220,所述控制模块220基于所述ID信号,进行互联拓扑识别,得到PCIe互联拓扑;并且,基于所述PCIe互联拓扑,对所述IO拓展架构中的全部所述PCIe Switch芯片120进行管理。In some embodiments, the external connector 140 and the internal connector 130 are also used to send their respective ID signals to the ID management unit 210. The ID management unit 210 sends the ID signal to a preset control module 220, and the control module 220 performs interconnection topology identification based on the ID signal to obtain a PCIe interconnection topology; and, based on the PCIe interconnection topology, manages all the PCIe Switch chips 120 in the IO expansion architecture.

在一些实施例中,所述PCIe Switch芯片120接收的所述ID信号是所述控制模块220中的目标管理单元发送的,所述目标管理单元为所述控制模块220中的基板管理控制器225和全部任务管理单元中的其中一个。所述目标管理单元的确定方式为:所述控制模块220中的逻辑控制单元222监测基板管理控制器225和全部任务管理单元的运行状态;基于所述运行状态、以及所述基板管理控制器225和所述任务管理单元各自对应的任务管理优先级,确定所述基板管理控制器225和全部所述任务管理单元中的其中一个为目标管理单元。In some embodiments, the ID signal received by the PCIe Switch chip 120 is sent by the target management unit in the control module 220, and the target management unit is one of the baseboard management controller 225 and all the task management units in the control module 220. The target management unit is determined as follows: the logic control unit 222 in the control module 220 monitors the operating status of the baseboard management controller 225 and all the task management units; based on the operating status and the task management priorities corresponding to the baseboard management controller 225 and the task management units, one of the baseboard management controller 225 and all the task management units is determined as the target management unit.

在一些实施例中,所述ID管理单元210将所述ID信号发送至预设的控制模块220包括:所述ID管理单元210将所述ID信号发送至所述控制模块220中预设的多路选择器221(MUX,Multiplex),多路选择器221基于逻辑控制单元222发送的目标管理单元控制指令,确定目标管理单元,将接收到的全部所述PCIe信号发送至目标管理单元。In some embodiments, the ID management unit 210 sends the ID signal to a preset control module 220, including: the ID management unit 210 sends the ID signal to a multiplexer 221 (MUX, Multiplex) preset in the control module 220, the multiplexer 221 determines the target management unit based on the target management unit control instruction sent by the logic control unit 222, and sends all the received PCIe signals to the target management unit.

上述实施例中的IO拓展架构,通过设置至少一个IO拓展单元,所述IO拓展单元包括拓展基板110,所述拓展基板设有至少两个PCIe Switch(PCIe交换)芯片(图1中简化表示为SW)、第一预设数量的对内连接器130,以及第二预设数量的对外连接器140。将所述对内连接器130的一端与所述拓展基板110上任一个所述PCIe Switch芯片120连接,所述对内连接器130的另一端用于所述PCIe Switch芯片120之间的互联;并且,将所述对外连接器140的一端与所述拓展基板110上任一个所述PCIe Switch芯片120连接,所述对外连接器140的另一端用于连接PCIe设备,以完成多个所述PCIe设备之间的互联。能够较好地支持大量或足量的IO资源的引出,能够较好地满足融合架构系统中复杂拓扑的构建需求,灵活度较高,为融合架构系统中复杂拓扑的构建提供了工程实施的可能性,成本较低。The IO expansion architecture in the above embodiment is provided with at least one IO expansion unit, wherein the IO expansion unit includes an expansion base plate 110, wherein the expansion base plate is provided with at least two PCIe Switch chips (simplified as SW in FIG. 1 ), a first preset number of internal connectors 130, and a second preset number of external connectors 140. One end of the internal connector 130 is connected to any one of the PCIe Switch chips 120 on the expansion base plate 110, and the other end of the internal connector 130 is used for interconnection between the PCIe Switch chips 120; and one end of the external connector 140 is connected to any one of the PCIe Switch chips 120 on the expansion base plate 110, and the other end of the external connector 140 is used for connecting PCIe devices to complete the interconnection between multiple PCIe devices. It can better support the extraction of a large amount or sufficient amount of IO resources, can better meet the construction requirements of complex topologies in a converged architecture system, has high flexibility, provides the possibility of engineering implementation for the construction of complex topologies in a converged architecture system, and has low cost.

请参考图2,本实施例还提供一种IO交换机,包括:Referring to FIG. 2 , this embodiment further provides an IO switch, including:

ID管理单元210、控制模块220、以及如上述所述的IO拓展架构。The ID management unit 210, the control module 220, and the IO expansion architecture as described above.

所述ID管理单元210用于采集所述IO拓展架构中全部所述对内连接器130和所述对外连接器140的ID信号,将所述ID信号传输至所述控制模块220。The ID management unit 210 is used to collect ID signals of all the internal connectors 130 and the external connectors 140 in the IO expansion architecture, and transmit the ID signals to the control module 220 .

所述控制模块220用于基于全部所述ID信号,对所述IO拓展架构中的全部所述PCIe Switch芯片120进行管理。本实施例中的IO交换机具有较好的IO资源拓展与共享能力,具有足够的系统健壮性、灵活性,能够维持如整机柜内诸多PCIe设备间的IO交换。能够较好地满足融合架构系统中复杂拓扑的构建需求,为融合架构系统中复杂拓扑的构建提供了工程实施的可能性,成本较低。The control module 220 is used to manage all the PCIe Switch chips 120 in the IO expansion architecture based on all the ID signals. The IO switch in this embodiment has good IO resource expansion and sharing capabilities, sufficient system robustness and flexibility, and can maintain IO exchanges between many PCIe devices in the entire cabinet. It can better meet the construction requirements of complex topologies in the converged architecture system, and provides the possibility of engineering implementation for the construction of complex topologies in the converged architecture system at a low cost.

具体地,通过在所述IO交换机中设置ID管理单元210,能够便于对IO拓展架构中全部的对内连接器130和对外连接器140的ID信号进行采集,并将采集到的ID信号发 送至控制模块220,便于控制模块220进行PCIe互联拓扑识别。通过在所述IO交换机中设置控制模块220,能够便于对全部的所述PCIe Switch芯片120进行管理。本实施例中的IO交换机,不仅能够支持大量或足量的IO资源的引出,并且,能够较好地满足融合架构系统中复杂拓扑的构建需求,灵活度较高,为融合架构系统中复杂拓扑的构建提供了工程实施的可能性,成本较低,可实施性较强,自动化程度较高。Specifically, by setting the ID management unit 210 in the IO switch, it is possible to collect the ID signals of all the internal connectors 130 and the external connectors 140 in the IO expansion architecture, and send the collected ID signals to the external connectors 140. The IO switch in this embodiment can not only support the extraction of a large or sufficient amount of IO resources, but also better meet the construction requirements of complex topologies in the converged architecture system, with high flexibility, providing the possibility of engineering implementation for the construction of complex topologies in the converged architecture system, with low cost, strong feasibility and high degree of automation.

在一些实施例中,每个所述对内连接器130和所述对外连接器140均各自连接有一个ID定义芯片,所述ID定义芯片用于对对应的所述对内连接器130或所述对外连接器140进行ID定义;In some embodiments, each of the inner connector 130 and the outer connector 140 is connected to an ID definition chip, and the ID definition chip is used to perform ID definition on the corresponding inner connector 130 or the outer connector 140;

所述ID管理单元具体用于通过所述对外连接器140或所述对内连接器130,向每个所述ID定义芯片发送ID查询指令;The ID management unit is specifically used to send an ID query instruction to each of the ID definition chips through the external connector 140 or the internal connector 130;

所述ID定义芯片还用于接收所述ID查询指令,基于所述ID查询指令,向对应的所述对内连接器130或所述对外连接器140反馈所述ID信号;The ID definition chip is also used to receive the ID query instruction, and based on the ID query instruction, feed back the ID signal to the corresponding internal connector 130 or the external connector 140;

所述对内连接器130或所述对外连接器140在接收到所述ID信号的情况下,将所述ID信号反馈至所述ID管理单元210。When receiving the ID signal, the internal connector 130 or the external connector 140 feeds the ID signal back to the ID management unit 210 .

所述ID管理单元210分别与每个所述对外连接器140、所述对内连接器130和所述控制模块220连接。The ID management unit 210 is connected to each of the external connectors 140 , the internal connectors 130 , and the control module 220 , respectively.

需要说明的是,所述ID管理单元210为所述IO交换机中的核心管理单元,其用于对全部所述对内连接器130或所述对外连接器140进行ID信号采集,并将采集到的ID信号传输至目标管理单元。所述目标管理单元为所述控制模块220中的基板管理控制器225和全部所述任务管理单元中的其中一个。It should be noted that the ID management unit 210 is a core management unit in the IO switch, which is used to collect ID signals from all the internal connectors 130 or the external connectors 140, and transmit the collected ID signals to the target management unit. The target management unit is the baseboard management controller 225 in the control module 220 and one of all the task management units.

还需要说明的是,每个对内连接器130和对外连接器140的ID信号均不相同。另外,为了简化ID识别算法,每个所述ID定义芯片均具有相同的I2C(Inter-Integrated Circuit,集成电路总线)地址,并且处于完全相同的I2C通道下。即ID管理单元210在进行ID信号采集过程中,无需多次切换访问地址等,加快识别速度,降低程序复杂度,可实施性较强,成本较低,识别效率较高。It should also be noted that the ID signals of each internal connector 130 and external connector 140 are different. In addition, in order to simplify the ID recognition algorithm, each of the ID definition chips has the same I2C (Inter-Integrated Circuit) address and is in exactly the same I2C channel. That is, during the ID signal acquisition process, the ID management unit 210 does not need to switch the access address multiple times, which speeds up the recognition speed, reduces the program complexity, has strong feasibility, low cost, and high recognition efficiency.

请参考图3,在一些实施例中,所述控制模块220包括:Please refer to FIG. 3 , in some embodiments, the control module 220 includes:

多路选择器221,用于接收所述ID管理单元210传输的所述ID信号;A multiplexer 221, configured to receive the ID signal transmitted by the ID management unit 210;

逻辑控制单元222,用于监测基板管理控制器225和全部任务管理单元的运行状态;基于所述运行状态、以及所述基板管理控制器225和所述任务管理单元各自对应的任务管理优先级,确定所述基板管理控制器225和全部所述任务管理单元中的其中一个为目标管理单元;基于确定的所述目标管理单元,生成目标管理单元指令;将所述目标管理单元指令发送至所述多路选择器221;The logic control unit 222 is used to monitor the operating status of the baseboard management controller 225 and all the task management units; based on the operating status and the task management priorities corresponding to the baseboard management controller 225 and the task management units, determine that one of the baseboard management controller 225 and all the task management units is a target management unit; based on the determined target management unit, generate a target management unit instruction; and send the target management unit instruction to the multiplexer 221;

所述多路选择器221还用于基于所述目标管理单元指令,确定所述目标管理单元,将所述ID信号发送至所述目标管理单元;The multiplexer 221 is further used to determine the target management unit based on the target management unit instruction and send the ID signal to the target management unit;

基板管理控制器225和至少一个任务管理单元,所述基板管理控制器225和所述任务管理单元均用于在接收到所述ID信号的情况下,基于所述ID信号,进行PCIe拓扑识别,得到PCIe互联拓扑;基于所述PCIe互联拓扑,对全部所述PCIe Switch芯片120进 行管理;The baseboard management controller 225 and at least one task management unit are used to perform PCIe topology identification based on the ID signal when receiving the ID signal to obtain a PCIe interconnection topology; perform PCIe topology identification on all the PCIe Switch chips 120 based on the PCIe interconnection topology. Bank management;

所述ID管理单元210和所述多路选择器221连接,所述多路选择器221分别与所述逻辑控制单元222、所述基板管理控制器225、以及全部所述任务管理单元连接。所述ID管理单元210和所述多路选择器221之间可基于USB(Universal Serial Bus,通用串口总线)、I2C和UART(Universal Asynchronous Receiver/Transmitter,通用异步收发传输器)中的任一种方式进行信号传输或通信。同样地,所述多路选择器221与所述逻辑控制单元222之间、所述多路选择器221与所述基板管理控制器225之间、以及所述多路选择器221与各所述任务管理单元之间,均可基于USB、I2C和UART中的任一种方式进行信号传输或通信。基于信号传输类型的不同,可以采用相应的通信协议或通信方式,满足不同通信需求,支持功能多样化。The ID management unit 210 is connected to the multiplexer 221, and the multiplexer 221 is respectively connected to the logic control unit 222, the baseboard management controller 225, and all the task management units. The ID management unit 210 and the multiplexer 221 can perform signal transmission or communication based on any one of USB (Universal Serial Bus), I2C and UART (Universal Asynchronous Receiver/Transmitter). Similarly, signal transmission or communication can be performed between the multiplexer 221 and the logic control unit 222, between the multiplexer 221 and the baseboard management controller 225, and between the multiplexer 221 and each of the task management units based on any one of USB, I2C and UART. Based on the different types of signal transmission, corresponding communication protocols or communication methods can be adopted to meet different communication requirements and support functional diversification.

具体地,所述多路选择器221接收所述ID管理单元210传输的PCIe信号,将PCIe信号发送至目标管理单元。该目标管理单元的确定由逻辑控制单元222决定。即逻辑控制单元222实时监测基板管理控制器225和全部任务管理单元的运行状态;基于所述运行状态、以及所述基板管理控制器225和所述任务管理单元各自对应的任务管理优先级,确定所述基板管理控制器225和全部所述任务管理单元中的其中一个为目标管理单元;基于确定的所述目标管理单元,生成目标管理单元指令;将所述目标管理单元指令发送至所述多路选择器221;多路选择器221基于目标管理单元指令,确定目标管理单元,并将所述PCIe信号发送至所述目标管理单元。Specifically, the multiplexer 221 receives the PCIe signal transmitted by the ID management unit 210, and sends the PCIe signal to the target management unit. The determination of the target management unit is determined by the logic control unit 222. That is, the logic control unit 222 monitors the operating status of the baseboard management controller 225 and all the task management units in real time; based on the operating status and the task management priorities corresponding to the baseboard management controller 225 and the task management unit, one of the baseboard management controller 225 and all the task management units is determined as the target management unit; based on the determined target management unit, a target management unit instruction is generated; the target management unit instruction is sent to the multiplexer 221; the multiplexer 221 determines the target management unit based on the target management unit instruction, and sends the PCIe signal to the target management unit.

需要说明的是,本实施例中的所述基板管理控制器225和全部所述任务管理单元全部都可以实现对全部所述PCIe Switch芯片120进行管理的功能。从该功能上看,所述基板管理控制器225和全部所述任务管理单元构成冗余。上述实施例通过从所述基板管理控制器225和全部所述任务管理单元中确定其中一个为目标管理单元,能够最大程度地提升IO交换机的可靠性,避免由于单个信号分配的功能单元出现故障所造成的不必要的损失。It should be noted that the baseboard management controller 225 and all the task management units in this embodiment can all realize the function of managing all the PCIe Switch chips 120. From this functional point of view, the baseboard management controller 225 and all the task management units are redundant. The above embodiment can maximize the reliability of the IO switch and avoid unnecessary losses caused by the failure of a single signal distribution functional unit by determining one of the baseboard management controller 225 and all the task management units as the target management unit.

还需要说明的是,对全部所述PCIe Switch芯片120进行管理优先级主要在于任务管理单元,即在全部任务管理单元出现故障的情况下,即全部任务管理单元均为异常运行状态的情况下,将基板管理控制器225确定为目标管理单元,并由作为目标管理单元的基板管理控制器225进行后续的信号分配与处理。It should also be noted that the management priority of all the PCIe Switch chips 120 lies mainly in the task management unit, that is, when all the task management units fail, that is, when all the task management units are in abnormal operation state, the baseboard management controller 225 is determined as the target management unit, and the baseboard management controller 225 as the target management unit performs subsequent signal allocation and processing.

需要提及的是,本实施例中的基板管理控制器225除了具有对全部所述PCIe Switch芯片120进行管理的功能以外,还需要对整个IO交换机进行基础信息监测,如采集IO交换机中预设的温度传感器等的信息,进行基础信息监测,并基于温度等基础信息,进行相应调控,如对IO交换机中预设的散热单元320的散热档位进行上升或下降控制等。It should be mentioned that, in addition to the function of managing all the PCIe Switch chips 120, the baseboard management controller 225 in this embodiment also needs to monitor the basic information of the entire IO switch, such as collecting information from the temperature sensor preset in the IO switch, and performing basic information monitoring, and performing corresponding adjustments based on basic information such as temperature, such as increasing or decreasing the heat dissipation level of the heat dissipation unit 320 preset in the IO switch.

另外,为了实现所述基板管理控制器225和全部所述任务管理单元的信息的同步,也为了降低在基板管理控制器225和全部所述任务管理单元之间进行信号分配的功能切换的难度,基板管理控制器225和全部所述任务管理单元之间两两通信,以实现实时的数据交换与备份,较大程度地提高IO交换机的稳定性与可靠性。In addition, in order to achieve information synchronization between the baseboard management controller 225 and all the task management units, and to reduce the difficulty of function switching of signal distribution between the baseboard management controller 225 and all the task management units, the baseboard management controller 225 and all the task management units communicate with each other to achieve real-time data exchange and backup, thereby greatly improving the stability and reliability of the IO switch.

在一些实施例中,所述基板管理控制器225为遵循RunBMC(一种开源硬件规范) 规范的扣卡,便于后续的BMC芯片升级。并且,BMC管理单元支持一个TF(Trans-flash,闪存)卡,以及相关的机箱管理功能接口。In some embodiments, the baseboard management controller 225 complies with RunBMC (an open source hardware specification) The standardized daughter card facilitates the subsequent BMC chip upgrade. In addition, the BMC management unit supports a TF (Trans-flash, flash memory) card and related chassis management function interfaces.

在一些实施例中,所述任务管理单元指CPU(Central Processing Unit,中央处理器)管理单元。In some embodiments, the task management unit refers to a CPU (Central Processing Unit) management unit.

在一些实施例中,至少一个所述任务管理单元包括:第一任务管理单元223和第二任务管理单元224。即第一CPU管理单元和第二CPU管理单元。通过设置所述第一CPU管理单元和所述第二CPU管理单元,构成任务管理单元的冗余,有助于提高IO交换机的稳定性与可靠性。为了更好地实现工程化,在一些实施例中,所述第一CPU管理单元和所述第二CPU管理单元均为遵循COMe(COM Express)规范的扣卡,便于后续的处理器升级。并且,每个所述任务管理单元均各自支持一个PCIe网卡、一个PCIe NVMe(Non-Volatile Memory express,非易失性内存主机控制器接口规范)硬盘或SATA(Serial Advanced Technology Attachment,串行ATA)硬盘、一路PCIe调试接口(用于PCIe Switch芯片120)等IO资源。In some embodiments, at least one of the task management units includes: a first task management unit 223 and a second task management unit 224. That is, a first CPU management unit and a second CPU management unit. By setting the first CPU management unit and the second CPU management unit, redundancy of the task management unit is formed, which helps to improve the stability and reliability of the IO switch. In order to better realize engineering, in some embodiments, the first CPU management unit and the second CPU management unit are both buckle cards that comply with the COMe (COM Express) specification, which is convenient for subsequent processor upgrades. In addition, each of the task management units supports a PCIe network card, a PCIe NVMe (Non-Volatile Memory express, non-volatile memory host controller interface specification) hard disk or SATA (Serial Advanced Technology Attachment, Serial ATA) hard disk, a PCIe debug interface (for PCIe Switch chip 120) and other IO resources.

需要说明的是,第一任务管理单元223、第二任务管理单元224和基板管理控制器225用于IO交换机的系统管理,三者之间互为冗余。通过这三者之间的冗余设置,能够较好地保证IO交换机的稳定运行。It should be noted that the first task management unit 223, the second task management unit 224 and the baseboard management controller 225 are used for system management of the IO switch, and are mutually redundant. The redundancy setting between the three can better ensure the stable operation of the IO switch.

在一些实施例中,所述逻辑控制单元222具体用于当所述第一任务管理单元223为正常运行状态时,将所述第一任务管理单元223确定为所述目标管理单元;当所述第一任务管理单元223为异常运行状态,且所述第二任务管理单元224为正常运行状态时,将所述第二任务管理单元224确定为所述目标管理单元;当所述第一任务管理单元223和所述第二任务管理单元224均为异常运行状态,且所述基板管理控制器225为正常运行状态时,将所述基板管理控制器225确定为所述目标管理单元。即本实施例中的任务管理优先级为第一任务管理单元223-第二任务管理单元224-基板管理控制器225。通过设置上述任务管理优先级,并采用所述任务管理优先级,对所述第一任务管理单元223、所述第二任务管理单元224和所述基板管理控制器225进行任务切换,即具体由哪个单元或控制器负责接收ID信号、进行PCIe互联拓扑识别,以及对全部所述PCIe Switch芯片120进行管理等任务的处理。能够较好地保证IO交换机的稳定性与可靠性,灵活度较高。所述逻辑控制单元222较好地实现了IO交换机中的冗余切换、上下电时序和复位逻辑控制,能够精准地保障IO交换机的运行。In some embodiments, the logic control unit 222 is specifically used to determine the first task management unit 223 as the target management unit when the first task management unit 223 is in a normal operating state; determine the second task management unit 224 as the target management unit when the first task management unit 223 is in an abnormal operating state and the second task management unit 224 is in a normal operating state; determine the baseboard management controller 225 as the target management unit when both the first task management unit 223 and the second task management unit 224 are in an abnormal operating state and the baseboard management controller 225 is in a normal operating state. That is, the task management priority in this embodiment is the first task management unit 223-the second task management unit 224-the baseboard management controller 225. By setting the above task management priority and using the task management priority, the first task management unit 223, the second task management unit 224 and the baseboard management controller 225 are task-switched, that is, which unit or controller is responsible for receiving ID signals, performing PCIe interconnection topology identification, and managing all the PCIe Switch chips 120. The stability and reliability of the IO switch can be well guaranteed, and the flexibility is high. The logic control unit 222 can well realize the redundant switching, power-on and power-off timing and reset logic control in the IO switch, and can accurately guarantee the operation of the IO switch.

需要说明的是,当所述第一任务管理单元223为异常运行状态,且所述第二任务管理单元224为正常运行状态时,将所述第二任务管理单元224确定为所述目标管理单元。同时,尝试重启第一任务管理单元223,若重启完成,则将通知多路选择器221将控制权切换至第一任务管理单元223,即将第一任务管理单元223重新确定为目标管理单元。同理,当所述第一任务管理单元223和所述第二任务管理单元224均为异常运行状态,且所述基板管理控制器225为正常运行状态时,将所述基板管理控制器225确定为所述目标管理单元。同时,尝试重启所述第一任务管理单元223和所述第二任务管理单元224。在第一任务管理单元223重启成功的情况下,将所述第一任务管理单元223确定 为所述目标管理单元。在所述第二任务管理单元224重启成功,且第一任务管理单元223未重启成功的情况下,将所述第二任务管理单元224确定为所述目标管理单元。从而能够较好地保证IO交换机的稳定运行。It should be noted that, when the first task management unit 223 is in an abnormal operating state, and the second task management unit 224 is in a normal operating state, the second task management unit 224 is determined as the target management unit. At the same time, an attempt is made to restart the first task management unit 223. If the restart is completed, the multiplexer 221 will be notified to switch the control to the first task management unit 223, that is, the first task management unit 223 will be re-determined as the target management unit. Similarly, when the first task management unit 223 and the second task management unit 224 are both in an abnormal operating state, and the baseboard management controller 225 is in a normal operating state, the baseboard management controller 225 will be determined as the target management unit. At the same time, an attempt is made to restart the first task management unit 223 and the second task management unit 224. In the case where the first task management unit 223 is restarted successfully, the first task management unit 223 will be determined as the target management unit. When the second task management unit 224 is successfully restarted and the first task management unit 223 is not successfully restarted, the second task management unit 224 is determined as the target management unit. Thus, the stable operation of the IO switch can be better guaranteed.

在具体实施过程中,还可以根据实际情况对任务管理单元的数量进行增减,以满足不同的实际需求。During the specific implementation process, the number of task management units can be increased or decreased according to actual conditions to meet different actual needs.

在一些实施例中,所述基板管理控制器225和所述任务管理单元均具体用于识别到所述PCIe互联拓扑的情况下,向每个所述PCIe Switch芯片分别发送对应的配置信息;所述配置信息包括所述PCIe Switch芯片的全部连接接口的类型信息;所述连接接口包括用于与所述对内连接器或所述对外连接器连接的接口;所述类型信息包括:上行接口信息、下行接口信息以及对内互联接口信息;所述上行接口信息指对应的所述连接接口为上行接口,所述上行接口用于向上行连接的所述PCIe设备传输PCIe信号;所述下行接口信息指对应的所述连接接口为下行接口,所述下行接口用于向下行连接的所述PCIe设备传输所述PCIe信号;所述互联接口信息指对应的连接接口为对内互联接口,所述对内互联接口用于向与当前所述PCIe Switch芯片互联的其余所述PCIe Switch芯片传输所述PCIe信号;上行连接的所述PCIe设备指主机,下行连接的所述PCIe设备为图形处理器、固态硬盘或内存;In some embodiments, the baseboard management controller 225 and the task management unit are specifically used to send corresponding configuration information to each PCIe Switch chip when identifying the PCIe interconnection topology; the configuration information includes type information of all connection interfaces of the PCIe Switch chip; the connection interface includes an interface for connecting to the internal connector or the external connector; the type information includes: uplink interface information, downlink interface information and internal interconnection interface information; the uplink interface information indicates that the corresponding connection interface is an uplink interface, and the uplink interface is used to transmit PCIe signals to the PCIe device connected upstream; the downlink interface information indicates that the corresponding connection interface is a downlink interface, and the downlink interface is used to transmit the PCIe signal to the PCIe device connected downstream; the interconnection interface information indicates that the corresponding connection interface is an internal interconnection interface, and the internal interconnection interface is used to transmit the PCIe signal to the remaining PCIe Switch chips interconnected with the current PCIe Switch chip; the uplink PCIe device refers to the host, and the downlink PCIe device is a graphics processor, a solid state drive or a memory;

所述PCIe Switch芯片用于基于接收到的所述配置信息,对自身的全部所述连接接口进行配置。The PCIe Switch chip is used to configure all of its own connection interfaces based on the received configuration information.

具体地,在所述基板管理控制器225为所述目标管理单元的情况下,所述基板管理控制器225接收所述ID信号,基于全部所述ID信号,进行拓扑识别,得到PCIe互联拓扑。在识别到所述PCIe互联拓扑的情况下,向每个所述PCIe Switch芯片120分别发送对应的配置信息;所述配置信息包括所述PCIe Switch芯片120的全部连接接口的类型信息;所述连接接口包括用于与所述对内连接器或所述对外连接器连接的接口;所述类型信息包括:上行接口信息、下行接口信息以及对内互联接口信息;所述上行接口信息指对应的所述连接接口为上行接口,所述上行接口用于向上行连接的所述PCIe设备传输PCIe信号;所述下行接口信息指对应的所述连接接口为下行接口,所述下行接口用于向下行连接的所述PCIe设备传输所述PCIe信号;所述互联接口信息指对应的连接接口为对内互联接口,所述对内互联接口用于向与当前所述PCIe Switch芯片互联的其余所述PCIe Switch芯片120传输所述PCIe信号;上行连接的所述PCIe设备指主机,下行连接的所述PCIe设备为图形处理器、固态硬盘或内存。所述PCIe Switch芯片120用于基于接收到的所述配置信息,对自身的全部所述连接接口进行配置。Specifically, when the baseboard management controller 225 is the target management unit, the baseboard management controller 225 receives the ID signal, performs topology identification based on all the ID signals, and obtains the PCIe interconnection topology. When the PCIe interconnection topology is identified, corresponding configuration information is sent to each PCIe Switch chip 120 respectively; the configuration information includes type information of all connection interfaces of the PCIe Switch chip 120; the connection interface includes an interface for connecting to the internal connector or the external connector; the type information includes: uplink interface information, downlink interface information and internal interconnection interface information; the uplink interface information indicates that the corresponding connection interface is an uplink interface, and the uplink interface is used to transmit PCIe signals to the PCIe device connected upstream; the downlink interface information indicates that the corresponding connection interface is a downlink interface, and the downlink interface is used to transmit the PCIe signal to the PCIe device connected downstream; the interconnection interface information indicates that the corresponding connection interface is an internal interconnection interface, and the internal interconnection interface is used to transmit the PCIe signal to the remaining PCIe Switch chips 120 interconnected with the current PCIe Switch chip; the uplink PCIe device refers to the host, and the downlink PCIe device is a graphics processor, a solid state drive or a memory. The PCIe Switch chip 120 is used to configure all of its own connection interfaces based on the received configuration information.

同样地,在所述任务管理单元为所述目标管理单元的情况下,所述任务管理单元接收所述ID信号,基于全部所述ID信号,进行拓扑识别,得到PCIe互联拓扑。在识别到所述PCIe互联拓扑的情况下,向每个所述PCIe Switch芯片120分别发送对应的配置信息;所述配置信息包括所述PCIe Switch芯片120的全部连接接口的类型信息;所述连接接口包括用于与所述对内连接器或所述对外连接器连接的接口;所述类型信息包括:上行接口信息、下行接口信息以及对内互联接口信息;所述上行接口信息指对应的所述连 接接口为上行接口,所述上行接口用于向上行连接的所述PCIe设备传输PCIe信号;所述下行接口信息指对应的所述连接接口为下行接口,所述下行接口用于向下行连接的所述PCIe设备传输所述PCIe信号;所述互联接口信息指对应的连接接口为对内互联接口,所述对内互联接口用于向与当前所述PCIe Switch芯片互联的其余所述PCIe Switch芯片120传输所述PCIe信号;上行连接的所述PCIe设备指主机,下行连接的所述PCIe设备为图形处理器、固态硬盘或内存。所述PCIe Switch芯片120用于基于接收到的所述配置信息,对自身的全部所述连接接口进行配置。Similarly, when the task management unit is the target management unit, the task management unit receives the ID signal, performs topology identification based on all the ID signals, and obtains the PCIe interconnection topology. When the PCIe interconnection topology is identified, corresponding configuration information is sent to each PCIe Switch chip 120; the configuration information includes type information of all connection interfaces of the PCIe Switch chip 120; the connection interface includes an interface for connecting to the internal connector or the external connector; the type information includes: uplink interface information, downlink interface information, and internal interconnection interface information; the uplink interface information refers to the corresponding connection interface information. The connection interface is an upstream interface, and the upstream interface is used to transmit PCIe signals to the upstream connected PCIe device; the downstream interface information indicates that the corresponding connection interface is a downstream interface, and the downstream interface is used to transmit the PCIe signal to the downstream connected PCIe device; the interconnection interface information indicates that the corresponding connection interface is an internal interconnection interface, and the internal interconnection interface is used to transmit the PCIe signal to the remaining PCIe Switch chips 120 interconnected with the current PCIe Switch chip; the upstream connected PCIe device refers to the host, and the downstream connected PCIe device is a graphics processor, a solid state drive or a memory. The PCIe Switch chip 120 is used to configure all of its own connection interfaces based on the received configuration information.

在一些实施例中,所述PCIe Switch芯片120用于在接收到所述PCIe设备发送的所述PCIe信号的情况下,获取所述PCIe信号对应的目标设备类型;基于所述目标设备类型,从当前所述PCIe Switch芯片120的全部所述连接接口中确定与所述目标设备类型相对应的至少一个目标接口,确定所述目标接口连接的所述PCIe设备为目标PCIe设备;通过所述目标接口向所述目标PCIe设备发送状态查询指令;接收每个所述目标PCIe设备反馈的任务处理状态,所述任务处理状态为所述目标PCIe设备基于所述状态查询指令反馈的;将所述PCIe信号发送至所述任务处理状态为空闲状态的任一所述目标PCIe设备。In some embodiments, the PCIe Switch chip 120 is used to obtain the target device type corresponding to the PCIe signal when receiving the PCIe signal sent by the PCIe device; based on the target device type, determine at least one target interface corresponding to the target device type from all the connection interfaces of the current PCIe Switch chip 120, and determine that the PCIe device connected to the target interface is the target PCIe device; send a status query instruction to the target PCIe device through the target interface; receive the task processing status fed back by each of the target PCIe devices, the task processing status being fed back by the target PCIe device based on the status query instruction; and send the PCIe signal to any of the target PCIe devices whose task processing status is an idle state.

在一些实施例中,所述PCIe Switch芯片120具体用于在全部所述目标PCIe设备反馈的所述任务处理状态均为工作状态的情况下,将所述PCIe信号传输至预设的消息队列;在监测到任一所述目标PCIe设备为所述空闲状态的情况下,基于所述消息队列,将所述PCIe信号传输至空闲状态的所述目标PCIe设备。In some embodiments, the PCIe Switch chip 120 is specifically used to transmit the PCIe signal to a preset message queue when the task processing status fed back by all the target PCIe devices is in the working state; and when any of the target PCIe devices is detected to be in the idle state, based on the message queue, transmit the PCIe signal to the target PCIe device in the idle state.

通过上述方式,能够较好地提高PCIe信号的传输效率,可行性较强。Through the above method, the transmission efficiency of PCIe signals can be improved well, and the feasibility is strong.

需要说明的是,每个类型的目标PCIe设备均对应一个消息队列。例如:当目标PCIe设备的类型为固态硬盘,且全部目标PCIe设备均为工作状态,则将当前的PCIe信号放入与固态硬盘相对应的消息队列中。并且,实时监控全部的目标PCIe设备(固态硬盘)的任务处理状态。在任一所述目标PCIe设备的任务处理状态为所述空闲状态的情况下,将消息队列中的所述PCIe信号发送至该空闲状态的目标PCIe设备。It should be noted that each type of target PCIe device corresponds to a message queue. For example: when the type of the target PCIe device is a solid-state drive, and all target PCIe devices are in working state, the current PCIe signal is placed in the message queue corresponding to the solid-state drive. In addition, the task processing status of all target PCIe devices (solid-state drives) is monitored in real time. When the task processing status of any of the target PCIe devices is the idle state, the PCIe signal in the message queue is sent to the target PCIe device in the idle state.

在一些实施例中,所述基板管理控制器225和所述任务管理单元在作为目标管理单元时,均通过对IO拓展单元中PCIe Switch芯片120进行管理调度,以实现所述PCIe信号的交换,灵活度较高。In some embodiments, when acting as target management units, the baseboard management controller 225 and the task management unit both manage and schedule the PCIe Switch chip 120 in the IO expansion unit to achieve the exchange of the PCIe signals, which has high flexibility.

在一些实施例中,还包括:网络管理单元310,所述网络管理单元310分别与所述基板管理控制器225、所述任务管理单元和所述IO拓展单元连接,以完成所述基板管理控制器225、所述IO拓展单元、以及全部所述任务管理单元之间的两两互通。In some embodiments, it also includes: a network management unit 310, which is connected to the baseboard management controller 225, the task management unit and the IO expansion unit respectively to complete the intercommunication between the baseboard management controller 225, the IO expansion unit, and all the task management units.

具体地,所述基板管理控制器225、所述IO拓展单元、以及全部所述任务管理单元之间,通过所述网络管理单元310,实现两两互通,即实现两两之间的以太网(Ethernet)通信,从而完成实时的数据交换与数据备份,有助于提高所述基板管理控制器225和全部所述任务管理单元之间的任务切换速度(该任务指信号分配或处理任务),并且,有助于提高IO交换机的稳定性。基于上述实施例中的网络管理单元310,能够实现所述基板管理控制器225、所述任务管理单元和所述IO拓展单元之间的星型互联,任两个设备之间均可以互相访问。 Specifically, the baseboard management controller 225, the IO expansion unit, and all the task management units can communicate with each other through the network management unit 310, that is, realize Ethernet communication between each other, so as to complete real-time data exchange and data backup, which helps to improve the task switching speed between the baseboard management controller 225 and all the task management units (the task refers to the signal distribution or processing task), and helps to improve the stability of the IO switch. Based on the network management unit 310 in the above embodiment, the baseboard management controller 225, the task management unit and the IO expansion unit can be connected in a star shape, and any two devices can access each other.

需要说明的是,所述IO拓展单元的每个所述PCIe Switch芯片120均设有网络接口,用于实现网络连接。It should be noted that each PCIe Switch chip 120 of the IO expansion unit is provided with a network interface for realizing network connection.

在一些实施例中,所述网络管理单元310还外接有多个网络接口,所述网络接口可用于访问基板管理控制器225、任务管理单元、PCIe Switch芯片120中的任一个设备。所述网络接口可采用RJ45(Registered Jack 45,一种连接器)网络接口。In some embodiments, the network management unit 310 is also externally connected to a plurality of network interfaces, which can be used to access any device among the baseboard management controller 225, the task management unit, and the PCIe Switch chip 120. The network interface can be an RJ45 (Registered Jack 45, a type of connector) network interface.

所述基板管理控制器225和所述任务管理单元均通过所述网络管理单元310,将配置信息传输至所述IO拓展单元。The baseboard management controller 225 and the task management unit both transmit configuration information to the IO expansion unit through the network management unit 310 .

在一些实施例中,还包括:散热单元320,用于对所述IO交换机进行散热。具体地,所述散热单元320由4个双转子风扇构成,支持正反转,支持风扇冗余设计,保证IO交换机散热,有助于提高IO交换机的稳定性。需要说明的是,所述双转子风扇的数量可以根据实际情况进行增减,以满足不同的场景需求。In some embodiments, it also includes: a heat dissipation unit 320 for dissipating heat for the IO switch. Specifically, the heat dissipation unit 320 is composed of 4 dual-rotor fans, supports forward and reverse rotation, supports fan redundancy design, ensures the heat dissipation of the IO switch, and helps to improve the stability of the IO switch. It should be noted that the number of the dual-rotor fans can be increased or decreased according to actual conditions to meet the needs of different scenarios.

在一些实施例中,还包括:电源轨(Power Rail)单元330,所述电源轨单元330的输入端用于与供电单元连接,所述电源轨单元330的输出端分别与所述IO拓展单元、所述ID管理单元210、所述基板管理控制器225和所述任务管理单元连接,用于为所述IO拓展单元、所述ID管理单元210、所述基板管理控制器225和所述任务管理单元供电。在一些实施例中,所述电源轨单元330还用于为所述散热单元320供电。具体地,可以采用12V的电源为散热单元320供电。通过设置电源轨单元330,能够满足IO交换机中多设备的供电需求。In some embodiments, it further includes: a power rail unit 330, the input end of the power rail unit 330 is used to connect to the power supply unit, and the output end of the power rail unit 330 is respectively connected to the IO expansion unit, the ID management unit 210, the baseboard management controller 225 and the task management unit, and is used to supply power to the IO expansion unit, the ID management unit 210, the baseboard management controller 225 and the task management unit. In some embodiments, the power rail unit 330 is also used to supply power to the heat dissipation unit 320. Specifically, a 12V power supply can be used to supply power to the heat dissipation unit 320. By setting the power rail unit 330, the power supply requirements of multiple devices in the IO switch can be met.

在一些实施例中,所述供电单元包括:PSU(Power Supply Unit,电源供应器,交流电)供电单元340和HVDC(High Voltage Direct Current,高压直流电)供电单元350。通过兼容PSU供电单元340和HVDC供电单元350,满足多种供电需求。PSU供电单元340与外部的交流(AC)电源连接,HVDC供电单元350与外部的直流(DC)电源(直流电源)连接,以兼容交流电源和直流电源的方式,将外部电源转换为IO交换机内部使用的12V,以满足不同的供电场景。上述交流电源可以为220V等,直流电源可以为380V等。In some embodiments, the power supply unit includes: a PSU (Power Supply Unit, power supply, alternating current) power supply unit 340 and an HVDC (High Voltage Direct Current, high voltage direct current) power supply unit 350. By being compatible with the PSU power supply unit 340 and the HVDC power supply unit 350, a variety of power supply requirements can be met. The PSU power supply unit 340 is connected to an external alternating current (AC) power supply, and the HVDC power supply unit 350 is connected to an external direct current (DC) power supply (DC power supply). In a compatible manner between the AC power supply and the DC power supply, the external power supply is converted into 12V used inside the IO switch to meet different power supply scenarios. The above-mentioned AC power supply can be 220V, etc., and the DC power supply can be 380V, etc.

在一些实施例中,所述逻辑控制单元222还用于向所述电源轨单元330发送电源开关信号(PWREN),以控制所述电源轨开始供电或停止供电。In some embodiments, the logic control unit 222 is further configured to send a power switch signal (PWREN) to the power rail unit 330 to control the power rail to start or stop supplying power.

在一些实施例中,所述第一预设数量的所述对内连接器130分布于对应的所述拓展基板110的第一侧,所述第二预设数量的所述对外连接器140分布于对应的所述拓展基板110第二侧,所述第一侧和第二侧为相对侧。通过采用上述方式进行对内连接器130与对外连接器140的布置,能够降低接线难度和复杂度。In some embodiments, the first preset number of the inner connectors 130 are distributed on the first side of the corresponding extension base 110, and the second preset number of the outer connectors 140 are distributed on the second side of the corresponding extension base 110, and the first side and the second side are opposite sides. By adopting the above method to arrange the inner connectors 130 and the outer connectors 140, the wiring difficulty and complexity can be reduced.

上述实施例中的IO交换机,通过采用灵活度较高的IO拓展架构,能够实现大量或足量的IO资源的引出,以连接上下行的PCIe设备。并且,通过采用多路选择器221、逻辑控制单元222、第一任务管理单元223和第二任务管理单元224,能够较好地保证IO交换机运行的稳定性和可靠性,成本较低,可实施性较高,兼具高密性、健壮性和灵活性。具有强大的系统健壮性,以维持IO交换机系统稳定性,保障整机柜内诸多设备间的IO数据交换。其具有灵活的拓展性,兼容不同供电方式、不同形式的PCIe Switch互联拓 扑。The IO switch in the above embodiment can realize the extraction of a large number or sufficient IO resources to connect upstream and downstream PCIe devices by adopting a highly flexible IO expansion architecture. In addition, by adopting a multiplexer 221, a logic control unit 222, a first task management unit 223 and a second task management unit 224, the stability and reliability of the IO switch operation can be better guaranteed, with low cost, high feasibility, and high density, robustness and flexibility. It has strong system robustness to maintain the stability of the IO switch system and ensure IO data exchange between many devices in the entire cabinet. It has flexible scalability and is compatible with different power supply methods and different forms of PCIe Switch interconnection topologies. flutter.

下面对本申请提供的PCIe设备进行描述,下文描述的PCIe设备与上文描述的IO交换机可相互对应参照。The PCIe device provided in the present application is described below. The PCIe device described below and the IO switch described above can refer to each other.

本实施例还提供一种PCIe设备,包括:交换机连接器,所述交换机连接器用于与上述所述的IO拓展架构中的对外连接器140连接,或者用于与上述中任一项所述的IO交换机中的对外连接器140连接。本实施例里中的PCIe设备,通过与上述所述的IO拓展架构中的对外连接器140连接,或者用于与上述中任一项所述的IO交换机中的对外连接器140连接,能够较好地实现不同PCIe资源之间的交互与互通,灵活度较高。This embodiment further provides a PCIe device, including: a switch connector, the switch connector is used to connect to the external connector 140 in the above-mentioned IO expansion architecture, or to connect to the external connector 140 in any one of the above-mentioned IO switches. The PCIe device in this embodiment can better realize the interaction and intercommunication between different PCIe resources by connecting to the external connector 140 in the above-mentioned IO expansion architecture, or to connect to the external connector 140 in any one of the above-mentioned IO switches, and has high flexibility.

在一些实施例中,所述交换机连接器与所述对外连接器140为同一种连接器。具体地,所述交换机连接器为CDFP连接器。In some embodiments, the switch connector is the same type of connector as the external connector 140. Specifically, the switch connector is a CDFP connector.

在一些实施例中,所述PCIe设备为主机(Host)、图形处理器(GPU)、固态硬盘(SSD)或内存(Memory)。In some embodiments, the PCIe device is a host, a graphics processing unit (GPU), a solid state drive (SSD), or a memory.

下面对本申请提供的PCIe融合架构系统进行描述,下文描述的PCIe融合架构系统与上文描述的IO交换机、PCIe设备可相互对应参照。The PCIe fusion architecture system provided by the present application is described below. The PCIe fusion architecture system described below and the IO switch and PCIe device described above can be referenced to each other.

本实施例还提供一种PCIe融合架构系统,包括:This embodiment also provides a PCIe fusion architecture system, including:

如上述中任一项所述的IO交换机,以及多个如上述中任一项所述的PCIe设备,多个所述PCIe设备分别与对应的所述对外连接器140连接。本实施例中的PCIe融合架构系统,能够较好地支持大量或足量的IO资源的引出,能够较好地满足融合架构系统中复杂拓扑的构建需求,灵活度较高,可实施性较强,成本较低。The IO switch as described in any one of the above, and the multiple PCIe devices as described in any one of the above, the multiple PCIe devices are respectively connected to the corresponding external connector 140. The PCIe converged architecture system in this embodiment can better support the extraction of a large number or sufficient IO resources, can better meet the construction requirements of complex topologies in the converged architecture system, has high flexibility, strong feasibility, and low cost.

请参考图4,以8个PCIe Switch芯片120为例,每个PCIe Switch芯片120(图4中以SW作为简化表示)均对应连有5个对外连接器140(图4中以H/D作为简化表示)、以及4个对内连接器130(图4中以F作为简化表示)。每个对内连接器130均用于与其他PCIe Switch芯片120的对内连接器130连接,以形成PCIe Switch芯片120之间的互联。每个对外连接器140均连接有一个PCIe设备,PCIe设备可以为主机、固态硬盘、图形处理器或内存。图4中示例性地将8个PCIe Switch芯片120分为上面4个和下面4个,将上面4个PCIe Switch芯片120的对外连接器140与主机(如图4中的Host0…Host4、Host5…Host9、Host10…Host14、Host15…Host19)连接。并且将下面4个PCIe Switch芯片120的对外连接器140与固态硬盘、图形处理器或内存(如图4中的Device(设备)0…Device4、Device5…Device9、Device10…Device14、Device15…Device19)连接,即图4中的Device可以为固态硬盘、图形处理器或内存。以Host-IO交换机-Device的方式,构建起PCIe融合架构系统,可实时性较强,较好地满足了复杂拓扑的构建需求,能够将诸多GPU资源、SSD资源和Memory资源在不同Host之间进行资源调度与分配。需要提及的是,在具体实施例过程中,还可以采用其他拓扑形式进行拓扑搭建。Please refer to FIG4 . Taking eight PCIe Switch chips 120 as an example, each PCIe Switch chip 120 (in FIG4 , SW is used as a simplified representation) is connected to five external connectors 140 (in FIG4 , H/D is used as a simplified representation) and four internal connectors 130 (in FIG4 , F is used as a simplified representation). Each internal connector 130 is used to connect to the internal connectors 130 of other PCIe Switch chips 120 to form an interconnection between the PCIe Switch chips 120. Each external connector 140 is connected to a PCIe device, which can be a host, a solid-state drive, a graphics processor, or a memory. In FIG4 , eight PCIe Switch chips 120 are exemplarily divided into four upper and four lower chips, and the external connectors 140 of the four upper PCIe Switch chips 120 are connected to the host (such as Host0...Host4, Host5...Host9, Host10...Host14, Host15...Host19 in FIG4 ). And the external connectors 140 of the four lower PCIe Switch chips 120 are connected to the solid state drive, graphics processor or memory (such as Device 0...Device4, Device5...Device9, Device10...Device14, Device15...Device19 in FIG4 ), that is, the Device in FIG4 can be a solid state drive, a graphics processor or a memory. In the Host-IO switch-Device mode, a PCIe fusion architecture system is built, which has strong real-time performance and better meets the construction requirements of complex topologies. It can schedule and allocate many GPU resources, SSD resources and Memory resources between different hosts. It should be mentioned that in the specific implementation process, other topological forms can also be used for topological construction.

再例如:以6个PCIe Switch芯片120为例,每个PCIe Switch芯片120均对应连有5个对外连接器140、以及3个或4个对内连接器130。每个对内连接器130均用于与其他PCIe Switch芯片120的对内连接器130连接,以形成PCIe Switch芯片120之间的互联。 每个对外连接器140均连接有一个PCIe设备,PCIe设备可以为主机、固态硬盘、图形处理器或内存。可以将6个PCIe Switch芯片120分为上面3个和下面3个,将上面3个PCIe Switch芯片120的对外连接器140与主机连接。并且将下面3个PCIe Switch芯片120的对外连接器140与固态硬盘、图形处理器或内存连接。从而实现基于6个PCIe Switch芯片120的融合架构或PCIe互联拓扑的构建。能够将诸多GPU资源、SSD资源和Memory资源在不同Host之间进行资源调度与分配,满足不同的融合架构构建需求。For another example, taking six PCIe Switch chips 120 as an example, each PCIe Switch chip 120 is connected to five external connectors 140 and three or four internal connectors 130. Each internal connector 130 is used to connect to the internal connector 130 of other PCIe Switch chips 120 to form interconnections between the PCIe Switch chips 120. Each external connector 140 is connected to a PCIe device, which may be a host, a solid-state drive, a graphics processor, or a memory. The six PCIe Switch chips 120 may be divided into three upper and three lower ones, and the external connectors 140 of the three upper PCIe Switch chips 120 may be connected to the host. The external connectors 140 of the three lower PCIe Switch chips 120 may be connected to a solid-state drive, a graphics processor, or a memory. This may enable the construction of a converged architecture or a PCIe interconnection topology based on six PCIe Switch chips 120. It may be possible to schedule and allocate a number of GPU resources, SSD resources, and Memory resources between different Hosts to meet different converged architecture construction requirements.

另外,以4个PCIe Switch芯片120为例,对应地,每两个PCIe Switch芯片120设置于同一拓展基板110。每个PCIe Switch芯片120均对应连有5个对外连接器140、以及2个、3个或4个对内连接器130。每个对内连接器130均用于与其他PCIe Switch芯片120的对内连接器130连接,以形成PCIe Switch芯片120之间的互联。每个对外连接器140均连接有一个PCIe设备,PCIe设备可以为主机、固态硬盘、图形处理器或内存。可以将4个PCIe Switch芯片120分为上面2个和下面2个,将上面2个PCIe Switch芯片120的对外连接器140与主机连接。并且将下面2个PCIe Switch芯片120的对外连接器140与固态硬盘、图形处理器或内存连接。从而实现基于4个PCIe Switch芯片120的融合架构或PCIe互联拓扑的构建,灵活度较高。In addition, taking four PCIe Switch chips 120 as an example, correspondingly, every two PCIe Switch chips 120 are arranged on the same expansion substrate 110. Each PCIe Switch chip 120 is connected to five external connectors 140 and two, three or four internal connectors 130. Each internal connector 130 is used to connect to the internal connectors 130 of other PCIe Switch chips 120 to form an interconnection between the PCIe Switch chips 120. Each external connector 140 is connected to a PCIe device, which can be a host, a solid state drive, a graphics processor or a memory. The four PCIe Switch chips 120 can be divided into two upper and two lower ones, and the external connectors 140 of the two upper PCIe Switch chips 120 are connected to the host. And the external connectors 140 of the two lower PCIe Switch chips 120 are connected to the solid state drive, the graphics processor or the memory. This enables the construction of a fusion architecture or PCIe interconnection topology based on four PCIe Switch chips 120, with high flexibility.

再以2个PCIe Switch芯片120为例,对应地,该两个PCIe Switch芯片120设置于同一拓展基板110。每个PCIe Switch芯片120均对应连有5个对外连接器140、以及1个对内连接器130。每个PCIe Switch芯片120均通过各自的对内连接器130与其他PCIe Switch芯片120的对内连接器130连接,以形成两个PCIe Switch芯片120之间的互联。每个对外连接器140均连接有一个PCIe设备,PCIe设备可以为主机、固态硬盘、图形处理器或内存。将其中一个PCIe Switch芯片120的对外连接器140与主机连接。并且将另一个PCIe Switch芯片120的对外连接器140与固态硬盘、图形处理器或内存连接。从而实现基于2个PCIe Switch芯片120的融合架构或PCIe互联拓扑的构建,灵活度较高,成本较低。Take two PCIe Switch chips 120 as an example. Correspondingly, the two PCIe Switch chips 120 are arranged on the same expansion substrate 110. Each PCIe Switch chip 120 is connected to five external connectors 140 and one internal connector 130. Each PCIe Switch chip 120 is connected to the internal connector 130 of other PCIe Switch chips 120 through its own internal connector 130 to form an interconnection between the two PCIe Switch chips 120. Each external connector 140 is connected to a PCIe device, which can be a host, a solid state drive, a graphics processor or a memory. Connect the external connector 140 of one PCIe Switch chip 120 to the host. And connect the external connector 140 of another PCIe Switch chip 120 to the solid state drive, the graphics processor or the memory. This enables construction of a fusion architecture or PCIe interconnection topology based on two PCIe Switch chips 120, with high flexibility and low cost.

以上以8个、6个、4个和2个PCIe Switch芯片120分别进行举例,可知,上述实施例中的PCIe融合架构系统具有较高的灵活度。在具体实施过程中,可以根据实际需求对上述PCIe互联拓扑进行适应性改动,此处不再赘述。The above examples are respectively given with 8, 6, 4 and 2 PCIe Switch chips 120. It can be seen that the PCIe fusion architecture system in the above embodiment has a high degree of flexibility. In the specific implementation process, the above PCIe interconnection topology can be adaptively modified according to actual needs, which will not be repeated here.

需要说明的是,上述实施例中的PCIe融合架构系统能够较好地支持大量或足量的IO资源的引出,能够较好地满足融合架构系统中复杂拓扑的构建需求,灵活度较高,为融合架构系统中复杂拓扑的构建提供了工程实施的可能性,兼具高密性、健壮性和灵活性,能够满足不同场景的应用需求。It should be noted that the PCIe converged architecture system in the above embodiment can better support the extraction of a large or sufficient amount of IO resources, can better meet the construction requirements of complex topologies in the converged architecture system, has high flexibility, and provides the possibility of engineering implementation for the construction of complex topologies in the converged architecture system. It has high density, robustness and flexibility, and can meet the application requirements of different scenarios.

下面对本申请提供的PCIe信号融合管理方法进行描述,下文描述的PCIe信号融合管理方法与上文描述的PCIe融合架构系统可相互对应参照。The PCIe signal fusion management method provided in the present application is described below. The PCIe signal fusion management method described below and the PCIe fusion architecture system described above can be referenced to each other.

请参考图5,本实施例还提供一种基于如上述所述的PCIe融合架构系统的PCIe信号融合管理方法,包括:Please refer to FIG. 5 , this embodiment further provides a PCIe signal fusion management method based on the PCIe fusion architecture system as described above, including:

S510:接收PCIe设备发送的PCIe信号。S510: Receive a PCIe signal sent by a PCIe device.

S520:获取所述PCIe信号对应的目标设备类型。 S520: Obtain a target device type corresponding to the PCIe signal.

S530:基于所述目标设备类型,从当前PCIe Switch芯片的全部连接接口中确定与所述目标设备类型相对应的至少一个目标接口,确定所述目标接口连接的所述PCIe设备为目标PCIe设备。S530: Based on the target device type, determine at least one target interface corresponding to the target device type from all connection interfaces of the current PCIe Switch chip, and determine that the PCIe device connected to the target interface is the target PCIe device.

S540:通过所述目标接口向所述目标PCIe设备发送状态查询指令。S540: Send a status query instruction to the target PCIe device through the target interface.

S550:接收每个所述目标PCIe设备反馈的任务处理状态,所述任务处理状态信息为所述目标PCIe设备基于所述状态查询指令反馈的。S550: Receive a task processing status fed back by each of the target PCIe devices, where the task processing status information is fed back by the target PCIe device based on the status query instruction.

S560:将所述PCIe信号发送至所述任务处理状态为空闲状态的任一所述目标PCIe设备。本方法能够较好地提高PCIe融合架构系统中PCIe信号的传输效率,灵活度较高,可实施性较强,成本较低。S560: Send the PCIe signal to any target PCIe device whose task processing state is idle. This method can improve the transmission efficiency of PCIe signals in a PCIe converged architecture system, has high flexibility, strong feasibility and low cost.

在一些实施例中,所述PCIe信号融合管理方法,还包括:在在全部所述目标PCIe设备反馈的所述任务处理状态均为工作状态的情况下,将所述PCIe信号传输至预设的消息队列;在监测到任一所述目标PCIe设备为所述空闲状态的情况下,基于所述消息队列,将所述PCIe信号传输至空闲状态的所述目标PCIe设备。In some embodiments, the PCIe signal fusion management method further includes: when the task processing status fed back by all the target PCIe devices is in a working state, transmitting the PCIe signal to a preset message queue; when any of the target PCIe devices is monitored to be in an idle state, based on the message queue, transmitting the PCIe signal to the target PCIe device in an idle state.

需要说明的是,上述实施例中的PCIe信号融合管理方法,基于以上实施例中的PCIe融合架构系统进行。所述PCIe融合架构系统能够较好地支持大量或足量的IO资源的引出,能够较好地满足融合架构系统中复杂拓扑的构建需求,灵活度较高,为融合架构系统中复杂拓扑的构建提供了工程实施的可能性,成本较低,兼具高密性、健壮性和灵活性。It should be noted that the PCIe signal fusion management method in the above embodiment is based on the PCIe fusion architecture system in the above embodiment. The PCIe fusion architecture system can better support the extraction of a large or sufficient amount of IO resources, can better meet the construction requirements of complex topologies in the fusion architecture system, has high flexibility, provides the possibility of engineering implementation for the construction of complex topologies in the fusion architecture system, has low cost, and has both high density, robustness and flexibility.

综上,本申请提供的IO拓展架构、IO交换机及PCIe设备,通过设置至少一个IO拓展单元,IO拓展单元包括拓展基板,拓展基板设有至少两个PCIe Switch芯片、第一预设数量的对内连接器,以及第二预设数量的对外连接器;对内连接器的一端与拓展基板上任一个PCIe Switch芯片连接,对内连接器的另一端用于PCIe Switch芯片之间的互联;对外连接器的一端与拓展基板上任一个PCIe Switch芯片连接,对外连接器的另一端用于连接PCIe设备,以完成多个PCIe设备之间的互联。能够较好地支持大量或足量的IO资源的引出,能够较好地满足融合架构系统中复杂拓扑的构建需求,灵活度较高,为融合架构系统中复杂拓扑的构建提供了工程实施的可能性,成本较低。并且,通过多次试验,提出了可行性较高的PCIe互联拓扑的搭建方案,能够在支持服务器的尺寸标准定义的情况下,满足不同场景的复杂拓扑的构建需求。兼具高密性、健壮性和灵活性。具有足够多的PCIe拓展能力用以连接上下行的PCIe设备。以及具有强大的系统健壮性,以维持IO交换机系统稳定性,保障整机柜内诸多设备间的IO数据或PCIe信号的交换。In summary, the IO expansion architecture, IO switch and PCIe device provided by the present application are provided with at least one IO expansion unit, the IO expansion unit includes an expansion substrate, the expansion substrate is provided with at least two PCIe Switch chips, a first preset number of internal connectors, and a second preset number of external connectors; one end of the internal connector is connected to any PCIe Switch chip on the expansion substrate, and the other end of the internal connector is used for interconnection between PCIe Switch chips; one end of the external connector is connected to any PCIe Switch chip on the expansion substrate, and the other end of the external connector is used to connect PCIe devices to complete the interconnection between multiple PCIe devices. It can better support the extraction of a large number or sufficient amount of IO resources, can better meet the construction requirements of complex topologies in converged architecture systems, has high flexibility, provides the possibility of engineering implementation for the construction of complex topologies in converged architecture systems, and has low cost. In addition, through multiple experiments, a construction plan for PCIe interconnection topology with high feasibility is proposed, which can meet the construction requirements of complex topologies in different scenarios while supporting the standard definition of server size. It has both high density, robustness and flexibility. It has enough PCIe expansion capabilities to connect upstream and downstream PCIe devices. It also has strong system robustness to maintain the stability of the IO switch system and ensure the exchange of IO data or PCIe signals between many devices in the entire cabinet.

图6示例了一种电子设备的实体结构示意图,如图6所示,该电子设备可以包括:处理器(processor)610、通信接口(Communications Interface)620、存储器(memory)630和通信总线640,其中,处理器610,通信接口620,存储器630通过通信总线640完成相互间的通信。处理器610可以调用存储器630中的逻辑指令,以执行PCIe信号融合管理方法,该方法包括:接收PCIe设备发送的PCIe信号;获取所述PCIe信号对应的目标设备类型;基于所述目标设备类型,从当前PCIe Switch芯片的全部连接接口中确定与所述目标设备类型相对应的至少一个目标接口,确定所述目标接口连接的所述PCIe设备为目标 PCIe设备;通过所述目标接口向所述目标PCIe设备发送状态查询指令;接收每个所述目标PCIe设备反馈的任务处理状态,所述任务处理状态信息为所述目标PCIe设备基于所述状态查询指令反馈的;将所述PCIe信号发送至所述任务处理状态为空闲状态的任一所述目标PCIe设备。FIG6 illustrates a schematic diagram of the physical structure of an electronic device. As shown in FIG6 , the electronic device may include: a processor 610, a communications interface 620, a memory 630, and a communications bus 640, wherein the processor 610, the communications interface 620, and the memory 630 communicate with each other via the communications bus 640. The processor 610 may call the logic instructions in the memory 630 to execute a PCIe signal fusion management method, the method comprising: receiving a PCIe signal sent by a PCIe device; obtaining a target device type corresponding to the PCIe signal; based on the target device type, determining at least one target interface corresponding to the target device type from all connection interfaces of the current PCIe Switch chip, and determining the PCIe device connected to the target interface as a target PCIe device; sending a status query instruction to the target PCIe device through the target interface; receiving the task processing status fed back by each of the target PCIe devices, wherein the task processing status information is fed back by the target PCIe device based on the status query instruction; sending the PCIe signal to any of the target PCIe devices whose task processing status is an idle state.

此外,上述的存储器630中的逻辑指令可以通过软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读取存储介质中。基于这样的理解,本申请的技术方案本质上或者说对相关技术做出贡献的部分或者该技术方案的部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质中,包括若干指令用以使得一台计算机设备(可以是个人计算机,服务器,或者网络设备等)执行本申请各个实施例所述方法的全部或部分步骤。而前述的存储介质包括:U盘、移动硬盘、只读存储器(ROM,Read-Only Memory)、随机存取存储器(RAM,Random Access Memory)、磁碟或者光盘等各种可以存储程序代码的介质。In addition, the logic instructions in the above-mentioned memory 630 can be implemented in the form of software functional units and can be stored in a computer-readable storage medium when sold or used as an independent product. Based on this understanding, the technical solution of the present application can be essentially or partly embodied in the form of a software product that contributes to the relevant technology. The computer software product is stored in a storage medium, including several instructions to enable a computer device (which can be a personal computer, server, or network device, etc.) to perform all or part of the steps of the method described in each embodiment of the present application. The aforementioned storage medium includes: U disk, mobile hard disk, read-only memory (ROM, Read-Only Memory), random access memory (RAM, Random Access Memory), disk or optical disk and other media that can store program codes.

又一方面,本申请还提供一种非暂态计算机可读存储介质,其上存储有计算机程序,该计算机程序被处理器执行时实现以执行上述各方法提供的PCIe信号融合管理方法,该方法包括:接收PCIe设备发送的PCIe信号;获取所述PCIe信号对应的目标设备类型;基于所述目标设备类型,从当前PCIe Switch芯片的全部连接接口中确定与所述目标设备类型相对应的至少一个目标接口,确定所述目标接口连接的所述PCIe设备为目标PCIe设备;通过所述目标接口向所述目标PCIe设备发送状态查询指令;接收每个所述目标PCIe设备反馈的任务处理状态,所述任务处理状态信息为所述目标PCIe设备基于所述状态查询指令反馈的;将所述PCIe信号发送至所述任务处理状态为空闲状态的任一所述目标PCIe设备。On the other hand, the present application also provides a non-transitory computer-readable storage medium having a computer program stored thereon, which, when executed by a processor, is implemented to execute the PCIe signal fusion management method provided by the above methods, the method comprising: receiving a PCIe signal sent by a PCIe device; obtaining a target device type corresponding to the PCIe signal; based on the target device type, determining at least one target interface corresponding to the target device type from all connection interfaces of the current PCIe Switch chip, and determining that the PCIe device connected to the target interface is a target PCIe device; sending a status query instruction to the target PCIe device through the target interface; receiving a task processing status fed back by each of the target PCIe devices, the task processing status information being fed back by the target PCIe device based on the status query instruction; and sending the PCIe signal to any of the target PCIe devices whose task processing status is an idle state.

以上所描述的装置实施例仅仅是示意性的,其中所述作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部模块来实现本实施例方案的目的。本领域普通技术人员在不付出创造性的劳动的情况下,即可以理解并实施。The device embodiments described above are merely illustrative, wherein the units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, they may be located in one place, or they may be distributed on multiple network units. Some or all of the modules may be selected according to actual needs to achieve the purpose of the scheme of this embodiment. Ordinary technicians in this field can understand and implement it without paying creative labor.

通过以上的实施方式的描述,本领域的技术人员可以清楚地了解到各实施方式可借助软件加必需的通用硬件平台的方式来实现,当然也可以通过硬件。基于这样的理解,上述技术方案本质上或者说对相关技术做出贡献的部分可以以软件产品的形式体现出来,该计算机软件产品可以存储在计算机可读存储介质中,如ROM/RAM、磁碟、光盘等,包括若干指令用以使得一台计算机设备(可以是个人计算机,服务器,或者网络设备等)执行各个实施例或者实施例的某些部分所述的方法。Through the description of the above implementation methods, those skilled in the art can clearly understand that each implementation method can be implemented by means of software plus a necessary general hardware platform, and of course, by hardware. Based on this understanding, the above technical solution can be essentially or in other words, the part that contributes to the relevant technology can be embodied in the form of a software product, which can be stored in a computer-readable storage medium, such as ROM/RAM, a disk, an optical disk, etc., including a number of instructions for a computer device (which can be a personal computer, a server, or a network device, etc.) to execute the methods described in each embodiment or some parts of the embodiment.

最后应说明的是:以上实施例仅用以说明本申请的技术方案,而非对其限制;尽管参照前述实施例对本申请进行了详细的说明,本领域的普通技术人员应当理解:其依然可以对前述各实施例所记载的技术方案进行修改,或者对其中部分技术特征进行等同替换;而这些修改或者替换,并不使相应技术方案的本质脱离本申请各实施例技术方案的精神和范围。 Finally, it should be noted that the above embodiments are only used to illustrate the technical solutions of the present application, rather than to limit it. Although the present application has been described in detail with reference to the aforementioned embodiments, those skilled in the art should understand that they can still modify the technical solutions described in the aforementioned embodiments, or make equivalent replacements for some of the technical features therein. However, these modifications or replacements do not deviate the essence of the corresponding technical solutions from the spirit and scope of the technical solutions of the embodiments of the present application.

Claims (20)

一种IO拓展架构,其特征在于,包括:An IO expansion architecture, characterized by comprising: 至少一个IO拓展单元,所述IO拓展单元包括拓展基板,所述拓展基板设有至少两个PCIe Switch芯片、第一预设数量的对内连接器,以及第二预设数量的对外连接器;At least one IO expansion unit, the IO expansion unit comprising an expansion base plate, the expansion base plate having at least two PCIe Switch chips, a first preset number of internal connectors, and a second preset number of external connectors; 所述对内连接器的一端与所述拓展基板上任一个所述PCIe Switch芯片连接,所述对内连接器的另一端用于所述PCIe Switch芯片之间的互联;所述对外连接器的一端与所述拓展基板上任一个所述PCIe Switch芯片连接,所述对外连接器的另一端用于连接PCIe设备,以完成多个所述PCIe设备之间的互联。One end of the internal connector is connected to any one of the PCIe Switch chips on the extension baseboard, and the other end of the internal connector is used for interconnection between the PCIe Switch chips; one end of the external connector is connected to any one of the PCIe Switch chips on the extension baseboard, and the other end of the external connector is used to connect a PCIe device to complete the interconnection between multiple PCIe devices. 一种IO交换机,其特征在于,包括:An IO switch, characterized by comprising: ID管理单元、控制模块、以及如权利要求1所述的IO拓展架构;An ID management unit, a control module, and an IO expansion architecture as claimed in claim 1; 所述ID管理单元用于采集所述IO拓展架构中全部所述对内连接器和所述对外连接器的ID信号,将所述ID信号传输至所述控制模块;The ID management unit is used to collect ID signals of all the internal connectors and the external connectors in the IO expansion architecture, and transmit the ID signals to the control module; 所述控制模块用于基于全部所述ID信号,对所述IO拓展架构中的全部所述PCIe Switch芯片进行管理。The control module is used to manage all the PCIe Switch chips in the IO expansion architecture based on all the ID signals. 根据权利要求2所述的IO交换机,其特征在于,The IO switch according to claim 2, characterized in that: 每个所述对内连接器和所述对外连接器均各自连接有一个ID定义芯片,所述ID定义芯片用于对对应的所述对内连接器或所述对外连接器进行ID定义;Each of the inner connector and the outer connector is connected to an ID definition chip, and the ID definition chip is used to perform ID definition on the corresponding inner connector or the outer connector; 所述ID管理单元具体用于通过所述对外连接器或所述对内连接器,向每个所述ID定义芯片发送ID查询指令;The ID management unit is specifically used to send an ID query instruction to each of the ID definition chips through the external connector or the internal connector; 所述ID定义芯片还用于接收所述ID查询指令,基于所述ID查询指令,向对应的所述对内连接器或所述对外连接器反馈所述ID信号;The ID definition chip is also used to receive the ID query instruction, and based on the ID query instruction, feed back the ID signal to the corresponding internal connector or the external connector; 所述对内连接器或所述对外连接器在接收到所述ID信号的情况下,将所述ID信号反馈至所述ID管理单元;When receiving the ID signal, the internal connector or the external connector feeds back the ID signal to the ID management unit; 所述ID管理单元分别与每个所述对外连接器、所述对内连接器和所述控制模块连接。The ID management unit is connected to each of the external connectors, the internal connectors and the control module respectively. 根据权利要求2所述的IO交换机,其特征在于,所述控制模块包括:The IO switch according to claim 2, wherein the control module comprises: 多路选择器,用于接收所述ID管理单元传输的所述ID信号;A multiplexer, used for receiving the ID signal transmitted by the ID management unit; 逻辑控制单元,用于监测基板管理控制器和全部任务管理单元的运行状态;基于所述运行状态、以及所述基板管理控制器和所述任务管理单元各自对应的任务管理优先级,确定所述基板管理控制器和全部所述任务管理单元中的其中一个为目标管理单元;基于确定的所述目标管理单元,生成目标管理单元指令;将所述目标管理单元指令发送至所述多路选择器;A logic control unit, configured to monitor the operating status of a baseboard management controller and all task management units; based on the operating status and the task management priorities corresponding to the baseboard management controller and the task management unit, determine one of the baseboard management controller and all the task management units as a target management unit; based on the determined target management unit, generate a target management unit instruction; and send the target management unit instruction to the multiplexer; 所述多路选择器还用于基于所述目标管理单元指令,确定所述目标管理单元,将所述ID信号发送至所述目标管理单元;The multiplexer is further used to determine the target management unit based on the target management unit instruction and send the ID signal to the target management unit; 基板管理控制器和至少一个任务管理单元,所述基板管理控制器和所述任务管理单元均用于在接收到所述ID信号的情况下,基于所述ID信号,进行PCIe拓扑识别,得到 PCIe互联拓扑;基于所述PCIe互联拓扑,对全部所述PCIe Switch芯片进行管理;A baseboard management controller and at least one task management unit, wherein the baseboard management controller and the task management unit are both used to perform PCIe topology identification based on the ID signal when receiving the ID signal, and obtain PCIe interconnection topology; based on the PCIe interconnection topology, managing all the PCIe Switch chips; 所述ID管理单元和所述多路选择器连接,所述多路选择器分别与所述逻辑控制单元、所述基板管理控制器、以及全部所述任务管理单元连接。The ID management unit is connected to the multiplexer, and the multiplexer is respectively connected to the logic control unit, the baseboard management controller, and all the task management units. 根据权利要求4所述的IO交换机,其特征在于,至少一个所述任务管理单元包括:第一任务管理单元和第二任务管理单元;The IO switch according to claim 4, characterized in that at least one of the task management units comprises: a first task management unit and a second task management unit; 所述逻辑控制单元具体用于当所述第一任务管理单元为正常运行状态时,将所述第一任务管理单元确定为所述目标管理单元;当所述第一任务管理单元为异常运行状态,且所述第二任务管理单元为正常运行状态时,将所述第二任务管理单元确定为所述目标管理单元;当所述第一任务管理单元和所述第二任务管理单元均为异常运行状态,且所述基板管理控制器为正常运行状态时,将所述基板管理控制器确定为所述目标管理单元。The logic control unit is specifically used to determine the first task management unit as the target management unit when the first task management unit is in a normal operating state; to determine the second task management unit as the target management unit when the first task management unit is in an abnormal operating state and the second task management unit is in a normal operating state; and to determine the baseboard management controller as the target management unit when both the first task management unit and the second task management unit are in an abnormal operating state and the baseboard management controller is in a normal operating state. 根据权利要求4所述的IO交换机,其特征在于,所述基板管理控制器和所述任务管理单元均具体用于在识别到所述PCIe互联拓扑的情况下,向每个所述PCIe Switch芯片分别发送对应的配置信息;所述配置信息包括所述PCIe Switch芯片的全部连接接口的类型信息;所述连接接口包括用于与所述对内连接器或所述对外连接器连接的接口;所述类型信息包括:上行接口信息、下行接口信息以及对内互联接口信息;所述上行接口信息指对应的所述连接接口为上行接口,所述上行接口用于向上行连接的所述PCIe设备传输PCIe信号;所述下行接口信息指对应的所述连接接口为下行接口,所述下行接口用于向下行连接的所述PCIe设备传输所述PCIe信号;所述互联接口信息指对应的连接接口为对内互联接口,所述对内互联接口用于向与当前所述PCIe Switch芯片互联的其余所述PCIe Switch芯片传输所述PCIe信号;上行连接的所述PCIe设备指主机,下行连接的所述PCIe设备为图形处理器、固态硬盘或内存;The IO switch according to claim 4 is characterized in that the baseboard management controller and the task management unit are specifically used to send corresponding configuration information to each of the PCIe Switch chips when the PCIe interconnection topology is identified; the configuration information includes type information of all connection interfaces of the PCIe Switch chip; the connection interface includes an interface for connecting to the internal connector or the external connector; the type information includes: uplink interface information, downlink interface information and internal interconnection interface information; the uplink interface information indicates that the corresponding connection interface is an uplink interface , the upstream interface is used to transmit PCIe signals to the PCIe device connected upstream; the downstream interface information indicates that the corresponding connection interface is a downstream interface, and the downstream interface is used to transmit the PCIe signal to the PCIe device connected downstream; the interconnection interface information indicates that the corresponding connection interface is an internal interconnection interface, and the internal interconnection interface is used to transmit the PCIe signal to the remaining PCIe Switch chips interconnected with the current PCIe Switch chip; the PCIe device connected upstream refers to the host, and the PCIe device connected downstream is a graphics processor, a solid-state drive or a memory; 所述PCIe Switch芯片用于基于接收到的所述配置信息,对自身的全部所述连接接口进行配置。The PCIe Switch chip is used to configure all of its own connection interfaces based on the received configuration information. 根据权利要求6所述的IO交换机,其特征在于,所述PCIe Switch芯片用于在接收到所述PCIe设备发送的所述PCIe信号的情况下,获取所述PCIe信号对应的目标设备类型;基于所述目标设备类型,从当前所述PCIe Switch芯片的全部所述连接接口中确定与所述目标设备类型相对应的至少一个目标接口,确定所述目标接口连接的所述PCIe设备为目标PCIe设备;通过所述目标接口向所述目标PCIe设备发送状态查询指令;接收每个所述目标PCIe设备反馈的任务处理状态,所述任务处理状态为所述目标PCIe设备基于所述状态查询指令反馈的;将所述PCIe信号发送至所述任务处理状态为空闲状态的任一所述目标PCIe设备。The IO switch according to claim 6 is characterized in that the PCIe Switch chip is used to obtain the target device type corresponding to the PCIe signal when receiving the PCIe signal sent by the PCIe device; based on the target device type, determine at least one target interface corresponding to the target device type from all the connection interfaces of the current PCIe Switch chip, and determine that the PCIe device connected to the target interface is the target PCIe device; send a status query instruction to the target PCIe device through the target interface; receive the task processing status fed back by each of the target PCIe devices, the task processing status being fed back by the target PCIe device based on the status query instruction; and send the PCIe signal to any of the target PCIe devices whose task processing status is an idle state. 根据权利要求7所述的IO交换机,其特征在于,所述PCIe Switch芯片具体用于在全部所述目标PCIe设备反馈的所述任务处理状态均为工作状态的情况下,将所述PCIe信号传输至预设的消息队列;在监测到任一所述目标PCIe设备为所述空闲状态的情况下,基于所述消息队列,将所述PCIe信号传输至空闲状态的所述目标PCIe设备。The IO switch according to claim 7 is characterized in that the PCIe Switch chip is specifically used to transmit the PCIe signal to a preset message queue when the task processing status fed back by all the target PCIe devices is in a working state; and when it is monitored that any of the target PCIe devices is in an idle state, based on the message queue, transmit the PCIe signal to the target PCIe device in an idle state. 根据权利要求6或7所述的IO交换机,其特征在于,还包括:网络管理单元,所 述网络管理单元分别与所述基板管理控制器、所述任务管理单元和所述IO拓展单元连接,以完成所述基板管理控制器、所述IO拓展单元、以及全部所述任务管理单元之间的两两互通。The IO switch according to claim 6 or 7, characterized in that it also includes: a network management unit, The network management unit is connected to the baseboard management controller, the task management unit and the IO expansion unit respectively to achieve two-to-two communication between the baseboard management controller, the IO expansion unit and all the task management units. 根据权利要求2所述的IO交换机,其特征在于,还包括:散热单元,用于对所述IO交换机进行散热。The IO switch according to claim 2, further comprising: a heat dissipation unit for dissipating heat from the IO switch. 根据权利要求4所述的IO交换机,其特征在于,还包括:电源轨单元,所述电源轨单元的输入端用于与供电单元连接,所述电源轨单元的输出端分别与所述IO拓展单元、所述ID管理单元、所述基板管理控制器和所述任务管理单元连接,用于为所述IO拓展单元、所述ID管理单元、所述基板管理控制器和所述任务管理单元供电。The IO switch according to claim 4 is characterized in that it further comprises: a power rail unit, the input end of the power rail unit is used to connect to the power supply unit, and the output end of the power rail unit is respectively connected to the IO expansion unit, the ID management unit, the baseboard management controller and the task management unit, for supplying power to the IO expansion unit, the ID management unit, the baseboard management controller and the task management unit. 根据权利要求11所述的IO交换机,其特征在于,所述逻辑控制单元还用于向所述电源轨单元发送电源开关信号,以控制所述电源轨单元开始供电或停止供电。The IO switch according to claim 11, characterized in that the logic control unit is further used to send a power switch signal to the power rail unit to control the power rail unit to start or stop power supply. 根据权利要求2所述的IO交换机,其特征在于,所述第一预设数量的所述对内连接器分布于对应的所述拓展基板的第一侧,所述第二预设数量的所述对外连接器分布于对应的所述拓展基板第二侧,所述第一侧和第二侧为相对侧。The IO switch according to claim 2, characterized in that the first preset number of the internal connectors are distributed on a first side of the corresponding extension base plate, the second preset number of the external connectors are distributed on a second side of the corresponding extension base plate, and the first side and the second side are opposite sides. 一种PCIe设备,其特征在于,包括:交换机连接器,所述交换机连接器用于与权利要求1所述的IO拓展架构中的对外连接器连接,或者用于与权利要求2至13中任一项所述的IO交换机中的对外连接器连接。A PCIe device, characterized in that it includes: a switch connector, wherein the switch connector is used to connect to the external connector in the IO expansion architecture described in claim 1, or to connect to the external connector in the IO switch described in any one of claims 2 to 13. 根据权利要求14所述的PCIe设备,其特征在于,所述交换机连接器与所述对外连接器为同一种连接器。The PCIe device according to claim 14, wherein the switch connector and the external connector are the same type of connector. 根据权利要求14所述的PCIe设备,其特征在于,所述PCIe设备为主机、图形处理器、固态硬盘或内存。The PCIe device according to claim 14 is characterized in that the PCIe device is a host, a graphics processor, a solid-state drive or a memory. 一种PCIe融合架构系统,其特征在于,包括:A PCIe fusion architecture system, comprising: 如权利要求2至13中任一项所述的IO交换机,以及多个如权利要求14至16中任一项所述的PCIe设备,多个所述PCIe设备分别与对应的所述对外连接器连接。An IO switch according to any one of claims 2 to 13, and a plurality of PCIe devices according to any one of claims 14 to 16, wherein the plurality of PCIe devices are respectively connected to the corresponding external connectors. 一种基于如权利要求17所述的PCIe融合架构系统的PCIe信号融合管理方法,其特征在于,包括:A PCIe signal fusion management method based on the PCIe fusion architecture system according to claim 17, characterized in that it includes: 接收PCIe设备发送的PCIe信号;Receive PCIe signals sent by PCIe devices; 获取所述PCIe信号对应的目标设备类型;Obtaining the target device type corresponding to the PCIe signal; 基于所述目标设备类型,从当前PCIe Switch芯片的全部连接接口中确定与所述目标设备类型相对应的至少一个目标接口,确定所述目标接口连接的所述PCIe设备为目标PCIe设备;Based on the target device type, determining at least one target interface corresponding to the target device type from all connection interfaces of the current PCIe Switch chip, and determining the PCIe device connected to the target interface as the target PCIe device; 通过所述目标接口向所述目标PCIe设备发送状态查询指令;Sending a status query instruction to the target PCIe device through the target interface; 接收每个所述目标PCIe设备反馈的任务处理状态,所述任务处理状态信息为所述目标PCIe设备基于所述状态查询指令反馈的;Receiving a task processing status fed back by each of the target PCIe devices, wherein the task processing status information is fed back by the target PCIe device based on the status query instruction; 将所述PCIe信号发送至所述任务处理状态为空闲状态的任一所述目标PCIe设备。The PCIe signal is sent to any of the target PCIe devices whose task processing state is an idle state. 一种电子设备,包括存储器、处理器及存储在所述存储器上并可在所述处理器上运行的计算机程序,其特征在于,所述处理器执行所述程序时实现如权利要求18所述的 PCIe信号融合管理方法。An electronic device, comprising a memory, a processor, and a computer program stored in the memory and executable on the processor, wherein the processor implements the method according to claim 18 when executing the program. PCIe signal fusion management method. 一种非暂态计算机可读存储介质,其上存储有计算机程序,其特征在于,所述计算机程序被处理器执行时实现如权利要求18所述的PCIe信号融合管理方法。 A non-transitory computer-readable storage medium having a computer program stored thereon, wherein when the computer program is executed by a processor, the PCIe signal fusion management method as described in claim 18 is implemented.
PCT/CN2024/111690 2023-10-10 2024-08-13 Io expansion architecture, io switch and pcie device Pending WO2025077411A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202311304011.4A CN117041184B (en) 2023-10-10 2023-10-10 IO expansion device and IO switch
CN202311304011.4 2023-10-10

Publications (1)

Publication Number Publication Date
WO2025077411A1 true WO2025077411A1 (en) 2025-04-17

Family

ID=88624922

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2024/111690 Pending WO2025077411A1 (en) 2023-10-10 2024-08-13 Io expansion architecture, io switch and pcie device

Country Status (2)

Country Link
CN (1) CN117041184B (en)
WO (1) WO2025077411A1 (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117041184B (en) * 2023-10-10 2024-02-09 苏州元脑智能科技有限公司 IO expansion device and IO switch
CN117472596B (en) * 2023-12-27 2024-03-22 苏州元脑智能科技有限公司 Distributed resource management method, device, system, equipment and storage medium

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115586964A (en) * 2022-09-30 2023-01-10 苏州浪潮智能科技有限公司 Resource sharing device, resource management device, and resource management method
WO2023016379A1 (en) * 2021-08-12 2023-02-16 华为技术有限公司 Computer system, control method based on pcie device, and related device
CN115905094A (en) * 2022-10-13 2023-04-04 超聚变数字技术有限公司 An electronic device and its PCIe topology configuration method and device
CN116032746A (en) * 2023-03-28 2023-04-28 苏州浪潮智能科技有限公司 Resource pool information processing method and device, storage medium and electronic device
CN116185641A (en) * 2023-04-24 2023-05-30 苏州浪潮智能科技有限公司 Fusion architecture system, non-volatile storage system and storage resource acquisition method
CN117041184A (en) * 2023-10-10 2023-11-10 苏州元脑智能科技有限公司 IO expansion architecture, IO switch and PCIe device

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114780472A (en) * 2022-04-29 2022-07-22 苏州浪潮智能科技有限公司 Host and PCIe device decoupling method, device, equipment and storage medium

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2023016379A1 (en) * 2021-08-12 2023-02-16 华为技术有限公司 Computer system, control method based on pcie device, and related device
CN115586964A (en) * 2022-09-30 2023-01-10 苏州浪潮智能科技有限公司 Resource sharing device, resource management device, and resource management method
CN115905094A (en) * 2022-10-13 2023-04-04 超聚变数字技术有限公司 An electronic device and its PCIe topology configuration method and device
CN116032746A (en) * 2023-03-28 2023-04-28 苏州浪潮智能科技有限公司 Resource pool information processing method and device, storage medium and electronic device
CN116185641A (en) * 2023-04-24 2023-05-30 苏州浪潮智能科技有限公司 Fusion architecture system, non-volatile storage system and storage resource acquisition method
CN117041184A (en) * 2023-10-10 2023-11-10 苏州元脑智能科技有限公司 IO expansion architecture, IO switch and PCIe device

Also Published As

Publication number Publication date
CN117041184A (en) 2023-11-10
CN117041184B (en) 2024-02-09

Similar Documents

Publication Publication Date Title
CN111400217B (en) An OCP adapter card and computer equipment
CN102473157B (en) Virtual thermal in share I/O environment inserts function
EP3035187B1 (en) Hard disk and management method
WO2025077411A1 (en) Io expansion architecture, io switch and pcie device
US20170220506A1 (en) Modular Software Defined Storage Technology
CN103107960A (en) Reducing impact of a switch failure in a switch fabric via switch cards
CN113645047A (en) Out-of-band management system and server based on intelligent network card
CN120406680B (en) Computing system, control method, device, equipment, medium and product thereof
CN115757236A (en) A storage expansion pooling device and system
CN118672367A (en) Computer system, bandwidth configuration method, product, server and medium
WO2025152504A1 (en) Server temperature control method, system and device, and storage medium
CN103793003A (en) Exchange plate and blade server
CN103984390B (en) Blade and blade server
CN117194299A (en) Hot-plug methods, PCIE devices and management controllers
US20250265213A1 (en) Pcie device management method, device, and server
CN120743662A (en) Test board and test system
CN113613347A (en) Single-host multi-disk cluster communication method, device and system
WO2025214078A1 (en) Computing device and control method
CN117666746B (en) Multi-node server, method, device and medium applied to multi-node server
WO2025161317A1 (en) Server
US20100088477A1 (en) Memory share system and memory share apparatus
CN118567954A (en) A whole cabinet server
CN216352292U (en) Server mainboard and server
CN204189089U (en) A kind of server
WO2023160691A1 (en) Connector-based port management method and related device

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 24876231

Country of ref document: EP

Kind code of ref document: A1