[go: up one dir, main page]

CN1731738A - Automated quick deployment method for large-scale computer cluster node - Google Patents

Automated quick deployment method for large-scale computer cluster node Download PDF

Info

Publication number
CN1731738A
CN1731738A CN 200510043160 CN200510043160A CN1731738A CN 1731738 A CN1731738 A CN 1731738A CN 200510043160 CN200510043160 CN 200510043160 CN 200510043160 A CN200510043160 A CN 200510043160A CN 1731738 A CN1731738 A CN 1731738A
Authority
CN
China
Prior art keywords
node
image
target node
target
management node
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN 200510043160
Other languages
Chinese (zh)
Inventor
董小社
李纪云
孙发龙
伍卫国
胡雷钧
王恩东
王守昊
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Inspur Beijing Electronic Information Industry Co Ltd
Xian Jiaotong University
Original Assignee
Inspur Beijing Electronic Information Industry Co Ltd
Xian Jiaotong University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Inspur Beijing Electronic Information Industry Co Ltd, Xian Jiaotong University filed Critical Inspur Beijing Electronic Information Industry Co Ltd
Priority to CN 200510043160 priority Critical patent/CN1731738A/en
Publication of CN1731738A publication Critical patent/CN1731738A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Stored Programmes (AREA)

Abstract

本发明涉及一种大规模计算机集群系统节点的自动化快速部署方法,采用基于映像的安装方式和客户机/服务器结构,使用预引导执行环境PXE协议、动态主机配置协议DHCP、简单文件传送协议TFTP和网络唤醒技术Wakeup On LAN等技术来实现大规模集群系统的快速自动化部署。本方法包括获得映像、绑定映像与目标节点、安装目标节点、配置目标节点、测试等五个步骤实现了集群系统节点安装过程的完全自动化。与传统的计算机集群系统部署方式相比,该方法极大的简化了集群系统节点安装过程,能大幅度提高工作效率。

Figure 200510043160

The present invention relates to an automatic rapid deployment method for nodes of a large-scale computer cluster system, which adopts an image-based installation mode and a client/server structure, and uses a pre-boot execution environment PXE protocol, a dynamic host configuration protocol DHCP, a simple file transfer protocol TFTP and Wakeup On LAN technology and other technologies to realize the rapid and automatic deployment of large-scale cluster systems. The method includes five steps of obtaining the image, binding the image and the target node, installing the target node, configuring the target node, and testing, and realizing the complete automation of the cluster system node installation process. Compared with the traditional computer cluster system deployment method, this method greatly simplifies the cluster system node installation process and can greatly improve work efficiency.

Figure 200510043160

Description

大规模计算机集群系统节点的自动化快速部署方法Automated Rapid Deployment Method for Large-Scale Computer Cluster System Nodes

                              技术领域Technical field

本发明属于计算机网络技术领域,涉及一种用于大规模计算机集群系统节点的自动化快速部署方法,特别涉及在集群节点上安装部署操作系统和应用软件的方法。The invention belongs to the technical field of computer networks, and relates to an automatic rapid deployment method for large-scale computer cluster system nodes, in particular to a method for installing and deploying operating systems and application software on the cluster nodes.

                              背景技术 Background technique

高性能集群计算系统目前应用非常广泛,集群的规模也在不断扩大,在集群的构建和运行期间,集群大量节点上操作系统和应用软件包的人工重复安装和升级会成为一项艰巨的任务。High-performance cluster computing systems are currently widely used, and the scale of the cluster is also expanding. During the construction and operation of the cluster, manual repeated installation and upgrade of operating systems and application software packages on a large number of nodes in the cluster will become a difficult task.

在本发明以前的现有技术中,一般以手动或ghost(Symantec公司开发的硬盘复制软件)硬盘刻录或NFS(net file system,网络文件系统)共享网络安装方法来部署操作系统和应用软件。手动方法以光盘方式,安装每个节点会耗费大量的时间和人力,并且节点的配置过程非常繁琐,很容易出错。在构建集群的工程实践中常用的方法是ghost硬盘刻录的方法,这种方法需要对被安装节点的硬盘进行拆卸,并且要求目标磁盘与源磁盘大小相同;或者是NFS共享网络安装法,即把网络中的一个节点作为NFS服务器,把操作系统安装盘中的文件复制到这个节点上,被安装节点通过网卡或软盘启动,从服务器上下载文件完成安装。In the prior art before the present invention, operating system and application software are generally deployed with manual or ghost (hard disk duplication software developed by Symantec) hard disk recording or NFS (net file system, network file system) shared network installation method. The manual method uses the CD-ROM, and it takes a lot of time and manpower to install each node, and the configuration process of the nodes is very cumbersome and error-prone. The commonly used method in the engineering practice of building a cluster is the method of burning the ghost hard disk. This method needs to disassemble the hard disk of the node to be installed, and requires the target disk to be the same size as the source disk; or the NFS shared network installation method, that is, the A node in the network is used as an NFS server, and the files in the operating system installation disk are copied to this node. The installed node starts through a network card or a floppy disk, and downloads files from the server to complete the installation.

现有的一些部署软件如Oscar(Open Source Cluster Application Resource,开源集群应用软件资源,由Open Cluster Group开发的一套集群应用软件包)和Systemimager(系统镜像器)等都是针对特定集群的,应用范围较小,功能比较单一,操作使用不方便。Some existing deployment software such as Oscar (Open Source Cluster Application Resource, an open source cluster application software resource, a set of cluster application software packages developed by the Open Cluster Group) and Systemimager (system mirror) are all aimed at specific clusters. The range is small, the function is relatively single, and the operation is inconvenient.

                              发明内容Contents of Invention

针对上述现有技术状况,本发明的目的在于提供一种新的用于大规模计算机集群系统节点上的自动化部署方法,以简化集群安装工作,提高效率,实现安装过程的完全自动化。In view of the above-mentioned existing technical situation, the purpose of the present invention is to provide a new automatic deployment method for large-scale computer cluster system nodes, so as to simplify the cluster installation work, improve efficiency, and realize the complete automation of the installation process.

实现上述目的的技术解决方案是:本发明方法是基于客户机/服务器体系结构;节点软件安装采用基于映像的方式;映像是一个完整的系统,含有需要在目标节点上安装的操作系统和应用软件包;所有映像都存储在服务器上;本方法所需的硬件架构中设有一个管理节点作为服务器,目标节点通过通用以太网交换机与管理节点连接,目标节点上需要有支持预引导执行环境(Preboot Execution Environment)和网络唤醒(Wakeup OnLAN)的网卡和主板,目标节点作为该结构中的客户机通过通用以太网交换机与管理节点连接;The technical solution for realizing the above object is: the method of the present invention is based on the client/server architecture; the node software installation adopts an image-based mode; the image is a complete system, containing the operating system and application software that need to be installed on the target node All images are stored on the server; the required hardware architecture of this method is provided with a management node as a server, the target node is connected with the management node through a general Ethernet switch, and the target node needs to support the pre-boot execution environment (Preboot Execution Environment) and Wakeup OnLAN (Wakeup OnLAN) network card and motherboard, the target node is connected to the management node as a client in this structure through a general Ethernet switch;

管理节点作为体系结构中的服务器,包括预引导执行环境(Preboot eXecutionEnvironment)协议的服务端和用于映像传输的远程复制工具的服务器端;管理节点负责创建映像、存储目标节点信息、网络唤醒目标节点以及提供部署过程所需要的服务。在目标节点内存中运行体系结构中的客户端,作为部署代理,负责从管理节点上下载映像并安装到本地的硬盘上。部署代理的运行独立于目标节点硬盘,部署代理包括用于映像传输的远程复制工具的客户端;目标节点的启动过程遵守Intel的预引导执行环境PXE协议;根据PXE协议,管理节点上提供动态主机配置协议DHCP服务、简单文件传送协议TFTP(Trivial File Transfer Protocol,)服务。为了进行映像传输,管理节点上还需要运行远程复制工具的服务器端。As a server in the architecture, the management node includes the server side of the Preboot eXecutionEnvironment protocol and the server side of the remote replication tool for image transmission; the management node is responsible for creating images, storing target node information, and waking up the target node from the network And provide the services needed for the deployment process. The client in the architecture runs in the memory of the target node, as a deployment agent, responsible for downloading the image from the management node and installing it on the local hard disk. The operation of the deployment agent is independent of the hard disk of the target node. The deployment agent includes the client of the remote copy tool for image transmission; the startup process of the target node complies with Intel's pre-boot execution environment PXE protocol; according to the PXE protocol, a dynamic host is provided on the management node Configuration protocol DHCP service, simple file transfer protocol TFTP (Trivial File Transfer Protocol,) service. For image transfer, the server side of the remote replication tool also needs to be running on the management node.

本方法的具体实现步骤是:The specific implementation steps of this method are:

步骤1:获得映像:Step 1: Get the image:

可以通过两种方式来获得映像:Images can be obtained in two ways:

一种是通过安装包来创建映像,即把安装光盘中操作系统和应用软件的安装包复制到管理节点的一个目录中,用安装包直接在管理节点上创建虚拟映像;One is to create an image through the installation package, that is, copy the installation package of the operating system and application software in the installation CD to a directory of the management node, and use the installation package to directly create a virtual image on the management node;

另一种是从一个源结点捕捉映像,即首先手动的安装好一个节点,作为映像的源节点,源结点上安装了操作系统和需要的各种应用软件,然后从管理节点捕捉源节点的映像并存储。这种方式适合目标节点与源节点是同构的情况The other is to capture the image from a source node, that is, first manually install a node as the source node of the image, the operating system and various application software required are installed on the source node, and then capture the source node from the management node image and store it. This method is suitable for the case where the target node and the source node are isomorphic

步骤2:绑定映像与目标节点:Step 2: Bind the image with the target node:

把目标节点的网络标识信息与映像绑定,确定映像与目标节点的对应关系。其中一个映像可以对应多个目标节点,但是一个目标节点只能对应唯一的映像。Bind the network identification information of the target node with the image, and determine the corresponding relationship between the image and the target node. One image can correspond to multiple target nodes, but a target node can only correspond to a unique image.

步骤3:安装目标节点:Step 3: Install the target node:

在进行部署的时候,首先在管理节点上启动DHCP(动态主机配置协议)服务,为目标节点提供网络标识信息;启动TFTP(简单文件传送协议)服务,为目标节点下载部署代理程序提供服务;启动在管理节点与目标节点之间传送映像的远程复制工具的的服务器端;When deploying, first start the DHCP (Dynamic Host Configuration Protocol) service on the management node to provide network identification information for the target node; start the TFTP (Trivial File Transfer Protocol) service to provide services for the target node to download the deployment agent; start The server side of the remote replication tool that transfers the image between the management node and the target node;

管理节点通过网络唤醒技术(Wakeup On LAN)启动目标节点;目标节点通过网卡启动后,先根据动态主机配置协议DHCP从管理节点获得网络标识,然后根据简单文件传送协议TFTP下载部署代理并在本机内存中运行;部署代理启动后,首先对本机硬盘进行分区,然后使用远程复制工具从管理节点上下载映像到本机硬盘,在本地完成安装,安装完成后,节点自动重启(见图3);The management node starts the target node through Wakeup On LAN technology; after the target node is started through the network card, it first obtains the network identifier from the management node according to the dynamic host configuration protocol DHCP, and then downloads and deploys the agent according to the simple file transfer protocol TFTP and uploads it on the local machine. Run in memory; after the deployment agent is started, first partition the local hard disk, then use the remote copy tool to download the image from the management node to the local hard disk, and complete the installation locally. After the installation is completed, the node will automatically restart (see Figure 3);

步骤4:配置目标节点:Step 4: Configure the target node:

目标节点安装完毕首次启动时,运行配置程序对目标节点的操作系统和应用软件进行配置;经过配置后,目标节点即成为一个可以立即使用的集群节点;When the target node is installed and started for the first time, run the configuration program to configure the operating system and application software of the target node; after configuration, the target node becomes a cluster node that can be used immediately;

步骤5:测试:Step 5: Test:

集群所有的节点安装完毕,运行测试程序,测试操作系统和应用软件的安装配置是否正确。After all the nodes in the cluster are installed, run the test program to test whether the installation and configuration of the operating system and application software are correct.

本发明方法的优点是:The advantage of the inventive method is:

A.对用户提供了单一控制点,用户只需要通过终端连接到管理节点上,所有的控制操作都在管理节点上完成。目标节点只需要在BIOS中设置为通过网卡启动,其软件安装过程是完全自动化的;目标节点不需要键盘、鼠标、显示器、光驱、软驱等外围设备。A. A single control point is provided for the user. The user only needs to connect to the management node through the terminal, and all control operations are completed on the management node. The target node only needs to be set in the BIOS to start through the network card, and its software installation process is completely automatic; the target node does not need peripheral devices such as keyboard, mouse, monitor, CD-ROM, and floppy drive.

B.可以同时安装多个节点和不同的映像。通过映像和节点之间的绑定,确定了映像和节点之间的对应关系,可以同时安装多个节点,相互之间不会干扰。B. Multiple nodes and different images can be installed at the same time. Through the binding between the image and the node, the corresponding relationship between the image and the node is determined, and multiple nodes can be installed at the same time without interfering with each other.

C.集群系统节点操作系统和应用软件包安装配置完全自动化,目标节点完全不需要人工干预。用本方法可以批量安装节点,适合于安装大规模集群系统。C. The cluster system node operating system and application software package installation and configuration are fully automated, and the target node does not require manual intervention at all. Nodes can be installed in batches with this method, which is suitable for installing large-scale cluster systems.

                              附图说明Description of drawings

图1:本发明方法采用的硬件架构示意图。Fig. 1: a schematic diagram of the hardware architecture adopted by the method of the present invention.

图2:根据本方法进行集群部署的主要步骤流程图。Figure 2: Flowchart of the main steps of cluster deployment according to the method.

图3:用本发明方法安装一个目标节点的详细过程示意图。Fig. 3: A schematic diagram of the detailed process of installing a target node with the method of the present invention.

以下结合附图和实施例对本发明作进一步的详细描述。The present invention will be described in further detail below in conjunction with the accompanying drawings and embodiments.

                              具体实施方式 Detailed ways

图1为本方法所采用的硬件架构,“switch”代表通用以太网交换机,“DHCP”代表“动态主机配置协议”,“TFTP”代表“简单文件传送协议”。其中有一个管理节点作为服务器,提供DHCP服务、TFTP服务和映像传输服务,目标节点通过交换机与管理节点相连,从管理节点上下载映像完成安装。Figure 1 shows the hardware architecture used in this method, "switch" represents a general Ethernet switch, "DHCP" represents "Dynamic Host Configuration Protocol", and "TFTP" represents "Trivial File Transfer Protocol". One of the management nodes acts as a server, providing DHCP service, TFTP service and image transmission service. The target node is connected to the management node through a switch, and the image is downloaded from the management node to complete the installation.

图2根据本方法进行集群部署的主要步骤流程图,表示用本方法进行集群部署的主要步骤。Fig. 2 is a flow chart of main steps of cluster deployment according to this method, showing the main steps of cluster deployment by this method.

图3中“bootstrap”的确切含义是引导程序。表示了一个目标节点的详细安装过程,也就是PXE协议的工作过程,图中的映像服务器和DHCP服务器都在管理节点上。The exact meaning of "bootstrap" in Figure 3 is bootstrap. It shows the detailed installation process of a target node, that is, the working process of the PXE protocol. The image server and DHCP server in the figure are both on the management node.

本方法在具体实施的时候,管理节点和目标节点都可以选用普通的商用计算机,管理节点要求必须有键盘、鼠标、光驱、软驱和监视器等输入输出设备,目标节点则不需要这些输入输出设备。管理节点要求有较大的硬盘(大于80G)用来存储映像。要求目标节点的网卡能支持PXE和Wakeup On LAN,只有这样才能从服务器启动目标节点,实现目标节点安装过程的全部自动化。目前一般的商用计算机硬件都能支持这两个功能。When this method is actually implemented, both the management node and the target node can use ordinary commercial computers. The management node requires input and output devices such as keyboard, mouse, optical drive, floppy drive and monitor, and the target node does not need these input and output devices. . The management node requires a larger hard disk (greater than 80G) to store images. The network card of the target node is required to support PXE and Wakeup On LAN, only in this way can the target node be started from the server, and the installation process of the target node can be fully automated. At present, general commercial computer hardware can support these two functions.

目标节点和管理节点之间通过以太网连接,由于映像文件通常比较大,所以网络带宽对本方法的效率影响很大,要求以太网的带宽是至少是100Mbps,网络介质可以是双绞线和光纤。连接方式可以通过100M或1000M通用交换机。采用星型或点对点的网络模型,所有目标节点都可以与管理节点直接连通。The target node and the management node are connected by Ethernet. Because the image file is usually relatively large, the network bandwidth has a great influence on the efficiency of the method. The bandwidth of the Ethernet is required to be at least 100 Mbps, and the network medium can be twisted pair and optical fiber. The connection method can be through 100M or 1000M general switch. Using a star or point-to-point network model, all target nodes can be directly connected to the management node.

在管理节点和目标节点之间传送映像的时候,需要选择一个合适的远程复制工具,这个复制工具可以根据实际情况选择,跟映像的操作系统有关,这个复制工具需要能够对映像的文件系统进行读写控制。例如,在Linux操作系统中,可以选用远程同步的软件rsync作为远程复制的工具。When transferring the image between the management node and the target node, it is necessary to select a suitable remote copy tool. This copy tool can be selected according to the actual situation, and it is related to the operating system of the image. This copy tool needs to be able to read the file system of the image. write control. For example, in the Linux operating system, the remote synchronization software rsync can be selected as a remote replication tool.

在本方法中,部署代理是一个功能完善的微型操作系统,部署代理中包括远程复制工具的客户端、必要的硬盘和网卡驱动程序、对硬盘进行操作的工具。在具体实施的时候,可以通过定制Linux来得到。部署代理需要在目标节点的内存中运行,并且能够识别本机的硬盘和通过网络与服务器通信,所以需要根据目标节点的不同硬件,编译新的Linux内核,加入硬件的驱动程序。制作与新内核对应的根文件系统映像RAMDISK文件,在其中加入必要的对硬盘进行操作的工具、远程复制工具(例如rsync)的客户端、驱动程序模块(例如SCSI驱动模块)等。In this method, the deployment agent is a micro-operating system with complete functions, and the deployment agent includes the client of the remote replication tool, necessary hard disk and network card drivers, and tools for operating the hard disk. In actual implementation, it can be obtained by customizing Linux. The deployment agent needs to run in the memory of the target node, and be able to recognize the local hard disk and communicate with the server through the network, so it is necessary to compile a new Linux kernel and add hardware drivers according to the different hardware of the target node. Make the root file system image RAMDISK file corresponding to the new kernel, and add necessary tools for operating the hard disk, clients of remote copy tools (such as rsync), driver modules (such as SCSI driver modules), etc.

具体步骤如下:Specific steps are as follows:

步骤1:获得映像Step 1: Get the Image

获得映像的方式有两种,一种是在管理节点上创建映像,另一种是从某个源节点上捕捉映像,下面分别介绍这两种方式的具体步骤。There are two ways to obtain an image. One is to create an image on the management node, and the other is to capture an image from a source node. The specific steps of these two methods are introduced below.

创建映像就是使用一些资源在管理节点上生成映像,这些资源包括操作系统内核、应用软件包、磁盘分区文件等,还需要根据对映像做一些配置,也就是说写需要一些配置的脚本。然后把这些资源按照一定的规则进行安装重组和整合,组成一个相对独立的系统,该系统就是一个映像。Creating an image is to use some resources to generate an image on the management node. These resources include the operating system kernel, application software packages, disk partition files, etc. It is also necessary to do some configuration on the image, that is to say, to write scripts that require some configuration. Then install, reorganize and integrate these resources according to certain rules to form a relatively independent system, which is an image.

创建映像的步骤一般如下:The general steps to create an image are as follows:

1)敲入映像桩1) Knock in the image pile

桩是这个映像的基础,这需要创建一个目录结构,然后加载当前系统的一些必备的文件系统。从某种程度上来说,创建映像是对管理结点操作系统的复制,不过只是复制了管理结点操作系统本身的很少的文件,使得以后的创建有了“桩”。The stub is the base of the image, which requires creating a directory structure and then loading some of the necessary filesystems for the current system. To some extent, creating an image is a copy of the management node operating system, but only a few files of the management node operating system itself are copied, so that there are "stubs" for future creation.

2)安装系统包2) Install the system package

创建“桩”后,就能以“桩”为基础,也就是把根目录改变到桩上,安装系统需要的包,成为一个与原文件系统无关,相对独立的文件系统,这就有了映像文件系统的原型。After creating the "pile", you can use the "pile" as the basis, that is, change the root directory to the pile, install the packages required by the system, and become a relatively independent file system that has nothing to do with the original file system. This has an image Prototype of the file system.

3)配置映像3) Configuration image

第二步虽然有了映像,可是该映像还不能使用,还需要对该映像进行一些配置,如磁盘分区等,这样的映像才能使用。当然,这种配置可以更改,基于文件模式的映像提供了更大的灵活性。Although there is an image in the second step, the image cannot be used yet, and some configurations, such as disk partitions, need to be performed on the image before such an image can be used. Of course, this configuration can be changed, and file-mode-based images provide more flexibility.

捕捉映像的过程类似于拍摄照片的工作方式,首先手动的安装一个源节点,在源节点上安装操作系统和必须的应用软件包,然后从管理结点上不失真的获取源结点的一个映像,获取过程实际上就是一个复制的过程,可以使用rsync等远程复制工具来完成。捕捉映像的步骤如下:The process of capturing an image is similar to the way of taking photos. First, manually install a source node, install the operating system and necessary application software packages on the source node, and then obtain an image of the source node from the management node without distortion. , the acquisition process is actually a replication process, which can be completed using remote replication tools such as rsync. The steps to capture an image are as follows:

1)使系统处于静止状态便于获取有效的快照1) Put the system in a quiescent state for valid snapshots

一个正在运行的操作系统需要维护其工作状态,在内存中和硬盘中都保存着工作状态的数据。磁盘快照不能完全克隆硬盘系统的状态。因此,对源节点系统进行映像捕捉时,操作系统必须要作适当的调整便于映像捕捉过程的顺利实施。例如,有时需要临时删除一些识别信息,如日志文件、主机名、IP地址等。这样才能使得获取的映像是一个相对中性的文件,方便移植到其他节点中。A running operating system needs to maintain its working state, and the data of the working state is saved in the memory and the hard disk. Disk snapshots cannot completely clone the state of the hard disk system. Therefore, when performing image capture on the source node system, the operating system must be properly adjusted to facilitate the smooth implementation of the image capture process. For example, sometimes it is necessary to temporarily delete some identifying information, such as log files, hostnames, IP addresses, etc. Only in this way can the obtained image be a relatively neutral file, which is convenient for porting to other nodes.

2)捕捉一个不失真的映像2) Capture an undistorted image

首先要检测源节点是否可以捕获映像,然后启动映像捕捉代理,读取源节点磁盘中的文件,把他们存储到管理节点的硬盘中。First, check whether the source node can capture the image, and then start the image capture agent, read the files in the disk of the source node, and store them in the hard disk of the management node.

3)配置映像3) Configuration image

对得到的映像进行必要的配置,与创建映像方式不同,捕捉映像的同时也把源节点的磁盘分区信息复制过来了,因此不需要配置磁盘分区。这里的配置主要是对映像中应用软件和操作系统的配置。Perform the necessary configuration on the obtained image. Different from the image creation method, the disk partition information of the source node is also copied when capturing the image, so there is no need to configure the disk partition. The configuration here is mainly the configuration of the application software and operating system in the image.

步骤2:绑定映像与目标节点Step 2: Bind the image to the target node

把目标节点的IP地址、主机名、MAC地址等网络信息与映像绑定,确定映像与目标节点的对应关系。需要建立一个小型数据库来存储节点与映像的对应关系。其中,映像与节点是一对多的关系,即一个映像可以对于多个节点,而一个节点只能对应一个确定的映像。Bind network information such as the IP address, host name, and MAC address of the target node with the image, and determine the corresponding relationship between the image and the target node. A small database needs to be established to store the correspondence between nodes and images. Among them, the image and the node have a one-to-many relationship, that is, one image can correspond to multiple nodes, and one node can only correspond to one certain image.

步骤3:安装目标节点Step 3: Install the target node

在进行部署的时候,需要启动的服务包括DHCP服务、TFTP服务、rsync服务等。DHCP服务器为目标节点提供IP地址,主机名等网络信息,并指定TFTP服务器的位置,通知目标节点到指定的位置去下载启动引导文件。During deployment, services that need to be started include DHCP service, TFTP service, rsync service, etc. The DHCP server provides network information such as IP address and host name for the target node, and specifies the location of the TFTP server, and notifies the target node to download the startup boot file from the specified location.

TFTP服务器为目标节点下载部署代理程序提供服务,在TFTP服务器的根目录中存储了启动引导文件、启动配置文件、Linux内核和根文件系统等。目标节点下载这些文件到本机内存中加载执行。The TFTP server provides services for the target node to download and deploy the agent program, and the boot boot file, startup configuration file, Linux kernel and root file system are stored in the root directory of the TFTP server. The target node downloads these files to the local memory for loading and execution.

rsync服务负责管理所有的映像,为管理节点与目标节点间传送映像提供服务。在管理节点上rsync是以守护进程的方式来运行。目标节点启动之后,根据本身的主机名等网络标识信息从服务器上找到本机对应的映像,然后通过rsync从管理节点上把映像复制到本机硬盘。The rsync service is responsible for managing all images and providing services for transferring images between the management node and the target node. rsync runs as a daemon process on the management node. After the target node starts, it finds the image corresponding to the local machine from the server according to its own host name and other network identification information, and then copies the image from the management node to the local hard disk through rsync.

在这一步中首先需要生成DHCP服务的配置文件,配置文件是根据管理节点数据库中存储的目标节点IP地址、主机名和MAC地址等网络信息来生成的。然后在管理节点上启动DHCP服务、TFTP服务和rsync服务。In this step, the configuration file of the DHCP service needs to be generated first, and the configuration file is generated according to the network information such as the target node IP address, host name, and MAC address stored in the management node database. Then start the DHCP service, TFTP service, and rsync service on the management node.

目标节点需要与电源接通,并在BIOS中设置为通过网卡启动。在管理节点上启动各项服务之后,从管理节点发送包含目标节点MAC地址的广播包,通过网络唤醒(Wakeup OnLAN)技术启动目标节点。The target node needs to be connected to the power supply and set to boot through the network card in the BIOS. After starting various services on the management node, send a broadcast packet containing the MAC address of the target node from the management node, and start the target node through Wakeup OnLAN (Wakeup OnLAN) technology.

目标节点收到与本机MAC地址对应的广播包后通过网卡启动,首先从DHCP服务器根据本机的MAC地址获得与本机对应的网络标识信息,然后从TFTP服务器下载网卡启动引导程序和部署代理,部署代理在本机内存中加载启动之后,首先检测并挂载本地的硬盘,然后根据磁盘分区配置文件对本地硬盘进行分区。分区完成之后使用rsync等远程复制工具从服务器上下载与本机对应的映像并完整的复制到本地硬盘,安装完成后,节点自动重启。After receiving the broadcast packet corresponding to the local MAC address, the target node starts through the network card, first obtains the network identification information corresponding to the local machine from the DHCP server according to the local MAC address, and then downloads the network card startup boot program and deployment agent from the TFTP server , after the deployment agent loads and starts in the local memory, it first detects and mounts the local hard disk, and then partitions the local hard disk according to the disk partition configuration file. After the partition is completed, use a remote copy tool such as rsync to download the image corresponding to the local machine from the server and copy it completely to the local hard disk. After the installation is complete, the node will automatically restart.

步骤4:配置目标节点Step 4: Configure the target node

目标节点安装完毕首次启动时,运行配置程序对目标节点的操作系统和应用软件进行配置。因为集群系统是一个松耦和的多机系统,集群中个节点是相对独立的,每个节点都有独立的操作系统和应用软件。为了让集群中的节点能够相互写作对外界提供服务器,就必须对节点进行配置。When the target node is installed and started for the first time, run the configuration program to configure the operating system and application software of the target node. Because the cluster system is a loosely coupled and multi-machine system, each node in the cluster is relatively independent, and each node has an independent operating system and application software. In order for the nodes in the cluster to be able to write to each other and provide servers to the outside world, the nodes must be configured.

在Linux操作系统中,配置过程就是执行一系列的脚本,每个应用软件都有相应的配置脚本,这些脚本在获得映像这一步中就已经写入映像中了。由于每种应用软件的配置方法都不同,所以要根据具体的应用软件来写出配置脚本并存入映像中。在节点首次启动时,调用这些脚本,完成对应用软件的配置。对于Windows操作系统,软件的配置需要调用一些配置工具,用批处理的方式来实现。In the Linux operating system, the configuration process is to execute a series of scripts, and each application software has a corresponding configuration script, and these scripts have been written into the image during the step of obtaining the image. Since the configuration methods of each application software are different, it is necessary to write a configuration script and store it in the image according to the specific application software. When the node starts for the first time, these scripts are invoked to complete the configuration of the application software. For the Windows operating system, the configuration of the software needs to call some configuration tools, which is implemented in a batch manner.

经过配置之后,目标节点就是一个可以立即使用的集群节点了。After configuration, the target node is a cluster node that can be used immediately.

步骤5:测试:集群所有的节点安装完毕,运行测试程序,测试操作系统和应用软件的安装配置是否正确。测试程序是在对映像进行配置的时候写入映像中的,是对应用软件的基本功能的测试,例如Linux系统下的并行计算软件MPICH.在完成对软件的配置之后,可以运行一个用MPI编写的小程序,检验运行结果是否正确。Step 5: Test: After all the nodes in the cluster are installed, run the test program to test whether the installation and configuration of the operating system and application software are correct. The test program is written into the image when configuring the image. It is a test of the basic functions of the application software, such as the parallel computing software MPICH under the Linux system. After completing the configuration of the software, you can run a program written in MPI A small program to check whether the running result is correct.

该测试是与应用密切相关的,需要根据不同的应用程序来设计测试程序。由于映像是可以配置的,测试程序可以预先写入映像中。The test is closely related to the application, and the test program needs to be designed according to different application programs. Since the image is configurable, test programs can be pre-written into the image.

由于本方法采用的是基于客户机/服务器(Client/Server)结构,映像都存储在管理节点上,所有的目标节点并行的从管理节点上下载映像。所以在可靠传输假设和公平共享带宽的前提下,可以根据网络带宽来推算集群部署所需要的时间。根据集群部署过程可以得出下面的计算部署时间的公式:Since the method adopts a client/server (Client/Server) structure, images are all stored on the management node, and all target nodes download images from the management node in parallel. Therefore, under the premise of reliable transmission assumptions and fair bandwidth sharing, the time required for cluster deployment can be estimated based on the network bandwidth. According to the cluster deployment process, the following formula for calculating the deployment time can be obtained:

部署时间=传输映像准备时间+传输映像时间Deployment time = transfer image preparation time + transfer image time

        =传输映像准备时间+节点个数×映像大小/网络带宽= Transmission image preparation time + number of nodes × image size / network bandwidth

用字母表示为:T=C+n×S×8/W          (公式1)Expressed in letters: T=C+n×S×8/W (Formula 1)

其中T为部署时间;C为传输映像准备时间,也就是从目标节点网卡启动到映像部署代理在内存中加载运行这段时间,可以认为是一个常量;n为同时部署的目标节点个数;S为映像大小(MB);W为网络带宽(Mb/s)。Among them, T is the deployment time; C is the transfer image preparation time, that is, the time from the start of the network card of the target node to the loading and running of the image deployment agent in memory, which can be considered as a constant; n is the number of target nodes deployed at the same time; S is the image size (MB); W is the network bandwidth (Mb/s).

在山东省高性能计算中心的集群搭建过程中,使用了根据本发明方法设计的集群部署软件,该集群的硬件环境如下:In the cluster building process of the Shandong Provincial High Performance Computing Center, the cluster deployment software designed according to the method of the present invention is used. The hardware environment of the cluster is as follows:

管理节点是浪潮英信NF260(2个Xeon 2.4GHz CPU,内存4GB,SCSI Ultra320,1000Mb网卡)。目标节点是浪潮NF170,内存为2G,IDE硬盘,100Mb网卡,无光驱、软驱、键盘和显示终端。主干网用千兆交换机通过光纤连接,管理节点接到主干网上,96个目标节点分成3个子网,子网通过双绞线连接到百兆交换机,再通过百兆交换机连接到主干网。The management node is Inspur Yingxin NF260 (2 x Xeon 2.4GHz CPU, memory 4GB, SCSI Ultra320, 1000Mb network card). The target node is Inspur NF170 with 2G memory, IDE hard disk, 100Mb network card, no CD-ROM drive, floppy drive, keyboard and display terminal. The backbone network is connected by a Gigabit switch through optical fiber, the management node is connected to the backbone network, and 96 target nodes are divided into 3 subnets, and the subnet is connected to a 100M switch through a twisted pair, and then connected to the backbone network through a 100M switch.

映像获取采用映像捕捉方式,获得的映像大小为3.7G。通过网络唤醒87个节点同时开机(有9个节点因为设置原因没有启动成功),从网卡启动进行软件部署。最终完成部署所需要的时间是48分钟。部署过程中检测得到网络的平均传输速率为990Mb。映像传输准备时间是150秒,则根据公式1的计算部署时间T=150+3700×8×87/990=2751秒=46分钟。该时间与实验统计时间基本吻合,证明公式1是正确的。The image acquisition adopts the image capture method, and the obtained image size is 3.7G. Wake up 87 nodes through the network and start up at the same time (9 nodes did not start successfully due to setting reasons), and start from the network card for software deployment. The time it took to finally complete the deployment was 48 minutes. The average transmission rate of the network detected during deployment is 990Mb. The preparation time for image transmission is 150 seconds, then the deployment time calculated according to Formula 1 is T=150+3700×8×87/990=2751 seconds=46 minutes. This time basically coincides with the experimental statistical time, which proves that Formula 1 is correct.

从公式1可以看到,部署所需要的时间与同时安装的目标节点数目成线性关系,而该线性关系的系数是由映像的大小和网络带宽决定的。映像越大,系数越大,部署所需要的时间越长;网络带宽越大,系数越小,部署所需要的时间越短。It can be seen from formula 1 that the time required for deployment has a linear relationship with the number of target nodes installed at the same time, and the coefficient of this linear relationship is determined by the size of the image and the network bandwidth. The larger the image, the larger the coefficient, and the longer the deployment time; the larger the network bandwidth, the smaller the coefficient, and the shorter the deployment time.

Claims (4)

1、一种大规模计算机集群系统节点的自动化快速部署方法,其特征在于,该方法基于客户机/服务器体系结构,节点软件安装采用基于映像的方式,映像是一个完整的系统,含有需要在目标节点上安装的操作系统和应用软件包;所有映像都存储在服务器上;本方法所需的硬件架构中设有一个管理节点作为服务器,目标节点通过通用以太网交换机与管理节点连接,目标节点上需要有支持预引导执行环境和网络唤醒的网卡和主板,目标节点作为该结构中的客户机通过通用以太网交换机与管理节点连接;1. An automated rapid deployment method for nodes of a large-scale computer cluster system, characterized in that the method is based on the client/server architecture, and the node software installation adopts an image-based method, and the image is a complete system that contains the The operating system and application software package installed on the node; all images are stored on the server; the hardware architecture required by this method is provided with a management node as a server, and the target node is connected to the management node through a general Ethernet switch. A network card and motherboard supporting pre-boot execution environment and wake-up on LAN are required, and the target node is connected to the management node as a client in the structure through a general-purpose Ethernet switch; 1)、管理节点是体系结构中的服务器端,包括用于映像传输的服务器端和预引导执行环境协议的服务端,负责创建映像、管理映像、存储目标节点信息、网络唤醒目标节点以及提供部署过程所需要的各种服务;1), the management node is the server side in the architecture, including the server side for image transmission and the server side of the pre-boot execution environment protocol, responsible for creating images, managing images, storing target node information, waking up target nodes from the network, and providing deployment Various services required by the process; 2)、目标节点是体系结构中的客户端,从管理节点获得部署代理,部署代理负责从作为服务器的管理节点上下载映像并安装到本地的硬盘上;2), the target node is the client in the architecture, obtains the deployment agent from the management node, and the deployment agent is responsible for downloading the image from the management node as the server and installing it on the local hard disk; 3)、目标节点的启动过程遵守Intel的预引导执行环境协议,在管理节点上提供动态主机配置协议DHCP、简单文件传送协议TFTP服务和用于传输映像的服务;3), the start-up process of the target node complies with Intel's pre-boot execution environment protocol, and provides dynamic host configuration protocol DHCP, simple file transfer protocol TFTP services and services for transferring images on the management node; 4)、在目标节点与管理节点之间进行映像传输的时候,使用不同的基于客户机/服务器体系结构的远程复制工具。4) When image transmission is performed between the target node and the management node, different remote replication tools based on client/server architecture are used. 2、如权利要求1所述的方法,其特征在于,所述的集群系统节点自动化快速部署方法的具体步骤是:2. The method according to claim 1, wherein the specific steps of the automated rapid deployment method for cluster system nodes are: 步骤1:获得映像Step 1: Get the Image 通过下述两种方式来获得映像:There are two ways to obtain an image: 第一种是通过安装包来创建映像,即把安装光盘中操作系统和应用软件的安装包复制到管理节点的一个目录中,用安装包直接在管理节点上创建虚拟映像;The first is to create an image through the installation package, that is, copy the installation package of the operating system and application software in the installation CD to a directory of the management node, and use the installation package to directly create a virtual image on the management node; 第二种是从一个源结点捕捉映像,即首先手动的安装好一个节点,作为映像的源节点,源结点上安装了操作系统和需要的各种应用软件,然后从管理节点捕捉源节点的映像并存储,适合目标节点与源节点是同构的情况;The second is to capture the image from a source node, that is, first manually install a node as the source node of the image, the operating system and various application software required are installed on the source node, and then capture the source node from the management node The image of the image is stored, which is suitable for the case where the target node and the source node are isomorphic; 步骤2:绑定映像与目标节点Step 2: Bind the image to the target node 把目标节点的网络标识信息与映像绑定,确定映像与目标节点的对应关系,其中一个映像对应多个目标节点,但是一个目标节点只能对应唯一的映像;Bind the network identification information of the target node with the image, and determine the corresponding relationship between the image and the target node. One image corresponds to multiple target nodes, but one target node can only correspond to a unique image; 步骤3:安装目标节点Step 3: Install the target node 在进行部署的时候,首先在管理节点上启动DHCP服务,为目标节点提供网络标识信息;启动TFTP服务,为目标节点下载部署代理程序提供服务;启动在管理节点与目标节点之间传送映像的远程复制工具的服务器端;When deploying, first start the DHCP service on the management node to provide network identification information for the target node; start the TFTP service to provide services for the target node to download the deployment agent; start the remote image transmission between the management node and the target node The server side of the replication tool; 管理节点通过Wakeup On LAN启动目标节点;目标节点通过网卡启动后,先根据动态主机配置协议DHCP从管理节点获得网络标识,然后根据简单文件传送协议TFTP下载部署代理并在本机内存中运行;部署代理从管理节点上下载映像,在本地完成安装,安装完成后,节点自动重启;The management node starts the target node through Wakeup On LAN; after the target node is started through the network card, it first obtains the network identifier from the management node according to the dynamic host configuration protocol DHCP, and then downloads the deployment agent according to the simple file transfer protocol TFTP and runs it in the local memory; deployment The agent downloads the image from the management node and completes the installation locally. After the installation is complete, the node restarts automatically; 步骤4:配置目标节点Step 4: Configure the target node 目标节点安装完毕首次启动时,运行配置程序对目标节点的操作系统和应用软件进行配置;经过配置后,目标节点即成为一个立即使用的集群节点;When the target node is installed and started for the first time, run the configuration program to configure the operating system and application software of the target node; after configuration, the target node becomes a cluster node for immediate use; 步骤5:测试Step 5: Test 集群系统所有的节点安装完毕,运行测试程序,测试操作系统和应用软件的安装配置是否正确。After all the nodes of the cluster system are installed, run the test program to test whether the installation and configuration of the operating system and application software are correct. 3、如权利要求1或2所述的方法,其特征在于:所述的目标节点在BIOS中设置为通过网卡启动,其软件安装过程完全自动化;目标节点不需要键盘、鼠标、显示器、光驱、软驱等外围设备;所有的操作都在管理节点上完成。3. The method according to claim 1 or 2, characterized in that: the target node is set in the BIOS to start through the network card, and its software installation process is fully automated; the target node does not need a keyboard, mouse, display, CD-ROM, Peripheral devices such as floppy drives; all operations are completed on the management node. 4、如权利要求1或2所述的方法,其特征在于:所述部署代理采用通过手动定制,它是一个完善的微型操作系统,在目标节点内存中运行,独立于目标节点硬盘;部署代理中包括有远程复制工具的客户端、必要的硬盘和网卡驱动程序、对硬盘进行操作的工具。4. The method according to claim 1 or 2, characterized in that: the deployment agent adopts manual customization, it is a perfect micro-operating system, runs in the memory of the target node, and is independent of the hard disk of the target node; the deployment agent It includes the client of the remote copy tool, the necessary hard disk and network card drivers, and the tools for operating the hard disk.
CN 200510043160 2005-08-30 2005-08-30 Automated quick deployment method for large-scale computer cluster node Pending CN1731738A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN 200510043160 CN1731738A (en) 2005-08-30 2005-08-30 Automated quick deployment method for large-scale computer cluster node

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN 200510043160 CN1731738A (en) 2005-08-30 2005-08-30 Automated quick deployment method for large-scale computer cluster node

Publications (1)

Publication Number Publication Date
CN1731738A true CN1731738A (en) 2006-02-08

Family

ID=35964046

Family Applications (1)

Application Number Title Priority Date Filing Date
CN 200510043160 Pending CN1731738A (en) 2005-08-30 2005-08-30 Automated quick deployment method for large-scale computer cluster node

Country Status (1)

Country Link
CN (1) CN1731738A (en)

Cited By (67)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN100451981C (en) * 2006-07-21 2009-01-14 华为技术有限公司 How to modify computer configuration information
CN101188526B (en) * 2007-12-18 2010-05-19 华南理工大学 Intelligent resource management method for wireless ultra-broadband dynamic connection cluster computer
CN101207565B (en) * 2006-12-22 2010-05-19 英业达股份有限公司 Network communication method
CN101963909A (en) * 2009-07-24 2011-02-02 株式会社理光 Information processing apparatus, workflow system and workflow management method
CN102033847A (en) * 2010-10-25 2011-04-27 浪潮电子信息产业股份有限公司 A multi-node server system
CN102075358A (en) * 2010-12-31 2011-05-25 网宿科技股份有限公司 System and method for distributing and deploying content of large-scale server cluster
CN102169448A (en) * 2011-03-18 2011-08-31 浪潮电子信息产业股份有限公司 Deployment method of cluster parallel computing environment
CN101330514B (en) * 2008-07-15 2011-11-16 西安交通大学 Method for disposing micro-OS with self-adaption transmission model
CN101452390B (en) * 2007-11-28 2012-03-14 无锡江南计算技术研究所 Network boot method and system, and system boot method and apparatus
CN102437961A (en) * 2011-12-28 2012-05-02 成都市华为赛门铁克科技有限公司 Cluster system and management and control method thereof
CN102520989A (en) * 2011-12-01 2012-06-27 浪潮电子信息产业股份有限公司 Intelligent colony system deployment method based on U disk
CN102567044A (en) * 2010-11-19 2012-07-11 微软公司 Code deployment assistance
CN102591679A (en) * 2011-11-30 2012-07-18 中国科学院计算机网络信息中心 Quick application software arrangement method for cluster system
CN102609280A (en) * 2012-01-17 2012-07-25 浪潮(北京)电子信息产业有限公司 Network operation system deploying method and network operation system deploying device
CN102726025A (en) * 2011-11-29 2012-10-10 华为技术有限公司 Service package installation method and related device
CN102722391A (en) * 2012-06-07 2012-10-10 浪潮电子信息产业股份有限公司 Automatic integrated system configuration and software installation method
CN102736946A (en) * 2011-04-11 2012-10-17 阿里巴巴集团控股有限公司 Batch application node deploying method and device
CN102750203A (en) * 2012-06-07 2012-10-24 浪潮电子信息产业股份有限公司 Linux-based rapid deploying method
CN101820387B (en) * 2010-02-08 2012-12-12 北京航空航天大学 Method for rapidly deploying extensible cluster
CN102843397A (en) * 2011-06-23 2012-12-26 神讯电脑(昆山)有限公司 Distributed system and distributed method for automatically copying hard disk
CN102882799A (en) * 2012-09-13 2013-01-16 曙光信息产业(北京)有限公司 Cluster arrangement and collocation system and method capable of controlling flow
CN102959505A (en) * 2010-09-23 2013-03-06 英特尔公司 Cluster computing - NIC based OS provision
CN103019732A (en) * 2012-12-25 2013-04-03 深圳市天维尔通讯技术有限公司 Method and system for installation and deployment of integrated software based on province, city and county
CN103036934A (en) * 2012-09-13 2013-04-10 曙光信息产业(北京)有限公司 Wide area network clustered deployment system and method based on mirror image
CN103037002A (en) * 2012-12-21 2013-04-10 中标软件有限公司 Method and system for arranging server cluster in cloud computing cluster environment
CN103186403A (en) * 2011-12-28 2013-07-03 英业达股份有限公司 Node replacement processing method and server system using the method
CN103279374A (en) * 2013-06-09 2013-09-04 浪潮电子信息产业股份有限公司 Operating system distributing technology based on MAC addresses
CN103384258A (en) * 2012-05-04 2013-11-06 上海盛大网络发展有限公司 Novel host computer generating method and system
CN103458002A (en) * 2013-08-15 2013-12-18 中电长城网际系统应用有限公司 One-key deployment method and device for cloud system
CN103475694A (en) * 2013-08-20 2013-12-25 汉柏科技有限公司 Parallel configuration method of cloud computing system
CN103491131A (en) * 2013-08-09 2014-01-01 浪潮电子信息产业股份有限公司 Network operating system fast batch installing method based on P2P
CN103593322A (en) * 2013-11-11 2014-02-19 曙光信息产业(北京)有限公司 Configuration method and device for drive assemblies
CN103744695A (en) * 2013-12-25 2014-04-23 新浪网技术(中国)有限公司 Method and system for operating system remote installation on basis of PXE
CN104104720A (en) * 2014-07-10 2014-10-15 浪潮(北京)电子信息产业有限公司 Server cluster management method and system
CN104219099A (en) * 2014-09-25 2014-12-17 浪潮(北京)电子信息产业有限公司 Automatic server cluster deploying method and device
CN104468199A (en) * 2014-11-23 2015-03-25 国云科技股份有限公司 Distributed system deployment and management framework and operation method suitable for Linux
CN104618168A (en) * 2015-03-06 2015-05-13 浪潮集团有限公司 Automatic operation and maintenance method based on puppet
CN104780058A (en) * 2014-01-13 2015-07-15 联想(北京)有限公司 Electronic equipment configuration method and system
CN105117293A (en) * 2015-09-01 2015-12-02 茂名市群英网络有限公司 Windows system environment automatic deployment method and automatic deployment system
CN105119732A (en) * 2015-06-30 2015-12-02 中标软件有限公司 Device and method for distributing hosts in cluster environment
CN105159750A (en) * 2015-09-14 2015-12-16 浪潮集团有限公司 Virtual machine creation method and apparatus
CN101699401B (en) * 2009-10-20 2015-12-16 中兴通讯股份有限公司 The installation method of blade server and operating system thereof and system
CN105183520A (en) * 2015-09-21 2015-12-23 赵伟 Automatic remote installing and debugging method and system for computer software
CN105207815A (en) * 2015-09-01 2015-12-30 茂名市群英网络有限公司 System environment deployment method and deployment system based on server
CN105408864A (en) * 2013-07-29 2016-03-16 阿尔卡特朗讯 data processing
CN105487891A (en) * 2015-11-26 2016-04-13 航天恒星科技有限公司 Distributed cluster deploying method and system
CN103716188B (en) * 2013-12-20 2016-09-14 上海网达软件股份有限公司 The management of Linux server cluster management and maintaining method
CN103713924B (en) * 2013-12-24 2017-03-08 汉柏科技有限公司 The upgrade method and system of cloud computing platform service
CN106527982A (en) * 2016-10-25 2017-03-22 西安交通大学 Object distribution algorithm for object storage system consisting of heterogeneous storage devices
CN106547600A (en) * 2016-11-22 2017-03-29 郑州云海信息技术有限公司 A kind of remote auto installs the method and system of whole machine cabinet host operating system
CN106686142A (en) * 2017-03-07 2017-05-17 五道口科技(北京)股份有限公司 Integrated Linux server system and system image push method thereof based on Web
CN107229497A (en) * 2017-05-31 2017-10-03 武汉联影医疗科技有限公司 Server test method and server test device
CN108563454A (en) * 2018-03-30 2018-09-21 天津麒麟信息技术有限公司 A kind of operating system hard disk quick deployment method towards machine of soaring
CN109842642A (en) * 2017-11-24 2019-06-04 中兴通讯股份有限公司 A kind of dispositions method and device, containerization group system of containerization group system
CN109976897A (en) * 2017-12-27 2019-07-05 航天信息股份有限公司 A kind of dispositions method, data interactive method and the corresponding intrument of big data cluster
CN110392080A (en) * 2018-04-20 2019-10-29 杭州海康威视系统技术有限公司 A kind of dispositions method based on distributed system, device and distributed system
CN110532065A (en) * 2019-09-02 2019-12-03 广州市品高软件股份有限公司 A kind of dispositions method and device of bare metal server
CN111752576A (en) * 2020-05-14 2020-10-09 深圳震有科技股份有限公司 A 5G data-based forwarding system deployment method, terminal and storage medium
CN112256289A (en) * 2020-10-27 2021-01-22 北京月新时代科技股份有限公司 Automatic deployment method, device and equipment
CN112988323A (en) * 2019-12-02 2021-06-18 美商安迈国际有限责任公司 Server deployment method and system under virtualization framework
CN113507388A (en) * 2021-07-08 2021-10-15 上海浦东发展银行股份有限公司 Equipment deployment configuration method and device, electronic equipment and storage medium
CN113849487A (en) * 2021-12-02 2021-12-28 中关村科学城城市大脑股份有限公司 Database management method, device, equipment and medium
CN114500522A (en) * 2020-10-23 2022-05-13 慧与发展有限责任合伙企业 shared image install image stream
CN115004151A (en) * 2020-01-23 2022-09-02 日立数据管理有限公司 System and method for updating add-in card firmware and collecting hardware information on any server or bare metal server with any OS installed
CN115408016A (en) * 2022-09-01 2022-11-29 上海承蓝数据科技有限公司 Computing cluster deployment method and device, electronic equipment and storage medium
CN116192622A (en) * 2022-12-30 2023-05-30 深圳海智创科技有限公司 A cluster deployment method, system, device and storage medium
CN118260092A (en) * 2024-05-31 2024-06-28 电科云(北京)科技有限公司 Computing power distribution method and device, seed server and multi-element computing power service cluster

Cited By (85)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN100451981C (en) * 2006-07-21 2009-01-14 华为技术有限公司 How to modify computer configuration information
CN101207565B (en) * 2006-12-22 2010-05-19 英业达股份有限公司 Network communication method
CN101452390B (en) * 2007-11-28 2012-03-14 无锡江南计算技术研究所 Network boot method and system, and system boot method and apparatus
CN101188526B (en) * 2007-12-18 2010-05-19 华南理工大学 Intelligent resource management method for wireless ultra-broadband dynamic connection cluster computer
CN101330514B (en) * 2008-07-15 2011-11-16 西安交通大学 Method for disposing micro-OS with self-adaption transmission model
CN101963909A (en) * 2009-07-24 2011-02-02 株式会社理光 Information processing apparatus, workflow system and workflow management method
CN101963909B (en) * 2009-07-24 2013-11-06 株式会社理光 Information processing apparatus, workflow system and workflow management method
CN101699401B (en) * 2009-10-20 2015-12-16 中兴通讯股份有限公司 The installation method of blade server and operating system thereof and system
CN101820387B (en) * 2010-02-08 2012-12-12 北京航空航天大学 Method for rapidly deploying extensible cluster
CN102959505A (en) * 2010-09-23 2013-03-06 英特尔公司 Cluster computing - NIC based OS provision
CN102959505B (en) * 2010-09-23 2017-02-08 英特尔公司 Cluster computing - NIC based OS provision
CN102033847A (en) * 2010-10-25 2011-04-27 浪潮电子信息产业股份有限公司 A multi-node server system
US9959103B2 (en) 2010-11-19 2018-05-01 Microsoft Technology Licensing, Llc Code deployment assistance
CN102567044A (en) * 2010-11-19 2012-07-11 微软公司 Code deployment assistance
CN102075358A (en) * 2010-12-31 2011-05-25 网宿科技股份有限公司 System and method for distributing and deploying content of large-scale server cluster
CN102075358B (en) * 2010-12-31 2013-10-30 网宿科技股份有限公司 System and method for distributing and deploying content of large-scale server cluster
CN102169448B (en) * 2011-03-18 2013-10-23 浪潮电子信息产业股份有限公司 Deployment method of cluster parallel computing environment
CN102169448A (en) * 2011-03-18 2011-08-31 浪潮电子信息产业股份有限公司 Deployment method of cluster parallel computing environment
CN102736946A (en) * 2011-04-11 2012-10-17 阿里巴巴集团控股有限公司 Batch application node deploying method and device
CN102736946B (en) * 2011-04-11 2015-12-16 阿里巴巴集团控股有限公司 A kind of batch dispositions method of application node and device
CN102843397A (en) * 2011-06-23 2012-12-26 神讯电脑(昆山)有限公司 Distributed system and distributed method for automatically copying hard disk
CN102726025B (en) * 2011-11-29 2015-07-08 华为技术有限公司 Service package installation method and related device
CN102726025A (en) * 2011-11-29 2012-10-10 华为技术有限公司 Service package installation method and related device
CN102591679B (en) * 2011-11-30 2014-09-17 中国科学院计算机网络信息中心 Quick application software arrangement method for cluster system
CN102591679A (en) * 2011-11-30 2012-07-18 中国科学院计算机网络信息中心 Quick application software arrangement method for cluster system
CN102520989A (en) * 2011-12-01 2012-06-27 浪潮电子信息产业股份有限公司 Intelligent colony system deployment method based on U disk
CN103186403A (en) * 2011-12-28 2013-07-03 英业达股份有限公司 Node replacement processing method and server system using the method
CN102437961A (en) * 2011-12-28 2012-05-02 成都市华为赛门铁克科技有限公司 Cluster system and management and control method thereof
CN102437961B (en) * 2011-12-28 2015-04-08 华为数字技术(成都)有限公司 Cluster system and management and control method thereof
CN102609280A (en) * 2012-01-17 2012-07-25 浪潮(北京)电子信息产业有限公司 Network operation system deploying method and network operation system deploying device
CN103384258B (en) * 2012-05-04 2019-05-24 上海盛大网络发展有限公司 Generate the method and system of new host
CN103384258A (en) * 2012-05-04 2013-11-06 上海盛大网络发展有限公司 Novel host computer generating method and system
CN102750203A (en) * 2012-06-07 2012-10-24 浪潮电子信息产业股份有限公司 Linux-based rapid deploying method
CN102722391A (en) * 2012-06-07 2012-10-10 浪潮电子信息产业股份有限公司 Automatic integrated system configuration and software installation method
CN103036934A (en) * 2012-09-13 2013-04-10 曙光信息产业(北京)有限公司 Wide area network clustered deployment system and method based on mirror image
CN102882799A (en) * 2012-09-13 2013-01-16 曙光信息产业(北京)有限公司 Cluster arrangement and collocation system and method capable of controlling flow
CN103037002A (en) * 2012-12-21 2013-04-10 中标软件有限公司 Method and system for arranging server cluster in cloud computing cluster environment
CN103019732A (en) * 2012-12-25 2013-04-03 深圳市天维尔通讯技术有限公司 Method and system for installation and deployment of integrated software based on province, city and county
CN103279374A (en) * 2013-06-09 2013-09-04 浪潮电子信息产业股份有限公司 Operating system distributing technology based on MAC addresses
CN105408864A (en) * 2013-07-29 2016-03-16 阿尔卡特朗讯 data processing
CN103491131A (en) * 2013-08-09 2014-01-01 浪潮电子信息产业股份有限公司 Network operating system fast batch installing method based on P2P
CN103458002A (en) * 2013-08-15 2013-12-18 中电长城网际系统应用有限公司 One-key deployment method and device for cloud system
CN103475694A (en) * 2013-08-20 2013-12-25 汉柏科技有限公司 Parallel configuration method of cloud computing system
CN103593322A (en) * 2013-11-11 2014-02-19 曙光信息产业(北京)有限公司 Configuration method and device for drive assemblies
CN103716188B (en) * 2013-12-20 2016-09-14 上海网达软件股份有限公司 The management of Linux server cluster management and maintaining method
CN103713924B (en) * 2013-12-24 2017-03-08 汉柏科技有限公司 The upgrade method and system of cloud computing platform service
CN103744695B (en) * 2013-12-25 2017-11-03 新浪网技术(中国)有限公司 Operating system remote installation method and system based on PXE
CN103744695A (en) * 2013-12-25 2014-04-23 新浪网技术(中国)有限公司 Method and system for operating system remote installation on basis of PXE
CN104780058A (en) * 2014-01-13 2015-07-15 联想(北京)有限公司 Electronic equipment configuration method and system
CN104104720A (en) * 2014-07-10 2014-10-15 浪潮(北京)电子信息产业有限公司 Server cluster management method and system
CN104219099A (en) * 2014-09-25 2014-12-17 浪潮(北京)电子信息产业有限公司 Automatic server cluster deploying method and device
CN104468199A (en) * 2014-11-23 2015-03-25 国云科技股份有限公司 Distributed system deployment and management framework and operation method suitable for Linux
CN104618168A (en) * 2015-03-06 2015-05-13 浪潮集团有限公司 Automatic operation and maintenance method based on puppet
CN105119732A (en) * 2015-06-30 2015-12-02 中标软件有限公司 Device and method for distributing hosts in cluster environment
CN105117293A (en) * 2015-09-01 2015-12-02 茂名市群英网络有限公司 Windows system environment automatic deployment method and automatic deployment system
CN105117293B (en) * 2015-09-01 2020-03-27 茂名市群英网络有限公司 Automatic deployment method and automatic deployment system for Windows system environment
CN105207815A (en) * 2015-09-01 2015-12-30 茂名市群英网络有限公司 System environment deployment method and deployment system based on server
CN105159750A (en) * 2015-09-14 2015-12-16 浪潮集团有限公司 Virtual machine creation method and apparatus
CN105183520A (en) * 2015-09-21 2015-12-23 赵伟 Automatic remote installing and debugging method and system for computer software
CN105183520B (en) * 2015-09-21 2019-01-15 赵伟 Computer software remote automation Method of Adjustment and system
CN105487891A (en) * 2015-11-26 2016-04-13 航天恒星科技有限公司 Distributed cluster deploying method and system
CN106527982B (en) * 2016-10-25 2019-04-12 西安交通大学 A kind of object distribution algorithm for the object storage system being made of heterogeneous storage devices
CN106527982A (en) * 2016-10-25 2017-03-22 西安交通大学 Object distribution algorithm for object storage system consisting of heterogeneous storage devices
CN106547600A (en) * 2016-11-22 2017-03-29 郑州云海信息技术有限公司 A kind of remote auto installs the method and system of whole machine cabinet host operating system
CN106686142A (en) * 2017-03-07 2017-05-17 五道口科技(北京)股份有限公司 Integrated Linux server system and system image push method thereof based on Web
CN107229497A (en) * 2017-05-31 2017-10-03 武汉联影医疗科技有限公司 Server test method and server test device
CN109842642B (en) * 2017-11-24 2022-11-01 中兴通讯股份有限公司 Deployment method and device of containerized cluster system and containerized cluster system
CN109842642A (en) * 2017-11-24 2019-06-04 中兴通讯股份有限公司 A kind of dispositions method and device, containerization group system of containerization group system
CN109976897A (en) * 2017-12-27 2019-07-05 航天信息股份有限公司 A kind of dispositions method, data interactive method and the corresponding intrument of big data cluster
CN109976897B (en) * 2017-12-27 2021-06-22 航天信息股份有限公司 Deployment method of big data cluster, data interaction method and corresponding device
CN108563454A (en) * 2018-03-30 2018-09-21 天津麒麟信息技术有限公司 A kind of operating system hard disk quick deployment method towards machine of soaring
CN110392080A (en) * 2018-04-20 2019-10-29 杭州海康威视系统技术有限公司 A kind of dispositions method based on distributed system, device and distributed system
CN110532065A (en) * 2019-09-02 2019-12-03 广州市品高软件股份有限公司 A kind of dispositions method and device of bare metal server
CN112988323B (en) * 2019-12-02 2024-02-06 美商安迈国际有限责任公司 Server deployment method and system under virtualization architecture
CN112988323A (en) * 2019-12-02 2021-06-18 美商安迈国际有限责任公司 Server deployment method and system under virtualization framework
CN115004151A (en) * 2020-01-23 2022-09-02 日立数据管理有限公司 System and method for updating add-in card firmware and collecting hardware information on any server or bare metal server with any OS installed
CN111752576A (en) * 2020-05-14 2020-10-09 深圳震有科技股份有限公司 A 5G data-based forwarding system deployment method, terminal and storage medium
CN114500522A (en) * 2020-10-23 2022-05-13 慧与发展有限责任合伙企业 shared image install image stream
CN114500522B (en) * 2020-10-23 2025-08-05 慧与发展有限责任合伙企业 Method for sharing image installation image stream, first computing node and storage medium
CN112256289A (en) * 2020-10-27 2021-01-22 北京月新时代科技股份有限公司 Automatic deployment method, device and equipment
CN113507388A (en) * 2021-07-08 2021-10-15 上海浦东发展银行股份有限公司 Equipment deployment configuration method and device, electronic equipment and storage medium
CN113849487A (en) * 2021-12-02 2021-12-28 中关村科学城城市大脑股份有限公司 Database management method, device, equipment and medium
CN115408016A (en) * 2022-09-01 2022-11-29 上海承蓝数据科技有限公司 Computing cluster deployment method and device, electronic equipment and storage medium
CN116192622A (en) * 2022-12-30 2023-05-30 深圳海智创科技有限公司 A cluster deployment method, system, device and storage medium
CN118260092A (en) * 2024-05-31 2024-06-28 电科云(北京)科技有限公司 Computing power distribution method and device, seed server and multi-element computing power service cluster

Similar Documents

Publication Publication Date Title
CN1731738A (en) Automated quick deployment method for large-scale computer cluster node
CN1848787A (en) Automatic fast dispositioning method for aggregated server system node
CN104394223A (en) Automatic rapid deployment method for large-scale computer cluster system nodes
CN104580519B (en) A kind of method of rapid deployment openstack cloud computing platforms
CN100410880C (en) Automated Deployment Methods for Operating Systems
CN101820387B (en) Method for rapidly deploying extensible cluster
CN101778136B (en) Method for installing operating systems for client through remote real-time monitoring of network
CN102591679B (en) Quick application software arrangement method for cluster system
CN106790467B (en) A method for automatic discovery and automatic deployment of cloud hosts
CN100340981C (en) Apparatus and method for managing and transferring virtual disks to network workstations via network
CN101232400B (en) A method of operating system deployment via network boot
CN106789327A (en) The automatic method and system that host is added for cloud platform
CN101344852A (en) A method, device and system for deploying WINDOWS enterprise edition operating system
JP2003330728A (en) System and method for custom installation of an operating system on a remote client
CN103297504A (en) Method for quickly deploying operating systems in physical bare computers in cloud data center
CN108089913B (en) Virtual machine deployment method of super-fusion system
CN104572372A (en) System and method for building server performance testing environment
CN104360883A (en) Method and system for configuring Raid (Redundant array of independent disks)
CN103200271A (en) Advanced Risc machine (ARM) server and method of automatic installation system thereof
CN101038551A (en) Method for implementing the start of Linux operation system in mobile hard disk
CN103777981A (en) Multi-operation-system remote loading implementation method suitable for X86 framework
CN105119732A (en) Device and method for distributing hosts in cluster environment
CN108989123A (en) A kind of server configuration method and device
CN1916861A (en) Method for modifying configuration information of computer
CN100347672C (en) Long-distance guide chip of transparent computing equipment based on dragon chip rack and panel construction and method thereof

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C02 Deemed withdrawal of patent application after publication (patent law 2001)
WD01 Invention patent application deemed withdrawn after publication