[go: up one dir, main page]

CN115904832A - A memory testing method, system, device and readable storage medium - Google Patents

A memory testing method, system, device and readable storage medium Download PDF

Info

Publication number
CN115904832A
CN115904832A CN202211258987.8A CN202211258987A CN115904832A CN 115904832 A CN115904832 A CN 115904832A CN 202211258987 A CN202211258987 A CN 202211258987A CN 115904832 A CN115904832 A CN 115904832A
Authority
CN
China
Prior art keywords
memory
test
test data
restart
round
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202211258987.8A
Other languages
Chinese (zh)
Inventor
王芸红
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Suzhou Inspur Intelligent Technology Co Ltd
Original Assignee
Suzhou Inspur Intelligent Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Suzhou Inspur Intelligent Technology Co Ltd filed Critical Suzhou Inspur Intelligent Technology Co Ltd
Priority to CN202211258987.8A priority Critical patent/CN115904832A/en
Publication of CN115904832A publication Critical patent/CN115904832A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Landscapes

  • Techniques For Improving Reliability Of Storages (AREA)

Abstract

本发明属于计算机领域,具体涉及一种内存测试方法、系统、设备及介质,其中方法包括:基于对应内存测试策略配置目标机,并基于内存测试策略对目标机上的内存进行多轮重启测试;实时获取目标机在每一轮重启测试时输出的测试数据,并将测试数据与预定数据和/或其他轮次的测试数据进行对比,并根据对比结果确定内存测试的结果。通过本发明提出的一种内存测试方法,可以有效提高在服务器领域的内存RTM测试时的测试效率,在无异常的情况下几乎完全无需人工参与,有效节省测试人力,同时避免人力测试时多次重复性数据收集带来的失误,避免手动测试的遗漏。节省了时间成本,使测试更加灵活,可以有效替代传统的RMT测试方法。

Figure 202211258987

The invention belongs to the field of computers, and specifically relates to a memory testing method, system, device and medium, wherein the method includes: configuring a target machine based on a corresponding memory test strategy, and performing multiple rounds of restart tests on the memory on the target machine based on the memory test strategy; Obtain the test data output by the target machine during each round of restart test, compare the test data with the predetermined data and/or test data of other rounds, and determine the result of the memory test according to the comparison results. Through a kind of memory test method proposed by the present invention, the test efficiency of the memory RTM test in the server field can be effectively improved, and no manual participation is required in the case of no abnormality, which effectively saves test manpower and avoids multiple times of manpower test. Avoid mistakes caused by repetitive data collection and avoid omissions in manual testing. It saves time and cost, makes the test more flexible, and can effectively replace the traditional RMT test method.

Figure 202211258987

Description

一种内存测试方法、系统、设备及可读存储介质A memory testing method, system, device and readable storage medium

技术领域technical field

本发明属于计算机领域,具体涉及一种内存测试方法、系统、设备及可读存储介质。The invention belongs to the field of computers, and in particular relates to a memory testing method, system, equipment and a readable storage medium.

背景技术Background technique

服务器内存测试是验证服务器平台和内存稳定性的一种必要测试,在传统的内存测试中,以当前Eagle stream(Intel推出的服务器解决方案)平台内存Margin测试标准有2X5X5或3X3X5两种,要收集50组或者45组数据,且当前需手动配置RMT BIOS。而手动收集50组数据,平均需要25小时的人力投入,所需时间长且灵活性差;手动配置RMT BIOS,操作步骤复杂,并可能有遗漏、隐藏项等;所以需要自动化的数据收集、存储的方式,以达到灵活测试,更省事,更能准确交付测试结果的时间的效果。The server memory test is a necessary test to verify the stability of the server platform and memory. In the traditional memory test, the current Eagle stream (server solution launched by Intel) platform memory Margin test standard has two types: 2X5X5 or 3X3X5. To collect 50 or 45 sets of data, and the current RMT BIOS needs to be manually configured. Manually collecting 50 sets of data requires an average of 25 hours of manpower input, which takes a long time and has poor flexibility; manual configuration of RMT BIOS requires complicated operation steps, and there may be omissions and hidden items; therefore, automated data collection and storage are required In order to achieve the effect of flexible testing, less trouble, and more accurate delivery of test results.

发明内容Contents of the invention

为解决以上问题,本发明提出一种内存测试方法,包括:In order to solve the above problems, the present invention proposes a memory testing method, comprising:

基于对应内存测试策略配置目标机,并基于所述内存测试策略对所述目标机上的内存进行多轮重启测试;Configuring the target machine based on the corresponding memory test strategy, and performing multiple rounds of restart tests on the memory on the target machine based on the memory test strategy;

实时获取目标机在每一轮重启测试时输出的测试数据,并将所述测试数据与预定数据和/或其他轮次的测试数据进行对比,并根据对比结果确定内存测试的结果。The test data output by the target machine during each round of restart test is obtained in real time, and the test data is compared with predetermined data and/or test data of other rounds, and the result of the memory test is determined according to the comparison result.

在本发明的一些实施方式中,基于所述内存测试策略对所述目标机上的内存进行多轮重启测试包括:In some embodiments of the present invention, performing multiple rounds of restart tests on the memory on the target machine based on the memory test strategy includes:

响应于任一一轮重启测试完成,通过修改BIOS中内存插槽的映射关系的方式更换对应内存的插法。In response to the completion of any one round of restart tests, the insertion method of the corresponding memory is replaced by modifying the mapping relationship of the memory slots in the BIOS.

在本发明的一些实施方式中,基于所述内存测试策略对所述目标机上的内存进行多轮重启测试还包括:In some embodiments of the present invention, performing multiple rounds of restart tests on the memory on the target machine based on the memory test strategy further includes:

响应于预定轮次的重启测试完成,更换对应内存在所述目标机内存插槽中的位置以进行下一轮重启测试。In response to the completion of a predetermined round of restart tests, the position of the corresponding memory in the memory slot of the target machine is replaced to perform a next round of restart tests.

在本发明的一些实施方式中,基于所述内存测试策略对所述目标机上的内存进行多轮重启测试还包括:In some embodiments of the present invention, performing multiple rounds of restart tests on the memory on the target machine based on the memory test strategy further includes:

响应于所述对比结果出现异常,确定所述异常的内存和对应的内存插槽的编号并记录当前所述异常内存对应的测试数据;In response to an abnormality in the comparison result, determine the number of the abnormal memory and the corresponding memory slot, and record the test data corresponding to the current abnormal memory;

通过更换对应内存在所述目标机内存插槽中的位置的方式进行下一轮重启测试,并收集所述异常内存所对应的测试数据以及所述异常的内存插槽上的对应新的内存的测试数据;Perform the next round of restart test by replacing the position of the corresponding memory in the memory slot of the target machine, and collect the test data corresponding to the abnormal memory and the corresponding new memory on the abnormal memory slot. Test Data;

将所述异常的内存对应的测试数据与下一轮重启测试收集到的所述内存对应的测试数据以及所述异常的内存插槽上对应新的内存的测试数据进行对比;Comparing the test data corresponding to the abnormal memory with the test data corresponding to the memory collected in the next round of restart test and the test data corresponding to the new memory on the abnormal memory slot;

基于三种所述测试数据的对比确定后续重启测试的内存切换方式。Based on the comparison of the three test data, the memory switching mode for the subsequent restart test is determined.

在本发明的一些实施方式中,基于三种所述测试数据的对比确定后续重启测试的内存切换方式包括:In some implementations of the present invention, determining the memory switching mode for the subsequent restart test based on the comparison of the three test data includes:

响应于所述异常内存当前对应的测试数据和下一轮重启测试收集到所述异常内存的测试数据不同,并且所述下一轮重启测试收集到的所述异常内存的测试数据与预定数据相同,则对后续重启测试采用修改BIOS中内存插槽的映射关系的方式更换对应内存的插法并以内存的编号对不同内存的测试数据进行保存。In response to the fact that the current test data corresponding to the abnormal memory is different from the test data of the abnormal memory collected in the next round of restart testing, and the test data of the abnormal memory collected in the next round of restart testing is the same as the predetermined data , then for the subsequent restart test, the method of modifying the mapping relationship of the memory slots in the BIOS is used to replace the insertion method of the corresponding memory, and the test data of different memories are saved by the number of the memory.

在本发明的一些实施方式中,基于三种所述测试数据的对比确定后续重启测试的内存切换方式还包括:In some embodiments of the present invention, determining the memory switching mode for the subsequent restart test based on the comparison of the three test data also includes:

响应于所述异常的内存所对应的内存插槽在所述下一轮测试中对应新的内存的测试数据与所述新的内存在当前轮次测试对应的测试数据不同,并且所述下一轮重启测试收集到的所述异常内存的测试数据与预定数据不同,则对后续重启测试采用更换对应内存在所述目标机内存插槽中的位置的方式进行后续重启测试。In response to the memory slot corresponding to the abnormal memory, the test data corresponding to the new memory in the next round of testing is different from the test data corresponding to the new memory in the current round of testing, and the next The test data of the abnormal memory collected by the round restart test is different from the predetermined data, and the subsequent restart test is performed by replacing the position of the corresponding memory in the memory slot of the target machine for the subsequent restart test.

在本发明的一些实施方式中,基于三种所述测试数据的对比确定后续重启测试的内存切换方式还包括:In some embodiments of the present invention, determining the memory switching mode for the subsequent restart test based on the comparison of the three test data also includes:

响应于所述异常的内存所对应的内存插槽在所述下一轮测试中对应新的内存的测试数据与所述新的内存在当前轮次测试对应的测试数据相同,并且所述下一轮重启测试收集到的所述异常内存的测试数据与预定数据不同,则对后续重启测试采用修改BIOS中内存插槽的映射关系的方式更换对应内存的插法并以内存的编号对不同内存的测试数据进行保存。In response to the memory slot corresponding to the abnormal memory, the test data corresponding to the new memory in the next round of testing is the same as the test data corresponding to the new memory in the current round of testing, and the next If the test data of the abnormal memory collected by the round restart test is different from the predetermined data, then for the subsequent restart test, the method of modifying the mapping relationship of the memory slots in the BIOS is used to replace the insertion method of the corresponding memory and use the number of the memory to compare the memory slots of the different memory. Save the test data.

本发明的另一方面还提出一种内存测试系统,包括:Another aspect of the present invention also proposes a memory testing system, comprising:

测试配置模块,所述测试配置模块配置用于基于对应内存测试策略配置目标机,并基于所述内存测试策略对所述目标机上的内存进行多轮重启测试;A test configuration module, the test configuration module is configured to configure the target machine based on the corresponding memory test strategy, and perform multiple rounds of restart tests on the memory on the target machine based on the memory test strategy;

测试模块,所述测试模块配置用于实时获取目标机在每一轮重启测试时输出的测试数据,并将所述测试数据与预定数据和/或其他轮次的测试数据进行对比,并根据对比结果确定内存测试的结果。A test module, the test module is configured to obtain the test data output by the target machine in each round of restart test in real time, and compare the test data with predetermined data and/or other rounds of test data, and according to the comparison Result determines the result of the memory test.

本发明的又一方面还提出一种计算机设备,包括:Another aspect of the present invention also proposes a computer device, comprising:

至少一个处理器;以及at least one processor; and

存储器,所述存储器存储有可在所述处理器上运行的计算机指令,所述指令由所述处理器执行时实现上述实施方式中任意一项所述方法的步骤。A memory, where the memory stores computer instructions that can run on the processor, and when the instructions are executed by the processor, the steps of any one of the methods described in the foregoing implementation manners are implemented.

本发明的再一方面还提出一种计算机可读存储介质,所述计算机可读存储介质存储有计算机程序,所述计算机程序被处理器执行时实现上述实施方式中任意一项所述方法的步骤。Another aspect of the present invention also proposes a computer-readable storage medium, the computer-readable storage medium stores a computer program, and when the computer program is executed by a processor, the steps of any one of the methods described in the above-mentioned implementation modes are implemented .

通过本发明提出的一种内存测试方法,可以有效提高在服务器领域的内存RTM测试时的测试效率,在测试时实时收集测试数据,并在每一轮测试完成后对比测试结果,根据对比结果实现测试方案的灵活配置,在无异常的情况下几乎完全无需人工参与,有效节省测试人力,同时避免人力测试时多次重复性数据收集带来的失误(动辄几十次数据收集),以及避免手动测试的遗漏。节省了时间成本,使测试更加灵活,可以有效替代传统的RMT测试方法。Through a kind of memory test method proposed by the present invention, the test efficiency during the memory RTM test in the server field can be effectively improved, the test data can be collected in real time during the test, and the test results can be compared after each round of test is completed, and realized according to the comparison results. The flexible configuration of the test plan almost completely eliminates the need for manual participation in the absence of abnormalities, effectively saving test manpower, and avoiding mistakes caused by repeated data collection during manual testing (dozens of data collections at every turn), and avoiding manual testing. test omission. It saves time and cost, makes the test more flexible, and can effectively replace the traditional RMT test method.

附图说明Description of drawings

为了更清楚地说明本发明实施例或现有技术中的技术方案,下面将对实施例或现有技术描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本发明的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。In order to more clearly illustrate the technical solutions in the embodiments of the present invention or the prior art, the following will briefly introduce the drawings that need to be used in the description of the embodiments or the prior art. Obviously, the accompanying drawings in the following description are only These are some embodiments of the present invention. Those skilled in the art can also obtain other drawings based on these drawings without creative work.

图1为本发明实施例提供的一种内存测试方法的流程示意图;Fig. 1 is a schematic flow chart of a memory testing method provided by an embodiment of the present invention;

图2为本发明实施例提供的一种内存测试系统的结构示意图;FIG. 2 is a schematic structural diagram of a memory testing system provided by an embodiment of the present invention;

图3为本发明实施例提供的一种计算机设备的结构示意图;FIG. 3 is a schematic structural diagram of a computer device provided by an embodiment of the present invention;

图4为本发明实施例提供的一种计算机可读存储介质的结构示意图。FIG. 4 is a schematic structural diagram of a computer-readable storage medium provided by an embodiment of the present invention.

具体实施方式Detailed ways

为使本发明的目的、技术方案和优点更加清楚明白,以下结合具体实施例,并参照附图,对本发明实施例进一步详细说明。In order to make the object, technical solution and advantages of the present invention clearer, the embodiments of the present invention will be further described in detail below in conjunction with specific embodiments and with reference to the accompanying drawings.

本发明旨在解决内存测试领域中的RMT(Rank Margin Test)内存测试效率低的问题,RMT是一种以BIOS为主的内存测试方案,整个测试方案内嵌在BIOS中;将BIOS中的RankMargin Tool选项enable,平台就自动进入Rank Margin测试模式,测试结果由COM口输出,其测试方式类似于ATE上的shmoo测试:逐步调节Vref电压值,同时逐步调节DQ-DQS Skew,在每个点上对内存进行读写,看是否会出错。The present invention aims to solve the problem of low efficiency of RMT (Rank Margin Test) memory test in the field of memory test, RMT is a kind of memory test scheme based on BIOS, and the whole test scheme is embedded in BIOS; RankMargin in BIOS Tool option enable, the platform will automatically enter the Rank Margin test mode, the test results are output by the COM port, the test method is similar to the shmoo test on the ATE: gradually adjust the Vref voltage value, and at the same time gradually adjust the DQ-DQS Skew, at each point Read and write the memory to see if there is an error.

在传统的实现方式上,RMT测试需要对服务器进行多次重启并且在重启一定次数时需要更换服务器主板上内存插槽中的内存次序,最后再在另一个服务器主板上重复一遍上述过程以完成对应的测试。由于服务器支持的内存的规格巨大,常见的服务器内存插槽都是2x8的内存方案,因此在进行内存测试时每一次内存测试均需要大量的时间,而传统的实现方式是由人工开启对应的测试软件,并通过对应的端口连接到被测试的服务器上的端口收集内存测试数据。当一轮测试或多轮测试完成时需要人工去更换服务器上对应内存的插法(将内存重新排列组合后重新插入内存),但由于在服务器上插满内存条进行RMT测试时要耗费大量的时间,每次内存测试都需要人工盯着测试机的执行状态,相应人员难以把时间用在其他工作上,使得测试人员的工作处于“机械”的空闲的状态(很难专注的投入到其他工作中),如果采取放任不管的测试方式,当一轮测试完成后无法及时进行下一轮RMT测试导致服务器的RMT测试时间大大延长。可能会影响服务器的后续开发或验证流程。导致服务器进展较慢。因此在服务器RMT测试期间,服务器的产生的噪音也影响测试人员的工作状态,如果远离测试环境又难以及时发现对应轮次的测试完成。因此传统的测试方式导致的测试问题一直亟待解决。In the traditional implementation method, the RMT test needs to restart the server multiple times and the order of the memory in the memory slot on the server motherboard needs to be changed after a certain number of restarts, and finally repeat the above process on another server motherboard to complete the corresponding test. Due to the large size of the memory supported by the server, the common server memory slots are all 2x8 memory solutions, so each memory test takes a lot of time when performing memory tests, and the traditional implementation method is to manually open the corresponding test software, and connect to the port on the server under test through the corresponding port to collect memory test data. When one round of testing or multiple rounds of testing is completed, it is necessary to manually replace the insertion method of the corresponding memory on the server (rearrange and combine the memory and then reinsert the memory). Time, each memory test needs to manually stare at the execution status of the test machine, and it is difficult for the corresponding personnel to spend time on other work, making the work of the tester in a "mechanical" idle state (it is difficult to concentrate on other work Middle), if the test method is adopted, the RMT test time of the server will be greatly prolonged if the next round of RMT test cannot be performed in time after one round of testing is completed. May affect the subsequent development or verification process of the server. Causes the server to progress slower. Therefore, during the server RMT test, the noise generated by the server also affects the working status of the testers. If they are far away from the test environment, it is difficult to find that the corresponding round of testing is completed in time. Therefore, the testing problems caused by traditional testing methods have been urgently needed to be solved.

如图1所示,为解决以上问题,本发明提出一种内存测试方法,包括:As shown in Figure 1, in order to solve the above problems, the present invention proposes a memory testing method, comprising:

步骤S1、基于对应内存测试策略配置目标机,并基于所述内存测试策略对所述目标机上的内存进行多轮重启测试;Step S1, configure the target machine based on the corresponding memory test strategy, and perform multiple rounds of restart tests on the memory on the target machine based on the memory test strategy;

步骤S2、实时获取目标机在每一轮重启测试时输出的测试数据,并将所述测试数据与预定数据和/或其他轮次的测试数据进行对比,并根据对比结果确定内存测试的结果。Step S2: Obtain in real time the test data output by the target machine during each round of restart test, compare the test data with predetermined data and/or test data of other rounds, and determine the result of the memory test according to the comparison result.

在本实施例中,在步骤S1中,目标机是指用于执行RMT测试的设备,本发明中是以服务器作为测试设备;内存测试配置策略是指RMT测试策略且RMT测试策略是由用户输入或指定的测试策略,用户在输入测试策略时通常有三个参数X、Y、Z,分别对应Borad Copies,DIMM Shifts和Repeats。其中Borad Copies表示测试主板的个数即指在测试过程中切换几个主板进行测试,DIMM Shifts是指更换内存插槽的次数,Repeats表示在主板和内存不发生变更的情况下进行几次重启测试。以2x5x5的测试配置策略为例,2x5x5表示BoradCopies为2,DIMM Shifts为5,Repeats为5。即需要准备两个服务器主板进行测试,先在其中一个主板上按照将内存插槽插满内存进行5轮内存的RMT测试,每轮RMT测试完成后重启服务器,经过5轮RMT测试之后将服务器主板上的内存条变更一次,内存条变更是将服务器主板上的内存插槽的内存进行变更,即下一组5轮重启中,对应的内存插槽和上一组5轮重启测试中的内存均不相同。也即,以2x5x5的测试配置策略则完成RMT测试将会收集到50组的内存测试数据。In the present embodiment, in step S1, the target machine refers to the equipment used to perform the RMT test. In the present invention, the server is used as the test equipment; the memory test configuration strategy refers to the RMT test strategy and the RMT test strategy is input by the user Or the specified test strategy, when the user enters the test strategy, there are usually three parameters X, Y, Z, corresponding to Borad Copies, DIMM Shifts and Repeats respectively. Among them, Borad Copies indicates the number of test motherboards, which means switching several motherboards for testing during the test, DIMM Shifts refers to the number of times the memory slots are replaced, and Repeats indicates several restart tests without changing the motherboard and memory . Take the 2x5x5 test configuration strategy as an example, 2x5x5 means BoradCopies is 2, DIMM Shifts is 5, and Repeats is 5. That is to say, two server motherboards need to be prepared for testing. First, perform 5 rounds of RMT tests on one of the motherboards by filling the memory slots with memory. After each round of RMT testing, restart the server. After 5 rounds of RMT testing, replace the server motherboards. The memory module on the server is changed once, and the memory module change is to change the memory of the memory slot on the mainboard of the server. Are not the same. That is to say, if the RMT test is completed with a 2x5x5 test configuration strategy, 50 sets of memory test data will be collected.

进一步,对于首次用于RMT测试的服务器主板,需要确认该服务器主板的BIOS设置是否满足对应的RMT测试需求。如果不满足RMT测试需求则需要对BIOS进行对应的设置。Further, for the server motherboard that is used for the RMT test for the first time, it is necessary to confirm whether the BIOS settings of the server motherboard meet the corresponding RMT test requirements. If the RMT test requirements are not met, the BIOS needs to be set accordingly.

具体地,首先通过SCE tool将bios选项值导出bios.txt,获取设置中的Advanceddebug Function选项值,修改选项值为Disabled来打开BIOS隐藏选项。进一步通过SCE工具将bios选项值导出bios.txt,获取如下设置的选项值:Specifically, first export the bios option value to bios.txt through the SCE tool, obtain the Advanceddebug Function option value in the settings, and modify the option value to Disabled to open the BIOS hidden option. Further export the bios option value to bios.txt through the SCE tool to obtain the option value set as follows:

Serial Debug Message Level;Serial Debug Message Level;

System Debug Level;System Debug Level;

Sockets in parallel;Sockets in parallel;

Attempt Fast Boot;Attempt Fast Boot;

Attempt Fast Cold Boot;Attempt Fast Cold Boot;

Rank Maigin Tool;Rank Maigin Tool;

RMT on Fast Cold Boot;RMT on Fast Cold Boot;

并检测上述选项(上述选项为BIOS中的对应功能的设置选项)的具体参数值是否为对应地:Mininum,Normal,1,Disable,Disable,Enable,Disable,如果存在上述选项的值与上述参数值不对应则将导出的bios.txt中对应的选项的值进行修改,并将修改后的bios.txt通过SEC tool重新导入到BISO中并使BIOS生效。完成对目标机的相关配置。And detect whether the specific parameter value of the above-mentioned option (the above-mentioned option is the setting option of the corresponding function in the BIOS) is corresponding to: Mininum, Normal, 1, Disable, Disable, Enable, Disable, if there is a value of the above-mentioned option and the above-mentioned parameter value If it does not correspond, modify the value of the corresponding option in the exported bios.txt, and re-import the modified bios.txt into the BIOS through the SEC tool to make the BIOS take effect. Complete the configuration of the target machine.

在步骤S2中,通过BMC的SOL功能收集在BIOS中运行的RMT测试程序所输出的串口日志,并通过BMC对应的命令保存BIOS在RMT测试过程中输出的日志信息,具体的通过ipmitool–I lanplus–H IP–U username–P password sol activate命令保存串口日志信息。In step S2, the serial port log output by the RMT test program running in the BIOS is collected by the SOL function of the BMC, and the log information output by the BIOS during the RMT test process is saved by the corresponding command of the BMC, specifically through ipmitool-Ilanplus –H IP–U username–P password sol activate command saves serial port log information.

进一步,基于保存的串口日志信息,在每一轮测试完成后将对应内存的测试数据进行解析分类,以将每一轮测试信息中的对应的内存的测试结果进行保存,用以与下一轮测试中获取到的日志信息中对应的内存测试结果进行对比或者是预定的标准测试结果进行对比。即采用当前轮次的测试数据和下一轮次的测试数据进行对比,或者是与预定的标准测试数据进行对比,如果多轮重启测试得到的对应的测试结果之间的对比结果均相同且与预定的标准测试结果一样则说明RMT测试通过服务器上的所有内存的状态正常。Further, based on the stored serial port log information, after each round of testing is completed, the test data of the corresponding memory is analyzed and classified, so as to save the test results of the corresponding memory in each round of test information for use in the next round of testing. Compare the corresponding memory test results in the log information obtained during the test or compare the predetermined standard test results. That is, the test data of the current round is compared with the test data of the next round, or compared with the predetermined standard test data. If the predetermined standard test results are the same, it means that the state of all memory on the server is normal after the RMT test is passed.

如果多轮重启测试得到测试结果之间的对比的结果存在不同,即存在对应的内存在测试中存在异常,则测试结果为存在内存测试异常。If there is a difference in the comparison results between the test results obtained through multiple rounds of restart tests, that is, there is an abnormality in the corresponding memory in the test, the test result is that there is an abnormality in the memory test.

具体地,在本发明实施例中,在每一轮次的测试中,需要每隔预定时间检查收集到的串口日志数据,如果串口日志数据中包含对应的完成标志信息,则认为完成一轮次的RMT测试。例如,当检测到“RMT test v3.22.0.complete”,“RMT test v3.22.0.complete”是版本为3.22.0的RMT测试程序在执行测试完成后输出的结束日志信息。在获得该结束日志信息后表示一轮内存RMT测试已经完成,则对收集到的日志信息进行解析,将不同内存(当前测试的目标机上所插入的多个内存)的测试数据进行分类,并将不同内存的测试数据用于与下一轮次的测试数据进行对比。Specifically, in the embodiment of the present invention, in each round of testing, it is necessary to check the collected serial port log data at predetermined intervals, and if the serial port log data contains corresponding completion flag information, it is considered that a round is completed The RMT test. For example, when "RMT test v3.22.0.complete" is detected, "RMT test v3.22.0.complete" is the end log information output by the RMT test program with version 3.22.0 after the execution of the test is completed. After obtaining the end log information, it means that a round of memory RMT test has been completed, then the collected log information is parsed, the test data of different memory (inserted on the target machine of the current test) are classified, and The test data of different memory is used for comparison with the test data of the next round.

另外,根据上述的RMT测试策略,在执行完一轮次的测试数据时将对应的BoradCopies,DIMM Shifts和Repeats测试记数加1。例如根据Borad Copies,DIMM Shifts和Repeats的关系可知,假如用户输入的测试策略是2x5x5,则当完成5轮对内存的RMT测试之后视为完成一轮DIMM Shifts,需要更换服务器上的内存插槽中的内存顺序。当完成5轮内存插槽中内存的更换之后,需要切换服务器主板进行测试。即将原本在测试服务器上的内存迁移到另外一台服务器上进行测试。具体是通过对应的参数的测试记数来实现对应的测试策略的控制,没执行完一轮内存测试,将Repeats对应的记数加1,当Repeats对应的记数为5时,切换内存以及将Repeats对应的记数为5清零重新记数,并将DIMM Shifts对应的记数加1,同样,当DIMM Shifts对应的技术为5时,切换服务器主板。在新主板上继续上述过程。知道满足用户输入的测试策略完成。In addition, according to the above RMT test strategy, the corresponding BoradCopies, DIMM Shifts and Repeats test counts are increased by 1 when a round of test data is executed. For example, according to the relationship between Borad Copies, DIMM Shifts and Repeats, if the test strategy input by the user is 2x5x5, after completing 5 rounds of RMT tests on the memory, it is deemed to have completed a round of DIMM Shifts, and the memory slot on the server needs to be replaced. memory order. After completing the replacement of the memory in the memory slot for 5 rounds, it is necessary to switch the mainboard of the server for testing. That is to migrate the memory originally on the test server to another server for testing. Specifically, the control of the corresponding test strategy is realized through the test count of the corresponding parameter. Before a round of memory test is executed, the count corresponding to Repeats is increased by 1. When the count corresponding to Repeats is 5, the memory is switched and the The count corresponding to Repeats is 5, cleared and counted again, and the count corresponding to DIMM Shifts is increased by 1. Similarly, when the technology corresponding to DIMM Shifts is 5, switch the mainboard of the server. Continue the above process on the new motherboard. The test strategy is complete until the user input is satisfied.

在此需要说明的是,本发明实施例是从第一轮次的测试数据的收集为角度解释本发明的实施方式,在第一轮次的RMT测试完成后得到的多个内存的测试数据先于预定的测试数据进行对比。当后续的第二轮次的RMT测试完成后再与第二轮次的RMT内存测试数据进行对比。It should be noted here that the embodiment of the present invention explains the implementation of the present invention from the perspective of the collection of the first round of test data. After the completion of the first round of RMT test, the test data of a plurality of internal memories are first Compared with the predetermined test data. After the subsequent second round of RMT test is completed, it will be compared with the second round of RMT memory test data.

在本发明的一些实施例中,测试数据的对比是以分类后的对应的内存的测试数据进行对比。In some embodiments of the present invention, the comparison of the test data is performed with the corresponding sorted test data in memory.

在本发明的一些实施方式中,基于所述内存测试策略对所述目标机上的内存进行多轮重启测试包括:In some embodiments of the present invention, performing multiple rounds of restart tests on the memory on the target machine based on the memory test strategy includes:

响应于任一一轮重启测试完成,通过修改BIOS中内存插槽的映射关系的方式更换对应内存的插法。In response to the completion of any one round of restart tests, the insertion method of the corresponding memory is replaced by modifying the mapping relationship of the memory slots in the BIOS.

在本实施例中,对于内存测试策略中切换内存的操作,本发明采用改变BIOS中对应的内存插槽的映射关系的方式实现,内存插槽的映射关系是指服务器中在物理层面上多个插槽在BIOS的软件层面上的编号,以16个内存插槽的主板为例,假设从左到右服务器上的内存插槽以此编号为1~16,每个内存插槽都连接着CPU上对应的总线端口,存在固定的一一对应的连接关系。而BIOS中则存在对应的映射关系,存在于BISO的固件中,例如物理空间上的第一个内存插槽在默认的BIOS中对应的编号为0,第二个内存插槽在默认的BIOS中对应的编号为1。则当RMT测试进行到要切换内存的插入顺序时,无需通过测试人员进行手动更换内存测试的方式,而是直接以修改BIOS中内存插槽的编号与对应物理插槽的空间映射关系的方式实现,则对于RMT内存测试工具在检测到对应的内存插槽上的内存条编号发生改变时便可执行后续的测试。例如,以2x5x5的配置策略为例,需要进行5次重启测试后切换内存的插法时对第6次的内存测试可以采用修改BIOS中内存插槽的映射关系的方式更换对应的内存的插法。In this embodiment, for the operation of switching memory in the memory test strategy, the present invention implements by changing the mapping relationship of the corresponding memory slot in the BIOS. The mapping relationship of the memory slot refers to multiple The number of slots on the BIOS software level, taking a motherboard with 16 memory slots as an example, assuming that the memory slots on the server are numbered from 1 to 16 from left to right, and each memory slot is connected to the CPU There is a fixed one-to-one connection relationship between the corresponding bus ports. In the BIOS, there is a corresponding mapping relationship, which exists in the BIOS firmware. For example, the first memory slot in the physical space is numbered 0 in the default BIOS, and the second memory slot is in the default BIOS. The corresponding number is 1. Then when the RMT test proceeds to switch the insertion sequence of the memory, it is not necessary to manually replace the memory test by the tester, but directly by modifying the memory slot number in the BIOS and the spatial mapping relationship between the corresponding physical slot. , then the RMT memory test tool can perform subsequent tests when it detects that the number of the memory module on the corresponding memory slot has changed. For example, taking the configuration strategy of 2x5x5 as an example, when it is necessary to switch the memory insertion method after 5 restart tests, the corresponding memory insertion method can be replaced by modifying the mapping relationship of the memory slot in the BIOS for the sixth memory test. .

具体地,修改BIOS中内存插槽的映射关系通过修改BIOS固件完成,并将修改后的BIOS固件重新下刷到BIOS中重启生效。可有效降低运维人员在RMT测试过程中的参与度以及工作量。Specifically, modifying the mapping relationship of memory slots in the BIOS is accomplished by modifying the BIOS firmware, and re-downloading the modified BIOS firmware to the BIOS and restarting to take effect. It can effectively reduce the participation and workload of operation and maintenance personnel in the RMT test process.

在本发明的一些实施方式中,基于所述内存测试策略对所述目标机上的内存进行多轮重启测试还包括:In some embodiments of the present invention, performing multiple rounds of restart tests on the memory on the target machine based on the memory test strategy further includes:

响应于预定轮次的重启测试完成,更换对应内存在所述目标机内存插槽中的位置以进行下一轮重启测试。In response to the completion of a predetermined round of restart tests, the position of the corresponding memory in the memory slot of the target machine is replaced to perform a next round of restart tests.

在本实施例中,对于内存测试策略中切换内存的操作,也可以采用人工更换对应的内存插法的方式,实现物理上的更换。In this embodiment, for the operation of switching memory in the memory test strategy, the method of manually replacing the corresponding memory interpolation method can also be used to realize physical replacement.

在本发明的一些实施方式中,基于所述内存测试策略对所述目标机上的内存进行多轮重启测试还包括:In some embodiments of the present invention, performing multiple rounds of restart tests on the memory on the target machine based on the memory test strategy further includes:

响应于所述对比结果出现异常,确定所述异常的内存和对应的内存插槽的编号并记录当前所述异常内存对应的测试数据;In response to an abnormality in the comparison result, determine the number of the abnormal memory and the corresponding memory slot, and record the test data corresponding to the current abnormal memory;

通过更换对应内存在所述目标机内存插槽中的位置的方式进行下一轮重启测试,并收集所述异常内存所对应的测试数据以及所述异常的内存插槽上的对应新的内存的测试数据;Perform the next round of restart test by replacing the position of the corresponding memory in the memory slot of the target machine, and collect the test data corresponding to the abnormal memory and the corresponding new memory on the abnormal memory slot. Test Data;

将所述异常的内存对应的测试数据与下一轮重启测试收集到的所述内存对应的测试数据以及所述异常的内存插槽上对应新的内存的测试数据进行对比;Comparing the test data corresponding to the abnormal memory with the test data corresponding to the memory collected in the next round of restart test and the test data corresponding to the new memory on the abnormal memory slot;

基于三种所述测试数据的对比确定后续重启测试的内存切换方式。Based on the comparison of the three test data, the memory switching mode for the subsequent restart test is determined.

在本实施例中,如果在测试过程中存在某个内存的当前轮次的测试数据和下一轮重启测试的测试数据的对比结果为异常,或者是某个内存的测试数据和预定的标准测试数据出现异常,则认为RMT内存测试出现异常,在进行下一轮重启测试时通由测试人员更换对应的内存插法的方式对内存与内存插槽的位置进行更换,用以通过后续的测试数据来确认是否是因为内存插槽问题还是内存条本身的问题。In this embodiment, if the comparison result between the current round of test data of a certain memory and the test data of the next round of restart test is abnormal during the test, or the test data of a certain memory and the predetermined standard test If the data is abnormal, it is considered that the RMT memory test is abnormal. During the next round of restart test, the tester replaces the corresponding memory insertion method to replace the position of the memory and the memory slot to pass the subsequent test data. To confirm whether it is due to the problem of the memory slot or the problem of the memory module itself.

进一步,在更换完内存的插法后,重新启动服务器进入BIOS中进行RMT内存测试,将更换内存插法后的测试数据按照内存的编号进行分类,所谓内存编号是指对应的内存条的唯一ID,由内存生产商保存在SPD信息中的唯一标识。根据内存的唯一标识将前后两轮对应的内存的RMT内存测试数据进行对比,验证是否在更换内存插槽后对应的内存条在两次测试中对应的测试数据不同。同时将当前轮次测试过程中出现异常内存测试数据的所属的内存插槽在下一轮的RMT内存测试时对应的内存产生的内存测试数据与当前轮次的内存插槽对应的内存产生的内存测试数据进行对比。Further, after replacing the memory insertion method, restart the server and enter the BIOS to perform the RMT memory test, and classify the test data after replacing the memory insertion method according to the memory number. The so-called memory number refers to the unique ID of the corresponding memory stick , the unique identifier stored in the SPD information by the memory manufacturer. According to the unique identifier of the memory, compare the RMT memory test data of the corresponding memory in the two rounds before and after, and verify whether the corresponding memory module has different test data in the two tests after the memory slot is replaced. At the same time, the memory test data generated by the corresponding memory of the memory slot with abnormal memory test data in the next round of RMT memory test in the current round of testing and the memory test data generated by the memory corresponding to the memory slot of the current round data for comparison.

如果同一根内存在更换内存插槽后产生的内存测试数据不同,则有可能是内存存在问题,也有可能是更换的另外一个内存插槽有问题。因此通过同一跟内存在两次RMT测试中的测试数据和对应的内存插槽在两次RMT测试中对应的测试数据的对比确认后续测试采用什么样的测试策略。If the memory test data of the same memory is different after the memory slot is replaced, there may be a problem with the memory, or there may be a problem with the replaced memory slot. Therefore, by comparing the test data of the same memory in the two RMT tests and the corresponding test data of the corresponding memory slot in the two RMT tests, it is confirmed what kind of test strategy is used for the subsequent test.

在本发明的一些实施例中,采用修改BIOS中内存插槽映射关系的方式更换对应的内存的插法和更换对应内存在所述目标机内存插槽中的位置的方式,在一些情况下是指完成一次内存测试的Repeats的之后进行更换,并且在一些情况下也可以是指完成5次Repeats即完成一次DIMM Shifts的情况下改变内存。即在本发明的一些实施例中根据当前得到的测试数据采用不同的更换内存的策略来验证出现问题的原因是内存本身还是服务器主板上的内存插槽或主板引起。In some embodiments of the present invention, the way of modifying the memory slot mapping relationship in the BIOS is used to replace the insertion method of the corresponding memory and to replace the position of the corresponding memory in the memory slot of the target machine. In some cases, it is It refers to the replacement after completing the Repeats of a memory test, and in some cases, it can also refer to changing the memory after completing 5 Repeats, that is, completing one DIMM Shifts. That is, in some embodiments of the present invention, different memory replacement strategies are used according to the currently obtained test data to verify whether the cause of the problem is caused by the memory itself or the memory slot on the mainboard of the server or the mainboard.

在本发明的一些实施方式中,基于三种所述测试数据的对比确定后续重启测试的内存切换方式包括:In some implementations of the present invention, determining the memory switching mode for the subsequent restart test based on the comparison of the three test data includes:

响应于所述异常内存当前对应的测试数据和下一轮重启测试收集到所述异常内存的测试数据不同,并且所述下一轮重启测试收集到的所述异常内存的测试数据与预定数据相同,则对后续重启测试采用修改BIOS中内存插槽的映射关系的方式更换对应内存的插法并以内存的编号对不同内存的测试数据进行保存。In response to the fact that the current test data corresponding to the abnormal memory is different from the test data of the abnormal memory collected in the next round of restart testing, and the test data of the abnormal memory collected in the next round of restart testing is the same as the predetermined data , then for the subsequent restart test, the method of modifying the mapping relationship of the memory slots in the BIOS is used to replace the insertion method of the corresponding memory, and the test data of different memories are saved by the number of the memory.

在本实施例中,当对应的内存在当前轮次的测试数据存在异常,并且在经过下一轮RMT测试后得到的关于该内存的测试数据与该内存当前轮次的测试数据不同,即两次RMT测试对于同样的一根内存所测试得到的测试数据不同,但下一轮RMT测试得到的该内存的测试数据与预定标准数据相同,则说明该内存在当前轮次的测试数据异常,但在更换内存插槽后的RMT测试中表现正常,此时极有可能是因为对应的内存插槽有问题。则在后续的RMT测试过程中则可以采用修改BIOS中内存插槽的映射关系的方式更换对应内存的插法,同时保存不同编号的内存在后续RMT测试过程中对应的测试数据,然后将每一个内存的测试数据进行对比,对比结果不一致的则与预定的标准数据进行对比,以分析对应的内存异常的次数、以及详细的异常信息。In this embodiment, when there is abnormality in the test data of the current round of the corresponding memory, and the test data about the memory obtained after the next round of RMT test is different from the test data of the current round of the memory, that is, two If the test data obtained by the same memory in one RMT test is different, but the test data of the memory obtained in the next RMT test is the same as the predetermined standard data, it means that the test data of the current round of the memory is abnormal, but In the RMT test after the memory slot is replaced, the performance is normal. At this time, it is very likely that there is a problem with the corresponding memory slot. Then in the subsequent RMT test process, you can use the method of modifying the mapping relationship of the memory slots in the BIOS to replace the corresponding memory insertion method, and at the same time save the corresponding test data of the memory with different numbers in the subsequent RMT test process, and then copy each The test data of the memory is compared, and if the comparison results are inconsistent, it is compared with the predetermined standard data to analyze the number of corresponding memory exceptions and detailed exception information.

在本发明的一些实施方式中,基于三种所述测试数据的对比确定后续重启测试的内存切换方式还包括:In some embodiments of the present invention, determining the memory switching mode for the subsequent restart test based on the comparison of the three test data also includes:

响应于所述异常的内存所对应的内存插槽在所述下一轮测试中对应新的内存的测试数据与所述新的内存在当前轮次测试对应的测试数据不同,并且所述下一轮重启测试收集到的所述异常内存的测试数据与预定数据不同,则对后续重启测试采用更换对应内存在所述目标机内存插槽中的位置的方式进行后续重启测试。In response to the memory slot corresponding to the abnormal memory, the test data corresponding to the new memory in the next round of testing is different from the test data corresponding to the new memory in the current round of testing, and the next The test data of the abnormal memory collected by the round restart test is different from the predetermined data, and the subsequent restart test is performed by replacing the position of the corresponding memory in the memory slot of the target machine for the subsequent restart test.

在本实施例中,如果在当前轮次的测试过程中,存在对应的内存的RTM测试的测试数据异常(与预定的标准数据不同或与下一轮测试得到的数据不同),则确定该内存对应的内存插槽,并判断在下一轮次测试中对应内存插槽上的内存的测试数据与当前轮次的内存的测试数据是否相同。如果下一轮次测试中对应内存插槽上的内存的测试数据与当前轮次的内存的测试数据是否不相同,则将该内存插槽在下一轮次测试中对应内存插槽上的内存的测试数据与预定标准数据进行比较,如果与预定的标准测试数据仍然不同,说明该内存插槽上在更换内存后仍然出现对应的测试数据异常的情况,但是在两次测试过程中得到的测试数据也不同,相对的第二次(下一轮测试)得到数据也和预定的标准测试数据不同,则说明可能是更换的两个内存本身都有问题;也有可能是内存插槽的电路存在问题,在不同的测试场景下因出现问题的时间不确定导致测试数据出现异常的Rank不同等情况,属于难以确认的情况。则对后续多轮次的重启测试采用更换对应内存的方式进行RMT测试。在本发明的一些实施方式中,基于三种所述测试数据的对比确定后续重启测试的内存切换方式还包括:In this embodiment, if during the current round of testing, the test data of the RTM test of the corresponding internal memory is abnormal (different from the predetermined standard data or different from the data obtained in the next round of testing), then it is determined that the internal memory the corresponding memory slot, and determine whether the test data of the memory on the corresponding memory slot in the next round of testing is the same as the test data of the memory of the current round. If the test data of the memory on the corresponding memory slot in the next round of testing is different from the test data of the memory of the current round, then the memory slot on the corresponding memory slot in the next round of testing Compare the test data with the predetermined standard data. If it is still different from the predetermined standard test data, it means that the corresponding test data is still abnormal after replacing the memory on the memory slot, but the test data obtained during the two tests It is also different, and the data obtained in the second (next round of testing) is also different from the predetermined standard test data, which means that there may be problems with the two replaced memory itself; there may also be problems with the circuit of the memory slot. In different test scenarios, due to the uncertainty of the time when the problem occurred, the abnormal Rank of the test data is different, which is difficult to confirm. Then, for subsequent multiple rounds of restart tests, the RMT test is performed by replacing the corresponding memory. In some embodiments of the present invention, determining the memory switching mode for the subsequent restart test based on the comparison of the three test data also includes:

响应于所述异常的内存所对应的内存插槽在所述下一轮测试中对应新的内存的测试数据与所述新的内存在当前轮次测试对应的测试数据相同,并且所述下一轮重启测试收集到的所述异常内存的测试数据与预定数据不同,则对后续重启测试采用修改BIOS中内存插槽的映射关系的方式更换对应内存的插法并以内存的编号对不同内存的测试数据进行保存。In response to the memory slot corresponding to the abnormal memory, the test data corresponding to the new memory in the next round of testing is the same as the test data corresponding to the new memory in the current round of testing, and the next If the test data of the abnormal memory collected by the round restart test is different from the predetermined data, then for the subsequent restart test, the method of modifying the mapping relationship of the memory slots in the BIOS is used to replace the insertion method of the corresponding memory and use the number of the memory to compare the memory slots of the different memory. Save the test data.

在本实施例中,如果在当前轮次的测试过结果中同一个内存插槽上的两个内存对应的测试数据不同,但下一轮次的测试数据与预定标准测试数据相同,则说明该内存插槽在更换对应的内存后对应的内存的测试数据出现正常。则说明很有可能是当前轮次的内存异常导致的测试数据异常。可初步排除内存插槽问题。因此在后续的RMT测试中采用修改BIOS中内存插槽的映射关系的方式更换对应内存的插法进行多轮重启测试。同时将测试数据根据内存编号分类保存。在本发明的一些实施方式中,本发明中的当前轮次并不是单指每次重启,可以是指测试策略中的每次更换内存的情况,因为RTM测试规定是在内存插法不变和主板不变的情况下执行5次重启测试。以冗余的方式验证在内存插法不变和主板不变的情况下的测试。因此,在一些实施例中后续的多轮重启测试可以是指每完成一次DIMMShifts(完成5次或预定次数的Repeats)的情况下切换对应的内存(采用修改BIOS或更换内存位置),也可以是在完成一次Repeats之后按照上述切换内存机制对后续的多轮次重启的RMT测试。In this embodiment, if the test data corresponding to two memories on the same memory slot are different in the test results of the current round, but the test data of the next round is the same as the predetermined standard test data, it means that the After the memory slot is replaced with the corresponding memory, the test data of the corresponding memory appears normal. It means that the test data exception is likely to be caused by the memory exception of the current round. The memory slot problem can be preliminarily ruled out. Therefore, in the subsequent RMT test, the method of modifying the mapping relationship of the memory slot in the BIOS is used to replace the insertion method of the corresponding memory for multiple rounds of restart tests. At the same time, the test data is classified and saved according to the memory number. In some embodiments of the present invention, the current round in the present invention does not simply refer to each restart, but may refer to the situation of replacing the memory each time in the test strategy, because the RTM test stipulates that the memory interpolation method is constant and Perform 5 restart tests with the motherboard unchanged. Verifies tests with the same memory placement and the same motherboard in a redundant manner. Therefore, in some embodiments, the subsequent multiple rounds of restart tests can refer to switching the corresponding memory (by modifying the BIOS or replacing the memory location) every time DIMMShifts is completed (5 times or a predetermined number of Repeats are completed), or it can be After completing a Repeats, follow the above-mentioned switching memory mechanism to test the RMT of subsequent multiple rounds of restarts.

通过本发明提出的一种内存测试方法,可以有效提高在服务器领域的内存RTM测试时的测试效率,在测试时实时收集测试数据,并在每一轮测试完成后对比测试结果,根据对比结果实现测试方案的灵活配置,在无异常的情况下几乎完全无需人工参与,有效节省测试人力,同时避免人力测试时多次重复性数据收集带来的失误(动辄几十次数据收集),以及避免手动测试的遗漏。节省了时间成本,使测试更加灵活,可以有效替代传统的RMT测试方法。Through a kind of memory test method proposed by the present invention, the test efficiency during the memory RTM test in the server field can be effectively improved, the test data can be collected in real time during the test, and the test results can be compared after each round of test is completed, and realized according to the comparison results. The flexible configuration of the test plan almost completely eliminates the need for manual participation in the absence of abnormalities, effectively saving test manpower, and avoiding mistakes caused by repeated data collection during manual testing (dozens of data collections at every turn), and avoiding manual testing. test omission. It saves time and cost, makes the test more flexible, and can effectively replace the traditional RMT test method.

如图2所示,本发明的另一方面还提出一种内存测试系统,包括:As shown in Figure 2, another aspect of the present invention also proposes a kind of memory testing system, comprises:

测试配置模块1,所述测试配置模块1配置用于基于对应内存测试策略配置目标机,并基于所述内存测试策略对所述目标机上的内存进行多轮重启测试;Test configuration module 1, the test configuration module 1 is configured to configure the target machine based on the corresponding memory test strategy, and perform multiple rounds of restart tests on the memory on the target machine based on the memory test strategy;

测试模块2,所述测试模块2配置用于实时获取目标机在每一轮重启测试时输出的测试数据,并将所述测试数据与预定数据和/或其他轮次的测试数据进行对比,并根据对比结果确定内存测试的结果。Test module 2, the test module 2 is configured to obtain the test data output by the target machine in each round of restart test in real time, and compare the test data with predetermined data and/or other rounds of test data, and Determine the result of the memory test based on the comparison results.

如图3所示,本发明的又一方面还提出一种计算机设备,包括:As shown in Figure 3, another aspect of the present invention also proposes a computer device, including:

至少一个处理器21;以及at least one processor 21; and

存储器22,所述存储器22存储有可在所述处理器21上运行的计算机指令23,所述指令23由所述处理器21执行时实现一种内存测试方法。A memory 22, the memory 22 stores computer instructions 23 that can run on the processor 21, and when the instructions 23 are executed by the processor 21, a memory testing method is implemented.

如图4所示,本发明的再一方面还提出一种计算机可读存储介质401,所述计算机可读存储介质401存储有计算机程序402,所述计算机程序402被处理器执行时实现一种内存测试方法。As shown in FIG. 4 , another aspect of the present invention also proposes a computer-readable storage medium 401, the computer-readable storage medium 401 stores a computer program 402, and when the computer program 402 is executed by a processor, a Memory test method.

以上是本发明公开的示例性实施例,但是应当注意,在不背离权利要求限定的本发明实施例公开的范围的前提下,可以进行多种改变和修改。根据这里描述的公开实施例的方法权利要求的功能、步骤和/或动作不需以任何特定顺序执行。此外,尽管本发明实施例公开的元素可以以个体形式描述或要求,但除非明确限制为单数,也可以理解为多个。The above are the exemplary embodiments disclosed in the present invention, but it should be noted that various changes and modifications can be made without departing from the scope of the disclosed embodiments of the present invention defined in the claims. The functions, steps and/or actions of the method claims in accordance with the disclosed embodiments described herein need not be performed in any particular order. In addition, although the elements disclosed in the embodiments of the present invention may be described or required in an individual form, they may also be understood as a plurality unless explicitly limited to a singular number.

应当理解的是,在本文中使用的,除非上下文清楚地支持例外情况,单数形式“一个”旨在也包括复数形式。还应当理解的是,在本文中使用的“和/或”是指包括一个或者一个以上相关联地列出的项目的任意和所有可能组合。It should be understood that as used herein, the singular form "a" and "an" are intended to include the plural forms as well, unless the context clearly supports an exception. It should also be understood that "and/or" as used herein is meant to include any and all possible combinations of one or more of the associated listed items.

上述本发明实施例公开实施例序号仅仅为了描述,不代表实施例的优劣。The serial numbers of the embodiments disclosed in the above-mentioned embodiments of the present invention are only for description, and do not represent the advantages and disadvantages of the embodiments.

本领域普通技术人员可以理解实现上述实施例的全部或部分步骤可以通过硬件来完成,也可以通过程序来指令相关的硬件完成,所述的程序可以存储于一种计算机可读存储介质中,上述提到的存储介质可以是只读存储器,磁盘或光盘等。Those of ordinary skill in the art can understand that all or part of the steps for implementing the above embodiments can be completed by hardware, and can also be completed by instructing related hardware through a program. The program can be stored in a computer-readable storage medium. The above-mentioned The storage medium mentioned may be a read-only memory, a magnetic disk or an optical disk, and the like.

所属领域的普通技术人员应当理解:以上任何实施例的讨论仅为示例性的,并非旨在暗示本发明实施例公开的范围(包括权利要求)被限于这些例子;在本发明实施例的思路下,以上实施例或者不同实施例中的技术特征之间也可以进行组合,并存在如上所述的本发明实施例的不同方面的许多其它变化,为了简明它们没有在细节中提供。因此,凡在本发明实施例的精神和原则之内,所做的任何省略、修改、等同替换、改进等,均应包含在本发明实施例的保护范围之内。Those of ordinary skill in the art should understand that: the discussion of any of the above embodiments is exemplary only, and is not intended to imply that the scope (including claims) disclosed by the embodiments of the present invention is limited to these examples; under the idea of the embodiments of the present invention , technical features in the above embodiments or in different embodiments can also be combined, and there are many other changes in different aspects of the embodiments of the present invention as described above, which are not provided in details for the sake of brevity. Therefore, within the spirit and principle of the embodiments of the present invention, any omissions, modifications, equivalent replacements, improvements, etc., shall be included in the protection scope of the embodiments of the present invention.

Claims (10)

1.一种内存测试方法,其特征在于,包括:1. A memory testing method, characterized in that, comprising: 基于对应内存测试策略配置目标机,并基于所述内存测试策略对所述目标机上的内存进行多轮重启测试;Configuring the target machine based on the corresponding memory test strategy, and performing multiple rounds of restart tests on the memory on the target machine based on the memory test strategy; 实时获取目标机在每一轮重启测试时输出的测试数据,并将所述测试数据与预定数据和/或其他轮次的测试数据进行对比,并根据对比结果确定内存测试的结果。The test data output by the target machine during each round of restart test is obtained in real time, and the test data is compared with predetermined data and/or test data of other rounds, and the result of the memory test is determined according to the comparison result. 2.根据权利要求1所述的方法,其特征在于,所述基于所述内存测试策略对所述目标机上的内存进行多轮重启测试包括:2. The method according to claim 1, wherein said carrying out multiple rounds of restart tests to the memory on the target machine based on the memory test strategy comprises: 响应于任一一轮重启测试完成,通过修改BIOS中内存插槽的映射关系的方式更换对应内存的插法。In response to the completion of any one round of restart tests, the insertion method of the corresponding memory is replaced by modifying the mapping relationship of the memory slots in the BIOS. 3.根据权利要求1所述的方法,其特征在于,所述基于所述内存测试策略对所述目标机上的内存进行多轮重启测试还包括:3. The method according to claim 1, wherein said carrying out multiple rounds of restart tests to the memory on the target machine based on the memory test strategy further comprises: 响应于预定轮次的重启测试完成,更换对应内存在所述目标机内存插槽中的位置以进行下一轮重启测试。In response to the completion of a predetermined round of restart tests, the position of the corresponding memory in the memory slot of the target machine is replaced to perform a next round of restart tests. 4.根据权利要求1所述的方法,其特征在于,所述基于所述内存测试策略对所述目标机上的内存进行多轮重启测试还包括:4. The method according to claim 1, wherein said carrying out multiple rounds of restart tests to the memory on the target machine based on the memory test strategy further comprises: 响应于所述对比结果出现异常,确定所述异常的内存和对应的内存插槽的编号并记录当前所述异常内存对应的测试数据;In response to an abnormality in the comparison result, determine the number of the abnormal memory and the corresponding memory slot, and record the test data corresponding to the current abnormal memory; 通过更换对应内存在所述目标机内存插槽中的位置的方式进行下一轮重启测试,并收集所述异常内存所对应的测试数据以及所述异常的内存插槽上的对应新的内存的测试数据;Perform the next round of restart test by replacing the position of the corresponding memory in the memory slot of the target machine, and collect the test data corresponding to the abnormal memory and the corresponding new memory on the abnormal memory slot. Test Data; 将所述异常的内存对应的测试数据与下一轮重启测试收集到的所述内存对应的测试数据以及所述异常的内存插槽上对应新的内存的测试数据进行对比;Comparing the test data corresponding to the abnormal memory with the test data corresponding to the memory collected in the next round of restart test and the test data corresponding to the new memory on the abnormal memory slot; 基于三种所述测试数据的对比确定后续重启测试的内存切换方式。Based on the comparison of the three test data, the memory switching mode for the subsequent restart test is determined. 5.根据权利要求4所述的方法,其特征在于,所述基于三种所述测试数据的对比确定后续重启测试的内存切换方式包括:5. The method according to claim 4, wherein the memory switching method for determining subsequent restart tests based on the comparison of three kinds of test data comprises: 响应于所述异常内存当前对应的测试数据和下一轮重启测试收集到所述异常内存的测试数据不同,并且所述下一轮重启测试收集到的所述异常内存的测试数据与预定数据相同,则对后续重启测试采用修改BIOS中内存插槽的映射关系的方式更换对应内存的插法并以内存的编号对不同内存的测试数据进行保存。In response to the fact that the current test data corresponding to the abnormal memory is different from the test data of the abnormal memory collected in the next round of restart testing, and the test data of the abnormal memory collected in the next round of restart testing is the same as the predetermined data , then for the subsequent restart test, the method of modifying the mapping relationship of the memory slots in the BIOS is used to replace the insertion method of the corresponding memory, and the test data of different memories are saved by the number of the memory. 6.根据权利要求4所述的方法,其特征在于,所述基于三种所述测试数据的对比确定后续重启测试的内存切换方式还包括:6. The method according to claim 4, wherein the memory switching method for determining subsequent restart tests based on the comparison of three kinds of test data also includes: 响应于所述异常的内存所对应的内存插槽在所述下一轮测试中对应新的内存的测试数据与所述新的内存在当前轮次测试对应的测试数据不同,并且所述下一轮重启测试收集到的所述异常内存的测试数据与预定数据不同,则对后续重启测试采用更换对应内存在所述目标机内存插槽中的位置的方式进行后续重启测试。In response to the memory slot corresponding to the abnormal memory, the test data corresponding to the new memory in the next round of testing is different from the test data corresponding to the new memory in the current round of testing, and the next The test data of the abnormal memory collected by the round restart test is different from the predetermined data, and the subsequent restart test is performed by replacing the position of the corresponding memory in the memory slot of the target machine for the subsequent restart test. 7.根据权利要求4所述的方法,其特征在于,所述基于三种所述测试数据的对比确定后续重启测试的内存切换方式还包括:7. The method according to claim 4, wherein the memory switching method for determining subsequent restart tests based on the comparison of three kinds of test data also includes: 响应于所述异常的内存所对应的内存插槽在所述下一轮测试中对应新的内存的测试数据与所述新的内存在当前轮次测试对应的测试数据相同,并且所述下一轮重启测试收集到的所述异常内存的测试数据与预定数据不同,则对后续重启测试采用修改BIOS中内存插槽的映射关系的方式更换对应内存的插法并以内存的编号对不同内存的测试数据进行保存。In response to the memory slot corresponding to the abnormal memory, the test data corresponding to the new memory in the next round of testing is the same as the test data corresponding to the new memory in the current round of testing, and the next If the test data of the abnormal memory collected by the round restart test is different from the predetermined data, then for the subsequent restart test, the method of modifying the mapping relationship of the memory slots in the BIOS is used to replace the insertion method of the corresponding memory and use the number of the memory to compare the memory slots of the different memory. Save the test data. 8.一种内存测试系统,其特征在于,包括:8. A memory testing system, characterized in that, comprising: 测试配置模块,所述测试配置模块配置用于基于对应内存测试策略配置目标机,并基于所述内存测试策略对所述目标机上的内存进行多轮重启测试;A test configuration module, the test configuration module is configured to configure the target machine based on the corresponding memory test strategy, and perform multiple rounds of restart tests on the memory on the target machine based on the memory test strategy; 测试模块,所述测试模块配置用于实时获取目标机在每一轮重启测试时输出的测试数据,并将所述测试数据与预定数据和/或其他轮次的测试数据进行对比,并根据对比结果确定内存测试的结果。A test module, the test module is configured to obtain the test data output by the target machine in each round of restart test in real time, and compare the test data with predetermined data and/or other rounds of test data, and according to the comparison Result determines the result of the memory test. 9.一种计算机设备,其特征在于,包括:9. A computer device, comprising: 至少一个处理器;以及at least one processor; and 存储器,所述存储器存储有可在所述处理器上运行的计算机指令,所述指令由所述处理器执行时实现权利要求1-7任意一项所述方法的步骤。A memory, the memory stores computer instructions operable on the processor, and the steps of the method according to any one of claims 1-7 are implemented when the instructions are executed by the processor. 10.一种计算机可读存储介质,所述计算机可读存储介质存储有计算机程序,所述计算机程序被处理器执行时实现权利要求1-7任意一项所述方法的步骤。10. A computer-readable storage medium, wherein the computer-readable storage medium stores a computer program, and when the computer program is executed by a processor, the steps of the method according to any one of claims 1-7 are realized.
CN202211258987.8A 2022-10-14 2022-10-14 A memory testing method, system, device and readable storage medium Pending CN115904832A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202211258987.8A CN115904832A (en) 2022-10-14 2022-10-14 A memory testing method, system, device and readable storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202211258987.8A CN115904832A (en) 2022-10-14 2022-10-14 A memory testing method, system, device and readable storage medium

Publications (1)

Publication Number Publication Date
CN115904832A true CN115904832A (en) 2023-04-04

Family

ID=86490418

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202211258987.8A Pending CN115904832A (en) 2022-10-14 2022-10-14 A memory testing method, system, device and readable storage medium

Country Status (1)

Country Link
CN (1) CN115904832A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN118173157A (en) * 2024-03-14 2024-06-11 皇虎测试科技(深圳)有限公司 Memory comprehensive testing method and device
CN118860671A (en) * 2024-09-25 2024-10-29 苏州元脑智能科技有限公司 Memory management method, device, computer equipment and storage medium

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106227616A (en) * 2016-08-10 2016-12-14 浪潮电子信息产业股份有限公司 A Method of Realizing RMT Automatic Test in Batches
CN108446195A (en) * 2018-05-30 2018-08-24 郑州云海信息技术有限公司 A kind of server memory performance test methods and device
CN111858198A (en) * 2020-06-19 2020-10-30 苏州浪潮智能科技有限公司 A multi-scheme memory plugging test method, system, terminal and storage medium

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106227616A (en) * 2016-08-10 2016-12-14 浪潮电子信息产业股份有限公司 A Method of Realizing RMT Automatic Test in Batches
CN108446195A (en) * 2018-05-30 2018-08-24 郑州云海信息技术有限公司 A kind of server memory performance test methods and device
CN111858198A (en) * 2020-06-19 2020-10-30 苏州浪潮智能科技有限公司 A multi-scheme memory plugging test method, system, terminal and storage medium

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN118173157A (en) * 2024-03-14 2024-06-11 皇虎测试科技(深圳)有限公司 Memory comprehensive testing method and device
CN118860671A (en) * 2024-09-25 2024-10-29 苏州元脑智能科技有限公司 Memory management method, device, computer equipment and storage medium

Similar Documents

Publication Publication Date Title
CN113176963B (en) PCIe fault self-repairing method, device, equipment and readable storage medium
US8549522B1 (en) Automated testing environment framework for testing data storage systems
CN111274077A (en) Disk array reliability testing method, system, terminal and storage medium
CN115904832A (en) A memory testing method, system, device and readable storage medium
CN107391333B (en) A kind of OSD disk failure testing method and system
CN110557299A (en) Method, system, terminal and storage medium for batch testing of network transmission function
CN119473744B (en) Link test method, electronic device, storage medium, product and computing device
CN115083510B (en) Solid state disk testing method and device, storage medium, power supply and electronic equipment
CN114203253B (en) Chip memory fault repair device and chip
CN102902613B (en) Computer system and diagnosis method thereof
CN114265733A (en) Automatic testing method and device for abnormal power failure of solid state disk
CN118675603B (en) Flash memory controller prototype firmware, prototype verification method, system and hardware platform
CN105389200A (en) Multi-group stimulation batch simulation method and system applied to super-large scale chip verification
JP2019175424A (en) Method and system for checking error of cable
TWI802951B (en) Method, computer system and computer program product for storing state data of finite state machine
WO2024124862A1 (en) Server-based memory processing method and apparatus, processor and an electronic device
CN116185689A (en) Configuration method, verification method, equipment and medium for error information injection capability
CN114816939A (en) Memory communication method, system, device and medium
CN120803825A (en) Memory slot test method and server
US7475164B2 (en) Apparatus, system, and method for automated device configuration and testing
CN119046051A (en) Fault processing method and product of computer system
CN118612114A (en) A port interconnection configuration method, device, equipment and medium
CN118277154A (en) Hard disk failure recovery method and computing device
CN118113536A (en) Command exception testing method and device, computer equipment and storage medium
CN116893938A (en) Server PCIe slot stress testing methods, devices, equipment and media

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
CB02 Change of applicant information
CB02 Change of applicant information

Country or region after: China

Address after: 215000 Building 9, No.1 guanpu Road, Guoxiang street, Wuzhong Economic Development Zone, Suzhou City, Jiangsu Province

Applicant after: Suzhou Yuannao Intelligent Technology Co.,Ltd.

Address before: 215000 Building 9, No.1 guanpu Road, Guoxiang street, Wuzhong Economic Development Zone, Suzhou City, Jiangsu Province

Applicant before: SUZHOU LANGCHAO INTELLIGENT TECHNOLOGY Co.,Ltd.

Country or region before: China