[go: up one dir, main page]

CN106873906A - Method and apparatus for managing metamessage - Google Patents

Method and apparatus for managing metamessage Download PDF

Info

Publication number
CN106873906A
CN106873906A CN201710005568.6A CN201710005568A CN106873906A CN 106873906 A CN106873906 A CN 106873906A CN 201710005568 A CN201710005568 A CN 201710005568A CN 106873906 A CN106873906 A CN 106873906A
Authority
CN
China
Prior art keywords
information
meta
data block
disk
data
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201710005568.6A
Other languages
Chinese (zh)
Inventor
付万宇
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Baidu Netcom Science and Technology Co Ltd
Original Assignee
Beijing Baidu Netcom Science and Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Baidu Netcom Science and Technology Co Ltd filed Critical Beijing Baidu Netcom Science and Technology Co Ltd
Priority to CN201710005568.6A priority Critical patent/CN106873906A/en
Publication of CN106873906A publication Critical patent/CN106873906A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0628Interfaces specially adapted for storage systems making use of a particular technique
    • G06F3/0638Organizing or formatting or addressing of data
    • G06F3/064Management of blocks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0602Interfaces specially adapted for storage systems specifically adapted to achieve a particular effect
    • G06F3/062Securing storage systems
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0668Interfaces specially adapted for storage systems adopting a particular infrastructure
    • G06F3/0671In-line storage system
    • G06F3/0673Single storage device
    • G06F3/0674Disk device
    • G06F3/0676Magnetic disk device

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

It is an object of the invention to provide a kind of method and apparatus for managing metamessage.The method according to the invention is comprised the following steps:Its corresponding metamessage is stored or updated in each data block of goal systems.Compared with prior art, the present invention has advantages below:It is stored in data block by by metamessage, can avoids causing the loss of mass data due to file corruption, improves the security of data storage;Also, scheme of the invention can create multiple threads and load metamessage in a parallel fashion, so as to improve system toggle speed.

Description

用于管理元信息的方法和装置Method and device for managing meta information

技术领域technical field

本发明涉及计算机技术领域,尤其涉及一种用于管理元信息的方法和装置。The present invention relates to the field of computer technology, in particular to a method and device for managing meta information.

背景技术Background technique

基于现有技术的数据存储系统一般将元信息持久地存储到磁盘等介质中。该方式存在以下缺陷:1)系统启动时需要较长的时间来加载和恢复元信息。元信息文件的读取以及信息反序列化到指定的内存数据结构会消耗相当长的时间,从而影响系统的启动速度;2)元信息以单独的文件组织在磁盘上,单独的文件容易因诸如磁盘坏块等问题而损坏,而一旦文件损坏,会导致整个节点上的大量数据的丢失。Data storage systems based on the prior art generally store metadata persistently in media such as disks. This method has the following defects: 1) It takes a long time to load and restore the meta information when the system starts. The reading of meta-information files and the deserialization of information into the specified memory data structure will consume quite a long time, thus affecting the startup speed of the system; Disk bad blocks and other problems are damaged, and once the file is damaged, it will cause a large amount of data loss on the entire node.

发明内容Contents of the invention

本发明的目的是提供一种用于管理元信息的方法和装置。An object of the present invention is to provide a method and apparatus for managing meta information.

根据本发明的一个方面,提供了一种用于管理元信息的方法,其中,所述方法包括以下步骤:According to one aspect of the present invention, a method for managing meta information is provided, wherein the method includes the following steps:

-在目标系统的各个数据块中存储或更新其对应的元信息。-Store or update its corresponding meta-information in each data block of the target system.

根据本发明的一个方面,提供了一种用于管理元信息的信息管理装置,其中,所述信息管理装置包括:According to one aspect of the present invention, an information management device for managing meta information is provided, wherein the information management device includes:

用于在目标系统的各个数据块中存储或更新其对应的元信息的装置。Means for storing or updating the corresponding meta-information in each data block of the target system.

与现有技术相比,本发明具有以下优点:通过将元信息存储于数据块中,可避免由于文件损坏而导致大量数据的丢失,提升了数据存储的安全性;并且,根据本发明的方案可创建多个线程以并行的方式加载元信息,从而提升了系统启动速度。Compared with the prior art, the present invention has the following advantages: by storing the meta information in the data block, the loss of a large amount of data due to file damage can be avoided, and the security of data storage is improved; and, according to the scheme of the present invention Multiple threads can be created to load meta information in parallel, thus improving the system startup speed.

附图说明Description of drawings

通过阅读参照以下附图所作的对非限制性实施例所作的详细描述,本发明的其它特征、目的和优点将会变得更明显:Other characteristics, objects and advantages of the present invention will become more apparent by reading the detailed description of non-limiting embodiments made with reference to the following drawings:

图1示意出了根据本发明的一种用于管理元信息的方法流程图;FIG. 1 schematically shows a flow chart of a method for managing meta-information according to the present invention;

图2示意出了根据本发明的一种用于管理元信息的信息管理装置的结构示意图。Fig. 2 shows a schematic structural diagram of an information management device for managing meta-information according to the present invention.

附图中相同或相似的附图标记代表相同或相似的部件。The same or similar reference numerals in the drawings represent the same or similar components.

具体实施方式detailed description

下面结合附图对本发明作进一步详细描述。The present invention will be described in further detail below in conjunction with the accompanying drawings.

图1示意出了根据本发明的一种用于管理元信息的方法流程图。根据本发明的方法包括步骤S1。Fig. 1 schematically shows a flowchart of a method for managing meta-information according to the present invention. The method according to the invention comprises a step S1.

其中,根据本发明的方法通过包含于计算机设备中的信息管理装置来实现。所述计算机设备包括一种能够按照事先设定或存储的指令,自动进行数值计算和/或信息处理的电子设备,其硬件包括但不限于微处理器、专用集成电路(ASIC)、可编程门阵列(FPGA)、数字处理器(DSP)、嵌入式设备等。所述计算机设备包括网络设备和/或用户设备。其中,所述网络设备包括但不限于单个网络服务器、多个网络服务器组成的服务器组或基于云计算(Cloud Computing)的由大量主机或网络服务器构成的云,其中,云计算是分布式计算的一种,由一群松散耦合的计算机集组成的一个超级虚拟计算机。所述用户设备包括但不限于任何一种可与用户通过键盘、鼠标、遥控器、触摸板、或声控设备等方式进行人机交互的电子产品,例如,个人计算机、平板电脑、智能手机、PDA、游戏机、或IPTV等。其中,所述用户设备及网络设备所处的网络包括但不限于互联网、广域网、城域网、局域网、VPN网络等。Wherein, the method according to the present invention is implemented by an information management device included in the computer equipment. The computer equipment includes an electronic equipment that can automatically perform numerical calculation and/or information processing according to pre-set or stored instructions, and its hardware includes but not limited to microprocessors, application-specific integrated circuits (ASICs), programmable gates Arrays (FPGA), digital processors (DSP), embedded devices, etc. The computer equipment includes network equipment and/or user equipment. Wherein, the network device includes but is not limited to a single network server, a server group composed of multiple network servers, or a cloud based on cloud computing (Cloud Computing) composed of a large number of hosts or network servers, wherein cloud computing is distributed computing A super virtual computer consisting of a group of loosely coupled computer sets. The user equipment includes but is not limited to any electronic product that can interact with the user through a keyboard, mouse, remote control, touch pad, or voice-activated device, such as a personal computer, tablet computer, smart phone, PDA , game console, or IPTV, etc. Wherein, the network where the user equipment and the network equipment are located includes but is not limited to the Internet, a wide area network, a metropolitan area network, a local area network, a VPN network, and the like.

优选地,根据本发明的信息管理装置所在的设备包含于目标系统中。其中,所述目标系统包括各种数据存储系统。Preferably, the device where the information management device according to the present invention is located is included in the target system. Wherein, the target system includes various data storage systems.

需要说明的是,所述用户设备、网络设备以及网络仅为举例,其他现有的或今后可能出现的用户设备、网络设备以及网络如可适用于本发明,也应包含在本发明保护范围以内,并以引用方式包含于此。It should be noted that the user equipment, network equipment, and network described above are only examples, and other existing or future user equipment, network equipment, and networks that are applicable to the present invention should also be included within the protection scope of the present invention , and is incorporated herein by reference.

参照图1,在步骤S1中,信息管理装置在目标系统的各个数据块中存储或更新其对应的元信息。Referring to FIG. 1 , in step S1 , the information management device stores or updates the corresponding meta information in each data block of the target system.

优选地,信息管理装置存储元信息的方式包括但不限于以下至少任一种:Preferably, the way the information management device stores the meta information includes but not limited to at least any of the following:

1)将元信息存储于底层文件系统;优选地,这类元信息包括数据块的长度、所在设备的标识信息等;1) storing the meta information in the underlying file system; preferably, such meta information includes the length of the data block, identification information of the device where it is located, etc.;

2)将元信息存储于数据块名称中;优选地,这类元信息包括数据块的标识信息、数据块的压缩方式信息等等;2) storing the meta information in the data block name; preferably, this type of meta information includes identification information of the data block, compression method information of the data block, etc.;

3)将元信息存储于数据块的数据头部;优选地,这类元信息包括不需要在系统启动时立即构建的元信息,主要包括文件的压缩块索引信息等等。3) Store the meta information in the data header of the data block; preferably, this type of meta information includes meta information that does not need to be constructed immediately when the system is started, mainly including file compression block index information and the like.

根据本发明的一个优选实施例,信息管理装置基于待存储数据的业务类型信息,将与该待存储数据对应的数据块及其元信息存储至特定的存储介质中,以分别对各个存储介质中的元信息进行管理。According to a preferred embodiment of the present invention, based on the service type information of the data to be stored, the information management device stores the data block corresponding to the data to be stored and its meta information in a specific storage medium, so as to separately store the data in each storage medium meta information for management.

其中,所述存储介质包括各种类型的媒体介质,例如,串行高级技术附件(SerialAdvanced Technology Attachment,SATA)、固态硬盘(Solid State Disk,SSD)、内存文件、归档盘和云盘等等。Wherein, the storage medium includes various types of media media, for example, Serial Advanced Technology Attachment (Serial Advanced Technology Attachment, SATA), Solid State Disk (Solid State Disk, SSD), memory file, archive disk, cloud disk and so on.

优选地,信息管理装置分别对各个存储介质中的数据块的元信息进行管理。Preferably, the information management device manages the meta-information of the data blocks in each storage medium respectively.

更优选地,对于同一存储介质,信息管理装置可分别对存储于不同设备的数据块的元信息进行管理。More preferably, for the same storage medium, the information management apparatus can respectively manage the metadata of the data blocks stored in different devices.

根据本优选实施例的方法,可基于数据的业务类型选择不同的存储介质进行存储,对不同存储介质中的元信息进行管理,从而减小不同业务之间的竞争关系,避免低优先级业务对高优先级业务带来性能上的影响;并且可对不同设备的元信息进行独立的管理,从而减小跨设备之间的竞争关系。According to the method of this preferred embodiment, different storage media can be selected for storage based on the service type of the data, and the meta information in different storage media can be managed, thereby reducing the competition between different services and avoiding low-priority services. High-priority services have an impact on performance; and the meta-information of different devices can be managed independently, thereby reducing the competition between devices.

优选地,根据本发明的方法包括步骤S2(图未示)。Preferably, the method according to the present invention includes step S2 (not shown in the figure).

在步骤S2中,在需要加载系统的元信息时,信息管理装置创建多个线程来分别加载多个磁盘中的数据块的元数据信息。In step S2, when the metadata information of the system needs to be loaded, the information management device creates multiple threads to respectively load the metadata information of the data blocks in the multiple disks.

根据本发明的一个优选实施方案,在目标系统的启动阶段,所述方法包括步骤S3(图未示),所述步骤S2包括步骤S201(图未示)。According to a preferred embodiment of the present invention, in the startup phase of the target system, the method includes step S3 (not shown in the figure), and the step S2 includes step S201 (not shown in the figure).

在步骤S3中,信息管理装置根据目标系统的黑名单信息,确定需要过滤掉的磁盘信息,从而得到可使用的一个或多个磁盘信息。In step S3, the information management device determines the disk information to be filtered out according to the blacklist information of the target system, so as to obtain one or more available disk information.

优选地,所述需要过滤掉的磁盘信息包括坏掉的磁盘,或者因其它原因而无法被使用的存储介质。Preferably, the disk information to be filtered out includes broken disks, or storage media that cannot be used due to other reasons.

在步骤S201中,信息管理装置基于所述可使用的一个或多个磁盘信息,创建多个线程来分别加载各个磁盘中的数据块的元数据信息。In step S201, the information management device creates multiple threads to respectively load metadata information of data blocks in each disk based on the available information of one or more disks.

优选地,根据本优选实施方案的方法包括步骤S4(图未示)。Preferably, the method according to this preferred embodiment includes step S4 (not shown).

在步骤S4中,信息管理装置加载每个磁盘的低压缩率的数据块信息,从而在目标系统进行透明压缩时跳过各个低压缩率的数据块。In step S4, the information management device loads the data block information with low compression ratio of each disk, so that each data block with low compression ratio is skipped when the target system performs transparent compression.

其中,所述低压缩率的数据块信息用于在透明压缩时直接跳过。其中,透明压缩是一种后台进行的定期把非压缩类型的数据压缩为指定类型数据的一种定时工作任务,以一种后台进行的定期把非压缩类型的数据压缩为指定类型数据的一种定时工作任务。Wherein, the data block information of the low compression rate is directly skipped during transparent compression. Among them, transparent compression is a scheduled task that periodically compresses non-compressed data into specified types of data in the background, and periodically compresses non-compressed data into specified types of data in the background. Timed work tasks.

优选地,根据本优选实施方案的方法包括步骤S5(图未示)。Preferably, the method according to this preferred embodiment includes step S5 (not shown).

在步骤S5中,信息管理装置根据已扫描的各个磁盘的使用信息和状态信息,对各个磁盘加载的数据块进行去重处理。In step S5, the information management device deduplicates the data blocks loaded on each disk according to the scanned usage information and status information of each disk.

根据本发明的方法,通过将元信息存储于数据块中,可避免由于文件损坏而导致大量数据的丢失,提升了数据存储的安全性;并且,根据本发明的方案可创建多个线程以并行的方式加载元信息,从而提升了系统启动速度。According to the method of the present invention, by storing the meta information in the data block, the loss of a large amount of data due to file damage can be avoided, and the security of data storage is improved; moreover, according to the scheme of the present invention, multiple threads can be created to parallelize The way to load meta information, thus improving the system startup speed.

图2示意出了根据本发明的一种用于管理元信息的信息管理装置的结构示意图。所述信息管理装置包括用于在目标系统的各个数据块中存储或更新其对应的元信息的装置(以下简称“存储装置1”)。Fig. 2 shows a schematic structural diagram of an information management device for managing meta-information according to the present invention. The information management device includes a device for storing or updating corresponding meta information in each data block of the target system (hereinafter referred to as "storage device 1").

存储装置1在目标系统的各个数据块中存储或更新其对应的元信息。The storage device 1 stores or updates corresponding meta information in each data block of the target system.

优选地,存储装置1存储元信息的方式包括但不限于以下至少任一种:Preferably, the storage device 1 stores meta information in a manner including but not limited to at least any of the following:

1)将元信息存储于底层文件系统;优选地,这类元信息包括数据块的长度、所在设备的标识信息等;1) storing the meta information in the underlying file system; preferably, such meta information includes the length of the data block, identification information of the device where it is located, etc.;

2)将元信息存储于数据块名称中;优选地,这类元信息包括数据块的标识信息、数据块的压缩方式信息等等;2) storing the meta information in the data block name; preferably, this type of meta information includes identification information of the data block, compression method information of the data block, etc.;

3)将元信息存储于数据块的数据头部;优选地,这类元信息包括不需要在系统启动时立即构建的元信息,主要包括文件的压缩块索引信息等等。3) Store the meta information in the data header of the data block; preferably, this type of meta information includes meta information that does not need to be constructed immediately when the system is started, mainly including file compression block index information and the like.

根据本发明的一个优选实施例,存储装置1基于待存储数据的业务类型信息,将与该待存储数据对应的数据块及其元信息存储至特定的存储介质中,以分别对各个存储介质中的元信息进行管理。According to a preferred embodiment of the present invention, based on the service type information of the data to be stored, the storage device 1 stores the data block corresponding to the data to be stored and its meta-information in a specific storage medium, so as to separately store data in each storage medium meta information for management.

其中,所述存储介质包括各种类型的媒体介质,例如,串行高级技术附件(SerialAdvanced Technology Attachment,SATA)、固态硬盘(Solid State Disk,SSD)、内存文件、归档盘和云盘等等。Wherein, the storage medium includes various types of media media, for example, Serial Advanced Technology Attachment (Serial Advanced Technology Attachment, SATA), Solid State Disk (Solid State Disk, SSD), memory file, archive disk, cloud disk and so on.

优选地,信息管理装置分别对各个存储介质中的数据块的元信息进行管理。Preferably, the information management device manages the meta-information of the data blocks in each storage medium respectively.

更优选地,对于同一存储介质,信息管理装置可分别对存储于不同设备的数据块的元信息进行管理。More preferably, for the same storage medium, the information management apparatus can respectively manage the metadata of the data blocks stored in different devices.

根据本优选实施例的方案,可基于数据的业务类型选择不同的存储介质进行存储,对不同存储介质中的元信息进行管理,从而减小不同业务之间的竞争关系,避免低优先级业务对高优先级业务带来性能上的影响;并且可对不同设备的元信息进行独立的管理,从而减小跨设备之间的竞争关系。According to the scheme of this preferred embodiment, different storage media can be selected for storage based on the service type of data, and the meta information in different storage media can be managed, thereby reducing the competition between different services and avoiding low priority services High-priority services have an impact on performance; and the meta-information of different devices can be managed independently, thereby reducing the competition between devices.

优选地,根据本发明的信息管理装置包括用于在需要加载系统的元信息时,创建多个线程来分别加载多个磁盘中的数据块的元数据信息(图未示,以下简称“加载装置”)。Preferably, the information management device according to the present invention includes a plurality of threads for respectively loading metadata information of data blocks in multiple disks when the metadata information of the system needs to be loaded (not shown in the figure, hereinafter referred to as "loading device") ").

在需要加载系统的元信息时,加载装置创建多个线程来分别加载多个磁盘中的数据块的元数据信息。When the metadata information of the system needs to be loaded, the loading device creates multiple threads to respectively load the metadata information of the data blocks in the multiple disks.

根据本发明的一个优选实施方案,在目标系统的启动阶段,所述信息管理装置包括用于根据目标系统的黑名单信息,确定需要过滤掉的磁盘信息,从而得到可使用的一个或多个磁盘信息的装置(图未示,以下简称“确定装置”)。According to a preferred embodiment of the present invention, at the start-up stage of the target system, the information management device includes a device for determining the disk information that needs to be filtered out according to the blacklist information of the target system, so as to obtain one or more usable disks Information device (not shown in the figure, hereinafter referred to as "determining device").

确定装置根据目标系统的黑名单信息,确定需要过滤掉的磁盘信息,从而得到可使用的一个或多个磁盘信息。The determining means determines the disk information to be filtered out according to the blacklist information of the target system, so as to obtain one or more available disk information.

优选地,所述需要过滤掉的磁盘信息包括坏掉的磁盘,或者因其它原因而无法被使用的存储介质。Preferably, the disk information to be filtered out includes broken disks, or storage media that cannot be used due to other reasons.

加载装置基于所述可使用的一个或多个磁盘信息,创建多个线程来分别加载各个磁盘中的数据块的元数据信息。The loading device creates multiple threads to respectively load the metadata information of the data blocks in each disk based on the available information of one or more disks.

优选地,根据本优选实施方案的信息管理装置包括用于加载每个磁盘的低压缩率的数据块信息,从而在目标系统进行透明压缩时跳过各个低压缩率的数据块的装置(图未示,以下简称“数据加载装置”)。Preferably, the information management device according to this preferred embodiment includes a device for loading data block information with a low compression ratio of each disk, so that when the target system performs transparent compression, each data block with a low compression ratio is skipped (not shown in the figure) hereinafter referred to as "data loading device").

数据加载装置加载每个磁盘的低压缩率的数据块信息,从而在目标系统进行透明压缩时跳过各个低压缩率的数据块。The data loading device loads the data block information with low compression ratio of each disk, so as to skip each data block with low compression ratio when the target system performs transparent compression.

其中,所述低压缩率的数据块信息用于在透明压缩时直接跳过。其中,透明压缩是一种后台进行的定期把非压缩类型的数据压缩为指定类型数据的一种定时工作任务,以一种后台进行的定期把非压缩类型的数据压缩为指定类型数据的一种定时工作任务。Wherein, the data block information of the low compression rate is directly skipped during transparent compression. Among them, transparent compression is a scheduled task that periodically compresses non-compressed data into specified types of data in the background, and periodically compresses non-compressed data into specified types of data in the background. Timed work tasks.

优选地,根据本优选实施方案的信息管理装置包括用于根据已扫描的各个磁盘的使用信息和状态信息,对各个磁盘加载的数据块进行去重处理的装置(图未示,以下简称“去重处理装置”)。Preferably, the information management device according to this preferred embodiment includes a device for performing deduplication processing on the data blocks loaded on each disk according to the scanned usage information and status information of each disk (not shown in the figure, hereinafter referred to as "deduplication") reprocessing device").

去重处理装置根据已扫描的各个磁盘的使用信息和状态信息,对各个磁盘加载的数据块进行去重处理。The deduplication processing device performs deduplication processing on the data blocks loaded on each disk according to the scanned usage information and status information of each disk.

根据本发明的方案,通过将元信息存储于数据块中,可避免由于文件损坏而导致大量数据的丢失,提升了数据存储的安全性;并且,根据本发明的方案可创建多个线程以并行的方式加载元信息,从而提升了系统启动速度。According to the scheme of the present invention, by storing the meta information in the data block, the loss of a large amount of data due to file damage can be avoided, and the security of data storage is improved; moreover, according to the scheme of the present invention, multiple threads can be created to parallelize The way to load meta information, thus improving the system startup speed.

本发明的软件程序可以通过处理器执行以实现上文所述步骤或功能。同样地,本发明的软件程序(包括相关的数据结构)可以被存储到计算机可读记录介质中,例如,RAM存储器,磁或光驱动器或软磁盘及类似设备。另外,本发明的一些步骤或功能可采用硬件来实现,例如,作为与处理器配合从而执行各个功能或步骤的电路。The software program of the present invention can be executed by a processor to realize the steps or functions described above. Likewise, the software program (including associated data structures) of the present invention can be stored in a computer-readable recording medium such as RAM memory, magnetic or optical drive or floppy disk and the like. In addition, some steps or functions of the present invention may be implemented by hardware, for example, as a circuit that cooperates with a processor to execute each function or step.

另外,本发明的一部分可被应用为计算机程序产品,例如计算机程序指令,当其被计算机执行时,通过该计算机的操作,可以调用或提供根据本发明的方法和/或技术方案。而调用本发明的方法的程序指令,可能被存储在固定的或可移动的记录介质中,和/或通过广播或其他信号承载媒体中的数据流而被传输,和/或被存储在根据所述程序指令运行的计算机设备的工作存储器中。在此,根据本发明的一个实施例包括一个装置,该装置包括用于存储计算机程序指令的存储器和用于执行程序指令的处理器,其中,当该计算机程序指令被该处理器执行时,触发该装置运行基于前述根据本发明的多个实施例的方法和/或技术方案。In addition, a part of the present invention can be applied as a computer program product, such as a computer program instruction. When it is executed by a computer, the method and/or technical solution according to the present invention can be invoked or provided through the operation of the computer. The program instructions for invoking the method of the present invention may be stored in a fixed or removable recording medium, and/or transmitted through broadcasting or data streams in other signal-carrying media, and/or stored in the in the working memory of the computer device on which the program instructions execute. Here, an embodiment according to the present invention comprises an apparatus comprising a memory for storing computer program instructions and a processor for executing the program instructions, wherein when the computer program instructions are executed by the processor, a trigger The operation of the device is based on the foregoing methods and/or technical solutions according to multiple embodiments of the present invention.

对于本领域技术人员而言,显然本发明不限于上述示范性实施例的细节,而且在不背离本发明的精神或基本特征的情况下,能够以其他的具体形式实现本发明。因此,无论从哪一点来看,均应将实施例看作是示范性的,而且是非限制性的,本发明的范围由所附权利要求而不是上述说明限定,因此旨在将落在权利要求的等同要件的含义和范围内的所有变化涵括在本发明内。不应将权利要求中的任何附图标记视为限制所涉及的权利要求。此外,显然“包括”一词不排除其他单元或步骤,单数不排除复数。系统权利要求中陈述的多个单元或装置也可以由一个单元或装置通过软件或者硬件来实现。第一,第二等词语用来表示名称,而并不表示任何特定的顺序。It will be apparent to those skilled in the art that the invention is not limited to the details of the above-described exemplary embodiments, but that the invention can be embodied in other specific forms without departing from the spirit or essential characteristics of the invention. Accordingly, the embodiments should be regarded in all points of view as exemplary and not restrictive, the scope of the invention being defined by the appended claims rather than the foregoing description, and it is therefore intended that the scope of the invention be defined by the appended claims rather than by the foregoing description. All changes within the meaning and range of equivalents of the elements are embraced in the present invention. Any reference sign in a claim should not be construed as limiting the claim concerned. In addition, it is obvious that the word "comprising" does not exclude other elements or steps, and the singular does not exclude the plural. A plurality of units or devices stated in the system claims may also be realized by one unit or device through software or hardware. The words first, second, etc. are used to denote names without implying any particular order.

Claims (14)

1.一种用于管理元信息的方法,其中,所述方法包括以下步骤:1. A method for managing meta-information, wherein the method comprises the following steps: -在目标系统的各个数据块中存储或更新其对应的元信息。-Store or update its corresponding meta-information in each data block of the target system. 2.根据权利要求1所述的方法,其中,所述在目标系统的各个数据块中存储或更新其对应的元信息的步骤按照以下至少任一种方式存储元信息:2. The method according to claim 1, wherein the step of storing or updating the corresponding meta-information in each data block of the target system stores the meta-information in at least one of the following ways: -将元信息存储于底层文件系统;- Store meta information in the underlying file system; -将元信息存储于数据块名称中;- store meta information in the data block name; -将元信息存储于数据块的数据头部。- Store meta information in the data header of the data block. 3.根据权利要求1所述的方法,其中,所述步骤a包括以下步骤:3. The method according to claim 1, wherein said step a comprises the steps of: -基于待存储数据的业务类型信息,将与该待存储数据对应的数据块及其元信息存储至特定的存储介质中,以分别对各个存储介质中的元信息进行管理。- Based on the service type information of the data to be stored, storing the data block and its meta information corresponding to the data to be stored in a specific storage medium, so as to manage the meta information in each storage medium respectively. 4.根据权利要求1所述的方法,其中,所述方法包括以下步骤:4. The method according to claim 1, wherein said method comprises the steps of: -在需要加载系统的元信息时,创建多个线程来分别加载多个磁盘中的数据块的元数据信息。- When the metadata information of the system needs to be loaded, multiple threads are created to respectively load the metadata information of the data blocks in multiple disks. 5.根据权利要求4所述的方法,其中,在目标系统的启动阶段,所述方法包括以下步骤:5. The method according to claim 4, wherein, during the startup phase of the target system, the method comprises the steps of: -根据目标系统的黑名单信息,确定需要过滤掉的磁盘信息,从而得到可使用的一个或多个磁盘信息;-According to the blacklist information of the target system, determine the disk information that needs to be filtered out, so as to obtain one or more disk information that can be used; 其中,所述创建多个线程来分别加载多个磁盘中的数据块的元数据信息的步骤包括以下步骤:Wherein, the step of creating multiple threads to respectively load the metadata information of the data blocks in multiple disks includes the following steps: -基于所述可使用的一个或多个磁盘信息,创建多个线程来分别加载各个磁盘中的数据块的元数据信息。- Based on the available one or more disk information, creating multiple threads to respectively load the metadata information of the data blocks in each disk. 6.根据权利要求5所述的方法,其中,所述方法包括以下步骤:6. The method according to claim 5, wherein said method comprises the steps of: -加载每个磁盘的低压缩率的数据块信息,从而在目标系统进行透明压缩时跳过各个低压缩率的数据块。-Load the low compression ratio data block information for each disk, so that each low compression ratio data block is skipped when the target system performs transparent compression. 7.根据权利要求5所述的方法,其中,所述方法包括以下步骤:7. The method according to claim 5, wherein said method comprises the steps of: -根据已扫描的各个磁盘的使用信息和状态信息,对各个磁盘加载的数据块进行去重处理。-According to the usage information and status information of each disk that has been scanned, the data blocks loaded by each disk are deduplicated. 8.一种用于管理元信息的信息管理装置,其中,所述信息管理装置包括:8. An information management device for managing meta-information, wherein the information management device comprises: 用于在目标系统的各个数据块中存储或更新其对应的元信息的装置。Means for storing or updating the corresponding meta-information in each data block of the target system. 9.根据权利要求8所述的信息管理装置,其中,所述用于在目标系统的各个数据块中存储或更新其对应的元信息的装置按照以下至少任一种方式存储元信息:9. The information management device according to claim 8, wherein the means for storing or updating the corresponding meta information in each data block of the target system stores the meta information in at least one of the following ways: -将元信息存储于底层文件系统;- Store meta information in the underlying file system; -将元信息存储于数据块名称中;- store meta information in the data block name; -将元信息存储于数据块的数据头部。- Store meta information in the data header of the data block. 10.根据权利要求8所述的信息管理装置,其中,所述用于在目标系统的各个数据块中存储或更新其对应的元信息的装置用于:10. The information management device according to claim 8, wherein the means for storing or updating the corresponding meta information in each data block of the target system is used for: -基于待存储数据的业务类型信息,将与该待存储数据对应的数据块及其元信息存储至特定的存储介质中,以分别对各个存储介质中的元信息进行管理的装置。- Based on the service type information of the data to be stored, the data block corresponding to the data to be stored and its meta information are stored in a specific storage medium, so as to manage the meta information in each storage medium respectively. 11.根据权利要求8所述的信息管理装置,其中,所述信息管理装置包括:11. The information management device according to claim 8, wherein the information management device comprises: 用于在需要加载系统的元信息时,创建多个线程来分别加载多个磁盘中的数据块的元数据信息。It is used to create multiple threads to separately load metadata information of data blocks in multiple disks when the metadata information of the system needs to be loaded. 12.根据权利要求11所述的信息管理装置,其中,在目标系统的启动阶段,所述信息管理装置包括:12. The information management device according to claim 11, wherein, at the startup phase of the target system, the information management device comprises: 用于根据目标系统的黑名单信息,确定需要过滤掉的磁盘信息,从而得到可使用的一个或多个磁盘信息的装置;A device for determining the disk information that needs to be filtered out according to the blacklist information of the target system, so as to obtain one or more disk information that can be used; 其中,所述用于在目标系统的各个数据块中存储或更新其对应的元信息的装置用于:Wherein, the device for storing or updating the corresponding meta-information in each data block of the target system is used for: -基于所述可使用的一个或多个磁盘信息,创建多个线程来分别加载各个磁盘中的数据块的元数据信息。- Based on the available one or more disk information, creating multiple threads to respectively load the metadata information of the data blocks in each disk. 13.根据权利要求12所述的信息管理装置,其中,所述信息管理装置包括:13. The information management device according to claim 12, wherein the information management device comprises: 用于加载每个磁盘的低压缩率的数据块信息,从而在目标系统进行透明压缩时跳过各个低压缩率的数据块的装置。A device for loading low-compression data block information of each disk, thereby skipping each low-compression data block when the target system performs transparent compression. 14.根据权利要求12所述的信息管理装置,其中,所述信息管理装置包括:14. The information management device according to claim 12, wherein the information management device comprises: 用于根据已扫描的各个磁盘的使用信息和状态信息,对各个磁盘加载的数据块进行去重处理的装置。A device for performing deduplication processing on data blocks loaded on each disk according to the scanned usage information and status information of each disk.
CN201710005568.6A 2017-01-04 2017-01-04 Method and apparatus for managing metamessage Pending CN106873906A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710005568.6A CN106873906A (en) 2017-01-04 2017-01-04 Method and apparatus for managing metamessage

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710005568.6A CN106873906A (en) 2017-01-04 2017-01-04 Method and apparatus for managing metamessage

Publications (1)

Publication Number Publication Date
CN106873906A true CN106873906A (en) 2017-06-20

Family

ID=59165558

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710005568.6A Pending CN106873906A (en) 2017-01-04 2017-01-04 Method and apparatus for managing metamessage

Country Status (1)

Country Link
CN (1) CN106873906A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108829646A (en) * 2018-05-03 2018-11-16 百度在线网络技术(北京)有限公司 Data storage and analytic method, device, system and storage medium
CN109814979A (en) * 2018-12-17 2019-05-28 华为技术有限公司 Data load method, device, computer equipment and readable storage medium storing program for executing
CN112148795A (en) * 2020-09-27 2020-12-29 上海依图网络科技有限公司 Data processing method, device, equipment and medium

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1834956A (en) * 2005-03-18 2006-09-20 联想(北京)有限公司 Processing of multiroute processing element data
CN102521269A (en) * 2011-11-22 2012-06-27 清华大学 Index-based computer continuous data protection method
CN104615546A (en) * 2015-02-13 2015-05-13 小米科技有限责任公司 File data management method and device
CN105144160A (en) * 2013-03-15 2015-12-09 甲骨文国际公司 A method to accelerate queries using dynamically generated alternate data formats in flash cache

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1834956A (en) * 2005-03-18 2006-09-20 联想(北京)有限公司 Processing of multiroute processing element data
CN102521269A (en) * 2011-11-22 2012-06-27 清华大学 Index-based computer continuous data protection method
CN105144160A (en) * 2013-03-15 2015-12-09 甲骨文国际公司 A method to accelerate queries using dynamically generated alternate data formats in flash cache
CN104615546A (en) * 2015-02-13 2015-05-13 小米科技有限责任公司 File data management method and device

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108829646A (en) * 2018-05-03 2018-11-16 百度在线网络技术(北京)有限公司 Data storage and analytic method, device, system and storage medium
CN109814979A (en) * 2018-12-17 2019-05-28 华为技术有限公司 Data load method, device, computer equipment and readable storage medium storing program for executing
CN109814979B (en) * 2018-12-17 2021-03-30 华为技术有限公司 Data loading method and device, computer equipment and readable storage medium
CN112148795A (en) * 2020-09-27 2020-12-29 上海依图网络科技有限公司 Data processing method, device, equipment and medium
CN112148795B (en) * 2020-09-27 2021-06-15 上海依图网络科技有限公司 Data processing method, device, equipment and medium

Similar Documents

Publication Publication Date Title
JP5732536B2 (en) System, method and non-transitory computer-readable storage medium for scalable reference management in a deduplication-based storage system
US8799238B2 (en) Data deduplication
US9176805B2 (en) Memory dump optimization in a system
CN102474531B (en) Address server
US20200341956A1 (en) Processing time series metrics data
CN103198088B (en) Log segment directory based on shadow pagination
CN107436725A (en) A kind of data are write, read method, apparatus and distributed objects storage cluster
US11836350B1 (en) Method and system for grouping data slices based on data file quantities for data slice backup generation
JP7729904B2 (en) Logical deletion of data in a sharded database
CN114153395B (en) Object storage data life cycle management method, device and equipment
CN108268344A (en) A kind of data processing method and device
CN112579550B (en) Metadata information synchronization method and system of distributed file system
WO2021139431A1 (en) Data synchronization method and apparatus for microservice, electronic device and storage medium
CN110147203A (en) A file management method, device, electronic device and storage medium
CN106156038B (en) Date storage method and device
CN106873906A (en) Method and apparatus for managing metamessage
US11994956B2 (en) Adaptive throttling of metadata requests
CN114579061B (en) Data storage method, device, equipment and medium
WO2018111696A1 (en) Partial storage of large files in distinct storage systems
CN110022348B (en) System and method for dynamic backup sessions
CN108132759A (en) A kind of method and apparatus that data are managed in file system
US11429311B1 (en) Method and system for managing requests in a distributed system
CN111949442B (en) System and method for scalable backup service
US20150212847A1 (en) Apparatus and method for managing cache of virtual machine image file
CN113835613B (en) A file reading method, device, electronic equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20170620

RJ01 Rejection of invention patent application after publication