[go: up one dir, main page]

WO2013163832A1 - Procédé et dispositif de stockage dans un nuage - Google Patents

Procédé et dispositif de stockage dans un nuage Download PDF

Info

Publication number
WO2013163832A1
WO2013163832A1 PCT/CN2012/075841 CN2012075841W WO2013163832A1 WO 2013163832 A1 WO2013163832 A1 WO 2013163832A1 CN 2012075841 W CN2012075841 W CN 2012075841W WO 2013163832 A1 WO2013163832 A1 WO 2013163832A1
Authority
WO
WIPO (PCT)
Prior art keywords
file
storage
module
hash value
storage path
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/CN2012/075841
Other languages
English (en)
Chinese (zh)
Inventor
王东临
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tianjin Sursen Investment Co Ltd
Original Assignee
Tianjin Sursen Investment Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tianjin Sursen Investment Co Ltd filed Critical Tianjin Sursen Investment Co Ltd
Priority to US13/858,489 priority Critical patent/US20140181116A1/en
Publication of WO2013163832A1 publication Critical patent/WO2013163832A1/fr
Anticipated expiration legal-status Critical
Priority to US14/943,909 priority patent/US20160112413A1/en
Priority to US15/055,373 priority patent/US20160182638A1/en
Priority to US15/594,374 priority patent/US20170249093A1/en
Priority to US16/378,076 priority patent/US20190235777A1/en
Ceased legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/13File access structures, e.g. distributed indices
    • G06F16/137Hash-based

Definitions

  • the present invention relates to the field of the Internet, and in particular, to a cloud storage method and apparatus. Background technique
  • Embodiments of the present invention provide a cloud storage method and apparatus to provide efficient storage of massive data.
  • Calculate the hash value of the file convert the hash value of the file into a string as the file name; calculate the storage path of the file by using the file hash value according to a predefined rule; and find the actual storage location of the storage path of the file in the index table
  • the file is stored in the actual storage location.
  • the mapping table of all possible storage paths and the actual location of the storage disk has been pre-stored in the index table.
  • a first module configured to calculate a hash value of the file according to a predefined hash value algorithm
  • a second module configured to convert the hash value of the file calculated according to the first module into a string Is the file name
  • a third module configured to calculate a storage path of the file according to a predefined algorithm by using a hash value of the file calculated by the first module
  • the fourth module is configured to store a mapping table of all possible storage paths and actual locations of the storage disks
  • a fifth module configured to search, according to a storage path of the file calculated by the third module, an actual storage location of the storage path of the file in the fourth module;
  • a sixth module configured to store the file on a storage disk found by the fifth module to store the file.
  • a cloud storage method and device provided by the embodiment of the present invention stores a large amount of data storage, since the file name storage using the hash value is used, the non-repetitive storage of the massive data can be ensured. Calculating the storage path of the file according to the hash value, and ensuring that the quantity data is stored on each storage server to ensure the storage balance of the system. Even if the storage server of the cloud storage system is infinitely expanded, the method can be efficiently managed. The storage of the document. DRAWINGS
  • FIG. 1 is a flowchart of a cloud storage method according to an embodiment of the present invention.
  • FIG. 2 is a schematic structural diagram of a cloud storage device according to an embodiment of the present invention.
  • BEST MODE FOR CARRYING OUT THE INVENTION will be further described in detail with reference to the accompanying drawings and embodiments. It is understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention.
  • FIG. 1 is a flowchart of a cloud storage method according to an embodiment of the present invention. As shown in FIG. 1, when the file is stored in the cloud, the method includes the following steps:
  • Step 101 Calculate the Hash value of the file, and convert the Hash value of the file into a string as the file name.
  • different hash value algorithms can be selected according to the configuration of the system.
  • a double hash algorithm may be adopted as the file name, that is, two different hash algorithms are used to calculate the hash value of the file, and the two hash values are connected as the hash value of the file.
  • the Hash value of the file is converted into a hexadecimal string as a file name, for example, the converted file name is HiHzHg -> HNO
  • Step 102 Calculate the storage path of the file by using the file Hash value according to a predefined rule.
  • the predefined rule may be:
  • the storage path of the file is composed of a secondary directory; for example, the first two bits of the file name may be directly used as a first-level directory for storing the file, and the file name is The third four bits are used as the second level directory for storing the file.
  • H 3 H 4 will be used as the second level directory of the file, that is, the file name is H 2 H 3 ...
  • H N has a storage path of 11 2 ⁇ 11 4 .
  • the two-digit 36-level secondary directory storage provided by the embodiment of the present invention can theoretically store and manage more than 10 billion documents. .
  • Step 103 Find an actual storage location of a storage path of the file in an index table, and store the file in the actual storage location.
  • the mapping table of all possible storage paths and the actual location of the storage disk has been pre-stored in the index table.
  • the storage disk location corresponding to the 1679616 storage path (second-level directory) is already recorded in the index table, for example, which volume of a storage directory is located in a secondary directory.
  • index table usually needs a few megabytes of space, and can be in the form of a group, which can be directly obtained by subscripting.
  • the index table records a primary directory named AB, and the secondary directory named CD is stored on the third logical volume of the first disk.
  • Step 104 When looking up (calling) a file, according to the same predefined rules, use The file hash value calculates the file name and storage path of the file to be searched;
  • the first four digits of the file name are extracted, and the primary storage directory and the secondary storage directory of the file are obtained.
  • Step 105 Find a physical disk corresponding to the file to be searched in the index table according to the calculated storage path of the file to be searched;
  • Step 106 Find the file to be searched in the physical disk corresponding to the storage path according to the file name of the file to be searched.
  • the actual storage location of all possible storage paths is recorded in the index table in advance in order to speed up the storage and lookup.
  • one storage server In actual operation, usually one storage server carries multiple storage disks. When you add a storage disk to the current storage server, copy some of the directories on the existing storage disk to the new storage disk. When adding a new storage server, you can insert some of the disks on the original storage server directly into the new storage server. Although the original storage server needs to add new storage disks, you need to leave it from the storage server. Copy the storage disk to the new disk, but copy it.
  • the storage method provided by the embodiment of the present invention can be used for mass data storage.
  • mass data storage when storing a large amount of data storage in this way, since the file name storage using the hash value is used, it is possible to ensure the non-repetitive storage of the massive data.
  • the storage path of the file is calculated according to the hash value, and the quantity data is stored on each storage server to ensure the storage balance of the system, even if the storage server of the cloud storage system is infinitely expanded. This way, you can also efficiently manage the storage of documents.
  • the index table can be stored in the web server, so that when the user needs to find a file, the file can be quickly found in the mass server.
  • the first bit of the hash value is used as the primary directory
  • the second three is used as the secondary directory
  • the fourth is used as the tertiary directory.
  • FIG. 2 is a schematic structural diagram of a cloud storage device according to an embodiment of the present invention. As shown in FIG. 2, the cloud storage device runs on a device having a processor and a storage module.
  • the device includes:
  • a Hash value calculation module configured to calculate a Hash value of the file according to a predefined hash value algorithm
  • a file name calculation module configured to calculate a Hash value of the file calculated by the module according to the Hash value, and convert the file into a string as a file name
  • a storage path calculation module configured to calculate a Hash value of the file calculated by the Hash value calculation module, and calculate a storage path of the file according to a predefined algorithm
  • a table module for storing a mapping table of all possible storage paths and the actual location of the storage disk
  • An index table searching module configured to search, according to a storage module, a storage path of the file calculated by the storage path calculation module, and find an actual storage location of the storage path of the file in the index table module;
  • the storage module is configured to invoke an index table lookup module, and store the file on a storage disk returned by the index table lookup module to store the file.
  • the Hash value calculation module calculates the file name to be searched
  • the file name calculation module calculates the file name of the file to be searched
  • the storage path calculation module calculates the storage path of the file to be searched
  • the index table search module And the physical storage disk corresponding to the file to be searched is found in the index table module
  • the cloud storage device further includes: a search module, configured to calculate a storage path of the file to be searched according to the storage path calculation module, and call an index table to search for a module; and calculate a file name of the file to be searched according to the file name calculation module, and find the physical disk corresponding to the storage path. The file to be found.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
PCT/CN2012/075841 2011-10-11 2012-05-22 Procédé et dispositif de stockage dans un nuage Ceased WO2013163832A1 (fr)

Priority Applications (5)

Application Number Priority Date Filing Date Title
US13/858,489 US20140181116A1 (en) 2011-10-11 2013-04-08 Method and device of cloud storage
US14/943,909 US20160112413A1 (en) 2011-10-11 2015-11-17 Method for controlling security of cloud storage
US15/055,373 US20160182638A1 (en) 2011-10-11 2016-02-26 Cloud serving system and cloud serving method
US15/594,374 US20170249093A1 (en) 2011-10-11 2017-05-12 Storage method and distributed storage system
US16/378,076 US20190235777A1 (en) 2011-10-11 2019-04-08 Redundant storage system

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN2012101329267A CN103384256A (zh) 2012-05-02 2012-05-02 一种云存储方法及装置
CN201210132926.7 2012-05-12

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2012/076516 Continuation WO2013170504A1 (fr) 2011-10-11 2012-06-06 Système de stockage de forts volumes de données

Related Child Applications (2)

Application Number Title Priority Date Filing Date
US13/271,165 Continuation US9176953B2 (en) 2008-06-04 2011-10-11 Method and system of web-based document service
US13/858,489 Continuation US20140181116A1 (en) 2011-10-11 2013-04-08 Method and device of cloud storage

Publications (1)

Publication Number Publication Date
WO2013163832A1 true WO2013163832A1 (fr) 2013-11-07

Family

ID=49491950

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2012/075841 Ceased WO2013163832A1 (fr) 2011-10-11 2012-05-22 Procédé et dispositif de stockage dans un nuage

Country Status (2)

Country Link
CN (1) CN103384256A (fr)
WO (1) WO2013163832A1 (fr)

Families Citing this family (22)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104182487A (zh) * 2014-08-11 2014-12-03 浪潮软件股份有限公司 一种支持多种存储方式的统一存储方法
CN104202428A (zh) * 2014-09-25 2014-12-10 北京网秦天下科技有限公司 分布式文件存储系统及文件存储方法
CN105812427B (zh) * 2014-12-31 2019-08-13 中国电信股份有限公司 文件上传和下载方法、装置、及文件服务器
US9684569B2 (en) * 2015-03-30 2017-06-20 Western Digital Technologies, Inc. Data deduplication using chunk files
CN105279258B (zh) * 2015-10-21 2020-01-14 Tcl集团股份有限公司 一种均衡分布的文件存储方法及系统
CN105590064B (zh) * 2015-12-14 2018-10-19 中国建设银行股份有限公司 文件处理方法、文件处理装置及文件服务器
CN106250440B (zh) * 2016-07-26 2019-07-09 金蝶软件(中国)有限公司 文档管理方法和装置
CN106446001B (zh) * 2016-07-29 2019-10-18 北京北信源软件股份有限公司 一种在计算机存储介质上存储文件的方法及系统
CN106776891A (zh) * 2016-11-30 2017-05-31 山东浪潮商用系统有限公司 一种文件存储的方法和装置
CN106649858A (zh) * 2016-12-30 2017-05-10 天津市测绘院 一种个人地理信息的操作方法及装置
CN106888119B (zh) * 2017-02-15 2020-10-27 金钱猫科技股份有限公司 一种获取终端属性的方法及系统
CN107562786A (zh) * 2017-07-27 2018-01-09 平安科技(深圳)有限公司 文件存储方法、终端和计算机可读存储介质
CN107612917B (zh) * 2017-09-28 2020-03-13 山东恒云信息科技有限公司 云计算环境下对日志存储使用3des加密算法加密的方法
CN107741968B (zh) * 2017-10-09 2021-06-29 郑州云海信息技术有限公司 一种文件检索的方法、系统、装置及计算机可读存储介质
CN110019010B (zh) * 2017-11-14 2023-06-13 阿里巴巴集团控股有限公司 处理方法、装置、设备和机器可读介质
CN109582642A (zh) * 2018-11-08 2019-04-05 网宿科技股份有限公司 文件存储方法、删除方法、服务器及存储介质
CN111198882B (zh) * 2019-12-26 2024-07-23 东软集团股份有限公司 数据的处理方法、装置、存储介质和电子设备
CN111400251A (zh) * 2020-03-17 2020-07-10 深圳市卡牛科技有限公司 一种基于网关的文件访问方法、装置、服务器和存储介质
CN113971162A (zh) * 2020-07-22 2022-01-25 深圳市茁壮网络股份有限公司 数据访问方法和装置
CN113127421A (zh) * 2021-04-01 2021-07-16 山东英信计算机技术有限公司 一种存储系统检索文件内容的方法和设备
CN113254395A (zh) * 2021-05-14 2021-08-13 杭州圆图网络技术有限公司 一种基于散列算法和文件管理系统的文件备注方法
CN115964754B (zh) * 2023-03-16 2023-05-23 长城数字能源(西安)科技有限公司 大数据安全存储方法及装置

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020135801A1 (en) * 2000-12-08 2002-09-26 Gary Tessman Distributed image storage architecture
CN1614591A (zh) * 2004-12-02 2005-05-11 中国科学院计算技术研究所 一种组织和访问分布式文件系统目录的方法
CN101576854A (zh) * 2009-04-03 2009-11-11 成都市华为赛门铁克科技有限公司 文件访问的方法、装置及系统

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7412449B2 (en) * 2003-05-23 2008-08-12 Sap Aktiengesellschaft File object storage and retrieval using hashing techniques
CN101072222A (zh) * 2006-05-12 2007-11-14 腾讯科技(深圳)有限公司 基于点对点技术的图片跟随浏览系统和方法
CN100486206C (zh) * 2006-08-22 2009-05-06 中国科学院声学研究所 一种基于ims的对等联网共享服务的信令控制方法
CN100499463C (zh) * 2007-01-12 2009-06-10 北京时代亮点科技发展有限公司 面向移动终端的对等网络传输方法

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020135801A1 (en) * 2000-12-08 2002-09-26 Gary Tessman Distributed image storage architecture
CN1614591A (zh) * 2004-12-02 2005-05-11 中国科学院计算技术研究所 一种组织和访问分布式文件系统目录的方法
CN101576854A (zh) * 2009-04-03 2009-11-11 成都市华为赛门铁克科技有限公司 文件访问的方法、装置及系统

Also Published As

Publication number Publication date
CN103384256A (zh) 2013-11-06

Similar Documents

Publication Publication Date Title
WO2013163832A1 (fr) Procédé et dispositif de stockage dans un nuage
JP6644960B1 (ja) オブジェクトベース・ストレージでアーカイブされたデータ・コンテナをリストアする方法とシステム
JP6122038B2 (ja) 重複排除のオペレーションを実行するためのフラグメンテーションのコントロール
CN106484820B (zh) 一种重命名方法、访问方法及装置
CN105027069B (zh) 卷区域的重复数据删除
CN103002027B (zh) 基于键值对系统实现树形目录结构的数据存储系统及方法
WO2014101583A1 (fr) Procédé, appareil et dispositif électronique pour construire un système de fichiers dans un système de stockage de valeurs de clés
US11681653B2 (en) Namespace representation and enhanced browsability for replicated file systems
US9128944B2 (en) File system enabling fast purges and file access
JP6506374B2 (ja) キャッシュ管理
JP2015512551A (ja) ネットワーク・インフラストラクチャにおけるデータ記憶及び編成を容易にするコンシステント・リング名前空間
CN103229173A (zh) 元数据管理方法及系统
WO2010099715A1 (fr) Procédé, système, client et serveur de données pour opération de données
CN103345472A (zh) 基于有限二叉树布隆过滤器的去冗文件系统及其构建方法
JP2015510174A (ja) ロケーション非依存のファイル
CN103064906A (zh) 文件管理方法及装置
CN105981033A (zh) 将放置策略分配给片段集合
CN106570113A (zh) 一种海量矢量切片数据云存储方法及系统
CN108540510B (zh) 一种云主机创建方法、装置及云服务系统
CN101114296A (zh) 管理许可的方法和设备
WO2015055035A1 (fr) Procédé et dispositif pour hacher un objet de métadonnées
CN114327285A (zh) 数据存储方法、装置、设备及存储介质
CN103744882B (zh) 一种基于键值对的目录片段表示方法及装置
CN103902577A (zh) 一种资源查找定位的方法和系统
CN114676096A (zh) 索引存储方法、文件查找方法、装置、电子设备和介质

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 12875880

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 12875880

Country of ref document: EP

Kind code of ref document: A1