[go: up one dir, main page]

GB201113720D0 - Index handling method and apparatus for a data de-duplication system - Google Patents

Index handling method and apparatus for a data de-duplication system

Info

Publication number
GB201113720D0
GB201113720D0 GBGB1113720.5A GB201113720A GB201113720D0 GB 201113720 D0 GB201113720 D0 GB 201113720D0 GB 201113720 A GB201113720 A GB 201113720A GB 201113720 D0 GB201113720 D0 GB 201113720D0
Authority
GB
United Kingdom
Prior art keywords
data
identify
chunk
data block
chunks
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
GBGB1113720.5A
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
REPLIFY Ltd
Original Assignee
REPLIFY Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by REPLIFY Ltd filed Critical REPLIFY Ltd
Priority to GBGB1113720.5A priority Critical patent/GB201113720D0/en
Publication of GB201113720D0 publication Critical patent/GB201113720D0/en
Priority to GB1214289.9A priority patent/GB2493832B/en
Ceased legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0602Interfaces specially adapted for storage systems specifically adapted to achieve a particular effect
    • G06F3/0608Saving storage space on storage systems
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/13File access structures, e.g. distributed indices
    • G06F16/137Hash-based
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/17Details of further file system functions
    • G06F16/174Redundancy elimination performed by the file system
    • G06F16/1748De-duplication implemented within the file system, e.g. based on file segments
    • G06F16/1752De-duplication implemented within the file system, e.g. based on file segments based on file chunks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0628Interfaces specially adapted for storage systems making use of a particular technique
    • G06F3/0638Organizing or formatting or addressing of data
    • G06F3/064Management of blocks
    • G06F3/0641De-duplication techniques
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0668Interfaces specially adapted for storage systems adopting a particular infrastructure
    • G06F3/0671In-line storage system
    • G06F3/0673Single storage device
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/14Error detection or correction of the data by redundancy in operation
    • G06F11/1402Saving, restoring, recovering or retrying
    • G06F11/1446Point-in-time backing up or restoration of persistent data
    • G06F11/1448Management of the data involved in backup or backup restore
    • G06F11/1453Management of the data involved in backup or backup restore using de-duplication of the data

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Human Computer Interaction (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Quality & Reliability (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

A data de-duplication system includes a data manager (12, fig. 1) for storing data blocks (40) in a memory 14 and an index for identifying which data blocks are stored in the memory (16) e.g. a sparse bitmap. The index includes chunks (22) e.g. of bits r corresponding to hash values of given data blocks, and a chunk allocation record (30) is provided which has a record of entries (32). Each record entry being associated with a range of data values that are associated with the data blocks, and being configurable to identify a respective chunk in respect of a received data block. The data manager refers to the record entry associated with the range of values in which the data value for the data block falls, and in the event that the record entry does not identify any of the chunks, selects one of the chunks, configures the record entry to identify the selected chunk, configures the selected chunk to identify the data value for the data block, and stores the data block in the memory. In the event that the record entry does identify one of the chunks, the data manager determines if the identified chunk is configured to identify the data value, and upon determining that the identified chunk does not identify the data value, configures the identified chunk to identify the data value for the data block, and stores the data block in the memory, and upon determining that said identified chunk does identify the data value, does not store the data block.
GBGB1113720.5A 2011-08-10 2011-08-10 Index handling method and apparatus for a data de-duplication system Ceased GB201113720D0 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
GBGB1113720.5A GB201113720D0 (en) 2011-08-10 2011-08-10 Index handling method and apparatus for a data de-duplication system
GB1214289.9A GB2493832B (en) 2011-08-10 2012-08-10 Index handling method and apparatus for a data de-duplication system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
GBGB1113720.5A GB201113720D0 (en) 2011-08-10 2011-08-10 Index handling method and apparatus for a data de-duplication system

Publications (1)

Publication Number Publication Date
GB201113720D0 true GB201113720D0 (en) 2011-09-21

Family

ID=44735663

Family Applications (2)

Application Number Title Priority Date Filing Date
GBGB1113720.5A Ceased GB201113720D0 (en) 2011-08-10 2011-08-10 Index handling method and apparatus for a data de-duplication system
GB1214289.9A Active GB2493832B (en) 2011-08-10 2012-08-10 Index handling method and apparatus for a data de-duplication system

Family Applications After (1)

Application Number Title Priority Date Filing Date
GB1214289.9A Active GB2493832B (en) 2011-08-10 2012-08-10 Index handling method and apparatus for a data de-duplication system

Country Status (1)

Country Link
GB (2) GB201113720D0 (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103246730B (en) * 2013-05-08 2016-08-10 网易(杭州)网络有限公司 File memory method and equipment, document sending method and equipment
US11995060B2 (en) * 2021-02-02 2024-05-28 Maxlinear, Inc. Hashing a data set with multiple hash engines

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7720892B1 (en) * 2006-06-30 2010-05-18 Emc Corporation Bulk updates and tape synchronization
WO2010045262A1 (en) * 2008-10-14 2010-04-22 Wanova Technologies, Ltd. Storage-network de-duplication

Also Published As

Publication number Publication date
GB201214289D0 (en) 2012-09-26
GB2493832A (en) 2013-02-20
GB2493832B (en) 2020-01-29

Similar Documents

Publication Publication Date Title
GB2522373A (en) Storage system and method of storing and managing data
GB2522372A (en) Storage system and method of storing and managing data
WO2014164016A3 (en) Block-based data storage low-overhead deduplication
GB2550800A (en) Managing data in storage according to a log structure
CA2902868C (en) Managing operations on stored data units
PH12017501245A1 (en) Design of sample entry and operation point signalling in a layered video file format
GB2599061B (en) Block mode toggling in data storage system
EP4239549A3 (en) Automated asset management system with multiple sensing technologies
MY182085A (en) Data management for connected devices
GB2472520A (en) Data processing apparatus and method of processing data
WO2015066719A3 (en) Use of solid state storage devices and the like in data deduplication
GB2569060A (en) Workload optimized data deduplication using ghost fingerprints
WO2013019869A3 (en) Data fingerpringting for copy accuracy assurance
GB2484019A (en) An integrated approach for deduplicating data in a distributed environment that involves a source and a target
EP3806021A4 (en) Commodity recommendation method, apparatus and system, and computer readable storage medium
WO2011116087A3 (en) Highly scalable and distributed data de-duplication
TN2017000327A1 (en) Restriction on palette block size in video coding
WO2013186665A3 (en) Methods and apparatus for storing, suggesting, and/or utilizing lighting settings
EP3376379A4 (en) Task management methods and system, and computer storage medium
GB201206443D0 (en) Backup and storage system
EP3654195A4 (en) Index data storage and retrieval methods and apparatuses, and storage medium
GB201206444D0 (en) Data cleaning
EP3401798A4 (en) Push information rough selection sorting method, device and computer storage medium
SG11202012438VA (en) Methods, apparatuses and devices for magnifying feature map and computer readable storage medium
EP3948630A4 (en) Validating data stored in memory using cryptographic hashes

Legal Events

Date Code Title Description
AT Applications terminated before publication under section 16(1)