GB201113720D0 - Index handling method and apparatus for a data de-duplication system - Google Patents
Index handling method and apparatus for a data de-duplication systemInfo
- Publication number
- GB201113720D0 GB201113720D0 GBGB1113720.5A GB201113720A GB201113720D0 GB 201113720 D0 GB201113720 D0 GB 201113720D0 GB 201113720 A GB201113720 A GB 201113720A GB 201113720 D0 GB201113720 D0 GB 201113720D0
- Authority
- GB
- United Kingdom
- Prior art keywords
- data
- identify
- chunk
- data block
- chunks
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/06—Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
- G06F3/0601—Interfaces specially adapted for storage systems
- G06F3/0602—Interfaces specially adapted for storage systems specifically adapted to achieve a particular effect
- G06F3/0608—Saving storage space on storage systems
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/10—File systems; File servers
- G06F16/13—File access structures, e.g. distributed indices
- G06F16/137—Hash-based
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/10—File systems; File servers
- G06F16/17—Details of further file system functions
- G06F16/174—Redundancy elimination performed by the file system
- G06F16/1748—De-duplication implemented within the file system, e.g. based on file segments
- G06F16/1752—De-duplication implemented within the file system, e.g. based on file segments based on file chunks
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/06—Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
- G06F3/0601—Interfaces specially adapted for storage systems
- G06F3/0628—Interfaces specially adapted for storage systems making use of a particular technique
- G06F3/0638—Organizing or formatting or addressing of data
- G06F3/064—Management of blocks
- G06F3/0641—De-duplication techniques
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/06—Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
- G06F3/0601—Interfaces specially adapted for storage systems
- G06F3/0668—Interfaces specially adapted for storage systems adopting a particular infrastructure
- G06F3/0671—In-line storage system
- G06F3/0673—Single storage device
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/07—Responding to the occurrence of a fault, e.g. fault tolerance
- G06F11/14—Error detection or correction of the data by redundancy in operation
- G06F11/1402—Saving, restoring, recovering or retrying
- G06F11/1446—Point-in-time backing up or restoration of persistent data
- G06F11/1448—Management of the data involved in backup or backup restore
- G06F11/1453—Management of the data involved in backup or backup restore using de-duplication of the data
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Human Computer Interaction (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- Quality & Reliability (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
A data de-duplication system includes a data manager (12, fig. 1) for storing data blocks (40) in a memory 14 and an index for identifying which data blocks are stored in the memory (16) e.g. a sparse bitmap. The index includes chunks (22) e.g. of bits r corresponding to hash values of given data blocks, and a chunk allocation record (30) is provided which has a record of entries (32). Each record entry being associated with a range of data values that are associated with the data blocks, and being configurable to identify a respective chunk in respect of a received data block. The data manager refers to the record entry associated with the range of values in which the data value for the data block falls, and in the event that the record entry does not identify any of the chunks, selects one of the chunks, configures the record entry to identify the selected chunk, configures the selected chunk to identify the data value for the data block, and stores the data block in the memory. In the event that the record entry does identify one of the chunks, the data manager determines if the identified chunk is configured to identify the data value, and upon determining that the identified chunk does not identify the data value, configures the identified chunk to identify the data value for the data block, and stores the data block in the memory, and upon determining that said identified chunk does identify the data value, does not store the data block.
Priority Applications (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| GBGB1113720.5A GB201113720D0 (en) | 2011-08-10 | 2011-08-10 | Index handling method and apparatus for a data de-duplication system |
| GB1214289.9A GB2493832B (en) | 2011-08-10 | 2012-08-10 | Index handling method and apparatus for a data de-duplication system |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| GBGB1113720.5A GB201113720D0 (en) | 2011-08-10 | 2011-08-10 | Index handling method and apparatus for a data de-duplication system |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| GB201113720D0 true GB201113720D0 (en) | 2011-09-21 |
Family
ID=44735663
Family Applications (2)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| GBGB1113720.5A Ceased GB201113720D0 (en) | 2011-08-10 | 2011-08-10 | Index handling method and apparatus for a data de-duplication system |
| GB1214289.9A Active GB2493832B (en) | 2011-08-10 | 2012-08-10 | Index handling method and apparatus for a data de-duplication system |
Family Applications After (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| GB1214289.9A Active GB2493832B (en) | 2011-08-10 | 2012-08-10 | Index handling method and apparatus for a data de-duplication system |
Country Status (1)
| Country | Link |
|---|---|
| GB (2) | GB201113720D0 (en) |
Families Citing this family (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN103246730B (en) * | 2013-05-08 | 2016-08-10 | 网易(杭州)网络有限公司 | File memory method and equipment, document sending method and equipment |
| US11995060B2 (en) * | 2021-02-02 | 2024-05-28 | Maxlinear, Inc. | Hashing a data set with multiple hash engines |
Family Cites Families (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US7720892B1 (en) * | 2006-06-30 | 2010-05-18 | Emc Corporation | Bulk updates and tape synchronization |
| WO2010045262A1 (en) * | 2008-10-14 | 2010-04-22 | Wanova Technologies, Ltd. | Storage-network de-duplication |
-
2011
- 2011-08-10 GB GBGB1113720.5A patent/GB201113720D0/en not_active Ceased
-
2012
- 2012-08-10 GB GB1214289.9A patent/GB2493832B/en active Active
Also Published As
| Publication number | Publication date |
|---|---|
| GB201214289D0 (en) | 2012-09-26 |
| GB2493832A (en) | 2013-02-20 |
| GB2493832B (en) | 2020-01-29 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| GB2522373A (en) | Storage system and method of storing and managing data | |
| GB2522372A (en) | Storage system and method of storing and managing data | |
| WO2014164016A3 (en) | Block-based data storage low-overhead deduplication | |
| GB2550800A (en) | Managing data in storage according to a log structure | |
| CA2902868C (en) | Managing operations on stored data units | |
| PH12017501245A1 (en) | Design of sample entry and operation point signalling in a layered video file format | |
| GB2599061B (en) | Block mode toggling in data storage system | |
| EP4239549A3 (en) | Automated asset management system with multiple sensing technologies | |
| MY182085A (en) | Data management for connected devices | |
| GB2472520A (en) | Data processing apparatus and method of processing data | |
| WO2015066719A3 (en) | Use of solid state storage devices and the like in data deduplication | |
| GB2569060A (en) | Workload optimized data deduplication using ghost fingerprints | |
| WO2013019869A3 (en) | Data fingerpringting for copy accuracy assurance | |
| GB2484019A (en) | An integrated approach for deduplicating data in a distributed environment that involves a source and a target | |
| EP3806021A4 (en) | Commodity recommendation method, apparatus and system, and computer readable storage medium | |
| WO2011116087A3 (en) | Highly scalable and distributed data de-duplication | |
| TN2017000327A1 (en) | Restriction on palette block size in video coding | |
| WO2013186665A3 (en) | Methods and apparatus for storing, suggesting, and/or utilizing lighting settings | |
| EP3376379A4 (en) | Task management methods and system, and computer storage medium | |
| GB201206443D0 (en) | Backup and storage system | |
| EP3654195A4 (en) | Index data storage and retrieval methods and apparatuses, and storage medium | |
| GB201206444D0 (en) | Data cleaning | |
| EP3401798A4 (en) | Push information rough selection sorting method, device and computer storage medium | |
| SG11202012438VA (en) | Methods, apparatuses and devices for magnifying feature map and computer readable storage medium | |
| EP3948630A4 (en) | Validating data stored in memory using cryptographic hashes |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| AT | Applications terminated before publication under section 16(1) |