[go: up one dir, main page]

File: DEDUPE-TODO

package info (click to toggle)
fio 3.33-3
  • links: PTS, VCS
  • area: main
  • in suites: bookworm
  • size: 10,896 kB
  • sloc: ansic: 78,615; sh: 5,191; python: 4,795; makefile: 832; yacc: 204; lex: 184
file content (16 lines) | stat: -rw-r--r-- 808 bytes parent folder | download | duplicates (3)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
- Shifted dedup-able data.
  Allow for dedup buffer generation to shift contents by random number
  of sectors (fill the gaps with uncompressible data). Some storage
  subsystems modernized the deduplication detection algorithms to look
  for shifted data as well. For example, some databases push a timestamp
  on the prefix of written blocks, which makes the underlying data
  dedup-able in different alignment. FIO should be able to simulate such
  workload.

- Generation of similar data (but not exact).
  A rising trend in enterprise storage systems.
  Generation of "similar" data means random uncompressible buffers
  that differ by few(configurable number of) bits from each other.
  The storage subsystem usually identifies the similar buffers using
  locality-sensitive hashing or other methods.