Brisaboa et al., 2018 - Google Patents
Scalable processing and autocovariance computation of big functional dataBrisaboa et al., 2018
View PDF- Document ID
- 2035753594892790161
- Author
- Brisaboa N
- Cao R
- Paramá J
- Silva‐Coira F
- Publication year
- Publication venue
- Software: Practice and Experience
External Links
Snippet
This paper presents 2 main contributions. The first is a compact representation of huge sets of functional data or trajectories of continuous‐time stochastic processes, which allows keeping the data always compressed even during the processing in main memory. It is …
- 238000000034 method 0 abstract description 50
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30067—File systems; File servers
- G06F17/30129—Details of further file system functionalities
- G06F17/3015—Redundancy elimination performed by the file system
- G06F17/30153—Redundancy elimination performed by the file system using compression, e.g. sparse files
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30067—File systems; File servers
- G06F17/30129—Details of further file system functionalities
- G06F17/3015—Redundancy elimination performed by the file system
- G06F17/30156—De-duplication implemented within the file system, e.g. based on file segments
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30286—Information retrieval; Database structures therefor; File system structures therefor in structured data stores
- G06F17/30587—Details of specialised database models
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30286—Information retrieval; Database structures therefor; File system structures therefor in structured data stores
- G06F17/30312—Storage and indexing structures; Management thereof
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30861—Retrieval from the Internet, e.g. browsers
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3061—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
-
- H—ELECTRICITY
- H03—BASIC ELECTRONIC CIRCUITRY
- H03M—CODING; DECODING; CODE CONVERSION IN GENERAL
- H03M7/00—Conversion of a code where information is represented by a given sequence or number of digits to a code where the same information or similar information or a subset of information is represented by a different sequence or number of digits
- H03M7/30—Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
- H03M7/40—Conversion to or from variable length codes, e.g. Shannon-Fano code, Huffman code, Morse code
- H03M7/42—Conversion to or from variable length codes, e.g. Shannon-Fano code, Huffman code, Morse code using table look-up for the coding or decoding process, e.g. using read-only memory
-
- H—ELECTRICITY
- H03—BASIC ELECTRONIC CIRCUITRY
- H03M—CODING; DECODING; CODE CONVERSION IN GENERAL
- H03M7/00—Conversion of a code where information is represented by a given sequence or number of digits to a code where the same information or similar information or a subset of information is represented by a different sequence or number of digits
- H03M7/30—Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
- H03M7/3084—Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction using adaptive string matching, e.g. the Lempel-Ziv method
- H03M7/3088—Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction using adaptive string matching, e.g. the Lempel-Ziv method employing the use of a dictionary, e.g. LZ78
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/21—Text processing
- G06F17/22—Manipulating or registering by use of codes, e.g. in sequence of text characters
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F7/00—Methods or arrangements for processing data by operating upon the order or content of the data handled
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US11151126B2 (en) | Hybrid column store providing both paged and memory-resident configurations | |
Lemire et al. | Decoding billions of integers per second through vectorization | |
Abadi et al. | Integrating compression and execution in column-oriented database systems | |
US8239421B1 (en) | Techniques for compression and processing optimizations by using data transformations | |
US20150234899A1 (en) | Data record compression with progressive and/or selective decomposition | |
KR100803285B1 (en) | Queryable X-M-L Compression Method Using Inverse Arithmetic Coding and Type Inference Engine | |
Yan et al. | Compressing term positions in web indexes | |
Barbarioli et al. | Hierarchical residual encoding for multiresolution time series compression | |
Petri et al. | Compact inverted index storage using general‐purpose compression libraries | |
US10885074B2 (en) | Memory optimization system for inverted indexes | |
Deng et al. | Memory deduplication: An effective approach to improve the memory system | |
Hassan et al. | Arithmetic N-gram: an efficient data compression technique | |
Zhang et al. | High-Ratio Compression for Machine-Generated Data | |
Su et al. | Everything You Always Wanted to Know About Storage Compressibility of Pre-Trained ML Models but Were Afraid to Ask | |
Brisaboa et al. | Scalable processing and autocovariance computation of big functional data | |
Narashiman et al. | AlphaZip: Neural Network-Enhanced Lossless Text Compression | |
Chen et al. | CMIC: an efficient quality score compressor with random access functionality | |
US9235610B2 (en) | Short string compression | |
Qiao et al. | Blitzcrank: Fast Semantic Compression for In-memory Online Transaction Processing | |
Mesut et al. | A method to improve full-text search performance of MongoDB | |
Jiancheng et al. | Block‐Split Array Coding Algorithm for Long‐Stream Data Compression | |
Oswald et al. | An efficient and novel data clustering and run length encoding approach to image compression | |
Dong et al. | Content-aware partial compression for big textual data analysis acceleration | |
Keskin et al. | Single and Binary Performance Comparison of Data Compression Algorithms for Text Files | |
Xiao et al. | Dzip: A data deduplication-compatible enhanced version of gzip |