Kim et al., 2015 - Google Patents
Conjoined Audio Fingerprint based on Interhash and Intra hash AlgorithmsKim et al., 2015
View PDF- Document ID
- 7743193465443989800
- Author
- Kim D
- Choi H
- Publication year
- Publication venue
- International Journal of Contents
External Links
Snippet
In practice, the most important performance parameters for music information retrieval (MIR) service are robustness of fingerprint in real noise environments and recognition accuracy when the obtained query clips are matched with the an entry in the database. To satisfy …
- 238000002474 experimental method 0 description 7
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3074—Audio data retrieval
- G06F17/30755—Query formulation specially adapted for audio data retrieval
- G06F17/30758—Query by example, e.g. query by humming
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis
- G10L19/018—Audio watermarking, i.e. embedding inaudible data in the audio signal
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3074—Audio data retrieval
- G06F17/30743—Audio data retrieval using features automatically derived from the audio content, e.g. descriptors, fingerprints, signatures, MEP-cepstral coefficients, musical score, tempo
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3074—Audio data retrieval
- G06F17/30749—Audio data retrieval using information manually generated or using information not derived from the audio data, e.g. title and artist information, time and location information, usage information, user ratings
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10H—ELECTROPHONIC MUSICAL INSTRUMENTS
- G10H2210/00—Aspects or methods of musical processing having intrinsic musical character, i.e. involving musical theory or musical parameters or relying on musical knowledge, as applied in electrophonic musical tools or instruments
- G10H2210/031—Musical analysis, i.e. isolation, extraction or identification of musical elements or musical parameters from a raw acoustic signal or from an encoded audio signal
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/90—Pitch determination of speech signals
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30781—Information retrieval; Database structures therefor; File system structures therefor of video data
- G06F17/30784—Information retrieval; Database structures therefor; File system structures therefor of video data using features automatically derived from the video content, e.g. descriptors, fingerprints, signatures, genre
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
Similar Documents
Publication | Publication Date | Title |
---|---|---|
TWI222623B (en) | System and method for music identification | |
CN103999150B (en) | Low-complexity duplicate detection in media data | |
US9317561B2 (en) | Scene change detection around a set of seed points in media data | |
Kobayashi et al. | Audio feature extraction based on sub-band signal correlations for music genre classification | |
Park et al. | Frequency‐Temporal Filtering for a Robust Audio Fingerprinting Scheme in Real‐Noise Environments | |
Son et al. | Sub-fingerprint masking for a robust audio fingerprinting system in a real-noise environment for portable consumer devices | |
Kim et al. | Robust audio fingerprinting using peak-pair-based hash of non-repeating foreground audio in a real environment | |
Bellettini et al. | A framework for robust audio fingerprinting. | |
Kekre et al. | A review of audio fingerprinting and comparison of algorithms | |
JP2017518715A (en) | Method and apparatus for generating a fingerprint of an information signal | |
Liu et al. | Audio fingerprinting based on multiple hashing in DCT domain | |
Pan et al. | Audio fingerprinting based on local energy centroid | |
Kim et al. | Conjoined Audio Fingerprint based on Interhash and Intra hash Algorithms | |
Kim et al. | Robust audio fingerprinting method using prominent peak pair based on modulated complex lapped transform | |
Li et al. | Low-order auditory Zernike moment: a novel approach for robust music identification in the compressed domain | |
Li et al. | Robust audio identification for MP3 popular music | |
Park et al. | Frequency filtering for a highly robust audio fingerprinting scheme in a real-noise environment | |
Kim et al. | TV advertisement search based on audio peak-pair hashing in real environments | |
Han et al. | A filtering method for audio fingerprint based on multiple measurements | |
Lee et al. | Audio fingerprinting to identify TV commercial advertisement in real-noisy environment | |
Van Nieuwenhuizen et al. | The study and implementation of shazam’s audio fingerprinting algorithm for advertisement identification | |
Xiong et al. | Audio fingerprinting based on dynamic subband locating and normalized SSC | |
Son et al. | A Robust Audio Fingerprinting System with Predominant Pitch Extraction in Real-Noise Environment | |
Park et al. | Audio fingerprinting scheme by temporal filtering for audio identification immune to channel-distortion | |
Van Nieuwenhuizen | Comparison of two audio fingerprinting algorithms for advertisement identification |