Burges et al., 2003 - Google Patents
Identifying audio clips with RAREBurges et al., 2003
- Document ID
- 6999744440407771129
- Author
- Burges C
- Platt J
- Goldstein J
- Publication year
- Publication venue
- Proceedings of the eleventh ACM international conference on Multimedia
External Links
Snippet
In this paper, we describe RARE (Robust Audio Recognition Engine): a system for identifying audio streams and files. RARE can be used in a variety of applications: from enhancing the consumer listening experience to cleaning large audio databases. RARE …
- 238000004140 cleaning 0 abstract description 2
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3074—Audio data retrieval
- G06F17/30755—Query formulation specially adapted for audio data retrieval
- G06F17/30758—Query by example, e.g. query by humming
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30781—Information retrieval; Database structures therefor; File system structures therefor of video data
- G06F17/30784—Information retrieval; Database structures therefor; File system structures therefor of video data using features automatically derived from the video content, e.g. descriptors, fingerprints, signatures, genre
- G06F17/30799—Information retrieval; Database structures therefor; File system structures therefor of video data using features automatically derived from the video content, e.g. descriptors, fingerprints, signatures, genre using low-level visual features of the video content
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3074—Audio data retrieval
- G06F17/30743—Audio data retrieval using features automatically derived from the audio content, e.g. descriptors, fingerprints, signatures, MEP-cepstral coefficients, musical score, tempo
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3061—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30017—Multimedia data retrieval; Retrieval of more than one type of audiovisual media
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/04—Training, enrolment or model building
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis
- G10L19/018—Audio watermarking, i.e. embedding inaudible data in the audio signal
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00624—Recognising scenes, i.e. recognition of a whole field of perception; recognising scene-specific objects
- G06K9/00711—Recognising video content, e.g. extracting audiovisual features from movies, extracting representative key-frames, discriminating news vs. sport content
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US8977067B1 (en) | Audio identification using wavelet-based signatures | |
Cano et al. | A review of algorithms for audio fingerprinting | |
US9208790B2 (en) | Extraction and matching of characteristic fingerprints from audio signals | |
US9286909B2 (en) | Method and system for robust audio hashing | |
US9093120B2 (en) | Audio fingerprint extraction by scaling in time and resampling | |
Gfeller et al. | Now playing: Continuous low-power music recognition | |
US20060155399A1 (en) | Method and system for generating acoustic fingerprints | |
US20150310008A1 (en) | Clustering and synchronizing multimedia contents | |
CN102436806A (en) | Audio copy detection method based on similarity | |
Burges et al. | Identifying audio clips with RARE | |
Kim et al. | Quick audio retrieval using multiple feature vectors | |
Bakker et al. | Semantic video retrieval using audio analysis | |
You et al. | Music Identification System Using MPEG‐7 Audio Signature Descriptors | |
Kimura et al. | A quick search method for audio signals based on a piecewise linear representation of feature trajectories | |
Yadav et al. | Real time audio synchronization using audio fingerprinting techniques | |
Cotton et al. | Finding similar acoustic events using matching pursuit and locality-sensitive hashing | |
Kashino et al. | Robust search methods for music signals based on simple representation | |
Medina et al. | Audio fingerprint parameterization for multimedia advertising identification | |
Herley | Accurate repeat finding and object skipping using fingerprints | |
Van Nieuwenhuizen et al. | The study and implementation of shazam’s audio fingerprinting algorithm for advertisement identification | |
Xiong et al. | An improved audio fingerprinting algorithm with robust and efficient | |
KR20100056430A (en) | Method for extracting feature vector of audio data and method for matching the audio data using the method | |
KR101002731B1 (en) | Feature vector extraction method of audio data, computer readable recording medium recording the method and matching method of audio data using same | |
Liu et al. | Wavelet-based audio fingerprinting algorithm robust to linear speed change | |
Shuyu | Efficient and robust audio fingerprinting |