Burges et al., 2003 - Google Patents

Identifying audio clips with RARE

Burges et al., 2003

Document ID: 6999744440407771129
Author: Burges C; Platt J; Goldstein J
Publication year: 2003
Publication venue: Proceedings of the eleventh ACM international conference on Multimedia

External Links

Cited by

Snippet

In this paper, we describe RARE (Robust Audio Recognition Engine): a system for identifying audio streams and files. RARE can be used in a variety of applications: from enhancing the consumer listening experience to cleaning large audio databases. RARE …

Continue reading at dl.acm.org (other versions)

238000004140 cleaning 0 abstract description 2

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3074—Audio data retrieval
- G06F17/30755—Query formulation specially adapted for audio data retrieval
- G06F17/30758—Query by example, e.g. query by humming
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30781—Information retrieval; Database structures therefor; File system structures therefor of video data
- G06F17/30784—Information retrieval; Database structures therefor; File system structures therefor of video data using features automatically derived from the video content, e.g. descriptors, fingerprints, signatures, genre
- G06F17/30799—Information retrieval; Database structures therefor; File system structures therefor of video data using features automatically derived from the video content, e.g. descriptors, fingerprints, signatures, genre using low-level visual features of the video content
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3074—Audio data retrieval
- G06F17/30743—Audio data retrieval using features automatically derived from the audio content, e.g. descriptors, fingerprints, signatures, MEP-cepstral coefficients, musical score, tempo
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3061—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30017—Multimedia data retrieval; Retrieval of more than one type of audiovisual media
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/04—Training, enrolment or model building
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis
- G10L19/018—Audio watermarking, i.e. embedding inaudible data in the audio signal
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00624—Recognising scenes, i.e. recognition of a whole field of perception; recognising scene-specific objects
- G06K9/00711—Recognising video content, e.g. extracting audiovisual features from movies, extracting representative key-frames, discriminating news vs. sport content

Similar Documents

Publication	Publication Date	Title
US8977067B1 (en)	2015-03-10	Audio identification using wavelet-based signatures
Cano et al.	2002	A review of algorithms for audio fingerprinting
US9208790B2 (en)	2015-12-08	Extraction and matching of characteristic fingerprints from audio signals
US9286909B2 (en)	2016-03-15	Method and system for robust audio hashing
US9093120B2 (en)	2015-07-28	Audio fingerprint extraction by scaling in time and resampling
Gfeller et al.	2017	Now playing: Continuous low-power music recognition
US20060155399A1 (en)	2006-07-13	Method and system for generating acoustic fingerprints
US20150310008A1 (en)	2015-10-29	Clustering and synchronizing multimedia contents
CN102436806A (en)	2012-05-02	Audio copy detection method based on similarity
Burges et al.	2003	Identifying audio clips with RARE
Kim et al.	2006	Quick audio retrieval using multiple feature vectors
Bakker et al.	2002	Semantic video retrieval using audio analysis
You et al.	2013	Music Identification System Using MPEG‐7 Audio Signature Descriptors
Kimura et al.	2008	A quick search method for audio signals based on a piecewise linear representation of feature trajectories
Yadav et al.	2022	Real time audio synchronization using audio fingerprinting techniques
Cotton et al.	2009	Finding similar acoustic events using matching pursuit and locality-sensitive hashing
Kashino et al.	2007	Robust search methods for music signals based on simple representation
Medina et al.	2017	Audio fingerprint parameterization for multimedia advertising identification
Herley	2005	Accurate repeat finding and object skipping using fingerprints
Van Nieuwenhuizen et al.	2011	The study and implementation of shazam’s audio fingerprinting algorithm for advertisement identification
Xiong et al.	2013	An improved audio fingerprinting algorithm with robust and efficient
KR20100056430A (en)	2010-05-27	Method for extracting feature vector of audio data and method for matching the audio data using the method
KR101002731B1 (en)	2010-12-21	Feature vector extraction method of audio data, computer readable recording medium recording the method and matching method of audio data using same
Liu et al.	2011	Wavelet-based audio fingerprinting algorithm robust to linear speed change
Shuyu	2007	Efficient and robust audio fingerprinting