Purver, 2011 - Google Patents

Topic segmentation

Purver, 2011

Document ID: 8515599811490171561
Author: Purver M
Publication year: 2011
Publication venue: Spoken language understanding: systems for extracting semantic information from speech

External Links

Cited by

Snippet

This chapter discusses the task of topic segmentation: automatically dividing single long recordings or transcripts into shorter, topically coherent segments. First, it looks at the task itself, the applications which require it, and some ways to evaluate accuracy. The chapter …

Continue reading at eecs.qmul.ac.uk (PDF) (other versions)

230000011218 segmentation 0 title description 151

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3061—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F17/30705—Clustering or classification
- G06F17/3071—Clustering or classification including class or cluster creation or modification
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2765—Recognition
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3061—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F17/30705—Clustering or classification
- G06F17/30707—Clustering or classification into predefined classes
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3061—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F17/30634—Querying
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30781—Information retrieval; Database structures therefor; File system structures therefor of video data
- G06F17/30784—Information retrieval; Database structures therefor; File system structures therefor of video data using features automatically derived from the video content, e.g. descriptors, fingerprints, signatures, genre
- G06F17/30796—Information retrieval; Database structures therefor; File system structures therefor of video data using features automatically derived from the video content, e.g. descriptors, fingerprints, signatures, genre using original textual content or text extracted from visual content or transcript of audio data
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30017—Multimedia data retrieval; Retrieval of more than one type of audiovisual media
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L2015/088—Word spotting
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/68—Methods or arrangements for recognition using electronic means using sequential comparisons of the image signals with a plurality of references in which the sequence of the image signals or the references is relevant, e.g. addressable memory
- G06K9/6807—Dividing the references in groups prior to recognition, the recognition taking place in steps; Selecting relevant dictionaries
- G06K9/6842—Dividing the references in groups prior to recognition, the recognition taking place in steps; Selecting relevant dictionaries according to the linguistic properties, e.g. English, German
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
- G10L15/07—Adaptation to the speaker
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification

Similar Documents

Publication	Publication Date	Title
Purver	2011	Topic segmentation
Lee et al.	2005	Spoken document understanding and organization
Bhatt et al.	2011	Multimedia data mining: state of the art and challenges
Purver et al.	2006	Unsupervised topic modelling for multi-party spoken discourse
Chua et al.	2004	Story boundary detection in large broadcast news video archives: techniques, experience and trends
Liu et al.	2011	Speech summarization
Purver et al.	2007	Detecting and summarizing action items in multi-party dialogue
Alrumiah et al.	2022	Educational Videos Subtitles’ Summarization Using Latent Dirichlet Allocation and Length Enhancement.
Liu et al.	2015	Combining relevance language modeling and clarity measure for extractive speech summarization
Bokaei et al.	2016	Extractive summarization of multi-party meetings through discourse segmentation
Chaisorn et al.	2003	A Two-Level Multi-Modal Approach for Story Segmentation of Large News Video Corpus.
Andra et al.	2019	Automatic lecture video content summarizationwith attention-based recurrent neural network
Hsueh et al.	2006	Automatic topic segmentation and labeling in multiparty dialogue
Lease	2007	Natural language processing for information retrieval: the time is ripe (again)
Rouvier et al.	2015	Audio-based video genre identification
Inoue et al.	2022	Infinite SCAN: An infinite model of diachronic semantic change
Lin et al.	2009	A comparative study of probabilistic ranking models for Chinese spoken document summarization
Hsueh et al.	2010	Combining multiple knowledge sources for dialogue segmentation in multimedia archives
Lin et al.	2010	Leveraging Kullback–Leibler divergence measures and information-rich cues for speech summarization
Georgescul et al.	2006	Word distributions for thematic segmentation in a support vector machine approach
Fernández et al.	2008	Identifying relevant phrases to summarize decisions in spoken meetings.
Kong et al.	2010	Semantic analysis and organization of spoken documents based on parameters derived from latent topics
Yu et al.	2018	Learning distributed sentence representations for story segmentation
Xie	2005	Unsupervised pattern discovery for multimedia sequences
Soares et al.	2018	A framework for automatic topic segmentation in video lectures