Lane et al., 2021 - Google Patents

A computational model for interactive transcription

Lane et al., 2021

Document ID: 15793213256408539851
Author: Lane W; Bettinson M; Bird S
Publication year: 2021
Publication venue: 2nd Workshop on Data Science with Human-in-the-Loop: Language Advances, DaSH-LA 2021

External Links

Cited by

Snippet

Transcribing low resource languages can be challenging in the absence of a comprehensive lexicon and proficient transcribers. Accordingly, we seek a way to enable interactive transcription, whereby the machine amplifies human efforts. This paper presents …

Continue reading at researchers.cdu.edu.au (PDF) (other versions)

230000035897 transcription 0 title abstract description 69

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3061—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F17/30634—Querying
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2765—Recognition
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2705—Parsing
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/21—Text processing
- G06F17/24—Editing, e.g. insert/delete
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30861—Retrieval from the Internet, e.g. browsers
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L2015/088—Word spotting
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
- G10L15/07—Adaptation to the speaker
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06Q—DATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G—PHYSICS
- G09—EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
- G09B—EDUCATIONAL OR DEMONSTRATION APPLIANCES; APPLIANCES FOR TEACHING, OR COMMUNICATING WITH, THE BLIND, DEAF OR MUTE; MODELS; PLANETARIA; GLOBES; MAPS; DIAGRAMS
- G09B5/00—Electrically-operated educational appliances

Similar Documents

Publication	Publication Date	Title
RU2571608C2 (en)	2015-12-20	Creating notes using voice stream
Iancu	2019	Evaluating Google speech-to-text API's performance for Romanian e-learning resources
US20180373702A1 (en)	2018-12-27	Interactive method and apparatus based on test-type application
Nagy et al.	2015	Extending ELAN into variationist sociolinguistics
Yu et al.	2010	Sequential labeling using deep-structured conditional random fields
WO2020210561A1 (en)	2020-10-15	Unsupervised adaptation of sentiment lexicon
Fan et al.	2020	Phonetics and ambiguity comprehension gated attention network for humor recognition
JP2009140466A (en)	2009-06-25	Method and system for providing conversation dictionary services based on user created dialog data
CN114281948A (en)	2022-04-05	A method for determining minutes and related equipment
Moyal et al.	2013	Phonetic search methods for large speech databases
Gustafson	2002	Developing multimodal spoken dialogue systems: Empirical studies of spoken human–computer interaction
Majdik et al.	2023	Building better machine learning models for rhetorical analyses: The use of rhetorical feature sets for training artificial neural network models
Lane et al.	2021	A computational model for interactive transcription
CN110647613A (en)	2020-01-03	Courseware construction method, courseware construction device, courseware construction server and storage medium
JP6743108B2 (en)	2020-08-19	PATTERN RECOGNITION MODEL AND PATTERN LEARNING DEVICE, GENERATION METHOD THEREOF, FAQ EXTRACTION METHOD USING THE SAME, PATTERN RECOGNITION DEVICE, AND PROGRAM
KR100852970B1 (en)	2008-08-19	Language Learning System and Method Using Image Segmentation Techniques, Recording Media and Language Learning Materials
Jong et al.	2008	Access to recorded interviews: A research agenda
Le Ferrand et al.	2022	Fashioning local designs from generic speech technologies in an Australian aboriginal community
Grigorov	2024	Harnessing Python 3.11 and Python Libraries for LLM Development
Gurram et al.	2023	String Kernel-based techniques for native language identification
CN110276001B (en)	2021-10-08	Inventory page identification method, apparatus, computing device and medium
JP2013109738A (en)	2013-06-06	Semantic label application model learning device, semantic label application device, semantic label application model learning method and program
Kamineni et al.	2024	Advancements and challenges of using natural language processing in the healthcare sector
Wu et al.	2019	Generating pseudo-relevant representations for spoken document retrieval
Kumar et al.	2024	Voice to Text Summarization Using NLP