[go: up one dir, main page]

GB2361569A - System and method for automating transcription services - Google Patents

System and method for automating transcription services

Info

Publication number
GB2361569A
GB2361569A GB0118231A GB0118231A GB2361569A GB 2361569 A GB2361569 A GB 2361569A GB 0118231 A GB0118231 A GB 0118231A GB 0118231 A GB0118231 A GB 0118231A GB 2361569 A GB2361569 A GB 2361569A
Authority
GB
United Kingdom
Prior art keywords
training
current user
file
enrollment
voice
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
GB0118231A
Other versions
GB0118231D0 (en
GB2361569B (en
Inventor
Jonathan Kahn
Charles Qin
Thomas P Flynn
Robert J Tippe
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Custom Speech USA Inc
Original Assignee
Custom Speech USA Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Custom Speech USA Inc filed Critical Custom Speech USA Inc
Priority to GB0324945A priority Critical patent/GB2390930B/en
Priority to GB0324946A priority patent/GB2391100B/en
Publication of GB0118231D0 publication Critical patent/GB0118231D0/en
Publication of GB2361569A publication Critical patent/GB2361569A/en
Application granted granted Critical
Publication of GB2361569B publication Critical patent/GB2361569B/en
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/166Editing, e.g. inserting or deleting
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/221Announcement of recognition results

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Acoustics & Sound (AREA)
  • Human Computer Interaction (AREA)
  • Multimedia (AREA)
  • Artificial Intelligence (AREA)
  • Theoretical Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Telephonic Communication Services (AREA)
  • Document Processing Apparatus (AREA)

Abstract

A system for substantially automating transcription services for multiple voice users including a manual transcription station, a speech recognition program and a routing program. The system establishes a profile for each of the voice users containing a training status which is selected from the group of enrollment, training, automated and stop automation. The system generates a uniquely identified voice dictation file from a current voice user and - based on the training status the system - routes the uniquely identified voice dictation file to a manual transcription station and the speech recognition program. A human transcriptionist creates transcribed files for each received voice dictation file. The speech recognition program automatically creates a written text for each received voice dictation file if the training status of the current user is training or automated. A verbatim file is manually established if the training status of the current user is enrollment or training and the speech recognition program is trained with an acoustic model for the current user using the verbatim file and the voice dictation file if the training status of the current user is enrollment or training. The transcribed file is returned to the current user if the training status of the current user is enrollment or training or the written text is returned if the training status of the current user is automated. An apparatus and method is also disclosed for testing the skills of a transcriptionist. These apparatuses and methods may also be used to establish new base language model for mass distribution.
GB0118231A 1999-02-05 2000-02-04 System and method for automating transcription services Expired - Fee Related GB2361569B (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
GB0324945A GB2390930B (en) 1999-02-05 2000-02-04 System and method for automating transcription services
GB0324946A GB2391100B (en) 1999-02-05 2000-02-04 System and method for automating transcription services

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US11894999P 1999-02-05 1999-02-05
PCT/US2000/002808 WO2000046787A2 (en) 1999-02-05 2000-02-04 System and method for automating transcription services

Publications (3)

Publication Number Publication Date
GB0118231D0 GB0118231D0 (en) 2001-09-19
GB2361569A true GB2361569A (en) 2001-10-24
GB2361569B GB2361569B (en) 2003-12-24

Family

ID=22381731

Family Applications (1)

Application Number Title Priority Date Filing Date
GB0118231A Expired - Fee Related GB2361569B (en) 1999-02-05 2000-02-04 System and method for automating transcription services

Country Status (5)

Country Link
AU (1) AU3588200A (en)
CA (1) CA2362462A1 (en)
GB (1) GB2361569B (en)
HK (1) HK1041086A1 (en)
WO (1) WO2000046787A2 (en)

Families Citing this family (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7383187B2 (en) 2001-01-24 2008-06-03 Bevocal, Inc. System, method and computer program product for a distributed speech recognition tuning platform
ATE300084T1 (en) 2001-03-16 2005-08-15 Koninkl Philips Electronics Nv TRANSCRIPTION SERVICE WITH CANCEL OF AUTOMATIC TRANSCRIPTION
DE10126020A1 (en) * 2001-05-28 2003-01-09 Olaf Berberich Automatic conversion of words spoken by speaker into digitally coded terms for processing by computer involves displaying term rejections in correction window for direct entry correction
GB2388739B (en) 2001-11-03 2004-06-02 Dremedia Ltd Time ordered indexing of an information stream
GB2388738B (en) 2001-11-03 2004-06-02 Dremedia Ltd Time ordered indexing of audio data
WO2008041083A2 (en) * 2006-10-02 2008-04-10 Bighand Ltd. Digital dictation workflow system and method
US8024289B2 (en) 2007-07-31 2011-09-20 Bighand Ltd. System and method for efficiently providing content over a thin client network
CN109285548A (en) * 2017-07-19 2019-01-29 阿里巴巴集团控股有限公司 Information processing method, system, electronic equipment and computer storage medium
CN116074150B (en) * 2023-03-02 2023-06-09 广东浩博特科技股份有限公司 Switch control method and device for intelligent home and intelligent home

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5799273A (en) * 1996-09-24 1998-08-25 Allvoice Computing Plc Automated proofreading using interface linking recognized words to their audio data while text is being changed
US5875448A (en) * 1996-10-08 1999-02-23 Boys; Donald R. Data stream editing system including a hand-held voice-editing apparatus having a position-finding enunciator

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5799273A (en) * 1996-09-24 1998-08-25 Allvoice Computing Plc Automated proofreading using interface linking recognized words to their audio data while text is being changed
US5875448A (en) * 1996-10-08 1999-02-23 Boys; Donald R. Data stream editing system including a hand-held voice-editing apparatus having a position-finding enunciator

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
Dragon Dictate for Windows 2.0, User's Guide, British version, First edition, pages 1 to 230 *

Also Published As

Publication number Publication date
GB0118231D0 (en) 2001-09-19
GB2361569B (en) 2003-12-24
HK1041086A1 (en) 2002-06-28
CA2362462A1 (en) 2000-08-10
WO2000046787A2 (en) 2000-08-10
WO2000046787A3 (en) 2000-12-14
AU3588200A (en) 2000-08-25

Similar Documents

Publication Publication Date Title
CA2351705A1 (en) System and method for automating transcription services
AP2001002243A0 (en) Automated transcription system and method using two speech converting instances and computer-assisted correction.
ATE297588T1 (en) ADJUSTING PHONETIC CONTEXT TO IMPROVE SPEECH RECOGNITION
ATE314718T1 (en) SPEAKER ADAPTED VOICE RECOGNITION
KR100321841B1 (en) Automatically updating language models
CN108766441A (en) A kind of sound control method and device based on offline Application on Voiceprint Recognition and speech recognition
CN105304080A (en) Speech synthesis device and speech synthesis method
Stan et al. The SWARA speech corpus: A large parallel Romanian read speech dataset
CA2537503A1 (en) Unsupervised and active learning in automatic speech recognition for call classification
ATE407411T1 (en) METHOD FOR PROVIDING ACCOUNT INFORMATION AND SYSTEM FOR WRITING DICTATE TEXT
CN103915093B (en) A kind of method and apparatus for realizing singing of voice
Komatani et al. User modeling in spoken dialogue systems to generate flexible guidance
GB2361569A (en) System and method for automating transcription services
Matoušek Building of a speech corpus optimised for unit selection TTS synthesis
CN106653002A (en) Literal live broadcasting method and platform
CN111179903A (en) Voice recognition method and device, storage medium and electric appliance
CN110503941A (en) Language competence evaluating method, device, system, computer equipment and storage medium
Hincks Processing the prosody of oral presentations
Mamiya et al. Lightly supervised GMM VAD to use audiobook for speech synthesiser
CN104735461B (en) The replacing options and device of voice AdWords in video
Komatani et al. User modeling in spoken dialogue systems for flexible guidance generation.
Mögele et al. SmartWeb UMTS Speech Data Collection: The SmartWeb Handheld Corpus.
Hahn et al. An improved speech detection algorithm for isolated Korean utterances
JP6594273B2 (en) Questioning utterance determination device, method and program thereof
Peiró-Lilja et al. LaFresCat: a Catalan multi-accent speech dataset for text-to-speech

Legal Events

Date Code Title Description
REG Reference to a national code

Ref country code: HK

Ref legal event code: WD

Ref document number: 1041086

Country of ref document: HK

PCNP Patent ceased through non-payment of renewal fee

Effective date: 20100204