GB2361569A - System and method for automating transcription services - Google Patents
System and method for automating transcription servicesInfo
- Publication number
- GB2361569A GB2361569A GB0118231A GB0118231A GB2361569A GB 2361569 A GB2361569 A GB 2361569A GB 0118231 A GB0118231 A GB 0118231A GB 0118231 A GB0118231 A GB 0118231A GB 2361569 A GB2361569 A GB 2361569A
- Authority
- GB
- United Kingdom
- Prior art keywords
- training
- current user
- file
- enrollment
- voice
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
- G06F40/166—Editing, e.g. inserting or deleting
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/221—Announcement of recognition results
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Acoustics & Sound (AREA)
- Human Computer Interaction (AREA)
- Multimedia (AREA)
- Artificial Intelligence (AREA)
- Theoretical Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Telephonic Communication Services (AREA)
- Document Processing Apparatus (AREA)
Abstract
A system for substantially automating transcription services for multiple voice users including a manual transcription station, a speech recognition program and a routing program. The system establishes a profile for each of the voice users containing a training status which is selected from the group of enrollment, training, automated and stop automation. The system generates a uniquely identified voice dictation file from a current voice user and - based on the training status the system - routes the uniquely identified voice dictation file to a manual transcription station and the speech recognition program. A human transcriptionist creates transcribed files for each received voice dictation file. The speech recognition program automatically creates a written text for each received voice dictation file if the training status of the current user is training or automated. A verbatim file is manually established if the training status of the current user is enrollment or training and the speech recognition program is trained with an acoustic model for the current user using the verbatim file and the voice dictation file if the training status of the current user is enrollment or training. The transcribed file is returned to the current user if the training status of the current user is enrollment or training or the written text is returned if the training status of the current user is automated. An apparatus and method is also disclosed for testing the skills of a transcriptionist. These apparatuses and methods may also be used to establish new base language model for mass distribution.
Priority Applications (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| GB0324945A GB2390930B (en) | 1999-02-05 | 2000-02-04 | System and method for automating transcription services |
| GB0324946A GB2391100B (en) | 1999-02-05 | 2000-02-04 | System and method for automating transcription services |
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US11894999P | 1999-02-05 | 1999-02-05 | |
| PCT/US2000/002808 WO2000046787A2 (en) | 1999-02-05 | 2000-02-04 | System and method for automating transcription services |
Publications (3)
| Publication Number | Publication Date |
|---|---|
| GB0118231D0 GB0118231D0 (en) | 2001-09-19 |
| GB2361569A true GB2361569A (en) | 2001-10-24 |
| GB2361569B GB2361569B (en) | 2003-12-24 |
Family
ID=22381731
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| GB0118231A Expired - Fee Related GB2361569B (en) | 1999-02-05 | 2000-02-04 | System and method for automating transcription services |
Country Status (5)
| Country | Link |
|---|---|
| AU (1) | AU3588200A (en) |
| CA (1) | CA2362462A1 (en) |
| GB (1) | GB2361569B (en) |
| HK (1) | HK1041086A1 (en) |
| WO (1) | WO2000046787A2 (en) |
Families Citing this family (9)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US7383187B2 (en) | 2001-01-24 | 2008-06-03 | Bevocal, Inc. | System, method and computer program product for a distributed speech recognition tuning platform |
| ATE300084T1 (en) | 2001-03-16 | 2005-08-15 | Koninkl Philips Electronics Nv | TRANSCRIPTION SERVICE WITH CANCEL OF AUTOMATIC TRANSCRIPTION |
| DE10126020A1 (en) * | 2001-05-28 | 2003-01-09 | Olaf Berberich | Automatic conversion of words spoken by speaker into digitally coded terms for processing by computer involves displaying term rejections in correction window for direct entry correction |
| GB2388739B (en) | 2001-11-03 | 2004-06-02 | Dremedia Ltd | Time ordered indexing of an information stream |
| GB2388738B (en) | 2001-11-03 | 2004-06-02 | Dremedia Ltd | Time ordered indexing of audio data |
| WO2008041083A2 (en) * | 2006-10-02 | 2008-04-10 | Bighand Ltd. | Digital dictation workflow system and method |
| US8024289B2 (en) | 2007-07-31 | 2011-09-20 | Bighand Ltd. | System and method for efficiently providing content over a thin client network |
| CN109285548A (en) * | 2017-07-19 | 2019-01-29 | 阿里巴巴集团控股有限公司 | Information processing method, system, electronic equipment and computer storage medium |
| CN116074150B (en) * | 2023-03-02 | 2023-06-09 | 广东浩博特科技股份有限公司 | Switch control method and device for intelligent home and intelligent home |
Citations (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US5799273A (en) * | 1996-09-24 | 1998-08-25 | Allvoice Computing Plc | Automated proofreading using interface linking recognized words to their audio data while text is being changed |
| US5875448A (en) * | 1996-10-08 | 1999-02-23 | Boys; Donald R. | Data stream editing system including a hand-held voice-editing apparatus having a position-finding enunciator |
-
2000
- 2000-02-04 AU AU35882/00A patent/AU3588200A/en not_active Abandoned
- 2000-02-04 GB GB0118231A patent/GB2361569B/en not_active Expired - Fee Related
- 2000-02-04 CA CA002362462A patent/CA2362462A1/en not_active Abandoned
- 2000-02-04 WO PCT/US2000/002808 patent/WO2000046787A2/en not_active Ceased
- 2000-02-04 HK HK02101880.9A patent/HK1041086A1/en unknown
Patent Citations (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US5799273A (en) * | 1996-09-24 | 1998-08-25 | Allvoice Computing Plc | Automated proofreading using interface linking recognized words to their audio data while text is being changed |
| US5875448A (en) * | 1996-10-08 | 1999-02-23 | Boys; Donald R. | Data stream editing system including a hand-held voice-editing apparatus having a position-finding enunciator |
Non-Patent Citations (1)
| Title |
|---|
| Dragon Dictate for Windows 2.0, User's Guide, British version, First edition, pages 1 to 230 * |
Also Published As
| Publication number | Publication date |
|---|---|
| GB0118231D0 (en) | 2001-09-19 |
| GB2361569B (en) | 2003-12-24 |
| HK1041086A1 (en) | 2002-06-28 |
| CA2362462A1 (en) | 2000-08-10 |
| WO2000046787A2 (en) | 2000-08-10 |
| WO2000046787A3 (en) | 2000-12-14 |
| AU3588200A (en) | 2000-08-25 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CA2351705A1 (en) | System and method for automating transcription services | |
| AP2001002243A0 (en) | Automated transcription system and method using two speech converting instances and computer-assisted correction. | |
| ATE297588T1 (en) | ADJUSTING PHONETIC CONTEXT TO IMPROVE SPEECH RECOGNITION | |
| ATE314718T1 (en) | SPEAKER ADAPTED VOICE RECOGNITION | |
| KR100321841B1 (en) | Automatically updating language models | |
| CN108766441A (en) | A kind of sound control method and device based on offline Application on Voiceprint Recognition and speech recognition | |
| CN105304080A (en) | Speech synthesis device and speech synthesis method | |
| Stan et al. | The SWARA speech corpus: A large parallel Romanian read speech dataset | |
| CA2537503A1 (en) | Unsupervised and active learning in automatic speech recognition for call classification | |
| ATE407411T1 (en) | METHOD FOR PROVIDING ACCOUNT INFORMATION AND SYSTEM FOR WRITING DICTATE TEXT | |
| CN103915093B (en) | A kind of method and apparatus for realizing singing of voice | |
| Komatani et al. | User modeling in spoken dialogue systems to generate flexible guidance | |
| GB2361569A (en) | System and method for automating transcription services | |
| Matoušek | Building of a speech corpus optimised for unit selection TTS synthesis | |
| CN106653002A (en) | Literal live broadcasting method and platform | |
| CN111179903A (en) | Voice recognition method and device, storage medium and electric appliance | |
| CN110503941A (en) | Language competence evaluating method, device, system, computer equipment and storage medium | |
| Hincks | Processing the prosody of oral presentations | |
| Mamiya et al. | Lightly supervised GMM VAD to use audiobook for speech synthesiser | |
| CN104735461B (en) | The replacing options and device of voice AdWords in video | |
| Komatani et al. | User modeling in spoken dialogue systems for flexible guidance generation. | |
| Mögele et al. | SmartWeb UMTS Speech Data Collection: The SmartWeb Handheld Corpus. | |
| Hahn et al. | An improved speech detection algorithm for isolated Korean utterances | |
| JP6594273B2 (en) | Questioning utterance determination device, method and program thereof | |
| Peiró-Lilja et al. | LaFresCat: a Catalan multi-accent speech dataset for text-to-speech |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| REG | Reference to a national code |
Ref country code: HK Ref legal event code: WD Ref document number: 1041086 Country of ref document: HK |
|
| PCNP | Patent ceased through non-payment of renewal fee |
Effective date: 20100204 |