WO2004000472A1 - Procede de traitement d'objets postaux utilisant la synthese vocale - Google Patents
Procede de traitement d'objets postaux utilisant la synthese vocale Download PDFInfo
- Publication number
- WO2004000472A1 WO2004000472A1 PCT/FR2003/001764 FR0301764W WO2004000472A1 WO 2004000472 A1 WO2004000472 A1 WO 2004000472A1 FR 0301764 W FR0301764 W FR 0301764W WO 2004000472 A1 WO2004000472 A1 WO 2004000472A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- operator
- video coding
- postal
- image
- speech synthesis
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Classifications
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B07—SEPARATING SOLIDS FROM SOLIDS; SORTING
- B07C—POSTAL SORTING; SORTING INDIVIDUAL ARTICLES, OR BULK MATERIAL FIT TO BE SORTED PIECE-MEAL, e.g. BY PICKING
- B07C3/00—Sorting according to destination
- B07C3/20—Arrangements for facilitating the visual reading of addresses, e.g. display arrangements coding stations
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/98—Detection or correction of errors, e.g. by rescanning the pattern or by human intervention; Evaluation of the quality of the acquired patterns
- G06V10/987—Detection or correction of errors, e.g. by rescanning the pattern or by human intervention; Evaluation of the quality of the acquired patterns with the intervention of an operator
Definitions
- the invention relates to a method for processing postal objects in which an image of a postal object is presented on a video coding station, and on the basis of this presentation, an operator is asked to provide postal address information. through the video coding station.
- An automatic sorting process for letter, flat or packet-type postal objects generally includes capturing a digital image of each object.
- An optical character recognition processing is then applied to this image to identify the recipient address appearing on the postal object.
- This recognition processing can fail, that is to say providing a solution with a very low confidence rate, or several solutions from which it was not possible to choose. What is called here solution corresponds for example to a part of the unrecognized recipient address: track name, company or person name, number in the track, post office box number.
- the digital image of the object is presented on a screen of the video coding station so that an operator provides address information, that is to say so that it confirms the 'one of the proposed solutions.
- the image and the solutions are displayed simultaneously so that the operator makes his selection by comparing each solution with the address appearing in the image.
- the object of the invention is to propose an improvement to the existing video coding methods to improve the comfort of the operator and reduce the processing time.
- the subject of the invention is a method for processing postal objects in which an image of a postal object is presented on a video coding station, and on the basis of this presentation, an operator is asked to supply postal address information through the video coding station, characterized in that the operator is called upon by voice synthesis. With this process, the operator reads the address appearing in the image at the same time as a solution is stated to him by speech synthesis.
- the solution is offered to the operator through a helmet listening. In the case where several solutions are possible, they are proposed successively stated to the operator.
- the idea underlying the invention is to use speech synthesis so that the operator reads the address appearing in the image which is presented to him at the same time as a solution is stated to him by speech synthesis.
- the single figure shows a video coding station 1 connected to a computerized management system of a postal sorting installation, this post includes a screen 2 for displaying digital images 3 of postal objects to an operator 4.
- This video coding station receives one or more solutions from the computerized management system resulting from an optical character recognition processing applied to the image 3.
- the solutions are offered to the operator by voice synthesis voice, so that by comparing the address presented to it in image 3 to the solution stated to it, the operator 4 provides its address information by confirming or denying the proposed solution.
- the station is arranged so that the operator can confirm the solution stated to him by pressing a single key on the keyboard 5.
- the video coding station may include a headset 6 connected to the central unit 7 to improve the operator's working comfort 4.
- the use of such a headset 6 makes it possible to equip the various video coding stations present in the same video coding room to use voice synthesis on each station without the operators disturbing each other.
- the video coding station is a computer equipped with a voice synthesis program, connected to the headset 6 through a sound card.
- This video coding station which is connected to the management system of the sorting installation is thus able to convert the solutions resulting from the recognition processing which are text messages into audible signals audible by the operator in the helmet 6.
- Such text-to-speech programs are currently available on the market.
- the chosen text-to-speech program will be able to work in several languages.
- the addresses of recipients can be entered in French, or in Dutch. It is therefore essential that the speech synthesis program reads in French or Dutch, depending on the results given by the optical character recognition processing.
- the optical character recognition processing fails, the latter can return a plurality of possible solutions, with a confidence rate associated with each of them.
- the different solutions are stated successively to the operator until he confirms the correct one to resolve the ambiguity resulting from the processing.
- the different solutions are stated in decreasing order of confidence, so that the first solution stated has the greatest probability of being the right one.
- the management system could advantageously be arranged to propose to the operator to manually enter the address which he reads in the image.
- the address or part of the address not recognized by the processing can be framed or extracted from the original image.
- the digital image 3 corresponds to an address block in which a word corresponding to the name of the channel 8 is surrounded by dotted lines to indicate to the operator that it is the part remaining to be identified.
- the invention can also be applied to manual coded entry on a video coding station.
- the coded manual entry is used for example in the case where none of the solutions proposed at the end of the automatic character recognition processing have been confirmed by the operator.
- the operator To reduce the input time, the operator only enters part of the unrecognized address line, also called extract, on his keyboard.
- a management program then assigns a value to this extract, but sometimes several solutions correspond to the same extract.
- the video coding station is arranged to request the operator by synthesis vocal by enunciating successively the different solutions corresponding to the extract he has grasped. More particularly, the different solutions are then listed one after the other until the operator confirms the one he wishes to enter using, for example, the station keyboard.
- the video coding station 1 illustrated in the figure is under the control of a multi-tasking software application using the "Windows NT, 2000" operating system.
- This application is part of a larger set including an image server and a supervision system, which are part of the sorting system consisting of sorting machines (letters, flat objects, packages), automatic recognition systems. OCR address, barcode readers, etc.
- the supervision system is a graphical software application of the "Windows" type having windows and drop-down menus to control and manage the stocks of images and the image database of the image server on the one hand and manage connections and assignments of video coding operators to coding tasks on the other hand.
- the image server receives as input the images that are not completely resolved by the OCR address recognition systems located upstream of the sorting process. In the case of completely unresolved images, the OCR systems transmit the partial results that they have successfully determined to the image server.
- the image server stores in separate image queues the images to be processed and this according to the results obtained (no information, postal code, several hypotheses of streets, specific street but number of lane not determined .). This organization then makes it possible to assign coding consoles to specific image queues in order to make video coding more efficient.
- the image server submits these images to the coding consoles and receives results in return. These allow the image server to make a decision on whether or not to continue processing images.
- the image server stores these results in a result base for transmission to the sorting machines.
- the various elements of the video coding system (supervision software, coding console, image server) communicate with each other by exchanging messages using the "TCP / IP" communication protocol.
- On the video coding station 1 is installed a postal database used by the video coding software in coding tasks for resolving addresses. This postal base is identical to that used on the OCR systems located upstream.
- Text-to-speech is a feature integrated into the video coding software application in the form of a library which allows, among other things, to adjust the sampling frequency, the language used, the communication protocol of the sound card.
- the video coding software After the display of the image on screen 2 of the video coding station, the video coding software extracts information concerning the nature of the task to be performed and uses the coordinates of the address blocks to draw a frame. (shown in the figure in dotted lines) around address information requiring processing by video coding. This information is available in the video coding software in the form of text and is submitted to the voice synthesis library through one of its access functions to be reproduced in sound form through the headset 6.
- the video coding software scans the keys of the keyboard 5 pressed by the operator during the speech synthesis process.
- we. can significantly increase the bit rate of video coding due to the parallelism of the tasks of displaying the image and of speaking in vocal form of the solutions to be confirmed.
- the speed of video coding can be increased by around 10% compared to video coding systems which do not use speech synthesis.
Landscapes
- Engineering & Computer Science (AREA)
- Quality & Reliability (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Multimedia (AREA)
- Theoretical Computer Science (AREA)
- Sorting Of Articles (AREA)
- Character Discrimination (AREA)
- Document Processing Apparatus (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
- Machine Translation (AREA)
- Devices For Executing Special Programs (AREA)
Abstract
Description
Claims
Priority Applications (6)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| EP03760724A EP1526926B1 (fr) | 2002-06-19 | 2003-06-12 | Procede de traitement d'objets postaux utilisant la synthese vocale |
| US10/473,421 US20050119898A1 (en) | 2002-06-19 | 2003-06-12 | Method for processing postal objects using speech synthesis |
| JP2004514920A JP2005529743A (ja) | 2002-06-19 | 2003-06-12 | 音声合成を使用して郵便物を処理する方法 |
| CA002487130A CA2487130A1 (fr) | 2002-06-19 | 2003-06-12 | Procede de traitement d'objets postaux utilisant la synthese vocale |
| DE60318448T DE60318448T2 (de) | 2002-06-19 | 2003-06-12 | Verfahren zur verarbeitung von poststücken unter verwendung von sprachsynthesen |
| AU2003253068A AU2003253068A1 (en) | 2002-06-19 | 2003-06-12 | Method for processing postal objects using speech synthesis |
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| FR0207581A FR2841160B1 (fr) | 2002-06-19 | 2002-06-19 | Procede de traitement d'objets postaux utilisant la synthese vocale |
| FR02/07581 | 2002-06-19 |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| WO2004000472A1 true WO2004000472A1 (fr) | 2003-12-31 |
| WO2004000472A8 WO2004000472A8 (fr) | 2005-03-10 |
Family
ID=29719884
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/FR2003/001764 Ceased WO2004000472A1 (fr) | 2002-06-19 | 2003-06-12 | Procede de traitement d'objets postaux utilisant la synthese vocale |
Country Status (10)
| Country | Link |
|---|---|
| US (1) | US20050119898A1 (fr) |
| EP (1) | EP1526926B1 (fr) |
| JP (1) | JP2005529743A (fr) |
| AT (1) | ATE382438T1 (fr) |
| AU (1) | AU2003253068A1 (fr) |
| CA (1) | CA2487130A1 (fr) |
| DE (1) | DE60318448T2 (fr) |
| ES (1) | ES2297215T3 (fr) |
| FR (1) | FR2841160B1 (fr) |
| WO (1) | WO2004000472A1 (fr) |
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2012085003A1 (fr) | 2010-12-22 | 2012-06-28 | Katholieke Universiteit Leuven, K.U. Leuven R&D | 2 -hydroxyisoquinoline- 1, 3 ( 2h, 4h) - diones et composés associés servant d'inhibiteurs de la réplication du vih |
Citations (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US4921107A (en) * | 1988-07-01 | 1990-05-01 | Pitney Bowes Inc. | Mail sortation system |
| US5558232A (en) * | 1994-01-05 | 1996-09-24 | Opex Corporation | Apparatus for sorting documents |
| US5677834A (en) * | 1995-01-26 | 1997-10-14 | Mooneyham; Martin | Method and apparatus for computer assisted sorting of parcels |
Family Cites Families (8)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2003005289A1 (fr) * | 1997-03-03 | 2003-01-16 | Keith Whited | Systeme de stockage, de recherche et d'affichage d'echantillons marins |
| DE19718805C2 (de) * | 1997-05-03 | 1999-11-04 | Siemens Ag | Verfahren und Anordnung zum Erkennen von Verteilinformationen |
| US6327343B1 (en) * | 1998-01-16 | 2001-12-04 | International Business Machines Corporation | System and methods for automatic call and data transfer processing |
| KR100664452B1 (ko) * | 1998-02-03 | 2007-01-04 | 코닌클리케 필립스 일렉트로닉스 엔.브이. | 코딩된 비디오 시퀀스들을 스위칭하는 방법 및 그것에 대응하는 장치 |
| US6976032B1 (en) * | 1999-11-17 | 2005-12-13 | Ricoh Company, Ltd. | Networked peripheral for visitor greeting, identification, biographical lookup and tracking |
| US6867875B1 (en) * | 1999-12-06 | 2005-03-15 | Matsushita Electric Industrial Co., Ltd. | Method and apparatus for simplifying fax transmissions using user-circled region detection |
| US6466847B1 (en) * | 2000-09-01 | 2002-10-15 | Canac Inc | Remote control system for a locomotive using voice commands |
| US6823084B2 (en) * | 2000-09-22 | 2004-11-23 | Sri International | Method and apparatus for portably recognizing text in an image sequence of scene imagery |
-
2002
- 2002-06-19 FR FR0207581A patent/FR2841160B1/fr not_active Expired - Fee Related
-
2003
- 2003-06-12 DE DE60318448T patent/DE60318448T2/de not_active Expired - Lifetime
- 2003-06-12 CA CA002487130A patent/CA2487130A1/fr not_active Abandoned
- 2003-06-12 JP JP2004514920A patent/JP2005529743A/ja active Pending
- 2003-06-12 AU AU2003253068A patent/AU2003253068A1/en not_active Abandoned
- 2003-06-12 ES ES03760724T patent/ES2297215T3/es not_active Expired - Lifetime
- 2003-06-12 EP EP03760724A patent/EP1526926B1/fr not_active Expired - Lifetime
- 2003-06-12 US US10/473,421 patent/US20050119898A1/en not_active Abandoned
- 2003-06-12 AT AT03760724T patent/ATE382438T1/de not_active IP Right Cessation
- 2003-06-12 WO PCT/FR2003/001764 patent/WO2004000472A1/fr not_active Ceased
Patent Citations (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US4921107A (en) * | 1988-07-01 | 1990-05-01 | Pitney Bowes Inc. | Mail sortation system |
| US5558232A (en) * | 1994-01-05 | 1996-09-24 | Opex Corporation | Apparatus for sorting documents |
| US5677834A (en) * | 1995-01-26 | 1997-10-14 | Mooneyham; Martin | Method and apparatus for computer assisted sorting of parcels |
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2012085003A1 (fr) | 2010-12-22 | 2012-06-28 | Katholieke Universiteit Leuven, K.U. Leuven R&D | 2 -hydroxyisoquinoline- 1, 3 ( 2h, 4h) - diones et composés associés servant d'inhibiteurs de la réplication du vih |
Also Published As
| Publication number | Publication date |
|---|---|
| JP2005529743A (ja) | 2005-10-06 |
| DE60318448T2 (de) | 2009-01-02 |
| EP1526926B1 (fr) | 2008-01-02 |
| FR2841160A1 (fr) | 2003-12-26 |
| CA2487130A1 (fr) | 2003-12-31 |
| ES2297215T3 (es) | 2008-05-01 |
| FR2841160B1 (fr) | 2004-07-23 |
| ATE382438T1 (de) | 2008-01-15 |
| WO2004000472A8 (fr) | 2005-03-10 |
| US20050119898A1 (en) | 2005-06-02 |
| AU2003253068A1 (en) | 2004-01-06 |
| EP1526926A1 (fr) | 2005-05-04 |
| DE60318448D1 (de) | 2008-02-14 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US7792701B2 (en) | Method and computer program product for providing accessibility services on demand | |
| CN1581294B (zh) | 语音识别增强的呼叫者识别 | |
| CN111931666B (zh) | 凭证自动化处理系统及方法 | |
| US20090108057A1 (en) | Using Quick Response Codes to Provide Interactive Services | |
| DE69724893T2 (de) | Datenverarbeitungsgerät mit kommunikationsfunktion | |
| US10999640B2 (en) | Automatic embedding of information associated with video content | |
| US20040034522A1 (en) | Method and apparatus for seamless transition of voice and/or text into sign language | |
| CN1573762A (zh) | 维护与检查系统和方法 | |
| CN1288619A (zh) | 用于交互式通信的方法 | |
| JP7284786B2 (ja) | データをラベリングするための方法、装置、電子機器、コンピュータ可読記憶媒体およびコンピュータプログラム | |
| US11019225B2 (en) | Dynamic image capture device control system | |
| EP0422195A1 (fr) | Procede et systeme de tri d'objets portant des inscriptions, tels que des objets postaux, des cheques, des mandats | |
| CN111343185A (zh) | 一种柜员机交互方法及交互系统 | |
| US7324948B2 (en) | Context-specific contact information | |
| CN115210703A (zh) | 知识信息制作支援装置 | |
| WO2000054252A2 (fr) | Procede avec plusieurs reconnaisseurs vocaux | |
| WO2001058160A1 (fr) | Système et procédé pour la diffusion d'objets images | |
| EP1526926B1 (fr) | Procede de traitement d'objets postaux utilisant la synthese vocale | |
| US11830154B2 (en) | AR-based information displaying method and device, AR apparatus, electronic device and medium | |
| KR102342188B1 (ko) | 화상 회의 서비스 제공 장치 및 방법 | |
| KR20180057990A (ko) | 직원 별 맞춤 학습 영상 데이터 제공 방법 및 이를 실행하는 시스템 | |
| US20060161488A1 (en) | Data confirming system and data confirming method | |
| CN115660874A (zh) | 银行业务的入账方法、装置、存储介质及计算机设备 | |
| DE102018201711B4 (de) | Anordnung und Verfahren zum Bereitstellen von Informationen bei einer kopftragbaren erweiterte-Realität-Vorrichtung | |
| CN118690265B (zh) | 一种事故车消息的管理方法、装置、电子设备及介质 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| WWE | Wipo information: entry into national phase |
Ref document number: 10473421 Country of ref document: US |
|
| AK | Designated states |
Kind code of ref document: A1 Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CR CU CZ DE DK DM DZ EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SE SG SK SL TJ TM TR TT TZ UA UG US UZ VN YU ZA ZW |
|
| AL | Designated countries for regional patents |
Kind code of ref document: A1 Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE TR BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG |
|
| DFPE | Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101) | ||
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
| WWE | Wipo information: entry into national phase |
Ref document number: 2003760724 Country of ref document: EP |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 2487130 Country of ref document: CA |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 2004514920 Country of ref document: JP |
|
| DFPE | Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101) |
Free format text: EXCEPT/SAUF EP (CZ, EE, HU, RO, SI, SK) |
|
| CFP | Corrected version of a pamphlet front page | ||
| CR1 | Correction of entry in section i |
Free format text: IN PCT GAZETTE 01/2004 DUE TO A TECHNICAL PROBLEMAT THE TIME OF INTERNATIONAL PUBLICATION, SOME INFORMATION WAS MISSING UNDER (81). THE MISSING INFORMATION NOW APPEARS IN THE CORRECTED VERSION |
|
| WWP | Wipo information: published in national office |
Ref document number: 2003760724 Country of ref document: EP |
|
| WWG | Wipo information: grant in national office |
Ref document number: 2003760724 Country of ref document: EP |