HK1207199B - Methods and systems for managing adaptation data - Google Patents
Methods and systems for managing adaptation data Download PDFInfo
- Publication number
- HK1207199B HK1207199B HK15107505.6A HK15107505A HK1207199B HK 1207199 B HK1207199 B HK 1207199B HK 15107505 A HK15107505 A HK 15107505A HK 1207199 B HK1207199 B HK 1207199B
- Authority
- HK
- Hong Kong
- Prior art keywords
- speech
- transaction
- adaptation data
- data
- target device
- Prior art date
Links
Description
发明人:Inventor:
Royce A.LevienRoyce A. Levien
Richard T.LordRichard T. Lord
Robert W.LordRobert W. Lord
Mark A.MalamudMark A. Malamud
关联申请的交叉引用Cross-references to related applications
如果在本申请的提交日已提交了申请数据表(ADS),则该申请数据表援引包含于此。在ADS基础上对于根据35USC§§119、120、121或365(c)规定的优先权要求的任何申请、以及这些申请任何和全部父申请、祖父申请、曾祖父申请等也援引包含于此,包括在那些申请中作出的任何优先权要求以及援引包括的任何素材,只要该主题不至与这里不一致。If an Application Data Sheet (ADS) was filed on the filing date of this application, the Application Data Sheet is incorporated herein by reference. Any application claiming priority under 35 U.S.C. §§119, 120, 121, or 365(c) on the basis of the ADS, and any and all parent, grandfather, great-grandfather, etc. applications of such applications, are also incorporated herein by reference, including any priority claims made in those applications and any material incorporated by reference, to the extent such subject matter is not inconsistent herewith.
关联申请: Related application :
本申请关联于和/或要求来自下面列出的申请(“优先权申请”)的最早可得有效提交日的权益,如果有的话,列出如下(例如要求除却临时专利申请的最早可得的优先权日或要求对于临时专利申请根据35USC§§119(e)的权益,对于优先权申请的任何和全部父申请、祖父申请、曾祖父申请权的权益)。另外,本申请涉及“关联申请”,如果有的话,列出如下。This application is related to and/or claims the benefit of the earliest available effective filing date, if any, of the applications listed below (the "priority applications") (e.g., claiming the benefit of any and all parent, grandfather, or great-grandfather applications of the priority applications in addition to the earliest available priority date of the provisional patent application or claiming the benefit of 35 U.S.C. §§119(e) with respect to the provisional patent application). In addition, this application is related to "related applications," if any, which are listed below.
优先权申请: Priority application :
为了USPTO额外法令要求,本申请构成下列申请的部分延续:美国专利申请No.13/538,866,该申请题为基于适应数据的讲话识别适应系统(SPEECH RECOGNITIONADAPTATION SYSTEMS BASED ON ADAPTATION DATA),发明人为Royce A.Levien、RichardT.Lord、Robert W.Lord和Mark A.Malamud,2012年6月29日以委托案号0711-003-042-C00001提交,该申请目前共同未决或者是其当前共同未决申请有资格享受其提交日的权益的申请;并且是美国专利申请No.13/538,855的延续,该申请题为基于适应数据的讲话识别适应系统(SPEECH RECOGNITION ADAPTATION SYSTEMS BASED ON ADAPTATION DATA),发明人为Royce A.Levien、Richard T.Lord、Robert W.Lord和Mark A.Malamud,2012年6月29日以委托案号0711-003-042-000000提交。For purposes of additional USPTO statutory requirements, this application constitutes a continuation-in-part of U.S. patent application Ser. No. 13/538,866, entitled SPEECH RECOGNITION ADAPTATION SYSTEMS BASED ON ADAPTATION DATA, inventors Royce A. Levien, Richard T. Lord, Robert W. Lord, and Mark A. Malamud, filed Jun. 29, 2012, under Attorney Docket No. 0711-003-042-C00001, which is currently co-pending or is an application that is eligible for the benefit of the filing date of a currently co-pending application; and a continuation-in-part of U.S. patent application Ser. No. 13/538,855, entitled SPEECH RECOGNITION ADAPTATION SYSTEMS BASED ON ADAPTATION DATA, inventors Royce A. Levien, Richard T. Lord, Robert W. Lord, and Mark A. Malamud. A. Malamud, filed on June 29, 2012 under Mandate No. 0711-003-042-000000.
为了USPTO额外法令要求,本申请构成下列申请的部分延续:美国专利申请No.13/564,649,该申请题为基于适应数据的讲话识别适应系统(SPEECH RECOGNITIONADAPTATION SYSTEMS BASED ON ADAPTATION DATA),发明人为Royce A.Levien、RichardT.Lord、Robert W.Lord和Mark A.Malamud,2012年8月1日以委托案号0711-003-043-C00001提交,该申请目前共同未决或者是其当前共同未决申请有资格享受其提交日权益的申请;并且是美国专利申请No.13/564,647的延续,该申请题为基于适应数据的讲话识别适应系统(SPEECH RECOGNITION ADAPTATION SYSTEMS BASED ON ADAPTATION DATA),发明人为Royce A.Levien、Richard T.Lord、Robert W.Lord和Mark A.Malamud,2012年8月1日以委托案号0711-003-043-000000提交。For purposes of additional USPTO statutory requirements, this application constitutes a continuation-in-part of U.S. patent application Ser. No. 13/564,649, entitled SPEECH RECOGNITION ADAPTATION SYSTEMS BASED ON ADAPTATION DATA, inventors Royce A. Levien, Richard T. Lord, Robert W. Lord, and Mark A. Malamud, filed Aug. 1, 2012, under Attorney Docket No. 0711-003-043-C00001, which is currently co-pending or is an application that is eligible for the benefit of the filing date of a currently co-pending application; and a continuation-in-part of U.S. patent application Ser. No. 13/564,647, entitled SPEECH RECOGNITION ADAPTATION SYSTEMS BASED ON ADAPTATION DATA, inventors Royce A. Levien, Richard T. Lord, Robert W. Lord, and Mark A. Malamud. A. Malamud, filed on August 1, 2012 under Mandate No. 0711-003-043-000000.
为了USPTO额外法令要求,本申请构成下列申请的部分延续:美国专利申请No.13/564,651,该申请题为基于适应数据的讲话识别适应系统(SPEECH RECOGNITIONADAPTATION SYSTEMS BASED ON ADAPTATION DATA),发明人为Royce A.Levien、RichardT.Lord、Robert W.Lord和Mark A.Malamud,2012年8月1日以委托案号0711-003-044-C00001提交,该申请目前共同未决或者是其当前共同未决申请有资格享受其提交日权益的申请;并且是美国专利申请No.13/564,650的延续,该申请题为基于适应数据的讲话识别适应系统(SPEECH RECOGNITION ADAPTATION SYSTEMS BASED ON ADAPTATION DATA),发明人为Royce A.Levien、Richard T.Lord、Robert W.Lord和Mark A.Malamud,2012年8月1日以委托案号0711-003-044-000000提交。For purposes of additional USPTO statutory requirements, this application constitutes a continuation-in-part of U.S. patent application Ser. No. 13/564,651, entitled SPEECH RECOGNITION ADAPTATION SYSTEMS BASED ON ADAPTATION DATA, inventors Royce A. Levien, Richard T. Lord, Robert W. Lord, and Mark A. Malamud, filed Aug. 1, 2012, under Attorney Docket No. 0711-003-044-C00001, which is currently co-pending or is an application that is eligible for the benefit of the filing date of a currently co-pending application; and a continuation-in-part of U.S. patent application Ser. No. 13/564,650, entitled SPEECH RECOGNITION ADAPTATION SYSTEMS BASED ON ADAPTATION DATA, inventors Royce A. Levien, Richard T. Lord, Robert W. Lord, and Mark A. Malamud. A. Malamud, filed on August 1, 2012 under Mandate No. 0711-003-044-000000.
为了USPTO额外法令要求,本申请构成下列申请的部分延续:美国专利申请No.13/609142,该申请题为用于讲话适应数据的方法和系统(METHODS AND SYSTEMS FOR SPEECHADAPTATION DATA),发明人为Royce A.Levien、Richard T.Lord、Robert W.Lord和MarkA.Malamud,2012年9月10日以委托案号0711-003-045-C00001提交,该申请目前共同未决或者是其当前共同未决申请有资格享受其提交日权益的申请;并且是美国专利申请No.13/609,139的延续,该申请题为用于讲话适应数据的方法和系统(METHODS AND SYSTEMS FORSPEECH ADAPTATION DATA),发明人为Royce A.Levien、Richard T.Lord、Robert W.Lord和Mark A.Malamud,2012年9月10日以委托案号0711-003-045-000000提交。For purposes of additional USPTO statutory requirements, this application constitutes a continuation-in-part of U.S. patent application Ser. No. 13/609,142, entitled METHODS AND SYSTEMS FOR SPEECH ADAPTATION DATA, inventors Royce A. Levien, Richard T. Lord, Robert W. Lord, and Mark A. Malamud, filed Sep. 10, 2012, under Attorney Docket No. 0711-003-045-C00001, which is currently co-pending or is an application that is eligible for the benefit of the filing date of a currently co-pending application; and a continuation-in-part of U.S. patent application Ser. No. 13/609,139, entitled METHODS AND SYSTEMS FOR SPEECH ADAPTATION DATA, inventors Royce A. Levien, Richard T. Lord, Robert W. Lord, and Mark A. Malamud. A. Malamud, filed on September 10, 2012 under Mandate No. 0711-003-045-000000.
为了USPTO额外法令要求,本申请构成下列申请的部分延续:美国专利申请No.13/609145,该申请题为用于讲话适应数据的方法和系统(METHODS AND SYSTEMS FOR SPEECHADAPTATION DATA),发明人为Royce A.Levien、Richard T.Lord、Robert W.Lord和MarkA.Malamud,2012年9月10日以委托案号0711-003-046-C00001提交,该申请目前共同未决或者是其当前共同未决申请有资格享受其提交日权益的申请;并且是美国专利申请No.13/609143的延续,该申请题为用于讲话适应数据的方法和系统(METHODS AND SYSTEMS FORSPEECH ADAPTATION DATA),发明人为Royce A.Levien、Richard T.Lord、Robert W.Lord和Mark A.Malamud,2012年9月10日以委托案号0711-003-046-000000提交。For purposes of additional USPTO statutory requirements, this application constitutes a continuation-in-part of U.S. patent application Ser. No. 13/609,145, entitled METHODS AND SYSTEMS FOR SPEECH ADAPTATION DATA, inventors Royce A. Levien, Richard T. Lord, Robert W. Lord, and Mark A. Malamud, filed Sep. 10, 2012, under Attorney Docket No. 0711-003-046-C00001, which is currently co-pending or is an application that is eligible for the benefit of the filing date of a currently co-pending application; and a continuation-in-part of U.S. patent application Ser. No. 13/609,143, entitled METHODS AND SYSTEMS FOR SPEECH ADAPTATION DATA, inventors Royce A. Levien, Richard T. Lord, Robert W. Lord, and Mark A. Malamud. A. Malamud, filed on September 10, 2012 under Mandate No. 0711-003-046-000000.
美国专利局(USPTO)已公布一生效通知,即USPTO的计算机程序要求专利申请人既引用序列号又指出申请是父申请的延续、部分延续还是分案。Stephen G.Kunin,在先提交申请的权益,USPTO官方公报2003年3月18日。USPTO进一步已提供申请数据表的格式,这种格式允许自动加载著录项目数据但需要将每件申请标识为父申请的延续、部分延续或分案。当前申请人实体(下文中称其为“申请人”)在前面已将特定引用提供给从中根据法令引述要求优先权的申请。申请人明白,该法令就其特定引用语言来说是明确的并且不需要序列号或任何特征化,例如“延续”或“部分延续”,以要求美国专利申请的优先权。尽管前述内容,申请人明白USPTO的计算机程序具有某些数据录入要求并因此申请人已提供如前所述和本申请提交的任何ADS中的本申请及其父申请之间的关系指定,但明确地指出这些指定无论如何不被解释成对于本申请除其父申请的事项外是否包含任何新事项的任何类型的评论和/或承认。The United States Patent Office (USPTO) has published an effective notice that the USPTO's computer program requires patent applicants to both cite a serial number and indicate whether the application is a continuation, continuation-in-part, or division of a parent application. Stephen G. Kunin, Benefit of Prior-Filed Applications, USPTO Official Gazette, March 18, 2003. The USPTO has further provided a format for an application data sheet that allows for automatic loading of bibliographic data but requires identification of each application as a continuation, continuation-in-part, or division of a parent application. The present applicant entity (hereinafter referred to as "applicant") has previously provided specific citations to the applications from which priority is claimed under the statute. Applicant understands that the statute is unambiguous as to its specific citation language and does not require a serial number or any characterization, such as "continuation" or "continuation-in-part," to claim priority to a U.S. patent application. Notwithstanding the foregoing, Applicant understands that the USPTO's computer programs have certain data entry requirements and, accordingly, Applicant has provided designations of the relationship between this application and its parent applications as previously described and in any ADS filed with this application, but expressly notes that these designations are not to be construed in any way as any kind of comment and/or admission as to whether this application contains any new matter beyond that of its parent applications.
如果前面给出的申请的列表与经由ADS提供的列表不一致,则申请人意图要求出现在ADS的优先权申请章节中的每件申请和出现在本申请的优先权申请章节中的每件申请的优先权。To the extent that the list of applications given above is inconsistent with the list provided via the ADS, applicant intends to claim priority from each application appearing in the Priority Applications section of the ADS and from each application appearing in the Priority Applications section of the present application.
优先权申请和关联申请以及优先权申请和关联申请的任何和全部父申请、祖父申请、曾祖父申请的所有主题(包括任何优先权要求)援引包含于此,只要该主题不至与这里不一致。All subject matter of the priority application and related applications, and any and all parent, grandfather, and great-grandfather applications of the priority application and related applications, including any priority claims, is incorporated herein by reference to the extent such subject matter is not inconsistent herewith.
背景background
本申请涉及适应数据。This application relates to adaptation data.
发明内容Summary of the Invention
在一个或多个各个方面,一种方法包括但不限于管理适应数据,其中适应数据关联于特定方的讲话的至少一个方面,响应与特定方的利于讲话的交易关联的指示,促进将适应数据传输至目标设备,其中适应数据关联于特定方的讲话的至少一个方面,以及确定是否对适应数据进行更新,所述确定至少部分地基于利于讲话的交易的至少一部分的结果。除了前述内容,构成本文阐述的公开内容一部分的权利要求书、附图和正文中描述了其它方法方面。In one or more various aspects, a method includes, but is not limited to, managing adaptation data, wherein the adaptation data is associated with at least one aspect of speech by a particular party, facilitating transmission of the adaptation data to a target device in response to an indication associated with a speech-facilitated transaction with the particular party, wherein the adaptation data is associated with at least one aspect of speech by the particular party, and determining whether to update the adaptation data, the determination being based at least in part on a result of at least a portion of the speech-facilitated transaction. In addition to the foregoing, further method aspects are described in the claims, drawings, and text forming a part of the disclosure set forth herein.
本申请的权利要求书、说明书和附图可以操作性/功能性语言将一个或多个当前技术描述为例如拟由计算机执行的一组操作。在大多数情况下,这种操作性/功能性描述将被本领域内技术人员理解为专门配置的硬件(例如由于通用计算机一旦被编程以遵循来自程序软件的指令执行特定功能则事实上成为专用计算机)。The claims, description, and drawings of this application may describe one or more of the present technologies in operational/functional language, for example, as a set of operations to be performed by a computer. In most cases, such operational/functional descriptions will be understood by those skilled in the art to refer to specially configured hardware (e.g., since a general-purpose computer becomes a special-purpose computer once it is programmed to perform specific functions following instructions from program software).
重要的是,尽管本文描述的操作性/功能性描述可通过人的头脑理解,但它们不是与那些操作/功能的计算实现分离的操作/功能的抽象理念。相反,这些操作/功能代表十分复杂的计算机器或其它装置的规范。如下面详细讨论的,操作性/功能性语言必须在其正确的技术背景下阅读,即作为物理实现的具体规范。Importantly, while the operational/functional descriptions described herein are understandable to the human mind, they are not abstract concepts of operations/functions separate from the computational implementation of those operations/functions. Rather, these operations/functions represent specifications of quite complex computing machines or other devices. As discussed in detail below, operational/functional language must be read in its proper technical context, i.e., as a concrete specification of a physical implementation.
本文描述的逻辑操作/功能是机器规范或由操作/功能规定的其它物理机构的提炼,以使如若不然难以理解的机器规范可由人的头脑领会。这种提炼也允许本领域内技术人员跨许多不同特定卖家的硬件配置或平台适应技术的操作性/功能性描述,而不仅限于特定卖家的硬件配置或平台。The logical operations/functions described herein are abstractions of the machine specifications or other physical mechanisms specified by the operations/functions, so that the otherwise incomprehensible machine specifications can be grasped by the human mind. This abstraction also allows those skilled in the art to adapt the operational/functional description of the technology across many different vendor-specific hardware configurations or platforms, rather than being limited to a particular vendor's hardware configuration or platform.
一些当前技术描述(例如详细描述、附图、权利要求书等)可按照逻辑操作/功能进行阐述。如下面段落中更详细描述的那样,这些逻辑操作/功能不是抽象理念的代表,而是各硬件组件的静态或顺序规范的表征。换种说法,除非上下文指出相反情形,否则逻辑操作/功能将被本领域内技术人员理解为表示各硬件组件的静态或顺序的规范。由于本领域内技术人员可用来实现以操作性/功能性格式阐述的技术公开的工具——以高级编程语言(例如C、Java、visual basic等)形式出现的工具或以非常高速硬件描述语言(VHDL,它是使用文本来描述逻辑电路的语言)形式出现的工具——是各硬件配置的静态或顺序规范的发生器,因此这是真的。这个事实有时被宽泛术语“软件”晦涩化,如下面解释中示出的,本领域内技术人员理解,被冠名为“软件”的东西是对于有序物质要素的十分复杂的相互联系/规范的的速记。术语“有序物质要素”可指物理计算器件,例如电子逻辑门的组件、分子计算逻辑组成、量子计算机构等。Some current technical descriptions (e.g., detailed descriptions, drawings, claims, etc.) may be formulated in terms of logical operations/functions. As described in more detail in the following paragraphs, these logical operations/functions are not representations of abstract concepts, but rather representations of static or sequential specifications for various hardware components. Stated another way, unless the context indicates otherwise, logical operations/functions will be understood by those skilled in the art to represent static or sequential specifications for various hardware components. This is true because the tools available to those skilled in the art for implementing technical disclosures formulated in an operational/functional format—tools in the form of high-level programming languages (e.g., C, Java, Visual Basic, etc.) or tools in the form of Very High Speed Hardware Description Language (VHDL, a language for describing logic circuits using text—are generators of static or sequential specifications for various hardware configurations. This fact is sometimes obscured by the broad term "software." As illustrated in the following explanation, those skilled in the art understand that what is termed "software" is shorthand for quite complex interconnections/specifications of ordered physical elements. The term "ordered elements of matter" may refer to physical computing devices, such as components of electronic logic gates, molecular computing logic components, quantum computing mechanisms, etc.
例如,高级编程语言是具有强抽象性的编程语言,例如来自高级编程语言实际规定的机器的顺序组织、状态、输入、输出等细节的多层抽象性。例如参见Wikipedia,High-level programming language,http://en.wikipedia.org/wiki/High-level_programming_language(2012年6月5日21:00GMT)。为了利于人们领会,在许多情况下,高级编程语言与自然语言相似或甚至共享标记,例如参见Wikipedia,Natural language,http://en.wikipedia.org/wiki/Natural_language(2012年6月5日,21:00GMT)。For example, a high-level programming language is a programming language with strong abstractions, such as multiple layers of abstraction from the details of the machine's sequential organization, state, input, output, etc., that the high-level programming language actually specifies. See, for example, Wikipedia, "High-level programming language," http://en.wikipedia.org/wiki/High-level_programming_language (June 5, 2012, 21:00 GMT). To facilitate understanding, high-level programming languages are often similar to or even share notation with natural languages. See, for example, Wikipedia, "Natural language," http://en.wikipedia.org/wiki/Natural_language (June 5, 2012, 21:00 GMT).
已有争论,由于高级编程语言使用强抽象性(例如它们可能与自然语言或共享标志),因此它们是“纯头脑构筑的”(例如“软件”——计算机程序或计算机编程——某种程度上是不可言喻的头脑构筑,因为在高级抽象性下,它可用人脑构思和理解)。这种争论被用来以功能/操作的形式将技术描述表征为某一程度的“抽象理念”。事实上,在技术领域(例如信息和通信技术)中并不是这样。It has been argued that because high-level programming languages use strong abstractions (e.g., they may share symbols with natural languages), they are therefore "purely mind-constructed" (e.g., "software"—computer programs or computer programming—is in some way an ineffable mind-construct because, at a high level of abstraction, it can be conceived and understood by the human mind). This argument has been used to characterize technical descriptions in the form of functions/operations as having a certain degree of "abstraction." In reality, this is not the case in technical fields (e.g., information and communications technology).
高级编程语言使用强抽象性以利于人们理解的这个事实不应当被视为所表达的内容是抽象理念的指示。事实上,本领域内技术人员理解,只有相反情形才是真的。如果高级编程语言是用来以功能/操作的形式实现技术公开的工具,则本领域内技术人员将理解,远非以任何重大语义定义的抽象的、不准确的、“模糊的”或“头脑的”,这种工具相反是特定计算机器的接近不可理解的准确顺序规范——其一些部分通过随时间(例如时钟时间)从典型更一般的计算机器激活/选择这些部分而建立。这个事实有时因为高级编程语言和自然语言之间的表面相似性而变得晦涩。这些表面相似性也可造成对高级编程语言实现最终通过创建/控制许多不同的计算机器来执行有价值工作的这一事实的粉饰。The fact that high-level programming languages use strong abstractions to facilitate human understanding should not be taken as an indication that the content expressed is an abstract concept. In fact, those skilled in the art understand that only the opposite is true. If a high-level programming language is a tool used to implement technical disclosures in the form of functions/operations, then those skilled in the art will understand that, far from being abstract, imprecise, "fuzzy," or "mental" in any significant semantic definition, such a tool is instead a nearly incomprehensible, precise sequential specification of a specific computing machine—parts of which are established by activating/selecting these parts over time (e.g., clock time) from a typical, more general computing machine. This fact is sometimes obscured by the superficial similarities between high-level programming languages and natural languages. These superficial similarities can also serve to gloss over the fact that high-level programming language implementations ultimately perform valuable work by creating/controlling many different computing machines.
高级编程语言规定的许多不同计算机器几乎是无法想象地复杂。基本上,计算机器中使用的硬件一般由某些类型的有序物质(例如传统电子器件(例如晶体管)、脱氧核糖核酸(DNA)、量子器件、机械开关、光学系统、射流技术、气动装置、光学器件(例如光干涉器件)、分子等)构成,这些有序物质被配置成形成逻辑门。逻辑门一般是可电气地、机械地、化学地或以其它方式被驱动以改变物理状态以形成布尔逻辑的物理现实的物理器件。The many different computing machines specified by high-level programming languages are almost unimaginably complex. Fundamentally, the hardware used in computing machines is typically composed of some type of ordered matter (e.g., traditional electronic devices (e.g., transistors), deoxyribonucleic acid (DNA), quantum devices, mechanical switches, optical systems, fluidics, pneumatics, optical devices (e.g., optical interferometers), molecules, etc.) configured to form logic gates. Logic gates are generally physical devices that can be actuated electrically, mechanically, chemically, or in some other manner to change physical states to form the physical reality of Boolean logic.
逻辑门可被配置成形成逻辑电路,该逻辑电路一般是可电气地、机械地、化学地或以其它方式被驱动以形成某些逻辑功能的物理现实的物理器件。逻辑电路的类型包括诸如多路复用器、寄存器、算术逻辑单元(ALU)、计算机存储器等的这类器件,其每种类型可被组合以形成又一些其它类型的物理器件,例如中央处理单元(CPU)——其中最广为人知的是微处理器。现代微处理器在其许多逻辑电路中经常包含一亿个以上的逻辑门(并经常超过十亿个晶体管),例如参见Wikipedia,Logic gate,http://en.wikipedia.org/wiki/Logic_gates(2012年6月5日21:03GMT)。Logic gates can be configured to form a logic circuit, which is generally a physical device that can be driven electrically, mechanically, chemically, or otherwise to form the physical reality of some logical function. Types of logic circuits include devices such as multiplexers, registers, arithmetic logic units (ALUs), computer memories, etc., each of which can be combined to form yet other types of physical devices, such as central processing units (CPUs), the most widely known of which are microprocessors. Modern microprocessors often contain more than 100 million logic gates (and often more than 1 billion transistors) in their many logic circuits. See, for example, Wikipedia, Logic gate, http://en.wikipedia.org/wiki/Logic_gates (June 5, 2012, 21:03 GMT).
形成微处理器的逻辑电路被配置成提供微架构,该微架构将执行由该微处理器的定义指令集架构定义的指令。指令集架构是与编程关联的微处理器架构的一部分,包括原数据类型、指令、寄存器、寻址模式、存储器架构、中断和异常处理以及外部输入/输出。例如参见Wikipedia,Computer architecture,http://en.wikipedia.org/wiki/Computer_architecture(2012年6月5日21:03GMT)。The logic circuits forming a microprocessor are configured to provide a microarchitecture that will execute instructions defined by the microprocessor's instruction set architecture. The instruction set architecture is the portion of the microprocessor's architecture associated with programming, including primitive data types, instructions, registers, addressing modes, memory architecture, interrupt and exception handling, and external input/output. See, for example, Wikipedia, Computer architecture, http://en.wikipedia.org/wiki/Computer_architecture (June 5, 2012, 21:03 GMT).
指令集架构包括机器语言的规范,所述规范可由编程者使用以使用/控制微处理器。由于机器语言指令使得它们可直接通过微处理器执行,它们通常由多串二进制数或位构成。例如,典型的机器语言指令可以是许多位长(例如32、64或128位字符串是当前通用的)。典型的机器语言指令可以“11110000101011110000111100111111”的形式出现(32位指令)。The instruction set architecture includes a specification for a machine language that a programmer can use to use/control a microprocessor. Because machine language instructions make them directly executable by a microprocessor, they are typically composed of strings of binary numbers or bits. For example, a typical machine language instruction can be many bits long (e.g., 32, 64, or 128 bit strings are currently common). A typical machine language instruction may appear in the form of "11110000101011110000111100111111" (a 32-bit instruction).
此处重要的是,尽管机器语言指令是以二进制数序列写就的,但实际上这些二进制数规定物理现实。例如,如果使用某些半导体来使布尔逻辑操作成为物理现实,则显然机器语言指令中的数学位“1”、“0”实际上构成规定将特定电压施加至特定线的速记。例如,在某些半导体技术中,机器语言指令中的二进制数“1”(例如逻辑“1”)规定被施加至特定“线”(例如印刷电路板上的金属迹线)的+5伏左右的电压,而机器语言指令中的二进制数“0”(例如逻辑“0”)规定被施加至特定“线”的-5伏左右的电压。除了规定机器配置的电压,这些机器语言指令也从更通用机器的上百万个逻辑门中选出并激活特定组的逻辑门。由此,远非抽象数学表达,机器语言指令程序(即便被写成一串0和1)也能规定许多许多构造的物理机或物理机状态。What's important here is that, although machine language instructions are written as sequences of binary numbers, these binary numbers actually specify physical reality. For example, if certain semiconductors are used to physically implement Boolean logic operations, it becomes apparent that the mathematical bits "1" and "0" in a machine language instruction actually constitute shorthand for specifying that a specific voltage be applied to a specific wire. For example, in certain semiconductor technologies, a binary number "1" (e.g., a logical "1") in a machine language instruction specifies a voltage of approximately +5 volts applied to a specific "wire" (e.g., a metal trace on a printed circuit board), while a binary number "0" (e.g., a logical "0") in a machine language instruction specifies a voltage of approximately -5 volts applied to a specific "wire." In addition to specifying the voltages configured for the machine, these machine language instructions also select and activate specific sets of logic gates from among the millions of logic gates in a more general machine. Thus, far from being an abstract mathematical expression, a program of machine language instructions (even if written as a string of 0s and 1s) can specify many, many configured physical machines or states of physical machines.
机器语言一般对多数人是不可领会的(例如前面的例子仅为一条指令,并且某些个人计算机每秒执行20亿条指令),例如参见Wikipedia,Instructions per second,http://en.wikipedia.org/wiki/Instruction_per_second(2012年6月5日21:04GMT)。由此,以机器语言写就的程序——它可以是数千万个机器语言指令长度——是难以领会的。有鉴于此,研发出早期汇编语言,它使用记忆代码以指向机器语言指令,而不是直接使用机器语言指令的数值例如为了执行乘运算,程序员将简称“mult(乘)”代码化,它代表MIPS机器代码中的二进制数“011000”)。尽管汇编语言一开始对人们控制微处理器以执行工作具有很大的帮助,然而经过一段时间,需要由人完成的工作的复杂度超过了人们仅使用汇编语言控制微处理器的能力。Machine language is generally incomprehensible to most people (for example, the previous example is just one instruction, and some personal computers execute 2 billion instructions per second). See, for example, Wikipedia, "Instructions per second," http://en.wikipedia.org/wiki/Instruction_per_second (June 5, 2012, 9:04 PM GMT). Consequently, programs written in machine language—which can be tens of millions of machine language instructions long—are difficult to understand. In response to this, early assembly languages were developed, which used mnemonics to point to machine language instructions rather than directly using their numerical values. For example, to perform a multiplication operation, programmers coded the abbreviation "mult," which represents the binary number "011000" in MIPS machine code. Although assembly language initially helped people control microprocessors to perform tasks, over time, the complexity of the tasks required of people exceeded their ability to control microprocessors using assembly language alone.
在这一点,注意相同的任务需要一遍又一遍地完成,并且完成这些重复任务所需的机器语言是相同的。有鉴于此,形成了编译器。编译器是这样一种器件,其采用比机器或汇编语言更容易被人领会的语句(例如“add2+2and output the result(加2+2并输出结果)”)并将人们易懂的语句转换成复杂的、冗长的和巨大的机器语言代码(例如上百万个32、64或128位长的字符串)。编译器由此将高级编程语言转换成机器语言。At this point, it's important to note that the same tasks need to be done over and over again, and the machine language required to complete these repetitive tasks is the same. With this in mind, compilers were developed. A compiler is a device that takes statements that are easier for humans to understand than machine or assembly language (e.g., "add 2 + 2 and output the result") and converts these human-understandable statements into complex, lengthy, and massive machine language code (e.g., millions of 32-, 64-, or 128-bit strings). Compilers thus convert high-level programming languages into machine language.
这种编译的机器语言,如前所述,则被用作顺序地构造并造成许多不同计算机器的互操作的技术规范,由此完成人类有益的、有形的和具体的工作。例如,如前面指出的,这种机器语言——较高级语言的编译版本——起到技术规范的作用,该技术规范选出硬件逻辑门、规定电压电平、电压转变定时等,以通过硬件完成对人类有益的工作。This compiled machine language, as previously mentioned, is then used as a technical specification to sequentially construct and cause many different computing machines to interoperate, thereby performing useful, tangible, and concrete tasks for humans. For example, as previously noted, this machine language—a compiled version of a higher-level language—serves as a technical specification that selects hardware logic gates, specifies voltage levels, timing of voltage transitions, and so forth, in order to perform useful tasks for humans through hardware.
由此,当由本领域内技术人员审视时,功能性/操作性技术描述远非抽象理念。相反,这种功能性/操作性技术描述,当通过例如刚才描述的业内可用的工具理解时,则可被理解为对硬件规范的人们容易理解的表示,其复杂性和特异性远远超过大多数人的领会范围。有鉴于此,本领域内技术人员将理解,任何这样的操作性/功能性技术描述——鉴于本文公开和本领域内技术人员的知识——可被理解为投入到物理现实中的操作,通过:(a)一个或多个相互联系的物理机;(b)被配置成形成一个或多个物理机的相互联系的逻辑门,其代表序列/组合逻辑;(c)构成逻辑门的相互联系的有序物质(例如相互联系的电子器件(例如晶体管)、DNA、量子器件、机械开关、光学系统、射流技术、气动装置、分子等),它们形成代表逻辑的物理现实;或(d)前述的实际任意组合。事实上,具有稳定、可测和可变状态的任何物理对象可用来基于前面的技术描述构造机器。例如,Charles Babbage由木材构造出第一台计算机并通过使手柄作曲柄转动而供电。Thus, when viewed by one skilled in the art, functional/operational technical descriptions are far from abstract concepts. Rather, such functional/operational technical descriptions, when understood through tools available in the industry, such as those just described, can be understood as human-understandable representations of hardware specifications, the complexity and specificity of which far exceed the comprehension of most people. In light of this, one skilled in the art will understand that any such operational/functional technical description—given the disclosure herein and the knowledge of one skilled in the art—can be understood as operations put into physical reality through: (a) one or more interconnected physical machines; (b) interconnected logic gates configured to form one or more physical machines, representing sequential/combinatorial logic; (c) interconnected ordered matter (e.g., interconnected electronic devices (e.g., transistors), DNA, quantum devices, mechanical switches, optical systems, fluidics, pneumatics, molecules, etc.) constituting logic gates, which form a physical reality representing logic; or (d) any actual combination of the foregoing. In fact, any physical object with a stable, measurable, and mutable state can be used to construct a machine based on the foregoing technical descriptions. For example, Charles Babbage constructed the first computer out of wood and powered it by cranking a handle.
由此,远非抽象理念理解的那样,本领域内技术人员将功能性/操作性技术描述认识为一个或多个几乎无法想象地复杂和时间顺序的硬件实例的人们可理解的表示。功能性/操作性技术描述可能容易将其本身赋予与自然语言共享某些字、结构、词等的高级计算语言(或对于该物质的高级框图)这一事实无法简单地视为这些功能性/操作性技术描述是抽象理念或仅仅为抽象理念表达的指示。事实上,如本文中概述的,在技术领域中这简单地不是真的。当通过本领域内技术人员可用的工具观察时,这些功能性/操作性技术描述被视为规定几乎无法想象复杂的硬件配置。Thus, far from being understood as abstract ideas, those skilled in the art recognize functional/operational technical descriptions as human-understandable representations of one or more hardware instances of almost unimaginable complexity and chronological order. The fact that functional/operational technical descriptions may readily lend themselves to high-level computing languages (or high-level block diagrams for that matter) that share certain words, structures, terms, etc. with natural languages cannot simply be viewed as an indication that these functional/operational technical descriptions are abstract ideas or merely expressions of abstract ideas. In fact, as outlined herein, this is simply not true in the art. When viewed through the tools available to those skilled in the art, these functional/operational technical descriptions are viewed as specifying hardware configurations of almost unimaginable complexity.
如前面概述的那样,使用功能性/操作性技术描述的原因至少为两层。首先,使用功能性/操作性技术描述允许接近无限复杂的机器和起因于相互联系的硬件组件的机器操作以人类头脑能够处理的方式(例如通过模仿自然语言和逻辑叙述流)描述。其次,使用功能性/操作性技术描述通过提供更独立于或较不独立于任何特定卖家的硬件零件的描述来帮助本领域内技术人员理解所描述的主题。As outlined above, the reasons for using functional/operational technical descriptions are at least twofold. First, using functional/operational technical descriptions allows machines of nearly infinite complexity and the machine operations resulting from interconnected hardware components to be described in a manner that the human mind can process (e.g., by mimicking natural language and logical narrative flow). Second, using functional/operational technical descriptions helps those skilled in the art understand the described subject matter by providing a description that is more or less independent of any particular vendor's hardware components.
使用功能性/操作性技术描述帮助本领域内技术人员理解所描述的主题,因为如从上面的讨论显而易见的,人们能容易地(尽管不是快速地)将该文档中阐述的技术描述改写成数万亿个1和0、数十亿个汇编层机器代码的单行、数百万个逻辑门、数千个门阵列或任何数量的中间级别的抽象。然而,如果任何这些低级技术描述要取代当前技术描述,本领域内技术人员在实现本公开时可能遇到过度的困难,因为这种低级技术描述可能增加复杂性而没有相应的益处(例如通过描述主题,该主题利用一个或多个卖家特定的硬件零件的规格)。由此,使用功能性/操作性技术描述通过将技术描述与任何卖家特定的硬件零件的规格分离开来而对本领域内技术人员形成帮助。The use of functional/operational technical descriptions assists those skilled in the art in understanding the described subject matter because, as is apparent from the discussion above, one can readily (though not quickly) rewrite the technical descriptions set forth in this document into trillions of ones and zeros, billions of single lines of assembly-level machine code, millions of logic gates, thousands of gate arrays, or any number of intermediate levels of abstraction. However, if any of these low-level technical descriptions were to replace the current technical description, those skilled in the art might encounter undue difficulty in implementing the present disclosure because such low-level technical descriptions might add complexity without corresponding benefit (e.g., by describing subject matter that utilizes the specifications of one or more vendor-specific hardware parts). Thus, the use of functional/operational technical descriptions assists those skilled in the art by separating the technical description from the specifications of any vendor-specific hardware parts.
有鉴于前述内容,当前技术描述中阐述的逻辑操作/功能是各种有序物质要素的静态或顺序规范的表示,为了使这些规范可由人们的头脑领会并可调整以产生许多硬件配置。本文披露的逻辑操作/功能应当被如此对待,并且不应当仅仅因为它们展示的规范以本领域内技术人员能容易理解的方式表示并以独立于特定卖家的硬件实现的方式应用而将其诋毁地归纳为抽象理念。In view of the foregoing, the logical operations/functions set forth in the present technical description are representations of static or sequential specifications of various ordered physical elements, in order to make these specifications understandable to the human mind and adjustable to produce a variety of hardware configurations. The logical operations/functions disclosed herein should be treated as such and should not be derogatorily reduced to abstract concepts simply because they present specifications that are readily understood by those skilled in the art and applied in a manner independent of a particular vendor's hardware implementation.
在一个或多个不同的方面,一个或多个关联的系统可实现在机器、物质的组合或系统制造中,其仅限于根据35USC 101规定的可授权主题。一个或多个相关系统可包括但不限于,用于实现本文引述的方法方面的电路和/或编程。电路和/或编程实际上可以是被配置成根据系统设计者的设计选择实现本文引述的方法方面的硬件、软件和/或固件的任意组合,并仅限于根据35USC 101规定的可授权主题。In one or more different aspects, one or more related systems may be implemented in a machine, composition of matter, or system of manufacture, which is limited to the subject matter allowable under 35 USC 101. The one or more related systems may include, but are not limited to, circuitry and/or programming for implementing the method aspects recited herein. The circuitry and/or programming may be any combination of hardware, software, and/or firmware configured to implement the method aspects recited herein, depending on the design choices of the system designer, and is limited to the subject matter allowable under 35 USC 101.
在一个或多个不同方面,一种系统包括但不限于:用于管理适应数据的装置,其中所述适应数据关联于特定方讲话的至少一个方面;响应关联于特定方的利于讲话的交易的指示,促进适应数据传输至目标设备的装置,其中适应数据关联于特定方的讲话的至少一个方面;以及确定是否更新适应数据的装置,所述确定至少部分地基于利于讲话的交易的至少一部分的结果。除了前述内容,构成本文阐述的公开内容一部分的权利要求书、附图和正文中描述了其它系统方面。In one or more various aspects, a system includes, but is not limited to: means for managing adaptation data, wherein the adaptation data is associated with at least one aspect of speech by a particular party; means for facilitating transmission of the adaptation data to a target device in response to an indication of a speech-facilitating transaction associated with the particular party, wherein the adaptation data is associated with at least one aspect of speech by the particular party; and means for determining whether to update the adaptation data, the determination being based at least in part on a result of at least a portion of the speech-facilitating transaction. In addition to the foregoing, further system aspects are described in the claims, drawings, and text forming a part of the disclosure set forth herein.
在一个或多个不同方面,一种系统包括但不限于:用于管理适应数据的电路,其中适应数据关联于特定方讲话的至少一个方面;响应关联于特定方的利于讲话的交易的指示,促进适应数据传输至目标设备的电路,其中适应数据关联于特定方的讲话的至少一个方面;以及确定是否更新适应数据的电路,所述确定至少部分地基于利于讲话的交易的至少一部分的结果。除了前述内容,构成本文阐述的公开内容一部分的权利要求书、附图和正文中描述了其它系统方面。In one or more various aspects, a system includes, but is not limited to, circuitry for managing adaptation data, wherein the adaptation data is associated with at least one aspect of speech by a particular party; circuitry for facilitating transmission of the adaptation data to a target device in response to an indication of a speech-facilitating transaction associated with the particular party, wherein the adaptation data is associated with at least one aspect of speech by the particular party; and circuitry for determining whether to update the adaptation data, the determination being based at least in part on a result of at least a portion of the speech-facilitating transaction. In addition to the foregoing, further system aspects are described in the claims, drawings, and text forming a part of the disclosure set forth herein.
在一个或多个不同方面,一种计算机程序产品,包括信号承载介质,其承载一个或多个指令,所述指令包括但不限于:用于管理适应数据的一个或多个指令,其中适应数据关联于特定方讲话的至少一个方面;响应关联于特定方的利于讲话的交易的指示,促进适应数据传输至目标设备的一个或多个指令,其中适应数据关联于特定方的讲话的至少一个方面;以及确定是否更新适应数据的一个或多个指令,所述确定至少部分地基于利于讲话的交易的至少一部分的结果。除了前述内容,构成本文阐述的公开内容一部分的权利要求书、附图和正文中描述了其它计算机程序产品方面。In one or more various aspects, a computer program product includes a signal-bearing medium carrying one or more instructions, including, but not limited to: one or more instructions for managing adaptation data, wherein the adaptation data is associated with at least one aspect of speech by a particular party; one or more instructions for facilitating transmission of the adaptation data to a target device in response to an indication of a speech-facilitating transaction associated with the particular party, wherein the adaptation data is associated with at least one aspect of speech by the particular party; and one or more instructions for determining whether to update the adaptation data, the determination being based at least in part on a result of at least a portion of the speech-facilitating transaction. In addition to the foregoing, further computer program product aspects are described in the claims, drawings, and text forming a part of the disclosure set forth herein.
在一个或多个不同方面,通过计算语言定义一种设备,以使该设备包括:定序以管理适应数据的一个或多个相互联系的物理机,其中适应数据关联于特定方讲话的至少一个方面;定序以响应关联于特定方的利于讲话的交易的指示将适应数据传输至目标设备的一个或多个相互联系的物理机,其中适应数据关联于特定方的讲话的至少一个方面;以及定序以确定是否更新适应数据的一个或多个相互联系的物理机,所述确定至少部分地基于利于讲话的交易的至少一部分的结果。In one or more different aspects, a device is defined by a computing language such that the device includes: one or more interconnected physical machines sequenced to manage adaptation data, wherein the adaptation data is associated with at least one aspect of speech by a particular party; one or more interconnected physical machines sequenced to transmit the adaptation data to a target device in response to an indication of a speech-facilitating transaction associated with the particular party, wherein the adaptation data is associated with at least one aspect of speech by the particular party; and one or more interconnected physical machines sequenced to determine whether to update the adaptation data, the determination being based at least in part on a result of at least a portion of the speech-facilitating transaction.
除了前述内容,在诸如本公开的正文(例如权利要求书和/或详细说明)和/或附图的教示中阐述和描述了多种其它方法和/或系统和/或程序产品方面。In addition to the foregoing, various other method and/or system and/or program product aspects are set forth and described in the teachings, such as in the text of the present disclosure (eg, claims and/or detailed description) and/or figures.
前面的内容是概述并因此可能包含细节的简化、概括、包含和/或省略;结果,本领域内技术人员将理解此概述仅为解说性的并且无论如何不旨在构成限制。本文描述的设备和/或过程和/或其它主题的其它方面、特征和优势通过参照详细说明、相应的附图和/或本文给出的教义将变得显而易见。The foregoing is an overview and therefore may contain simplifications, generalizations, inclusions, and/or omissions of details; consequently, those skilled in the art will appreciate that this overview is illustrative only and is not intended to be limiting in any way. Other aspects, features, and advantages of the devices and/or processes and/or other subject matter described herein will become apparent by reference to the detailed description, the accompanying drawings, and/or the teachings presented herein.
附图说明BRIEF DESCRIPTION OF THE DRAWINGS
为了实施例的更完全理解,现在结合附图参照下面的说明。在不同的附图中使用相同的标记一般表示相似或相同的部件,除非上下文指出相反情形。详细说明、附图和权利要求书中描述的解说性实施例不旨在构成限制。也可利用其它实施例并作出其它改变而不脱离这里给出的主题的精神或范围。For a more complete understanding of the embodiments, reference is now made to the following description in conjunction with the accompanying drawings. The use of the same reference numerals in different figures generally indicates similar or identical components, unless the context indicates otherwise. The illustrative embodiments described in the detailed description, drawings, and claims are not intended to be limiting. Other embodiments may be utilized and other changes may be made without departing from the spirit or scope of the subject matter presented herein.
图1A示出根据一实施例的示例性环境100的高级框图。FIG1A illustrates a high-level block diagram of an exemplary environment 100 , according to one embodiment.
图1B示出根据一实施例的工作在示例性实施例100中的个人设备120的高级框图。FIG. 1B illustrates a high-level block diagram of a personal device 120 operating in the exemplary embodiment 100 , according to one embodiment.
图1C示出根据一实施例的示例性环境100’的高级图,它是具有个人设备120的示例性实施例100的一个例子。FIG1C illustrates a high-level diagram of an exemplary environment 100', which is an example of an exemplary embodiment 100 having a personal device 120, according to an embodiment.
图1D示出根据一实施例的示例性环境100”的高级图,它是具有个人设备120的示例性实施例100的一个例子。FIG. 1D illustrates a high-level diagram of an exemplary environment 100 ″, which is an example of an exemplary embodiment 100 having a personal device 120 , according to an embodiment.
图1E示出根据一实施例的示例性环境100”’的高级图,它是具有个人设备120的示例性实施例100的一个例子。FIGURE 1E illustrates a high-level diagram of an exemplary environment 100'" which is an example of an exemplary embodiment 100 having a personal device 120, according to an embodiment.
图2,包括图2A-2B,示出与图1B的环境100的个人设备120的特定方调节模块152的至少一个方面关联的讲话适应数据的特定方面。FIG. 2 , comprising FIG. 2A-2B , illustrates particular aspects of speech adaptation data associated with at least one aspect of a particular party adaptation module 152 of a personal device 120 of the environment 100 of FIG. 1B .
图3,包括图3A-3D,示出图1B的环境100的个人设备120的响应包括目标设备的利于讲话的交易的指示的讲话适应数据传输促进模块154的特定方面。3 , comprising FIGS. 3A-3D , illustrates certain aspects of the speech-adapted data transfer facilitation module 154 of the personal device 120 of the environment 100 of FIG. 1B in response to an indication of a speech-facilitated transaction including a target device.
图4,包括图4A-4I,示出图1B的环境100的个人设备120的部分基于利于讲话的交易的部分的结果的讲话适应数据潜在修正确定模块156的特定方面。4, including FIGs. 4A-4I, illustrates certain aspects of the speech adaptation data potential correction determination module 156 of the personal device 120 of the environment 100 of FIG. 1B based, in part, on the results of a portion of a speech-facilitated transaction.
图5是根据一实施例的过程(例如操作流程500)的高级逻辑流程图。FIG5 is a high-level logical flow diagram of a process (eg, operational flow 500 ) according to an embodiment.
图6A是根据一个或多个实施例的描绘图5的管理适应数据操作502的替代实现的方法的高级逻辑流程图。FIG6A is a high-level logical flow diagram depicting a method of an alternative implementation of the manage adaptation data operation 502 of FIG5 , according to one or more embodiments.
图6B是根据一个或多个实施例的描绘图5的管理适应数据操作502的替代实现的方法的高级逻辑流程图。FIG6B is a high-level logic flow diagram depicting a method of an alternative implementation of the manage adaptation data operation 502 of FIG5 , according to one or more embodiments.
图7A是根据一个或多个实施例的描绘利于传输图5的适应数据504的替代实现的方法的高级逻辑流程图。FIG7A is a high-level logical flow diagram depicting a method that facilitates an alternative implementation of transmitting the adaptation data 504 of FIG5 , according to one or more embodiments.
图7B是根据一个或多个实施例的描绘利于传输图5的适应数据504的替代实现的方法的高级逻辑流程图。FIG7B is a high-level logic flow diagram depicting a method that facilitates an alternative implementation of transmitting the adaptation data 504 of FIG5 , according to one or more embodiments.
图7C是根据一个或多个实施例的描绘利于传输图5的适应数据504的替代实现的方法的高级逻辑流程图。FIG7C is a high-level logic flow diagram depicting a method that facilitates an alternative implementation of transmitting the adaptation data 504 of FIG5 , according to one or more embodiments.
图7D是根据一个或多个实施例的描绘利于传输图5的适应数据504的替代实现的方法的高级逻辑流程图。FIG7D is a high-level logic flow diagram depicting a method that facilitates an alternative implementation of transmitting the adaptation data 504 of FIG5 , according to one or more embodiments.
图8A是根据一个或多个实施例的描述判断是否更新图5的适应数据506的方法的高级逻辑流程图。FIG8A is a high-level logic flow diagram illustrating a method for determining whether to update the adaptation data 506 of FIG5 , according to one or more embodiments.
图8B是根据一个或多个实施例的描述判断是否更新图5的适应数据506的方法的高级逻辑流程图。FIG8B is a high-level logic flow diagram illustrating a method for determining whether to update the adaptation data 506 of FIG5 , according to one or more embodiments.
图8C是根据一个或多个实施例的描述判断是否更新图5的适应数据506的方法的高级逻辑流程图。FIG8C is a high-level logic flow diagram illustrating a method for determining whether to update the adaptation data 506 of FIG5 according to one or more embodiments.
图8D是根据一个或多个实施例的描述判断是否更新图5的适应数据506的方法的高级逻辑流程图。FIG8D is a high-level logic flow diagram illustrating a method for determining whether to update the adaptation data 506 of FIG5 according to one or more embodiments.
图8E是根据一个或多个实施例的描述判断是否更新图5的适应数据506的方法的高级逻辑流程图。FIG8E is a high-level logic flow diagram illustrating a method for determining whether to update the adaptation data 506 of FIG5 according to one or more embodiments.
图8F是根据一个或多个实施例的描述判断是否更新图5的适应数据506的方法的高级逻辑流程图。FIG8F is a high-level logic flow diagram illustrating a method for determining whether to update the adaptation data 506 of FIG5 according to one or more embodiments.
图8G是根据一个或多个实施例的描述判断是否更新图5的适应数据506的方法的高级逻辑流程图。FIG8G is a high-level logic flow diagram illustrating a method for determining whether to update the adaptation data 506 of FIG5 according to one or more embodiments.
图8H是根据一个或多个实施例的描述判断是否更新图5的适应数据506的方法的高级逻辑流程图。FIG8H is a high-level logic flow diagram illustrating a method for determining whether to update the adaptation data 506 of FIG5 according to one or more embodiments.
图8I是根据一个或多个实施例的描述判断是否更新图5的适应数据506的方法的高级逻辑流程图。FIG8I is a high-level logic flow diagram illustrating a method for determining whether to update the adaptation data 506 of FIG5 according to one or more embodiments.
具体实施方式DETAILED DESCRIPTION
在下面的详细说明中,参照构成说明书一部分的附图。在附图中,相似的标记一般表示相似或相同的组件或物品,除非上下文指明相反情形。详细说明、附图和权利要求书中描述的解说性实施例不旨在构成限制。可利用其它实施例,并可作出其它改变而不脱离这里给出的主题的精神或范围。In the following detailed description, reference is made to the accompanying drawings, which form a part thereof. In the drawings, like symbols generally indicate similar or identical components or items, unless the context indicates otherwise. The illustrative embodiments described in the detailed description, drawings, and claims are not intended to be limiting. Other embodiments may be utilized, and other changes may be made, without departing from the spirit or scope of the subject matter presented herein.
由此,根据各实施例,计算实现的方法、系统、电路、制品、物质的有序联系以及计算机程序产品被设计成:尤其提供管理适应数据的接口,其中适应数据关联于特定方的讲话的至少一个方面;响应特定方的利于讲话的交易关联的指示,促进将适应数据传输至目标设备,其中适应数据关联于特定方的讲话的至少一个方面;以及确定是否更新适应数据,所述确定至少部分地基于利于讲话交易的至少一部分的结果。Thus, according to various embodiments, computer-implemented methods, systems, circuits, articles, ordered connections of matter, and computer program products are designed to: provide, among other things, an interface for managing adaptation data, wherein the adaptation data is associated with at least one aspect of speech by a particular party; facilitate the transmission of adaptation data to a target device in response to an indication of a transaction associated with speech-facilitating transaction by a particular party, wherein the adaptation data is associated with at least one aspect of speech by the particular party; and determine whether to update the adaptation data, the determination being based at least in part on the results of at least a portion of the speech-facilitating transaction.
本申请为清楚表达而使用正式大纲标题。然而要理解,这些大纲标题用于表示目的,并且贯穿申请地可讨论不同类型的主题(例如设备/结构可在过程/操作标题下描述和/或过程/操作可在结构/过程标题下讨论;和/或单个话题的描述可跨两个或更多个标题)。因此,使用正式大纲标题无论如何不旨在构成限制。This application uses formal outline headings for clarity. However, it should be understood that these outline headings are for presentation purposes and that different types of subject matter may be discussed throughout the application (e.g., a device/structure may be described under a process/operation heading and/or a process/operation may be discussed under a structure/process heading; and/or the description of a single topic may span two or more headings). Therefore, the use of formal outline headings is not intended to be limiting in any way.
贯穿本申请地给出具有括号、缩写“例如”或两者的示例和列表。除非明确地另有声明,否则这些示例和列表仅为示例性的而非穷尽性的。在多数情形下,列出每个例子和每种组合是不现实的。由此,使用较小的解说性列表和例子,其侧重于给予对权利要求术语的理解而不是对这些术语的范围构成限制。Throughout this application, examples and lists are given with parentheses, the abbreviation "such as," or both. Unless expressly stated otherwise, these examples and lists are illustrative only and not exhaustive. In most cases, it is impractical to list every example and every combination. Therefore, smaller illustrative lists and examples are used that focus on providing an understanding of the claim terms rather than limiting the scope of these terms.
对于本文中对基本任何复数和/或单数术语的使用,本领域内技术人员可在上下文和/或应用适合时从复数转换成单数和/或从单数转换成复数。为清楚起见在这里不明确地阐述各种单数/复数置换。With respect to the use of substantially any plural and/or singular terms herein, those skilled in the art can translate from the plural to the singular and/or from the singular to the plural as appropriate to the context and/or application. The various singular/plural permutations are not explicitly set forth herein for the sake of clarity.
本领域内技术人员将认识到,本文描述的组成(例如操作)、设备、对象和伴随它们的讨论为了概念清楚作为示例使用并且考虑多种配置修正。结果,如本文中使用的,所阐述的特定例子和伴随的讨论旨在表达其更一般的类别。总地来说,任何特定例子的使用旨在表示其类别,并且特定组成(例如操作)、设备和目的的非包含不应当被认为是限制。Those skilled in the art will recognize that the components (e.g., operations), devices, objects, and the accompanying discussions described herein are used as examples for conceptual clarity and contemplate a variety of configuration modifications. Consequently, as used herein, the specific examples and accompanying discussions set forth are intended to represent their more general categories. In general, the use of any specific example is intended to represent its category, and the non-inclusion of specific components (e.g., operations), devices, and objects should not be considered limiting.
本申请为清楚表示而使用正式大纲标题。然而要理解,大纲标题用于表示目的,并可贯穿本申请地讨论不同类型的主题(例如设备/结构可在过程/操作标题下描述和/或过程/操作可在结构/过程标题下讨论;和/或单个话题的描述可跨两个或更多个标题)。因此,使用正式大纲标题无论如何不旨在构成限制。This application uses formal outline headings for clarity. However, it should be understood that the outline headings are for presentation purposes and that different types of subject matter may be discussed throughout this application (e.g., a device/structure may be described under a process/operation heading and/or a process/operation may be discussed under a structure/process heading; and/or the description of a single topic may span two or more headings). Therefore, the use of formal outline headings is not intended to be limiting in any way.
尽管用户105在本文中图示/描述为的单个示出的人物,然而本领域内技术人员将理解,用户105可代表人类用户、机器人用户(例如计算实体)和/或其基本任意组合(例如用户可由一个或多个机器人代理来协助),除非上下文指出了相反情形。本领域内技术人员将理解,一般来说,同一对象可被说成是“发送方”和/或其它面向实体的术语(当这些术语在本文中使用时),除非上下文指出了相反情形。Although user 105 is illustrated/described herein as a single illustrated person, those skilled in the art will appreciate that user 105 may represent a human user, a robotic user (e.g., a computing entity), and/or substantially any combination thereof (e.g., a user may be assisted by one or more robotic agents), unless the context indicates otherwise. Those skilled in the art will appreciate that, generally speaking, the same entity may be referred to as a "sender" and/or other entity-oriented terms (as such terms are used herein) unless the context indicates otherwise.
本领域内技术人员将认识到,技术发展水平已发展到在系统各个方面的硬件、软件和/或固件实现之间存在极小差别的阶段;硬件、软件和/或固件的使用通常是(但不总是,因为在某些背景下硬件和软件之间的选择可能变得重要)代表成本相对于效率权衡的设计选择。本领域内技术人员将理解,存在多种载体,通过这些载体可实现本文描述的过程和/或系统和/或其它技术(例如硬件、软件和/或固件),并且优选的载体可随着其中部署过程和/或系统和/或其它技术的背景而变化。例如,如果实施者确定速度和精确性是至上的,则实施者可选择主要硬件和/或固件载体;替代地,如果灵活性至上,则实施者可选择主要软件实现;或者再一次,实施者可选择一个或多个机器中的硬件、软件和/或固件的一些组合、物质的组合以及制品,其仅限于根据35USC 101可授权的主题。因此,存在若干种可能的载体,通过这些载体可实现本文描述的过程和/或设备和/或其它技术,没有任何一种天生地优于其它的,因为拟利用的任何载体是依赖于载体将被部署的背景和实施者的特殊考虑(例如速度、灵活性或可预测性)的选择,其任意一种都是可变的。本领域内技术人员将认识到,实现的光学方面将一般采用面向光学的硬件、软件和固件。Those skilled in the art will recognize that the state of the art has advanced to the point where there is minimal differentiation between hardware, software, and/or firmware implementations of various aspects of a system; the use of hardware, software, and/or firmware is often (but not always, as the choice between hardware and software may become important in certain contexts) a design choice that represents a cost versus efficiency tradeoff. Those skilled in the art will understand that there are a variety of vehicles by which the processes and/or systems and/or other techniques described herein can be implemented (e.g., hardware, software, and/or firmware), and that the preferred vehicle may vary depending on the context in which the processes and/or systems and/or other techniques are deployed. For example, if the implementer determines that speed and accuracy are paramount, the implementer may select a primarily hardware and/or firmware vehicle; alternatively, if flexibility is paramount, the implementer may select a primarily software implementation; or again, the implementer may select some combination of hardware, software, and/or firmware in one or more machines, combinations of materials, and articles of manufacture, limited to subject matter patentable under 35 USC §101. Thus, there are several possible carriers by which the processes and/or devices and/or other techniques described herein may be implemented, with no one carrier being inherently superior to the other, as any carrier to be utilized is a choice dependent upon the context in which the carrier will be deployed and the implementer's particular considerations (e.g., speed, flexibility, or predictability), any of which are variable. Those skilled in the art will recognize that the optical aspects of the implementation will generally employ optically oriented hardware, software, and firmware.
在本文描述的一些实现中,逻辑和类似的实现可包括软件或其它控制结构。例如,电子电路可具有被构造和布置成实现如本文描述的各种功能的一个或多个电流路径。在一些实现中,一个或多个介质可被配置成当该介质保持或发送如本文所述那样执行的设备可检测指令时承载设备可检测的实现。在一些变例中,例如实现可包括已有软件或硬件或门阵列或可编程硬件的更新或修正,例如通过关联于本文描述的一个或多个操作执行一个或多个指令的接收或发送。作为替代或附加,在一些变例中,实现可包括专用硬件、软件、固件器件和/或通用器件,它们执行或以其它方式调用专用器件。规范或其它实现可通过如本文描述的有形传输介质的一个或多个实例来传输,可选择地通过分组传输或通过分布式介质多次地传递。In some implementations described herein, logic and similar implementations may include software or other control structures. For example, an electronic circuit may have one or more current paths constructed and arranged to implement various functions as described herein. In some implementations, one or more media may be configured to carry device-detectable implementations when the media holds or sends device-detectable instructions executed as described herein. In some variations, for example, the implementation may include an update or modification of existing software or hardware or gate arrays or programmable hardware, such as the reception or transmission of one or more instructions performed in association with one or more operations described herein. Alternatively or in addition, in some variations, the implementation may include dedicated hardware, software, firmware devices and/or general-purpose devices that execute or otherwise call dedicated devices. The specification or other implementation may be transmitted via one or more instances of a tangible transmission medium as described herein, optionally transmitted multiple times via packet transmission or via distributed media.
作为替代或附加,实现可包括执行专用指令序列或调用电路以启用、触发、协调、请求或以其它方式造成实际上本文描述的任何功能性操作的一次或多次发生。在一些变例中,本文中的操作性或其它逻辑描述可被表达为源代码并被编译或者被调用作为可执行指令序列。在一些背景下,例如实现可整体地或部分地通过诸如C++的源代码或其它代码序列来提供。在其它实现中,源代码或其它代码实现,使用商业可得的和/或业内的技术,可被编译/实现/翻译/转换成高级描述符语言(例如最初实现C或C++编程语言中描述的技术并随后将编程语言实现转化成可逻辑合成的语言实现、硬件描述语言实现、硬件设计模拟实现和/或其它这种类似的表达模式)。例如,一些或全部的逻辑表达(例如计算机编程语言实现)可被显示为Verilog型硬件描述(例如经由硬件描述语言(HDL)和/或非常高速集成电路硬件描述符语言(VHDL))或者其它电路模型,它可随后用来产生具有硬件的物理实现(例如专用集成电路)。受这些教义的启发,本领域内技术人员将理解如何获得、配置和优化合适的传输或计算要素、材料供给、致动器或其它结构。Alternatively or in addition, implementation may include executing a dedicated instruction sequence or calling circuitry to enable, trigger, coordinate, request, or otherwise cause one or more occurrences of virtually any functional operation described herein. In some variations, the operational or other logical descriptions herein may be expressed as source code and compiled or called as an executable instruction sequence. In some contexts, for example, implementation may be provided in whole or in part via source code or other code sequences such as C++. In other implementations, source code or other code implementations, using commercially available and/or industry-proven techniques, may be compiled/implemented/translated/converted into a high-level descriptor language (e.g., initially implementing the techniques described in a C or C++ programming language and subsequently converting the programming language implementation into a logically synthesizable language implementation, a hardware description language implementation, a hardware design simulation implementation, and/or other such similar expression patterns). For example, some or all of the logical expression (e.g., a computer programming language implementation) may be displayed as a Verilog-type hardware description (e.g., via a hardware description language (HDL) and/or a very high speed integrated circuit hardware descriptor language (VHDL)) or other circuit model, which may then be used to generate a physical implementation with hardware (e.g., an application-specific integrated circuit). Given these teachings, one skilled in the art will understand how to obtain, configure, and optimize appropriate transmission or computing elements, material supplies, actuators, or other structures.
本领域内技术人员将认识到,业内常见的是实现设备和/或过程和/或系统并随后使用工程设计和/或其它实践来将这些实现的设备和/或过程和/或系统整合到更广泛的设备和/或过程和/或系统中。也就是说,本文描述的设备和/或过程和/或系统的至少一部分可经由合理数量的试验被整合到其它设备和/或过程和/或系统。本领域内技术人员将认识到,这些其它设备和/或过程和/或系统的例子可包括——只要对背景和应用是适当的——下列设备和/或过程和/或系统的全部或部分:(a)空中运输(例如飞机、火箭、直升飞机等);(b)地面运输(例如轿车、卡车、火车、坦克、带装甲的人员运载车等);(c)建筑(例如住宅、仓库、办公室等);(d)设施(例如冰箱、洗衣机、干衣器等);(e)通信系统(例如联网系统、电话系统、IP上语音系统等);(f)营业实体(例如互联网服务提供商(ISP)实体,比如ComcastCable、Qwest、Southwestern Bell等)或(g)有线/无线服务实体(例如Sprint、Cingular、Nextel等)等等。Those skilled in the art will recognize that it is common in the industry to implement devices and/or processes and/or systems and then use engineering design and/or other practices to integrate these implemented devices and/or processes and/or systems into broader devices and/or processes and/or systems. That is, at least a portion of the devices and/or processes and/or systems described herein can be integrated into other devices and/or processes and/or systems through a reasonable amount of experimentation. Those skilled in the art will recognize that examples of these other devices and/or processes and/or systems may include, as appropriate to the context and application, all or part of the following devices and/or processes and/or systems: (a) air transportation (e.g., airplanes, rockets, helicopters, etc.); (b) ground transportation (e.g., cars, trucks, trains, tanks, armored personnel carriers, etc.); (c) buildings (e.g., homes, warehouses, offices, etc.); (d) facilities (e.g., refrigerators, washers, dryers, etc.); (e.g., networking systems, telephone systems, voice over IP systems, etc.); (f) business entities (e.g., Internet Service Provider (ISP) entities, such as Comcast Cable, Qwest, Southwestern Bell, etc.) or (g) wired/wireless service entities (e.g., Sprint, Cingular, Nextel, etc.), etc.
在某些情形下,系统或方法的使用可能发生在一地域内,即便组件位于该地域之外。例如,在分布式计算背景下,分布式计算系统的使用可发生在一地域内,即便系统的部件可能位于该地域之外(例如中继器、服务器、处理器、信号承载介质、发送计算机、接收计算机等位于该地域之外)。In some cases, use of a system or method may occur within a region even if components are located outside of the region. For example, in the context of distributed computing, use of a distributed computing system may occur within a region even if components of the system may be located outside of the region (e.g., repeaters, servers, processors, signal-carrying media, sending computers, receiving computers, etc. may be located outside of the region).
系统或方法的销售可能同样发生在一地域内,即便系统或方法的组件位于和/或使用在该地域之外。此外,在一个地域内实现用于执行方法的系统的至少一部分不排除在另一地域内使用系统。Sales of a system or method may also occur within a territory even if components of the system or method are located and/or used outside the territory. Furthermore, implementing at least a portion of a system for performing a method in one territory does not preclude use of the system in another territory.
本领域内技术人员将认识到,为概念清楚起见,本文描述的组成(例如操作)、设备、对象和与之伴随的讨论被用作示例,并可考虑各种配置修正。结果,如本文中使用的,陈述的特定例子和附图描述旨在代表其更一般的类别。一般来说,任何特定例子的使用旨在代表其类别,并且特定组成(例如操作)、设备和对象的非包含不应当被认为构成限制。Those skilled in the art will recognize that for the sake of conceptual clarity, the components (e.g., operations), devices, objects, and the accompanying discussions described herein are used as examples, and various configuration modifications are contemplated. Consequently, as used herein, the specific examples and accompanying figure descriptions set forth are intended to represent their more general classes. In general, the use of any specific example is intended to represent its class, and the non-inclusion of specific components (e.g., operations), devices, and objects should not be considered limiting.
本文描述的主题有时说明包含在不同的其它组件中或与不同的其它组件连接的不同组件。要注意,这些描述的架构仅为示例性的并且事实上可采用取得相同功能的许多其它结构。在理念意义上,取得相同功能的任何组成布置是效果上“关联的”以取得要求的功能。因此,本文中组合以取得特定功能的任意两个组成可被视为彼此“关联的”以取得要求的功能,不管架构或中间组件如何。同样,如此关联的任意两个组件也可被视为彼此“可操作地连接的”或“可操作地耦合的”以取得要求的功能,并且能够如此关联的任意两个组件也可被视为彼此“可操作地耦合的”以取得要求的功能。可操作地耦合的特定例子包括但不限于可物理匹配和/或物理相互作用的组件、和/或可无线相互作用、和/或无线相互作用的组件、和/或逻辑相互作用、和/或可逻辑相互作用的组件。The subject matter described herein sometimes illustrates different components that are contained in or connected to different other components. It should be noted that the architectures described are merely exemplary and in fact many other structures that can be used to obtain the same function can be adopted. In a conceptual sense, any composition arrangement that obtains the same function is effectively "associated" to obtain the required function. Therefore, any two compositions that are combined to obtain a specific function herein can be considered to be "associated" to obtain the required function, regardless of the architecture or intermediate components. Similarly, any two components that are so associated can also be considered to be "operably connected" or "operably coupled" to obtain the required function, and any two components that can be so associated can also be considered to be "operably coupled" to obtain the required function. Specific examples of operable coupling include, but are not limited to, components that can physically match and/or physically interact, and/or components that can wirelessly interact, and/or components that can wirelessly interact, and/or logically interact, and/or components that can logically interact.
在一些情形下,一个或多个组件在本文中可被称为“被配置成”、“通过……配置”、“可配置成”、“操作/作用以”、“适于/可适于”、“能够”、“可遵循/遵循”等。本领域内技术人员将理解,这些术语(例如“被配置成”)一般涵盖活动状态组件和/或非活动状态组件和/或待命状态组件,除非上下文要求相反情形。In some cases, one or more components may be referred to herein as being "configured to," "configured by," "configurable to," "operating/acting to," "suitable for/adaptable to," "capable of," "complying/complying with," etc. Those skilled in the art will understand that these terms (e.g., "configured to") generally encompass active state components and/or inactive state components and/or standby state components, unless the context requires otherwise.
从一般意义上说,本领域内技术人员将认识到,本文描述的各实施例可通过各种类型的电-机系统单独地和/或统一地实现,所述电-机系统具有宽范围的电气组件,例如硬件、软件、固件和/或实际上其任意组合,它们仅限于根据35USC 101的可授权主题;以及宽范围的组件,其可向例如刚性本体、弹簧或扭转本体、液压、电磁致动的设备和/或实际上其任意组合给予机械力或运动。结果,如本文中使用的“电-机系统”包括但不限于:与换能器(例如致动器、电动机、压电晶体、微电机系统(MEMS)等)可操作地耦合的电路、具有至少一个分立电路的电路、具有至少一个集成电路的电路、具有至少一个专用集成电路的电路、形成通用计算设备的电路,该通用计算设备通过计算机程序配置(例如通过计算机程序配置的通用计算机,该计算机程序至少部分地执行本文描述的过程和/或设备,或者通过计算机程序配置的微处理器,该计算机程序至少部分地执行本文描述的方法和/或设备)、形成存储设备(例如形成存储器(例如随机存取、闪存、只读等))的电路、形成通信设备(例如调制解调器、通信交换机、光-电设施等)的电路和/或在那里的任何非电类似物,例如光或其它类似物(例如基于石墨烯的电路)。本领域内技术人员也将理解,电-机系统的例子包括但不限于众多消费者电子系统、医疗设备以及诸如机动传输系统、工厂自动化系统、安全系统和/或通信/计算系统的其它系统。本领域内技术人员将认识到,如本文描述的电-机不一定仅限于既电致动又机械致动的系统,除非上下文指出相反情形。In a general sense, those skilled in the art will recognize that the various embodiments described herein may be implemented individually and/or collectively with various types of electro-mechanical systems having a wide range of electrical components, such as hardware, software, firmware, and/or virtually any combination thereof, which are limited to subject matter patentable under 35 USC 101; and a wide range of components that can impart mechanical force or motion to devices such as rigid bodies, spring or torsion bodies, hydraulic, electromagnetic actuation, and/or virtually any combination thereof. Consequently, as used herein, "electromechanical system" includes, but is not limited to, circuits operatively coupled to transducers (e.g., actuators, motors, piezoelectric crystals, microelectromechanical systems (MEMS), etc.), circuits having at least one discrete circuit, circuits having at least one integrated circuit, circuits having at least one application specific integrated circuit, circuits forming a general purpose computing device configured by a computer program (e.g., a general purpose computer configured by a computer program that at least partially performs the processes and/or apparatus described herein, or a microprocessor configured by a computer program that at least partially performs the methods and/or apparatus described herein), circuits forming a storage device (e.g., forming memory (e.g., random access, flash, read-only, etc.)), circuits forming a communication device (e.g., a modem, a communication switch, an opto-electronic device, etc.), and/or any non-electrical analogues therein, such as optical or other analogues (e.g., graphene-based circuits). One skilled in the art will also appreciate that examples of electromechanical systems include, but are not limited to, numerous consumer electronic systems, medical devices, and other systems such as motor transport systems, factory automation systems, security systems, and/or communication/computing systems. Those skilled in the art will recognize that electro-mechanical devices as described herein are not necessarily limited to systems that are both electrically and mechanically actuated, unless the context indicates otherwise.
从一般意义上说,本领域内技术人员将认识到,可通过大范围的硬件、软件、固件和/或其任意组合单独地和/或统一地实现的本文描述的各个方面可被视为由多种类型“电路”构成。因此,本文中使用的“电路”包括但不限于,具有至少一个分立电路的电路、具有至少一个集成电路的电路、具有至少一个专用集成电路的电路、形成通用计算设备的电路,该通用计算设备通过计算机程序配置(例如通过计算机程序配置的通用计算机,该计算机程序至少部分地执行本文描述的过程和/或设备,或者通过计算机程序配置的微处理器,该计算机程序至少部分地执行本文描述的过程和/或设备)、形成存储设备(例如形成存储器(例如随机存取、闪存、只读等))的电路、和/或形成通信设备(例如调制解调器、通信交换机、光-电设施等)的电路。本领域内技术人员将认识到,本文描述的主题可以模拟或数字形式或其某些组合来实现。In a general sense, those skilled in the art will recognize that the various aspects described herein, which may be implemented individually and/or collectively by a wide range of hardware, software, firmware, and/or any combination thereof, may be considered to be comprised of various types of "circuits." Thus, as used herein, "circuitry" includes, but is not limited to, circuitry having at least one discrete circuit, circuitry having at least one integrated circuit, circuitry having at least one application-specific integrated circuit, circuitry forming a general-purpose computing device configured by a computer program (e.g., a general-purpose computer configured by a computer program that at least partially performs the processes and/or devices described herein, or a microprocessor configured by a computer program that at least partially performs the processes and/or devices described herein), circuitry forming a storage device (e.g., circuitry forming memory (e.g., random access, flash memory, read-only, etc.)), and/or circuitry forming a communication device (e.g., a modem, a communication switch, an optical-to-electronic device, etc.). Those skilled in the art will recognize that the subject matter described herein may be implemented in analog or digital form, or some combination thereof.
本领域内技术人员将认识到,本文描述的设备和/或过程的至少一部分可被集成到图像处理系统中。本领域内技术人员将认识到,典型的图像处理系统一般包括下列部件中的一个或多个:系统单元外壳;视频显示设备;诸如易失性存储器或非易失性存储器的存储器;诸如微处理器或数字信号处理器的处理器;诸如操作系统、驱动器、应用程序的计算实体;一个或多个交互设备(例如触摸垫、触摸屏、天线等);包括反馈环和控制电机(例如用于感测透镜位置和/或速度的反馈;用于移动/扭曲透镜以提供要求的焦点的控制电机)的控制系统。图像处理系统可利用合适的可商业获得的组件来实现,例如一般在数字静止系统和/或数字运动系统中找到的那些。Those skilled in the art will recognize that at least a portion of the devices and/or processes described herein can be integrated into an image processing system. Those skilled in the art will recognize that a typical image processing system generally includes one or more of the following components: a system unit housing; a video display device; a memory such as volatile memory or non-volatile memory; a processor such as a microprocessor or a digital signal processor; a computing entity such as an operating system, drivers, and application programs; one or more interactive devices (e.g., a touch pad, a touch screen, an antenna, etc.); a control system including feedback loops and control motors (e.g., feedback for sensing lens position and/or velocity; control motors for moving/distorting the lens to provide the desired focus). The image processing system can be implemented using suitable commercially available components, such as those typically found in digital still systems and/or digital motion systems.
本领域内技术人员将认识到,本文描述的设备和/或过程的至少一部分可被集成到数据处理系统中。本领域内技术人员将认识到,数据处理系统一般包括下列部件中的一个或多个:系统单元外壳;视频显示设备;诸如易失性存储器或非易失性存储器的存储器;诸如微处理器或数字信号处理器的处理器;诸如操作系统、驱动器、图形用户接口和应用程序的计算实体;一个或多个交互设备(例如触摸垫、触摸屏、天线等);和/或包括反馈环和控制电机(例如用于感测位置和/或速度的反馈;用于移动和/或调整器件和/或量的控制电机)的控制系统。数据处理系统可利用合适的可商业获得的组件来实现,例如一般在数据计算/通信和/或网络计算/通信系统中找到的那些。Those skilled in the art will recognize that at least a portion of the devices and/or processes described herein can be integrated into a data processing system. Those skilled in the art will recognize that a data processing system generally includes one or more of the following components: a system unit housing; a video display device; a memory such as volatile memory or non-volatile memory; a processor such as a microprocessor or a digital signal processor; a computing entity such as an operating system, drivers, a graphical user interface, and application programs; one or more interactive devices (e.g., a touch pad, a touch screen, an antenna, etc.); and/or a control system including feedback loops and control motors (e.g., feedback for sensing position and/or velocity; control motors for moving and/or adjusting devices and/or quantities). The data processing system can be implemented using suitable commercially available components, such as those typically found in data computing/communication and/or network computing/communication systems.
为了本申请的目的,“云”计算可被理解为如云计算文献中描述的那样。例如,云计算可以是传递计算容量和/或存储容量作为服务的方法和/或系统。“云”可表示一个或多个硬件和/或软件组成,其传递或帮助传递计算和/或存储容量,包括但不限于下面列出的一个或多个:客户机、应用、平台、基础设施和/或服务器。云可指与客户机、应用、平台、基础设施和/或服务器关联的任意硬件和/或软件。例如,云和云计算可指下面列出的一个或多个:计算机、处理器、存储介质、路由器、交换机、调制解调器、虚拟机(例如虚拟服务器)、数据中心、操作系统、中间件、固件、硬件后端、软件后端、软件后端和/或软件应用。云可指私人云、公众云、混合云和/或社区云。云可以是可配置计算资源的共享池,它也可以是公众的、私人的、半私人的、分布式的、可扩展的、灵活的、临时的、虚拟的和/或物理的。云或云服务可在一种或多种类型的网络(例如移动通信网和互联网)上传递。For the purposes of this application, "cloud" computing may be understood as described in the cloud computing literature. For example, cloud computing may be a method and/or system for delivering computing capacity and/or storage capacity as a service. "Cloud" may refer to one or more hardware and/or software components that deliver or facilitate the delivery of computing and/or storage capacity, including, but not limited to, one or more of the following: clients, applications, platforms, infrastructure, and/or servers. Cloud may refer to any hardware and/or software associated with clients, applications, platforms, infrastructure, and/or servers. For example, cloud and cloud computing may refer to one or more of the following: computers, processors, storage media, routers, switches, modems, virtual machines (e.g., virtual servers), data centers, operating systems, middleware, firmware, hardware backends, software backends, and/or software applications. Cloud may refer to private, public, hybrid, and/or community clouds. A cloud may be a shared pool of configurable computing resources; it may also be public, private, semi-private, distributed, scalable, flexible, temporary, virtual, and/or physical. Clouds or cloud services may be delivered over one or more types of networks, such as mobile communication networks and the Internet.
如本申请中使用的,云或云服务可包括下面列出的一种或多种:作为服务的基础设施(IaaS)、作为服务的平台(PaaS)、作为服务的软件(SaaS)和/或作为服务的桌面(DaaS)。作为一个非排他例子,IaaS可包括例如一个或多个虚拟服务器实例,其可开始、停止、访问和/或配置虚拟服务器和/或存储中心(例如按需地提供一个或多个处理器、存储空间和/或网络资源,比如EMC和Rackspace)。PaaS可包括例如宿主在基础设施(例如计算平台和/或方案栈,客户机可从中产生软件接口和应用(例如Microsoft Azure))的一个或多个软件和/或研发工具。SaaS可包括例如由服务提供商做主并在网络上可访问的软件(例如应用软件和/或与该软件应用关联的数据可被保持在网络上,例如Google App、SalesForce)。DaaS可包括例如在网络上为用户提供桌面、应用、数据和/或服务(例如提供多应用框架、框架中的应用、与应用关联的数据和/或与应用和/或网络上的数据关联的服务(例如Citrix))。前面内容旨在是本申请中提到的多种类型的系统和/或方法作为“云”或“云计算”的示例并且不应当认为是完全的或穷尽的。As used in this application, cloud or cloud services may include one or more of the following: Infrastructure as a Service (IaaS), Platform as a Service (PaaS), Software as a Service (SaaS), and/or Desktop as a Service (DaaS). As a non-exclusive example, IaaS may include, for example, one or more virtual server instances that can start, stop, access, and/or configure virtual servers and/or storage centers (e.g., providing one or more processors, storage space, and/or network resources on demand, such as EMC and Rackspace). PaaS may include, for example, one or more software and/or development tools hosted on an infrastructure (e.g., a computing platform and/or solution stack from which clients can generate software interfaces and applications (e.g., Microsoft Azure)). SaaS may include, for example, software hosted by a service provider and accessible on a network (e.g., application software and/or data associated with the software application can be maintained on the network, such as Google Apps, SalesForce). DaaS may include, for example, providing desktops, applications, data, and/or services to users on a network (e.g., providing a multi-application framework, applications within the framework, data associated with the application, and/or services associated with the application and/or data on the network (e.g., Citrix)). The foregoing is intended to be exemplary of the various types of systems and/or methods mentioned in this application as "clouds" or "cloud computing" and should not be considered complete or exhaustive.
在许多交易中,自动化的增殖是显而易见的。例如,自动取款机(ATM)发钱和接受储蓄。航空公司售票柜台机检查乘客签到、发票并允许乘客改变或更新航班。列车和地铁售票柜台机允许乘客购买至具体目的地的票而根本不需要启用人类交互。许多杂货店和药店具有自助结账机,其允许消费者通过仅与机器交互而对购买的商品进行支付。大公司如今配备电话答复系统,该系统具有与消费者交互的机器,并仅在如果利于机器的交易中存在问题的情况下在交易中有人参与。The proliferation of automation is evident in many transactions. For example, automated teller machines (ATMs) dispense money and accept deposits. Airline ticket counters check passenger check-ins, invoices, and allow passengers to change or renew flights. Train and subway ticket counters allow passengers to purchase tickets to specific destinations without requiring human interaction. Many grocery stores and drug stores have self-service checkout machines that allow consumers to pay for their purchases simply by interacting with the machine. Large companies now have telephone answering systems that have machines that interact with consumers and only involve a human in a transaction if there is a problem with the machine-assisted transaction.
然而,随着这种自动化增加,便利性和可访问性可能降低。在杂货店的自结账机可能难以操作。ATM和售票柜台机对于残疾人或需要特殊访问的人来说多数情况下是不可访问的。在这以前,与人的交互使得残疾人相对容易地完成交易,如果残疾人无法按下ATM上的按钮,那么机器无法利于交易完成。尽管这些公共终端中的一些允许讲话操作,但它们被配置成最一般形式的讲话,对于识别特定讲话者来说用处不大,由此导致尝试对该机器讲话的用户感到受挫。对于残疾人来说,这个问题尤其具有挑战性,这些残疾人已在通过自动化机器完成交易时可能遇到重大的挑战。However, as this automation increases, convenience and accessibility may decrease. Self-checkout machines at grocery stores may be difficult to operate. ATMs and ticket counters are often inaccessible to people with disabilities or those who require special access. Previously, human interaction made it relatively easy for people with disabilities to complete transactions, but if a person with a disability cannot press the button on the ATM, the machine cannot facilitate the transaction. Although some of these public terminals allow speech operation, they are configured for the most general form of speech and are less useful for identifying specific speakers, resulting in frustration for users who try to speak to the machine. This problem is particularly challenging for people with disabilities, who already face significant challenges completing transactions through automated machines.
另外,智能电话和平板设备如今也被配置以接收讲话命令。受讲话和语音控制的汽车系统现在经常出现在机动交通工具中,甚至在经济型、大量生产的交通工具中。家庭娱乐设备(例如盘播放机、电视机、无线电、立体声等)可对讲话命令作出回应。另外,家庭安全系统可对讲话命令作出回应。在办公室设定中,工作者的计算机可对来自工作者的讲话作出回应,以允许更快、更有效的工作流程。这些系统和机器可被训练以针对特定用户工作,或者通过明确的训练或通过重复的交互。然而,当系统更新或更换时(例如购买新电视机时),训练可能随着设备而丢失。因此,在本文描述的一些实施例中,用于讲话识别系统的适应数据可能与识别讲话的设备是分离的,并可更靠近地关联于用户,例如通过由用户携带的设备或通过与用户关联的网络位置。In addition, smartphones and tablet devices are now also configured to receive spoken commands. Automotive systems controlled by speech and voice are now common in motor vehicles, even in economical, mass-produced vehicles. Home entertainment devices (such as disc players, televisions, radios, stereos, etc.) can respond to spoken commands. In addition, home security systems can respond to spoken commands. In office settings, workers' computers can respond to speeches from workers to allow faster and more efficient workflows. These systems and machines can be trained to work for specific users, either through explicit training or through repeated interactions. However, when the system is updated or replaced (such as when buying a new television), the training may be lost along with the equipment. Therefore, in some embodiments described herein, the adaptation data for the speech recognition system may be separated from the equipment that recognizes speech and may be more closely associated with the user, such as through a device carried by the user or through a network location associated with the user.
此外,在一些环境中,可以有一个以上的设备在与用户交互的范围内发送和接收数据。例如,仅坐在沙发上看电视可能涉及五种或多种设备,例如电视机、机顶盒和音频/视频接收机、遥控器以及智能电话设备。这些设备中的一些可发送或接收讲话数据。这些设备中的一些可发送、接收或存储适应数据,如本文中更详细描述的那样。因此,在一些实施例中(将在本文中更详细地予以描述)可存在一些方法、系统和设备,用于确定系统中的哪些设备应当执行动作,所述动作允许用户通过用户讲话有效地与期望的设备形成交互。In addition, in some environments, more than one device may send and receive data within the scope of interaction with the user. For example, simply sitting on a sofa watching TV may involve five or more devices, such as a television, a set-top box and an audio/video receiver, a remote control, and a smart phone device. Some of these devices may send or receive speech data. Some of these devices may send, receive, or store adaptation data, as described in more detail herein. Therefore, in some embodiments (which will be described in more detail herein), there may be some methods, systems, and devices for determining which devices in a system should perform actions that allow the user to effectively interact with the desired device through the user's speech.
现在参见图1(例如图1A),图1A示出一示例环境100,其中根据各实施例的方法、系统、电路、制品和计算机程序产品以及架构可通过下面列出的一个或多个来实现:个人设备20A、个人设备20B、中间设备40、目标设备30A、目标设备30B。在一些实施例中,例如图1B所示,个人设备120,其在一些实施例中可以是个人设备20A、个人设备20B和中间设备40中的一个的例子。个人设备120在各实施例中可被赋予有:被设计成管理适应数据的逻辑,其中适应数据关联于特定方讲话的至少一个方面;被设计成响应与特定方的利于讲话的交易关联的指示,促进适应数据传输至目标设备的逻辑,其中适应数据关联于特定方的讲话的至少一个方面;以及被设计成确定是否更新适应数据的逻辑,所述确定至少部分地基于利于讲话的交易的至少一部分的结果。Referring now to FIG. 1 (e.g., FIG. 1A ), FIG. 1A illustrates an example environment 100 in which methods, systems, circuits, articles of manufacture, computer program products, and architectures according to various embodiments may be implemented by one or more of the following: personal device 20A, personal device 20B, intermediary device 40, target device 30A, and target device 30B. In some embodiments, such as shown in FIG. 1B , personal device 120, which in some embodiments may be an example of one of personal device 20A, personal device 20B, and intermediary device 40, may be provided. In various embodiments, personal device 120 may be provided with: logic configured to manage adaptation data, wherein the adaptation data is associated with at least one aspect of speech by a particular party; logic configured to facilitate transmission of adaptation data to a target device in response to an indication associated with a speech-facilitated transaction with the particular party, wherein the adaptation data is associated with at least one aspect of speech by the particular party; and logic configured to determine whether to update the adaptation data, the determination being based at least in part on the outcome of at least a portion of the speech-facilitated transaction.
再次参见图1A的示例性实施例,用户105可通过终端设备30A和30B中的一个或多个从事利于讲话的交易。在一些实施例中,利于讲话的交易可针对终端设备30A或终端设备30B中的一个。在一些实施例中,用户可以不使她的讲话特别地朝向终端设备30A或终端设备30B,而是朝向它们两者,对于哪个设备执行利于讲话的交易则是无关紧要的。在一些实施例中,终端设备30A和终端设备30B中的一个在它们之间作出协商以确定哪个设备将执行利于讲话的交易。在一些实施例中,个人设备20A、个人设备20B和中间设备40中的一个或多个可确定终端设备30A和终端设备30B中的哪一个执行利于讲话的交易。在一些实施例中,个人设备20A、个人设备20B和中间设备40中的一个或多个可检测终端设备30A和终端设备30B中的一个或多个,与终端设备30A、30B中的一个或多个建立连接或协商。Referring again to the exemplary embodiment of FIG. 1A , user 105 can engage in a speech-facilitated transaction via one or more of terminal devices 30A and 30B. In some embodiments, the speech-facilitated transaction can be directed to either terminal device 30A or terminal device 30B. In some embodiments, the user can direct her speech not specifically toward terminal device 30A or terminal device 30B, but toward both, with it being irrelevant which device performs the speech-facilitated transaction. In some embodiments, one of terminal device 30A and terminal device 30B negotiates between themselves to determine which device will perform the speech-facilitated transaction. In some embodiments, one or more of personal device 20A, personal device 20B, and intermediary device 40 can determine which of terminal device 30A and terminal device 30B will perform the speech-facilitated transaction. In some embodiments, one or more of personal device 20A, personal device 20B, and intermediary device 40 can detect one or more of terminal device 30A and terminal device 30B and establish a connection or negotiation with one or more of terminal devices 30A and 30B.
图1A的环境100中示出的虚线箭头未被标号,但旨在表示从一个设备至另一设备的数据流。一些数据连接为简化附图而被省去,例如尽管没有箭头,然而个人设备20A可直接与终端设备30A和终端设备30B通信。数据流可包括以任何格式出现的一个或多个适应数据、讲话数据,包括来自用户的原话、适应结果数据、期望的目标数据、目标数据等等。虚线箭头示出用户105与个人设备20A、个人设备20B和中间设备40中的一个或多个之间的联系。The dotted arrows shown in the environment 100 of Figure 1A are not numbered, but are intended to represent the data flow from one device to another. Some data connections are omitted for simplifying the accompanying drawings, and for example, although there are no arrows, personal device 20A can directly communicate with terminal device 30A and terminal device 30B. Data flow can include one or more adaptation data, speech data that occur in any format, including the original words from the user, adaptation result data, desired target data, target data, etc. The dotted arrows illustrate the contact between one or more of user 105 and personal device 20A, personal device 20B and intermediate device 40.
尽管图1A中未示出,然而个人设备20A、20B和40中的任意一个或全部可或者直接地或者间接地与终端设备30A、终端设备30B中的任意一个或全部通信。在一些实施例中,这些设备经由服务器110彼此通信,所述服务器110可以在设备20A、20B、30A、30B和40中的任何一个的本地或远离这些设备中的任何一个。在一些实施例中,这些设备经由一个或多个通信网络140彼此通信,该通信网络140可以在设备20A、20B、30A、30B和40中的任何一个的本地或远离这些设备中的任何一个。尽管服务器110和通信网络40被绘出在图1A和图1C-1E中的每个实施例中,然而服务器110和通信网络140并非必需的,并仅为解说目的而被示出。Although not shown in FIG1A , any or all of the personal devices 20A, 20B, and 40 may communicate, either directly or indirectly, with any or all of the terminal devices 30A and 30B. In some embodiments, these devices communicate with each other via a server 110, which may be local to or remote from any of the devices 20A, 20B, 30A, 30B, and 40. In some embodiments, these devices communicate with each other via one or more communication networks 140, which may be local to or remote from any of the devices 20A, 20B, 30A, 30B, and 40. Although server 110 and communication network 40 are depicted in each of the embodiments in FIG1A and FIG1C-1E , server 110 and communication network 140 are not required and are shown for illustrative purposes only.
再次参见图1A,图1A示出个人设备20A、个人设备20B、中间设备40、终端设备30A、终端设备30B和服务器110。所示出的设备数仅为了说明的目的。然而,在一些实施例中,可能存在不同数量的个人设备、中间设备、终端设备、服务器和通信网络。在一些实施例中,可完全地省去个人设备、中间设备、终端设备、服务器和通信网络中的一个或多个。Referring again to FIG. 1A , FIG. 1A illustrates a personal device 20A, a personal device 20B, an intermediary device 40, a terminal device 30A, a terminal device 30B, and a server 110. The number of devices shown is for illustrative purposes only. However, in some embodiments, there may be a varying number of personal devices, intermediary devices, terminal devices, servers, and communication networks. In some embodiments, one or more of the personal devices, intermediary devices, terminal devices, servers, and communication networks may be completely omitted.
再次参见图1A,个人设备20A、20B被图示为与用户105关联的。这种关联可能衰弱,例如它们可能仅在同一物理接近下。在其它实施例中。关联可以是拥有者、双方合同、信息存储、之前使用或其它因素中的一个。本文进一步描述的例子将提供用户105与个人设备120(例如个人设备20A或个人设备20B)之间的关系的例子的非穷尽列表。在一些实施例中,个人设备120(例如个人设备20A或个人设备20B)可以是任何尺寸并具有任何规范。个人设备120(例如个人设备20A或个人设备20B)可以是任何形状或尺寸的定制设备,其被配置成发送、接收和存储数据。个人设备120(例如个人设备20A或个人设备20B)可包括但不限于智能电话设备、平板设备、个人计算机设备、膝上计算机设备、钥匙链设备、钥匙、个人数字助理设备、改型的记忆棒、通用遥控器或任何其它电子器件。另外,个人设备120(例如个人设备20A或个人设备20B)可以是改型的佩戴对象,例如眼镜、钱包、信用卡、手表、挂链或衣服饰件。被配置成存储、发送和接收数据的任何东西都可以是个人设备120(例如个人设备20A或个人设备20B),并且个人设备120(例如个人设备20A或个人设备20B)的尺寸并非局限于能够由用户携带的设备尺寸。另外,个人设备120(例如个人设备20A或个人设备20B)可以不是直接靠近用户的,例如,个人设备120(例如个人设备20A或个人设备20B)可以是坐落在用户家里或办公室内的桌子上的计算机。Referring again to FIG1A , personal devices 20A, 20B are illustrated as being associated with user 105. This association may be weak, for example, they may only be in the same physical proximity. In other embodiments, the association may be one of an owner, a contract between the two parties, information storage, previous use, or other factors. The examples further described herein will provide a non-exhaustive list of examples of the relationship between user 105 and personal device 120 (e.g., personal device 20A or personal device 20B). In some embodiments, personal device 120 (e.g., personal device 20A or personal device 20B) can be any size and have any specifications. Personal device 120 (e.g., personal device 20A or personal device 20B) can be a custom device of any shape or size that is configured to send, receive, and store data. Personal device 120 (e.g., personal device 20A or personal device 20B) may include, but is not limited to, a smartphone device, a tablet device, a personal computer device, a laptop device, a keychain device, a key, a personal digital assistant device, a modified memory stick, a universal remote control, or any other electronic device. In addition, the personal device 120 (e.g., personal device 20A or personal device 20B) can be a modified wearable object, such as glasses, a wallet, a credit card, a watch, a necklace, or a clothing accessory. Anything configured to store, send, and receive data can be a personal device 120 (e.g., personal device 20A or personal device 20B), and the size of the personal device 120 (e.g., personal device 20A or personal device 20B) is not limited to the size of a device that can be carried by the user. In addition, the personal device 120 (e.g., personal device 20A or personal device 20B) may not be directly close to the user. For example, the personal device 120 (e.g., personal device 20A or personal device 20B) can be a computer located on a desk in the user's home or office.
尽管终端设备30A、30B(在下文中可互换地统称为或个别地称为“终端设备30*”)被描述为“终端设备”,然而这仅仅是为了简化说明。终端设备30*可以是被配置成接收讲话的任何设备。例如,终端设备30*可以是终端、计算机、导航系统、电话、一件家庭电子器件(例如DVD播放机、蓝光播放机、媒体播放机、游戏系统、电视机、接收机、警报钟等等)。在一些实施例中,终端设备30*可以是被配置成接收讲话的家庭安全系统、安全锁、门锁、厨房设施等等。在一些实施例中,终端设备30*可以是机动交通工具,例如轿车、船、飞机、摩托车、高尔夫球车、轮椅车等等。在一些实施例中,终端设备30*可以是一件便携式电子器件,例如膝上计算机、上网本计算机、平板设备、智能电话、蜂窝电话、无线电、便携式导航系统或者能够接收讲话的任何其它件的电子器件。终端设备30*可以是企业解决方案的一部分,例如办公室内的共同工作站、复印机、扫描仪、小间内的个人工作站、办公室目录、交互屏幕以及电话。这些例子和列表不旨在是穷尽性的,而是仅示出终端设备的几个例子。这些例子中的一些相对于图1C、图1D和图1E被更详细地示出。Although the terminal devices 30A, 30B (hereinafter interchangeably referred to collectively or individually as "terminal devices 30*") are described as "terminal devices", this is merely for simplicity of description. The terminal device 30* can be any device configured to receive speech. For example, the terminal device 30* can be a terminal, a computer, a navigation system, a phone, a piece of home electronics (such as a DVD player, a Blu-ray player, a media player, a gaming system, a television, a receiver, an alarm clock, etc.). In some embodiments, the terminal device 30* can be a home security system, a security lock, a door lock, a kitchen appliance, etc. configured to receive speech. In some embodiments, the terminal device 30* can be a motor vehicle, such as a car, a boat, an airplane, a motorcycle, a golf cart, a wheelchair, etc. In some embodiments, the terminal device 30* can be a portable electronic device, such as a laptop computer, a netbook computer, a tablet device, a smart phone, a cellular phone, a radio, a portable navigation system, or any other piece of electronic device capable of receiving speech. The terminal device 30* can be part of an enterprise solution, such as a shared workstation in an office, a copier, a scanner, a personal workstation in a cubicle, an office directory, an interactive screen, and a telephone. These examples and lists are not intended to be exhaustive, but rather to illustrate only a few examples of terminal devices. Some of these examples are shown in greater detail with respect to FIG1C , FIG1D , and FIG1E .
在一些实施例中,目标设备30A和/或目标设备30B在本文将更详细描述的方法中从个人设备120(例如个人设备20A或个人设备20B)接收适应数据。在一些实施例中,适应数据在一个或多个通信网络40上被发送。在各实施例中,通信网络40可包括局域网(LAN)、广域网(WAN)、城域网(MAN)、无线局域网(WLAN)、个人局域网(PAN)、微波接入全球互通(WiMax)、公共交换电话网(PTSN)、通用分组无线服务(GPRS)网、蜂窝网等中的一个或多个。通信网络40可以是有线网络、无线网络或者是有线网络和无线网络的组合。要注意,“通信网络”在这里指可以彼此互通或者不互通的一个或多个通信网络。In some embodiments, the target device 30A and/or the target device 30B receives adaptation data from the personal device 120 (e.g., personal device 20A or personal device 20B) in a method described in more detail herein. In some embodiments, the adaptation data is sent over one or more communication networks 40. In various embodiments, the communication network 40 may include one or more of a local area network (LAN), a wide area network (WAN), a metropolitan area network (MAN), a wireless local area network (WLAN), a personal area network (PAN), a World Interoperability for Microwave Access (WiMax), a public switched telephone network (PTSN), a general packet radio service (GPRS) network, a cellular network, and the like. The communication network 40 may be a wired network, a wireless network, or a combination of a wired network and a wireless network. It should be noted that the term "communication network" herein refers to one or more communication networks that may or may not be interoperable with each other.
在一些实施例中,适应数据不直接来自个人设备120(例如个人设备20A或个人设备20B)。在一些实施例中,个人设备120(例如个人设备20A或个人设备20B)仅例如通过提供地址、凭证、指令、授权和推荐中的一个或多个促进适应数据的通信。例如,在一些实施例中,个人设备120(例如个人设备20A或个人设备20B)提供服务器10上可接收适应数据的位置。在一些实施例中,个人设备120(例如个人设备20A或个人设备20B)一旦有来自终端设备30的请求则从服务器10检索适应数据,并随后将适应数据中继至终端设备30或促进适应数据的中继。In some embodiments, the adaptation data does not come directly from the personal device 120 (e.g., personal device 20A or personal device 20B). In some embodiments, the personal device 120 (e.g., personal device 20A or personal device 20B) merely facilitates the communication of the adaptation data, for example, by providing one or more of an address, credentials, instructions, authorization, and a recommendation. For example, in some embodiments, the personal device 120 (e.g., personal device 20A or personal device 20B) provides a location on the server 10 where the adaptation data can be received. In some embodiments, the personal device 120 (e.g., personal device 20A or personal device 20B) retrieves the adaptation data from the server 10 upon a request from the terminal device 30 and then relays the adaptation data to the terminal device 30 or facilitates the relay of the adaptation data.
在一些实施例中,个人设备120(例如个人设备20A或个人设备20B)例如以预定的、规则的或其它方式定义的间隔广播适应数据,不管终端设备30是否正在侦听。在其它实施例中,个人设备120(例如个人设备20A或个人设备20B)侦听来自终端设备30的请求,并响应该请求发送或广播适应数据。在一些实施例中,用户5确定个人设备120(例如个人设备20A或个人设备20B)何时广播适应数据。在又一些其它实施例中,第三方(未示出)触发适应数据至终端设备30的发送,其中发送是由个人设备120(例如个人设备20A或个人设备20B)促进的。In some embodiments, personal device 120 (e.g., personal device 20A or personal device 20B) broadcasts adaptation data, for example, at predetermined, regular, or otherwise defined intervals, regardless of whether or not end device 30 is listening. In other embodiments, personal device 120 (e.g., personal device 20A or personal device 20B) listens for a request from end device 30 and transmits or broadcasts adaptation data in response to the request. In some embodiments, user 5 determines when personal device 120 (e.g., personal device 20A or personal device 20B) broadcasts adaptation data. In still other embodiments, a third party (not shown) triggers the transmission of adaptation data to end device 30, where the transmission is facilitated by personal device 120 (e.g., personal device 20A or personal device 20B).
图1B示出示例性环境100中的个人设备120的更详细描述。个人设备120可以是图1A的个人设备20A或20B、图1A的中间设备40、图1C的第一个人设备21A、图1D的第二个人设备21B、图1C的设备31的模块之一、图1D的个人设备22A、图1D的个人设备22B、图1D的设备51、52、53和54中的任何一个、图1E的智能键设备26、图1E的GPS导航设备41等的一个例子。前面的内容不旨在是与图1B的个人设备120对应的可能设备的穷尽性说明,而是仅为可具有图1B概括出的结构的设备类型的例子。FIG1B shows a more detailed description of a personal device 120 in the exemplary environment 100. The personal device 120 may be an example of the personal device 20A or 20B of FIG1A , the intermediate device 40 of FIG1A , the first personal device 21A of FIG1C , the second personal device 21B of FIG1D , one of the modules of the device 31 of FIG1C , the personal device 22A of FIG1D , the personal device 22B of FIG1D , any of the devices 51 , 52 , 53 , and 54 of FIG1D , the smart key device 26 of FIG1E , the GPS navigation device 41 of FIG1E , etc. The foregoing is not intended to be an exhaustive description of possible devices corresponding to the personal device 120 of FIG1B , but rather is merely an example of the types of devices that may have the structure outlined in FIG1B .
再次参见图1B,在各实施例中,个人设备120尤其可包括处理器132、存储器134、用户接口135、讲话检测接口138以及数据传输接口137。这些部件中的每一个可以不出现在个人设备120的各个实施例中,例如一些个人设备120可以不具有讲话检测接口138、或存储器134、或用户接口135。1B , in various embodiments, personal device 120 may include, among other things, a processor 132, a memory 134, a user interface 135, a speech detection interface 138, and a data transmission interface 137. Each of these components may not be present in various embodiments of personal device 120; for example, some personal devices 120 may not have speech detection interface 138, memory 134, or user interface 135.
处理器132可包括一个或多个微处理器、中央处理单元(CPU)、图形处理单元(GPU)、物理处理单元、数字信号处理器、网络处理器、浮点处理器等等。在一些实施例中,处理器132可以是服务器。在一些实施例中,处理器132可以是分布式核处理器。尽管处理器132是作为单个个人设备120的部件的单个处理器,然而处理器132可以是分布在一个或多个计算设备30上的多个处理器,这些处理器可配置成一起工作或者不一起工作。处理器132被图示为配置成执行计算机可读指令以执行如前所述和如图5、图6A-6B、图7A-7D和图8A-8I中示出的一个或多个操作。在一些实施例中,处理器132被设计成配置以作为处理模块150工作,该处理模块150可包括下面列出的一个或多个:关联于特定方的至少一个方面的讲话适应数据调节模块152、响应包括目标设备的利于讲话的交易的指示而传输讲话适应数据的促进模块154以及部分地基于利于讲话的一部分的结果确定讲话适应数据可能的修正的模块156。Processor 132 may include one or more microprocessors, central processing units (CPUs), graphics processing units (GPUs), physical processing units, digital signal processors, network processors, floating-point processors, and the like. In some embodiments, processor 132 may be a server. In some embodiments, processor 132 may be a distributed core processor. Although processor 132 is a single processor as part of a single personal device 120, processor 132 may be multiple processors distributed across one or more computing devices 30, which may or may not be configured to work together. Processor 132 is illustrated as being configured to execute computer-readable instructions to perform one or more operations as previously described and illustrated in Figures 5, 6A-6B, 7A-7D, and 8A-8I. In some embodiments, processor 132 is designed and configured to operate as processing module 150, which may include one or more of the following: a speech adaptation data conditioning module 152 associated with at least one aspect of a particular party, a facilitation module 154 for transmitting speech adaptation data in response to an indication of a speech-facilitated transaction involving a target device, and a module 156 for determining possible revisions to the speech adaptation data based in part on the results of the speech-facilitated portion.
再次参见图1B,如前所述,个人设备120可包括存储器134。在一些实施例中,存储器134可包括下面列出的一个或多个:一个或多个海量存储设备;只读存储器(ROM);可编程只读存储器(PROM);可擦除可编程只读存储器(EPROM);例如随机存取存储器(RAM)、闪存、同步随机存取存储器(SRAM)、动态随机存取存储器(DRAM)的高速缓冲存储器;和/或其它类型的存储设备。在一些实施例中,存储器134可位于单个网络地点。在一些实施例中,存储器134可位于多个网络地点,包括彼此远离的地点。Referring again to FIG. 1B , as previously described, personal device 120 may include memory 134. In some embodiments, memory 134 may include one or more of the following: one or more mass storage devices; read-only memory (ROM); programmable read-only memory (PROM); erasable programmable read-only memory (EPROM); cache memory such as random access memory (RAM), flash memory, synchronous random access memory (SRAM), dynamic random access memory (DRAM); and/or other types of storage devices. In some embodiments, memory 134 may be located at a single network location. In some embodiments, memory 134 may be located at multiple network locations, including locations that are remote from one another.
再次参见图1B,如前所述,个人设备120可包括用户接口135。用户接口可以硬件或软件或它们两者来实现,并可包括各种输入和输出设备以允许个人设备120的操作者与个人设备120交互。例如,用户接口135可包括但不限于:音频显示器(例如扬声器108);视频显示器(例如屏幕102);麦克风;相机;键盘(例如键盘103);跟踪球(例如跟踪球104);鼠标(例如鼠标105);一个或多个软键(例如硬/软键106);触摸输入(例如触摸屏107,它也可以是视频显示屏幕);摇杆;游戏控制器;触摸垫;听筒;或允许设备和用户之间交互的任何其它设备。Referring again to FIG1B , as previously described, personal device 120 may include a user interface 135. The user interface may be implemented in hardware, software, or both, and may include various input and output devices to allow an operator of personal device 120 to interact with personal device 120. For example, user interface 135 may include, but is not limited to: an audio display (e.g., speaker 108); a video display (e.g., screen 102); a microphone; a camera; a keyboard (e.g., keyboard 103); a trackball (e.g., trackball 104); a mouse (e.g., mouse 105); one or more soft keys (e.g., hard/soft keys 106); a touch input (e.g., touch screen 107, which may also be a video display screen); a joystick; a game controller; a touch pad; an earpiece; or any other device that allows interaction between the device and a user.
再次参见图1B,如前所述,个人设备120可包括讲话检测接口138。讲话检测接口138可被配置成接收和/或处理作为输入的讲话或观察和/或记录利于讲话的交易的讲话。尽管在一些实施例中不存在,然而在一些实施例中,讲话检测接口138可包括讲话指示接收机112,其可以是任何类型的传感器、或者是接收信号的通信端口、或者是检测按钮按压的传感器、或者是能够检测环境100(不管在设备内部还是外部)中任何类型状态的改变的任何其它模块。讲话检测接口138在一些实施例中可包括麦克风110,它可与讲话指示接收机112通信或不通信。在一些实施例中,麦克风110可或者选择性地或者常开地检测讲话,并可通过讲话指示接收机112和处理器132中的一个或多个受到控制。Referring again to Figure 1B, as previously described, the personal device 120 may include a speech detection interface 138. The speech detection interface 138 may be configured to receive and/or process speech as input or to observe and/or record speech for speech-facilitating transactions. Although not present in some embodiments, in some embodiments, the speech detection interface 138 may include a speech indication receiver 112, which may be any type of sensor, or a communication port that receives a signal, or a sensor that detects a button press, or any other module capable of detecting any type of change of state in the environment 100 (whether internal or external to the device). The speech detection interface 138 may, in some embodiments, include a microphone 110, which may or may not be in communication with the speech indication receiver 112. In some embodiments, the microphone 110 may be enabled to detect speech, either selectively or constantly, and may be controlled by one or more of the speech indication receiver 112 and the processor 132.
再次参见图1B,如前所述,个人设备120可包括数据传输接口137。数据传输接口137在一些实施例中可处理通过设备的数据发送和接收。例如,在一些实施例中,数据传输接口137可包括适应数据发送机/接收机114,其处理在任何类型的网络或内部形式通信(例如内部总线等等)上的适应数据的接收和发送。数据传输接口137在一些实施例中可包括讲话数据发送机/接收机116,其可处理在任何形式移动数据上的讲话数据(包括原话)的接收和发送。Referring again to FIG. 1B , as previously described, the personal device 120 may include a data transmission interface 137. In some embodiments, the data transmission interface 137 may handle the transmission and reception of data through the device. For example, in some embodiments, the data transmission interface 137 may include an adaptive data transmitter/receiver 114 that handles the transmission and reception of adaptive data over any type of network or internal form of communication (e.g., an internal bus, etc.). In some embodiments, the data transmission interface 137 may include a speech data transmitter/receiver 116 that handles the transmission and reception of speech data (including original speech) over any form of mobile data.
再次参见图1B,如前所述,个人设备120可具有一个或多个传感器182。这些传感器包括但不限于:全球定位系统(GPS)传感器;静止相机;视频相机;高度计;空气质量传感器;气压计;加速度计;电荷耦合器件;无线电;温度计;计步器;心脏监视器;湿气传感器;湿度传感器;麦克风;地震仪以及磁场传感器。传感器182可与传感器接口180连接。尽管图1B将传感器182图示为个人设备120的一部分,然而在一些实施例中,传感器182可与个人设备120分离并经由一个或多个通信网络(例如通信网络140)进行通信。Referring again to FIG. 1B , as previously described, personal device 120 may include one or more sensors 182 . These sensors include, but are not limited to, a global positioning system (GPS) sensor; a still camera; a video camera; an altimeter; an air quality sensor; a barometer; an accelerometer; a charge-coupled device (CCD); a radio; a thermometer; a pedometer; a heart rate monitor; a moisture sensor; a humidity sensor; a microphone; a seismometer; and a magnetic field sensor. Sensor 182 may be connected to sensor interface 180 . Although FIG. 1B illustrates sensor 182 as part of personal device 120 , in some embodiments, sensor 182 may be separate from personal device 120 and communicate via one or more communication networks (e.g., communication network 140 ).
现在参见图1C,图1C示出示例性环境100’的示例性实施例,该示例性环境100’是环境100的一个非限定例。如图1C所示,环境100’可包括用户(未示出),该用户可具有第一个人设备21A和第二个人设备21B中的一个或多个。尽管个人设备21A和个人设备21B两者均可以是前面描述的任何形式个人设备120,但第一个人设备21A可以是例如USB驱动器,而第二个人设备21B可以是例如蜂窝电话设备。第一个人设备21A和第二个人设备21B中的一个或多个可与设备31交互,设备31可以是任何类型的计算设备,例如膝上计算机、台式计算机、服务器、上网本、平板设备、智能电话等等。设备31可具有装载在其上的操作系统软件81。操作系统软件81可包括但不限于Microsoft Windows、Google Android、Apple iOS、Apple Mountain Lion、UNIX、Linux、Chrome OS、Symbian等等。Referring now to FIG. 1C , FIG. 1C illustrates an exemplary embodiment of an exemplary environment 100′, which is a non-limiting example of environment 100. As shown in FIG. 1C , environment 100′ may include a user (not shown), who may have one or more of a first personal device 21A and a second personal device 21B. While both personal device 21A and personal device 21B may be any of the personal devices 120 described above, first personal device 21A may be, for example, a USB drive, while second personal device 21B may be, for example, a cellular phone. One or more of first personal device 21A and second personal device 21B may interact with device 31, which may be any type of computing device, such as a laptop, desktop computer, server, netbook, tablet device, smartphone, or the like. Device 31 may have operating system software 81 loaded thereon. Operating system software 81 may include, but is not limited to, Microsoft Windows, Google Android, Apple iOS, Apple Mountain Lion, UNIX, Linux, Chrome OS, Symbian, and the like.
另外,在一些实施例中,设备31可包括板上的企业客户软件82。例如,一些系统(例如在办公室环境中)可具有客户软件(例如Citrix等),所述客户软件被装载在它们的系统上以整合它们的工作者的用户体验。在一些实施例中,该模块可起作用以确定讲话数据(例如讲话数据101)的解释和适应数据的施加的角色。在一些实施例中,设备31也可包括第一应用软件91和第二应用软件92中的一个或多个。第一和第二应用软件91、92可以是任何类型的应用,例如游戏、电子表格、字处理器、web浏览器、聊天客户端、图片观看器、图片操纵器、网络摄像机应用等等。在一些实施例中,这些模块可起作用以确定讲话数据的解释和适应数据的施加的角色。例如,应用的复杂性可起作用以确定在应用层发生了多少次讲话处理。在一些实施例中,设备31可与一个或多个通信网络140和一个或多个服务器110通信。Additionally, in some embodiments, the device 31 may include onboard enterprise client software 82. For example, some systems (e.g., in an office environment) may have client software (e.g., Citrix, etc.) loaded onto their systems to integrate the user experience of their workers. In some embodiments, this module may function to determine the interpretation of speech data (e.g., speech data 101) and the role of the application that adapts the data. In some embodiments, the device 31 may also include one or more of a first application software 91 and a second application software 92. The first and second application software 91, 92 may be any type of application, such as a game, a spreadsheet, a word processor, a web browser, a chat client, a picture viewer, a picture manipulator, a webcam application, and the like. In some embodiments, these modules may function to determine the interpretation of speech data and the role of the application that adapts the data. For example, the complexity of the application may function to determine how much speech processing occurs at the application layer. In some embodiments, the device 31 may communicate with one or more communication networks 140 and one or more servers 110.
现在参见图1D,图1D示出示例性环境100”的一个示例性实施例,它是环境100的非限定例。如图1D所示,环境100”可包括用户105,该用户可具有个人设备22A和个人设备22B中的一个或多个。尽管个人设备22A和个人设备22B两者均可以是如前所述的任何形式个人设备120,但个人设备22A可以是例如通用遥控器而个人设备22B可以是例如蜂窝电话设备。在一些实施例中,个人设备22A、个人设备22B中的一个或两者以及计算设备54可发送、存储和/或接收适应数据。在一些实施例中,个人设备22A、个人设备22B和计算设备54中的一个可确定用户105正在朝向图1D所示的哪个设备讲话。在其它实施例中,接收机设备51、媒体播放机设备52和电视机设备53中的一个或多个可往返地发送讲话数据和适应数据中的一个或多个,而接收机设备51、媒体播放机设备52和电视机设备53中的一个或多个可确定设备22A、22B、51、52、53和54中的哪个设备应当施加适应数据、而哪个设备应当处理讲话数据。Referring now to FIGURE 1D, FIGURE 1D illustrates an exemplary embodiment of exemplary environment 100", which is a non-limiting example of environment 100. As shown in FIGURE 1D, environment 100" may include user 105, who may have one or more of personal device 22A and personal device 22B. Although both personal device 22A and personal device 22B may be any form of personal device 120 as previously described, personal device 22A may be, for example, a universal remote control and personal device 22B may be, for example, a cellular telephone device. In some embodiments, one or both of personal device 22A, personal device 22B, and computing device 54 may send, store, and/or receive adaptation data. In some embodiments, one of personal device 22A, personal device 22B, and computing device 54 may determine which device shown in FIGURE 1D the user 105 is speaking toward. In other embodiments, one or more of the receiver device 51, the media player device 52, and the television device 53 may send one or more of speech data and adaptation data back and forth, and one or more of the receiver device 51, the media player device 52, and the television device 53 may determine which device of the devices 22A, 22B, 51, 52, 53, and 54 should apply the adaptation data and which device should process the speech data.
现在参见图1E,图1E示出示例性环境100”’的一个示例性实施例,该示例性环境100”’是环境100的一个非限定例。如图1E所示,环境100”’可包括驾驶汽车(仅示出车内部)的用户(未示出),其中汽车配有机动交通工具控制系统42,该机动交通工具控制系统42可控制汽车的非驾驶特征,例如音乐、气候、温度、燃料管理、座位位置、媒体播放、照明等等。汽车也可具有智能键设备26,其在一些实施例中可存储、接收和/或发送适应数据,或者以无线方式或者通过汽车的系统进行。在一些实施例中,环境100”’也可包括GPS导航设备41,该GPS导航设备41可以是中间设备40的一个例子,该中间设备40也可以是个人设备120。在一些实施例中,GPS导航设备41可充当终端设备,其接收讲话数据和适应数据以处理用户的请求。在其它实施例中,GPS导航设备41可充当个人设备120,其存储从用户的导航命令中衍生的适应数据并当需要时将适应数据发送至目标设备(例如机动交通工具控制系统42)。例如图1A所示的中间设备40以及可以是中间设备40的一个例子的GPS导航设备41可以是用于第一交易的个人设备120和第二交易中的终端设备。在一些实施例中,GPS导航设备41可基于由GPS导航设备41接收的数据分析改变其角色。Now referring to Figure 1E, Figure 1E shows an exemplary embodiment of an exemplary environment 100'", which is a non-limiting example of environment 100. As shown in Figure 1E, environment 100' may include a user (not shown) driving a car (only the interior of the car is shown), wherein the car is equipped with a motor vehicle control system 42, which can control non-driving features of the car, such as music, climate, temperature, fuel management, seat position, media playback, lighting, etc. The car may also have a smart key device 26, which in some embodiments can store, receive and/or send adaptation data, either wirelessly or through the car's system. In some embodiments, environment 100' may also include a GPS navigation device 41, which can be an example of an intermediate device 40, which can also be a personal device 120. In some embodiments, the GPS navigation device 41 can act as a terminal device that receives speech data and adaptation data to process the user's request. In other embodiments, the GPS navigation device 41 may function as the personal device 120, storing adaptation data derived from the user's navigation commands and transmitting the adaptation data to the target device (e.g., the vehicle control system 42) when necessary. For example, the intermediate device 40 shown in FIG. 1A , and the GPS navigation device 41, which may be an example of the intermediate device 40, may be the personal device 120 used in the first transaction and the terminal device in the second transaction. In some embodiments, the GPS navigation device 41 may change its role based on analysis of data received by the GPS navigation device 41.
再次参见图1E,在一些实施例中,GPS导航设备41、机动交通工具控制系统42、智能键设备26以及用户的个人设备(未示出)可与一个或多个通信网络140以及一个或多个服务器110通信。然而,如同在所有图示的示例性实施例中那样,这些要素是选择性的并且一些实施例可排除它们。1E , in some embodiments, the GPS navigation device 41, the motor vehicle control system 42, the smart key device 26, and the user's personal device (not shown) may communicate with one or more communication networks 140 and one or more servers 110. However, as in all of the illustrated exemplary embodiments, these elements are optional and some embodiments may exclude them.
现在参见图2,图2示出关联于特定方的至少一个方面的讲话适应数据调节模块152的示例性实现。如图2所示,该关联于特定方的至少一个方面的讲话适应数据调节模块152在各替代实现和实施例中可包括一个或多个子逻辑模块。例如,如图2所示(例如图2A),在一些实施例中,模块152可包括下面列出的一个或多个:与特定方和设备之间的至少一次在前交互关联的讲话适应数据调节模块202;存储讲话适应数据的地址的获得模块204;确定在特定时间讲话适应数据是否在获得的地址是可得的模块206、关联于特定方的至少一个方面的讲话适应数据获得模块208;关联于特定方访问的至少一个方面的讲话适应数据监视模块210;将一个或多个音节的发音联系至一个或多个概念的讲话的适应数据调节模块212;适应数据地址发生模块214以及适应数据地址提供模块216。Referring now to FIG2 , FIG2 illustrates an exemplary implementation of a speech adaptation data conditioning module 152 associated with at least one aspect of a particular party. As shown in FIG2 , the speech adaptation data conditioning module 152 associated with at least one aspect of a particular party may include one or more sub-logic modules in various alternative implementations and embodiments. For example, as shown in FIG2 (e.g., FIG2A ), in some embodiments, the module 152 may include one or more of the following: a speech adaptation data conditioning module 202 associated with at least one prior interaction between a particular party and a device; a module 204 for obtaining an address for storing speech adaptation data; a module 206 for determining whether speech adaptation data is available at the obtained address at a particular time; a speech adaptation data obtaining module 208 associated with at least one aspect of a particular party; a speech adaptation data monitoring module 210 associated with at least one aspect of a particular party access; an adaptation data conditioning module 212 for associating pronunciation of one or more syllables with speech of one or more concepts; an adaptation data address generating module 214; and an adaptation data address providing module 216.
再次参见图2,例如图2B,在一些实施例中,模块152可包括下面列出的一个或多个:适应数据地址的发生模块218和一旦请求则提供适应数据的地址的模块220。2, such as FIG. 2B, in some embodiments, the module 152 may include one or more of the following: a module 218 for generating an address of the adaptation data and a module 220 for providing the address of the adaptation data upon request.
现在参见图3,图3示出响应包括目标设备的利于讲话的交易的指示的讲话适应数据传输促进模块154的示例性实现。如图3所示,响应包括目标设备的利于讲话的交易的指示的讲话适应数据传输促进模块154在各替代实现和实施例中可包括一个或多个子逻辑模块。例如,如图3所示(例如图3A),在一些实施例中,模块154可包括下面一个或多个:响应包括目标设备的利于讲话的交易的指示包括一个或多个词的列表以及相应发音传输的讲话适应数据促进模块302;响应包括目标设备的利于讲话的交易的指示包括用于修正词频率表传输的指令的讲话适应数据促进模块304;响应包括目标设备的利于讲话的交易的指示包括词语完整算法传输的讲话适应数据促进模块306;以及响应意图执行利于讲话的交易的指示的讲话适应数据传输促进模块308。在一些实施例中,模块308可包括响应从目标设备接收到意图执行利于讲话的交易的指示而传输适应数据的促进模块310。在一些实施例中,模块310可包括下面列出的一个或多个:响应从目标设备接收到特定方与目标设备的输入部分交互的指示的讲话适应数据传输促进模块312;响应接收到特定方正朝向目标设备说话的指示的讲话适应数据传输促进模块314;以及响应讲话适应数据的请求的讲话适应数据传输促进模块316。Referring now to FIG3 , FIG3 illustrates an exemplary implementation of a speech-adaptive data transmission facilitation module 154 responsive to an indication of a speech-facilitated transaction with a target device. As shown in FIG3 , the speech-adaptive data transmission facilitation module 154 responsive to an indication of a speech-facilitated transaction with a target device may include one or more sub-logic modules in various alternative implementations and embodiments. For example, as shown in FIG3 (e.g., FIG3A ), in some embodiments, module 154 may include one or more of the following: a speech-adaptive data facilitation module 302 responsive to an indication of a speech-facilitated transaction with a target device, including a list of one or more words and corresponding pronunciations; a speech-adaptive data facilitation module 304 responsive to an indication of a speech-facilitated transaction with a target device, including a transmission of instructions for modifying a word frequency table; a speech-adaptive data facilitation module 306 responsive to an indication of a speech-facilitated transaction with a target device, including a transmission of a word completion algorithm; and a speech-adaptive data transmission facilitation module 308 responsive to an indication of an intent to perform a speech-facilitated transaction. In some embodiments, module 308 may include a facilitation module 310 for transmitting adaptation data responsive to receiving an indication of an intent to perform a speech-facilitated transaction from a target device. In some embodiments, module 310 may include one or more of the following: a speech-adapted data transmission facilitation module 312 in response to receiving an indication from a target device that a particular party is interacting with an input portion of the target device; a speech-adapted data transmission facilitation module 314 in response to receiving an indication that a particular party is speaking toward the target device; and a speech-adapted data transmission facilitation module 316 in response to a request for speech adaptation data.
再次参见图3,例如图3B,模块154可包括如前所述的模块308。在一些实施例中,模块308可包括响应从特定设备接收的执行利于讲话的交易的意图的指示的讲话适应数据传输促进模块318。3, such as FIG3B, modules 154 may include module 308 as previously described. In some embodiments, module 308 may include a speech-adapted data transfer facilitation module 318 responsive to an indication of intent to perform a speech-facilitated transaction received from a particular device.
再次参见图3,例如图3C,模块154可包括下面列出的一个或多个:关联于特定方讲话的第一方面的第一讲话适应数据发送至目标设备模块320以及关联于特定方讲话的第二方面的第二讲话适应数据发送至目标设备模块322。在一些实施例中,模块320可包括下面列出的一个或多个:关联于特定方的讲话方式的第一讲话适应数据发送至目标设备模块324以及关联于预期词汇的第一讲话适应数据发送至目标设备模块326。在一些实施例中,模块326可包括关联于估计会在利于讲话的交易过程中使用的词汇的第一讲话适应数据至目标设备模块328。在一些实施例中,模块328可包括下面列出的一个或多个:与关联于估计在利于讲话的交易期间使用的食品术语的一个或多个词关联的第一讲话适应数据至目标设备模块330;以及与关联于估计在利于讲话的交易期间使用的一个或多个电子器件的一个或多个词关联的第一讲话适应数据至目标设备模块332。在一些实施例中,模块322可包括响应目标设备请求关联于特定方的讲话的第二方面的第二讲话适应数据发送模块340。Referring again to FIG. 3 , for example, FIG. 3C , module 154 may include one or more of the following: sending first speech adaptation data associated with a first aspect of the particular party's speech to target device module 320 and sending second speech adaptation data associated with a second aspect of the particular party's speech to target device module 322. In some embodiments, module 320 may include one or more of the following: sending first speech adaptation data associated with the particular party's speech style to target device module 324 and sending first speech adaptation data associated with expected vocabulary to target device module 326. In some embodiments, module 326 may include sending first speech adaptation data associated with vocabulary estimated to be used during a speech-facilitated transaction to target device module 328. In some embodiments, module 328 may include one or more of the following: sending first speech adaptation data associated with one or more food terms estimated to be used during a speech-facilitated transaction to target device module 330; and sending first speech adaptation data associated with one or more electronic devices estimated to be used during a speech-facilitated transaction to target device module 332. In some embodiments, module 322 may include sending second speech adaptation data associated with the second aspect of the particular party's speech to target device module 340 in response to a request from the target device.
现在再次参见图3,例如图3D,模块154可包括如前所述的模块320和模块322中的一个或多个。在一些实施例中,模块320可包括下面列出的一个或多个:关联于所说语言的第一讲话适应数据发送至目标设备模块334以及通过具有特定性质的一组成员关联于之前的利于讲话的交易的第一讲话适应数据发送至目标设备模块336。在一些实施例中,模块336可包括通过来自特定社区的一组成员关联于之前的利于讲话的交易的第一讲话适应数据发送至目标设备模块338。在一些实施例中,模块322可包括下面列出的一个或多个:关联于与第一方面不同的第二方面的第二讲话适应数据发送至目标设备模块342。在一些实施例中,模块342可包括与第二组之前的讲话交互关联的第二讲话适应数据发送至目标设备模块344,该第二组之前的讲话交互不同于与第一讲话适应数据关联的第一组。在一些实施例中,模块344可包括与第二组之前的讲话交互关联的第二讲话适应数据发送至目标设备模块346,该第二组之前的讲话交互比关联于第一讲话适应数据的第一组更小。Referring again to FIG. 3 , for example, FIG. 3D , module 154 may include one or more of modules 320 and 322 as described above. In some embodiments, module 320 may include one or more of the following: first speech adaptation data associated with the spoken language is sent to a target device module 334, and first speech adaptation data associated with previous speech-facilitated transactions by a group of members having a particular property is sent to a target device module 336. In some embodiments, module 336 may include sending first speech adaptation data associated with previous speech-facilitated transactions by a group of members from a particular community to a target device module 338. In some embodiments, module 322 may include one or more of the following: second speech adaptation data associated with a second aspect different from the first aspect is sent to a target device module 342. In some embodiments, module 342 may include sending second speech adaptation data associated with a second set of previous speech interactions to a target device module 344, the second set of previous speech interactions being different from the first set associated with the first speech adaptation data. In some embodiments, module 344 may include sending second speech adaptation data associated with a second set of previous speech interactions that is smaller than the first set associated with the first speech adaptation data to a target device module 346.
现在参见图4,图4示出部分地基于利于讲话的交易的一部分的结果的讲话适应数据可能修正确定的模块156的示例性实现。如图4所示,部分地基于利于讲话的交易的一部分的结果的讲话适应数据可能的修正确定的模块156在各替代实现和实施例中可包括一个或多个子逻辑模块。例如,如图4所示(例如图4A),在一些实施例中,模块156可包括下面列出的一个或多个:部分地基于利于讲话的交易的一部分的结果的讲话适应数据修正的模块402;部分地基于利于讲话的交易的一部分的结果相对于修正的讲话适应数据确定的模块404;部分地基于结束的利于讲话的交易的讲话适应数据可能修正确定的模块406;部分地基于利于讲话的交易的最初部分的结果的讲话适应数据可能的修正确定的模块408;部分地基于在利于讲话的交易的第二句子之前发生的利于讲话的交易的第一句子的讲话适应数据可能的修正确定的模块410;部分地基于利于讲话的交易的一部分的结果并在利于讲话的交易结束之前作出的讲话适应数据可能的修正确定的模块412;以及在由特定方说出至少一个词之后部分地基于利于讲话的交易的一部分的结果的讲话适应数据可能的修正确定的模块414。在一些实施例中,模块414可包括部分地基于利于讲话的交易的一部分的结果并在特定词之前作出的讲话适应数据可能的修正确定的模块416。Referring now to Figure 4, an exemplary implementation of a module 156 for determining possible revisions to speech adaptation data based in part on the results of a portion of a speech-facilitated transaction is shown. As shown in Figure 4, the module 156 for determining possible revisions to speech adaptation data based in part on the results of a portion of a speech-facilitated transaction may include one or more sub-logic modules in various alternative implementations and embodiments. For example, as shown in Figure 4 (e.g., Figure 4A), in some embodiments, module 156 may include one or more of the following: a module 402 for modifying speech adaptation data based in part on the results of a portion of a speech-facilitating transaction; a module 404 for determining speech adaptation data relative to the modified data based in part on the results of a portion of a speech-facilitating transaction; a module 406 for determining possible modifications to speech adaptation data based in part on the results of an initial portion of a speech-facilitating transaction; a module 408 for determining possible modifications to speech adaptation data based in part on the results of an initial portion of a speech-facilitating transaction; a module 410 for determining possible modifications to speech adaptation data based in part on a first sentence of a speech-facilitating transaction that occurs before a second sentence of the speech-facilitating transaction; a module 412 for determining possible modifications to speech adaptation data based in part on the results of a portion of a speech-facilitating transaction and made before the speech-facilitating transaction ends; and a module 414 for determining possible modifications to speech adaptation data based in part on the results of a portion of a speech-facilitating transaction after at least one word is spoken by a particular party. In some embodiments, module 414 may include module 416 for determining possible corrections to the utterance adaptation data based in part on the results of a portion of a transaction that facilitated the utterance and that preceded a particular word.
再次参见图4,例如图4B,模块156可包括下面列出的一个或多个部分:利于讲话的交易结果获取模块418;以及部分地基于所获取的结果的讲话适应数据可能的修正确定的模块420。在一些实施例中,模块418可包括下面列出的一个或多个:从目标设备获取利于讲话的交易结果部分的模块422;从特定方获取利于讲话的交易结果部分的模块424;以及指示没有交易获取完全成功的利于讲话的交易结果部分的模块432(它在一些实施例中可例如包括指示交易获取部分的不完整或低置信等级解释的利于讲话的交易结果部分的模块434)。在一些实施例中,模块424可包括下面列出的一个或多个:从通过特定方操作的特定设备获取利于讲话的交易结果部分的模块426(它在一些实施例中可例如包括至少部分地作为特定设备的按下按钮的结果的利于讲话的交易结果部分获取模块428)以及从通过特定方操作的特定设备获取的利于讲话的交易反馈部分的模块430。Referring again to FIG. 4 , for example, FIG. 4B , module 156 may include one or more of the following: a speech-facilitated transaction result acquisition module 418; and a module 420 for determining possible corrections based in part on speech adaptation data of the acquired results. In some embodiments, module 418 may include one or more of the following: a module 422 for acquiring a speech-facilitated transaction result portion from a target device; a module 424 for acquiring a speech-facilitated transaction result portion from a specific party; and a module 432 for indicating that no transaction acquisition was fully successful (which, in some embodiments, may include, for example, a speech-facilitated transaction result portion indicating an incomplete or low-confidence interpretation of the transaction acquisition portion). In some embodiments, module 424 may include one or more of the following: a module 426 for acquiring a speech-facilitated transaction result portion from a specific device operated by a specific party (which, in some embodiments, may include, for example, a speech-facilitated transaction result portion acquired at least in part as a result of a button press on the specific device) and a module 430 for acquiring a speech-facilitated transaction feedback portion from a specific device operated by a specific party.
再次参见图4,例如图4C,模块156可包括如前所述的模块418和模块420中的一个或多个。在一些实施例中,模块418可包括下面列出的一个或多个:指示目标设备执行至少一部分利于讲话的交易失败的利于讲话的交易结果部分的获取模块436;经由目标设备从特定方获取利于讲话的交易结果部分的模块442;包括一个或多个解释词的利于讲话的交易结果部分的获取模块444;包括不完全解释的一个或多个词的利于讲话的交易结果部分的获取模块446;以及包括特定方讲出利于讲话的交易的一个或多个词的表示的利于讲话的交易结果部分的获取模块448。在一些实施例中,模块436可包括下面列出的一个或多个:指示目标设备确定的误差率高于一阈值的利于讲话的交易结果部分的获取模块438;以及指示目标设备确定的置信率低于一阈值的利于讲话的交易结果部分的获取模块440。Referring again to FIG. 4 , for example, FIG. 4C , module 156 may include one or more of modules 418 and 420 as described above. In some embodiments, module 418 may include one or more of the following: a module 436 for obtaining a speech-facilitated transaction result portion indicating that the target device failed to perform at least a portion of the speech-facilitated transaction; a module 442 for obtaining a speech-facilitated transaction result portion from a particular party via the target device; a module 444 for obtaining a speech-facilitated transaction result portion including one or more interpreted words; a module 446 for obtaining a speech-facilitated transaction result portion including one or more incompletely interpreted words; and a module 448 for obtaining a speech-facilitated transaction result portion including a representation of the particular party speaking one or more words of the speech-facilitated transaction. In some embodiments, module 436 may include one or more of the following: a module 438 for obtaining a speech-facilitated transaction result portion indicating that an error rate determined by the target device is above a threshold; and a module 440 for obtaining a speech-facilitated transaction result portion indicating that a confidence rate determined by the target device is below a threshold.
再次参见图4,例如图4D,模块156可包括如前所述的模块418和模块420中的一个或多个。在一些实施例中,模块418可包括下面列出的一个或多个:包括解释利于讲话的交易的一个或多个词的目标设备的表示的利于讲话的交易结果部分的获取模块450以及包括被配置成存储利于讲话的交易的一个或多个解释词以及一个或多个解释词的发音的一个或多个相应表示的数据结构的利于讲话的交易结果部分的获取模块452。在一些实施例中,模块156可包括下面列出的一个或多个:利于讲话的交易部分至少一部分结果的接收模块454;至少部分地基于所接收的结果的至少一部分的讲话适应数据修正的模块456;在完成利于讲话的交易之前发送更新的讲话适应数据的模块458;利于讲话的交易结果部分接收的模块460;部分地基于结果修正讲话适应数据的模块462;利于讲话的交易部分的至少一部分结果的获得模块464;至少部分地基于所接收的结果的至少一部分调整讲话适应数据的模块466以及指示讲话适应数据的适应的信号的发送模块468。4, for example, FIG4D, modules 156 may include one or more of modules 418 and 420 as described above. In some embodiments, module 418 may include one or more of the following: a module 450 for acquiring a speech-facilitated transaction result portion including a representation of a target device that interprets one or more words of a speech-facilitated transaction, and a module 452 for acquiring a speech-facilitated transaction result portion including a data structure configured to store one or more interpreted words of a speech-facilitated transaction and one or more corresponding representations of pronunciations of the one or more interpreted words. In some embodiments, module 156 may include one or more of the following: a module 454 for receiving at least a portion of the results of the speech-facilitating portion of the transaction; a module 456 for correcting the speech adaptation data based at least in part on at least a portion of the received results; a module 458 for sending updated speech adaptation data before completing the speech-facilitating transaction; a module 460 for receiving the results of the speech-facilitating portion of the transaction; a module 462 for correcting the speech adaptation data based in part on the results; a module 464 for obtaining at least a portion of the results of the speech-facilitating portion of the transaction; a module 466 for adjusting the speech adaptation data based at least in part on at least a portion of the received results; and a module 468 for sending a signal indicating adaptation of the speech adaptation data.
再次参见图4,例如图4E,模块156可包括下面列出的一个或多个:从进一步的设备接收可能的修正数据的模块470;将从进一步的设备接收的可能的修正数据施加至讲话适应数据的模块472;利于讲话的交易结果部分的接收模块474;将利于讲话的交易结果部分发送至设备的模块476(它在一些实施例中可包括将利于讲话的交易结果部分发送至存储讲话适应数据的设备的模块480);指示是否修正从设备接收的讲话适应数据的确定数据的模块478(它在一些实施例中可包括指示如何修正从设备接收的讲话适应数据的确定数据的模块482);利于讲话的交易结果部分接收模块484;部分地基于利于讲话的交易结果部分作出讲话适应数据修正决策的模块486;请求修正发送至设备的讲话适应数据的模块488以及从设备获取修正的适应数据的模块490。Referring again to Figure 4, for example Figure 4E, module 156 may include one or more of the following: a module 470 for receiving possible correction data from a further device; a module 472 for applying possible correction data received from a further device to the speech adaptation data; a module 474 for receiving a speech-facilitating transaction result portion; a module 476 for sending the speech-facilitating transaction result portion to the device (which in some embodiments may include a module 480 for sending the speech-facilitating transaction result portion to a device storing the speech adaptation data); a module 478 for indicating whether to correct the speech adaptation data received from the device (which in some embodiments may include a module 482 for indicating how to correct the speech adaptation data received from the device); a module 484 for receiving a speech-facilitating transaction result portion; a module 486 for making a speech adaptation data correction decision based in part on the speech-facilitating transaction result portion; a module 488 for requesting correction of the speech adaptation data sent to the device; and a module 490 for obtaining the corrected adaptation data from the device.
再次参见图4,例如图4F,模块156可包括下面列出的一个或多个:利于讲话的交易的特定部分的结果的获取模块492;至少部分地基于利于讲话的交易的特定部分的获取结果的讲话适应数据可能的修正施加的模块494;以及被配置成通过目标设备施加至利于讲话的交易的又一部分的修正讲话适应数据的发送模块496。在一些实施例中,模块494可包括至少部分地基于利于讲话的交易的特定部分的获取结果施加讲话适应数据可能的修正的算法的模块498。在一些实施例中,模块498可包括下面列出的一个或多个:至少部分地基于利于讲话的交易的特定部分的获取结果施加讲话适应数据可能的修正的算法的参数的模块401;以及至少部分地基于利于讲话的交易的特定部分的获取结果选择讲话适应数据的不同算法的模块403。Referring again to FIG. 4 , for example, FIG. 4F , module 156 may include one or more of the following: a module 492 for acquiring results for a particular portion of a speech-facilitated transaction; a module 494 for applying possible corrections to speech adaptation data based at least in part on the acquired results for the particular portion of a speech-facilitated transaction; and a module 496 for transmitting the corrected speech adaptation data to another portion of the speech-facilitated transaction via a target device. In some embodiments, module 494 may include a module 498 for applying an algorithm for possible corrections to speech adaptation data based at least in part on the acquired results for the particular portion of a speech-facilitated transaction. In some embodiments, module 498 may include one or more of the following: a module 401 for applying parameters of an algorithm for possible corrections to speech adaptation data based at least in part on the acquired results for the particular portion of a speech-facilitated transaction; and a module 403 for selecting different algorithms for speech adaptation data based at least in part on the acquired results for the particular portion of a speech-facilitated transaction.
再次参见图4,例如图4G,模块156可包括如前所述的模块492、模块494和模块496中的一个或多个。在一些实施例中,模块494可包括至少部分地基于利于讲话的交易的特定部分的获取结果施加发音词典可能的修正的模块405。在一些实施例中,模块405可包括下面列出的一个或多个:至少部分地基于利于讲话的交易的特定部分的获取结果将一个或多个词添加至发音词典的模块407;至少部分地基于利于讲话的交易的特定部分的获取结果将一个或多个词添加至包括一个或多个词的发音词典的模块409;以及至少部分地基于利于讲话的交易的特定部分的获取结果替换发音词典的模块411。Referring again to FIG. 4 , for example, FIG. 4G , module 156 may include one or more of modules 492, 494, and 496 as described above. In some embodiments, module 494 may include module 405 for applying possible revisions to the pronunciation dictionary based at least in part on the results obtained for the specific portion of the transaction that facilitates speech. In some embodiments, module 405 may include one or more of the following: module 407 for adding one or more words to the pronunciation dictionary based at least in part on the results obtained for the specific portion of the transaction that facilitates speech; module 409 for adding one or more words to the pronunciation dictionary that includes the one or more words based at least in part on the results obtained for the specific portion of the transaction that facilitates speech; and module 411 for replacing the pronunciation dictionary based at least in part on the results obtained for the specific portion of the transaction that facilitates speech.
再次参见图4,例如图4H,模块156可包括如前所述的模块492、模块494和模块496中的一个或多个。在一些实施例中,模块494可包括至少部分地基于利于讲话的交易的特定部分的获取结果施加路径选择算法可能的修正的路径的模块413。在一些实施例中,模块496可包括下面列出的一个或多个:被配置成用于解释利于讲话的交易的又一部分的修正讲话适应数据的发送模块415;被配置成就在对特定部分进行解释之后用于直接解释利于讲话的交易的又一部分的修正讲话适应数据的发送模块417;修正的讲话适应数据发送模块419以及接收修正的讲话适应数据的指示的接收模块421。Referring again to FIG. 4 , for example, FIG. 4H , module 156 may include one or more of modules 492, 494, and 496 as previously described. In some embodiments, module 494 may include module 413 for applying a path selection algorithm to a possible modified path based at least in part on the results of obtaining the specific portion of the speech-favored transaction. In some embodiments, module 496 may include one or more of the following: a module 415 configured to transmit modified speech adaptation data for interpreting another portion of the speech-favored transaction; a module 417 configured to transmit modified speech adaptation data for interpreting another portion of the speech-favored transaction directly after interpreting the specific portion; a module 419 for transmitting modified speech adaptation data; and a module 421 for receiving an indication of the modified speech adaptation data.
再次参见图4,例如图4I,模块156可包括如前所述的模块492、模块494和模块496中的一个或多个。在一些实施例中,模块496可包括下面列出的一个或多个:修正的讲话适应数据发送模块423;修正的讲话适应数据施加的指示的接收模块425;以及准备就绪以完成利于讲话的交易表示的又一部分的指示的促进模块427。在一些实施例中,模块427可包括下面列出的一个或多个:指示利于讲话的交易的又一部分准备就绪的消息的显示模块429;以及指示利于讲话的交易的又一部分准备就绪以显示在目标设备上的消息的促进模块431。Referring again to FIG. 4 , for example, FIG. 4I , module 156 may include one or more of modules 492, 494, and 496 as previously described. In some embodiments, module 496 may include one or more of the following: a module 423 for sending modified speech adaptation data; a module 425 for receiving an indication that the modified speech adaptation data applies; and a module 427 for facilitating an indication that a further portion of the speech-facilitated transaction is ready for completion. In some embodiments, module 427 may include one or more of the following: a module 429 for displaying a message indicating that a further portion of the speech-facilitated transaction is ready; and a module 431 for facilitating a message on a target device indicating that a further portion of the speech-facilitated transaction is ready for completion.
下面是描述实现方式的一系列流程图。为了便于理解,组织这些流程图以使最初的流程图表示经由示例性实现方式的实现方式,并且之后的流程图表示替代的实现方式和/或最初流程图的扩展,或者作为子组成操作或者作为构筑在一个或多个之前给出的流程图上的附加组成操作。本领域内技术人员将理解,本文中利用的表现形式(例如一开始给出一示例性实现的流程图表示并随后将附加内容和/或进一步的细节提供给后面的流程图)一般允许对各过程实现的快速和容易的理解。另外,本领域内技术人员将进一步理解,本文中使用的表示方式使其本身能够很好地借用到面向模块和/或对象的程序设计范例中。Below is a series of flow charts describing the implementation. For ease of understanding, these flow charts are organized so that the initial flow chart represents the implementation via an exemplary implementation, and subsequent flow charts represent alternative implementations and/or extensions of the initial flow chart, either as sub-component operations or as additional component operations built on one or more previously given flow charts. It will be understood by those skilled in the art that the representations utilized herein (e.g., initially providing a flow chart representation of an exemplary implementation and subsequently providing additional content and/or further details to subsequent flow charts) generally allow for quick and easy understanding of the implementations of each process. Additionally, it will be further understood by those skilled in the art that the representations used herein lend themselves well to module-oriented and/or object-oriented programming paradigms.
此外,在图2-4和之后的附图中,各种操作可以方框中带方框的方式描绘。这种描绘可指示内方框中的操作可包括一个或多个外方框中示出的操作步骤中的可选示例性实施例。然而应当理解,内方框操作可被视为与任何关联的外方框分离的独立操作并可相对于所有其它示出的操作以任何顺序执行,或可同时地执行。再有,图5-8中示出的这些操作以及本文描述的其它操作可通过机器、制品或物质组合中的至少一者来执行。In addition, in Figures 2-4 and subsequent figures, various operations may be depicted in a box-within-a-box format. This depiction may indicate that the operations within the inner box may include optional exemplary embodiments of the operational steps shown in one or more outer boxes. However, it should be understood that the inner box operations may be considered independent operations separated from any associated outer boxes and may be performed in any order relative to all other illustrated operations, or may be performed simultaneously. Furthermore, these operations shown in Figures 5-8 and other operations described herein may be performed by at least one of a machine, an article, or a combination of materials.
现在参见图5,图5示出操作500,操作500可包括描述管理适应数据的操作502,其中该适应数据关联于特定方的讲话的至少一个方面。例如,图1(例如图1B)示出关联于特定方的至少一个方面的讲话适应数据的调节模块152,其管理(例如存储、跟踪、监视、授权、改变许可、提供访问、分配存储、检索、接收、处理、变更、比较或以其它方式执行对适应数据的一个或多个操作),其中适应数据(例如词组补足算法,其用于帮助基于背景解释所说的词)被关联至特定方的讲话的至少一个方面(例如用户之前通过不同的自动出纳机设备终端执行利于讲话的交易,并且由用户说出的多组词帮助形成语境规则,该语境规则由至少一部分词组补足算法使用)。Referring now to FIG5 , FIG5 illustrates operations 500, which may include operations 502 describing managing adaptation data associated with at least one aspect of speech by a particular party. For example, FIG1 (e.g., FIG1B ) illustrates a conditioning module 152 managing (e.g., storing, tracking, monitoring, authorizing, changing permissions, providing access to, allocating storage for, retrieving, receiving, processing, altering, comparing, or otherwise performing one or more operations on) adaptation data associated with at least one aspect of speech by a particular party, wherein adaptation data (e.g., a phrase completion algorithm used to help interpret spoken words based on context) is associated with at least one aspect of speech by a particular party (e.g., a user previously performing speech-facilitated transactions through different automated teller machine (ATM) terminals, and multiple groups of words spoken by the user help form contextual rules used by at least a portion of the phrase completion algorithm).
再次参见图5,操作500可包括操作504,该操作504描绘了响应与特定方的利于讲话的交易关联的指示促进将适应数据发送至目标设备,其中适应数据关联于特定方的讲话的至少一个方面。例如,图1(例如图1B)示出响应包括目标设备的利于讲话的交易的指示发送讲话适应数据的促进模块154,该促进模块154响应特定方(例如自动出纳机设备的用户)的利于讲话的交易(例如对交易的至少一部分通过使用讲话命令命令自动出纳机从自动出纳机设备终端撤回两百美元)的指示(例如用户或者站在自动出纳机设备终端前面、或者插入她的卡、或者按下按钮、或者说出被显示在自动出纳机终端的屏幕上的开始词组)而促进适应数据(例如帮助基于背景解释所说的词的词组补足算法)传输(例如发送、或采用有助于传输的一个或多个步骤,不管起点或终点为何)至目标设备,其中适应数据关联至特定方的讲话的至少一个方面(例如用户之前通过不同的自动出纳机设备终端执行利于讲话的交易,并且由用户说出的多组词帮助形成语境规则,该语境规则由词组补足算法的至少一部分使用)。Referring again to FIG. 5 , operations 500 may include operation 504 depicting facilitating sending adaptation data to a target device in response to an indication associated with the speech-facilitated transaction for the particular party, wherein the adaptation data is associated with at least one aspect of speech by the particular party. For example, Figure 1 (e.g., Figure 1B) shows a facilitation module 154 that transmits speech adaptation data in response to an indication of a speech-enabled transaction involving a target device, the facilitation module 154 facilitating the transmission (e.g., sending, or taking one or more steps that facilitate transmission, regardless of the starting or ending point) of adaptation data (e.g., a phrase completion algorithm that assists in interpreting spoken words based on context) to a target device in response to an indication of a speech-enabled transaction (e.g., at least a portion of the transaction being instructing an ATM to withdraw two hundred dollars from an ATM device terminal using a speech command) by a particular party (e.g., a user of an ATM device), such as the user either standing in front of the ATM device terminal, or inserting her card, or pressing a button, or speaking a starting phrase displayed on a screen of the ATM terminal, wherein the adaptation data is associated with at least one aspect of the speech of the particular party (e.g., the user previously performed speech-enabled transactions through different ATM device terminals, and multiple groups of words spoken by the user helped form contextual rules that were used by at least a portion of the phrase completion algorithm).
再次参见图5,操作500可包括操作506,该操作506描绘了确定是否更新适应数据,所述确定至少部分地基于利于讲话的交易的至少一部分的结果。例如,图1(例如图1B)示出部分地基于利于讲话的交易的一部分的结果的讲话适应数据可能的修正确定的模块156,该模块156确定是否更新(例如改变、修正、改变其时戳、改变其参数、删除、增补、更改、发送至不同的位置、改变其地址、再生或其它操作)适应数据(例如用来帮助基于背景解释所说的词的词组补足算法),所述确定至少部分地基于利于讲话的交易的至少一部分(例如对交易的至少一部分通过使用讲话命令命令自动出纳机从自动出纳机设备终端撤回两百美元)的结果(例如以数值或其它方式表示、通过机器推理提供、明确用户输入或其它来源,具有或不具有人为干预)。Referring again to Figure 5, operation 500 may include operation 506, which depicts determining whether to update adaptation data, the determination being based at least in part on the results of at least a portion of the speech-enabled transaction. For example, Figure 1 (e.g., Figure 1B) illustrates module 156 for determining a possible revision of speech adaptation data based in part on the results of a portion of the speech-enabled transaction, the module 156 determining whether to update (e.g., change, modify, change its timestamp, change its parameters, delete, append, alter, send to a different location, change its address, regenerate, or otherwise manipulate) adaptation data (e.g., a phrase completion algorithm used to assist in interpreting spoken words based on context), the determination being based at least in part on the results (e.g., represented numerically or otherwise, provided by machine reasoning, explicit user input, or other source, with or without human intervention) of at least a portion of the speech-enabled transaction (e.g., instructing an ATM to withdraw two hundred dollars from an ATM device terminal using a speech command for at least a portion of the transaction).
图6A-6B绘出根据实施例的操作502的各种实现。现在参见图6A,操作502可包括描绘管理适应数据的操作602,其中适应数据关联于由特定方作出的至少一次之前的利于讲话的交易。例如,图2(例如图2A)示出与特定方和设备之间的至少一次在先交互关联的讲话适应数据的调节模块202,该模块202管理适应数据(例如不合语法的话语删除算法),其中适应数据关联于由特定方作出的至少一次之前的利于讲话的交易(例如之前交易中用户的不合语法的话语被记录和分析并被存储在适应数据中,以使它们能在未来的利于讲话的交易中更有效地被去除)。Figures 6A-6B depict various implementations of operation 502 according to embodiments. Referring now to Figure 6A, operation 502 may include an operation 602 depicting managing adaptation data, wherein the adaptation data is associated with at least one prior speech-facilitated transaction made by a particular party. For example, Figure 2 (e.g., Figure 2A) illustrates a speech adaptation data conditioning module 202 associated with at least one prior interaction between a particular party and a device, the module 202 managing adaptation data (e.g., an ungrammatical utterance removal algorithm), wherein the adaptation data is associated with at least one prior speech-facilitated transaction made by the particular party (e.g., ungrammatical utterances of a user in a prior transaction were recorded and analyzed and stored in the adaptation data so that they can be more effectively removed in future speech-facilitated transactions).
再次参见图6A,操作502可包括操作604,该操作604描绘了获得对适应数据的位置的基准。例如,图2(例如图2A)示出其中存储讲话适应数据的地址的获得模块204,该获得模块204获得(例如接收、获取、产生、检索、创建或以其它方式得到)适应数据(例如部分图案树模型)存储(适应数据)的计算机位置(例如互联网协议“IP”)地址的基准。6A , operation 502 may include operation 604, which depicts obtaining a reference to the location of the adaptation data. For example, FIG. 2 (e.g., FIG. 2A ) shows an obtaining module 204 where the address of the speech adaptation data is stored, the obtaining module 204 obtaining (e.g., receiving, acquiring, generating, retrieving, creating, or otherwise obtaining) a reference to the computer location (e.g., Internet Protocol “IP”) address where the adaptation data (e.g., a portion of the pattern tree model) is stored (adaptation data).
再次参见图6A,操作502可包括操作606,该操作606描绘了测试适应数据在由存储的基准指定的位置的可得性。例如,图2(例如图2A)示出确定在特定时间讲话适应数据在所获得的地址是否可得的模块206,该模块206测试在由所存储的基准(IP地址)指定的位置(例如在由IP地址指定的设备处)适应数据的可得性(例如确定网络连接是否可用以检索适应数据,或检查适应数据的文件完整性)。6A , operation 502 may include operation 606, which depicts testing the availability of adaptation data at the location specified by the stored reference. For example, FIG. 2 (e.g., FIG. 2A ) illustrates module 206 for determining whether speech adaptation data is available at the obtained address at a particular time, which tests the availability of adaptation data at the location specified by the stored reference (IP address) (e.g., at the device specified by the IP address) (e.g., determining whether a network connection is available to retrieve the adaptation data, or checking the file integrity of the adaptation data).
再次参见图6A,操作502可包括操作608,该操作608绘出了获得适应数据,其中适应数据至少部分地基于特定方的至少一个讲话交互。例如,图2(例如图2A)示出关联于特定方的至少一个方面的讲话适应数据的获得模块208,该获得模块208获得适应数据(例如地区性方言应用算法),其中适应数据至少部分地基于特定方的至少一次讲话交互(例如用户在家用计算机上运行程序,该程序记录用户的讲话并对系统进行训练,系统通过训练程序学习用户的地区性方言,并产生或修正地区性方言应用算法)。Referring again to FIG6A , operation 502 may include operation 608, which depicts obtaining adaptation data, wherein the adaptation data is based at least in part on at least one speech interaction of the particular party. For example, FIG2 (e.g., FIG2A ) illustrates a module 208 for obtaining speech adaptation data associated with at least one aspect of the particular party, wherein the module 208 obtains adaptation data (e.g., a regional dialect application algorithm), wherein the adaptation data is based at least in part on at least one speech interaction of the particular party (e.g., a user running a program on a home computer that records the user's speech and trains the system, the system learning the user's regional dialect through the training program and generating or revising the regional dialect application algorithm).
再次参见图6A,操作502可包括操作610,该操作610描绘了控制对适应数据的访问,其中适应数据至少部分地基于特定方的至少一个讲话交互。例如,图2(例如图2A)示出与特定方访问的至少一个方面关联的讲话适应数据的监视模块210,该监视模块210控制对适应数据(例如随机状态转变网络)进行访问(例如维持许可列表、或者确定允许读、修正、更新和/或删除的一个或多个用户/程序/设备/等等),其中适应数据至少部分地基于特定方的至少一个讲话交互(例如用户向讲话使能的蓝光播放机给予讲话命令)。6A , operation 502 may include operation 610, which depicts controlling access to adaptation data, wherein the adaptation data is based at least in part on at least one speech interaction of a particular party. For example, FIG. 2 (e.g., FIG. 2A ) illustrates a monitoring module 210 for speech adaptation data associated with at least one aspect of access by a particular party, the monitoring module 210 controlling access (e.g., maintaining a permission list, or determining one or more users/programs/devices/etc. that are allowed to read, modify, update, and/or delete) to the adaptation data (e.g., a random state transition network), wherein the adaptation data is based at least in part on at least one speech interaction of a particular party (e.g., a user giving a speech command to a speech-enabled Blu-ray player).
再次参见图6A,操作502可包括操作612,该操作612描述了管理适应数据,该适应数据包括将由特定方作出的一个或多个音素的发音联系至一个或多个概念的数据。例如,图2(例如图2A)示出将一个或多个音节的发音联系至一个或多个概念的讲话适应数据的调节模块212,该调节模块212管理适应数据,该适应数据包括将一个或多个音素(例如由特定方作出的包括词“二十(twenty)”的音素)的发音联系至一个或多个概念(例如具有某些“二十”的概念,例如撤回二十美元,如果与之通信的设备是自动出纳机设备)的数据。6A , operation 502 may include operation 612, which describes managing adaptation data, the adaptation data including data linking the pronunciation of one or more phonemes made by a particular party to one or more concepts. For example, FIG2 (e.g., FIG2A ) illustrates a conditioning module 212 for speech adaptation data linking the pronunciation of one or more syllables to one or more concepts, the conditioning module 212 managing adaptation data including data linking the pronunciation of one or more phonemes (e.g., phonemes comprising the word “twenty” made by a particular party) to one or more concepts (e.g., a concept having some “twenty,” such as withdrawing twenty dollars if the device communicating therewith is an automated teller machine device).
再次参见图6A,操作502可包括操作614,该操作614描绘了获得适应数据位置的基准。例如,图2(例如图2A)示出适应数据地址的生成模块214,该生成模块214获得(例如从存储器中检索)适应数据(例如词组补足算法)的位置(例如云服务(比如Amazon的云服务)中的位置)的基准。6A , operation 502 may include operation 614, which depicts obtaining a reference for the location of the adaptation data. For example, FIG. 2 (e.g., FIG. 2A ) illustrates the adaptation data address generation module 214, which obtains (e.g., retrieves from memory) a reference for the location of the adaptation data (e.g., a phrase completion algorithm) (e.g., a location in a cloud service (e.g., Amazon's cloud service)).
再次参见图6A,操作502可包括操作616,该操作616描绘了以特定间隔提供适应数据的位置。例如,图2(例如图2A)示出适应数据地址的提供模块216,该提供模块216以特定间隔(例如1分钟一次)提供(例如一般广播或发送至特定的一个或多个设备)适应数据(例如词组补足算法)的位置。6A , operation 502 may include operation 616, which depicts providing the location of the adaptation data at specific intervals. For example, FIG. 2 (e.g., FIG. 2A ) illustrates the providing module 216 of the adaptation data address, which provides (e.g., generally broadcasts or sends to specific one or more devices) the location of the adaptation data (e.g., a phrase completion algorithm) at specific intervals (e.g., once per minute).
现在参见图6B,操作502可包括操作618,该操作618描述了获得适应数据的位置的基准。例如,图2(例如图2B)示出适应数据地址的生成模块218,该生成模块218获得(例如从远程源,例如远程服务器接收)适应数据(例如基本发音调节算法)的位置(例如位置web地址或联网驱动器的位置)的基准。6B , operation 502 may include operation 618, which describes obtaining a reference to a location of adaptation data. For example, FIG. 2 (e.g., FIG. 2B ) illustrates an adaptation data address generation module 218 that obtains (e.g., receives from a remote source, such as a remote server) a reference to a location (e.g., a web address or a networked drive) of adaptation data (e.g., a basic pronunciation adjustment algorithm).
再次参见图6B,操作502可包括操作620,该操作620描绘了根据请求提供适应数据的位置。例如,图2(例如图2B)示出根据请求提供适应数据的地址的模块220,该模块220根据请求(例如一旦从设备的用户或从接收自另一设备(例如目标设备,比如讲话使能的数字视频盘(DVD)播放机)的信号接收请求)提供适应数据(例如基本发音调节算法)的位置(例如联网驱动器上的目录和文件名)。6B , operation 502 may include operation 620, which depicts providing the location of adaptation data upon request. For example, FIG2 (e.g., FIG2B ) illustrates module 220 for providing the address of adaptation data upon request, which module 220 provides the location (e.g., a directory and file name on a network drive) of adaptation data (e.g., a base pronunciation adjustment algorithm) upon request (e.g., upon receiving a request from a user of the device or from a signal received from another device (e.g., a target device such as a speech-enabled digital video disc (DVD) player).
再次参见图6B,操作502可包括操作622,该操作622描绘管理口语的排序列表,其中口语的排序列表对应于由特定方领会的语言并通过偏好分级。例如,图2(例如图2B)示出由特定方讲出的语言列表的调节模块222,该调节模块222管理口语的排序列表(例如“英语、法语、西班牙语、意大利语、德语”),其中口语的排序列表对应于由特定方领会的语言(例如用户可能将英语作为第一语言、法语流利、懂西班牙语、并能够认识少量的意大利语和德语词汇)并通过偏好分级(例如用户优选说英语、然后是法语、然后是其它语言)。6B , operation 502 may include operation 622, which depicts managing a ranked list of spoken languages, wherein the ranked list of spoken languages corresponds to languages understood by the particular party and is ranked by preference. For example, FIG2 (e.g., FIG2B ) illustrates a conditioning module 222 for a list of languages spoken by a particular party, the conditioning module 222 managing a ranked list of spoken languages (e.g., “English, French, Spanish, Italian, German”), wherein the ranked list of spoken languages corresponds to languages understood by the particular party (e.g., a user may speak English as a first language, be fluent in French, understand Spanish, and be able to recognize a small vocabulary in Italian and German) and is ranked by preference (e.g., a user prefers to speak English, then French, then other languages).
再次参见图6B,操作502可包括操作624,该操作624描述了管理口语的排序列表,其中口语的排序列表对应于由特定方领会的语言并通过偏好分级,并且口语的排序列表被配置成由一设备接收,该设备被配置成部分地基于口语的排序列表并部分地基于由设备领会的口语的设备列表来生成优选语言。例如,图2(例如图2B)示出由特定方说出并配置成利于交易通信语言的选择的语言列表的调节模块224,该调节模块224管理口语的排序列表(例如英语、西班牙语和法语),其中口语的排序列表对应于由特定方领会并通过偏好分级的语言,并且口语的排序列表被配置成通过设备(例如西班牙机场中的自动售票设备)接收,其被配置成部分地基于口语的排序列表(例如英语、西班牙语和法语)并部分地基于由设备领会的口语的设备列表(例如西班牙语、巴斯克语、意大利语和法语)生成优选语言(例如西班牙语),因此选择西班牙语作为两个列表的共选项并且比法语具有更高的“等级”。Referring again to FIG. 6B , operation 502 may include operation 624 depicting managing a ranked list of spoken languages, wherein the ranked list of spoken languages corresponds to languages understood by a particular party and is ranked by preference, and the ranked list of spoken languages is configured to be received by a device configured to generate a preferred language based in part on the ranked list of spoken languages and in part on a device list of spoken languages understood by the device. For example, Figure 2 (e.g., Figure 2B) shows an adjustment module 224 for a list of languages spoken by a particular party and configured to facilitate the selection of a language for transaction communications, which adjustment module 224 manages a ranked list of spoken languages (e.g., English, Spanish, and French), where the ranked list of spoken languages corresponds to languages understood by the particular party and ranked by preferences, and the ranked list of spoken languages is configured to be received by a device (e.g., an automatic ticket vending machine in a Spanish airport), which is configured to generate a preferred language (e.g., Spanish) based in part on the ranked list of spoken languages (e.g., English, Spanish, and French) and in part on a device list of spoken languages understood by the device (e.g., Spanish, Basque, Italian, and French), thereby selecting Spanish as a common option for both lists and with a higher "rank" than French.
图7A-7B示出根据实施例的操作504的各种实现。现在参见图7A,操作504可包括操作702,该操作702描绘了响应于与特定方的利于讲话的交易关联的指示将发音词典发送至目标设备,其中适应数据关联于特定方讲话的至少一个方面。例如,图3(例如图3A)示出响应包括目标设备的利于讲话的交易的指示包括一个或多个词的列表和相应的发音传输的讲话适应数据的促进模块302,该促进模块302响应与特定方的利于讲话的交易关联的指示(例如用户按下方向盘上的一个按钮,其指示用户想要给予讲话命令)将发音词典发送至目标设备(例如机动交通工具控制系统),其中适应数据(例如话语忽略算法)关联于特定方的讲话的至少一个方面(例如关联于由用户讲出的词的在前发音)。Figures 7A-7B illustrate various implementations of operation 504 according to embodiments. Referring now to Figure 7A , operation 504 may include operation 702, which depicts transmitting a pronunciation dictionary to a target device in response to an indication associated with a speech-facilitated transaction with a particular party, wherein adaptation data is associated with at least one aspect of speech of the particular party. For example, Figure 3 (e.g., Figure 3A ) illustrates a facilitation module 302 transmitting speech adaptation data including a list of one or more words and corresponding pronunciations in response to an indication of a speech-facilitated transaction with a particular party (e.g., a user pressing a button on a steering wheel indicating that the user wishes to give a speech command), transmitting a pronunciation dictionary to a target device (e.g., a motor vehicle control system) in response to an indication associated with a speech-facilitated transaction with a particular party (e.g., a user pressing a button on a steering wheel indicating that the user wishes to give a speech command), wherein adaptation data (e.g., an utterance ignoring algorithm) is associated with at least one aspect of speech of the particular party (e.g., associated with a previous pronunciation of a word spoken by the user).
再次参见图7A,操作504可包括操作704,该操作704描绘了响应关联于特定方的利于讲话的交易的指示发送一组用反映特定方用词的修正的词频率表替换一词频率表的指令至目标设备,其中适应数据关联于特定方的讲话的至少一个方面。例如,图3(例如图3A)示出包括响应包括目标设备的利于讲话的指示用于修正词频率表的指令的讲话适应数据的传输促进模块304,该促进模块304响应关联于特定方(例如讲话使能的个人媒体播放机的用户)的利于讲话的交易(例如用户开始说话)的指示发送一组指令至目标设备(例如至讲话使能的个人媒体播放机)以用反映特定方的用词(例如特定乐队名,例如“U2”基于用户选择的歌在词表中较高)的修正的词频率表替换一词频率表,其中适应数据(例如修正的词频率表)被关联至特定方的讲话的至少一个方面(例如用户说一个或多个词的次数或者频率)。Referring again to FIG7A , operation 504 may include operation 704 depicting sending a set of instructions to a target device for replacing a word frequency table with a revised word frequency table reflecting the specific party's usage of words in response to an indication of a speech-facilitated transaction associated with a particular party, wherein the adaptation data is associated with at least one aspect of the specific party's speech. For example, FIG3 (e.g., FIG3A ) illustrates a speech adaptation data transmission facilitation module 304 including instructions for revising the word frequency table in response to an indication of a speech-facilitated transaction associated with a particular party (e.g., a user of a speech-enabled personal media player), sending a set of instructions to a target device (e.g., a speech-enabled personal media player) for replacing a word frequency table with a revised word frequency table reflecting the specific party's usage of words (e.g., a particular band name, such as "U2" being higher in the word frequency table based on a song selected by the user), wherein the adaptation data (e.g., the revised word frequency table) is associated with at least one aspect of the specific party's speech (e.g., the number of times or frequency at which the user speaks one or more words).
再次参见图7A,操作504可包括操作706,操作706描绘了响应与特定方的利于讲话的交易关联的指示而将词组补足算法发送至目标设备,其中适应数据关联于特定方的讲话的至少一个方面。例如,图3(例如图3A)示出响应包括目标设备的利于讲话的交易的指示而传输包括词组补足算法的讲话适应数据的促进模块306,该促进模块306响应与特定方(例如排队等候使用讲话使能的自动出纳机设备的用户)的利于讲话的交易(例如讲话使能的自动出纳机设备检测到一个或多个特定词正由用户说出)关联的指示而将词语补足算法发送至目标设备(例如讲话使能的自动出纳机设备),其中适应数据关联于特定方的讲话的至少一个方面(例如用户对词“五十(fifty)”发音的特殊方式)。Referring again to Figure 7A, operation 504 may include operation 706, which depicts transmitting a phrase completion algorithm to a target device in response to an indication associated with a speech-enabled transaction with a particular party, wherein adaptation data is associated with at least one aspect of the speech of the particular party. For example, Figure 3 (e.g., Figure 3A) illustrates facilitating module 306 transmitting speech adaptation data including a phrase completion algorithm in response to an indication of a speech-enabled transaction with a target device. The facilitating module 306 transmits a phrase completion algorithm to a target device (e.g., a speech-enabled ATM device) in response to an indication associated with a speech-enabled transaction with a particular party (e.g., a user waiting in line to use the speech-enabled ATM device) (e.g., the speech-enabled ATM device detects one or more particular words being spoken by the user), wherein the adaptation data is associated with at least one aspect of the speech of the particular party (e.g., the particular way the user pronounces the word "fifty").
再次参见图7A,操作504可包括操作708,该操作708描绘了响应特定方意图执行利于讲话的交易的指示而利于将适应数据传输至目标设备,其中适应数据关联于特定方的讲话的至少一个方面。例如,图3(例如图3A)示出响应意图执行利于讲话的交易的指示的讲话适应数据传输的促进模块308,该促进模块308响应特定方(例如车内的用户)意图执行利于讲话的交易(例如用户将她的车开到一特定位置,该位置被系统检测为足够靠近自动通行窗口以下单)而促进将适应数据(例如噪声电平依存的滤除算法)传输至目标设备(例如自动通行窗口),其中适应数据关联至特定方的讲话的至少一个方面(例如关联至用户在其它自动通行窗口的之前讲话)。7A , operation 504 may include operation 708, which depicts facilitating transmission of adaptation data to a target device in response to an indication of a particular party's intent to perform a speech-facilitated transaction, wherein the adaptation data is associated with at least one aspect of the particular party's speech. For example, FIG3 (e.g., FIG3A ) illustrates a module 308 for facilitating transmission of speech-adapted data in response to an indication of an intent to perform a speech-facilitated transaction, wherein the module 308 facilitates transmission of adaptation data (e.g., a noise level-dependent filtering algorithm) to a target device (e.g., an auto-pass window) in response to a particular party (e.g., a user in a vehicle) intending to perform a speech-facilitated transaction (e.g., the user drives her vehicle to a particular location that is detected by the system as being close enough to the auto-pass window to place an order), wherein the adaptation data is associated with at least one aspect of the particular party's speech (e.g., associated with the user's previous speech at other auto-pass windows).
再次参见图7A,操作708可包括操作710,该操作710描述了响应从目标设备接收到的特定方意图执行利于讲话的交易的指示而促进将适应数据传输至目标设备,其中适应数据关联于特定方的讲话的至少一个方面。例如,图3(例如图3A)示出响应从目标设备接收到意图执行利于讲话的交易的指示的讲话适应数据传输的促进模块310,该促进模块310响应从目标设备(例如家用安全系统的基站)接收到特定方意图执行利于讲话的交易(例如特定方开始说话(例如可在后来确定或通过另一单元确定用户是否实际上正在对目标设备说话,但在诸如这一个的实施例中,目标设备将讲话解释成特定方意图发起利于讲话的交易))的指示而利于将适应数据(例如基于感情的发音调节算法)传输至目标设备(例如家用安全系统),其中适应数据关联至特定方的讲话的至少一个方面(例如在一时间周期上检测到的用户的讲话模式)。Referring again to FIG7A , operation 708 may include operation 710 depicting facilitating transmission of adaptation data to a target device in response to receiving an indication from the target device that a particular party intends to perform a speech-facilitated transaction, wherein the adaptation data is associated with at least one aspect of the speech of the particular party. For example, FIG3 (e.g., FIG3A ) illustrates a module 310 for facilitating transmission of speech-adapted data in response to receiving an indication from a target device that an intent to perform a speech-facilitated transaction, wherein the facilitating module 310 facilitates transmission of adaptation data (e.g., a sentiment-based voice adjustment algorithm) to a target device (e.g., a home security system) in response to receiving an indication from a target device (e.g., a base station of a home security system) that a particular party intends to perform a speech-facilitated transaction (e.g., the particular party begins speaking (e.g., it may be determined later or by another unit whether the user is actually speaking to the target device, but in embodiments such as this one, the target device interprets the speech as an intent by the particular party to initiate a speech-facilitated transaction)), wherein the adaptation data is associated with at least one aspect of the speech of the particular party (e.g., a user's speech patterns detected over a period of time).
再次参见图7A,操作710可包括操作712,操作712描述响应从目标设备接收到特定方按下目标设备上的一个按钮的指示而促进适应数据至目标设备的传输。例如,图3(例如图3A)示出响应从目标设备接收到特定方与目标设备的一输入部分交互的指示的讲话适应数据传输的促进模块312,该促进模块312利于响应从目标设备接收到特定方按下目标设备上的按钮(例如自动出纳机设备的显示器上表示“开始”的触摸屏按钮)的指示而将适应数据(例如音节发音数据库)传输至目标设备(例如自动出纳机设备)。7A , operation 710 may include operation 712, which depicts facilitating transmission of adaptation data to a target device in response to receiving an indication from the target device that a particular party pressed a button on the target device. For example, FIG3 (e.g., FIG3A ) illustrates facilitating module 312 for facilitating transmission of speech-adapted data in response to receiving an indication from the target device that a particular party interacted with an input portion of the target device. The facilitating module 312 facilitates transmitting adaptation data (e.g., a syllable pronunciation database) to a target device (e.g., an ATM device) in response to receiving an indication from the target device that a particular party pressed a button on the target device (e.g., a touchscreen button representing “start” on a display of an ATM device).
再次参见图7A,操作710可包括操作714,该操作714描绘响应特定方正在向目标设备说话的指示而促进将适应数据传输至目标设备。例如,图3(例如图3A)示出响应特定方正在朝向目标设备讲话的接收指示的讲话适应数据传输的促进模块314,该促进模块314响应特征方正在向目标设备讲话的指示(例如响应接收到指示膝上计算机的麦克风已录制被标识为讲话的声音的信号)促进向目标设备(例如讲话使能的膝上计算机)传输适应数据(例如基于口音的发音修正算法)。7A , operation 710 may include operation 714, which depicts facilitating transmission of adaptation data to a target device in response to an indication that a particular party is speaking to the target device. For example, FIG3 (e.g., FIG3A ) illustrates a facilitating module 314 for facilitating transmission of speech-adapted data in response to receiving an indication that a particular party is speaking toward the target device. The facilitating module 314 facilitates transmission of adaptation data (e.g., an accent-based pronunciation modification algorithm) to a target device (e.g., a speech-enabled laptop) in response to an indication that a particular party is speaking to the target device (e.g., in response to receiving a signal indicating that a microphone of the laptop has recorded sounds identified as speech).
现在参见图7B,操作710可包括操作716,该操作716表述响应从目标设备对与特定方的讲话的至少一个方面关联的适应数据的请求而促进将适应数据传输至目标设备。例如,图3(例如图3A)示出响应对讲话适应数据的请求的讲话适应数据传输的促进模块316,该促进模块316响应从目标设备对关联于特定方的讲话的至少一个方面的适应数据的请求(例如从给予由用户拥有的之前电视机的讲话命令衍生出的适应数据)促进适应数据(例如句子图表路径选择算法)传输至目标设备(例如讲话使能的电视机)。Referring now to Figure 7B, operation 710 may include operation 716 depicting facilitating transmission of adaptation data to a target device in response to a request from the target device for adaptation data associated with at least one aspect of speech by a particular party. For example, Figure 3 (e.g., Figure 3A) illustrates a facilitating module 316 for facilitating transmission of speech adaptation data in response to a request for speech adaptation data, facilitating transmission of adaptation data (e.g., a sentence graph path selection algorithm) to a target device (e.g., a speech-enabled television) in response to a request from the target device for adaptation data associated with at least one aspect of speech by a particular party (e.g., adaptation data derived from a previous speech command given to a television owned by a user).
再次参见图7B,操作708可包括操作718,该操作718描绘响应从与特定方关联的特定设备接收的特定方意图执行利于讲话的交易的指示而促进将适应数据传输至目标设备,其中适应数据关联于特定方的讲话的至少一个方面。例如,图3(例如图3B)示出响应从特定设备接收到意图执行利于讲话的交易的指示的讲话适应数据传输的促进模块318,该促进模块318响应从与特定方(例如特定方携带的)关联的特定设备(例如具有配置成发送、接收和存储适应数据的芯片和存储器的钥匙环)接收的特定方意图执行利于讲话的交易(例如命令微波炉解冻五分钟)的指示而促进适应数据(例如不常见的词发音指南)传输至目标设备(例如讲话使能的微波炉),其中适应数据关联于特定方的讲话的至少一个方面(例如特定方的之前讲话交互)。7B , operation 708 may include operation 718, which depicts facilitating transmission of adaptation data to a target device in response to receiving an indication from a specific device associated with the specific party that the specific party intends to perform a speech-enabled transaction, wherein the adaptation data is associated with at least one aspect of the speech of the specific party. For example, FIG3 (e.g., FIG3B ) illustrates facilitating transmission of speech-adapted data 318 in response to receiving an indication from a specific device that the specific party intends to perform a speech-enabled transaction, wherein the facilitating module 318 facilitates transmission of adaptation data (e.g., an uncommon word pronunciation guide) to a target device (e.g., a speech-enabled microwave oven) in response to receiving an indication from a specific device (e.g., a key fob having a chip and memory configured to send, receive, and store adaptation data) associated with the specific party (e.g., carried by the specific party) that the specific party intends to perform a speech-enabled transaction (e.g., commanding a microwave oven to defrost for five minutes), wherein the adaptation data is associated with at least one aspect of the speech of the specific party (e.g., a previous speech interaction of the specific party).
现在参见图7C,操作504可包括操作720,该操作720描绘将与特定方的讲话的第一方面关联的第一适应数据发送至目标设备。例如,图3(例如图3C)示出将与特定方的讲话的第一方面关联的第一讲话适应数据发送至目标设备的模块320,该模块320将关联于特定方的讲话的第一方面(例如基于特定方来自哪里的口音)的第一适应数据(例如基于口音的发音修正算法)发送至目标设备(例如讲话使能的音频/视频接收机)。7C , operation 504 may include operation 720, which depicts sending first adaptation data associated with a first aspect of the speech of the particular party to a target device. For example, FIG3 (e.g., FIG3C ) illustrates module 320 for sending first speech adaptation data associated with a first aspect of the speech of the particular party to a target device, which module 320 sends first adaptation data (e.g., an accent-based pronunciation modification algorithm) associated with a first aspect of the speech of the particular party (e.g., an accent based on where the particular party is from) to a target device (e.g., a speech-enabled audio/video receiver).
再次参见图7C,操作504可包括操作722,该操作722描绘在完成特定方的利于讲话的交易之前将关联于特定方的讲话的第二方面的第二适应数据发送至目标设备。例如,图3(例如图3C)示出将与特定方的讲话的第二方面关联的第二讲话适应数据发送至目标设备的模块322,该模块322在特定方的利于讲话的交易完成之前(例如在利于讲话的交易结束前)将与特定方的第二讲话方面(例如基于用户对一组词的发音,例如“音量”、“源”、“播放”和“快进”)关联的第二适应数据(例如不同的基于口音的发音修正算法)发送。在一个例子中,如果音频/视频接收机在基于第一适应数据(它可以是基于口音的)解释讲话时遇到麻烦,则接收机可请求额外的适应数据,该适应数据可以是基于之前发音的。Referring again to FIG7C , operation 504 may include operation 722, which depicts sending second adaptation data associated with a second aspect of the speech of the particular party to the target device before completing the speech-facilitated transaction for the particular party. For example, FIG3 (e.g., FIG3C ) shows module 322 sending second speech adaptation data associated with a second aspect of the speech of the particular party to the target device, which module 322 sends second adaptation data (e.g., a different accent-based pronunciation correction algorithm) associated with the second aspect of the speech of the particular party (e.g., based on the user's pronunciation of a set of words, such as "volume," "source," "play," and "fast forward") before the speech-facilitated transaction for the particular party is completed (e.g., before the speech-facilitated transaction ends). In one example, if an audio/video receiver encounters difficulty interpreting speech based on first adaptation data (which may be accent-based), the receiver may request additional adaptation data, which may be based on a previous pronunciation.
再次参见图7C,操作720可包括操作724,该操作724描绘将关联于特定方的口音的第一适应数据发送至目标设备。例如,图3(例如图3C)示出将关联于特定方的讲话方式的第一讲话适应数据发送至目标设备的模块324,该模块将与特定方的口音关联的第一适应数据发送至目标设备(例如机动交通工具辅助/拨号系统,例如OnStar)。7C , operation 720 may include operation 724, which depicts sending first adaptation data associated with the particular party's accent to a target device. For example, FIG. 3 (e.g., FIG. 3C ) illustrates module 324 for sending first speech adaptation data associated with the particular party's speaking style to a target device, which sends first adaptation data associated with the particular party's accent to a target device (e.g., a motor vehicle assistance/dialing system, such as OnStar).
再次参见图7C,操作720可包括操作726,该操作726描绘将关联于特定方的预期词汇的第一适应数据发送至目标设备。例如,图3(例如图3C)示出将关联于预期词汇的第一讲话适应数据发送至目标设备的模块326,该模块326将与特定方的预期词汇(例如可能用于交易中的词(例如“钱”、“结账”和“节省”)的发音指南)关联的第一适应数据发送至目标设备(例如自动出纳机设备)。7C , operation 720 may include operation 726, which depicts sending first adaptation data associated with the expected vocabulary of the particular party to the target device. For example, FIG3 (e.g., FIG3C ) illustrates module 326 sending first speech adaptation data associated with the expected vocabulary to the target device. Module 326 sends first adaptation data associated with the expected vocabulary of the particular party (e.g., pronunciation guides for words that may be used in transactions, such as “money,” “checkout,” and “save”) to the target device (e.g., an automated teller machine device).
再次参见图7C,操作726可包括操作728,该操作728描绘将包括在利于讲话的交易期间预期会使用的特定方的词汇的第一适应数据发送至目标设备。例如,图3(例如图3C)示出将与在利于讲话的交易过程中估计会使用的词汇关联的第一讲话适应数据发送至目标设备的模块328,该模块328将包括在利于讲话的交易(例如至一个地方的导航)期间预期会使用的特定方的词汇(例如街道名、城市名和准确地名的发音)的第一适应数据发送至目标设备(例如安装在汽车挡风玻璃上的个人导航系统)。7C , operation 726 may include operation 728, which depicts sending first adaptation data including a party-specific vocabulary expected to be used during a speech-facilitated transaction to a target device. For example, FIG3 (e.g., FIG3C ) illustrates module 328 sending first speech adaptation data associated with vocabulary estimated to be used during a speech-facilitated transaction to a target device. Module 328 sends first adaptation data including a party-specific vocabulary (e.g., pronunciation of street names, city names, and proper names) expected to be used during a speech-facilitated transaction (e.g., navigation to a place) to a target device (e.g., a personal navigation system mounted on a car windshield).
再次参见图7C,操作728可包括操作730,该操作730绘出将包括与预期会在利于讲话的交易期间使用的食物项目关联的一个或多个词的发音词典的第一适应数据发送至目标设备,该目标设备是自动通行订单采纳设备。例如,图3(例如图3C)示出将与关联于在利于讲话的交易过程中预期会使用的食物项目的一个或多个词关联的第一讲话适应数据发送至目标设备的模块330,该模块330将包括与在利于讲话的交易期间(例如在快餐店订购晚餐)预期会使用的食物项目(例如“汉堡包”、“腊肉”、“牛肉汉堡”)关联的一个或多个词的发音词典的第一适应数据发送至目标设备,该目标设备是自动通行订单采纳设备。7C , operation 728 may include operation 730, which depicts sending first adaptation data for a pronunciation dictionary including one or more words associated with food items expected to be used during a speech-facilitated transaction to a target device, the target device being an automated pass-through order taking device. For example, FIG3 (e.g., FIG3C ) illustrates module 330 sending first speech adaptation data associated with one or more words associated with food items expected to be used during a speech-facilitated transaction to a target device. Module 330 sends first adaptation data for a pronunciation dictionary including one or more words associated with food items expected to be used during a speech-facilitated transaction (e.g., “hamburger,” “bacon,” “beef burger”) to a target device, the target device being an automated pass-through order taking device.
再次参见图7,操作728可包括操作732,该操作732描绘将包括在利于讲话的交易期间预期会使用的一个或多个电子和媒体词的发音词典的第一适应数据发送至目标设备,该目标设备是语音使能的媒体播放机。例如,图3(例如图3C)示出将与关联于在利于讲话的交易过程中预期会使用的一个或多个电子器件的一个或多个词关联的第一讲话适应数据发送至目标设备的模块332,该模块332将包括在利于讲话的交易期间(例如与媒体播放机交互)预期会使用的一个或多个电子和媒体词(例如“播放”、“停止”、“弹出”以及“播放电影‘盗梦空间’”)的发音词典的第一适应数据发送至目标设备,该目标设备是讲话使能的媒体播放机。7 , operation 728 may include operation 732, which depicts sending first adaptation data for a pronunciation lexicon including one or more electronics and media words expected to be used during a speech-facilitated transaction to a target device, the target device being a speech-enabled media player. For example, FIG3 (e.g., FIG3C ) illustrates module 332 sending first speech adaptation data associated with one or more words associated with one or more electronics expected to be used during a speech-facilitated transaction to a target device, the module 332 sending first adaptation data for a pronunciation lexicon including one or more electronics and media words expected to be used during a speech-facilitated transaction (e.g., interacting with a media player) to a target device, the target device being a speech-enabled media player.
现在参见图7D,操作720可包括操作734,该操作734描绘发送与特定方说出的语言关联的第一适应数据。例如,图3(例如图3D)示出将与所说语言关联的第一讲话适应数据发送至目标设备的模块334,该模块334发送与特定方所说的(例如用户是法国人说话者)语言关联的第一适应数据(例如第一适应数据取用户说的法语并将其转换成设备能够理解的英语等效物)。7D , operation 720 may include operation 734, which depicts sending first adaptation data associated with the language spoken by the particular party. For example, FIG3 (e.g., FIG3D ) illustrates module 334 sending first speech adaptation data associated with the spoken language to the target device. Module 334 sends first adaptation data associated with the language spoken by the particular party (e.g., the user is a French speaker) (e.g., the first adaptation data takes the French spoken by the user and converts it into an English equivalent that the device understands).
再次参见图7D,操作720可包括操作736,该操作736描绘了通过具有与特定方至少一个共同性质的组成员基于之前利于讲话的交易发送第一适应数据。例如,图3(例如图3D)示出通过具有特定性质的组的成员将关联于之前的利于讲话的交易的第一讲话适应数据发送至目标设备的模块336,该模块336通过具有与特定方共同的至少一个性质(例如共同的后裔)的组的成员(例如爱尔兰血统)基于之前的利于会话的交易发送第一适应数据(例如音素发音数据库)。7D , operation 720 may include operation 736, which depicts sending first adaptation data based on a previous speech-facilitated transaction by a group member having at least one common property with the particular party. For example, FIG3 (e.g., FIG3D ) illustrates module 336 sending first speech adaptation data associated with a previous speech-facilitated transaction to a target device by a group member having a particular property, wherein module 336 sends first adaptation data (e.g., a phoneme pronunciation database) based on a previous speech-facilitated transaction by a group member having at least one common property (e.g., common ancestry) with the particular party.
再次参见图7D,操作736可包括操作738,该操作738描绘了通过起源于与特定方的同一地理区域的组的成员基于之前的利于讲话的交易发送第一适应数据。例如,图3(例如图3D)示出通过来自特定社区的组的成员将关联于之前利于讲话的交易的第一讲话适应数据发送至目标设备的模块338,该模块338通过起源自与特定方同一地理区域(例如“美国南部”)的组成员基于之前利于讲话的交易发送第一适应数据(例如基本发音调节算法)。7D , operation 736 may include operation 738, which depicts sending, by members of a group originating from the same geographic region as the particular party, first adaptation data based on previous speech-facilitating transactions. For example, FIG3 (e.g., FIG3D ) illustrates module 338 sending, by members of a group from a particular community, first speech adaptation data associated with previous speech-facilitating transactions to a target device, which module 338 sends, by members of a group originating from the same geographic region as the particular party (e.g., “Southern United States”), first adaptation data (e.g., a basic pronunciation adjustment algorithm) based on previous speech-facilitating transactions.
回来参见图7C,操作722可包括操作740,该操作740描绘了在完成特定方的利于讲话的交易之前响应来自目标设备的请求将与特定方的讲话的第二方面关联的第二适应数据发送至目标设备。例如,图3(例如图3C)示出响应目标设备请求发送与特定方的讲话的第二方面关联的第二讲话适应数据的模块340,该模块340在特定方的利于讲话的交易完成之前响应来自目标设备的请求(例如目标设备仅使用第一适应数据可能无法对讲话作出快速解释,因此目标设备请求第二适应数据)将关联于特定方的讲话的第二方面(例如基于用户之前的讲话交互的用户特定口音)的第二适应数据(例如地区性方言应用算法)发送至目标设备(例如自动售票设备)。Referring back to FIG7C , operation 722 may include operation 740, which depicts sending second adaptation data associated with a second aspect of the speech of the particular party to the target device in response to a request from the target device before completing the speech-facilitated transaction for the particular party. For example, FIG3 (e.g., FIG3C ) illustrates module 340 for sending second speech adaptation data associated with a second aspect of the speech of the particular party in response to a request from the target device, which module 340 sends second adaptation data (e.g., a regional dialect application algorithm) associated with a second aspect of the speech of the particular party (e.g., a user-specific accent based on the user's previous speech interactions) to a target device (e.g., a ticket vending machine) in response to a request from the target device (e.g., the target device may not be able to quickly interpret the speech using only the first adaptation data, so the target device requests the second adaptation data) before completing the speech-facilitated transaction for the particular party.
再次参见图7D,操作722可包括操作742,该操作742描绘了在特定方的利于讲话的交易完成前将与特定方的讲话的第一方面不同的讲话的第二方面关联的第二适应数据发送至目标设备。例如,图3(例如图3C)示出将与第一方面不同的第二方面关联的第二讲话适应数据发送至目标设备的模块342,该模块342在特定方的利于讲话的交易完成前将与不同于特定方的讲话的第一方面(例如特定方是其一部分的一组人群的口音)的讲话的第二方面(从来自之前讲话交互观察到的讲话模式)关联的第二适应数据(例如讲话部分标记算法)发送至目标设备(例如机动交通工具控制系统)。7D , operation 722 may include operation 742, which depicts sending second adaptation data associated with a second aspect of speech that is different from the first aspect of speech of the particular party to a target device before the transaction facilitated by the particular party is completed. For example, FIG3 (e.g., FIG3C ) illustrates module 342 sending second speech adaptation data associated with a second aspect that is different from the first aspect to a target device, which module 342 sends second adaptation data (e.g., a speech part labeling algorithm) associated with a second aspect of speech that is different from the first aspect of speech of the particular party (e.g., an accent of a group of people of which the particular party is a part) to a target device (e.g., a motor vehicle control system) before the transaction facilitated by the particular party is completed.
再次参见图7D,操作742可包括操作744,该操作744描绘了在利于讲话的交易完成之前发送第二适应数据,其中第二适应数据基于第二组人群的之前讲话交互,而第一适应数据基于第一组人群的之前讲话交互。例如,图3(图3C)示出关联于与第一讲话适应数据关联的第一组不同的与第二组之前的讲话交互关联的第二讲话适应数据发送至目标设备的模块344,该模块344在利于讲话的交易完成前发送第二适应数据(例如第二发音调节算法,其中第一适应数据是第一、不同的发音调节算法),其中第二适应数据基于第二组人群的之前的讲话交互(例如特定种族背景的人群),而第一适应数据基于第一组人群的之前的讲话交互(例如特定城市里面长大的一组人群)。7D , operation 742 may include operation 744, which depicts sending second adaptation data prior to completion of the speech-facilitated transaction, wherein the second adaptation data is based on prior speech interactions of a second group of people, and the first adaptation data is based on prior speech interactions of a first group of people. For example, FIG3 ( FIG3C ) illustrates sending to a target device second speech adaptation data associated with a first, different set of prior speech interactions associated with the first speech adaptation data, the module 344 sending the second adaptation data (e.g., a second pronunciation adjustment algorithm, wherein the first adaptation data is a first, different pronunciation adjustment algorithm) prior to completion of the speech-facilitated transaction, wherein the second adaptation data is based on prior speech interactions of a second group of people (e.g., people of a particular ethnic background), and the first adaptation data is based on prior speech interactions of a first group of people (e.g., a group of people who grew up in a particular city).
再次参见图7D,操作744可包括操作746,该操作746在利于讲话的交易完成之前发送第二适应数据,其中第二适应数据基于比第一组人群更少的第二组人群的之前的讲话交互。例如,图3(例如图3C)示出关联于比与第一讲话适应数据关联的第一组更少的第二组的之前讲话交互的第二讲话适应数据发送至目标设备的模块346,该模块346在利于讲话的交易完成前发送第二适应数据(例如被设计成修正第一发音调节算法的第二发音调节算法),其中第二适应数据基于第二组人群(例如直接关联于对其检测和分析之前讲话交互的特定方的人群)的之前讲话交互,该第二组人群少于第一组人群(例如在特定城市长大的人群)。7D , operation 744 may include operation 746 of sending second adaptation data prior to completion of the speech-facilitated transaction, wherein the second adaptation data is based on prior speech interactions of a second group of people that is smaller than the first group of people. For example, FIG3 (e.g., FIG3C ) shows second speech adaptation data associated with prior speech interactions of a second group that is smaller than the first group associated with the first speech adaptation data being sent to a module 346 of a target device, the module 346 sending second adaptation data (e.g., a second pronunciation adjustment algorithm designed to modify a first pronunciation adjustment algorithm) prior to completion of the speech-facilitated transaction, wherein the second adaptation data is based on prior speech interactions of a second group of people (e.g., a group of people directly associated with a particular party for whom the prior speech interactions were detected and analyzed) that is smaller than the first group of people (e.g., a group of people who grew up in a particular city).
图8A-8G描绘根据实施例的操作506的各种实现。现参见图8A,操作506包括操作802,其描绘了更新适应数据,所述更新至少部分地基于利于讲话的交易的至少一部分的结果。例如,图4(例如图4A)示出部分地基于利于讲话的交易的一部分的结果的讲话适应数据修正的模块402,该模块402更新(例如修正其至少一部分,即使它就像改变适应数据中保存的“最后使用日期”那样微小)适应数据(例如句子图解路径选择算法),所述更新至少部分地基于利于讲话的交易(例如将讲话命令给予自动出纳机设备)的至少一部分的结果(例如目标设备能够多快地使用句子图解路径选择算法来处理讲话)。Figures 8A-8G depict various implementations of operation 506 according to embodiments. Referring now to Figure 8A , operation 506 includes operation 802, which depicts updating adaptation data, the updating being based at least in part on the results of at least a portion of the speech-favored transaction. For example, Figure 4 (e.g., Figure 4A ) illustrates module 402 modifying speech adaptation data based in part on the results of a portion of the speech-favored transaction, which updates (e.g., modifies at least a portion thereof, even if it is as minor as changing a "last used date" stored in the adaptation data) adaptation data (e.g., a sentence graph routing algorithm), the updating being based at least in part on the results of at least a portion of the speech-favored transaction (e.g., administering a speech command to an automated teller machine device) (e.g., how quickly the target device was able to process speech using the sentence graph routing algorithm).
再次参见图8A,操作506可包括操作804,该操作804描绘了确定不更新适应数据,所述确定至少部分地基于利于讲话的交易的至少一部分的结果。例如,图4(例如图4A)示出讲话适应数据部分地基于利于讲话的交易的一部分的结果确定不修正的模块404,该模块404确定不更新适应数据(例如不常见的词发音指南),所述确定至少部分地基于利于讲话的交易(例如用户从自动饮料配售机设备订购饮料)的至少一部分的结果(例如指示用户对交易“非常乐意”的用户反馈)。8A , operation 506 may include operation 804, which depicts determining not to update adaptation data, the determination being based at least in part on the results of at least a portion of the speech-facilitated transaction. For example, FIG4 (e.g., FIG4A ) illustrates module 404 for determining not to modify speech adaptation data based in part on the results of at least a portion of the speech-facilitated transaction, the module 404 determining not to update adaptation data (e.g., uncommon word pronunciation guides), the determination being based at least in part on the results of at least a portion of the speech-facilitated transaction (e.g., user feedback indicating that the user was “very happy” with the transaction).
再次参见图8A,操作506可包括操作806,该操作806描绘了确定是否更新适应数据,所述确定至少部分地基于完成的利于讲话的交易的结果。例如,图4(例如图4A)示出至少部分地基于结束的利于讲话的交易的结果确定讲话适应数据可能的修正的模块406,该模块406确定是否更新适应数据(例如非词典词去除算法),所述确定至少部分地基于完成的利于讲话的交易的结果(例如与机动交通工具控制系统交互的完成交易的结果被发送至远程服务器,并在以后的日子,分析交易以确定交易在去除非词典词方面有多成功)。Referring again to Figure 8A, operation 506 may include operation 806, which depicts determining whether to update the adaptation data, the determination being based at least in part on the results of the completed speech-facilitating transaction. For example, Figure 4 (e.g., Figure 4A) illustrates module 406 for determining possible revisions to speech adaptation data based at least in part on the results of the completed speech-facilitating transaction, the module 406 determining whether to update the adaptation data (e.g., a non-dictionary word removal algorithm), the determination being based at least in part on the results of the completed speech-facilitating transaction (e.g., the results of the completed transaction interacting with the motor vehicle control system are sent to a remote server, and at a later date, the transaction is analyzed to determine how successful the transaction was in removing non-dictionary words).
再次参见图8A,操作506可包括操作808,该操作808描绘了确定是否更新适应数据,所述确定至少部分地基于利于讲话的交易的开始部分的结果。例如,图4(例如图4A)示出部分地基于利于讲话的交易的最初部分的结果确定讲话适应数据可能的修正的模块408,该模块408确定是否更新适应数据(例如讲话不流利检测算法),所述确定至少部分地基于利于讲话的交易(例如从讲话使能的自动出纳机设备撤回一笔钱)的开始部分(例如交易的头两句话)的结果。8A , operation 506 may include operation 808, which depicts determining whether to update adaptation data, the determination being based at least in part on the results of the initial portion of the speech-facilitated transaction. For example, FIG4 (e.g., FIG4A ) illustrates module 408 for determining possible revisions to speech adaptation data based in part on the results of the initial portion of the speech-facilitated transaction. The module 408 determines whether to update adaptation data (e.g., a speech disfluency detection algorithm) based at least in part on the results of the initial portion (e.g., the first two sentences of the transaction) of a speech-facilitated transaction (e.g., withdrawing money from a speech-enabled automated teller machine).
再次参见图8A,操作506可包括操作810,该操作810描绘了确定是否更新适应数据,其中利于讲话的交易包括第一句子和第二句子,并且所述确定发生在第一句子之后并在第二句子之前。例如,图4(例如图4A)示出部分地基于发生在利于讲话的交易的第二句子之前的利于讲话的交易的第一句子的讲话适应数据的可能修正的确定模块410,该模块410确定是否更新适应数据(例如基于背景的修复的话语处理矩阵),其中利于讲话的交易(例如对机动交通工具控制系统发命令)包括第一句子(例如“播放Norah Jones”)和第二句子(例如“将音量调高至25”),并且所述确定发生在第一句子之后并在第二句子之前。8A , operation 506 may include operation 810, which depicts determining whether to update adaptation data, wherein a speech-facilitating transaction includes a first sentence and a second sentence, and the determining occurs after the first sentence and before the second sentence. For example, FIG4 (e.g., FIG4A ) illustrates a determination module 410 for determining whether to update adaptation data (e.g., a context-based repaired speech processing matrix) based in part on a possible revision of speech adaptation data for a first sentence of a speech-facilitating transaction that occurs before a second sentence of the speech-facilitating transaction, wherein the speech-facilitating transaction (e.g., issuing a command to a motor vehicle control system) includes a first sentence (e.g., “Play Norah Jones”) and a second sentence (e.g., “Turn the volume up to 25”), and the determining occurs after the first sentence and before the second sentence.
再次参见图8A,操作506可包括操作812,该操作812描绘了确定如何更新适应数据,所述确定在利于讲话的交易完成之前作出。例如,图4(例如图4A)示出部分地基于利于讲话的交易的一部分的结果并在利于讲话的交易结束之前作出的讲话适应数据可能的修正的模块412,该模块412确定是否更新适应数据(例如论述者检测模块),所述确定在利于讲话的交易完成之前作出(例如在用户在自动通行窗口完成他的订单)。8A , operation 506 may include operation 812, which depicts determining how to update adaptation data, said determination being made before the speech-facilitated transaction is completed. For example, FIG4 (e.g., FIG4A ) illustrates module 412 for determining whether to update adaptation data (e.g., discussant detection module) based in part on the results of a portion of the speech-facilitated transaction and made before the speech-facilitated transaction is concluded, said module 412 determining whether to update adaptation data (e.g., discussant detection module), said determination being made before the speech-facilitated transaction is completed (e.g., after the user completes his order at the auto-pass window).
再次参见图8A,操作506可包括操作814,其描绘了确定是否更新适应数据,所述确定在利于讲话的交易中说出至少一个词之后作出。例如,图4(例如图4A)示出在由特定方说出至少一个词之后部分地基于利于讲话的交易的一部分的结果确定讲话适应数据可能的修正的模块414,该模块414确定是否更新适应数据(例如讲话部分标记算法),所述在至少一个词之后作出的确定是在利于讲话的交易中说出的(例如向被配置成接收讲话作为输入的计算机口述记录)。Referring again to Figure 8A, operation 506 may include operation 814, which depicts determining whether to update adaptation data after at least one word is spoken in a speech-facilitated transaction. For example, Figure 4 (e.g., Figure 4A) illustrates module 414 for determining possible revisions to speech adaptation data based in part on the results of a portion of a speech-facilitated transaction after at least one word is spoken by a particular party, the module 414 determining whether to update adaptation data (e.g., a speech portion tagging algorithm), the determination made after at least one word is spoken in a speech-facilitated transaction (e.g., dictating a transcript to a computer configured to receive speech as input).
再次参见图8A,操作814可包括操作816,该操作816描绘了确定是否更新适应数据,所述确定发生在利于讲话的交易中说出一特定词之前。例如,图4(例如图4A)示出部分地基于利于讲话的交易的一部分的结果并在特定词之前作出的确定讲话适应数据可能的修正的模块416,该模块416确定是否更新适应数据(例如部分图案树模型),所述确定发生在利于讲话的交易中说出特定词之前(例如在可能以词“再见”结束的交易中说出词“再见”之前)。Referring again to Figure 8A, operation 814 may include operation 816, which depicts determining whether to update adaptation data, the determination occurring before a particular word is spoken in a speech-facilitated transaction. For example, Figure 4 (e.g., Figure 4A) illustrates module 416 for determining a possible revision of speech adaptation data based in part on the results of a portion of a speech-facilitated transaction and made before a particular word. The module 416 determines whether to update adaptation data (e.g., a portion of a pattern tree model) the determination occurring before a particular word is spoken in a speech-facilitated transaction (e.g., before the word "bye" is spoken in a transaction that may end with the word "bye").
现在参见图8B,操作506可包括操作818,该操作818描绘了接收利于讲话的交易的至少一部分的结果。例如,图4(例如图4B)示出利于讲话的交易结果的一部分的获取模块418,该获取模块418接收利于讲话的交易(例如在机场使用讲话使能的自动售票设备打印机票)的至少一部分(例如由用户说出并由自动售票设备解释的第一句子)的结果(例如交易部分的解释是否成功或多成功或者估计设备有多少可能确定交易是成功的指示)。8B , operation 506 may include operation 818, which depicts receiving a result of at least a portion of the speech-enabled transaction. For example, FIG4 (e.g., FIG4B ) illustrates a portion of a speech-enabled transaction result acquisition module 418, which receives a result (e.g., an indication of whether or how successful the interpretation of the portion of the transaction was successful, or an estimate of how likely the device was to determine that the transaction was successful) of at least a portion (e.g., a first sentence spoken by a user and interpreted by the speech-enabled ticketing device) of a speech-enabled transaction (e.g., printing a ticket at an airport using a speech-enabled ticketing device).
再次参见图8B,操作506可包括操作820,该操作820描绘了确定是否更新适应数据,所述确定至少部分地基于所接收的结果。例如,图4(例如图4B)示出部分地基于所获得的结果确定讲话适应数据可能的修正的模块420,该模块420确定是否更新适应数据(例如潜在对话作用矩阵),所述确定至少部分地基于所接收的结果(例如潜在对话作用矩阵藉此将被修正的量是基于所接收的交易部分有多成功的指示)。Referring again to FIG8B , operation 506 may include operation 820 , which depicts determining whether to update the adaptation data, the determination being based at least in part on the received results. For example, FIG4 (e.g., FIG4B ) illustrates module 420 for determining possible revisions to speech adaptation data based in part on the obtained results. The module 420 determines whether to update the adaptation data (e.g., a potential conversation interaction matrix), the determination being based at least in part on the received results (e.g., the amount by which the potential conversation interaction matrix is to be revised is based on the received indication of how successful the transaction portion was).
再次参见图8B,操作818可包括操作822,该操作822描绘了从目标设备接收利于讲话的交易的至少一部分的结果。例如,图4(例如图4B)示出从目标设备获取利于讲话的交易结果的一部分的模块422,该模块422从目标设备(例如便携式导航系统发送估计)接收利于讲话的交易的至少一部分(例如将命令给予便携式导航系统)的结果(例如估计讲话结束相比利于讲话的交易的平均部分花费72%的更长时间)。8B , operation 818 may include operation 822, which depicts receiving a result of at least a portion of the speech-facilitated transaction from the target device. For example, FIG4 (e.g., FIG4B ) illustrates module 422 for obtaining a portion of the speech-facilitated transaction result from the target device, which module 422 receives a result of at least a portion of the speech-facilitated transaction (e.g., a command given to the portable navigation system) from the target device (e.g., the portable navigation system sending an estimate) (e.g., an estimate that speech completion takes 72% longer than the average portion of the speech-facilitated transaction).
再次参见图8B,操作818可包括操作824,该操作824描绘了从特定方接收利于讲话的交易的至少一部分的结果。例如,图4(例如图4B)示出从特定方获取的利于讲话的交易的一部分的模块424,该模块424从特定方(例如用户指示交易解释是“糟糕的”)接收利于讲话的交易的至少一部分(例如从自动饮料制备机设备订购山泉汽水)的结果(例如交易解释被评级为“糟糕的”)。8B , operation 818 may include operation 824, which depicts receiving a result of at least a portion of the speech-favored transaction from a particular party. For example, FIG4 (e.g., FIG4B ) illustrates module 424 for obtaining a portion of the speech-favored transaction from a particular party, which module 424 receives a result (e.g., the transaction interpretation being rated as "poor") of at least a portion of the speech-favored transaction (e.g., ordering spring soda from an automated beverage preparation machine device) from a particular party (e.g., a user indicating that the transaction interpretation is "poor").
再次参见图8B,操作824可包括操作826,该操作826描绘了从与特定设备交互的特定方接收利于讲话的交易的至少一部分的结果,该特定设备被配置成管理适应数据。例如,图4(例如图4B)示出从通过特定方操作的特定设备获取利于讲话的交易结果的一部分的模块426,该模块426从与被配置成管理适应数据(例如词和/或音节依赖的分析器)的特定设备(例如插入到车内并被配置成存储、发送和接收适应数据并关联于特定驾驶员的智能钥匙)交互的特定方接收利于讲话的交易的至少一部分的结果(例如命令机动交通工具控制系统下降后车窗并打开遮阳蓬)。Referring again to Figure 8B, operation 824 may include operation 826, which depicts receiving a result of at least a portion of a speech-facilitated transaction from a specific party interacting with a specific device configured to manage adaptation data. For example, Figure 4 (e.g., Figure 4B) illustrates module 426 for obtaining a portion of a speech-facilitated transaction result from a specific device operated by a specific party, the module 426 receiving a result of at least a portion of a speech-facilitated transaction (e.g., commanding a motor vehicle control system to lower a rear window and open a sunshade) from a specific party interacting with a specific device configured to manage adaptation data (e.g., a word and/or syllable dependency analyzer) (e.g., a key fob inserted into a vehicle and configured to store, send, and receive adaptation data and associated with a specific driver).
再次参见图8B,操作826可包括操作828,该操作828描绘了从特定方按下特定设备上被配置成管理适应数据的按钮而接收利于讲话的交易的至少一部分的结果。例如,图4(例如图4B)示出至少部分地作为按下特定设备的按钮的结果获取利于讲话的交易结果的一部分的模块428,该模块428从特定方(例如用户)按下特定设备(例如用户的智能电话)上的按钮(例如写明“如果你想要与之通信的设备对处理你的讲话存在麻烦,就按这里”的按钮)而接收利于讲话的交易的至少一部分的结果(例如利于讲话的交易失败的指示),所述按钮被配置成管理适应数据(例如智能电话管理用户的适应数据,比如不合语法的话语删除算法)。Referring again to Figure 8B, operation 826 may include operation 828, which depicts receiving a result of at least a portion of the speech-facilitated transaction from a particular party pressing a button on a particular device configured to manage adaptation data. For example, Figure 4 (e.g., Figure 4B) illustrates module 428 for obtaining a portion of the speech-facilitated transaction result at least in part as a result of pressing a button on a particular device, which module 428 receives a result of at least a portion of the speech-facilitated transaction (e.g., an indication that the speech-facilitated transaction failed) from a particular party (e.g., a user) pressing a button on a particular device (e.g., a user's smartphone) (e.g., a button that reads "Press here if the device you wish to communicate with is having trouble processing your speech") that is configured to manage adaptation data (e.g., a smartphone manages the user's adaptation data, such as an ungrammatical utterance removal algorithm).
再次参见图8B,操作826可包括操作830,该操作830描绘了在利于讲话的交易结束之后从特定方接收利于讲话的交易的至少一部分的结果,该特定方向特定设备提供反馈。例如,图4(例如图4B)示出从通过特定方操作的特定设备获取的利于讲话的交易反馈的一部分的模块430,该模块430在利于讲话的交易结束后(例如在利于讲话的交易已结束后概览被发送至蜂窝电话并被用户采用)从向特定设备提供反馈(例如填写显示在蜂窝电话的屏幕上的概览)的特定方(例如买票的用户)接收利于讲话的交易的至少一部分(例如在火车站购买火车票)的结果(例如其中利于讲话的交易被评级为5级中的1级的概览结果)。8B , operation 826 may include operation 830, which depicts receiving, after the speech-facilitated transaction is concluded, a result of at least a portion of the speech-facilitated transaction from a particular party that provided feedback to a particular device. For example, FIG4 (e.g., FIG4B ) illustrates module 430 for obtaining a portion of speech-facilitated transaction feedback from a particular device operated by the particular party, wherein module 430 receives, after the speech-facilitated transaction is concluded (e.g., after the speech-facilitated transaction is concluded and the summary is sent to the cell phone and adopted by the user), a result of at least a portion of the speech-facilitated transaction (e.g., purchasing a train ticket at a train station) (e.g., a summary result in which the speech-facilitated transaction was rated 1 out of 5) from a particular party (e.g., a user purchasing a ticket) that provided feedback to the particular device (e.g., filling out a summary displayed on a screen of the cell phone).
再次参见图8B,操作818可包括操作832,该操作832描绘了接收利于讲话的交易的至少一部分是不成功的指示。例如,图4(例如图4B)示出指示低于交易完全成功的利于讲话的交易结果的一部分的获取模块432,该获取模块432接收利于讲话的交易的至少一部分是不成功的指示(例如接收交易中用户所说的前面三句话无法被解析成设备理解的命令的指示)。8B , operation 818 may include operation 832, which depicts receiving an indication that at least a portion of the speech-enabled transaction was unsuccessful. For example, FIG. 4 (e.g., FIG. 4B ) illustrates acquisition module 432 indicating a portion of a speech-enabled transaction outcome that was less than a complete success, where acquisition module 432 receives an indication that at least a portion of the speech-enabled transaction was unsuccessful (e.g., receives an indication that the first three sentences spoken by the user in the transaction could not be parsed into commands understood by the device).
再次参见图8B,操作832可包括操作834,该操作834描绘了接收到利于讲话的交易的至少一部分未成功地由目标设备解释的指示。例如,图4(例如图4B)示出指示交易的一部分的不完全或低置信水平的解释的利于讲话的交易结果的一部分的获取模块434,该获取模块434接收利于讲话的交易的至少一部分(例如在房间控制系统中给予降低环境温度的命令)未由目标设备(例如自动出纳机设备)成功解释(例如命令失败)的指示。8B , operation 832 may include operation 834, which depicts receiving an indication that at least a portion of the speech-facilitated transaction was not successfully interpreted by the target device. For example, FIG4 (e.g., FIG4B ) illustrates capture module 434 of a portion of a speech-facilitated transaction result indicating an incomplete or low-confidence interpretation of a portion of the transaction, where capture module 434 receives an indication that at least a portion of the speech-facilitated transaction (e.g., a command to lower the ambient temperature in a room control system) was not successfully interpreted (e.g., the command failed) by a target device (e.g., an automated teller machine device).
现在参见图8C,操作818可包括操作836,该操作836描绘了接收到设备已确定利于讲话的交易的至少一部分未成功的指示。例如,图4(例如图4C)示出指示目标设备执行利于讲话的交易的至少一部分失败的利于讲话的交易结果的一部分的获取模块436,该获取模块436接收目标设备(例如自动售票设备)已确定利于讲话的交易的至少一部分(例如交易的第一部分,其中用户尝试识别他们想要前往的城市)是不成功的(例如设备无法定位由用户说出的城市)指示。8C , operation 818 may include operation 836, which depicts receiving an indication that the device has determined that at least a portion of the speech-enabled transaction was unsuccessful. For example, FIG4 (e.g., FIG4C ) illustrates an acquisition module 436 of a portion of a speech-enabled transaction result indicating that at least a portion of the speech-enabled transaction failed to be performed by a target device, where the acquisition module 436 receives an indication that a target device (e.g., an automated ticketing device) has determined that at least a portion of the speech-enabled transaction (e.g., a first portion of the transaction in which a user attempted to identify a city to which they wish to travel) was unsuccessful (e.g., the device was unable to locate a city spoken by the user).
再次参见图8C,操作836可包括操作838,该操作838描绘了接收目标设备解释利于讲话的交易的至少一部分具有高于特定阈值的出错率的指示。例如,图4(例如图4C)示出指示目标设备确定的差错率高于一阈值的利于讲话的交易结果的一部分的获取模块436,该获取模块436接收目标设备(例如工作自办公室企业网络上的讲话使能的复印机设备)解释利于讲话的交易的至少一部分(例如命令从“黑白”改变至“彩色”)具有高于特定阈值的出错率(例如尝试了四次才使用户的命令被理解)的指示。8C , operation 836 may include operation 838, which depicts receiving an indication that the target device interpreted at least a portion of the speech-enabled transaction with an error rate above a particular threshold. For example, FIG4 (e.g., FIG4C ) illustrates acquisition module 436 indicating that the target device determined a portion of the speech-enabled transaction result with an error rate above a threshold. Acquisition module 436 receives an indication that the target device (e.g., a speech-enabled copier device operating on an office enterprise network) interpreted at least a portion of the speech-enabled transaction (e.g., a command to change from "black and white" to "color") with an error rate above a particular threshold (e.g., it took four attempts for the user's command to be understood).
再次参见图8C,操作836可包括操作840,该操作840描绘了接收目标设备解释利于讲话的交易的至少一部分具有低于特定阈值的置信率的指示。例如,图4(例如图4C)示出指示目标设备确定置信率低于阈值的利于讲话的交易结果的一部分的获取模块440,该获取模块440接收目标设备(例如个人导航系统)解释利于讲话的交易的至少一部分(例如用户尝试寻找的餐馆名)具有低于特定阈值的置信率(例如设备标识正确地点的置信度低于25%)的指示。8C , operation 836 may include operation 840, which depicts receiving an indication that the target device interpreted at least a portion of the speech-enabled transaction with a confidence rate below a particular threshold. For example, FIG. 4 (e.g., FIG. 4C ) illustrates an acquisition module 440 instructing the target device to determine a portion of the speech-enabled transaction result with a confidence rate below a threshold. The acquisition module 440 receives an indication that the target device (e.g., a personal navigation system) interpreted at least a portion of the speech-enabled transaction (e.g., the name of a restaurant that a user was trying to find) with a confidence rate below a particular threshold (e.g., the device has a confidence rate of less than 25% that it identified the correct location).
再次参见图8C,操作818可包括操作842,该操作842描绘了接收特定方通知目标设备利于讲话的交易的至少一部分是不成功的指示。例如,图4(例如图4C)示出经由目标设备从特定方获取利于讲话的交易结果的一部分的模块442,该模块442接收特定方通知目标设备(例如讲话使能的自动出纳机设备)利于讲话的交易的至少一部分(例如选择拟撤回钱的数量的部分)是不成功的(例如用户按下指示在屏幕上显示的钱的数量中没有任何一笔是用户声称的钱的数量的按钮)的指示。8C , operation 818 may include operation 842, which depicts receiving an indication from a particular party that notifies a target device that at least a portion of the speech-enabled transaction was unsuccessful. For example, FIG4 (e.g., FIG4C ) illustrates module 442 for obtaining a portion of the speech-enabled transaction result from a particular party via a target device, wherein module 442 receives an indication from a particular party that notifies a target device (e.g., a speech-enabled automated teller machine device) that at least a portion of the speech-enabled transaction (e.g., a portion selecting an amount of money to be withdrawn) was unsuccessful (e.g., a user pressing a button indicating that none of the amounts displayed on the screen are the amounts claimed by the user).
再次参见图8C,操作818可包括操作844,该操作844描绘了接收利于讲话的交易的至少一部分的结果,所述结果包括被成功地解释的一个或多个词的列表。例如,图4(例如图4C)示出包括一个或多个解释的词的利于讲话的交易的一部分的获取模块444,该获取模块444接收利于讲话的交易的至少一部分(例如在自动通行窗口订单的一部分)的结果(例如一个或多个词的列表,所述词是交易的一部分并被成功地解释)。8C , operation 818 may include operation 844, which depicts receiving a result of at least a portion of a speech-facilitated transaction, the result including a list of one or more words that were successfully interpreted. For example, FIG. 4 (e.g., FIG. 4C ) illustrates a portion of a speech-facilitated transaction including one or more interpreted words, where the capture module 444 receives a result (e.g., a list of one or more words that were part of the transaction and were successfully interpreted) of at least a portion of a speech-facilitated transaction (e.g., a portion of an auto-pass window order).
再次参见图8C,操作818可包括操作846,该操作846描绘了接收利于讲话的交易的至少一部分的结果,所述结果包括未被成功解释的一个或多个词的列表。例如,图4(例如图4C)示出包括未被完全解释的一个或多个词的利于讲话的交易结果的一部分的获取模块446,该获取模块446接收利于讲话的交易(例如用户对运行在家庭网络上的讲话使能的计算机使用讲话的法律记录)的至少一部分(例如其一个段落)的结果(例如包含一个或多个未成功解释的词(例如据初步印象)的数据结构)。8C , operation 818 may include operation 846, which depicts receiving a result of at least a portion of a speech-facilitated transaction, the result including a list of one or more words that were not successfully interpreted. For example, FIG4 (e.g., FIG4C ) illustrates an acquisition module 446 of a portion of a speech-facilitated transaction result including one or more words that were not fully interpreted, the acquisition module 446 receiving a result (e.g., a data structure containing one or more unsuccessfully interpreted words (e.g., based on a preliminary impression)) of at least a portion (e.g., a paragraph thereof) of a speech-facilitated transaction (e.g., a legal transcript of a user's use of speech to a speech-enabled computer operating on a home network).
再次参见图8C,操作818包括操作848,该操作848描绘了接收利于讲话的交易的至少一部分的结果,所述结果包括在利于讲话的交易期间由特定方说出的一个或多个词的发音。例如,图4(例如图4C)示出包括特定方说出利于讲话的交易的一个或多个词的表示的利于讲话交易结果的一部分的获取模块448,该获取模块448接收在利于讲话的交易(例如交易包括若干步骤以设定家庭影院设置,例如将声音调节成环绕声、降低亮度和对比度以及将调谐对蓝光播放机的视频输入)期间包括由特定方说出的一个或多个词(例如词“播放电影‘盗梦空间’”)的发音的利于讲话的交易的至少一部分(例如对家庭影院系统给予命令)的结果(例如设备获得不能解释或者花费一次以上的尝试以解释的一个或多个词的用户发音的波形音频文件(WAV文件))。8C , operation 818 includes operation 848, which depicts receiving a result of at least a portion of a speech-facilitated transaction, the result including a pronunciation of one or more words spoken by a particular party during the speech-facilitated transaction. For example, FIG4 (e.g., FIG4C ) illustrates a capture module 448 of a portion of a speech-facilitated transaction result including a representation of one or more words spoken by a particular party during the speech-facilitated transaction. The capture module 448 receives a result (e.g., a device obtains a waveform audio file (WAV file) of a user's pronunciation of one or more words that the device cannot interpret or takes more than one attempt to interpret) of at least a portion of a speech-facilitated transaction (e.g., a transaction including several steps to set up a home theater setup, such as adjusting the sound to surround sound, reducing brightness and contrast, and tuning the video input to a Blu-ray player) including a pronunciation of one or more words spoken by the particular party (e.g., the words "play the movie 'Inception'").
现在参见图8D,操作818可包括操作850,该操作850描绘了接收利于讲话的交易的至少一部分的结果,所述结果包括在利于讲话的交易期间由目标设备解释的一个或多个词的发音。例如,图4(图4D)示出包括目标设备解释利于讲话的交易的一个或多个词的表示的利于讲话的交易结果的一部分的获取模块450,该获取模块450接收利于讲话的交易的至少一部分的结果(例如对讲话使能的电视机给予语音命令),所述结果包括在利于讲话的交易期间由目标设备解释的一个或多个词(例如“断电”)的发音。8D , operation 818 may include operation 850, which depicts receiving a result of at least a portion of the speech-enabled transaction, the result including a pronunciation of one or more words interpreted by the target device during the speech-enabled transaction. For example, FIG. 4 ( FIG. 4D ) illustrates an acquisition module 450 of a portion of a speech-enabled transaction result including a representation of one or more words interpreted by the target device for the speech-enabled transaction, the acquisition module 450 receiving a result of at least a portion of the speech-enabled transaction (e.g., a voice command given to a speech-enabled television), the result including a pronunciation of one or more words interpreted by the target device during the speech-enabled transaction (e.g., “power off”).
再次参见图8D,操作818可包括操作852,该操作852描绘了接收利于讲话的交易的至少一部分的结果,所述结果包括在利于讲话的交易期间解释的一个或多个词的表格以及由特定方作出的一个或多个解释的词的发音。例如,图4(例如图4D)示出包括被配置成存储利于讲话的交易的一个或多个解释的词和所述一个或多个解释的词的发音的一个或多个相应表示的数据结构的利于讲话的交易结果的部分的获取模块452,该获取模块452接收利于讲话的交易的至少一部分的结果,所述结果包括在利于讲话的交易期间解释的一个或多个词的表格以及通过特定方(例如用户)作出的一个或多个解释的词的发音。8D , operation 818 may include operation 852, which depicts receiving results of at least a portion of a speech-facilitated transaction, the results comprising a table of one or more words interpreted during the speech-facilitated transaction and pronunciations of the one or more interpreted words by a particular party. For example, FIG4 (e.g., FIG4D ) illustrates a portion of a speech-facilitated transaction results acquisition module 452 comprising a data structure configured to store one or more interpreted words of the speech-facilitated transaction and one or more corresponding representations of the pronunciations of the one or more interpreted words, the acquisition module 452 receiving results of at least a portion of a speech-facilitated transaction, the results comprising a table of one or more words interpreted during the speech-facilitated transaction and pronunciations of the one or more interpreted words by a particular party (e.g., a user).
再次参见图8D,操作506可包括操作854,该操作854描绘了接收利于讲话的交易的至少一部分的结果。例如,图4(例如图4D)示出利于讲话的交易的一部分的结果的至少一部分的接收模块454,该接收模块454接收利于讲话的交易的至少一部分(例如对在讲话使能的视频游戏系统上播放的视频游戏通过麦克风(例如头戴耳机)给予命令)的结果(例如对讲话的解释成功的指示)。8D , operation 506 may include operation 854, which depicts receiving a result of at least a portion of the speech-facilitated transaction. For example, FIG4 (e.g., FIG4D ) illustrates module 454 receiving a result of at least a portion of the speech-facilitated transaction (e.g., an indication that interpretation of speech was successful) for at least a portion of the speech-facilitated transaction (e.g., commands given to a video game played on a speech-enabled video game system through a microphone (e.g., a headset)).
再次参见图8D,操作506可包括操作856,该操作856描绘了基于利于讲话的交易的至少一部分的结果更新适应数据。例如,图4(例如图4D)示出至少部分地基于接收的结果的至少一部分修正讲话适应数据的模块456,该模块456基于利于讲话的交易的至少一部分(例如通过麦克风(例如头戴耳机)向在讲话使能的视频游戏系统上播放的视频游戏给予命令)的结果更新适应数据(例如随机状态转变网络)。8D , operation 506 may include operation 856, which depicts updating the adaptation data based on the results of at least a portion of the speech-facilitated transaction. For example, FIG4 (e.g., FIG4D ) illustrates module 456 for modifying speech adaptation data based at least in part on at least a portion of the received results, where module 456 updates adaptation data (e.g., a random state transition network) based on the results of at least a portion of the speech-facilitated transaction (e.g., giving commands to a video game played on a speech-enabled video game system through a microphone (e.g., a headset).
再次参见图8D,操作506可包括操作858,该操作858描绘了在利于讲话的交易完成之前将更新的适应数据发送至目标设备。例如,图4(例如图4D)示出在利于讲话的交易完成之前发送更新的讲话适应数据的模块458,该模块458在利于讲话的交易结束之前(例如在用户给予对该游戏的一串命令中的最后一个命令之前)将更新的适应数据(例如具有至少一个更新的或改变的状态转变路径的随机状态转变网络)发送至目标设备。8D , operation 506 may include operation 858, which depicts sending updated adaptation data to the target device before the speech-facilitated transaction is completed. For example, FIG4 (e.g., FIG4D ) illustrates module 458 for sending updated speech adaptation data before the speech-facilitated transaction is completed, which sends updated adaptation data (e.g., a random state transition network having at least one updated or changed state transition path) to the target device before the speech-facilitated transaction ends (e.g., before the user issues the last command in a series of commands to the game).
再次参见图8D,操作506可包括操作860,该操作860描绘了接收利于讲话的交易的至少一部分的结果。例如,图4(例如图4D)示出利于讲话的交易结果的一部分的接收模块456,该接收模块456接收利于讲话的交易的至少一部分(例如一部分用户通过自动通行窗口订购双份芝士汉堡包、薯条和冰茶)的结果(例如讲话的解释是否成功的指示)。8D , operation 506 may include operation 860, which depicts receiving a result of at least a portion of the speech-facilitated transaction. For example, FIG4 (e.g., FIG4D ) illustrates module 456 receiving a result of at least a portion of the speech-facilitated transaction (e.g., an indication of whether the interpretation of the speech was successful) of at least a portion of the speech-facilitated transaction (e.g., a portion of a user ordering a double cheeseburger, fries, and iced tea through a drive-through window).
再次参见图8D,操作506可包括操作862,该操作862描绘了基于利于讲话的交易的至少一部分的结果更新适应数据。例如,图4(例如图4D)示出部分地基于结果修正讲话适应数据的模块458,该模块458基于利于讲话的交易的至少一部分(例如发出订单的第一部分,其中用户订购“双份芝士汉堡包”)的结果更新适应数据(例如改变发音词典中的一个或多个词的发音,例如词“双份(double)”)。8D , operation 506 may include operation 862, which depicts updating the adaptation data based on the results of at least a portion of the speech-facilitated transaction. For example, FIG4 (e.g., FIG4D ) illustrates module 458 for modifying speech adaptation data based in part on the results of updating the adaptation data (e.g., changing the pronunciation of one or more words in the pronunciation dictionary, such as the word "double") based on the results of at least a portion of the speech-facilitated transaction (e.g., placing the first portion of an order in which the user ordered a "double cheeseburger").
再次参见图8D,操作506可包括操作864,该操作864描绘了接收利于讲话的交易的至少一部分的结果。例如,图4(例如图4D)示出利于讲话的交易的一部分的结果的至少一部分的获得模块464,该获得模块464接收利于讲话的交易的至少一部分(例如用户向自动出纳机设备给予命令)的结果(例如关于解释是否成功的目标设备推断的指示)。8D , operation 506 may include operation 864, which depicts receiving a result of at least a portion of the speech-enabled transaction. For example, FIG4 (e.g., FIG4D ) illustrates an obtaining module 464 for obtaining at least a portion of the result of a portion of the speech-enabled transaction, which receives a result (e.g., an indication of a target device inference regarding whether the interpretation was successful) of at least a portion of the speech-enabled transaction (e.g., a user giving a command to an ATM device).
再次参见图8D,操作506可包括操作866,该操作866描绘了基于利于讲话的交易的至少一部分的结果更新适应数据。例如,图4(例如图4D)示出至少部分地基于接收的结果的至少一部分更新讲话适应数据的模块466,该模块466基于利于讲话的交易的至少一部分(例如用户向自动出纳机设备给予个人标识号(PIN))的结果(例如由自动出纳机设备作出的关于解释是否成功的推断)更新适应数据(例如音素发音数据库)。8D , operation 506 may include operation 866, which depicts updating the adaptation data based on the outcome of at least a portion of the speech-facilitated transaction. For example, FIG4 (e.g., FIG4D ) illustrates module 466 for updating speech adaptation data based at least in part on at least a portion of the received outcome, which updates adaptation data (e.g., a phoneme pronunciation database) based on the outcome (e.g., an inference made by the ATM device as to whether the interpretation was successful) of at least a portion of the speech-facilitated transaction (e.g., a user providing a personal identification number (PIN) to the ATM device).
再次参见图8D,操作506可包括操作868,该操作868描绘了发送指示适应数据已被更新的信号。例如,图4(例如图4E)示出指示讲话适应数据的适应的信号的发送模块468,该发送模块468发送指示适应数据(例如话语忽略算法)已被更新(例如要忽略的额外话语已被添加至该算法)的信号。8D , operation 506 may include operation 868, which depicts sending a signal indicating that the adaptation data has been updated. For example, FIG4 (e.g., FIG4E ) illustrates the sending module 468 of a signal indicating adaptation of speech adaptation data, which sends a signal indicating that the adaptation data (e.g., an utterance ignoring algorithm) has been updated (e.g., additional utterances to be ignored have been added to the algorithm).
现在参见图8E,操作506可包括操作870,该操作870描绘了从又一设备接收有关更新适应数据的更新数据。例如,图4(例如图4E)示出从又一设备接收可能的修正数据的模块470,该模块470从又一设备(例如远程服务器)接收关于更新适应数据(例如关于修正发音词典中的词的发音)的更新数据(例如指示如何修正发音词典的数据)。8E , operation 506 may include operation 870, which depicts receiving update data regarding updated adaptation data from another device. For example, FIG. 4 (e.g., FIG. 4E ) illustrates module 470 for receiving possible correction data from another device, which receives update data (e.g., data indicating how to correct the pronunciation dictionary) regarding updated adaptation data (e.g., regarding correcting the pronunciation of a word in the pronunciation dictionary) from another device (e.g., a remote server).
再次参见图8E,操作506可包括操作872,该操作872描绘了基于所接收的更新数据更新适应数据。例如,图4(例如图4D)示出将从又一设备接收的可能修正数据施加至讲话适应数据的模块472,该模块472基于接收的更新数据(例如指示如何修正发音词典的数据)更新适应数据(例如修正发音词典)。8E , operation 506 may include operation 872, which depicts updating the adaptation data based on the received update data. For example, FIG. 4 (e.g., FIG. 4D ) illustrates module 472 for applying possible correction data received from another device to speech adaptation data, and module 472 updates the adaptation data (e.g., a modified pronunciation dictionary) based on the received update data (e.g., data indicating how to modify the pronunciation dictionary).
再次参见图8E,操作506可包括操作874,该操作874描绘了接收利于讲话的交易的至少一部分的结果。例如,图4(例如图4E)示出利于讲话的交易结果部分的接收模块474,该接收模块474接收利于讲话的交易(例如将一系列命令给予家庭影院系统)的至少一部分的结果(例如给予家庭影院系统的一个命令)。8E , operation 506 may include operation 874, which depicts receiving a result of at least a portion of the speech-facilitated transaction. For example, FIG4 (e.g., FIG4E ) illustrates speech-facilitated transaction result portion receiving module 474, which receives a result of at least a portion of the speech-facilitated transaction (e.g., a command to the home theater system) (e.g., a series of commands to the home theater system).
再次参见图8E,操作506可包括操作876,该操作876描绘了将利于讲话的交易的至少一部分的结果发送至又一位置。例如,图4(例如图4E)示出将利于讲话的交易结果部分发送至设备的模块476,该模块476将利于讲话的交易的至少一部分(例如给予家庭影院系统的一个命令)的结果(例如利于讲话的交易的解释是否成功的指示)发送至又一位置(例如发送至家中的计算机,该计算机被联网至家庭影院系统的一个或多个设备)。8E , operation 506 may include operation 876, which depicts sending the result of at least a portion of the speech-facilitated transaction to another location. For example, FIG4 (e.g., FIG4E ) illustrates module 476 for sending a portion of the speech-facilitated transaction result to a device, which sends the result of at least a portion of the speech-facilitated transaction (e.g., a command given to the home theater system) (e.g., an indication of whether the interpretation of the speech-facilitated transaction was successful) to another location (e.g., to a computer at home that is networked to one or more devices of the home theater system).
再次参见图8E,操作506可包括操作878,该操作878描绘了从又一位置接收更新数据,该更新数据包括是否更新适应数据的指示。例如,图4(例如图4E)示出指示是否修正从设备接收的讲话适应数据的确定数据的模块478,该模块478从又一位置(例如与一个或多个设备联网的家用计算机)接收更新数据,该更新数据包括是否更新(例如是否修正)适应数据(例如词组补足算法)的指示。Referring again to Figure 8E, operation 506 may include operation 878, which depicts receiving update data from another location, the update data including an indication of whether to update the adaptation data. For example, Figure 4 (e.g., Figure 4E) illustrates module 478 for indicating whether to modify the determination data of speech adaptation data received from the device, the module 478 receiving update data from another location (e.g., a home computer networked with one or more devices), the update data including an indication of whether to update (e.g., whether to modify) the adaptation data (e.g., a phrase completion algorithm).
再次参见图8E,操作876可包括操作880,该操作880描绘了将利于讲话的交易的至少一部分的结果发送至也存储适应数据的又一位置。例如图4(例如图4E)示出将利于讲话的交易结果部分发送至存储讲话适应数据的设备的模块480,该模块480将利于讲话的交易的至少一部分(例如给予家庭影院系统的一个命令)的结果(例如指示被给予家庭影院系统的一个命令的讲话被多高效地处理的分数)发送至也存储适应数据的又一位置(例如远程云服务器)。Referring again to Figure 8E, operation 876 may include operation 880, which depicts sending the results of at least a portion of the speech-facilitated transaction to another location that also stores adaptation data. For example, Figure 4 (e.g., Figure 4E) shows module 480 sending a portion of the speech-facilitated transaction results to a device that stores speech adaptation data. Module 480 sends the results of at least a portion of the speech-facilitated transaction (e.g., a command given to a home theater system) (e.g., a score indicating how efficiently speech given to a command given to the home theater system was processed) to another location that also stores adaptation data (e.g., a remote cloud server).
再次参见图8E,操作878可包括操作882,该操作882描绘了从又一位置接收更新数据,该更新数据进一步包括用于更新适应数据的指令。例如,图4(例如图4E)示出指令如何修正从设备接收的讲话适应数据的确定数据的模块482,该模块482从又一位置(例如用于处理企业网络设定中的适应数据的潜在变化的专用计算机)接收更新数据(例如用于修正音素数据库的指令),该更新数据包括用于更新适应数据(例如音素数据库)的指令。Referring again to Figure 8E, operation 878 may include operation 882, which depicts receiving update data from another location, the update data further including instructions for updating the adaptation data. For example, Figure 4 (e.g., Figure 4E) shows module 482 that instructs how to modify the determination data of speech adaptation data received from the device, the module 482 receiving update data (e.g., instructions for modifying a phoneme database) from another location (e.g., a dedicated computer for processing potential changes to adaptation data in an enterprise network setting), the update data including instructions for updating the adaptation data (e.g., phoneme database).
再次参见图8E,操作506可包括操作,该操作884描绘了接收利于讲话的交易的至少一部分的结果。例如,图4(例如图4E)示出利于讲话的交易结果部分的接收模块848,该接收模块848接收利于讲话的交易的至少一部分(例如执行一组命令以定制讲话使能的计算机上的互联网浏览器)的结果(例如运作在计算机上并确定由计算机完成的讲话处理的效率的独立程序)。Referring again to Figure 8E, operation 506 may include an operation depicting receiving a result of at least a portion of the speech-enabled transaction, operation 884. For example, Figure 4 (e.g., Figure 4E) illustrates a speech-enabled transaction result portion receiving module 848 that receives a result (e.g., a separate program running on a computer and determining the efficiency of speech processing performed by the computer) of at least a portion of the speech-enabled transaction (e.g., executing a set of commands to customize an Internet browser on a speech-enabled computer).
再次参见图8E,操作506可包括操作886,该操作886描绘了至少部分地基于利于讲话的交易的至少一部分的结果确定以更新适应数据。例如,图4(例如图4E)示出部分地基于利于讲话的交易结果部分作出讲话适应数据修正决策的模块486,该模块486至少部分地基于利于讲话的交易的至少一部分的结果(例如交易中的第一命令以定制浏览器,例如加载定制的主页或首页)确定以更新适应数据(例如句子图解路径选择算法)。Referring again to Figure 8E, operation 506 may include operation 886, which depicts determining to update adaptation data based at least in part on the results of at least a portion of the speech-facilitated transaction. For example, Figure 4 (e.g., Figure 4E) illustrates module 486 for making speech adaptation data modification decisions based in part on the results of the speech-facilitated transaction, which module 486 determines to update adaptation data (e.g., a sentence graph path selection algorithm) based at least in part on the results of at least a portion of the speech-facilitated transaction (e.g., the first command in the transaction to customize the browser, e.g., to load a customized homepage or front page).
再次参见图8E,操作506可包括操作888,该操作888描绘了将更新适应数据的请求发送至又一位置。例如,图4(例如图4E)示出请求将修正讲话适应数据发送至设备的模块488,该模块488将更新适应数据(例如句子图解路径选择算法)的请求发送至又一位置(例如发送至由其上运行浏览器的操作系统的开发者运作的远程服务器)。8E , operation 506 may include operation 888, which depicts sending a request to update the adaptation data to another location. For example, FIG4 (e.g., FIG4E ) illustrates module 488 requesting that the modified speech adaptation data be sent to the device, which module 488 sends a request to update the adaptation data (e.g., the sentence graph path selection algorithm) to another location (e.g., to a remote server operated by the developer of the operating system on which the browser is running).
再次参见图8E,操作506可包括操作890,该操作890描绘了从又一位置接收更新的适应数据。例如,图4(例如图4E)示出从设备获取修正的适应数据的模块490,该模块490从又一位置(例如从远程服务器,其基于用户的许多讲话交互修正适应数据)接收更新的适应数据(例如修正的或改变的句子图解路径选择算法)。Referring again to Figure 8E, operation 506 may include operation 890, which depicts receiving updated adaptation data from another location. For example, Figure 4 (e.g., Figure 4E) shows module 490 for obtaining revised adaptation data from a device, which receives updated adaptation data (e.g., a revised or changed sentence graph path selection algorithm) from another location (e.g., from a remote server that revised the adaptation data based on a number of speech interactions of the user).
再次参见图8E,操作506可包括操作891,该操作891描绘了接收利于讲话的交易的至少一部分的结果,其中该结果指示利于讲话的交易移动地太快而使讲话无法被完全处理。例如,图4(例如图4E)示出来自设备的减慢请求的可能的修正数据的模块491,该模块491接收利于讲话的交易的至少一部分的结果,其中结果指示利于讲话的交易移动地太快而使讲话无法被完全处理(例如说话者讲话过快并且目标设备无法跟上处理)。8E , operation 506 may include operation 891 depicting receiving a result of at least a portion of a speech-facilitating transaction, wherein the result indicates that the speech-facilitating transaction is moving too fast for the speech to be fully processed. For example, FIG4 (e.g., FIG4E ) illustrates module 491 of possible correction data for a slowdown request from a device, which receives a result of at least a portion of a speech-facilitating transaction, wherein the result indicates that the speech-facilitating transaction is moving too fast for the speech to be fully processed (e.g., the speaker is speaking too fast and the target device is unable to keep up with the processing).
再次参见图8E,操作506可包括操作893,该操作893描绘了促进向特定方呈现指示特定方减慢利于讲话的交易的视频或音频指示。例如,图4(例如图4E)示出减慢指示的呈现的促进模块493,该促进模块493利于将视频或音频指示(例如嘟嘟声或者黄光或红光)呈现(例如或者在与用户关联的特定设备(例如智能电话)或者通过它用户具有利于讲话的交易的设备(例如自动通行窗口))给特定方,其指示特定方减慢(例如或者停止讲话一定时间或减慢用户语音的语调)利于讲话的交易。8E , operation 506 may include operation 893, which depicts facilitating presentation of a video or audio indication to the particular party instructing the particular party to slow down to facilitate the transaction. For example, FIG4 (e.g., FIG4E ) illustrates facilitating presentation of a slow down indication module 493, which facilitates presenting (e.g., either on a particular device associated with the user (e.g., a smartphone) or through a device through which the user facilitates the transaction (e.g., an auto-pass window)) a video or audio indication (e.g., a beep or a yellow or red light) to the particular party instructing the particular party to slow down (e.g., either stop speaking for a certain amount of time or slow down the tone of the user's voice) to facilitate the transaction.
现在参见图8F,操作506可包括操作892,该操作892描绘了接收利于讲话的交易的第一部分的结果。例如,图4(例如图4F)示出利于讲话的交易的特定部分的结果的获取模块492,该获取模块492接收利于讲话的交易的第一部分的结果(例如与自动售票机交互(例如为安全目的认证你自己)的第一句子)。Referring now to Figure 8F, operation 506 may include operation 892, which depicts receiving the results of the first portion of the speech-enabled transaction. For example, Figure 4 (e.g., Figure 4F) illustrates the acquisition module 492 of the results of a particular portion of the speech-enabled transaction, which receives the results of the first portion of the speech-enabled transaction (e.g., the first sentence of interacting with the ticket vending machine (e.g., authenticating yourself for security purposes)).
再次参见图8F,操作506可包括操作894,该操作894描绘了至少部分地基于利于讲话的交易的第一部分的结果更新适应数据。例如,图4(图4F)示出至少部分地基于利于讲话的交易的特定部分的获取结果施加讲话适应数据可能的修正494,其至少部分地基于利于讲话的交易的第一部分的结果(例如设备是否能够有效地识别用户并发现给予设备的讲话)更新(例如改变其一个参数)适应数据(例如地方性方言应用算法)。Referring again to Figure 8F, operation 506 may include operation 894, which depicts updating the adaptation data based at least in part on the results of the first portion of the speech-facilitating transaction. For example, Figure 4 (Figure 4F) illustrates applying a possible modification 494 of speech adaptation data based at least in part on the results of obtaining a particular portion of the speech-facilitating transaction, which updates (e.g., changes a parameter of) adaptation data (e.g., a regional dialect application algorithm) based at least in part on the results of the first portion of the speech-facilitating transaction (e.g., whether the device was able to effectively identify the user and detect speech presented to the device).
再次参见图8F,操作506可包括操作896,该操作896描绘了将更新的适应数据发送至目标设备,所述更新的适应数据被配置成在利于讲话的交易的第二部分结束之前通过目标设备施加。例如,图4(例如图4F)示出被配置成通过目标设备施加至利于讲话的交易的又一部分的修正的讲话适应数据的发送模块496,该发送模块496将更新的适应数据(例如具有修正的参数的地区性方言应用算法)发送至目标设备(例如自动售票机),所述更新的适应数据被配置成在利于讲话的交易的第二部分(例如用户给予她的目的地城市信息)结束前通过目标设备(例如自动售票机)施加。8F , operation 506 may include operation 896, which depicts sending updated adaptation data to the target device, the updated adaptation data being configured to be applied by the target device before the second portion of the speech-facilitated transaction concludes. For example, FIG4 (e.g., FIG4F ) illustrates a sending module 496 configured to apply revised speech adaptation data to another portion of the speech-facilitated transaction by the target device, the sending module 496 sending updated adaptation data (e.g., a regional dialect application algorithm with revised parameters) to the target device (e.g., a ticket vending machine), the updated adaptation data being configured to be applied by the target device (e.g., a ticket vending machine) before the second portion of the speech-facilitated transaction (e.g., the user giving her destination city information) concludes.
再次参见图8F,操作894可包括操作898,该操作898描绘了至少部分地基于利于讲话的交易的第一部分的结果改变适应数据的算法。例如,图4(例如图4F)示出至少部分地基于利于讲话的交易的特定部分的获取结果施加讲话适应数据可能的修正的算法的模块496,该模块496至少部分地基于利于讲话的交易的第一部分的结果(例如交易的第一部分的出错率高于25%,这触发算法的切换)至少部分地改变适应数据的算法(例如从地区性方言应用算法切换至基于口音的发音修正算法)。Referring again to Figure 8F, operation 894 may include operation 898, which depicts changing the algorithm for adapting the data based at least in part on the results of the first portion of the speech-facilitated transaction. For example, Figure 4 (e.g., Figure 4F) illustrates module 496 for applying an algorithm for potentially modifying speech adaptation data based at least in part on the results of obtaining a particular portion of the speech-facilitated transaction, wherein module 496 at least in part modifies the algorithm for adapting the data (e.g., switching from a regional dialect application algorithm to an accent-based pronunciation modification algorithm) based at least in part on the results of the first portion of the speech-facilitated transaction (e.g., an error rate greater than 25% for the first portion of the transaction, which triggers the algorithm to switch).
再次参见图8F,操作898可包括操作801,该操作801描绘了至少部分地基于利于讲话的交易的第一部分的结果修正适应数据的至少一个算法的至少一个参数。例如,图4(例如图4F)示出至少部分地基于利于讲话的交易的特定部分的获取结果施加讲话适应数据可能的修正的算法的参数的模块401,该模块401至少部分地基于利于讲话的交易的第一部分的结果(例如表示有太多词段的误报,因此要调节参数)修正适应数据的至少一个算法的至少一个参数(例如改变噪声水平依赖的过滤算法的衰减参数的量)。Referring again to Figure 8F, operation 898 may include operation 801, which depicts modifying at least one parameter of at least one algorithm for adapting the data based at least in part on the results of the first portion of the speech-facilitating transaction. For example, Figure 4 (e.g., Figure 4F) illustrates module 401 for applying a parameter of an algorithm for potentially modifying speech adaptation data based at least in part on the results of acquiring a particular portion of the speech-facilitating transaction. Module 401 modifies at least one parameter of at least one algorithm for adapting the data (e.g., changing the amount of an attenuation parameter of a noise-level-dependent filtering algorithm) based at least in part on the results of the first portion of the speech-facilitating transaction (e.g., indicating that there were too many false positives for a word segment, and therefore the parameter should be adjusted).
再次参见图8F,操作896可包括操作803,该操作803描绘了至少部分地基于利于讲话的交易的第一部分的结果使用适应数据的不同算法。例如,图4(例如图4F)示出至少部分地基于利于讲话的交易的特定部分的获取结果选择讲话适应数据的不同算法的模块403,该模块403至少部分地基于利于讲话的交易的第一部分的结果(例如表示由于没有检测到非词典词因此算法不做任何有用的事的结果)使用适应数据的不同算法(例如从非词典词去除算法切换至讲话不流利检测算法)。8F , operation 896 may include operation 803, which depicts using a different algorithm for adapting the data based at least in part on the results of the first portion of the speech-facilitated transaction. For example, FIG4 (e.g., FIG4F ) illustrates module 403 for selecting a different algorithm for speech adaptation data based at least in part on the results of acquiring a particular portion of the speech-facilitated transaction, the module 403 using a different algorithm for adapting the data (e.g., switching from a non-lexicon word removal algorithm to a speech disfluency detection algorithm) based at least in part on the results of the first portion of the speech-facilitated transaction (e.g., a result indicating that the algorithm did not do anything useful because no non-lexicon words were detected).
现在参见图8G,操作894可包括操作805,该操作805描绘了修正作为适应数据的一部分的发音词典,所述修正至少部分地基于利于讲话的交易的第一部分的结果。例如,图4(例如图4G)示出至少部分地基于利于讲话的交易的特定部分的获取结果施加发音词典可能的修正的模块405,该模块405修正作为适应数据的一部分的发音词典,所述修正至少部分的基于利于讲话的交易的第一部分的结果(例如该结果包括用户对词“二百美元(twohundred dollars)”发音的新方式并因此发音词典被更新以包括该新发音)。Referring now to Figure 8G , operation 894 may include operation 805, which depicts modifying the pronunciation dictionary as part of the adaptation data, the modification being based at least in part on the results of the first portion of the transaction facilitating speech. For example, Figure 4 (e.g., Figure 4G ) illustrates module 405 for applying possible modifications to the pronunciation dictionary based at least in part on the results of obtaining a particular portion of the transaction facilitating speech, the module 405 modifying the pronunciation dictionary as part of the adaptation data, the modification being based at least in part on the results of the first portion of the transaction facilitating speech (e.g., the results including a new way for the user to pronounce the word "two hundred dollars" and therefore the pronunciation dictionary being updated to include the new pronunciation).
再次参见图8G,操作805可包括操作807,该操作807描绘了将一个或多个词添加至作为适应数据一部分的发音词典,所述修正至少部分地基于利于讲话的交易的第一部分的结果。例如,图4(例如图4G)示出至少部分地基于利于讲话的交易的特定部分的获取结果将一个或多个词添加至发音词典的模块807,该模块807将一个或多个词(例如词“亚特兰大,乔治亚(Atlanta,Georgia)”)添加至作为适应数据一部分的发音词典,所述修正至少部分地基于利于讲话的交易的第一部分的结果(例如结果指示目标设备(例如个人导航系统)当用户说出“亚特兰大,乔治亚”时一开始可能无法识别,因此这些词和用户的发音被添加至适应数据)。Referring again to Figure 8G, operation 805 may include operation 807, which depicts adding one or more words to a pronunciation dictionary as part of the adaptation data, the modification being based at least in part on the results of the first portion of the speech-facilitated transaction. For example, Figure 4 (e.g., Figure 4G) illustrates module 807 for adding one or more words to a pronunciation dictionary based at least in part on the results of obtaining a particular portion of the speech-facilitated transaction, the module 807 adding one or more words (e.g., the words "Atlanta, Georgia") to a pronunciation dictionary as part of the adaptation data, the modification being based at least in part on the results of the first portion of the speech-facilitated transaction (e.g., the results indicating that the target device (e.g., a personal navigation system) may not initially recognize the user's utterance of "Atlanta, Georgia," and therefore, the words and the user's pronunciation are added to the adaptation data).
再次参见图8G,操作805可包括操作809,该操作809描绘了将一个或多个词添加至作为适应数据的一部分的发音词典,所述修正至少部分地基于利于讲话的交易的第一部分的结果,所述第一部分包括由特定方发音的一个或多个词以及由特定方发音的一个或多个词的相应发音,它们被添加至发音词典。例如,图4(例如图4G)示出至少部分地基于包括一个或多个词的利于讲话的交易的特定部分的获取结果将一个或多个词添加至发音词典的模块809,该模块809将一个或多个词添加至作为适应数据的一部分的发音词典,所述修正至少部分地基于利于讲话的交易的第一部分的结果(例如第一部分正在对“从支票账户撤回两百美元”的交易“输入PIN”),所述第一部分包括由特定方发音的一个或多个词(例如“四”、“七”和“三”)以及由特定方发音的一个或多个词的相应发音(例如用户说出词“四”、“七”和“三”的方式),它们被添加至发音词典。Referring again to FIG8G , operation 805 may include operation 809 depicting adding one or more words to a pronunciation dictionary as part of the adaptation data, the modification being based at least in part on the results of a first portion of the speech-facilitated transaction, the first portion comprising one or more words pronounced by a particular party and corresponding pronunciations of the one or more words pronounced by the particular party, which are added to the pronunciation dictionary. For example, FIG4 (e.g., FIG4G ) illustrates module 809 for adding one or more words to a pronunciation dictionary based at least in part on the results of obtaining a particular portion of the speech-facilitated transaction comprising one or more words, the module 809 adding one or more words to a pronunciation dictionary as part of the adaptation data, the modification being based at least in part on the results of a first portion of the speech-facilitated transaction (e.g., the first portion being “enter PIN” for a transaction “withdraw two hundred dollars from a checking account”), the first portion comprising one or more words pronounced by a particular party (e.g., “four,” “seven,” and “three”), and corresponding pronunciations of the one or more words pronounced by the particular party (e.g., how a user pronounces the words “four,” “seven,” and “three”), which are added to the pronunciation dictionary.
再次参见图8G,操作805可包括操作811,该操作811描述了在适应数据中用新的发音词典替换作为适应数据的一部分的之前发音词典以及之前采用以解释利于讲话的交易的第一部分。例如,图4(例如图4G)示出至少部分地基于利于讲话的交易的特定部分的获取结果的发音词典替换的模块811,该模块811在适应数据中用新的发音词典替换作为适应数据的一部分的之前发音词典以及之前采用以解释利于讲话的交易的第一部分(例如在自动通行窗口的订单的第一部分)。Referring again to FIG8G , operation 805 may include operation 811, which describes replacing, in the adaptation data, a previous pronunciation dictionary that was part of the adaptation data and that was previously used to interpret the first portion of the speech-facilitated transaction with a new pronunciation dictionary. For example, FIG4 (e.g., FIG4G ) illustrates module 811 for replacing, based at least in part on the acquisition results of a particular portion of the speech-facilitated transaction, a previous pronunciation dictionary that was part of the adaptation data and that was previously used to interpret the first portion of the speech-facilitated transaction (e.g., the first portion of an order at an auto-pass window) with a new pronunciation dictionary.
现在参见图8H,操作894可包括操作813,该操作813描绘了修正适应数据的路径选择算法的路径,所述修正至少部分地基于利于讲话的交易的第一部分的结果。例如,图4(例如图4H)示出至少部分地基于利于讲话的交易的特定部分的获取结果施加路径选择算法可能修正的路径的模块413,该模块413修正适应数据的路径选择算法的路径,所述修正至少部分地基于利于讲话的交易的第一部分的结果(例如指示路径选择算法成功施加至与机动交通工具控制系统交互的用户的第一部分的结果)。Referring now to Figure 8H, operation 894 may include operation 813, which depicts modifying a path of the routing algorithm adapted to the data, the modification being based at least in part on the results of the first portion of the speech-facilitating transaction. For example, Figure 4 (e.g., Figure 4H) illustrates module 413 for applying a path of a routing algorithm that may be modified based at least in part on the results of obtaining a particular portion of the speech-facilitating transaction, wherein module 413 modifies a path of the routing algorithm adapted to the data, the modification being based at least in part on the results of the first portion of the speech-facilitating transaction (e.g., results indicating that the routing algorithm was successfully applied to the first portion of the user interacting with the motor vehicle control system).
再次参见图8H,操作896可包括操作815,该操作815描绘了将更新的适应数据发送至目标设备,所述更新的适应数据被配置成由目标设备施加以解释利于讲话的交易的第二部分。例如,图4(例如图4H)示出被配置成用于解释利于讲话的交易的又一部分的修正的讲话适应数据的发送模块415,该发送模块415将更新的适应数据发送至目标设备(例如自动出纳机设备),所述更新的适应数据被配置成由目标设备(例如自动出纳机设备)施加以解释利于讲话的交易(例如从储蓄账户中撤回二百美元)的第二部分(例如指示撤回的钱的数量)。Referring again to Figure 8H, operation 896 may include operation 815, which depicts sending updated adaptation data to the target device, the updated adaptation data being configured to be applied by the target device to interpret the second portion of the speech-facilitated transaction. For example, Figure 4 (e.g., Figure 4H) illustrates sending module 415 of revised speech adaptation data configured to interpret another portion of the speech-facilitated transaction, sending module 415 to a target device (e.g., an automated teller machine device) the updated adaptation data being configured to be applied by the target device (e.g., an automated teller machine device) to interpret the second portion (e.g., indicating the amount of money withdrawn) of the speech-facilitated transaction (e.g., withdrawing two hundred dollars from a savings account).
再次参见图8H,操作896可包括操作817,其描绘了将更新的适应数据发送至目标设备,所述更新的适应数据被配置成由目标设备施加以解释利于讲话的交易的第二部分,其中在解释利于讲话的交易的第一部分和解释利于讲话的交易的第二部分之间没有讲话被解释。例如,图4(例如图4H)示出被配置成在解释特定部分之后立即用来解释利于讲话的交易的又一部分的修正的讲话适应数据的发送模块417,该发送模块417将更新的适应数据发送至目标设备(例如被配置成接收讲话输入的计算机),所述更新的适应数据被配置成由目标设备施加以解释利于讲话的交易的第二部分(例如口述记录的第二段),其中在解释利于讲话的交易的第一部分(例如口述记录的第一段)和解释利于讲话的交易的第二部分之间没有讲话被解释。Referring again to Figure 8H, operation 896 may include operation 817, which depicts sending updated adaptation data to the target device, the updated adaptation data being configured to be applied by the target device to interpret the second portion of the speech-favored transaction, wherein no speech was interpreted between the first portion of the speech-favored transaction and the second portion of the speech-favored transaction. For example, Figure 4 (e.g., Figure 4H) shows a sending module 417 of modified speech adaptation data configured to be used to interpret another portion of the speech-favored transaction immediately after interpreting the particular portion, the sending module 417 sending updated adaptation data to the target device (e.g., a computer configured to receive speech input), the updated adaptation data being configured to be applied by the target device to interpret the second portion of the speech-favored transaction (e.g., the second segment of the dictation), wherein no speech was interpreted between the first portion of the speech-favored transaction (e.g., the first segment of the dictation) and the second portion of the speech-favored transaction.
再次参见图8H,操作896可包括操作819,该操作819描绘了将更新的适应数据发送至目标设备。例如,图4(例如图4H)示出修正的讲话适应数据的发送模块419,该发送模块419将更新的适应数据(例如更新的讲话部分标记算法)发送至目标设备(例如讲话使能的媒体播放机)。8H , operation 896 may include operation 819, which depicts sending the updated adaptation data to the target device. For example, FIG4 (e.g., FIG4H ) illustrates a revised speech adaptation data sending module 419 that sends the updated adaptation data (e.g., an updated speech portion labeling algorithm) to the target device (e.g., a speech-enabled media player).
再次参见图8H,操作896可包括操作821,该操作821描绘了从目标设备接收更新的适应数据已被施加的指示。例如,图4(例如图4H)示出接收修正的讲话适应数据的指示的接收模块421,该接收模块421从目标设备(例如讲话使能的媒体播放机)接收更新的适应数据已被施加(例如更新的适应数据已替代适应数据,或者现存的适应数据已如由更新的适应数据指令的那样被修正)的指示(例如接收从目标设备发送的信号)。8H , operation 896 may include operation 821, which depicts receiving an indication from the target device that the updated adaptation data has been applied. For example, FIG. 4 (e.g., FIG. 4H ) illustrates receiving module 421 receiving an indication of revised speech adaptation data, the receiving module 421 receiving an indication (e.g., receiving a signal sent from the target device) from the target device (e.g., a speech-enabled media player) that the updated adaptation data has been applied (e.g., the updated adaptation data has replaced the adaptation data, or the existing adaptation data has been revised as instructed by the updated adaptation data).
现在参见图8I,操作896可包括操作823,该操作823描绘了将更新的适应数据发送至目标设备。例如,图4(例如图4I)示出修正的讲话适应数据的发送模块423,该发送模块423将更新的适应数据(例如一组专有名词发音(例如城市名),这些专有名词发音已基于新用户发音被修正)发送至目标设备(例如基于交通工具的导航系统)。Referring now to Figure 8I, operation 896 may include operation 823, which depicts sending the updated adaptation data to the target device. For example, Figure 4 (e.g., Figure 4I) illustrates a modified speech adaptation data sending module 423 that sends updated adaptation data (e.g., a set of proper noun pronunciations (e.g., city names) that have been modified based on the new user pronunciation) to a target device (e.g., a vehicle-based navigation system).
再次参见图8I,操作896可包括操作825,该操作825描绘了从目标设备接收更新的适应数据已被施加的指示。例如,图4(例如图4I)示出施加修正的讲话适应数据的指示的接收模块425,该接收模块425从目标设备(例如基于交通工具的导航系统)接收更新的适应数据(例如一组专有名词发音(例如城市名),这些专有名词发音已基于新用户发音被修正)已被施加的指示(接收指示成功施加的数据)。8I , operation 896 may include operation 825, which depicts receiving an indication from the target device that the updated adaptation data has been applied. For example, FIG4 (e.g., FIG4I ) illustrates receiving module 425 of an indication of applying modified speech adaptation data, which receives an indication from a target device (e.g., a vehicle-based navigation system) that updated adaptation data (e.g., a set of proper noun pronunciations (e.g., city names) that have been modified based on a new user pronunciation) has been applied (receiving data indicating successful application).
再次参见图8I,操作896可包括操作827,该操作827描绘了促进呈现一指示,该指示表示利于讲话的交易的第二部分已准备发生。例如,图4(例如图4I)示出指示完成利于讲话的交易的又一部分就绪的呈现模块427,该呈现模块427促进一指示的呈现(例如视觉指示(例如设备上的灯被激活),或者发出可闻的嘟嘟声),该指示表示利于讲话的交易的第二部分准备发生(例如当基于交通工具的导航系统接收到更新的适应数据时用户的蜂窝设备发出嘟嘟声)。8I , operation 896 may include operation 827, which depicts facilitating presentation of an indication that the second portion of the speech-facilitated transaction is ready to occur. For example, FIG4 (e.g., FIG4I ) illustrates presentation module 427 indicating that another portion of the speech-facilitated transaction is ready to be completed, which facilitates presentation of an indication (e.g., a visual indication (e.g., a light on the device is activated) or an audible beep) indicating that the second portion of the speech-facilitated transaction is ready to occur (e.g., a user's cellular device beeps when the vehicle-based navigation system receives updated adaptation data).
再次参见图8I,操作827可包括操作829,该操作829描绘了在特定设备的屏幕上显示一消息,该消息指示利于讲话的交易的第二部分准备发生。例如,图4(例如图4I)示出指示利于讲话的交易的又一部分准备就绪的消息的显示模块429,该显示模块429在特定设备(例如用户的通用遥控器)的屏幕上显示一消息(例如“现在继续讲话”),该消息指示利于讲话的交易的第二部分(例如命令家庭影院系统播放电影“盗梦空间”)准备发生。8I , operation 827 may include operation 829, which depicts displaying a message on the screen of the particular device indicating that the second part of the speech-facilitated transaction is ready to occur. For example, FIG4 (e.g., FIG4I ) illustrates display module 429 of a message indicating that another part of the speech-facilitated transaction is ready to occur, which displays a message (e.g., “Now continue speaking”) on the screen of the particular device (e.g., a user's universal remote control) indicating that the second part of the speech-facilitated transaction (e.g., commanding a home theater system to play the movie “Inception”) is ready to occur.
现在参见图8I,操作827可包括操作831,该操作831描绘了促进在目标设备的屏幕上显示一消息,该消息指示利于讲话的交易的第二部分准备发生。例如,图4(例如图4I)示出在目标设备上显示指示利于讲话的交易的又一部分准备就绪的消息的促进模块431,该促进模块431促进在目标设备的屏幕上显示消息(例如将信号发送至自动通行窗口,该信号指令自动通行窗口显示词“请继续你的订单”),该消息指示利于讲话的交易的第二部分(例如双份培根芝士汉堡包和薯条的第二订单)准备发生。Referring now to Figure 8I, operation 827 may include operation 831, which depicts facilitating display of a message on the screen of the target device indicating that the second portion of the speech-facilitated transaction is ready to occur. For example, Figure 4 (e.g., Figure 4I) illustrates facilitating module 431 displaying a message on the target device indicating that another portion of the speech-facilitated transaction is ready to occur. The facilitating module 431 facilitates displaying a message on the screen of the target device (e.g., sending a signal to the auto-pass window instructing the auto-pass window to display the words "Please Continue with Your Order") indicating that the second portion of the speech-facilitated transaction (e.g., a second order for a double bacon cheeseburger and fries) is ready to occur.
本说明书中引用的和/或任何申请数据表中列出的所有前述美国专利、美国专利申请公布、美国专利申请、外国专利、外国专利申请和非专利公布援引包含于此,只要不与本申请不一致即可。All of the foregoing U.S. patents, U.S. patent application publications, U.S. patent applications, foreign patents, foreign patent applications, and non-patent publications cited in this specification and/or listed in any Application Data Sheet are incorporated herein by reference to the extent not inconsistent with this application.
尽管已示出和描述了本文描述的当前主题的特定方面,然而显然对于本领域内技术人员而言,基于本文的教义,可不脱离本文描述的主题及其更宽方面地作出多种改变和修正,并因此,所附权利要求书将所有这些改变和修正涵盖在其范围内,只要它们落在本文描述的主题的真实精神和范围内。本领域内技术人员将理解,一般来说,本文(尤其是所附权利要求书中(例如所附权利要求书的正文中))使用的术语一般意图解释成“开放性”术语(例如术语“包括”应当被解释成“包括但不限于”,术语“具有”应当被解释成“具有至少……”,术语“包含”应当被解释成“包含但不限于”等等)。While particular aspects of the present subject matter described herein have been shown and described, it will be apparent to those skilled in the art that, based on the teachings herein, various changes and modifications may be made without departing from the subject matter described herein and its broader aspects, and, therefore, the appended claims are intended to cover within their scope all such changes and modifications as fall within the true spirit and scope of the subject matter described herein. Those skilled in the art will understand that, in general, the terms used herein (and particularly in the appended claims (e.g., in the body of the appended claims)) are generally intended to be interpreted as "open-ended" terms (e.g., the term "including" should be interpreted as "including but not limited to," the term "having" should be interpreted as "having at least...", the term "comprising" should be interpreted as "including but not limited to," etc.).
本领域内技术人员将进一步理解,如果引入的权利要求意图引用特定数,这种意图将明确地在权利要求中阐明,并且在没有这种阐明的情况下就没有这种意图。例如,为了帮助立即,下面所附的权利要求书可包含使用前导性词语“至少一个”以及“一个或多个”以介绍权利要求引用。然而,这些词语的使用不应当被解释成暗示权利要求通过不定冠词“一”或“一个”引述的前导将包含这些介绍的权利要求引述的任何特定权利要求限定至仅包含一个这样的引述的权利要求,即使当相同的权利要求包括前导性词语“一个或多个”或“至少一个”以及诸如“一”或“一个”的不定冠词(例如“一”和/或“一个”一般应当被解释成表示“至少一个”或者“一个或多个”);对于使用定冠词以介绍权利要求引述时也是如此。另外,即使明确地引述了介绍性权利要求引述的一个特定数,本领域内技术人员将发现,这些引述一般应当被解释成表示至少引述的数(例如单纯引述“两个引述”而没有其它修饰语,一般表示至少两个引述,或者两个或更多个引述)。Those skilled in the art will further understand that if an introduced claim intends to refer to a particular number, such intent will be expressly set forth in the claim, and in the absence of such clarification, there is no such intent. For example, to aid in understanding, the claims appended below may contain the use of the introductory words "at least one" and "one or more" to introduce claim recitations. However, the use of these words should not be construed to imply that any particular claim recited by the indefinite article "a" or "an" will be limited to claims containing only one such recitation, even when the same claim includes the introductory words "one or more" or "at least one" and an indefinite article such as "a" or "an" (e.g., "a" and/or "an" should generally be construed to mean "at least one" or "one or more"); this is also true when definite articles are used to introduce claim recitations. In addition, even if a particular number of introductory claim references is explicitly recited, those skilled in the art will recognize that these references should generally be interpreted to mean at least the number of references (e.g., a simple reference to "two references" without other modifiers generally means at least two references, or two or more references).
此外,在使用“A、B和C中的至少一个等”的惯例类似物的那些情况下,一般这种解释旨在表达本领域内技术人员将理解该惯例(例如“具有A、B和C中的至少一个的系统”将包括但不限于单独具有A、单独具有B、单独具有C、A和B一起、A和C一起、B和C一起和/或A、B、C一起的系统等等)。在使用“A、B或C中的至少一个”的惯例类似物的那些情况下,一般这种解释旨在表达本领域内技术人员将理解该惯例(例如“具有A、B或C中的至少一个的系统”将包括但不限于单独具有A、单独具有B、单独具有C、A和B一起、A和C一起、B和C一起和/或A、B、C一起的系统等等)。本领域内技术人员还将理解,一般表示两个或更多个可选项的分隔词和/或词语(不管是在说明书中、权利要求书中还是附图中)应当被理解为考虑包括其中一个项、包括任意一个项或包括两个项的可能性,除非上下文指出了相反情形。例如,词语“A或B”一般将被理解为包括“A”或“B”或者“A和B”的可能性。In addition, in those cases where the conventional analogy of "at least one of A, B, and C, etc." is used, generally such interpretation is intended to convey that one skilled in the art would understand the convention (e.g., "a system having at least one of A, B, and C" would include but is not limited to systems having A alone, B alone, C alone, A and B together, A and C together, B and C together, and/or A, B, C together, etc.). In those cases where the conventional analogy of "at least one of A, B, or C" is used, generally such interpretation is intended to convey that one skilled in the art would understand the convention (e.g., "a system having at least one of A, B, or C" would include but is not limited to systems having A alone, B alone, C alone, A and B together, A and C together, B and C together, and/or A, B, C together, etc.). One skilled in the art would also understand that separators and/or words generally indicating two or more optional items (whether in the specification, claims, or drawings) should be understood to contemplate the possibility of including one, either, or both items, unless the context indicates otherwise. For example, the phrase "A or B" will generally be understood to include the possibilities of "A" or "B" or "A and B."
对于所附权利要求书,本领域内技术人员将理解,本文引述的操作一般可以任何顺序执行。另外,尽管按顺序给出的各种操作流程,然而应当理解,各操作可以所示出的顺序不同的顺序执行,或者可同时执行。这些替代顺序的例子可包括重叠的、交织的、中断的、重排序的、递增的、预备的、补充的、同时的、颠倒的或其它变例顺序,除非上下文指出了其它情形。此外,类似“响应于”、“关联于”的术语或其它过去式形容词一般不旨在排除这些变例,除非上下文指出了其它情形。With respect to the appended claims, it will be understood by those skilled in the art that the operations recited herein may generally be performed in any order. In addition, although the various operational flows are presented in order, it will be understood that the operations may be performed in an order different from that shown, or may be performed simultaneously. Examples of these alternative sequences may include overlapping, interleaved, interrupted, reordered, incremental, preliminary, supplementary, simultaneous, reversed, or other variant sequences, unless the context indicates otherwise. Furthermore, terms such as "responsive to," "associated with," or other past tense adjectives are generally not intended to exclude such variations, unless the context indicates otherwise.
本申请可参照一个或多个商标,例如由一个制造商或商家采用并用来相对于其它产品标识和/或区分他或她的产品的字、字母、字符或设备。本文中使用的商标名以这样的语言来阐述:即,使它们的身份清楚、将它们与常见的描述性名词区别开、具有固定和明确的含义或者在许多(如果不是全部)情形下伴随有使用不由商标涵盖的术语的其它专门标识。另外,本文描述的商标名具有文献中公知和定义的含义,但不涉及对其需要知晓一个或多个商业秘密以赋予其含义的产品或化合物。本申请中引用的所有商标是其相应所有者的财产,并且一个或多个商标在本申请中的出现不减弱或以其它方式不利地影响一个或多个商标的有效性。假设出现在本申请中的所有商标(不管是注册的还是未注册的)包括正确的商标标记,例如或带括号的大写(例如[商标名]),即使这些商标标记未明确地紧挨着商标出现。只要以描述性方式使用一商标以指向一产品或过程,该商标就应当被解释成表示如本申请提交日的相应的产品或过程。This application may refer to one or more trademarks, such as words, letters, characters, or devices adopted by a manufacturer or merchant to identify and/or distinguish his or her products from other products. Trademark names used herein are described in language that makes their identity clear, distinguishes them from common descriptive nouns, has a fixed and unambiguous meaning, or, in many (if not all) cases, is accompanied by other specialized designations using terms not covered by trademarks. In addition, trademark names described herein have meanings that are well known and defined in the literature, but do not refer to products or compounds to which knowledge of one or more trade secrets is required to give their meaning. All trademarks cited in this application are the property of their respective owners, and the appearance of one or more trademarks in this application does not diminish or otherwise adversely affect the validity of one or more trademarks. It is assumed that all trademarks appearing in this application (whether registered or unregistered) include the correct trademark marking, such as or bracketed capitalization (e.g., [trademark name]), even if such trademark marking does not explicitly appear immediately adjacent to the trademark. Whenever a trademark is used in a descriptive manner to refer to a product or process, the trademark should be interpreted as referring to the corresponding product or process as of the filing date of this application.
本领域内技术人员将理解,前面的特定示例性过程和/或设备和/或技术是本文其它地方教示的更一般过程和/或设备和/或技术的表征,例如与本申请一起提交的权利要求书和/或本申请其它部分教示的那样。Those skilled in the art will understand that the foregoing specific exemplary processes and/or apparatus and/or techniques are representative of more general processes and/or apparatus and/or techniques taught elsewhere herein, such as in the claims filed with this application and/or in other parts of this application.
Claims (47)
Applications Claiming Priority (25)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US13/538,866 US20130325447A1 (en) | 2012-05-31 | 2012-06-29 | Speech recognition adaptation systems based on adaptation data |
| US13/538,855 | 2012-06-29 | ||
| US13/538,855 US9495966B2 (en) | 2012-05-31 | 2012-06-29 | Speech recognition adaptation systems based on adaptation data |
| US13/538,866 | 2012-06-29 | ||
| US13/564,647 US9620128B2 (en) | 2012-05-31 | 2012-08-01 | Speech recognition adaptation systems based on adaptation data |
| US13/564,647 | 2012-08-01 | ||
| US13/564,649 US8843371B2 (en) | 2012-05-31 | 2012-08-01 | Speech recognition adaptation systems based on adaptation data |
| US13/564,650 | 2012-08-01 | ||
| US13/564,649 | 2012-08-01 | ||
| US13/564,651 | 2012-08-01 | ||
| US13/564,651 US9899026B2 (en) | 2012-05-31 | 2012-08-01 | Speech recognition adaptation systems based on adaptation data |
| US13/564,650 US20130325449A1 (en) | 2012-05-31 | 2012-08-01 | Speech recognition adaptation systems based on adaptation data |
| US13/609,145 | 2012-09-10 | ||
| US13/609,143 | 2012-09-10 | ||
| US13/609,145 US20130325453A1 (en) | 2012-05-31 | 2012-09-10 | Methods and systems for speech adaptation data |
| US13/609,139 | 2012-09-10 | ||
| US13/609,142 | 2012-09-10 | ||
| US13/609,139 US10431235B2 (en) | 2012-05-31 | 2012-09-10 | Methods and systems for speech adaptation data |
| US13/609,142 US20130325451A1 (en) | 2012-05-31 | 2012-09-10 | Methods and systems for speech adaptation data |
| US13/609,143 US9305565B2 (en) | 2012-05-31 | 2012-09-10 | Methods and systems for speech adaptation data |
| US13/662,228 | 2012-10-26 | ||
| US13/662,125 | 2012-10-26 | ||
| US13/662,125 US9899040B2 (en) | 2012-05-31 | 2012-10-26 | Methods and systems for managing adaptation data |
| US13/662,228 US10395672B2 (en) | 2012-05-31 | 2012-10-26 | Methods and systems for managing adaptation data |
| PCT/US2013/048640 WO2014005055A2 (en) | 2012-06-29 | 2013-06-28 | Methods and systems for managing adaptation data |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| HK1207199A1 HK1207199A1 (en) | 2016-01-22 |
| HK1207199B true HK1207199B (en) | 2020-02-21 |
Family
ID=
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US9305565B2 (en) | Methods and systems for speech adaptation data | |
| US20130325447A1 (en) | Speech recognition adaptation systems based on adaptation data | |
| US9899026B2 (en) | Speech recognition adaptation systems based on adaptation data | |
| US9899040B2 (en) | Methods and systems for managing adaptation data | |
| US10431235B2 (en) | Methods and systems for speech adaptation data | |
| JP7418526B2 (en) | Dynamic and/or context-specific hotwords to trigger automated assistants | |
| US20130325446A1 (en) | Speech recognition adaptation systems based on adaptation data | |
| US9620128B2 (en) | Speech recognition adaptation systems based on adaptation data | |
| US11823659B2 (en) | Speech recognition through disambiguation feedback | |
| US20130325451A1 (en) | Methods and systems for speech adaptation data | |
| US10755713B2 (en) | Generic virtual personal assistant platform | |
| KR102731835B1 (en) | Electronic device and method for providing artificial intelligence services based on pre-gathered conversations | |
| CN108829235B (en) | Voice data processing method and electronic device supporting the same | |
| CN107430855B (en) | Context-sensitive dynamic updates of speech-to-text models in speech-enabled electronic devices | |
| CN108701454B (en) | Parameter Collection and Automatic Dialogue Generation in Dialogue Systems | |
| US10943591B2 (en) | Voice to text conversion based on third-party agent content | |
| US20180285595A1 (en) | Virtual agent for the retrieval and analysis of information | |
| US20130325459A1 (en) | Speech recognition adaptation systems based on adaptation data | |
| AU2020447125B2 (en) | Hot-word free pre-emption of automated assistant response presentation | |
| CN104412322A (en) | Methods and systems for managing adaptation data | |
| HK1207199B (en) | Methods and systems for managing adaptation data |