[go: up one dir, main page]

EP0012767B1 - Speech analyser - Google Patents

Speech analyser Download PDF

Info

Publication number
EP0012767B1
EP0012767B1 EP19790900422 EP79900422A EP0012767B1 EP 0012767 B1 EP0012767 B1 EP 0012767B1 EP 19790900422 EP19790900422 EP 19790900422 EP 79900422 A EP79900422 A EP 79900422A EP 0012767 B1 EP0012767 B1 EP 0012767B1
Authority
EP
European Patent Office
Prior art keywords
speech
output
demodulated signal
frequency
analyser
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired
Application number
EP19790900422
Other languages
German (de)
French (fr)
Other versions
EP0012767A1 (en
Inventor
John Decatur Williamson
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
OMNITRONICS RESEARCH CORPORATION
Original Assignee
OMNITRONICS RESEARCH Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US05/895,375 external-priority patent/US4142067A/en
Application filed by OMNITRONICS RESEARCH Corp filed Critical OMNITRONICS RESEARCH Corp
Publication of EP0012767A1 publication Critical patent/EP0012767A1/en
Application granted granted Critical
Publication of EP0012767B1 publication Critical patent/EP0012767B1/en
Expired legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/90Pitch determination of speech signals

Definitions

  • This invention is related to an apparatus for analysing an individual's speech and more particularly, to an apparatus for analysing pitch perturbations to determine the individual emotional state such as stress, depression, anxiety, fear, happiness, etc., which can be indicative of subjective attitudes, character, mental state, physical state, gross behavioral patterns, veracity, etc.
  • the apparatus has commercial applications as a criminal investigative tool, a medical and/or psychiatric diagnostic aid, a public opinion polling aid, etc.
  • each voice sample must be calibrated or normalized against each individual for reliable results. Analysis is also limited to the occurrence of stress, and other characteristics of an individual's emotional state cannot be detected.
  • a technique for voice analysis is disclosed in Coulter U.S. Patent No. 3,268,661 which determines the frequency loci of voice consonants by extrapolating the slope of the consonant sound to its initial extrapolation.
  • the apparatus disclosed in the Coulter patent comprises a microphone whose output is fed in parallel to a formant analyser and a - voice/unvoice silence detector. The detector serves simply to detect whether a voice signal i.e. one having a particular pitch frequency is present from the microphone.
  • a voice signal i.e. one having a particular pitch frequency is present from the microphone.
  • the Coulter patent makes use of the record formant.
  • the present invention is directed to an apparatus for analysing a person's speech to determine their emotional state.
  • the analyser operates on the real time frequency or pitch components within the first formant band of human speech.
  • the apparatus analyses certain value occurrence patterns in terms of differential first formant pitch, rate of change of pitch, duration and time distribution patterns. These factors relate in a complex but very fundamental way to both transient and long term emotional states.
  • Human speech is initiated by two basic sound generating mechanisms.
  • the vocal cords thin stretched membranes under muscle control, oscillate when expelled air from the lugs passes through them. They produce a characteristic "buzz" sound at a fundamental frequency between 80 Hz and 240 Hz. This frequency is varied over a moderate range by both conscious and unconscious muscle contraction and relaxation.
  • the wave form of the fundamental "buzz” contains many harmonics, some of which excite resonance in various fixed and variable cavities associated with the vocal tract.
  • the second basic sound generated during speech is a pseudo-random noise having a fairly broad and uniform frequency distribution. It is caused by turbulence as expelled air moves through the vocal tract and is called a "hiss" sound. It is modulated, for the most part, by tongue movements and also excites the fixed and variable cavities. It is this complex mixture of "buzz” and "hiss” sounds, shaped and articulated by the resonant cavities, which produces speech.
  • formants In an energy distribution analysis of speech sounds, it will be found that the energy falls into distinct frequency bands called formants. There are three significant formants.
  • the system described here utilizes the first formant band which extends from the fundamental "buzz" frequency to approximately 1000 Hz. This band has not only the highest energy content but reflects a high degree of frequency modulation as a function of various vocal tract and facial muscle tension variations.
  • the present invention analyses an FM demodulated first formant speech signal and produces an output indicative of nulls thereof.
  • the frequency or number of nulls or "flat" spots in the FM demodulated signal, the length of the nulls and the ratio of the total time that nulls exist during a word period to the overall time of the word period are all indicative of the emotional state of the individual.
  • the user By looking at the output of the device, the user can seen or feel the occurrence of the nulls and thus can determine by observing the output the number or frequency of nulls, the length of the nulls and the ratio of the total time nulls exist during a word period to the length of the word period, the emotional state of the individual.
  • a speech analyser for determining the emotional state of a person, the speech analyser having FM demodulator means for detecting the first formant of a person's speech and producing an FM demodulated signal therefrom, the speech analyser characterized by
  • the user of the device thus monitors the nulls and can thereby determine the emotional state of the individual whose speech is being analysed.
  • An advantage of the speech analyser is that it can be made small and portable.
  • Figs. 1 and 2A-2K speech, for the purposes of convenience, is introduced into the speech analyser by means of a built-in microphone 2.
  • the low level signal from the microphone 2 shown in Fig. 2A is amplified by the preamplifier 4 which also removes the low frequency components of the signal by means of a high pass filter section.
  • the amplified speech signal is then passed through the low pass filter 6 which removes the high frequency components above the first formant band.
  • the resultant signal, illustrated in Fig. 2B represents the frequency components to be found in the first formant band of speech, the first formant band being 250 Hz-800 Hz.
  • the signal from low pass filter 6 is then passed through the zero axis limiter circuit 8 which removes all amplitude variations and produces a uniform square wave output illustrated in Fig. 2C which contains only the period or instantaneous frequency component of the first formant speech signal.
  • This signal is then applied to the pulse generator circuit 10 which produces an output pulse of constant amplitude and width, hence constant energy, upon each positive going transition of the input signal.
  • the output of pulse generator circuit 10 is illustrated in Fig. 2D.
  • the pulse signal in Fig. 2D is integrated by the low pass filter circuit 12 whose output is shown in Figs. 2E1 and 2E2.
  • the D.C. level or amplitude of the output of the filter as shown in Fig. 2E thus represents the instantaneous frequency of the first formant speech signal.
  • the output of the low pass filter 12 will thus vary as a function of the frequency modulation of the first formant speech signal by various vocal cord and other vocal tract muscle systems.
  • the overall combination of the zero axis limiter 8, the pulse generator 10, and the low pass filter 12 comprise a conventional FM demodulator designed to operate over the first formant speech frequency band.
  • the FM democulated output signal from the low pass filter 12 is applied to word detector circuit 14 which is a voltage comparator with a reference voltage set to a level representative of a first formant frequency of 250 Hz.
  • word detector circuit 14 which is a voltage comparator with a reference voltage set to a level representative of a first formant frequency of 250 Hz.
  • the FM demodulated output signal from the pass filter 12 is also applied to differentiator circuit 16 which produces an output signal proportional to the instantaneous rate of change of frequency of the first formant speech signal.
  • the output of differentiator 16, which is shown in Fig. 2G, corresponds to the degree of frequency modulation of the first formant spooch signal.
  • the signal from differentiator 16 is applied to a full wave rectifier circuit 18. This circuit passes the positive portion of the signal unchanged. The negative portion is inverted and added to the positive portion.
  • the composite signal is then applied to pulse stretching circuit 19 which comprises a parallel circuit of a resistor and capacitor in series with a diode.
  • the pulse stretching circuit 19 provides a fast rise, slow delay function which eliminates false null information as the differentiated signal passes through zero.
  • the output of pulse stretching circuit 19 is illustrated in Fig. 2H.
  • the output signal of the pulse stretching circuit 19 is applied to an output circuit 17 including a comparator circuit 20 and a display 21.
  • Comparator circuit 20 comprises a three level voltage comparator gated ON or OFF by the ouput of word detector circuit 14.
  • the comparator circuit 20 evaluates, in terms of amplitude level, the output of the pulse stretching circuit 19.
  • Reference levels of the comparator circuit 20 are set so that when normal levels of frequency modulation are present in the first formant speech signal an output as shown in Fig. 21 is produced and a display 21 having an appropriate visual indicator, such as a green LED 22 is turned ON.
  • an output such as shown in Fig. 2J is produced and the comparator circuit 20 turns on the yellow LED 24.
  • an output such as shown in Fig. 2K is produced and the comparator circuit turns on the red LED 26.
  • comparator circuit 20 can have an output coupled to a tactile device 28 for producing a tactile output so that the user can place the device close to his body and sense the occurrence of nulls through a physical stimulation to his body rather than through a visual display.
  • the user can maintain eye contact with the individual whose speech is being analysed which could in turn reduce the anxiety of the individual whose speed is being analysed, which is caused by the user constantly looking to the speech analyser.
  • the word detector 14 and the pulse stretching circuit 19 are connected to a voltage meter circuit 30 which is substituted for the comparator circuit 20.
  • the meter circuit 30 is turned on when word detector 14 in ON and meter 32 provides an indication of the voltage output of pulse stretching circuit 19.
  • the pitch or frequency null perturbations contained within the first formant speech signal define, by their pattern of occurrence, certain emotional states of the individual whose speech is being analysed, a visual integration and interpretation of the displayed output provides adequate information to the user of the imm for môkino certain decisions WiIn regard to the emotional state, in real time, of the person speaking.
  • the speech analyser of the present invention can be constructed using integrated circuits and therefore can be constructed in a very small size which allows it to be portable and capable of being carried in one's pocket, for example.

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Measurement Of The Respiration, Hearing Ability, Form, And Blood Characteristics Of Living Organisms (AREA)
  • Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)
  • Measuring Pulse, Heart Rate, Blood Pressure Or Blood Flow (AREA)

Abstract

A speech analyser is provided for determining the emotional state of a person by analysing pitch or frequency perturbations in the speech pattern. The analyser determines null points or "flat" spots in a FM demodulated speech signal and it produces an output indicative of the nulls. The output can be analysed by the operator of the device to determine the emotional state of the person whose speech pattern is being monitored.

Description

  • This invention is related to an apparatus for analysing an individual's speech and more particularly, to an apparatus for analysing pitch perturbations to determine the individual emotional state such as stress, depression, anxiety, fear, happiness, etc., which can be indicative of subjective attitudes, character, mental state, physical state, gross behavioral patterns, veracity, etc. In this regard, the apparatus has commercial applications as a criminal investigative tool, a medical and/or psychiatric diagnostic aid, a public opinion polling aid, etc.
  • One type of technique for speech analysis to determine emotional stress is disclosed in Bell, Jr., et al., U.S. Patent 3,971,034. In the technique disclosed in this patent a speech signal is processed to produce an FM demodulated speech signal. This FM demodulated signal is recorded on a chart recorder and then is manually analysed by an operator. This technique has several disadvantages. First, the output is not a real time analysis of the speech signal. Another disadvantage is that the operator must be very highly trained in order to perform a manual analysis of the FM demodulated speech signal and the analysis is a very time consuming endeavor. Still another disadvantage of the technique disclosed in Bell, Jr., et al. is that it operates on the fundamental frequencies of the vocal cords and, in the Bell, Jr., et al. technique tedious re-recording and special time expansion of the voice signal are required. In practise, all these factors result in an unnecessarily low sensitivity to the parameter of interest, specifically stress.
  • Another technique for voice analysing to determine emotional states is disclosed in Fuller, U.S. Patents 3,855,416, 3,855,417 and 3,855,418. The technique disclosed in the Fuller patents analyses amplitude characteristics of a speech signal and operates on distortion products of the fundamental frequency commonly called vibrato and on proportional relationships between various harmonic overtone or higher order formant frequencies.
  • Although this technique appears to operate in real time, in practise, each voice sample must be calibrated or normalized against each individual for reliable results. Analysis is also limited to the occurrence of stress, and other characteristics of an individual's emotional state cannot be detected.
  • A technique for voice analysis is disclosed in Coulter U.S. Patent No. 3,268,661 which determines the frequency loci of voice consonants by extrapolating the slope of the consonant sound to its initial extrapolation. The apparatus disclosed in the Coulter patent comprises a microphone whose output is fed in parallel to a formant analyser and a - voice/unvoice silence detector. The detector serves simply to detect whether a voice signal i.e. one having a particular pitch frequency is present from the microphone. There is no disclosure in the Coulter patent that indicates that it would be used for determining the emotional state of a person. Further, the Coulter patent makes use of the record formant.
  • The present invention is directed to an apparatus for analysing a person's speech to determine their emotional state. The analyser operates on the real time frequency or pitch components within the first formant band of human speech. In analysing the speech, the apparatus analyses certain value occurrence patterns in terms of differential first formant pitch, rate of change of pitch, duration and time distribution patterns. These factors relate in a complex but very fundamental way to both transient and long term emotional states.
  • Human speech is initiated by two basic sound generating mechanisms. The vocal cords; thin stretched membranes under muscle control, oscillate when expelled air from the lugs passes through them. They produce a characteristic "buzz" sound at a fundamental frequency between 80 Hz and 240 Hz. This frequency is varied over a moderate range by both conscious and unconscious muscle contraction and relaxation. The wave form of the fundamental "buzz" contains many harmonics, some of which excite resonance in various fixed and variable cavities associated with the vocal tract. The second basic sound generated during speech is a pseudo-random noise having a fairly broad and uniform frequency distribution. It is caused by turbulence as expelled air moves through the vocal tract and is called a "hiss" sound. It is modulated, for the most part, by tongue movements and also excites the fixed and variable cavities. It is this complex mixture of "buzz" and "hiss" sounds, shaped and articulated by the resonant cavities, which produces speech.
  • In an energy distribution analysis of speech sounds, it will be found that the energy falls into distinct frequency bands called formants. There are three significant formants. The system described here utilizes the first formant band which extends from the fundamental "buzz" frequency to approximately 1000 Hz. This band has not only the highest energy content but reflects a high degree of frequency modulation as a function of various vocal tract and facial muscle tension variations.
  • In effect, by analysing certain first formant frequency distribution patterns, a qualitative measure of speech-related muscle tension variations and interactions is performed. Since these muscles are predominantly biased and articulated through secondary unconscious processes which are in turn influenced by emotional state, a relative measure of emotional activity can be determined independent of a person's awareness or lack of awareness of that state. Research also bears out a general supposition that since the mechanisms of speech are exceedingly complex and largely autonomous, very few people are able to consciously "project" a fictitious emotional state. In fact, an attempt to do so usually generates its own unique psychological stress "fingerprint" in the voice pattern.
  • Because of the characteristics of the first formant speech sounds, the present invention analyses an FM demodulated first formant speech signal and produces an output indicative of nulls thereof.
  • The frequency or number of nulls or "flat" spots in the FM demodulated signal, the length of the nulls and the ratio of the total time that nulls exist during a word period to the overall time of the word period are all indicative of the emotional state of the individual. By looking at the output of the device, the user can seen or feel the occurrence of the nulls and thus can determine by observing the output the number or frequency of nulls, the length of the nulls and the ratio of the total time nulls exist during a word period to the length of the word period, the emotional state of the individual.
  • In the present invention, there is provided a speech analyser for determining the emotional state of a person, the speech analyser having FM demodulator means for detecting the first formant of a person's speech and producing an FM demodulated signal therefrom, the speech analyser characterized by
    • (a) word detector means for detecting the presence of an FM demodulated signal;
    • (b) null detector means for detecting the absence of a change in frequency of the speech utilized to produce the FM demodulated signal and for producing an output indicative thereof; and,
    • (c) output means coupled to said word detector means and said null detector means, wherein said output means is enabled by said word detector means when said word detector means detects the presence of an FM demodulated signal and wherein said output means produces an output indicative of the presence or absence of a change in frequency of the person's speech utilized to produce the FM demodulated signal.
  • The user of the device thus monitors the nulls and can thereby determine the emotional state of the individual whose speech is being analysed.
  • An advantage of the speech analyser is that it can be made small and portable.
  • In order that the present invention be more readily understood, an embodiment thereof will now be described by way of example with reference to the accompanying drawings, in which:-
    • Figure 1 is a block diagram of the system of the present invention;
    • Figures 2A-2K illustrate the electrical signals produced by the system shown in Figure 1;
    • Figure 3 illustrates an alternative embodiment of the output of the present invention; and
    • Figure 4 illustrates still another alternative embodiment of the output of the present invention.
    Detailed Description of the Preferred Embodiment
  • Referring to Figs. 1 and 2A-2K, speech, for the purposes of convenience, is introduced into the speech analyser by means of a built-in microphone 2. The low level signal from the microphone 2 shown in Fig. 2A is amplified by the preamplifier 4 which also removes the low frequency components of the signal by means of a high pass filter section. The amplified speech signal is then passed through the low pass filter 6 which removes the high frequency components above the first formant band. The resultant signal, illustrated in Fig. 2B represents the frequency components to be found in the first formant band of speech, the first formant band being 250 Hz-800 Hz. The signal from low pass filter 6 is then passed through the zero axis limiter circuit 8 which removes all amplitude variations and produces a uniform square wave output illustrated in Fig. 2C which contains only the period or instantaneous frequency component of the first formant speech signal. This signal is then applied to the pulse generator circuit 10 which produces an output pulse of constant amplitude and width, hence constant energy, upon each positive going transition of the input signal. The output of pulse generator circuit 10 is illustrated in Fig. 2D. The pulse signal in Fig. 2D is integrated by the low pass filter circuit 12 whose output is shown in Figs. 2E1 and 2E2. The D.C. level or amplitude of the output of the filter as shown in Fig. 2E thus represents the instantaneous frequency of the first formant speech signal. The output of the low pass filter 12 will thus vary as a function of the frequency modulation of the first formant speech signal by various vocal cord and other vocal tract muscle systems. The overall combination of the zero axis limiter 8, the pulse generator 10, and the low pass filter 12 comprise a conventional FM demodulator designed to operate over the first formant speech frequency band.
  • The FM democulated output signal from the low pass filter 12 is applied to word detector circuit 14 which is a voltage comparator with a reference voltage set to a level representative of a first formant frequency of 250 Hz. When this reference level is exceeded by the FM demodulated signal, the comparator output switches from OFF to ON as illustrated in Fig. 2F.
  • The FM demodulated output signal from the pass filter 12 is also applied to differentiator circuit 16 which produces an output signal proportional to the instantaneous rate of change of frequency of the first formant speech signal. The output of differentiator 16, which is shown in Fig. 2G, corresponds to the degree of frequency modulation of the first formant spooch signal.
  • The signal from differentiator 16 is applied to a full wave rectifier circuit 18. This circuit passes the positive portion of the signal unchanged. The negative portion is inverted and added to the positive portion. The composite signal is then applied to pulse stretching circuit 19 which comprises a parallel circuit of a resistor and capacitor in series with a diode. The pulse stretching circuit 19 provides a fast rise, slow delay function which eliminates false null information as the differentiated signal passes through zero. The output of pulse stretching circuit 19 is illustrated in Fig. 2H.
  • The output signal of the pulse stretching circuit 19 is applied to an output circuit 17 including a comparator circuit 20 and a display 21. Comparator circuit 20 comprises a three level voltage comparator gated ON or OFF by the ouput of word detector circuit 14. Thus, when speech is present, the comparator circuit 20 evaluates, in terms of amplitude level, the output of the pulse stretching circuit 19. Reference levels of the comparator circuit 20 are set so that when normal levels of frequency modulation are present in the first formant speech signal an output as shown in Fig. 21 is produced and a display 21 having an appropriate visual indicator, such as a green LED 22 is turned ON. When there is only a small amount of frequency modulation present, such as under mild stress conditions, an output such as shown in Fig. 2J is produced and the comparator circuit 20 turns on the yellow LED 24. When there is a full null, such as produced by more intense stress conditions, an output such as shown in Fig. 2K is produced and the comparator circuit turns on the red LED 26.
  • Referring to Fig. 3, comparator circuit 20 can have an output coupled to a tactile device 28 for producing a tactile output so that the user can place the device close to his body and sense the occurrence of nulls through a physical stimulation to his body rather than through a visual display. In this embodiment the user can maintain eye contact with the individual whose speech is being analysed which could in turn reduce the anxiety of the individual whose speed is being analysed, which is caused by the user constantly looking to the speech analyser.
  • In the embodiment shown in Fig. 4 the word detector 14 and the pulse stretching circuit 19 are connected to a voltage meter circuit 30 which is substituted for the comparator circuit 20. The meter circuit 30 is turned on when word detector 14 in ON and meter 32 provides an indication of the voltage output of pulse stretching circuit 19.
  • Since the pitch or frequency null perturbations contained within the first formant speech signal define, by their pattern of occurrence, certain emotional states of the individual whose speech is being analysed, a visual integration and interpretation of the displayed output provides adequate information to the user of the imm for môkino certain decisions WiIn regard to the emotional state, in real time, of the person speaking.
  • The speech analyser of the present invention can be constructed using integrated circuits and therefore can be constructed in a very small size which allows it to be portable and capable of being carried in one's pocket, for example.

Claims (9)

1. A speech analyser for determining the emotional state of a person, the speech analyser having FM demodulator means (8, 10, 12) for detecting the first formant of a person's speech and producing an FM demodulated signal therefrom, the speech analyser characterized by
(a) word detector means (14) for detecting the presence of an FM demodulated signal;
(b) null detector means (16, 18, 19) for detecting the absence of a change in frequency of the speech utilized to produce the FM demodulated signal and for producing an output indicative thereof; and,
(c) output means (17) coupled to said word detector means (14) and said null detector means (16, 18, 19), wherein said output means (17) is enabled by said word detector means (14) when said word detector means (14) detects the presence of an FM demodulated signal and wherein said output means (17) produces an output indicative of the presence or absence of a change in frequency of the person's speech utilized to produce the FM demodulated signal.
2. A speech analyser, as set forth in claim 1, characterized by said word detector means (14) and said null detector means (16, 18, 19) each coupled to the output of said FM demodulator means (8, 10, 12).
3. A speech analyser, as set forth in claim 1, wherein said null detector means (16, 18, 19) is characterized by
(a) a differentiator means (16) for differentiating the FM demodulated signal;
(b) a full wave rectifier means (18) for rectifying the FM demodulated signal; and
(c) pulse stretching circuit means (19) for eliminating the detection of the absence of a change in frequency of the speech utilized to produce the FM demodulated signal when the differentiated FM demodulated signal passes through zero.
4. A speech analyser, as set forth in claim 1, wherein said output means (17) is characterized by a meter means (30, 32) for depicting voltage magnitude.
5. A speech analyser, as set forth in claim 1, wherein said output means (17) is characterized by
(a) comparator means (20) for detecting the level of the output of the null detector means (16, 18, 19) and comparing the level with predetermined voltage levels wherein when said level is below a first predetermined level there exists an absence of a change in frequency of the speech utilized to produce the FM demodulated signal and when said level is above a second predetermined level a change in the frequency of the speech utilized to produce the FM demodulated signal is present; and
(b) display means (21) for displaying the output of said comparator means (20).
6. A speech analyser, as set forth in claim 5, wherein said display means (21) is characterized by a tactile display (28).
7. A speech analyser, as set forth in claim 5, wherein said display means (21) is characterized by at least two lights (22, 26) one of said lights (26) being turned on when the output of the comparator means (20) is indicative of an absence of a change in frequency of the speech utilized to produce the FM demodulated signal and the other light (22) being turned on when the output of the comparator means (20) is indicative of a change in frequency of the speech utilized to produce the FM demodulated signal.
8. A speech analyser as set forth in claim 7, wherein said display means (21) is further characterized by a third light (24), said third light (24) being turned on when the level of the output of the comparator means (20) is indicative of a transition between the presence and absence of a change in frequency of the speech utilized to produce the FM demodulated signal.
9. A speech analyser, as set forth in claim 1, further characterized by filter means (4, 6) receiving the person's speech and passing signals only in the range of 250 Hz to 800 Hz to the FM demodulator means (8, 10, 12).
EP19790900422 1978-04-11 1979-11-19 Speech analyser Expired EP0012767B1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US05/895,375 US4142067A (en) 1977-06-14 1978-04-11 Speech analyzer for analyzing frequency perturbations in a speech pattern to determine the emotional state of a person
US895375 1978-04-11

Publications (2)

Publication Number Publication Date
EP0012767A1 EP0012767A1 (en) 1980-07-09
EP0012767B1 true EP0012767B1 (en) 1983-07-27

Family

ID=25404422

Family Applications (1)

Application Number Title Priority Date Filing Date
EP19790900422 Expired EP0012767B1 (en) 1978-04-11 1979-11-19 Speech analyser

Country Status (5)

Country Link
EP (1) EP0012767B1 (en)
JP (1) JPS55500275A (en)
DE (1) DE2965975D1 (en)
DK (1) DK525679A (en)
WO (1) WO1979000913A1 (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4543957A (en) * 1983-06-13 1985-10-01 Ernest H. Friedman Human response apparatus and method
DE102010064357A1 (en) * 2010-12-29 2012-07-05 Sms Siemag Ag Process for the pyrometallurgical treatment of metals, molten metals and / or slags

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US3971034A (en) * 1971-02-09 1976-07-20 Dektor Counterintelligence And Security, Inc. Physiological response analysis method and apparatus
US3855417A (en) * 1972-12-01 1974-12-17 F Fuller Method and apparatus for phonation analysis lending to valid truth/lie decisions by spectral energy region comparison

Also Published As

Publication number Publication date
JPS55500275A (en) 1980-05-08
DK525679A (en) 1979-12-11
EP0012767A1 (en) 1980-07-09
WO1979000913A1 (en) 1979-11-15
DE2965975D1 (en) 1983-09-01

Similar Documents

Publication Publication Date Title
US4142067A (en) Speech analyzer for analyzing frequency perturbations in a speech pattern to determine the emotional state of a person
US6353810B1 (en) System, method and article of manufacture for an emotion detection system improving emotion recognition
US3971034A (en) Physiological response analysis method and apparatus
US6697457B2 (en) Voice messaging system that organizes voice messages based on detected emotion
US6427137B2 (en) System, method and article of manufacture for a voice analysis system that detects nervousness for preventing fraud
US6480826B2 (en) System and method for a telephonic emotion detection that provides operator feedback
EP1222448B1 (en) System, method, and article of manufacture for detecting emotion in voice signals by utilizing statistics for voice signal parameters
Bregman Auditory streaming: Competition among alternative organizations
US3855416A (en) Method and apparatus for phonation analysis leading to valid truth/lie decisions by fundamental speech-energy weighted vibratto component assessment
US6036653A (en) Pulsimeter
Pabon et al. Automatic phonetogram recording supplemented with acoustical voice-quality parameters
KR101248353B1 (en) Speech analyzer detecting pitch frequency, speech analyzing method, and speech analyzing program
US7606701B2 (en) Method and apparatus for determining emotional arousal by speech analysis
US4809332A (en) Speech processing apparatus and methods for processing burst-friction sounds
US3855418A (en) Method and apparatus for phonation analysis leading to valid truth/lie decisions by vibratto component assessment
CN111195132B (en) Non-contact lie detection and emotion recognition method, device and system
US6416485B1 (en) Instrumental measurement of the neuro-psycho-physical state of a person
US3855417A (en) Method and apparatus for phonation analysis lending to valid truth/lie decisions by spectral energy region comparison
JPH0121519B2 (en)
EP0012767B1 (en) Speech analyser
JP2005198828A (en) Biological information analysis apparatus, biological information analysis method, control program, and recording medium
US4319081A (en) Sound level monitoring apparatus
Rao Audio signal processing
JPH0998964A (en) Coughing measurement apparatus
US4276445A (en) Speech analysis apparatus

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

AK Designated contracting states

Designated state(s): CH DE FR GB SE

17P Request for examination filed
RAP1 Party data changed (applicant data changed or rights of an application transferred)

Owner name: OMNITRONICS RESEARCH CORPORATION

GRAA (expected) grant

Free format text: ORIGINAL CODE: 0009210

AK Designated contracting states

Designated state(s): CH DE FR GB SE

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: SE

Effective date: 19830727

REF Corresponds to:

Ref document number: 2965975

Country of ref document: DE

Date of ref document: 19830901

ET Fr: translation filed
PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: CH

Payment date: 19840224

Year of fee payment: 6

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: FR

Payment date: 19840301

Year of fee payment: 6

PLBE No opposition filed within time limit

Free format text: ORIGINAL CODE: 0009261

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: NO OPPOSITION FILED WITHIN TIME LIMIT

26N No opposition filed
PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: DE

Payment date: 19850228

Year of fee payment: 7

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: CH

Effective date: 19860228

GBPC Gb: european patent ceased through non-payment of renewal fee
PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: FR

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 19861031

REG Reference to a national code

Ref country code: CH

Ref legal event code: PL

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: DE

Effective date: 19861101

REG Reference to a national code

Ref country code: FR

Ref legal event code: ST

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: GB

Effective date: 19881118