[go: up one dir, main page]

US20140148219A1 - Emotional illumination, and related arrangements - Google Patents

Emotional illumination, and related arrangements Download PDF

Info

Publication number
US20140148219A1
US20140148219A1 US14/058,595 US201314058595A US2014148219A1 US 20140148219 A1 US20140148219 A1 US 20140148219A1 US 201314058595 A US201314058595 A US 201314058595A US 2014148219 A1 US2014148219 A1 US 2014148219A1
Authority
US
United States
Prior art keywords
smartphone
user
data
instructions
data capture
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US14/058,595
Inventor
Yang Bai
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Digimarc Corp
Original Assignee
Digimarc Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Digimarc Corp filed Critical Digimarc Corp
Priority to US14/058,595 priority Critical patent/US20140148219A1/en
Assigned to DIGIMARC CORPORATION reassignment DIGIMARC CORPORATION ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: BAI, yang
Publication of US20140148219A1 publication Critical patent/US20140148219A1/en
Abandoned legal-status Critical Current

Links

Images

Classifications

    • H04N5/243
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/10Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
    • G06V40/16Human faces, e.g. facial parts, sketches or expressions
    • G06V40/174Facial expression recognition
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N23/00Cameras or camera modules comprising electronic image sensors; Control thereof
    • H04N23/70Circuitry for compensating brightness variation in the scene
    • H04N23/76Circuitry for compensating brightness variation in the scene by influencing the image signals
    • G06K9/00302
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/02Constructional features of telephone sets
    • H04M1/0202Portable telephone sets, e.g. cordless phones, mobile phones or bar type handsets
    • H04M1/026Details of the structure or mounting of specific components
    • H04M1/0264Details of the structure or mounting of specific components for a camera module assembly
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N23/00Cameras or camera modules comprising electronic image sensors; Control thereof
    • H04N23/60Control of cameras or camera modules
    • H04N23/61Control of cameras or camera modules based on recognised objects
    • H04N23/611Control of cameras or camera modules based on recognised objects where the recognised objects include parts of the human body
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N23/00Cameras or camera modules comprising electronic image sensors; Control thereof
    • H04N23/70Circuitry for compensating brightness variation in the scene
    • H04N23/74Circuitry for compensating brightness variation in the scene by influencing the scene brightness using illuminating means

Definitions

  • Frown/smile detection is used by some consumer cameras to automatically identify good images.
  • the technology can be used to trigger image capture when a favorable facial expression is sensed, or to select from among a series of images, to pick a favorable image therefrom. It is sometimes termed a “smile shutter.”) See, e.g., US patent publications US20070201725, US20080309796, US20090002512, and US20100110265.
  • Facial expressions can also be used in conjunction with commercial methods, to sense which ads or products are pleasing (or not) to viewers. See, e.g., US20090118593, US2009112616 and US20040001616.
  • Verizon has suggested tailoring behavior of a user interface based on a user's sensed emotional state. For example, if the user's voice sounds stressed, a phone UI may address the user more slowly. See US20100037187.
  • Related “affective computing” technology is detailed in Microsoft's U.S. Pat. No. 6,212,502, in which the user's emotional state is sensed, and a “help system” user interface responds accordingly.
  • the Microsoft system relies on a Bayesian network to recognize the user's emotion. Additional mood-detecting technology is detailed in Microsoft's US20090002178.
  • smartphones are used to sense machine readable data from physical media.
  • consumers increasingly use smartphones to read QR codes and encoded digital watermarks from posters, magazines and newspapers, in order to link to related content.
  • Such technology is detailed, e.g., in the assignee's patent documents U.S. Pat. No. 6,947,571, U.S. Pat. No. 6,590,996, 20110161076 and 20100150434, and in applications Ser. No. 13/079,327, filed Apr. 4, 2011, and Ser. No. 13/011,618, filed Jan. 21, 2011.
  • the LED “torch” (illuminator) of a smartphone is activated when a user seems to be having difficulty using the smartphone to sense machine-readable data.
  • the smartphone processor may be better able to decode the encoded information from the captured imagery.
  • FIG. 1 is a block diagram of an illustrative smartphone.
  • FIG. 2 is a flow chart of a process according to one particular embodiment of the present technology.
  • an illustrative smartphone 10 includes a processor 12 , a display 14 , a touchscreen 16 and other physical user interface (UI) elements 18 (e.g., buttons, etc.). Also included are one or more microphones 20 , a variety of other sensors 22 (e.g., motions sensors such 3D accelerometers, gyroscopes and magnetometers), a network adapter 24 , a location-determining module 26 (e.g., GPS), and an RF transceiver 28 .
  • UI physical user interface
  • the depicted phone 10 also includes two cameras 30 , 32 .
  • Camera 30 is front-facing, i.e., with a lens mounted on the side of the smartphone that also includes the screen.
  • the second camera 32 has a lens on a different side of the smartphone, commonly on the back side.
  • the front-facing camera is lower in resolution than the back-facing camera (e.g., 640 ⁇ 480 pixels for the front-facing camera, vs. 1280 ⁇ 720 pixels for the back-facing camera). Accordingly, imagery from the front-facing camera can be processed more simply than imagery from the back-facing camera, with less power consumption and less computational complexity.
  • an LED “torch” 34 Associated with the second camera 32 is an LED “torch” 34 that is mounted so as to illuminate the second camera's field of view. Commonly, this torch is positioned on the same side of the smartphone as the lens of the second camera, although this is not essential.
  • Smartphone 10 also includes a memory 36 that stores software and data.
  • the software includes both operating system software and application software.
  • the former includes software that controls the user interface.
  • the latter includes content processing software—such as a QR code reader and/or a digital watermark decoder. It similarly may include music recognition software.
  • the smartphone captures first image data from a physical object (e.g., a newspaper) using the second (e.g., rear-facing) camera 32 .
  • the smartphone attempts to decode encoded information from the captured imagery (e.g., a QR code or digital watermark).
  • An associated result is presented to the user, e.g., on the smartphone screen 14 .
  • the smartphone captures imagery of the user's face, from the front-facing camera 30 —both before and after the decoding attempt.
  • This facial expression information is analyzed to discern whether an emotion indicated by the user changes negatively. For example, the user's facial expression may change from a neutral expression to a slight frown or grimace. If the smartphone thereby discerns that the user is becoming frustrated with the smartphone, the smartphone processor 36 issues a signal that turns on the torch 34 . This torch illuminates the field of view of the camera 32 , including the newspaper being imaged.
  • the increased illumination will often allow the smartphone to extract the encoded information from the imagery captured from the newspaper, when the smartphone was previously unable to do so.
  • the torch 34 can be extinguished when the processor 36 indicates that a decoding operation has been performed successfully.
  • the torch can be turned-off if imagery captured by the camera 30 reveals a change in the users' facial expression, e.g., from a frown to a neutral expression, or a smile.
  • the torch can be turned-off based on a time interval—such as 3, 5 or 10 seconds following its enablement.
  • the torch can also be extinguished if the processor senses (e.g., by reference to one of the motion sensors) that the phone has been moved from the pose in which the user was holding it when a negative emotion was sensed, to a different pose—indicating that the user has ceased the attempt to extract information from the object.
  • Enabling the torch is one action the smartphone can take based on the user's sensed emotion.
  • the smartphone can change one or more other parameters.
  • the smartphone may change the focus or zoom of the second camera 32 —trying to capture information depicted in a different focal plane. (Such change can be achieved by conventional mechanical arrangements, or by computational photography techniques). Or a different lens aperture or a different exposure interval can be tried.
  • different image processing operations may be triggered, such as spatial-domain or frequency-domain filtering, averaging, or analysis in different color planes (or greyscale).
  • several captured image frames can be combined, such as by averaging, or using high dynamic range combination techniques, in an attempt to obtain imagery from which better recognition results can be obtained.
  • other facial expressions control other aspects of image processing.
  • the zoom function of camera 32 can be controlled in accordance with eyelid gestures sensed by camera 30 (e.g., with zoom increasing as the user's eyes are opened further).
  • changes to the user's lip posture can vary a parameter of operation (e.g., with zoom increasing as the user's lips move apart).
  • the smartphone analyzes camera data to turn on a torch.
  • the analyzed camera data is not from the camera 32 with which the torch is associated, but rather is from a camera 30 facing a different direction (towards the user).
  • the detailed arrangement benefits the user by responding automatically to the user's reflexive reaction to disappointment—without requiring any deliberate action on the user's part. It also conserves battery power, by not energizing the LED unnecessarily.
  • the detailed embodiment senses mood/emotion by reference to facial image data
  • other embodiments can use other techniques, e.g., based on voice parameters, heart rate, skin conductivity, and/or other biometrics.
  • Apple's patent publication 20100113950 details technology for capturing and analyzing EKG data from a user, using a smartphone.
  • a user's gestures with the phone can also be sensed and analyzed to discern likely emotion (e.g., hard shaking of the device can indicate frustration).
  • Analysis of the user's emotion typically is based on a “before” and “after” comparison of sampled information (e.g., facial expression data). However, this is not essential.
  • the smartphone can decide to change a parameter of operation (e.g., turn on the torch) based on detection of a frown after the smartphone presents an original processing result (e.g., OCR extraction), regardless of the user's expression before presentation of that result.
  • a negative emotion may be inferred from the lack of a positive facial expression—or a change from positive facial expression to a neutral facial expression.
  • a classifier arrangement is used to recognize different emotional states.
  • Such classification can employ a probabilistic and/or statistical-based analysis to infer an action or state that corresponds to user.
  • a support vector machine (SVM) is an example of a classifier that can be employed.
  • Exemplary smartphones include the Apple iPhone 4, and smartphones following Google's Android specification (e.g., the Verizon Droid Eris phone, manufactured by HTC Corp., and the Motorola Droid 3 phone). (Details of the iPhone, including its touch interface, are provided in Apple's published patent application 20080174570.)
  • Google's Android specification e.g., the Verizon Droid Eris phone, manufactured by HTC Corp., and the Motorola Droid 3 phone.
  • processors for a variety of programmable processors, including microprocessors (e.g., the Atom and A4), graphics processing units (GPUs, such as the nVidia Tegra APX 2600), and digital signal processors (e.g., the Texas Instruments TMS320 series devices), etc.
  • microprocessors e.g., the Atom and A4
  • GPUs graphics processing units
  • digital signal processors e.g., the Texas Instruments TMS320 series devices
  • These instructions can be implemented as software, firmware, etc.
  • processor circuitry including programmable logic devices, field programmable gate arrays (e.g., the Xilinx Virtex series devices), field programmable object arrays, and application specific circuits—including digital, analog and mixed analog/digital circuitry.
  • Execution of the instructions can be distributed among processors and/or made parallel across processors within a device or across a network of devices. Processing of data can also be distributed among different processor and memory devices. “Cloud” computing resources can be used as well. References to “processors,” “modules” or “components” should be understood to refer to functionality, rather than requiring a particular form of implementation.
  • Smartphones can include software modules for performing the different functions and acts.
  • image processing or music recognition operations can involve one or more remote devices, between which execution can be distributed. Extraction of watermark data from image content is one example of a process that can be distributed in such fashion. Another example is image analysis to discern emotion.
  • description of an operation as being performed by a particular device e.g., a smartphone
  • performance of the operation by another device e.g., a remote server
  • another device e.g., a remote server
  • a user's facial response to the app can be captured by a front-facing camera and—if it turns negative—the device can employ alternate strategies to try and obtain a result that is more user-pleasing.
  • a music app one strategy is for the smartphone to attempt to characterize non-music audio captured by the microphone, and then apply a corresponding filter to reduce interference from such audio.
  • Another strategy is to involve nearby smartphones in the detection task, e.g., requesting (such as by Bluetooth) that they sample audio from their locations, and forward captured audio—perhaps after initial processing—to the original smartphone.
  • the original smartphone can then combine such audio with its own captured audio to perhaps increase the signal-to-noise ratio of the music, to which a recognition process can be applied—hopefully with a more pleasing result.
  • the detailed embodiment may be regarded as employing a first, front-facing camera as a user-feedback sensor device, and employing a second camera as an environment sensor device.
  • a related embodiment is a variation on the “smile shutter” concept.
  • a user positions a smartphone so that the second (e.g., rear-facing) camera points towards a desired scene (which is displayed on the phone screen).
  • the second camera e.g., rear-facing
  • this variant embodiment instead triggers image capture by analyzing imagery from the front-facing camera—looking for a particular facial signal, such as a smile.
  • a smile When the smartphone operator smiles, the second camera takes a picture. It will be recognized that this arrangement avoids the shake problem inherent in the prior art (in which image capture is triggered by the user touching the screen).

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Oral & Maxillofacial Surgery (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Studio Devices (AREA)
  • Telephone Function (AREA)

Abstract

A smartphone senses a user's emotional reaction to certain output (e.g., an output from a smartphone's attempt to read a barcode printed in a newspaper). The phone then tailors its operation based on the sensed reaction (e.g., it may turn on a torch to better illuminate the newspaper, or vary image processing or decoding parameters).

Description

    CROSS REFERENCE TO RELATED APPLICATIONS
  • This application is a Continuation of prior U.S. application Ser. No. 13/212,119, filed Aug. 17, 2011, which is incorporated herein by reference.
  • TECHNICAL FIELD The present technology concerns smartphones and other processor-equipped devices. BACKGROUND AND INTRODUCTION OF THE TECHNOLOGY
  • Frown/smile detection is used by some consumer cameras to automatically identify good images. (The technology can be used to trigger image capture when a favorable facial expression is sensed, or to select from among a series of images, to pick a favorable image therefrom. It is sometimes termed a “smile shutter.”) See, e.g., US patent publications US20070201725, US20080309796, US20090002512, and US20100110265.
  • Related technology has also been proposed for games, in which a user's facial expression is sensed, and mimicked on an avatar that corresponds to the user in a game. See, e.g., Microsoft's US2011007142. Neven et al has done related work, shown in U.S. Pat. Nos. 6,580,811 and 6,714,661.
  • Facial expressions can also be used in conjunction with commercial methods, to sense which ads or products are pleasing (or not) to viewers. See, e.g., US20090118593, US2009112616 and US20040001616.
  • Motorola has proposed a phone that senses and communicates the user's emotional state, as indicated by facial expressions. See U.S. Pat. No. 7,874,983.
  • Verizon has suggested tailoring behavior of a user interface based on a user's sensed emotional state. For example, if the user's voice sounds stressed, a phone UI may address the user more slowly. See US20100037187. Related “affective computing” technology is detailed in Microsoft's U.S. Pat. No. 6,212,502, in which the user's emotional state is sensed, and a “help system” user interface responds accordingly. The Microsoft system relies on a Bayesian network to recognize the user's emotion. Additional mood-detecting technology is detailed in Microsoft's US20090002178.
  • A recent survey of affective computing techniques is provided in Robinson, The Emotional Computer, Ninth Intl Conference on Pervasive Computing, Jun., 2011.
  • Separately, smartphones are used to sense machine readable data from physical media. For example, consumers increasingly use smartphones to read QR codes and encoded digital watermarks from posters, magazines and newspapers, in order to link to related content. Such technology is detailed, e.g., in the assignee's patent documents U.S. Pat. No. 6,947,571, U.S. Pat. No. 6,590,996, 20110161076 and 20100150434, and in applications Ser. No. 13/079,327, filed Apr. 4, 2011, and Ser. No. 13/011,618, filed Jan. 21, 2011.
  • In accordance with one aspect of the present technology, the LED “torch” (illuminator) of a smartphone is activated when a user seems to be having difficulty using the smartphone to sense machine-readable data. With additional illumination on the object being imaged, the smartphone processor may be better able to decode the encoded information from the captured imagery.
  • The foregoing and additional features and advantages of the present technology will be more readily apparent from the following description, which proceeds with reference to the accompanying drawings.
  • BRIEF DESCRIPTION OF THE DRAWINGS
  • FIG. 1 is a block diagram of an illustrative smartphone.
  • FIG. 2 is a flow chart of a process according to one particular embodiment of the present technology.
  • DETAILED DESCRIPTION
  • Referring to FIG. 1, an illustrative smartphone 10 includes a processor 12, a display 14, a touchscreen 16 and other physical user interface (UI) elements 18 (e.g., buttons, etc.). Also included are one or more microphones 20, a variety of other sensors 22 (e.g., motions sensors such 3D accelerometers, gyroscopes and magnetometers), a network adapter 24, a location-determining module 26 (e.g., GPS), and an RF transceiver 28.
  • The depicted phone 10 also includes two cameras 30, 32. Camera 30 is front-facing, i.e., with a lens mounted on the side of the smartphone that also includes the screen. The second camera 32 has a lens on a different side of the smartphone, commonly on the back side. The front-facing camera is lower in resolution than the back-facing camera (e.g., 640×480 pixels for the front-facing camera, vs. 1280×720 pixels for the back-facing camera). Accordingly, imagery from the front-facing camera can be processed more simply than imagery from the back-facing camera, with less power consumption and less computational complexity.
  • Associated with the second camera 32 is an LED “torch” 34 that is mounted so as to illuminate the second camera's field of view. Commonly, this torch is positioned on the same side of the smartphone as the lens of the second camera, although this is not essential.
  • Smartphone 10 also includes a memory 36 that stores software and data. The software includes both operating system software and application software. The former includes software that controls the user interface. The latter includes content processing software—such as a QR code reader and/or a digital watermark decoder. It similarly may include music recognition software.
  • In operation, the smartphone captures first image data from a physical object (e.g., a newspaper) using the second (e.g., rear-facing) camera 32. The smartphone then attempts to decode encoded information from the captured imagery (e.g., a QR code or digital watermark). An associated result is presented to the user, e.g., on the smartphone screen 14.
  • Meanwhile, the smartphone captures imagery of the user's face, from the front-facing camera 30—both before and after the decoding attempt. This facial expression information is analyzed to discern whether an emotion indicated by the user changes negatively. For example, the user's facial expression may change from a neutral expression to a slight frown or grimace. If the smartphone thereby discerns that the user is becoming frustrated with the smartphone, the smartphone processor 36 issues a signal that turns on the torch 34. This torch illuminates the field of view of the camera 32, including the newspaper being imaged.
  • The increased illumination will often allow the smartphone to extract the encoded information from the imagery captured from the newspaper, when the smartphone was previously unable to do so.
  • The torch 34 can be extinguished when the processor 36 indicates that a decoding operation has been performed successfully. Alternatively, the torch can be turned-off if imagery captured by the camera 30 reveals a change in the users' facial expression, e.g., from a frown to a neutral expression, or a smile. Still further, the torch can be turned-off based on a time interval—such as 3, 5 or 10 seconds following its enablement. The torch can also be extinguished if the processor senses (e.g., by reference to one of the motion sensors) that the phone has been moved from the pose in which the user was holding it when a negative emotion was sensed, to a different pose—indicating that the user has ceased the attempt to extract information from the object.
  • Enabling the torch is one action the smartphone can take based on the user's sensed emotion. Alternatively, or additionally, the smartphone can change one or more other parameters. For example, the smartphone may change the focus or zoom of the second camera 32—trying to capture information depicted in a different focal plane. (Such change can be achieved by conventional mechanical arrangements, or by computational photography techniques). Or a different lens aperture or a different exposure interval can be tried. Likewise, different image processing operations may be triggered, such as spatial-domain or frequency-domain filtering, averaging, or analysis in different color planes (or greyscale). Still further, several captured image frames can be combined, such as by averaging, or using high dynamic range combination techniques, in an attempt to obtain imagery from which better recognition results can be obtained.
  • In a variant embodiment, other facial expressions control other aspects of image processing. For example, the zoom function of camera 32 can be controlled in accordance with eyelid gestures sensed by camera 30 (e.g., with zoom increasing as the user's eyes are opened further). Similarly, changes to the user's lip posture can vary a parameter of operation (e.g., with zoom increasing as the user's lips move apart).
  • In the detailed arrangement, it will be recognized that the smartphone analyzes camera data to turn on a torch. However, non-obviously, the analyzed camera data is not from the camera 32 with which the torch is associated, but rather is from a camera 30 facing a different direction (towards the user).
  • The detailed arrangement benefits the user by responding automatically to the user's reflexive reaction to disappointment—without requiring any deliberate action on the user's part. It also conserves battery power, by not energizing the LED unnecessarily.
  • While described in the context of reading barcode or digital watermark data from a printed object, the technology finds other applications as well. One is in performing OCR-based text recognition. Another is in connection with a pattern-matching operation (e.g., based on extracting characteristic feature data from imagery, such as by SURF). A great variety of other smartphone operations can likewise be altered based on the user's sensed emotional state.
  • Other Comments
  • Having described and illustrated the principles of my inventive work with reference to an illustrative example, it will be recognized that the technology is not so limited.
  • For example, while the detailed embodiment senses mood/emotion by reference to facial image data, other embodiments can use other techniques, e.g., based on voice parameters, heart rate, skin conductivity, and/or other biometrics. (Apple's patent publication 20100113950 details technology for capturing and analyzing EKG data from a user, using a smartphone.) A user's gestures with the phone can also be sensed and analyzed to discern likely emotion (e.g., hard shaking of the device can indicate frustration).
  • Particular arrangements for recognizing emotions (e.g., joy, sadness, anticipation, surprise, trust, disgust, anger, fear, etc.) from facial imagery are detailed in US20070066916. Other particular arrangements for facial expression analysis are familiar to artisans in the field from publications including Cohen, et al, “Facial Expression Recognition from Video Sequences: Temporal and Static Modeling,” Computer Vision and Understanding 91 (2003), pp. 160-187, and from Chapter 11 (Facial Expression Analysis) in the book Handbook of Facial Recognition, Li and Jain, eds., Springer Verlag 2005.
  • Analysis of the user's emotion typically is based on a “before” and “after” comparison of sampled information (e.g., facial expression data). However, this is not essential. The smartphone can decide to change a parameter of operation (e.g., turn on the torch) based on detection of a frown after the smartphone presents an original processing result (e.g., OCR extraction), regardless of the user's expression before presentation of that result. In some embodiments, a negative emotion may be inferred from the lack of a positive facial expression—or a change from positive facial expression to a neutral facial expression.
  • Upcoming smartphones will doubtless have stereo cameras for 3D image capture—perhaps both front-facing and back-facing. The availability of stereo imagery of the user's facial expressions allows for more accurate, and nuanced, inferencing of user emotion.
  • In an illustrative embodiment, a classifier arrangement is used to recognize different emotional states. (A classifier is a function that maps an input attribute vector, x=(x1, x2, x3, x4,xn), to a confidence that the input belongs to a class, that is, f(x)=confidence(class). Such classification can employ a probabilistic and/or statistical-based analysis to infer an action or state that corresponds to user. A support vector machine (SVM) is an example of a classifier that can be employed.)
  • While reference has been made to a smartphone-based embodiment, it will be recognized that this technology finds utility with all manner of devices. Game consoles, desktop computers, laptop computers, tablet computers, set-top boxes, televisions, netbooks, wearable computers, etc., can all make use of the principles detailed herein. The term “smartphone” should be construed to encompass all such devices, even those that are not strictly-speaking telephones.
  • Exemplary smartphones include the Apple iPhone 4, and smartphones following Google's Android specification (e.g., the Verizon Droid Eris phone, manufactured by HTC Corp., and the Motorola Droid 3 phone). (Details of the iPhone, including its touch interface, are provided in Apple's published patent application 20080174570.)
  • As is familiar to artisans, the processes and arrangements detailed in this specification can be implemented as instructions for computing devices, including general purpose processor instructions for a variety of programmable processors, including microprocessors (e.g., the Atom and A4), graphics processing units (GPUs, such as the nVidia Tegra APX 2600), and digital signal processors (e.g., the Texas Instruments TMS320 series devices), etc. These instructions can be implemented as software, firmware, etc. These instructions can also be implemented in various forms of processor circuitry, including programmable logic devices, field programmable gate arrays (e.g., the Xilinx Virtex series devices), field programmable object arrays, and application specific circuits—including digital, analog and mixed analog/digital circuitry. Execution of the instructions can be distributed among processors and/or made parallel across processors within a device or across a network of devices. Processing of data can also be distributed among different processor and memory devices. “Cloud” computing resources can be used as well. References to “processors,” “modules” or “components” should be understood to refer to functionality, rather than requiring a particular form of implementation.
  • Software instructions for implementing the detailed functionality can be authored by artisans without undue experimentation from the description provided herein, e.g., written in C, C++, Visual Basic, Java, Python, Tcl, Perl, Scheme, Ruby, etc. Smartphones according to certain implementations of the present technology can include software modules for performing the different functions and acts.
  • Different of the functionality can be implemented on different devices. For example, image processing or music recognition operations can involve one or more remote devices, between which execution can be distributed. Extraction of watermark data from image content is one example of a process that can be distributed in such fashion. Another example is image analysis to discern emotion. Thus, it should be understood that description of an operation as being performed by a particular device (e.g., a smartphone) is not limiting but exemplary; performance of the operation by another device (e.g., a remote server), or shared between devices, is also expressly contemplated.
  • While this disclosure has detailed particular ordering of acts and particular combinations of elements, it will be recognized that other contemplated methods may re-order acts (possibly omitting some and adding others), and other contemplated combinations may omit some elements and add others, etc.
  • Although disclosed as complete systems, sub-combinations of the detailed arrangements are also separately contemplated.
  • While detailed in the context of a smartphone that extracts information from imagery, corresponding arrangements are equally applicable to systems that extract information from audio, or from combinations of media.
  • For example, in connection with a music-recognition app or a speech-to-text app, a user's facial response to the app can be captured by a front-facing camera and—if it turns negative—the device can employ alternate strategies to try and obtain a result that is more user-pleasing. For a music app, one strategy is for the smartphone to attempt to characterize non-music audio captured by the microphone, and then apply a corresponding filter to reduce interference from such audio. Another strategy is to involve nearby smartphones in the detection task, e.g., requesting (such as by Bluetooth) that they sample audio from their locations, and forward captured audio—perhaps after initial processing—to the original smartphone. The original smartphone can then combine such audio with its own captured audio to perhaps increase the signal-to-noise ratio of the music, to which a recognition process can be applied—hopefully with a more pleasing result.
  • (Music recognition is taught in Shazam's U.S. Pat. Nos. 6,990,453 and 7,359,889.)
  • More generally, the detailed embodiment may be regarded as employing a first, front-facing camera as a user-feedback sensor device, and employing a second camera as an environment sensor device.
  • A related embodiment is a variation on the “smile shutter” concept. In this embodiment, a user positions a smartphone so that the second (e.g., rear-facing) camera points towards a desired scene (which is displayed on the phone screen). While prior art smartphone cameras normally require the user to touch the screen to capture an image of the scene, this variant embodiment instead triggers image capture by analyzing imagery from the front-facing camera—looking for a particular facial signal, such as a smile. When the smartphone operator smiles, the second camera takes a picture. It will be recognized that this arrangement avoids the shake problem inherent in the prior art (in which image capture is triggered by the user touching the screen).
  • To provide a comprehensive disclosure, while complying with the statutory requirement of conciseness, applicant incorporates-by-reference the patents, patent applications and other documents referenced herein. (Such materials are incorporated in their entireties, even if cited above in connection with specific of their teachings.) These references disclose technologies, teachings and systems that can be incorporated into the arrangements detailed herein, and into which the technologies, teachings and systems detailed herein can be incorporated. The reader is presumed to be familiar with such prior work.
  • In view of the wide variety of embodiments to which the principles and features discussed above can be applied, it should be apparent that the detailed embodiments are illustrative only, and should not be taken as limiting the scope of the invention. Rather, I claim as my invention all such modifications as may come within the scope and spirit of the following claims and equivalents thereof.

Claims (20)

I claim:
1. A smartphone comprising:
a first sensor capable of capturing first data indicative of at least one condition in an environment surrounding the smartphone;
a second sensor capable of capturing second data indicative of an emotional state of a user of the smartphone;
an output device;
a processor; and
a memory containing stored instructions;
wherein the instructions are executable by the processor to cause the smartphone to:
perform a first data capture operation using the first sensor;
perform a processing operation on the first data captured during the first data capture operation and present an associated result to the user via the output device;
perform a second data capture operation using the second sensor after the associated result is presented;
analyze the second data captured during the first data capture operation to discern an emotion indicated by the user; and
change a parameter associated with at least one of the first data capture operation and the processing operation data based upon the discerned emotion indicated by the user.
2. The smartphone of claim 1, wherein the first sensor includes a first camera having a first field of view relative to the smartphone.
3. The smartphone of claim 2, wherein a parameter associated with at least one of the first data capture operation and the processing operation includes a location of a focal plane associated with the first camera, and wherein the instructions are executable by the processor to cause the smartphone to change the focal plane of the first camera based upon the discerned emotion indicated by the user.
4. The smartphone of claim 2, wherein a parameter associated with at least one of the first data capture operation and the processing operation includes the first field of view, and wherein the instructions are executable by the processor to cause the smartphone to change the first field of view based upon the discerned emotion indicated by the user.
5. The smartphone of claim 2, further comprising a light source capable of illuminating the first field of view, wherein a parameter associated with the first data capture operation includes an illumination state of the light source, and wherein the instructions are executable by the processor to cause the smartphone to change the illumination state of the light source based upon the discerned emotion indicated by the user.
6. The smartphone of claim 2, wherein the second sensor includes a second camera having a second field of view relative to the smartphone, the second field of view being different from the first field of view.
7. The smartphone of claim 2, wherein the instructions are executable by the processor to cause the smartphone to perform at least one operation selected from the group consisting of an information decoding operation and a pattern recognition operation.
8. The smartphone of claim 1, wherein the first sensor includes a microphone.
9. The smartphone of claim 8, wherein a parameter associated with the processing operation includes applying a filter based on audio data captured by the microphone during the first data capture operation.
10. The smartphone of claim 1, wherein the second sensor includes a camera.
11. The smartphone of claim 1, wherein the second sensor includes an accelerometer.
12. The smartphone of claim 1, wherein the output device includes a display.
13. The smartphone of claim 12, wherein the instructions are executable by the processor to cause the smartphone to perform a processing operation including displaying the captured first data on the display as an image.
14. The smartphone of claim 13, wherein the instructions are executable by the processor to cause the smartphone to capture the displayed image based upon the discerned emotion indicated by the user.
15. The smartphone of claim 14, wherein the instructions are executable by the processor to cause the smartphone to capture the displayed image when the analysis discerns that the user is smiling.
16. The smartphone of claim 1, wherein the instructions are executable by the processor to cause the smartphone to change a parameter associated with at least one of the first data capture operation and the processing operation data when the analysis discerns a negative emotion indicated by the user.
17. The smartphone of claim 1, wherein the instructions are further executable by the processor to cause the smartphone to:
perform a third data capture operation using the second sensor before a result associated with the processing operation is presented;
analyze the second data captured during the third data capture operation to discern an emotion indicated by the user; and
change the parameter associated with at least one of the first data capture operation and the processing operation data based upon a change in the discerned emotion indicated by the user from the third data capture operation to the second data capture operation.
18. The smartphone of claim 17, wherein the instructions are executable by the processor to cause the smartphone to change a parameter associated with at least one of the first data capture operation and the processing operation data when the analysis discerns that the emotion of the user changed negatively.
19. A non-transitory computer readable medium containing instructions for use with a device having a first sensor capable of capturing first data indicative of at least one condition in an environment surrounding the device, a second sensor capable of capturing second data indicative of an emotional state of a user of the device, and an output device, wherein said instructions—if executed by a processor in said device—cause the device to perform acts including:
perform a first data capture operation using the first sensor;
perform a processing operation on the first data captured during the first data capture operation and present an associated result to the user via the output device;
perform a second data capture operation using the second sensor after the associated result is presented;
analyze the second data captured during the first data capture operation to discern an emotion indicated by the user; and
change a parameter associated with at least one of the first data capture operation and the processing operation data based upon the discerned emotion indicated by the user.
20. A method comprising:
capturing first data indicative of at least one conditions in an environment surrounding a user of a device;
processing the captured first data and presenting an associated result to the user;
capturing second data indicative of an emotional state of the user;
analyzing the captured second data to discern an emotion indicated by the user; and
changing a parameter associated with at least one of the capturing of the first data and the processing of the captured first data based upon the discerned emotion indicated by the user.
US14/058,595 2011-08-17 2013-10-21 Emotional illumination, and related arrangements Abandoned US20140148219A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US14/058,595 US20140148219A1 (en) 2011-08-17 2013-10-21 Emotional illumination, and related arrangements

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US13/212,119 US8564684B2 (en) 2011-08-17 2011-08-17 Emotional illumination, and related arrangements
US14/058,595 US20140148219A1 (en) 2011-08-17 2013-10-21 Emotional illumination, and related arrangements

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
US13/212,119 Continuation US8564684B2 (en) 2011-08-17 2011-08-17 Emotional illumination, and related arrangements

Publications (1)

Publication Number Publication Date
US20140148219A1 true US20140148219A1 (en) 2014-05-29

Family

ID=47712399

Family Applications (2)

Application Number Title Priority Date Filing Date
US13/212,119 Expired - Fee Related US8564684B2 (en) 2011-08-17 2011-08-17 Emotional illumination, and related arrangements
US14/058,595 Abandoned US20140148219A1 (en) 2011-08-17 2013-10-21 Emotional illumination, and related arrangements

Family Applications Before (1)

Application Number Title Priority Date Filing Date
US13/212,119 Expired - Fee Related US8564684B2 (en) 2011-08-17 2011-08-17 Emotional illumination, and related arrangements

Country Status (1)

Country Link
US (2) US8564684B2 (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20160127641A1 (en) * 2014-11-03 2016-05-05 Robert John Gove Autonomous media capturing
US20190207992A1 (en) * 2017-12-29 2019-07-04 Facebook, Inc. Systems and methods for sharing content
US20220329678A1 (en) * 2021-03-02 2022-10-13 Apple Inc. Handheld electronic device
US12238401B2 (en) 2020-03-06 2025-02-25 Apple Inc. Housing structure for handheld electronic device

Families Citing this family (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2758956B1 (en) 2011-09-23 2021-03-10 Digimarc Corporation Context-based smartphone sensor logic
US9348479B2 (en) * 2011-12-08 2016-05-24 Microsoft Technology Licensing, Llc Sentiment aware user interface customization
US9378290B2 (en) 2011-12-20 2016-06-28 Microsoft Technology Licensing, Llc Scenario-adaptive input method editor
EP2864856A4 (en) 2012-06-25 2015-10-14 Microsoft Technology Licensing Llc SEIZURE METHOD EDITOR APPLICATION PLATFORM
WO2014032244A1 (en) 2012-08-30 2014-03-06 Microsoft Corporation Feature-based candidate selection
US9104467B2 (en) 2012-10-14 2015-08-11 Ari M Frank Utilizing eye tracking to reduce power consumption involved in measuring affective response
US9477993B2 (en) 2012-10-14 2016-10-25 Ari M Frank Training a predictor of emotional response based on explicit voting on content and eye tracking to verify attention
US20150035952A1 (en) * 2013-08-05 2015-02-05 Samsung Electronics Co., Ltd. Photographing apparatus, display apparatus, photographing method, and computer readable recording medium
EP3030982A4 (en) 2013-08-09 2016-08-03 Microsoft Technology Licensing Llc Input method editor providing language assistance
KR102063102B1 (en) * 2013-08-19 2020-01-07 엘지전자 주식회사 Mobile terminal and control method for the mobile terminal
IL229115A0 (en) * 2013-10-28 2014-03-31 Safe Code Systems Ltd Real - time presence verification
US20150215514A1 (en) * 2014-01-24 2015-07-30 Voxx International Corporation Device for wirelessly controlling a camera
US9311639B2 (en) 2014-02-11 2016-04-12 Digimarc Corporation Methods, apparatus and arrangements for device to device communication
US9269009B1 (en) * 2014-05-20 2016-02-23 Amazon Technologies, Inc. Using a front-facing camera to improve OCR with a rear-facing camera
DE102014222426B4 (en) * 2014-11-04 2025-06-26 Bayerische Motoren Werke Aktiengesellschaft Radio key for adjusting the configuration of a means of transport
US10180339B1 (en) 2015-05-08 2019-01-15 Digimarc Corporation Sensing systems
US10885915B2 (en) 2016-07-12 2021-01-05 Apple Inc. Intelligent software agent
US11816678B2 (en) 2020-06-26 2023-11-14 Capital One Services, Llc Systems and methods for providing user emotion information to a customer service provider

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5839000A (en) * 1997-11-10 1998-11-17 Sharp Laboratories Of America, Inc. Automatic zoom magnification control using detection of eyelid condition
US6614466B2 (en) * 2001-02-22 2003-09-02 Texas Instruments Incorporated Telescopic reconstruction of facial features from a speech pattern
US20080174570A1 (en) * 2006-09-06 2008-07-24 Apple Inc. Touch Screen Device, Method, and Graphical User Interface for Determining Commands by Applying Heuristics
US20080212831A1 (en) * 2007-03-02 2008-09-04 Sony Ericsson Mobile Communications Ab Remote control of an image capturing unit in a portable electronic device
US20090041428A1 (en) * 2007-08-07 2009-02-12 Jacoby Keith A Recording audio metadata for captured images
US20100110265A1 (en) * 2008-11-05 2010-05-06 Sony Corporation Imaging apparatus and display control method thereof

Family Cites Families (28)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6947571B1 (en) * 1999-05-19 2005-09-20 Digimarc Corporation Cell phones with optical capabilities, and related applications
US6590996B1 (en) * 2000-02-14 2003-07-08 Digimarc Corporation Color adaptive watermarking
US6185534B1 (en) * 1998-03-23 2001-02-06 Microsoft Corporation Modeling emotion and personality in a computer user interface
JP3970520B2 (en) * 1998-04-13 2007-09-05 アイマティック・インターフェイシズ・インコーポレイテッド Capturing facial movements based on wavelets to animate a human figure
US6714661B2 (en) * 1998-11-06 2004-03-30 Nevengineering, Inc. Method and system for customizing facial feature tracking using precise landmark finding on a neutral face image
US6990453B2 (en) * 2000-07-31 2006-01-24 Landmark Digital Services Llc System and methods for recognizing sound and music signals in high noise and distortion
US7359889B2 (en) * 2001-03-02 2008-04-15 Landmark Digital Services Llc Method and apparatus for automatically creating database for use in automated media recognition system
US20040001616A1 (en) * 2002-06-27 2004-01-01 Srinivas Gutta Measurement of content ratings through vision and speech recognition
US7665024B1 (en) * 2002-07-22 2010-02-16 Verizon Services Corp. Methods and apparatus for controlling a user interface based on the emotional state of a user
US7874983B2 (en) * 2003-01-27 2011-01-25 Motorola Mobility, Inc. Determination of emotional and physiological states of a recipient of a communication
CA2622365A1 (en) * 2005-09-16 2007-09-13 Imotions-Emotion Technology A/S System and method for determining human emotion by analyzing eye properties
US7804983B2 (en) * 2006-02-24 2010-09-28 Fotonation Vision Limited Digital image acquisition control and correction method and apparatus
JP2008234401A (en) * 2007-03-22 2008-10-02 Fujifilm Corp User interface device and operation control method thereof
US20090112616A1 (en) * 2007-10-30 2009-04-30 Searete Llc, A Limited Liability Corporation Of The State Of Delaware Polling for interest in computational user-health test output
US20090118593A1 (en) * 2007-11-07 2009-05-07 Searete Llc, A Limited Liability Corporation Of The State Of Delaware Determining a demographic characteristic based on computational user-health testing of a user interaction with advertiser-specified content
JP4600435B2 (en) * 2007-06-13 2010-12-15 ソニー株式会社 Image photographing apparatus, image photographing method, and computer program
JP2009010776A (en) * 2007-06-28 2009-01-15 Sony Corp Imaging apparatus, imaging control method, and program
US20090002178A1 (en) * 2007-06-29 2009-01-01 Microsoft Corporation Dynamic mood sensing
US8805110B2 (en) * 2008-08-19 2014-08-12 Digimarc Corporation Methods and systems for content processing
US8615290B2 (en) * 2008-11-05 2013-12-24 Apple Inc. Seamlessly embedded heart rate monitor
US9117268B2 (en) * 2008-12-17 2015-08-25 Digimarc Corporation Out of phase digital watermarking in two chrominance directions
US8886206B2 (en) * 2009-05-01 2014-11-11 Digimarc Corporation Methods and systems for content processing
US8390680B2 (en) * 2009-07-09 2013-03-05 Microsoft Corporation Visual representation expression based on player expression
US20110013034A1 (en) * 2009-07-15 2011-01-20 Mediatek Inc. Method for operating digital camera and digital camera using the same
KR101078057B1 (en) * 2009-09-08 2011-10-31 주식회사 팬택 Mobile terminal had a function of photographing control and photographing control system used image recognition technicque
US20120154633A1 (en) * 2009-12-04 2012-06-21 Rodriguez Tony F Linked Data Methods and Systems
US20120004575A1 (en) * 2010-06-30 2012-01-05 Sony Ericsson Mobile Communications Ab System and method for indexing content viewed on an electronic device
US20120046071A1 (en) * 2010-08-20 2012-02-23 Robert Craig Brandis Smartphone-based user interfaces, such as for browsing print media

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5839000A (en) * 1997-11-10 1998-11-17 Sharp Laboratories Of America, Inc. Automatic zoom magnification control using detection of eyelid condition
US6614466B2 (en) * 2001-02-22 2003-09-02 Texas Instruments Incorporated Telescopic reconstruction of facial features from a speech pattern
US20080174570A1 (en) * 2006-09-06 2008-07-24 Apple Inc. Touch Screen Device, Method, and Graphical User Interface for Determining Commands by Applying Heuristics
US20080212831A1 (en) * 2007-03-02 2008-09-04 Sony Ericsson Mobile Communications Ab Remote control of an image capturing unit in a portable electronic device
US20090041428A1 (en) * 2007-08-07 2009-02-12 Jacoby Keith A Recording audio metadata for captured images
US20100110265A1 (en) * 2008-11-05 2010-05-06 Sony Corporation Imaging apparatus and display control method thereof

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20160127641A1 (en) * 2014-11-03 2016-05-05 Robert John Gove Autonomous media capturing
US10334158B2 (en) * 2014-11-03 2019-06-25 Robert John Gove Autonomous media capturing
US11509817B2 (en) * 2014-11-03 2022-11-22 Robert John Gove Autonomous media capturing
US20230156319A1 (en) * 2014-11-03 2023-05-18 Robert John Gove Autonomous media capturing
US12149819B2 (en) * 2014-11-03 2024-11-19 Robert John Gove Autonomous media capturing
US20190207992A1 (en) * 2017-12-29 2019-07-04 Facebook, Inc. Systems and methods for sharing content
US10805367B2 (en) * 2017-12-29 2020-10-13 Facebook, Inc. Systems and methods for sharing content
US12238401B2 (en) 2020-03-06 2025-02-25 Apple Inc. Housing structure for handheld electronic device
US20220329678A1 (en) * 2021-03-02 2022-10-13 Apple Inc. Handheld electronic device
US12088748B2 (en) * 2021-03-02 2024-09-10 Apple Inc. Handheld electronic device

Also Published As

Publication number Publication date
US8564684B2 (en) 2013-10-22
US20130044233A1 (en) 2013-02-21

Similar Documents

Publication Publication Date Title
US8564684B2 (en) Emotional illumination, and related arrangements
US11102398B2 (en) Distributing processing for imaging processing
US11743571B2 (en) Electronic device and operating method thereof
KR102664688B1 (en) Method for providing shoot mode based on virtual character and electronic device performing thereof
KR102598109B1 (en) Electronic device and method for providing notification relative to image displayed via display and image stored in memory based on image analysis
RU2649773C2 (en) Controlling camera with face detection
US9131150B1 (en) Automatic exposure control and illumination for head tracking
KR102560689B1 (en) Method and apparatus for displaying an ar object
US9436870B1 (en) Automatic camera selection for head tracking using exposure control
KR102707773B1 (en) Apparatus and method for displaying graphic elements according to object
CN103916591A (en) Device with camera and method for capturing images
US9846956B2 (en) Methods, systems and computer-readable mediums for efficient creation of image collages
KR101434533B1 (en) System for filming camera using appreciate gesture of finger and method therefor
CN113536866B (en) A person tracking display method and electronic device
CN107395957B (en) Photographing method, device, storage medium and electronic device
CN108156376A (en) Image acquisition method, device, terminal and storage medium
CN108038431A (en) Image processing method, image processing device, computer equipment and computer readable storage medium
US20230224574A1 (en) Photographing method and apparatus
CN113744172A (en) Document image processing method and device and training sample generation method and device
EP4258649A1 (en) Method for determining tracking target, and electronic device
CN107360371B (en) Automatic photographing method
CN116709013A (en) Terminal equipment control method, terminal equipment control device and storage medium
CN113749614A (en) Skin detection method and device

Legal Events

Date Code Title Description
AS Assignment

Owner name: DIGIMARC CORPORATION, OREGON

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:BAI, YANG;REEL/FRAME:032155/0685

Effective date: 20140124

STCB Information on status: application discontinuation

Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION