US20140148219A1 - Emotional illumination, and related arrangements - Google Patents
Emotional illumination, and related arrangements Download PDFInfo
- Publication number
- US20140148219A1 US20140148219A1 US14/058,595 US201314058595A US2014148219A1 US 20140148219 A1 US20140148219 A1 US 20140148219A1 US 201314058595 A US201314058595 A US 201314058595A US 2014148219 A1 US2014148219 A1 US 2014148219A1
- Authority
- US
- United States
- Prior art keywords
- smartphone
- user
- data
- instructions
- data capture
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
Images
Classifications
-
- H04N5/243—
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V40/00—Recognition of biometric, human-related or animal-related patterns in image or video data
- G06V40/10—Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
- G06V40/16—Human faces, e.g. facial parts, sketches or expressions
- G06V40/174—Facial expression recognition
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N23/00—Cameras or camera modules comprising electronic image sensors; Control thereof
- H04N23/70—Circuitry for compensating brightness variation in the scene
- H04N23/76—Circuitry for compensating brightness variation in the scene by influencing the image signals
-
- G06K9/00302—
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M1/00—Substation equipment, e.g. for use by subscribers
- H04M1/02—Constructional features of telephone sets
- H04M1/0202—Portable telephone sets, e.g. cordless phones, mobile phones or bar type handsets
- H04M1/026—Details of the structure or mounting of specific components
- H04M1/0264—Details of the structure or mounting of specific components for a camera module assembly
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N23/00—Cameras or camera modules comprising electronic image sensors; Control thereof
- H04N23/60—Control of cameras or camera modules
- H04N23/61—Control of cameras or camera modules based on recognised objects
- H04N23/611—Control of cameras or camera modules based on recognised objects where the recognised objects include parts of the human body
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N23/00—Cameras or camera modules comprising electronic image sensors; Control thereof
- H04N23/70—Circuitry for compensating brightness variation in the scene
- H04N23/74—Circuitry for compensating brightness variation in the scene by influencing the scene brightness using illuminating means
Definitions
- Frown/smile detection is used by some consumer cameras to automatically identify good images.
- the technology can be used to trigger image capture when a favorable facial expression is sensed, or to select from among a series of images, to pick a favorable image therefrom. It is sometimes termed a “smile shutter.”) See, e.g., US patent publications US20070201725, US20080309796, US20090002512, and US20100110265.
- Facial expressions can also be used in conjunction with commercial methods, to sense which ads or products are pleasing (or not) to viewers. See, e.g., US20090118593, US2009112616 and US20040001616.
- Verizon has suggested tailoring behavior of a user interface based on a user's sensed emotional state. For example, if the user's voice sounds stressed, a phone UI may address the user more slowly. See US20100037187.
- Related “affective computing” technology is detailed in Microsoft's U.S. Pat. No. 6,212,502, in which the user's emotional state is sensed, and a “help system” user interface responds accordingly.
- the Microsoft system relies on a Bayesian network to recognize the user's emotion. Additional mood-detecting technology is detailed in Microsoft's US20090002178.
- smartphones are used to sense machine readable data from physical media.
- consumers increasingly use smartphones to read QR codes and encoded digital watermarks from posters, magazines and newspapers, in order to link to related content.
- Such technology is detailed, e.g., in the assignee's patent documents U.S. Pat. No. 6,947,571, U.S. Pat. No. 6,590,996, 20110161076 and 20100150434, and in applications Ser. No. 13/079,327, filed Apr. 4, 2011, and Ser. No. 13/011,618, filed Jan. 21, 2011.
- the LED “torch” (illuminator) of a smartphone is activated when a user seems to be having difficulty using the smartphone to sense machine-readable data.
- the smartphone processor may be better able to decode the encoded information from the captured imagery.
- FIG. 1 is a block diagram of an illustrative smartphone.
- FIG. 2 is a flow chart of a process according to one particular embodiment of the present technology.
- an illustrative smartphone 10 includes a processor 12 , a display 14 , a touchscreen 16 and other physical user interface (UI) elements 18 (e.g., buttons, etc.). Also included are one or more microphones 20 , a variety of other sensors 22 (e.g., motions sensors such 3D accelerometers, gyroscopes and magnetometers), a network adapter 24 , a location-determining module 26 (e.g., GPS), and an RF transceiver 28 .
- UI physical user interface
- the depicted phone 10 also includes two cameras 30 , 32 .
- Camera 30 is front-facing, i.e., with a lens mounted on the side of the smartphone that also includes the screen.
- the second camera 32 has a lens on a different side of the smartphone, commonly on the back side.
- the front-facing camera is lower in resolution than the back-facing camera (e.g., 640 ⁇ 480 pixels for the front-facing camera, vs. 1280 ⁇ 720 pixels for the back-facing camera). Accordingly, imagery from the front-facing camera can be processed more simply than imagery from the back-facing camera, with less power consumption and less computational complexity.
- an LED “torch” 34 Associated with the second camera 32 is an LED “torch” 34 that is mounted so as to illuminate the second camera's field of view. Commonly, this torch is positioned on the same side of the smartphone as the lens of the second camera, although this is not essential.
- Smartphone 10 also includes a memory 36 that stores software and data.
- the software includes both operating system software and application software.
- the former includes software that controls the user interface.
- the latter includes content processing software—such as a QR code reader and/or a digital watermark decoder. It similarly may include music recognition software.
- the smartphone captures first image data from a physical object (e.g., a newspaper) using the second (e.g., rear-facing) camera 32 .
- the smartphone attempts to decode encoded information from the captured imagery (e.g., a QR code or digital watermark).
- An associated result is presented to the user, e.g., on the smartphone screen 14 .
- the smartphone captures imagery of the user's face, from the front-facing camera 30 —both before and after the decoding attempt.
- This facial expression information is analyzed to discern whether an emotion indicated by the user changes negatively. For example, the user's facial expression may change from a neutral expression to a slight frown or grimace. If the smartphone thereby discerns that the user is becoming frustrated with the smartphone, the smartphone processor 36 issues a signal that turns on the torch 34 . This torch illuminates the field of view of the camera 32 , including the newspaper being imaged.
- the increased illumination will often allow the smartphone to extract the encoded information from the imagery captured from the newspaper, when the smartphone was previously unable to do so.
- the torch 34 can be extinguished when the processor 36 indicates that a decoding operation has been performed successfully.
- the torch can be turned-off if imagery captured by the camera 30 reveals a change in the users' facial expression, e.g., from a frown to a neutral expression, or a smile.
- the torch can be turned-off based on a time interval—such as 3, 5 or 10 seconds following its enablement.
- the torch can also be extinguished if the processor senses (e.g., by reference to one of the motion sensors) that the phone has been moved from the pose in which the user was holding it when a negative emotion was sensed, to a different pose—indicating that the user has ceased the attempt to extract information from the object.
- Enabling the torch is one action the smartphone can take based on the user's sensed emotion.
- the smartphone can change one or more other parameters.
- the smartphone may change the focus or zoom of the second camera 32 —trying to capture information depicted in a different focal plane. (Such change can be achieved by conventional mechanical arrangements, or by computational photography techniques). Or a different lens aperture or a different exposure interval can be tried.
- different image processing operations may be triggered, such as spatial-domain or frequency-domain filtering, averaging, or analysis in different color planes (or greyscale).
- several captured image frames can be combined, such as by averaging, or using high dynamic range combination techniques, in an attempt to obtain imagery from which better recognition results can be obtained.
- other facial expressions control other aspects of image processing.
- the zoom function of camera 32 can be controlled in accordance with eyelid gestures sensed by camera 30 (e.g., with zoom increasing as the user's eyes are opened further).
- changes to the user's lip posture can vary a parameter of operation (e.g., with zoom increasing as the user's lips move apart).
- the smartphone analyzes camera data to turn on a torch.
- the analyzed camera data is not from the camera 32 with which the torch is associated, but rather is from a camera 30 facing a different direction (towards the user).
- the detailed arrangement benefits the user by responding automatically to the user's reflexive reaction to disappointment—without requiring any deliberate action on the user's part. It also conserves battery power, by not energizing the LED unnecessarily.
- the detailed embodiment senses mood/emotion by reference to facial image data
- other embodiments can use other techniques, e.g., based on voice parameters, heart rate, skin conductivity, and/or other biometrics.
- Apple's patent publication 20100113950 details technology for capturing and analyzing EKG data from a user, using a smartphone.
- a user's gestures with the phone can also be sensed and analyzed to discern likely emotion (e.g., hard shaking of the device can indicate frustration).
- Analysis of the user's emotion typically is based on a “before” and “after” comparison of sampled information (e.g., facial expression data). However, this is not essential.
- the smartphone can decide to change a parameter of operation (e.g., turn on the torch) based on detection of a frown after the smartphone presents an original processing result (e.g., OCR extraction), regardless of the user's expression before presentation of that result.
- a negative emotion may be inferred from the lack of a positive facial expression—or a change from positive facial expression to a neutral facial expression.
- a classifier arrangement is used to recognize different emotional states.
- Such classification can employ a probabilistic and/or statistical-based analysis to infer an action or state that corresponds to user.
- a support vector machine (SVM) is an example of a classifier that can be employed.
- Exemplary smartphones include the Apple iPhone 4, and smartphones following Google's Android specification (e.g., the Verizon Droid Eris phone, manufactured by HTC Corp., and the Motorola Droid 3 phone). (Details of the iPhone, including its touch interface, are provided in Apple's published patent application 20080174570.)
- Google's Android specification e.g., the Verizon Droid Eris phone, manufactured by HTC Corp., and the Motorola Droid 3 phone.
- processors for a variety of programmable processors, including microprocessors (e.g., the Atom and A4), graphics processing units (GPUs, such as the nVidia Tegra APX 2600), and digital signal processors (e.g., the Texas Instruments TMS320 series devices), etc.
- microprocessors e.g., the Atom and A4
- GPUs graphics processing units
- digital signal processors e.g., the Texas Instruments TMS320 series devices
- These instructions can be implemented as software, firmware, etc.
- processor circuitry including programmable logic devices, field programmable gate arrays (e.g., the Xilinx Virtex series devices), field programmable object arrays, and application specific circuits—including digital, analog and mixed analog/digital circuitry.
- Execution of the instructions can be distributed among processors and/or made parallel across processors within a device or across a network of devices. Processing of data can also be distributed among different processor and memory devices. “Cloud” computing resources can be used as well. References to “processors,” “modules” or “components” should be understood to refer to functionality, rather than requiring a particular form of implementation.
- Smartphones can include software modules for performing the different functions and acts.
- image processing or music recognition operations can involve one or more remote devices, between which execution can be distributed. Extraction of watermark data from image content is one example of a process that can be distributed in such fashion. Another example is image analysis to discern emotion.
- description of an operation as being performed by a particular device e.g., a smartphone
- performance of the operation by another device e.g., a remote server
- another device e.g., a remote server
- a user's facial response to the app can be captured by a front-facing camera and—if it turns negative—the device can employ alternate strategies to try and obtain a result that is more user-pleasing.
- a music app one strategy is for the smartphone to attempt to characterize non-music audio captured by the microphone, and then apply a corresponding filter to reduce interference from such audio.
- Another strategy is to involve nearby smartphones in the detection task, e.g., requesting (such as by Bluetooth) that they sample audio from their locations, and forward captured audio—perhaps after initial processing—to the original smartphone.
- the original smartphone can then combine such audio with its own captured audio to perhaps increase the signal-to-noise ratio of the music, to which a recognition process can be applied—hopefully with a more pleasing result.
- the detailed embodiment may be regarded as employing a first, front-facing camera as a user-feedback sensor device, and employing a second camera as an environment sensor device.
- a related embodiment is a variation on the “smile shutter” concept.
- a user positions a smartphone so that the second (e.g., rear-facing) camera points towards a desired scene (which is displayed on the phone screen).
- the second camera e.g., rear-facing
- this variant embodiment instead triggers image capture by analyzing imagery from the front-facing camera—looking for a particular facial signal, such as a smile.
- a smile When the smartphone operator smiles, the second camera takes a picture. It will be recognized that this arrangement avoids the shake problem inherent in the prior art (in which image capture is triggered by the user touching the screen).
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- General Health & Medical Sciences (AREA)
- Oral & Maxillofacial Surgery (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Studio Devices (AREA)
- Telephone Function (AREA)
Abstract
A smartphone senses a user's emotional reaction to certain output (e.g., an output from a smartphone's attempt to read a barcode printed in a newspaper). The phone then tailors its operation based on the sensed reaction (e.g., it may turn on a torch to better illuminate the newspaper, or vary image processing or decoding parameters).
Description
- This application is a Continuation of prior U.S. application Ser. No. 13/212,119, filed Aug. 17, 2011, which is incorporated herein by reference.
- Frown/smile detection is used by some consumer cameras to automatically identify good images. (The technology can be used to trigger image capture when a favorable facial expression is sensed, or to select from among a series of images, to pick a favorable image therefrom. It is sometimes termed a “smile shutter.”) See, e.g., US patent publications US20070201725, US20080309796, US20090002512, and US20100110265.
- Related technology has also been proposed for games, in which a user's facial expression is sensed, and mimicked on an avatar that corresponds to the user in a game. See, e.g., Microsoft's US2011007142. Neven et al has done related work, shown in U.S. Pat. Nos. 6,580,811 and 6,714,661.
- Facial expressions can also be used in conjunction with commercial methods, to sense which ads or products are pleasing (or not) to viewers. See, e.g., US20090118593, US2009112616 and US20040001616.
- Motorola has proposed a phone that senses and communicates the user's emotional state, as indicated by facial expressions. See U.S. Pat. No. 7,874,983.
- Verizon has suggested tailoring behavior of a user interface based on a user's sensed emotional state. For example, if the user's voice sounds stressed, a phone UI may address the user more slowly. See US20100037187. Related “affective computing” technology is detailed in Microsoft's U.S. Pat. No. 6,212,502, in which the user's emotional state is sensed, and a “help system” user interface responds accordingly. The Microsoft system relies on a Bayesian network to recognize the user's emotion. Additional mood-detecting technology is detailed in Microsoft's US20090002178.
- A recent survey of affective computing techniques is provided in Robinson, The Emotional Computer, Ninth Intl Conference on Pervasive Computing, Jun., 2011.
- Separately, smartphones are used to sense machine readable data from physical media. For example, consumers increasingly use smartphones to read QR codes and encoded digital watermarks from posters, magazines and newspapers, in order to link to related content. Such technology is detailed, e.g., in the assignee's patent documents U.S. Pat. No. 6,947,571, U.S. Pat. No. 6,590,996, 20110161076 and 20100150434, and in applications Ser. No. 13/079,327, filed Apr. 4, 2011, and Ser. No. 13/011,618, filed Jan. 21, 2011.
- In accordance with one aspect of the present technology, the LED “torch” (illuminator) of a smartphone is activated when a user seems to be having difficulty using the smartphone to sense machine-readable data. With additional illumination on the object being imaged, the smartphone processor may be better able to decode the encoded information from the captured imagery.
- The foregoing and additional features and advantages of the present technology will be more readily apparent from the following description, which proceeds with reference to the accompanying drawings.
-
FIG. 1 is a block diagram of an illustrative smartphone. -
FIG. 2 is a flow chart of a process according to one particular embodiment of the present technology. - Referring to
FIG. 1 , anillustrative smartphone 10 includes aprocessor 12, adisplay 14, atouchscreen 16 and other physical user interface (UI) elements 18 (e.g., buttons, etc.). Also included are one ormore microphones 20, a variety of other sensors 22 (e.g., motions sensors such 3D accelerometers, gyroscopes and magnetometers), anetwork adapter 24, a location-determining module 26 (e.g., GPS), and anRF transceiver 28. - The depicted
phone 10 also includes two 30, 32. Camera 30 is front-facing, i.e., with a lens mounted on the side of the smartphone that also includes the screen. Thecameras second camera 32 has a lens on a different side of the smartphone, commonly on the back side. The front-facing camera is lower in resolution than the back-facing camera (e.g., 640×480 pixels for the front-facing camera, vs. 1280×720 pixels for the back-facing camera). Accordingly, imagery from the front-facing camera can be processed more simply than imagery from the back-facing camera, with less power consumption and less computational complexity. - Associated with the
second camera 32 is an LED “torch” 34 that is mounted so as to illuminate the second camera's field of view. Commonly, this torch is positioned on the same side of the smartphone as the lens of the second camera, although this is not essential. - Smartphone 10 also includes a
memory 36 that stores software and data. The software includes both operating system software and application software. The former includes software that controls the user interface. The latter includes content processing software—such as a QR code reader and/or a digital watermark decoder. It similarly may include music recognition software. - In operation, the smartphone captures first image data from a physical object (e.g., a newspaper) using the second (e.g., rear-facing)
camera 32. The smartphone then attempts to decode encoded information from the captured imagery (e.g., a QR code or digital watermark). An associated result is presented to the user, e.g., on thesmartphone screen 14. - Meanwhile, the smartphone captures imagery of the user's face, from the front-facing
camera 30—both before and after the decoding attempt. This facial expression information is analyzed to discern whether an emotion indicated by the user changes negatively. For example, the user's facial expression may change from a neutral expression to a slight frown or grimace. If the smartphone thereby discerns that the user is becoming frustrated with the smartphone, thesmartphone processor 36 issues a signal that turns on thetorch 34. This torch illuminates the field of view of thecamera 32, including the newspaper being imaged. - The increased illumination will often allow the smartphone to extract the encoded information from the imagery captured from the newspaper, when the smartphone was previously unable to do so.
- The
torch 34 can be extinguished when theprocessor 36 indicates that a decoding operation has been performed successfully. Alternatively, the torch can be turned-off if imagery captured by thecamera 30 reveals a change in the users' facial expression, e.g., from a frown to a neutral expression, or a smile. Still further, the torch can be turned-off based on a time interval—such as 3, 5 or 10 seconds following its enablement. The torch can also be extinguished if the processor senses (e.g., by reference to one of the motion sensors) that the phone has been moved from the pose in which the user was holding it when a negative emotion was sensed, to a different pose—indicating that the user has ceased the attempt to extract information from the object. - Enabling the torch is one action the smartphone can take based on the user's sensed emotion. Alternatively, or additionally, the smartphone can change one or more other parameters. For example, the smartphone may change the focus or zoom of the
second camera 32—trying to capture information depicted in a different focal plane. (Such change can be achieved by conventional mechanical arrangements, or by computational photography techniques). Or a different lens aperture or a different exposure interval can be tried. Likewise, different image processing operations may be triggered, such as spatial-domain or frequency-domain filtering, averaging, or analysis in different color planes (or greyscale). Still further, several captured image frames can be combined, such as by averaging, or using high dynamic range combination techniques, in an attempt to obtain imagery from which better recognition results can be obtained. - In a variant embodiment, other facial expressions control other aspects of image processing. For example, the zoom function of
camera 32 can be controlled in accordance with eyelid gestures sensed by camera 30 (e.g., with zoom increasing as the user's eyes are opened further). Similarly, changes to the user's lip posture can vary a parameter of operation (e.g., with zoom increasing as the user's lips move apart). - In the detailed arrangement, it will be recognized that the smartphone analyzes camera data to turn on a torch. However, non-obviously, the analyzed camera data is not from the
camera 32 with which the torch is associated, but rather is from acamera 30 facing a different direction (towards the user). - The detailed arrangement benefits the user by responding automatically to the user's reflexive reaction to disappointment—without requiring any deliberate action on the user's part. It also conserves battery power, by not energizing the LED unnecessarily.
- While described in the context of reading barcode or digital watermark data from a printed object, the technology finds other applications as well. One is in performing OCR-based text recognition. Another is in connection with a pattern-matching operation (e.g., based on extracting characteristic feature data from imagery, such as by SURF). A great variety of other smartphone operations can likewise be altered based on the user's sensed emotional state.
- Having described and illustrated the principles of my inventive work with reference to an illustrative example, it will be recognized that the technology is not so limited.
- For example, while the detailed embodiment senses mood/emotion by reference to facial image data, other embodiments can use other techniques, e.g., based on voice parameters, heart rate, skin conductivity, and/or other biometrics. (Apple's patent publication 20100113950 details technology for capturing and analyzing EKG data from a user, using a smartphone.) A user's gestures with the phone can also be sensed and analyzed to discern likely emotion (e.g., hard shaking of the device can indicate frustration).
- Particular arrangements for recognizing emotions (e.g., joy, sadness, anticipation, surprise, trust, disgust, anger, fear, etc.) from facial imagery are detailed in US20070066916. Other particular arrangements for facial expression analysis are familiar to artisans in the field from publications including Cohen, et al, “Facial Expression Recognition from Video Sequences: Temporal and Static Modeling,” Computer Vision and Understanding 91 (2003), pp. 160-187, and from Chapter 11 (Facial Expression Analysis) in the book Handbook of Facial Recognition, Li and Jain, eds., Springer Verlag 2005.
- Analysis of the user's emotion typically is based on a “before” and “after” comparison of sampled information (e.g., facial expression data). However, this is not essential. The smartphone can decide to change a parameter of operation (e.g., turn on the torch) based on detection of a frown after the smartphone presents an original processing result (e.g., OCR extraction), regardless of the user's expression before presentation of that result. In some embodiments, a negative emotion may be inferred from the lack of a positive facial expression—or a change from positive facial expression to a neutral facial expression.
- Upcoming smartphones will doubtless have stereo cameras for 3D image capture—perhaps both front-facing and back-facing. The availability of stereo imagery of the user's facial expressions allows for more accurate, and nuanced, inferencing of user emotion.
- In an illustrative embodiment, a classifier arrangement is used to recognize different emotional states. (A classifier is a function that maps an input attribute vector, x=(x1, x2, x3, x4,xn), to a confidence that the input belongs to a class, that is, f(x)=confidence(class). Such classification can employ a probabilistic and/or statistical-based analysis to infer an action or state that corresponds to user. A support vector machine (SVM) is an example of a classifier that can be employed.)
- While reference has been made to a smartphone-based embodiment, it will be recognized that this technology finds utility with all manner of devices. Game consoles, desktop computers, laptop computers, tablet computers, set-top boxes, televisions, netbooks, wearable computers, etc., can all make use of the principles detailed herein. The term “smartphone” should be construed to encompass all such devices, even those that are not strictly-speaking telephones.
- Exemplary smartphones include the Apple iPhone 4, and smartphones following Google's Android specification (e.g., the Verizon Droid Eris phone, manufactured by HTC Corp., and the Motorola Droid 3 phone). (Details of the iPhone, including its touch interface, are provided in Apple's published patent application 20080174570.)
- As is familiar to artisans, the processes and arrangements detailed in this specification can be implemented as instructions for computing devices, including general purpose processor instructions for a variety of programmable processors, including microprocessors (e.g., the Atom and A4), graphics processing units (GPUs, such as the nVidia Tegra APX 2600), and digital signal processors (e.g., the Texas Instruments TMS320 series devices), etc. These instructions can be implemented as software, firmware, etc. These instructions can also be implemented in various forms of processor circuitry, including programmable logic devices, field programmable gate arrays (e.g., the Xilinx Virtex series devices), field programmable object arrays, and application specific circuits—including digital, analog and mixed analog/digital circuitry. Execution of the instructions can be distributed among processors and/or made parallel across processors within a device or across a network of devices. Processing of data can also be distributed among different processor and memory devices. “Cloud” computing resources can be used as well. References to “processors,” “modules” or “components” should be understood to refer to functionality, rather than requiring a particular form of implementation.
- Software instructions for implementing the detailed functionality can be authored by artisans without undue experimentation from the description provided herein, e.g., written in C, C++, Visual Basic, Java, Python, Tcl, Perl, Scheme, Ruby, etc. Smartphones according to certain implementations of the present technology can include software modules for performing the different functions and acts.
- Different of the functionality can be implemented on different devices. For example, image processing or music recognition operations can involve one or more remote devices, between which execution can be distributed. Extraction of watermark data from image content is one example of a process that can be distributed in such fashion. Another example is image analysis to discern emotion. Thus, it should be understood that description of an operation as being performed by a particular device (e.g., a smartphone) is not limiting but exemplary; performance of the operation by another device (e.g., a remote server), or shared between devices, is also expressly contemplated.
- While this disclosure has detailed particular ordering of acts and particular combinations of elements, it will be recognized that other contemplated methods may re-order acts (possibly omitting some and adding others), and other contemplated combinations may omit some elements and add others, etc.
- Although disclosed as complete systems, sub-combinations of the detailed arrangements are also separately contemplated.
- While detailed in the context of a smartphone that extracts information from imagery, corresponding arrangements are equally applicable to systems that extract information from audio, or from combinations of media.
- For example, in connection with a music-recognition app or a speech-to-text app, a user's facial response to the app can be captured by a front-facing camera and—if it turns negative—the device can employ alternate strategies to try and obtain a result that is more user-pleasing. For a music app, one strategy is for the smartphone to attempt to characterize non-music audio captured by the microphone, and then apply a corresponding filter to reduce interference from such audio. Another strategy is to involve nearby smartphones in the detection task, e.g., requesting (such as by Bluetooth) that they sample audio from their locations, and forward captured audio—perhaps after initial processing—to the original smartphone. The original smartphone can then combine such audio with its own captured audio to perhaps increase the signal-to-noise ratio of the music, to which a recognition process can be applied—hopefully with a more pleasing result.
- (Music recognition is taught in Shazam's U.S. Pat. Nos. 6,990,453 and 7,359,889.)
- More generally, the detailed embodiment may be regarded as employing a first, front-facing camera as a user-feedback sensor device, and employing a second camera as an environment sensor device.
- A related embodiment is a variation on the “smile shutter” concept. In this embodiment, a user positions a smartphone so that the second (e.g., rear-facing) camera points towards a desired scene (which is displayed on the phone screen). While prior art smartphone cameras normally require the user to touch the screen to capture an image of the scene, this variant embodiment instead triggers image capture by analyzing imagery from the front-facing camera—looking for a particular facial signal, such as a smile. When the smartphone operator smiles, the second camera takes a picture. It will be recognized that this arrangement avoids the shake problem inherent in the prior art (in which image capture is triggered by the user touching the screen).
- To provide a comprehensive disclosure, while complying with the statutory requirement of conciseness, applicant incorporates-by-reference the patents, patent applications and other documents referenced herein. (Such materials are incorporated in their entireties, even if cited above in connection with specific of their teachings.) These references disclose technologies, teachings and systems that can be incorporated into the arrangements detailed herein, and into which the technologies, teachings and systems detailed herein can be incorporated. The reader is presumed to be familiar with such prior work.
- In view of the wide variety of embodiments to which the principles and features discussed above can be applied, it should be apparent that the detailed embodiments are illustrative only, and should not be taken as limiting the scope of the invention. Rather, I claim as my invention all such modifications as may come within the scope and spirit of the following claims and equivalents thereof.
Claims (20)
1. A smartphone comprising:
a first sensor capable of capturing first data indicative of at least one condition in an environment surrounding the smartphone;
a second sensor capable of capturing second data indicative of an emotional state of a user of the smartphone;
an output device;
a processor; and
a memory containing stored instructions;
wherein the instructions are executable by the processor to cause the smartphone to:
perform a first data capture operation using the first sensor;
perform a processing operation on the first data captured during the first data capture operation and present an associated result to the user via the output device;
perform a second data capture operation using the second sensor after the associated result is presented;
analyze the second data captured during the first data capture operation to discern an emotion indicated by the user; and
change a parameter associated with at least one of the first data capture operation and the processing operation data based upon the discerned emotion indicated by the user.
2. The smartphone of claim 1 , wherein the first sensor includes a first camera having a first field of view relative to the smartphone.
3. The smartphone of claim 2 , wherein a parameter associated with at least one of the first data capture operation and the processing operation includes a location of a focal plane associated with the first camera, and wherein the instructions are executable by the processor to cause the smartphone to change the focal plane of the first camera based upon the discerned emotion indicated by the user.
4. The smartphone of claim 2 , wherein a parameter associated with at least one of the first data capture operation and the processing operation includes the first field of view, and wherein the instructions are executable by the processor to cause the smartphone to change the first field of view based upon the discerned emotion indicated by the user.
5. The smartphone of claim 2 , further comprising a light source capable of illuminating the first field of view, wherein a parameter associated with the first data capture operation includes an illumination state of the light source, and wherein the instructions are executable by the processor to cause the smartphone to change the illumination state of the light source based upon the discerned emotion indicated by the user.
6. The smartphone of claim 2 , wherein the second sensor includes a second camera having a second field of view relative to the smartphone, the second field of view being different from the first field of view.
7. The smartphone of claim 2 , wherein the instructions are executable by the processor to cause the smartphone to perform at least one operation selected from the group consisting of an information decoding operation and a pattern recognition operation.
8. The smartphone of claim 1 , wherein the first sensor includes a microphone.
9. The smartphone of claim 8 , wherein a parameter associated with the processing operation includes applying a filter based on audio data captured by the microphone during the first data capture operation.
10. The smartphone of claim 1 , wherein the second sensor includes a camera.
11. The smartphone of claim 1 , wherein the second sensor includes an accelerometer.
12. The smartphone of claim 1 , wherein the output device includes a display.
13. The smartphone of claim 12 , wherein the instructions are executable by the processor to cause the smartphone to perform a processing operation including displaying the captured first data on the display as an image.
14. The smartphone of claim 13 , wherein the instructions are executable by the processor to cause the smartphone to capture the displayed image based upon the discerned emotion indicated by the user.
15. The smartphone of claim 14 , wherein the instructions are executable by the processor to cause the smartphone to capture the displayed image when the analysis discerns that the user is smiling.
16. The smartphone of claim 1 , wherein the instructions are executable by the processor to cause the smartphone to change a parameter associated with at least one of the first data capture operation and the processing operation data when the analysis discerns a negative emotion indicated by the user.
17. The smartphone of claim 1 , wherein the instructions are further executable by the processor to cause the smartphone to:
perform a third data capture operation using the second sensor before a result associated with the processing operation is presented;
analyze the second data captured during the third data capture operation to discern an emotion indicated by the user; and
change the parameter associated with at least one of the first data capture operation and the processing operation data based upon a change in the discerned emotion indicated by the user from the third data capture operation to the second data capture operation.
18. The smartphone of claim 17 , wherein the instructions are executable by the processor to cause the smartphone to change a parameter associated with at least one of the first data capture operation and the processing operation data when the analysis discerns that the emotion of the user changed negatively.
19. A non-transitory computer readable medium containing instructions for use with a device having a first sensor capable of capturing first data indicative of at least one condition in an environment surrounding the device, a second sensor capable of capturing second data indicative of an emotional state of a user of the device, and an output device, wherein said instructions—if executed by a processor in said device—cause the device to perform acts including:
perform a first data capture operation using the first sensor;
perform a processing operation on the first data captured during the first data capture operation and present an associated result to the user via the output device;
perform a second data capture operation using the second sensor after the associated result is presented;
analyze the second data captured during the first data capture operation to discern an emotion indicated by the user; and
change a parameter associated with at least one of the first data capture operation and the processing operation data based upon the discerned emotion indicated by the user.
20. A method comprising:
capturing first data indicative of at least one conditions in an environment surrounding a user of a device;
processing the captured first data and presenting an associated result to the user;
capturing second data indicative of an emotional state of the user;
analyzing the captured second data to discern an emotion indicated by the user; and
changing a parameter associated with at least one of the capturing of the first data and the processing of the captured first data based upon the discerned emotion indicated by the user.
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US14/058,595 US20140148219A1 (en) | 2011-08-17 | 2013-10-21 | Emotional illumination, and related arrangements |
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US13/212,119 US8564684B2 (en) | 2011-08-17 | 2011-08-17 | Emotional illumination, and related arrangements |
| US14/058,595 US20140148219A1 (en) | 2011-08-17 | 2013-10-21 | Emotional illumination, and related arrangements |
Related Parent Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US13/212,119 Continuation US8564684B2 (en) | 2011-08-17 | 2011-08-17 | Emotional illumination, and related arrangements |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| US20140148219A1 true US20140148219A1 (en) | 2014-05-29 |
Family
ID=47712399
Family Applications (2)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US13/212,119 Expired - Fee Related US8564684B2 (en) | 2011-08-17 | 2011-08-17 | Emotional illumination, and related arrangements |
| US14/058,595 Abandoned US20140148219A1 (en) | 2011-08-17 | 2013-10-21 | Emotional illumination, and related arrangements |
Family Applications Before (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US13/212,119 Expired - Fee Related US8564684B2 (en) | 2011-08-17 | 2011-08-17 | Emotional illumination, and related arrangements |
Country Status (1)
| Country | Link |
|---|---|
| US (2) | US8564684B2 (en) |
Cited By (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20160127641A1 (en) * | 2014-11-03 | 2016-05-05 | Robert John Gove | Autonomous media capturing |
| US20190207992A1 (en) * | 2017-12-29 | 2019-07-04 | Facebook, Inc. | Systems and methods for sharing content |
| US20220329678A1 (en) * | 2021-03-02 | 2022-10-13 | Apple Inc. | Handheld electronic device |
| US12238401B2 (en) | 2020-03-06 | 2025-02-25 | Apple Inc. | Housing structure for handheld electronic device |
Families Citing this family (18)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| EP2758956B1 (en) | 2011-09-23 | 2021-03-10 | Digimarc Corporation | Context-based smartphone sensor logic |
| US9348479B2 (en) * | 2011-12-08 | 2016-05-24 | Microsoft Technology Licensing, Llc | Sentiment aware user interface customization |
| US9378290B2 (en) | 2011-12-20 | 2016-06-28 | Microsoft Technology Licensing, Llc | Scenario-adaptive input method editor |
| EP2864856A4 (en) | 2012-06-25 | 2015-10-14 | Microsoft Technology Licensing Llc | SEIZURE METHOD EDITOR APPLICATION PLATFORM |
| WO2014032244A1 (en) | 2012-08-30 | 2014-03-06 | Microsoft Corporation | Feature-based candidate selection |
| US9104467B2 (en) | 2012-10-14 | 2015-08-11 | Ari M Frank | Utilizing eye tracking to reduce power consumption involved in measuring affective response |
| US9477993B2 (en) | 2012-10-14 | 2016-10-25 | Ari M Frank | Training a predictor of emotional response based on explicit voting on content and eye tracking to verify attention |
| US20150035952A1 (en) * | 2013-08-05 | 2015-02-05 | Samsung Electronics Co., Ltd. | Photographing apparatus, display apparatus, photographing method, and computer readable recording medium |
| EP3030982A4 (en) | 2013-08-09 | 2016-08-03 | Microsoft Technology Licensing Llc | Input method editor providing language assistance |
| KR102063102B1 (en) * | 2013-08-19 | 2020-01-07 | 엘지전자 주식회사 | Mobile terminal and control method for the mobile terminal |
| IL229115A0 (en) * | 2013-10-28 | 2014-03-31 | Safe Code Systems Ltd | Real - time presence verification |
| US20150215514A1 (en) * | 2014-01-24 | 2015-07-30 | Voxx International Corporation | Device for wirelessly controlling a camera |
| US9311639B2 (en) | 2014-02-11 | 2016-04-12 | Digimarc Corporation | Methods, apparatus and arrangements for device to device communication |
| US9269009B1 (en) * | 2014-05-20 | 2016-02-23 | Amazon Technologies, Inc. | Using a front-facing camera to improve OCR with a rear-facing camera |
| DE102014222426B4 (en) * | 2014-11-04 | 2025-06-26 | Bayerische Motoren Werke Aktiengesellschaft | Radio key for adjusting the configuration of a means of transport |
| US10180339B1 (en) | 2015-05-08 | 2019-01-15 | Digimarc Corporation | Sensing systems |
| US10885915B2 (en) | 2016-07-12 | 2021-01-05 | Apple Inc. | Intelligent software agent |
| US11816678B2 (en) | 2020-06-26 | 2023-11-14 | Capital One Services, Llc | Systems and methods for providing user emotion information to a customer service provider |
Citations (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US5839000A (en) * | 1997-11-10 | 1998-11-17 | Sharp Laboratories Of America, Inc. | Automatic zoom magnification control using detection of eyelid condition |
| US6614466B2 (en) * | 2001-02-22 | 2003-09-02 | Texas Instruments Incorporated | Telescopic reconstruction of facial features from a speech pattern |
| US20080174570A1 (en) * | 2006-09-06 | 2008-07-24 | Apple Inc. | Touch Screen Device, Method, and Graphical User Interface for Determining Commands by Applying Heuristics |
| US20080212831A1 (en) * | 2007-03-02 | 2008-09-04 | Sony Ericsson Mobile Communications Ab | Remote control of an image capturing unit in a portable electronic device |
| US20090041428A1 (en) * | 2007-08-07 | 2009-02-12 | Jacoby Keith A | Recording audio metadata for captured images |
| US20100110265A1 (en) * | 2008-11-05 | 2010-05-06 | Sony Corporation | Imaging apparatus and display control method thereof |
Family Cites Families (28)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US6947571B1 (en) * | 1999-05-19 | 2005-09-20 | Digimarc Corporation | Cell phones with optical capabilities, and related applications |
| US6590996B1 (en) * | 2000-02-14 | 2003-07-08 | Digimarc Corporation | Color adaptive watermarking |
| US6185534B1 (en) * | 1998-03-23 | 2001-02-06 | Microsoft Corporation | Modeling emotion and personality in a computer user interface |
| JP3970520B2 (en) * | 1998-04-13 | 2007-09-05 | アイマティック・インターフェイシズ・インコーポレイテッド | Capturing facial movements based on wavelets to animate a human figure |
| US6714661B2 (en) * | 1998-11-06 | 2004-03-30 | Nevengineering, Inc. | Method and system for customizing facial feature tracking using precise landmark finding on a neutral face image |
| US6990453B2 (en) * | 2000-07-31 | 2006-01-24 | Landmark Digital Services Llc | System and methods for recognizing sound and music signals in high noise and distortion |
| US7359889B2 (en) * | 2001-03-02 | 2008-04-15 | Landmark Digital Services Llc | Method and apparatus for automatically creating database for use in automated media recognition system |
| US20040001616A1 (en) * | 2002-06-27 | 2004-01-01 | Srinivas Gutta | Measurement of content ratings through vision and speech recognition |
| US7665024B1 (en) * | 2002-07-22 | 2010-02-16 | Verizon Services Corp. | Methods and apparatus for controlling a user interface based on the emotional state of a user |
| US7874983B2 (en) * | 2003-01-27 | 2011-01-25 | Motorola Mobility, Inc. | Determination of emotional and physiological states of a recipient of a communication |
| CA2622365A1 (en) * | 2005-09-16 | 2007-09-13 | Imotions-Emotion Technology A/S | System and method for determining human emotion by analyzing eye properties |
| US7804983B2 (en) * | 2006-02-24 | 2010-09-28 | Fotonation Vision Limited | Digital image acquisition control and correction method and apparatus |
| JP2008234401A (en) * | 2007-03-22 | 2008-10-02 | Fujifilm Corp | User interface device and operation control method thereof |
| US20090112616A1 (en) * | 2007-10-30 | 2009-04-30 | Searete Llc, A Limited Liability Corporation Of The State Of Delaware | Polling for interest in computational user-health test output |
| US20090118593A1 (en) * | 2007-11-07 | 2009-05-07 | Searete Llc, A Limited Liability Corporation Of The State Of Delaware | Determining a demographic characteristic based on computational user-health testing of a user interaction with advertiser-specified content |
| JP4600435B2 (en) * | 2007-06-13 | 2010-12-15 | ソニー株式会社 | Image photographing apparatus, image photographing method, and computer program |
| JP2009010776A (en) * | 2007-06-28 | 2009-01-15 | Sony Corp | Imaging apparatus, imaging control method, and program |
| US20090002178A1 (en) * | 2007-06-29 | 2009-01-01 | Microsoft Corporation | Dynamic mood sensing |
| US8805110B2 (en) * | 2008-08-19 | 2014-08-12 | Digimarc Corporation | Methods and systems for content processing |
| US8615290B2 (en) * | 2008-11-05 | 2013-12-24 | Apple Inc. | Seamlessly embedded heart rate monitor |
| US9117268B2 (en) * | 2008-12-17 | 2015-08-25 | Digimarc Corporation | Out of phase digital watermarking in two chrominance directions |
| US8886206B2 (en) * | 2009-05-01 | 2014-11-11 | Digimarc Corporation | Methods and systems for content processing |
| US8390680B2 (en) * | 2009-07-09 | 2013-03-05 | Microsoft Corporation | Visual representation expression based on player expression |
| US20110013034A1 (en) * | 2009-07-15 | 2011-01-20 | Mediatek Inc. | Method for operating digital camera and digital camera using the same |
| KR101078057B1 (en) * | 2009-09-08 | 2011-10-31 | 주식회사 팬택 | Mobile terminal had a function of photographing control and photographing control system used image recognition technicque |
| US20120154633A1 (en) * | 2009-12-04 | 2012-06-21 | Rodriguez Tony F | Linked Data Methods and Systems |
| US20120004575A1 (en) * | 2010-06-30 | 2012-01-05 | Sony Ericsson Mobile Communications Ab | System and method for indexing content viewed on an electronic device |
| US20120046071A1 (en) * | 2010-08-20 | 2012-02-23 | Robert Craig Brandis | Smartphone-based user interfaces, such as for browsing print media |
-
2011
- 2011-08-17 US US13/212,119 patent/US8564684B2/en not_active Expired - Fee Related
-
2013
- 2013-10-21 US US14/058,595 patent/US20140148219A1/en not_active Abandoned
Patent Citations (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US5839000A (en) * | 1997-11-10 | 1998-11-17 | Sharp Laboratories Of America, Inc. | Automatic zoom magnification control using detection of eyelid condition |
| US6614466B2 (en) * | 2001-02-22 | 2003-09-02 | Texas Instruments Incorporated | Telescopic reconstruction of facial features from a speech pattern |
| US20080174570A1 (en) * | 2006-09-06 | 2008-07-24 | Apple Inc. | Touch Screen Device, Method, and Graphical User Interface for Determining Commands by Applying Heuristics |
| US20080212831A1 (en) * | 2007-03-02 | 2008-09-04 | Sony Ericsson Mobile Communications Ab | Remote control of an image capturing unit in a portable electronic device |
| US20090041428A1 (en) * | 2007-08-07 | 2009-02-12 | Jacoby Keith A | Recording audio metadata for captured images |
| US20100110265A1 (en) * | 2008-11-05 | 2010-05-06 | Sony Corporation | Imaging apparatus and display control method thereof |
Cited By (10)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20160127641A1 (en) * | 2014-11-03 | 2016-05-05 | Robert John Gove | Autonomous media capturing |
| US10334158B2 (en) * | 2014-11-03 | 2019-06-25 | Robert John Gove | Autonomous media capturing |
| US11509817B2 (en) * | 2014-11-03 | 2022-11-22 | Robert John Gove | Autonomous media capturing |
| US20230156319A1 (en) * | 2014-11-03 | 2023-05-18 | Robert John Gove | Autonomous media capturing |
| US12149819B2 (en) * | 2014-11-03 | 2024-11-19 | Robert John Gove | Autonomous media capturing |
| US20190207992A1 (en) * | 2017-12-29 | 2019-07-04 | Facebook, Inc. | Systems and methods for sharing content |
| US10805367B2 (en) * | 2017-12-29 | 2020-10-13 | Facebook, Inc. | Systems and methods for sharing content |
| US12238401B2 (en) | 2020-03-06 | 2025-02-25 | Apple Inc. | Housing structure for handheld electronic device |
| US20220329678A1 (en) * | 2021-03-02 | 2022-10-13 | Apple Inc. | Handheld electronic device |
| US12088748B2 (en) * | 2021-03-02 | 2024-09-10 | Apple Inc. | Handheld electronic device |
Also Published As
| Publication number | Publication date |
|---|---|
| US8564684B2 (en) | 2013-10-22 |
| US20130044233A1 (en) | 2013-02-21 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US8564684B2 (en) | Emotional illumination, and related arrangements | |
| US11102398B2 (en) | Distributing processing for imaging processing | |
| US11743571B2 (en) | Electronic device and operating method thereof | |
| KR102664688B1 (en) | Method for providing shoot mode based on virtual character and electronic device performing thereof | |
| KR102598109B1 (en) | Electronic device and method for providing notification relative to image displayed via display and image stored in memory based on image analysis | |
| RU2649773C2 (en) | Controlling camera with face detection | |
| US9131150B1 (en) | Automatic exposure control and illumination for head tracking | |
| KR102560689B1 (en) | Method and apparatus for displaying an ar object | |
| US9436870B1 (en) | Automatic camera selection for head tracking using exposure control | |
| KR102707773B1 (en) | Apparatus and method for displaying graphic elements according to object | |
| CN103916591A (en) | Device with camera and method for capturing images | |
| US9846956B2 (en) | Methods, systems and computer-readable mediums for efficient creation of image collages | |
| KR101434533B1 (en) | System for filming camera using appreciate gesture of finger and method therefor | |
| CN113536866B (en) | A person tracking display method and electronic device | |
| CN107395957B (en) | Photographing method, device, storage medium and electronic device | |
| CN108156376A (en) | Image acquisition method, device, terminal and storage medium | |
| CN108038431A (en) | Image processing method, image processing device, computer equipment and computer readable storage medium | |
| US20230224574A1 (en) | Photographing method and apparatus | |
| CN113744172A (en) | Document image processing method and device and training sample generation method and device | |
| EP4258649A1 (en) | Method for determining tracking target, and electronic device | |
| CN107360371B (en) | Automatic photographing method | |
| CN116709013A (en) | Terminal equipment control method, terminal equipment control device and storage medium | |
| CN113749614A (en) | Skin detection method and device |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| AS | Assignment |
Owner name: DIGIMARC CORPORATION, OREGON Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:BAI, YANG;REEL/FRAME:032155/0685 Effective date: 20140124 |
|
| STCB | Information on status: application discontinuation |
Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION |