[go: up one dir, main page]

WO2018107566A1 - Processing method and mobile device - Google Patents

Processing method and mobile device Download PDF

Info

Publication number
WO2018107566A1
WO2018107566A1 PCT/CN2017/072496 CN2017072496W WO2018107566A1 WO 2018107566 A1 WO2018107566 A1 WO 2018107566A1 CN 2017072496 W CN2017072496 W CN 2017072496W WO 2018107566 A1 WO2018107566 A1 WO 2018107566A1
Authority
WO
WIPO (PCT)
Prior art keywords
quadrilateral
image
text
preset
mobile device
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/CN2017/072496
Other languages
French (fr)
Chinese (zh)
Inventor
郜文美
张运超
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Huawei Technologies Co Ltd
Original Assignee
Huawei Technologies Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Huawei Technologies Co Ltd filed Critical Huawei Technologies Co Ltd
Priority to CN201780009378.XA priority Critical patent/CN108604128B/en
Publication of WO2018107566A1 publication Critical patent/WO2018107566A1/en
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/40Processing or translation of natural language

Definitions

  • the embodiments of the present invention relate to the field of communications technologies, and in particular, to a processing method and a mobile device.
  • the document correction function can quickly locate the target content (for example, business cards, books, billboards, etc.) in the captured image, and the document translation function can be used for the user. Translate the text on the target content in the shooting screen.
  • the smart terminal When implementing the above functions such as document correction and document translation, it is necessary to locate the target content in the shooting screen.
  • the smart terminal first determines one or more candidate quadrilaterals from the captured image, and further, in the candidate quadrilateral, a candidate quadrangle that is close to the center of the photographing screen and has a larger area and a shape closer to a rectangle is used as the target quadrilateral, thereby The content of the picture in the quadrilateral is used as the target content.
  • the positioning method of the above target content requires the user to manually adjust the shooting screen, especially when the user includes multiple candidate quadrilaterals in the shooting screen, and if the user does not perform left and right or backward movement, zooming the shooting screen, etc., the user calibrates the shooting screen.
  • the target content of the smart terminal positioning is likely not the content that the user wants, that is, the target content content in the captured image cannot be accurately and conveniently located.
  • Embodiments of the present invention provide a processing method and a mobile device, which can accurately and conveniently locate a target content content in a captured picture.
  • the first aspect provides a processing method, including: displaying a preview frame in a preview frame of the mobile device, the preview frame including at least two quadrilaterals; determining that the user's line of sight focus stays on the first quadrilateral (the first four sides) The duration in the shape of the at least two quadrilaterals is greater than a preset duration; the contour of the first quadrilateral is highlighted in the preview frame; after the first photographing instruction is detected, the first four sides are After performing the preset processing on the first image surrounded by the contour line, outputting the first image subjected to the preset processing; the preset processing includes: performing quadrilateral correction on the first image; or, the first image The text in the text performs OCR.
  • the mobile device determines, by detecting the focus of the user's line of sight, which quadrant in the preview frame the user's line of sight stays for a longer time, because the line of sight stays longer in the quadrilateral.
  • the image is usually the target content that the user needs, then the mobile device highlights the edge line of the quadrilateral, thereby prompting the user that the user has already
  • the first image in the quadrilateral is determined as the target content.
  • the mobile device may directly output the first image after the preset processing. In this way, the mobile device can accurately and conveniently locate the target content in the preview frame for the user without manually adjusting the preview frame.
  • the method further includes: determining that the duration of the user's line of sight staying in the second quadrilateral is greater than the preset duration,
  • the second quadrilateral is any quadrilateral except the first quadrilateral of the at least two quadrilaterals; the outline of the first quadrilateral is normally displayed in the preview frame; the second is highlighted in the preview frame
  • the outline of the quadrilateral that is to say, when the user's line of sight changes, the mobile device can further display the contour of the second quadrilateral where the changed line of sight focus is displayed for the user.
  • the method further includes: after detecting the second photographing instruction, enclosing the outline of the second quadrilateral After the second image performs the preset processing, the second image subjected to the preset processing is output.
  • the method further includes: displaying a contour of a default quadrilateral in the preview frame, wherein the default quadrilateral is a quadrilateral having the largest area among the at least two quadrilaterals. That is to say, before detecting the user's line of sight focus, the mobile device may first use the quadrilateral with the largest area as the default quadrilateral, and present the image surrounded by the outline of the default quadrilateral as the target content to the user.
  • the method before displaying the preview frame in the preview frame of the mobile device, the method further includes: performing quadrilateral detection on the screen content in the preview frame, and storing the detected vertex coordinates of the at least two quadrilaterals.
  • determining that a user's line of sight focus stays within the first quadrilateral is greater than a preset duration includes: detecting a focus position of the line of sight focus within the preview frame; determining that the focus position is in a preview The duration in the first quadrilateral of the frame is greater than the preset duration.
  • performing OCR on the text in the first image includes: translating the text in the first image into a text in a preset language, and replacing the text in the first image with a translation And text of the preset language; or, the text in the first image is translated into the text of the preset language, and the translated text of the preset language is added to the first image, thereby implementing the document translation function.
  • the preset processing further includes: performing a cropping operation on the first image.
  • the second aspect provides a processing method, including: performing quadrilateral detection on a preview frame to be displayed, saving the detected vertex coordinates of at least two quadrilaterals; displaying the preview frame in a preview box of the mobile device; determining that the line of sight focus is located in the The first quadrilateral of the at least two quadrilaterals exceeds a preset duration, the outline of the first quadrilateral is presented in the preview frame; an instruction for photographing is detected; the first four in the preview frame After performing the preset processing on the first image surrounded by the outline of the edge, outputting the first image subjected to the preset processing, and the first image passing through the preset processing includes the contour line of the first quadrilateral
  • the preset processing includes at least one of the following: quadrilateral correction; performing optical character recognition on the text in the first image, translating the recognized text into a text of a preset language, and the recognized text Replaced with the text of the preset language; in the first image
  • the text performs optical character recognition
  • the method further includes: determining that a line of sight focus is within a second quadrilateral of the at least two quadriles beyond a preset duration, and presenting the contour of the second quadrilateral in the preview frame a line; detecting an instruction for photographing; after performing the preset processing on the second image surrounded by the outline of the second quadrilateral in the preview frame, outputting the second image subjected to the preset processing, the passing The second image of the preset process includes a second content surrounded by the outline of the second quadrilateral.
  • a processing method including: performing quadrilateral detection on a preview frame to be displayed, and storing the detected vertex coordinates of at least two quadrilaterals; displaying the preview frame in a preview frame of the mobile device, where the preview frame is presented a contour line of the first quadrilateral of the at least two quadrilaterals; determining that the line of sight focus is within the second quadrilateral of the at least two quadrilaterals for more than a preset duration, and presenting the second quadrilateral in the preview frame a contour that hides a contour of the first quadrilateral, the second quadrilateral being any one of the at least two quadrilaterals except the first quadrilateral; an instruction for photographing is detected; After performing a preset process on the second image surrounded by the outline of the second quadrilateral in the preview frame, outputting the second image that is subjected to the preset process, and the second image that is subjected to the preset process includes the first image
  • a fourth aspect provides a processing method, comprising: displaying a preview frame in a preview frame of the mobile device, the preview frame including at least two quadrilaterals; detecting an operation acting on the first quadrilateral, the first four sides Forming one of the at least two quadrilaterals; in response to the operation, after performing a preset process on the first image surrounded by the outline of the first quadrilateral, outputting the first image subjected to the preset processing;
  • the preset process includes: performing quadrilateral correction on the first image; or performing optical character recognition on the text in the first image. It should be understood that the operation includes, but is not limited to, clicking, double clicking, tapping, long pressing or repressing.
  • performing optical character recognition on the text in the first image includes: translating the text in the first image into a text in a preset language, and replacing the text in the first image The translated text of the preset language; or, the text in the first image is translated into the text of the preset language, and the translated text of the preset language is added to the first image.
  • the preset processing further includes: performing a cropping operation on the first image.
  • a fifth aspect provides a processing method, including: displaying a preview frame in a preview frame of the mobile device, the preview frame including at least two quadrilaterals; detecting a first operation acting on the first quadrilateral, the first a quadrilateral is one of the at least two quadrilaterals; in response to the first operation, highlighting the contour of the first quadrilateral; detecting a second operation acting on the camera key; in response to the second operation, After performing a preset process on the first image surrounded by the outline of the first quadrilateral, outputting the first image subjected to the preset process; the preset process includes: performing quadrilateral correction on the first image; Alternatively, optical character recognition is performed on the text in the first image.
  • the first operation includes but is not limited to: clicking, double clicking, tapping, long pressing or re-pressing.
  • the second operation includes but is not limited to: clicking, tapping, long pressing or re-pressing.
  • the preset processing further includes: performing a cropping operation on the first image.
  • a sixth aspect provides a mobile device, comprising: a display unit, configured to display a preview frame in a preview frame of the mobile device, the preview frame includes at least two quadrilaterals; and a determining unit, configured to determine that a user's line of sight focus stays at The duration of the first quadrilateral is greater than a preset duration, the first quadrilateral being one of the at least two quadrilaterals; the display unit is further configured to highlight the first quadrilateral in the preview frame a contour processing unit, configured to: after detecting the first photographing instruction, performing a preset process on the first image surrounded by the outline of the first quadrilateral, and outputting the first image subjected to the preset processing;
  • the preset process includes: performing quadrilateral correction on the first image; or performing optical character recognition on the text in the first image.
  • the method further includes determining that a duration in which the user's line of sight focus stays in the second quadrilateral is greater than the preset duration, and the second quadrilateral is the first one of the at least two quadrilaterals Any one of the quadrilaterals outside the quadrilateral; the display unit is further configured to normally display the outline of the first quadrilateral in the preview frame; and highlight the outline of the second quadrilateral in the preview frame.
  • the processing unit is further configured to: after detecting the second photographing instruction, perform the preset processing on the second image surrounded by the outline of the second quadrilateral, and output the passing through The second image of the preset process.
  • the display unit is further configured to highlight a contour of a default quadrilateral in the preview frame, where the default quadrilateral is the largest quadrilateral of the at least two quadrilaterals.
  • the mobile device further includes: a detecting unit, configured to perform quadrilateral detection on the preview frame, and save the detected vertex coordinates of the at least two quadrilaterals.
  • the determining unit is specifically configured to detect a focus position of the line of sight focus in the preview frame, and determine that the focus is located in the first quadrilateral for a longer duration than the preset duration.
  • the processing unit is specifically configured to translate the text in the first image into a text in a preset language, and replace the text in the first image with the translated language in the preset language. Text; or, the text in the first image is translated into the text of the preset language, and the translated text of the preset language is added to the first image.
  • a seventh aspect provides a mobile device, comprising: a detecting unit, configured to perform quadrilateral detection on a preview frame to be displayed, save coordinates of at least two quadrilateral vertices detected; and display unit, in a preview box of the mobile device And displaying, in the preview frame, the first quadrilateral a detection unit configured to detect an instruction for photographing; a processing unit configured to perform a first image surrounded by the outline of the first quadrilateral in the preview frame After the line preset processing, the first image subjected to the preset processing is output, and the first image that is subjected to the preset processing includes the first content surrounded by the outline of the first quadrilateral,
  • the preset processing includes at least one of the following: quadrilateral correction; performing optical character recognition on the text in the first image, translating the recognized text into a text of a preset language, and replacing the recognized text with the preset language a text; performing optical character recognition on the text in the first image, translating the recognized text into a text of
  • An eighth aspect provides a mobile device, comprising: a detecting unit, configured to perform quadrilateral detection on a preview frame to be displayed, save coordinates of the detected at least two quadrilateral vertices; and display unit, in a preview box of the mobile device
  • the preview frame is displayed, wherein the preview frame presents an outline of the first quadrilateral of the at least two quadrilaterals; and the determining unit is configured to determine that the line of sight focus is located in the second four of the at least two quadrilaterals
  • the contour of the second quadrilateral is presented in the preview frame to hide the contour line of the first quadrilateral, and the second quadrilateral is the at least two Any one of the quadrilaterals except the first quadrilateral;
  • the detecting unit is further configured to detect an instruction for photographing; and the processing unit is configured to: contour the second quadrilateral in the preview frame After performing the preset processing on the second image surrounded by the line, the second image subjected to the preset processing is
  • a ninth aspect provides a mobile device, including: a display unit, configured to display a preview frame in a preview frame of the mobile device, the preview frame includes at least two quadrilaterals; and a detecting unit configured to detect a role a quadrilateral operation, the first quadrilateral being one of the at least two quadrilaterals; a processing unit responsive to the operation, surrounding the outline of the first quadrilateral After the first image performs the preset processing, outputting the first image subjected to the preset processing; the preset processing includes: performing quadrilateral correction on the first image; or performing execution on the text in the first image Optical character recognition.
  • a tenth aspect provides a mobile device, comprising: a display unit, configured to display a preview frame in a preview frame of the mobile device, the preview frame includes at least two quadrilaterals; and a detecting unit, configured to detect a role a first operation of a quadrilateral, the first quadrilateral being one of the at least two quadrilaterals; the display unit further configured to highlight the first quadrilateral in response to the first operation a contouring unit; the detecting unit is further configured to detect a second operation acting on the camera key; the processing unit, configured to, in response to the second operation, the first surrounded by the contour line of the first quadrilateral After the image performs the preset processing, outputting the first image subjected to the preset processing; the preset processing includes: performing quadrilateral correction on the first image; or performing optical characters on the text in the first image Identification.
  • the eleventh aspect provides a mobile device comprising: a processor, a memory, a bus, and a communication An interface for storing a computer execution instruction, the processor being coupled to the memory via the bus, the processor executing the computer-executable instruction stored by the memory when the mobile device is running, to cause the mobile device to perform the arbitrary A treatment method.
  • a twelfth aspect provides a computer readable storage medium for storing computer software instructions for use in the mobile device described above, comprising a program designed to perform the above aspects for the mobile device.
  • a thirteenth aspect provides a computer program comprising instructions which, when executed by a computer, cause a computer to perform the processing method of any of the above.
  • FIG. 1 is a schematic diagram of an application scenario of a preview frame according to an embodiment of the present disclosure
  • FIG. 2 is a schematic diagram of an application scenario of a built-in camera according to an embodiment of the present invention
  • FIG. 3 is a schematic structural diagram 1 of a mobile device according to an embodiment of the present disclosure.
  • FIG. 4 is a schematic diagram 1 of an application scenario of a processing method according to an embodiment of the present disclosure
  • FIG. 5 is a schematic flowchart 1 of a processing method according to an embodiment of the present disclosure.
  • FIG. 6 is a schematic diagram 2 of an application scenario of a processing method according to an embodiment of the present disclosure.
  • FIG. 7 is a schematic diagram 3 of an application scenario of a processing method according to an embodiment of the present disclosure.
  • FIG. 8 is a schematic diagram 4 of an application scenario of a processing method according to an embodiment of the present disclosure.
  • FIG. 9 is a schematic diagram 5 of an application scenario of a processing method according to an embodiment of the present disclosure.
  • FIG. 10 is a schematic diagram 6 of an application scenario of a processing method according to an embodiment of the present disclosure.
  • FIG. 11 is a schematic diagram 7 of an application scenario of a processing method according to an embodiment of the present disclosure.
  • FIG. 12 is a schematic diagram 8 of an application scenario of a processing method according to an embodiment of the present disclosure.
  • FIG. 13 is a second schematic flowchart of a processing method according to an embodiment of the present disclosure.
  • FIG. 14 is a schematic diagram 9 of an application scenario of a processing method according to an embodiment of the present disclosure.
  • FIG. 15 is a schematic flowchart 3 of a processing method according to an embodiment of the present disclosure.
  • FIG. 16 is a schematic diagram of an application scenario of a processing method according to an embodiment of the present disclosure.
  • FIG. 17 is a schematic diagram of an application scenario of a processing method according to an embodiment of the present invention.
  • FIG. 18 is a schematic diagram of an application scenario of a processing method according to an embodiment of the present invention.
  • FIG. 19 is a schematic flowchart 4 of a processing method according to an embodiment of the present disclosure.
  • FIG. 20 is a schematic diagram of an application scenario of a processing method according to an embodiment of the present disclosure.
  • FIG. 21 is a schematic diagram of an application scenario of a processing method according to an embodiment of the present invention.
  • FIG. 22 is a schematic diagram of an application scenario of a processing method according to an embodiment of the present invention.
  • FIG. 23 is a schematic diagram of an application scenario of a processing method according to an embodiment of the present invention.
  • FIG. 24 is a schematic structural diagram 2 of a mobile device according to an embodiment of the present disclosure.
  • FIG. 25 is a schematic structural diagram 3 of a mobile device according to an embodiment of the present invention.
  • first and second are used for descriptive purposes only, and are not to be construed as indicating or implying a relative importance or implicitly indicating the number of technical features indicated.
  • features defining “first” and “second” may include one or more of the features either explicitly or implicitly.
  • the meaning of "a plurality" is two or more unless otherwise specified.
  • the preview frame refers to a single-frame shooting picture of a minimum unit captured by the mobile device during the shooting process, and the shooting picture can be displayed in the display interface of the mobile device, for example, using the mobile phone as the mobile device as an example.
  • a preview frame 12 can be displayed in the display interface of the mobile phone, and then the mobile phone captures the captured image captured by the camera in the preview frame 12 in real time.
  • Each shot in the preview frame 12 can be used as a preview frame 11.
  • the above-mentioned mobile device may also be an Augmented Reality (AR) device or a portable device (for example, a mobile phone) such as a wearable eyeglass, a wearable helmet, and the like.
  • AR Augmented Reality
  • portable device for example, a mobile phone
  • the built-in camera 13 of the mobile device refers to a camera that is opposite to the user's line of sight when the user views the display screen of the mobile device.
  • the built-in camera 13 can generally capture the state of the user's eyeball during work, for example, the front camera of the mobile phone (as shown in the figure) 2 (a)), or, in a wearable eyeglass (shown as (b) in FIG. 2) or a camera provided in a wearable helmet, the built-in camera 13 may specifically be an infrared camera, R (red, The red (green) green (green) B (blue, blue) camera or the like is not limited in this embodiment of the present invention.
  • the external camera of the mobile device refers to the camera set on the side facing away from the display when the user views the display screen of the mobile device, for example, the rear camera of the mobile phone or the camera set outside the wearable glasses or the wearable helmet.
  • the external camera may specifically be an RGB camera, a Mono (black and white) camera, or a depth camera, etc., which is not limited in this embodiment of the present invention.
  • Embodiments of the present invention provide a processing method, which can be applied to any head mounted electronic device, for example, Augmented Reality (AR) glasses, Virtual Reality (VR) helmets, etc., and can also be applied to Any handheld electronic device, such as a mobile phone, a tablet computer, a notebook computer, an ultra-mobile personal computer (UMPC), a netbook, a personal digital assistant (PDA), etc., in the embodiment of the present invention There are no restrictions on this.
  • AR Augmented Reality
  • VR Virtual Reality
  • Any handheld electronic device such as a mobile phone, a tablet computer, a notebook computer, an ultra-mobile personal computer (UMPC), a netbook, a personal digital assistant (PDA), etc.
  • the structure of the mobile device is shown in Figure 3.
  • the mobile device 300 includes components such as a communication interface 320, a memory 330, an input unit 340, a display unit 350, a built-in camera 13, an external camera 14, an audio circuit 370, and a processor 380. It should be understood that the structure of the mobile device 300 illustrated in FIG. 3 does not constitute a limitation to the mobile device, which may include more or fewer components than those illustrated, or some components, or other different components.
  • the components of the mobile device 300 will be specifically described below with reference to FIG. 3:
  • the communication interface 320 is configured to receive and transmit signals during and after receiving or transmitting information or during a call.
  • communication interfaces include, but are not limited to, an antenna, at least one amplifier, a transceiver, a coupler, a low noise amplifier (LNA), a duplexer, and the like.
  • LNA low noise amplifier
  • the memory 330 can be used to store software programs and modules, and the processor 380 executes various functional applications and data processing of the mobile device 300 by running software programs and modules stored in the memory 330.
  • the memory 330 may mainly include a storage program area and a storage data area, wherein the storage program area may store an operating system, an application required for at least one function (such as a camera function, an image playing function, etc.), and the like; the storage data area may be stored according to the movement.
  • the data created by the use of the device 300 (such as audio data, image data, phone book, etc.) and the like.
  • the input unit 340 can be configured to receive input numeric or character information and to generate key signal inputs related to user settings and function control of the mobile device 300.
  • the input unit 340 can include a touch screen 341 as well as other input devices 342.
  • the touch screen 341 also referred to as a touch panel, can detect a touch operation on or near the user and drive the corresponding connection device according to a preset program.
  • the input unit 340 may also include other input devices 342.
  • other input devices 342 may include, but are not limited to, one or more of a physical keyboard, function keys (such as volume control buttons, power switch buttons, etc.), trackballs, mice, joysticks, and the like.
  • the display unit 350 can be used to display information input by the user or information provided to the user and various menus of the mobile device 300.
  • the display unit 350 may include a display panel 351. Further, when the touch screen 341 detects a touch operation on or near it, it can be transmitted to the processor 380 to determine the type of the touch event, and then the processor 380 provides a corresponding visual output on the display panel 351 according to the type of the touch event. .
  • Audio circuitry 370, speaker 371, microphone 372 can provide an audio interface between the user and mobile device 300.
  • the audio circuit 370 can transmit the converted electrical data of the received audio data to the speaker 371 for conversion to the sound signal output by the speaker 371; on the other hand, the microphone 372 converts the detected sound signal into an electrical signal by the audio circuit.
  • the 370 is converted to audio data after reception, and the audio data is output to the communication interface 320 for transmission to, for example, another mobile device, or the audio data is output to the memory 330 for further processing.
  • Processor 380 is the control center of mobile device 300, which connects various portions of the entire mobile device using various interfaces and lines, by running or executing software programs and/or modules stored in memory 330, and by calling stored in memory 330. Data, performing various functions and processing data of the mobile device 300, thereby performing overall monitoring of the mobile device 300.
  • processor 380 can include one or more processing units.
  • the mobile device 300 can also include at least one built-in camera 13 and at least one external camera 14.
  • the built-in camera 13 or the external camera 14 can transmit the captured preview frame to the processor 380. After the image processing is performed on the preview frame by the processor 380, the processed image is sent to the display unit 350 for display.
  • the processor 380 can call the external camera 14 to acquire the current preview frame 11 and display the preview frame 11 in the display unit 350. Further, the processor 380 can call the preset in the memory 330.
  • the quadrilateral detection algorithm acquires vertex coordinates of at least two quadrilaterals in the preview frame 11, for example, the vertex coordinates of the quadrilateral A in FIG. 4, the vertex coordinates of the quadrilateral B, the vertex coordinates of the quadrilateral C, and the vertex coordinates of the quadrilateral D.
  • the processor 380 can call the built-in camera 13 to detect the user's line of sight, determine the duration of the user's line of sight focus staying in the preview frame 11 and the position of the stay. If the user's line of sight focus stays in the quadrilateral B for longer than the preset duration, Then, as shown in FIG. 4, the processor 380 highlights the outline of the quadrilateral B in the preview frame 11 through the display unit 350, that is, the processor 380 uses the first image surrounded by the outline of the quadrilateral B as the preview image. Target content.
  • the processor 380 detects the first photographing instruction, the processor 380 performs preset processing on the first image, and outputs the preset processed first image in the display unit 350.
  • the preset processing may specifically be: performing quadrilateral correction on the first image, for example, performing tilt correction on the first image;
  • the instruction is triggered when the user initiates the document translation function, and the preset processing may be: performing optical character recognition (OCR) on the text in the first image, for example, translating the Chinese characters in the first image into English.
  • OCR optical character recognition
  • the mobile device determines, by detecting the focus of the user's line of sight, which quadrant in the preview frame the user's line of sight stays for a longer time, because the line of sight stays longer in the quadrilateral.
  • the image is usually the target content that the user needs.
  • the mobile device highlights the edge line of the quadrilateral, thereby prompting the user that the first image in the quadrilateral has been determined as the target content for the user, and subsequently, the mobile device detects the photo.
  • the first image subjected to the preset processing may be directly output.
  • the mobile device can accurately and conveniently locate the target content in the preview frame without the user manually adjusting the preview frame, and the mobile device only needs to perform the preset processing on the first image in the preview frame, thereby avoiding Processing of invalid data outside the first image, thereby improving image processing efficiency.
  • processing method can be applied to various application scenarios such as document correction and document translation, and can also be applied to various application scenarios based on target content positioning, for example, business card recognition, bank card identification, ID card recognition, and the like.
  • target content positioning for example, business card recognition, bank card identification, ID card recognition, and the like.
  • the embodiment of the present invention does not impose any limitation on this.
  • the mobile device 300 may further include a power source, a gravity sensor, a light sensor, a gyroscope, a barometer, a hygrometer, a thermometer, an infrared sensor, and a wireless fidelity ( Wireless fidelity, Wi-Fi module or Bluetooth module, etc., will not be described here.
  • the method includes:
  • the mobile device performs a quadrilateral detection operation on the content in the preview frame.
  • the mobile device saves vertex coordinates of each of the detected at least two quadrilaterals.
  • the mobile device can call the external camera to enter the preview state.
  • the mobile device displays the image captured by the external camera in the form of a preview frame on the mobile device.
  • step 401 the mobile device performs a quadrilateral detection operation on the picture content in the preview frame, thereby determining one or more quadrilaterals included in the preview frame.
  • the edge of the preview frame may be edge-detected, and then the straight line included in the preview frame may be extracted, and finally, according to a preset quadrilateral detection algorithm, for example, based on an open source computer vision library (Open Source Computer Vision) Library, OpenCV) Rectangular recognition algorithm, etc., determines one or more quadrilaterals composed of these straight lines.
  • a preset quadrilateral detection algorithm for example, based on an open source computer vision library (Open Source Computer Vision) Library, OpenCV) Rectangular recognition algorithm, etc., determines one or more quadrilaterals composed of these straight lines.
  • the quadrilateral detection operation may also be performed according to the size of the quadrilateral. Screening, discarding quadrilaterals with an area or perimeter less than the preset value, avoiding these quadrilaterals with smaller areas or perimeters from interfering with subsequent image processing. In addition, it is also possible to perform noise filtering and the like on the screen content in the preview frame to improve the accuracy of the quadrilateral detection operation.
  • step 402 the mobile device saves the vertex coordinates of each of the detected at least two quadrilaterals to determine the position of each quadrilateral detected within the preview frame.
  • the mobile device displays the preview frame in the preview frame 11 of the display unit. If the captured image in the preview frame 11 is an A4 paper, it can be determined that the preview frame is included by the quadrilateral detection operation. Five quadrangles, that is, a quadrilateral 1 formed by the edge of the A4 paper, and a quadrilateral 2-quadrant 5 included in the A4 paper, then the mobile device can detect the vertices of each of the five quadrilaterals by performing quadrilateral detection. Coordinates to determine the position of the five quadrilaterals in the preview frame 11.
  • the mobile device highlights a contour of the default quadrilateral in the preview frame; the default quadrilateral is the quadrilateral having the largest area among the at least two quadrilaterals.
  • the mobile device may determine the quadrilateral with the largest area from the at least two quadrilaterals detected above, and use the quadrilateral as the default quadrilateral to highlight the outline of the default quadrilateral in the preview frame.
  • highlighting refers to the function of highlighting text or image using lines of different colors or different thicknesses; highlighting can be clearly distinguished by visual observation compared with normal display.
  • the mobile device can highlight the outline of the quadrilateral 1 (in FIG. 7).
  • the bold display is indicated as a highlight, so that the user is prompted to display the quadrilateral with the largest area in the current preview frame as the quadrilateral 1, and the content surrounded by the outline of the quadrilateral 1 is the current shift.
  • the default device waiting for the preset processing image.
  • the method for displaying the outline of the default quadrilateral may be various.
  • the highlighting is taken as an example, and the display is also performed by using a dotted line or the like, which is not limited in this embodiment of the present invention.
  • the mobile device detects, according to vertex coordinates of each quadrilateral, a focus position of the user's line of sight focus within the preview frame.
  • the mobile device highlights the contour of the first quadrilateral in the preview frame, where the first quadrilateral is at least two quadrilaterals. one of.
  • the mobile device can capture the motion state of the user's eyeball by calling the built-in camera, and then use the eyeball tracking technology to detect the position of the user's line of sight focus within the preview frame, that is, the focus position.
  • the user and the eyeball and the image around the eyeball can be captured by the built-in camera.
  • the mobile device can track the eyeball and the feature changes around the eyeball to determine the user's line of sight focus position; or, the mobile device can be based on the iris The angle change is tracked to determine the user's line of sight focus position, or the mobile device can actively project a beam of infrared light to the iris to extract feature information for tracking to determine the user's line of sight focus position.
  • two threads can be started in parallel, and one thread is used to determine which current focus position is located according to the vertex coordinates of each quadrature described above. Within the quadrilateral, another thread can be used to record how long the user's line of sight focus stays at that focus position.
  • the user Since the user's line of sight focus may change slightly within a certain period of time, the user may be considered to be the user when the line of sight focus falls within the area centered on the focus position and the radius is R (R is the preset value).
  • the line of sight focus stays at the above focus position.
  • the mobile device determines that the focus position is in the first quadrilateral (the first quadrilateral is one of the at least two quadrilaterals), and the user's line of sight focus stays at the focus position for longer than the preset duration (for example, 2 seconds), indicating that the user's current attention is concentrated in the first quadrilateral.
  • the triggering mobile device displays the outline of the first quadrilateral in the preview frame to prompt the user to move.
  • the current document determined by the device for the user is the first image surrounded by the outline of the first quadrilateral.
  • the mobile device may first determine, according to the vertex coordinates of each quadrilateral, which quadrilateral the focus position is located in, for example, located in the first quadrilateral, and further, determine Whether the duration of the user's line of sight focus staying within the first quadrilateral is greater than a preset duration.
  • the mobile device may first determine whether the duration of the user's line of sight focus staying at the focus position is greater than a preset duration. If the duration is greater than the preset duration, the focus position is further determined according to the coordinates of the vertices of each of the quadrilaterals. Which quadrilateral is located.
  • the mobile device detects that the focus position of the user's line of sight focus in the preview frame is point A by calling the built-in camera. Then, the mobile device can determine the point A of each quadrilateral as described above, and determine that the point A is located in the quadrilateral 2, at this time, if the user's line of sight focus is at the focus position If the duration of the stay is greater than 2 seconds, the mobile device highlights the outline of the quadrilateral 2 in the preview frame (indicated by a bold display in FIG. 8 as a highlight), that is, prompting the user that the current mobile device is the current document determined by the user. An image enclosed by the outline of the quadrilateral 2.
  • the mobile device can also normally display the outline of the default quadrilateral (ie, quadrilateral 1) that has been displayed in step 403.
  • the mobile device continues.
  • the outline of the quadrilateral 1 that has been highlighted in step 403 is displayed.
  • the mobile device After detecting the first photographing instruction, the mobile device performs a preset process on the first image surrounded by the outline of the first quadrilateral, and then outputs the first image subjected to the preset processing.
  • the “outputting the first image processed by the preset process” as referred to herein may be understood as an image obtained by the user after clicking the camera key, and the first image processed by the preset process is saved in the memory of the mobile device.
  • the first image subjected to the preset processing is as shown in FIG. 11, and it should be understood that the first image subjected to the preset processing may include all or part of the content in the quadrilateral 2 in FIG. 9;
  • the first image subjected to the preset processing may include a part of the content in the quadrilateral 1 as shown in FIG. 9 in addition to all or part of the content in the quadrilateral 2 in FIG. 9; or, the It is assumed that the processed first image may include other content in addition to all or part of the content in the quadrilateral 2 in FIG.
  • the mobile device may be triggered to generate a first photographing instruction. For example, as shown in FIG. 10, after the user clicks the photographing button in the mobile device display interface, the mobile device A first photographing instruction is generated. Further, after the first photographing instruction is generated, the mobile device photographs the first image surrounded by the outline of the first quadrilateral, and performs preset processing on the first image obtained by photographing.
  • the preview frame 11 shown in FIG. 10 is used as an example. If the user activates the document correction function and then triggers the camera button of the mobile device, the mobile device generates a first camera instruction in response to the operation, and further, according to the first camera instruction. Taking a picture of the first image enclosed by the outline of the quadrilateral 2, and performing quadrilateral correction on the obtained first image, for example, performing tilt correction on the first image, as shown in FIG. 11, obtaining the first image after tilt correction, Then, for the user to display the tilt-corrected first image in the mobile device, the document correction function is completed.
  • the mobile device If the user initiates the document translation function and triggers the camera button of the mobile device, at this time, the mobile device generates a first camera instruction in response to the operation, and further, according to the first camera instruction, the outline surrounded by the outline of the quadrilateral 2 An image is taken.
  • the preset processing at this time is: performing OCR on the text in the first image.
  • the text in the first image may be first recognized by a preset character recognition algorithm to obtain computer characters recognizable by the mobile device, and further, the computer text may be obtained through a preset translation software or program.
  • Translating into the text of the default language for example, translating the Chinese in the first image into English, and subsequently, as shown in (a) of FIG. 12, the translated English can be directly used in the first image to replace the pre-translation.
  • Chinese, or, as shown in (b) of FIG. 12 the translated English may also be added to the first image in the first image, for example, the translated image English is added next to the pre-translation Chinese to complete the document translation function.
  • the foregoing preset processing may further include performing a cropping operation or the like on the first image, which is not limited in this embodiment of the present invention.
  • quadrilateral correction can also be performed on the first image, thereby improving the accuracy of subsequent character recognition and translation.
  • the mobile device may perform the following steps 501-504.
  • the mobile device may periodically determine the position of the user's line of sight focus.
  • the user's line of sight may be determined.
  • the focus position changes.
  • the mobile device may perform the following steps 501-504.
  • the mobile device may perform the following steps 501-504.
  • the distance between the focus positions determined twice consecutive times is less than or equal to the preset distance, it may be determined that the user's line of sight focus position has not changed.
  • the mobile device can continue to perform steps 405-406 above.
  • steps 501-504 include:
  • the mobile device determines that a duration in which the user's line of sight stays in the second quadrilateral is greater than a preset duration, and the second quadrilateral is any one of the at least two quadrilaterals except the first quadrilateral.
  • the mobile device can determine that the new focus position is within the second quadrilateral according to the vertex coordinates of each of the above quadrilaterals. For example, as shown in (a) of FIG. 14, the new focus position C is located in the quadrilateral 5, and further, the mobile device can determine whether the duration of the user's line of sight focus staying within the quadrilateral 5 is greater than a preset duration, if greater than the pre- If the duration is set, the following steps 502-204 are continued.
  • the mobile device displays the outline of the first quadrilateral normally in the preview frame.
  • the mobile device highlights a contour of the second quadrilateral in the preview frame.
  • step 405 since the mobile device has highlighted the contour of the first quadrilateral in the preview frame in step 405, the user's line of sight focus position has changed, new The focus position is located within the second quadrilateral, so the mobile device can normally display the outline of the first quadrilateral within the preview frame, ie, cancel or hide the outline of the first quadrilateral that has been highlighted in step 405, Restores the original display mode of the first quadrilateral in the preview frame; and highlights the outline of the second quadrilateral in the preview frame, thereby prompting the user that the current mobile device determines that the current document is the second quadrilateral The image enclosed by the outline.
  • the mobile device may also perform step 503 and then perform step 502.
  • the mobile device may perform step 502 and step 503 at the same time.
  • the mobile device determines that the C point is within the quadrilateral 5, and the line of sight focus stays within the quadrilateral 5 for longer than the preset duration. Then, the mobile device can cancel or hide the outline of the highlighted quadrilateral 2 in the preview frame 11 (ie, the outline of the normal display quadrilateral 2), and at the same time, highlight the outline of the quadrilateral 5 (in FIG. 14 (a) The bold display is indicated as a highlight), that is, the image that the current mobile device determines for the user that the current document is surrounded by the outline of the quadrilateral 5.
  • the mobile device when the mobile device determines that the C point is within the quadrilateral 5, and the line of sight focus stays within the quadrilateral 5 for longer than the preset duration, the mobile device can also display only the content of the quadrilateral 5 in the preview frame 11.
  • the current document identified by the user for the current mobile device is an image surrounded by the outline of the quadrilateral 5.
  • the image surrounded by the outline of the quadrilateral 5 may occupy the entire preview frame 11 or may be located in a certain area in the preview frame 11; the outline of the quadrilateral 5 may be highlighted or displayed normally, in the embodiment of the present invention. There are no restrictions on this.
  • the mobile device After detecting the second photographing instruction, the mobile device performs a preset process on the second image surrounded by the outline of the second quadrilateral, and outputs a second image that passes the preset processing.
  • the “outputting the second image subjected to the preset processing” as referred to herein may be understood as an image obtained after the user clicks the camera key, and the first image processed through the preset process is saved in the memory of the mobile phone.
  • the second image subjected to the preset processing may include all content or partial content in the quadrilateral 5 as shown in FIG. 14; or the second image subjected to the preset processing may be included in FIG. 14 except In addition to all or part of the content in the quadrilateral 5, part of the content in the quadrilateral 1 in FIG. 14 is included; or the second image subjected to the preset processing may include all the contents in the quadrilateral 5 as shown in FIG. In addition to or in part, it may include other content.
  • the mobile device can be triggered to generate a second photographing instruction. Further, after the second photographing instruction is generated, the mobile device is triggered to take a photo of the second image surrounded by the outline of the second quadrilateral, and perform preset processing on the photographed second image.
  • the embodiment of the present invention further provides a processing method, including:
  • the mobile device displays a preview frame in its preview frame, the preview frame including at least two quadrilaterals.
  • the mobile device can call the external camera to enter the preview state. At this time, the mobile device captures the image captured by the external camera as a preview frame. The form is displayed in the display interface of the mobile device.
  • the mobile device may perform quadrilateral detection on the content of the picture in the preview frame, and save the coordinates of the vertices of each of the detected at least two quadrilaterals, thereby determining one or more quadrilaterals included in the preview frame.
  • the mobile device can determine that the preview frame 11 includes a quadrilateral 1-quadrant 5 by performing quadrilateral detection. At this time, the mobile device highlights the quadrilateral 1-quadrant 5 in the preview frame. Each quad, that is, reminds the user that the user detects a quadrilateral 1-quadrant 5 in the current preview frame. It should be understood that, as an alternative or alternative, the mobile device determines that the preview frame 11 includes a quadrilateral 1-quadrant 5 by performing quadrilateral detection, and the mobile device can also normally display the quadrilateral 1-quadrant 5 in the preview frame.
  • the mobile device detects an operation acting on the first quadrilateral, the first quadrilateral being one of the at least two quadrilaterals.
  • the mobile device In response to the operation, after performing a preset process on the first image surrounded by the contour line of the first quadrilateral, the mobile device outputs the first image that passes the preset process.
  • the user can click on any quadrilateral displayed in the preview frame.
  • the user clicks on any position within the quadrilateral 2, and at this time, the mobile device can detect that the user acts on the quadrilateral.
  • the click operation of 2 triggers the mobile device to perform a preset process on the first image surrounded by the outline of the first quadrilateral, and outputs the first image subjected to the preset process.
  • the “outputting the first image processed by the preset process” as referred to herein may be understood as an image obtained by the user after clicking the camera key, and the first image processed by the preset process is saved in the memory of the mobile phone.
  • the first image subjected to the preset processing is as shown in FIG. 18, and it should be understood that the first image subjected to the preset processing may include all or part of the content in the quadrilateral 2 in FIG. 17;
  • the first image subjected to the preset processing may include, in addition to all contents or partial contents in the quadrilateral 2 in FIG. 17, a partial content in the quadrilateral 1 in FIG. 17; or, the It is assumed that the processed first image may include other content in addition to all or part of the content in the quadrilateral 2 in FIG.
  • the foregoing preset processing includes: performing quadrilateral correction on the first image; or performing optical character recognition on the text in the first image.
  • performing quadrilateral correction on the first image or performing optical character recognition on the text in the first image.
  • an embodiment of the present invention further provides a processing method, including:
  • the mobile device displays a preview frame in its preview frame, where the preview frame includes at least two quadrilaterals.
  • the mobile device can call the external camera to enter the preview state. At this time, the mobile device can perform quadrilateral detection on the screen content in the preview frame. Determines one or more quads contained within the preview frame and displays the preview frame in its preview frame.
  • the mobile device detects a first operation acting on the first quadrilateral, the first quadrilateral being one of the at least two quadrilaterals.
  • the mobile device highlights the outline of the first quadrilateral.
  • the user can click on any quadrilateral displayed in the preview frame. For example, as shown in FIG. 20, the user clicks on any position within the quadrilateral 2.
  • the mobile device can detect The user acts on the click operation of the quadrilateral 2, triggering the mobile device to highlight the outline of the quadrilateral 2 (indicated by the bold display as a highlight in FIG. 21), thereby prompting the user that the current mobile device is the current document determined by the user. An image enclosed by the outline of the quadrilateral 2.
  • the mobile device detects a second operation that acts on the camera button.
  • the camera button may be a virtual button for triggering the photographing in the display interface, or may be a physical button for triggering the photographing on the mobile device, which is not limited in this embodiment of the present invention.
  • steps 704-705 if the user determines that the first quadrilateral is the target content, as shown in FIG. Then, the camera button on the mobile device can be triggered, that is, the second operation, and further, the mobile device takes a photo of the first image surrounded by the outline of the first quadrilateral in response to the second operation, and the first image obtained by photographing After the preset process is performed, the first image subjected to the preset process is output.
  • the “outputting the first image processed by the preset process” as referred to herein may be understood as an image obtained by the user after clicking the camera key, and the first image processed by the preset process is saved in the memory of the mobile phone.
  • the first image subjected to the preset processing is as shown in FIG. 23, and it should be understood that the first image subjected to the preset processing may include all or part of the content in the quadrilateral 2 in FIG. 22;
  • the first image subjected to the preset processing may include a part of the content in the quadrilateral 1 as shown in FIG. 22 in addition to all or part of the content in the quadrilateral 2 in FIG. 22; or, the It is assumed that the processed first image may include other content in addition to all or part of the content in the quadrilateral 2 in FIG.
  • the foregoing preset processing includes: performing quadrilateral correction on the first image; or performing optical character recognition on the text in the first image.
  • performing quadrilateral correction on the first image or performing optical character recognition on the text in the first image.
  • the above mobile device or the like includes a hardware structure and/or a software module corresponding to each function.
  • Those skilled in the art will readily appreciate that the embodiments of the present invention can be implemented in a combination of hardware or hardware and computer software in combination with the elements and algorithm steps of the various examples described in the embodiments disclosed herein. Whether a function is implemented in hardware or computer software to drive hardware depends on the specific application and design constraints of the solution. A person skilled in the art can use different methods to implement the described functions for each particular application, but such implementation should not be considered to be beyond the scope of the embodiments of the invention.
  • the embodiment of the present invention may divide a function module into a mobile device or the like according to the foregoing method example.
  • each function module may be divided according to each function, or two or more functions may be integrated into one processing module.
  • the above integrated modules can be implemented in the form of hardware or in the form of software functional modules. It should be noted that the division of the module in the embodiment of the present invention is schematic, and is only a logical function division, and the actual implementation may have another division manner.
  • FIG. 24 shows a possible structural diagram of the mobile device involved in the above embodiment.
  • the mobile device includes a detecting unit 1101, a display unit 1102, a determining unit 1103, and a processing unit 1104.
  • the detecting unit 1101 is configured to support the mobile device to perform the processes 401-402 in FIG. 4, or the process 602 in FIG. 15, or the processes 702, 704 in FIG. 18;
  • the display unit 1102 is configured to support the mobile device to perform the process in FIG. Process 403, 405, or process 502-503 in FIG. 10, or process 601 in FIG. 15, or process 701, 703 in FIG. 18;
  • determination unit 1103 is used to support the mobile device to perform the process in FIG. Process 404, or process 501 in FIG. 10;
  • processing unit 1104 is configured to support the mobile device to perform 406 in FIG. 4, or process 504 in FIG. 10, or process 603 in FIG. 15, or, in FIG. Process 705. All the related content of the steps involved in the foregoing method embodiments may be referred to the functional descriptions of the corresponding functional modules, and details are not described herein again.
  • FIG. 25 shows a possible structural diagram of the mobile device involved in the above embodiment.
  • the mobile device includes a processing module 1302 and a communication module 1303.
  • the processing module 1302 is configured to control and manage the actions of the mobile device.
  • the processing module 1302 is configured to support the mobile device to perform the processes 401-406 in FIG. 5, or the processes 501-504 in FIG. 13, or, in FIG. Processes 601-603, or processes 701-705 in Figure 19, and/or other processes for the techniques described herein.
  • the communication module 1303 is configured to support communication of the mobile device with other network entities.
  • the mobile device can also include a storage module 1301 for storing program code and data of the mobile device.
  • the processing module 1302 may be a processor or a controller, for example, may be a central processing unit (CPU), a general-purpose processor, a digital signal processor (DSP), and an application specific integrated circuit (Application-Specific Integrated Circuit (ASIC), Field Programmable Gate Array (FPGA) or other programmable logic device, transistor logic device, hardware component, or any combination thereof. It is possible to implement or carry out various exemplary logical blocks, modules and circuits described in connection with the disclosure of the embodiments of the invention.
  • the processor may also be a combination of computing functions, for example, including one or more microprocessor combinations, a combination of a DSP and a microprocessor, and the like.
  • the communication module 1303 may be a transceiver, a transceiver circuit (eg, an RF circuit), a communication interface, or the like.
  • the storage module 1301 may be a memory.
  • the processing module 1302 is a processor
  • the communication module 1303 is a communication interface
  • the storage module 1301 is a memory
  • the specific hardware structure of the mobile device according to the embodiment of the present invention may be the mobile device 300 shown in FIG.
  • an embodiment of the present invention further provides a computer program, where the computer program includes instructions, when the computer program is executed by a computer, may cause the computer to perform the above steps 401-406, or steps 501-504, or steps 601-603, or the related processing methods in steps 701-705.
  • an embodiment of the present invention further provides a computer readable storage medium for storing computer software instructions used by the mobile device, including any program designed to execute the mobile device.
  • the disclosed system, apparatus, and method may be implemented in other manners.
  • the device embodiments described above are merely illustrative.
  • the division of the modules or units is only a logical function division.
  • there may be another division manner for example, multiple units or components may be used.
  • Combinations can be integrated into another system, or some features can be ignored or not executed.
  • the mutual coupling or direct coupling or communication connection shown or discussed may be through some interface, device or unit.
  • the coupling or communication connection can be in electrical, mechanical or other form.
  • the units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, may be located in one place, or may be distributed to multiple network units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution of the embodiment.
  • each functional unit in each embodiment of the present application may be integrated into one processing unit, or each unit may exist physically separately, or two or more units may be integrated into one unit.
  • the above integrated unit can be implemented in the form of hardware or in the form of a software functional unit.
  • the integrated unit if implemented in the form of a software functional unit and sold or used as a standalone product, may be stored in a computer readable storage medium.
  • a computer readable storage medium A number of instructions are included to cause a computer device (which may be a personal computer, server, or network device, etc.) or processor to perform all or part of the steps of the methods described in various embodiments of the present application.
  • the foregoing storage medium includes: a flash memory, a mobile hard disk, a read only memory, a random access memory, a magnetic disk, or an optical disk, and the like, which can store program codes.
  • the computer program product therefore includes one or more computer instructions.
  • the computer can be a general purpose computer, a special purpose computer, a computer network, or other programmable device.
  • the computer instructions can be stored in a computer readable storage medium or transferred from one computer readable storage medium to another computer readable storage medium, for example, the computer instructions can be from a website site, computer, server or data center Transfer to another website site, computer, server, or data center by wire (eg, coaxial cable, fiber optic, digital subscriber line (DSL), or wireless (eg, infrared, wireless, microwave, etc.).
  • the computer readable storage medium can be any available media that can be accessed by a computer or a data storage device such as a server, data center, or the like that includes one or more available media.
  • the usable medium can be a magnetic medium (eg, a floppy disk, a hard disk, a magnetic tape), an optical medium (eg, a DVD), or a semiconductor medium (eg, a Solid State Disk (SSD)).

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Artificial Intelligence (AREA)
  • Human Computer Interaction (AREA)
  • User Interface Of Digital Computer (AREA)
  • Studio Devices (AREA)

Abstract

The present application relates to the technical field of communications and provides a processing method and a mobile device, for use in accurately and quickly locating target content in a photographed image. The method comprises: displaying a preview frame in a preview box of a mobile device, the preview frame comprising at least two quadrilaterals; determining that the time when a gaze focus of a user on a first quadrilateral exceeds a preset duration; prominently displaying an outline of the first quadrilateral in the preview frame; and after detecting a first photographing instruction, performing preset processing on a first image enclosed by the outline of the first quadrilateral, and outputting the first image subjected to the preset processing, the preset processing comprising performing quadrilateral correction on the first image, or performing OCR on text in the first image.

Description

一种处理方法及移动设备Processing method and mobile device

本申请要求于2016年12月16日提交中国专利局、申请号为201611169528.7、发明名称为“一种处理的方法和设备”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。The present application claims priority to Chinese Patent Application No. 2016-1169528.7, entitled "A Method and Apparatus for Processing", filed on Dec. 16, 2016, the entire contents of .

技术领域Technical field

本发明实施例涉及通信技术领域,尤其涉及一种处理方法及移动设备。The embodiments of the present invention relate to the field of communications technologies, and in particular, to a processing method and a mobile device.

背景技术Background technique

目前,许多智能终端都具有文档校正和文档翻译等功能,其中,文档校正功能可以为用户快速定位拍摄画面中的目标内容(例如,名片、书籍以及广告牌等),而文档翻译功能可以为用户翻译拍摄画面中目标内容上的文字。At present, many smart terminals have functions such as document correction and document translation. Among them, the document correction function can quickly locate the target content (for example, business cards, books, billboards, etc.) in the captured image, and the document translation function can be used for the user. Translate the text on the target content in the shooting screen.

在实现上述文档校正和文档翻译等功能时,均需要对拍摄画面中的目标内容进行定位。一般,智能终端首先会从拍摄画面中确定一个或多个候选四边形,进而,在上述候选四边形中将靠近拍摄画面中心、面积较大、且形状更接近矩形的候选四边形作为目标四边形,从而将目标四边形中的画面内容作为目标内容。When implementing the above functions such as document correction and document translation, it is necessary to locate the target content in the shooting screen. Generally, the smart terminal first determines one or more candidate quadrilaterals from the captured image, and further, in the candidate quadrilateral, a candidate quadrangle that is close to the center of the photographing screen and has a larger area and a shape closer to a rectangle is used as the target quadrilateral, thereby The content of the picture in the quadrilateral is used as the target content.

但是,上述目标内容的定位方法需要用户手动调整拍摄画面,尤其是在拍摄画面中包含多个候选四边形时,如果用户对智能终端不进行左右或前后移动、缩放拍摄画面等操作对拍摄画面进行校准,则智能终端定位的目标内容很可能不是用户想要的内容,即无法准确便捷的定位拍摄画面中的目标内容内容。However, the positioning method of the above target content requires the user to manually adjust the shooting screen, especially when the user includes multiple candidate quadrilaterals in the shooting screen, and if the user does not perform left and right or backward movement, zooming the shooting screen, etc., the user calibrates the shooting screen. The target content of the smart terminal positioning is likely not the content that the user wants, that is, the target content content in the captured image cannot be accurately and conveniently located.

发明内容Summary of the invention

本发明的实施例提供一种处理方法及移动设备,可准确便捷的定位拍摄画面中的目标内容内容。Embodiments of the present invention provide a processing method and a mobile device, which can accurately and conveniently locate a target content content in a captured picture.

为达到上述目的,本发明的实施例采用如下技术方案:In order to achieve the above object, embodiments of the present invention adopt the following technical solutions:

第一方面提供了一种处理方法,包括:在移动设备的预览框内显示预览帧,该预览帧包括至少两个四边形;确定用户的视线焦点停留在第一四边形(该第一四边形为该至少两个四边形中的一个)内的时长大于预设时长;在该预览帧内突出显示该第一四边形的轮廓线;检测到第一拍照指令后,对该第一四边形的轮廓线所包围的第一图像执行预设处理后,输出经过所述预设处理的第一图像;该预设处理包括:对该第一图像执行四边形校正;或者,对该第一图像中的文字执行OCR。The first aspect provides a processing method, including: displaying a preview frame in a preview frame of the mobile device, the preview frame including at least two quadrilaterals; determining that the user's line of sight focus stays on the first quadrilateral (the first four sides) The duration in the shape of the at least two quadrilaterals is greater than a preset duration; the contour of the first quadrilateral is highlighted in the preview frame; after the first photographing instruction is detected, the first four sides are After performing the preset processing on the first image surrounded by the contour line, outputting the first image subjected to the preset processing; the preset processing includes: performing quadrilateral correction on the first image; or, the first image The text in the text performs OCR.

可以看出,在本发明实施例提供的处理方法中,移动设备通过检测用户的视线焦点确定出用户视线在预览帧内的哪个四边形内停留的时间较长,由于视线停留时长较长的四边形内的图像通常为用户需要的目标内容,那么,移动设备将该四边形的边缘线突出显示出来,从而向用户提示已经为用户确 定该四边形内的第一图像为目标内容,后续,移动设备在检测到拍照指令后,可直接输出经过预设处理后的上述第一图像。这样,无需用户手动调整预览帧的画面,移动设备便可以准确便捷的为用户定位预览帧中的目标内容。It can be seen that, in the processing method provided by the embodiment of the present invention, the mobile device determines, by detecting the focus of the user's line of sight, which quadrant in the preview frame the user's line of sight stays for a longer time, because the line of sight stays longer in the quadrilateral. The image is usually the target content that the user needs, then the mobile device highlights the edge line of the quadrilateral, thereby prompting the user that the user has already The first image in the quadrilateral is determined as the target content. Subsequently, after detecting the photographing instruction, the mobile device may directly output the first image after the preset processing. In this way, the mobile device can accurately and conveniently locate the target content in the preview frame for the user without manually adjusting the preview frame.

在一种可能的实现方式中,在预览框内突出显示第一四边形的轮廓线之后,还包括:确定用户的视线焦点停留在第二四边形内的时长大于该预设时长,该第二四边形为该至少两个四边形中除该第一四边形外的任一个四边形;在预览框内正常显示第一四边形的轮廓线;在该预览框内突出显示该第二四边形的轮廓线。也就是说,当用户视线发生变化时,移动设备可进一步为用户显示变化后的视线焦点所在的第二四边形的轮廓线。In a possible implementation, after the outline of the first quadrilateral is highlighted in the preview frame, the method further includes: determining that the duration of the user's line of sight staying in the second quadrilateral is greater than the preset duration, The second quadrilateral is any quadrilateral except the first quadrilateral of the at least two quadrilaterals; the outline of the first quadrilateral is normally displayed in the preview frame; the second is highlighted in the preview frame The outline of the quadrilateral. That is to say, when the user's line of sight changes, the mobile device can further display the contour of the second quadrilateral where the changed line of sight focus is displayed for the user.

在一种可能的实现方式中,在该预览框内突出显示该第二四边形的轮廓线之后,还包括:检测到第二拍照指令后,对该第二四边形的轮廓线所包围的第二图像执行该预设处理后,输出经过所述预设处理的第二图像。In a possible implementation, after the outline of the second quadrilateral is highlighted in the preview frame, the method further includes: after detecting the second photographing instruction, enclosing the outline of the second quadrilateral After the second image performs the preset processing, the second image subjected to the preset processing is output.

在一种可能的实现方式中,在移动设备的预览框内显示预览之后,还包括:在该预览帧中显示默认四边形的轮廓线,该默认四边形为该至少两个四边形中面积最大的四边形。也就是说,在检测用户的视线焦点之前,移动设备可以先将面积最大的四边形作为默认四边形,并将该默认四边形的轮廓线所包围的图像作为目标内容提示给用户。In a possible implementation, after displaying the preview in the preview box of the mobile device, the method further includes: displaying a contour of a default quadrilateral in the preview frame, wherein the default quadrilateral is a quadrilateral having the largest area among the at least two quadrilaterals. That is to say, before detecting the user's line of sight focus, the mobile device may first use the quadrilateral with the largest area as the default quadrilateral, and present the image surrounded by the outline of the default quadrilateral as the target content to the user.

在一种可能的实现方式中,在移动设备的预览框内显示预览帧之前,还包括:对该预览帧内的画面内容执行四边形检测,保存检测到的至少两个四边形的顶点坐标。In a possible implementation manner, before displaying the preview frame in the preview frame of the mobile device, the method further includes: performing quadrilateral detection on the screen content in the preview frame, and storing the detected vertex coordinates of the at least two quadrilaterals.

在一种可能的实现方式中,确定用户的视线焦点停留在第一四边形内的时长大于预设时长,包括:检测该视线焦点在该预览帧内的焦点位置;确定该焦点位置位于预览帧的第一四边形内的时长大于该预设时长。In a possible implementation manner, determining that a user's line of sight focus stays within the first quadrilateral is greater than a preset duration includes: detecting a focus position of the line of sight focus within the preview frame; determining that the focus position is in a preview The duration in the first quadrilateral of the frame is greater than the preset duration.

在一种可能的实现方式中,对该第一图像中的文字执行OCR,包括:将该第一图像中的文字翻译为预设语言的文字,并将该第一图像中的文字替换为翻译后该预设语言的文字;或者,将该第一图像中的文字翻译为预设语言的文字,并将翻译后该预设语言的文字添加到该第一图像中,从而实现文档翻译功能。In a possible implementation manner, performing OCR on the text in the first image includes: translating the text in the first image into a text in a preset language, and replacing the text in the first image with a translation And text of the preset language; or, the text in the first image is translated into the text of the preset language, and the translated text of the preset language is added to the first image, thereby implementing the document translation function.

在一种可能的实现方式中,该预设处理还包括:对该第一图像执行裁剪操作。In a possible implementation manner, the preset processing further includes: performing a cropping operation on the first image.

第二方面提供了一种处理方法,包括:对待显示预览帧执行四边形检测,保存检测到的至少两个四边形的顶点坐标;在该移动设备的预览框中显示该预览帧;确定视线焦点位于该至少两个四边形中的第一四边形内超过预设时长,在该预览框中呈现该第一四边形的轮廓线;检测到用于拍照的指令;对该预览帧中该第一四边形的轮廓线包围的第一图像执行预设处理后,输出经过所述预设处理的第一图像,该经过所述预设处理的第一图像中包括该第一四边形的轮廓线包围的第一内容,该预设处理包括以下至少一种:四边形校正;对第一图像中的文字执行光学字符识别,将识别出的文字翻译为预设语言的文字,并将识别出的文字替换为该预设语言的文字;对该第一图像中的 文字执行光学字符识别,将识别出的文字翻译为预设语言的文字,并将该预设语言的文字添加到识别出的文字的旁边。The second aspect provides a processing method, including: performing quadrilateral detection on a preview frame to be displayed, saving the detected vertex coordinates of at least two quadrilaterals; displaying the preview frame in a preview box of the mobile device; determining that the line of sight focus is located in the The first quadrilateral of the at least two quadrilaterals exceeds a preset duration, the outline of the first quadrilateral is presented in the preview frame; an instruction for photographing is detected; the first four in the preview frame After performing the preset processing on the first image surrounded by the outline of the edge, outputting the first image subjected to the preset processing, and the first image passing through the preset processing includes the contour line of the first quadrilateral The surrounding first content, the preset processing includes at least one of the following: quadrilateral correction; performing optical character recognition on the text in the first image, translating the recognized text into a text of a preset language, and the recognized text Replaced with the text of the preset language; in the first image The text performs optical character recognition, translates the recognized text into a text of a preset language, and adds the text of the preset language to the side of the recognized text.

在一种可能的实现方式中,该方法还包括:确定视线焦点位于该至少两个四边形中的第二四边形内超过预设时长,在该预览框中呈现该第二四边形的轮廓线;检测到用于拍照的指令;对该预览帧中该第二四边形的轮廓线包围的第二图像执行该预设处理后,输出经过所述预设处理的第二图像,该经过所述预设处理的第二图像中包括该第二四边形的轮廓线包围的第二内容。In a possible implementation manner, the method further includes: determining that a line of sight focus is within a second quadrilateral of the at least two quadriles beyond a preset duration, and presenting the contour of the second quadrilateral in the preview frame a line; detecting an instruction for photographing; after performing the preset processing on the second image surrounded by the outline of the second quadrilateral in the preview frame, outputting the second image subjected to the preset processing, the passing The second image of the preset process includes a second content surrounded by the outline of the second quadrilateral.

第三方面,提供一种处理方法,包括:对待显示预览帧执行四边形检测,保存检测到的至少两个四边形的顶点坐标;在该移动设备的预览框中显示该预览帧,该预览帧中呈现该至少两个四边形中的第一四边形的轮廓线;确定视线焦点位于该至少两个四边形中的第二四边形内超过预设时长,在该预览框中呈现该第二四边形的轮廓线,隐藏该第一四边形的轮廓线,该第二四边形为该至少两个四边形中除该第一四边形外的任一个;检测到用于拍照的指令;对该预览帧中该第二四边形的轮廓线包围的第二图像执行预设处理后,输出经过所述预设处理的第二图像,该经过所述预设处理的第二图像中包括该第二四边形的轮廓线包围的第二内容,该预设处理包括以下至少一种:四边形校正;对第二图像中的文字执行光学字符识别,将识别出的文字翻译为预设语言的文字,并将识别出的文字替换为该预设语言的文字;对该第二图像中的文字执行光学字符识别,将识别出的文字翻译为预设语言的文字,并将该预设语言的文字添加到识别出的文字的旁边。In a third aspect, a processing method is provided, including: performing quadrilateral detection on a preview frame to be displayed, and storing the detected vertex coordinates of at least two quadrilaterals; displaying the preview frame in a preview frame of the mobile device, where the preview frame is presented a contour line of the first quadrilateral of the at least two quadrilaterals; determining that the line of sight focus is within the second quadrilateral of the at least two quadrilaterals for more than a preset duration, and presenting the second quadrilateral in the preview frame a contour that hides a contour of the first quadrilateral, the second quadrilateral being any one of the at least two quadrilaterals except the first quadrilateral; an instruction for photographing is detected; After performing a preset process on the second image surrounded by the outline of the second quadrilateral in the preview frame, outputting the second image that is subjected to the preset process, and the second image that is subjected to the preset process includes the first image a second content surrounded by a contour of the quadrilateral, the preset processing comprising at least one of the following: quadrilateral correction; performing optical character recognition on the text in the second image, and translating the recognized text into a preset language Text, and replacing the recognized text with the text of the preset language; performing optical character recognition on the text in the second image, translating the recognized text into a text of a preset language, and The text is added next to the recognized text.

第四方面提供了一种处理方法,包括:在该移动设备的预览框内显示预览帧,该预览帧包括至少两个四边形;检测到作用于第一四边形的操作,该第一四边形为该至少两个四边形中的一个;响应于该操作,对该第一四边形的轮廓线所包围的第一图像执行预设处理后,输出经过所述预设处理的第一图像;该预设处理包括:对该第一图像执行四边形校正;或者,对该第一图像中的文字执行光学字符识别。应理解:该操作包括但不限于点击,双击,轻按,长按或重按。A fourth aspect provides a processing method, comprising: displaying a preview frame in a preview frame of the mobile device, the preview frame including at least two quadrilaterals; detecting an operation acting on the first quadrilateral, the first four sides Forming one of the at least two quadrilaterals; in response to the operation, after performing a preset process on the first image surrounded by the outline of the first quadrilateral, outputting the first image subjected to the preset processing; The preset process includes: performing quadrilateral correction on the first image; or performing optical character recognition on the text in the first image. It should be understood that the operation includes, but is not limited to, clicking, double clicking, tapping, long pressing or repressing.

在一种可能的实现方式中,对该第一图像中的文字执行光学字符识别,包括:将该第一图像中的文字翻译为预设语言的文字,并将该第一图像中的文字替换为翻译后的该预设语言的文字;或者,将该第一图像中的文字翻译为预设语言的文字,并将翻译后的该预设语言的文字添加到该第一图像中。In a possible implementation manner, performing optical character recognition on the text in the first image includes: translating the text in the first image into a text in a preset language, and replacing the text in the first image The translated text of the preset language; or, the text in the first image is translated into the text of the preset language, and the translated text of the preset language is added to the first image.

在一种可能的实现方式中,该预设处理还包括:对该第一图像执行裁剪操作。In a possible implementation manner, the preset processing further includes: performing a cropping operation on the first image.

第五方面提供了一种处理方法,包括:在该移动设备的预览框内显示预览帧,该预览帧包括至少两个四边形;检测到作用于第一四边形的第一操作,该第一四边形为该至少两个四边形中的一个;响应于该第一操作,突出显示该第一四边形的轮廓线;检测到作用于拍照键的第二操作;响应于该第二操作,对该第一四边形的轮廓线所包围的第一图像执行预设处理后,输出经过所述预设处理的第一图像;该预设处理包括:对该第一图像执行四边形校正; 或者,对该第一图像中的文字执行光学字符识别。应理解:该第一操作与该第二操作可以相同,也可以不同,此处并不做限定。该第一操作包括但不限于:点击,双击,轻按,长按或重按。该第二操作包括但不限于:点击,轻按,长按或重按。A fifth aspect provides a processing method, including: displaying a preview frame in a preview frame of the mobile device, the preview frame including at least two quadrilaterals; detecting a first operation acting on the first quadrilateral, the first a quadrilateral is one of the at least two quadrilaterals; in response to the first operation, highlighting the contour of the first quadrilateral; detecting a second operation acting on the camera key; in response to the second operation, After performing a preset process on the first image surrounded by the outline of the first quadrilateral, outputting the first image subjected to the preset process; the preset process includes: performing quadrilateral correction on the first image; Alternatively, optical character recognition is performed on the text in the first image. It should be understood that the first operation and the second operation may be the same or different, and are not limited herein. The first operation includes but is not limited to: clicking, double clicking, tapping, long pressing or re-pressing. The second operation includes but is not limited to: clicking, tapping, long pressing or re-pressing.

在一种可能的实现方式中,该预设处理还包括:对该第一图像执行裁剪操作。In a possible implementation manner, the preset processing further includes: performing a cropping operation on the first image.

第六方面提供了一种移动设备,包括:显示单元,用于在该移动设备的预览框内显示预览帧,该预览帧包括至少两个四边形;确定单元,用于确定用户的视线焦点停留在第一四边形内的时长大于预设时长,该第一四边形为该至少两个四边形中的一个;该显示单元,还用于在该预览框内突出显示该第一四边形的轮廓线;处理单元,用于检测到第一拍照指令后,对该第一四边形的轮廓线所包围的第一图像执行预设处理后,输出经过所述预设处理的第一图像;该预设处理包括:对该第一图像执行四边形校正;或者,对该第一图像中的文字执行光学字符识别。A sixth aspect provides a mobile device, comprising: a display unit, configured to display a preview frame in a preview frame of the mobile device, the preview frame includes at least two quadrilaterals; and a determining unit, configured to determine that a user's line of sight focus stays at The duration of the first quadrilateral is greater than a preset duration, the first quadrilateral being one of the at least two quadrilaterals; the display unit is further configured to highlight the first quadrilateral in the preview frame a contour processing unit, configured to: after detecting the first photographing instruction, performing a preset process on the first image surrounded by the outline of the first quadrilateral, and outputting the first image subjected to the preset processing; The preset process includes: performing quadrilateral correction on the first image; or performing optical character recognition on the text in the first image.

在一种可能的实现方式中,还用于确定用户的视线焦点停留在第二四边形内的时长大于该预设时长,该第二四边形为该至少两个四边形中除该第一四边形外的任一个四边形;该显示单元,还用于在该预览框内正常显示该第一四边形的轮廓线;在该预览框内突出显示该第二四边形的轮廓线。In a possible implementation manner, the method further includes determining that a duration in which the user's line of sight focus stays in the second quadrilateral is greater than the preset duration, and the second quadrilateral is the first one of the at least two quadrilaterals Any one of the quadrilaterals outside the quadrilateral; the display unit is further configured to normally display the outline of the first quadrilateral in the preview frame; and highlight the outline of the second quadrilateral in the preview frame.

在一种可能的实现方式中,该处理单元,还用于检测到第二拍照指令后,对该第二四边形的轮廓线所包围的第二图像执行该预设处理后,输出经过所述预设处理的第二图像。In a possible implementation, the processing unit is further configured to: after detecting the second photographing instruction, perform the preset processing on the second image surrounded by the outline of the second quadrilateral, and output the passing through The second image of the preset process.

在一种可能的实现方式中,该显示单元,还用于在该预览帧中突出显示默认四边形的轮廓线,该默认四边形为该至少两个四边形中面积最大的四边形。In a possible implementation manner, the display unit is further configured to highlight a contour of a default quadrilateral in the preview frame, where the default quadrilateral is the largest quadrilateral of the at least two quadrilaterals.

在一种可能的实现方式中,移动设备还包括:检测单元,用于对该预览帧执行四边形检测,保存检测到的至少两个四边形的顶点坐标。In a possible implementation, the mobile device further includes: a detecting unit, configured to perform quadrilateral detection on the preview frame, and save the detected vertex coordinates of the at least two quadrilaterals.

在一种可能的实现方式中,该确定单元,具体用于检测该视线焦点在该预览帧内的焦点位置;确定该焦点位于该第一四边形内的时长大于该预设时长。In a possible implementation, the determining unit is specifically configured to detect a focus position of the line of sight focus in the preview frame, and determine that the focus is located in the first quadrilateral for a longer duration than the preset duration.

在一种可能的实现方式中,该处理单元,具体用于将该第一图像中的文字翻译为预设语言的文字,并将该第一图像中的文字替换为翻译后该预设语言的文字;或者,将该第一图像中的文字翻译为预设语言的文字,并将翻译后该预设语言的文字添加到该第一图像中。In a possible implementation, the processing unit is specifically configured to translate the text in the first image into a text in a preset language, and replace the text in the first image with the translated language in the preset language. Text; or, the text in the first image is translated into the text of the preset language, and the translated text of the preset language is added to the first image.

第七方面提供了一种移动设备,包括:检测单元,用于对待显示预览帧执行四边形检测,保存检测到的至少两个四边形的顶点坐标;显示单元,用于在所述移动设备的预览框中显示所述预览帧;确定单元,用于确定视线焦点位于所述至少两个四边形中的第一四边形内超过预设时长,在所述预览框中呈现所述第一四边形的轮廓线;检测单元,用于检测到用于拍照的指令;处理单元,用于对所述预览帧中所述第一四边形的轮廓线包围的第一图像执 行预设处理后,输出经过所述预设处理的第一图像,所述经过所述预设处理的第一图像中包括所述第一四边形的轮廓线包围的第一内容,所述预设处理包括以下至少一种:四边形校正;对第一图像中的文字执行光学字符识别,将识别出的文字翻译为预设语言的文字,并将识别出的文字替换为所述预设语言的文字;对所述第一图像中的文字执行光学字符识别,将识别出的文字翻译为预设语言的文字,并将所述预设语言的文字添加到识别出的文字的旁边。A seventh aspect provides a mobile device, comprising: a detecting unit, configured to perform quadrilateral detection on a preview frame to be displayed, save coordinates of at least two quadrilateral vertices detected; and display unit, in a preview box of the mobile device And displaying, in the preview frame, the first quadrilateral a detection unit configured to detect an instruction for photographing; a processing unit configured to perform a first image surrounded by the outline of the first quadrilateral in the preview frame After the line preset processing, the first image subjected to the preset processing is output, and the first image that is subjected to the preset processing includes the first content surrounded by the outline of the first quadrilateral, The preset processing includes at least one of the following: quadrilateral correction; performing optical character recognition on the text in the first image, translating the recognized text into a text of a preset language, and replacing the recognized text with the preset language a text; performing optical character recognition on the text in the first image, translating the recognized text into a text of a preset language, and adding the text of the preset language to the side of the recognized text.

第八方面提供了一种移动设备,包括:检测单元,用于对待显示预览帧执行四边形检测,保存检测到的至少两个四边形的顶点坐标;显示单元,用于在所述移动设备的预览框中显示所述预览帧,所述预览帧中呈现所述至少两个四边形中的第一四边形的轮廓线;确定单元,用于确定视线焦点位于所述至少两个四边形中的第二四边形内超过预设时长,在所述预览框中呈现所述第二四边形的轮廓线,隐藏所述第一四边形的轮廓线,所述第二四边形为所述至少两个四边形中除所述第一四边形外的任一个;检测单元,还用于检测到用于拍照的指令;处理单元,用于对所述预览帧中所述第二四边形的轮廓线包围的第二图像执行预设处理后,输出经过所述预设处理的第二图像,所述经过所述预设处理的第二图像中包括所述第二四边形的轮廓线包围的第二内容,所述预设处理包括以下至少一种:四边形校正;对第二图像中的文字执行光学字符识别,将识别出的文字翻译为预设语言的文字,并将识别出的文字替换为所述预设语言的文字;对所述第二图像中的文字执行光学字符识别,将识别出的文字翻译为预设语言的文字,并将所述预设语言的文字添加到识别出的文字的旁边。An eighth aspect provides a mobile device, comprising: a detecting unit, configured to perform quadrilateral detection on a preview frame to be displayed, save coordinates of the detected at least two quadrilateral vertices; and display unit, in a preview box of the mobile device The preview frame is displayed, wherein the preview frame presents an outline of the first quadrilateral of the at least two quadrilaterals; and the determining unit is configured to determine that the line of sight focus is located in the second four of the at least two quadrilaterals The contour of the second quadrilateral is presented in the preview frame to hide the contour line of the first quadrilateral, and the second quadrilateral is the at least two Any one of the quadrilaterals except the first quadrilateral; the detecting unit is further configured to detect an instruction for photographing; and the processing unit is configured to: contour the second quadrilateral in the preview frame After performing the preset processing on the second image surrounded by the line, the second image subjected to the preset processing is output, and the second image subjected to the preset processing includes the contour line surrounded by the second quadrilateral Second content, The preset processing includes at least one of the following: quadrilateral correction; performing optical character recognition on the text in the second image, translating the recognized text into a text of a preset language, and replacing the recognized text with the preset language a text; performing optical character recognition on the text in the second image, translating the recognized text into a text of a preset language, and adding the text of the preset language to the side of the recognized text.

第九方面提供了一种移动设备,包括:显示单元,用于在所述移动设备的预览框内显示预览帧,所述预览帧包括至少两个四边形;检测单元,用于检测到作用于第一四边形的操作,所述第一四边形为所述至少两个四边形中的一个;处理单元,用于响应于所述操作,对所述第一四边形的轮廓线所包围的第一图像执行预设处理后,输出经过所述预设处理的第一图像;所述预设处理包括:对所述第一图像执行四边形校正;或者,对所述第一图像中的文字执行光学字符识别。A ninth aspect provides a mobile device, including: a display unit, configured to display a preview frame in a preview frame of the mobile device, the preview frame includes at least two quadrilaterals; and a detecting unit configured to detect a role a quadrilateral operation, the first quadrilateral being one of the at least two quadrilaterals; a processing unit responsive to the operation, surrounding the outline of the first quadrilateral After the first image performs the preset processing, outputting the first image subjected to the preset processing; the preset processing includes: performing quadrilateral correction on the first image; or performing execution on the text in the first image Optical character recognition.

第十方面提供了一种移动设备,包括:显示单元,用于在所述移动设备的预览框内显示预览帧,所述预览帧包括至少两个四边形;检测单元,用于检测到作用于第一四边形的第一操作,所述第一四边形为所述至少两个四边形中的一个;显示单元,还用于响应于所述第一操作,突出显示所述第一四边形的轮廓线;检测单元,还用于检测到作用于拍照键的第二操作;处理单元,用于响应于所述第二操作,对所述第一四边形的轮廓线所包围的第一图像执行预设处理后,输出经过所述预设处理的第一图像;所述预设处理包括:对所述第一图像执行四边形校正;或者,对所述第一图像中的文字执行光学字符识别。A tenth aspect provides a mobile device, comprising: a display unit, configured to display a preview frame in a preview frame of the mobile device, the preview frame includes at least two quadrilaterals; and a detecting unit, configured to detect a role a first operation of a quadrilateral, the first quadrilateral being one of the at least two quadrilaterals; the display unit further configured to highlight the first quadrilateral in response to the first operation a contouring unit; the detecting unit is further configured to detect a second operation acting on the camera key; the processing unit, configured to, in response to the second operation, the first surrounded by the contour line of the first quadrilateral After the image performs the preset processing, outputting the first image subjected to the preset processing; the preset processing includes: performing quadrilateral correction on the first image; or performing optical characters on the text in the first image Identification.

第十一方面提供了一种移动设备,包括:处理器、存储器、总线和通信 接口;该存储器用于存储计算机执行指令,该处理器与该存储器通过该总线连接,当该移动设备运行时,该处理器执行该存储器存储的该计算机执行指令,以使该移动设备执行上述任意一项处理方法。The eleventh aspect provides a mobile device comprising: a processor, a memory, a bus, and a communication An interface for storing a computer execution instruction, the processor being coupled to the memory via the bus, the processor executing the computer-executable instruction stored by the memory when the mobile device is running, to cause the mobile device to perform the arbitrary A treatment method.

第十二方面提供了一种计算机可读存储介质,用于储存为上述移动设备所用的计算机软件指令,其包含用于执行上述方面为该移动设备所设计的程序。A twelfth aspect provides a computer readable storage medium for storing computer software instructions for use in the mobile device described above, comprising a program designed to perform the above aspects for the mobile device.

第十三方面提供了一种计算机程序,该计算机程序包括指令,当该计算机程序被计算机执行时,使得计算机可以执行上述任一项的处理方法。A thirteenth aspect provides a computer program comprising instructions which, when executed by a computer, cause a computer to perform the processing method of any of the above.

上述移动设备的名字对设备或功能模块本身不构成限定,在实际实现中,这些设备或功能模块可以以其他名称出现。只要各个设备或功能模块的功能和本发明类似,属于本发明权利要求及其等同技术的范围之内。The names of the above mobile devices are not limited to the devices or the functional modules themselves. In actual implementation, these devices or functional modules may appear under other names. As long as the functions of the respective devices or functional modules are similar to the present invention, they are within the scope of the claims and equivalents thereof.

另外,第二方面至第十三方面中任一种设计方式所带来的技术效果可参见第一方面中不同设计方式所带来的技术效果,此处不再赘述。In addition, the technical effects brought by the design method of any one of the second aspect to the thirteenth aspect can be referred to the technical effects brought by different design modes in the first aspect, and details are not described herein again.

本申请的这些方面或其他方面在以下实施例的描述中会更加简明易懂。These and other aspects of the present application will be more readily apparent from the following description of the embodiments.

附图说明DRAWINGS

图1为本发明实施例提供的预览帧的应用场景示意图;FIG. 1 is a schematic diagram of an application scenario of a preview frame according to an embodiment of the present disclosure;

图2为本发明实施例提供的内置摄像头的应用场景示意图;2 is a schematic diagram of an application scenario of a built-in camera according to an embodiment of the present invention;

图3为本发明实施例提供的一种移动设备的结构示意图一;FIG. 3 is a schematic structural diagram 1 of a mobile device according to an embodiment of the present disclosure;

图4为本发明实施例提供的一种处理方法的应用场景示意图一;FIG. 4 is a schematic diagram 1 of an application scenario of a processing method according to an embodiment of the present disclosure;

图5为本发明实施例提供的一种处理方法的流程示意图一;FIG. 5 is a schematic flowchart 1 of a processing method according to an embodiment of the present disclosure;

图6为本发明实施例提供的处理方法的应用场景示意图二;FIG. 6 is a schematic diagram 2 of an application scenario of a processing method according to an embodiment of the present disclosure;

图7为本发明实施例提供的处理方法的应用场景示意图三;FIG. 7 is a schematic diagram 3 of an application scenario of a processing method according to an embodiment of the present disclosure;

图8为本发明实施例提供的处理方法的应用场景示意图四;FIG. 8 is a schematic diagram 4 of an application scenario of a processing method according to an embodiment of the present disclosure;

图9为本发明实施例提供的处理方法的应用场景示意图五;FIG. 9 is a schematic diagram 5 of an application scenario of a processing method according to an embodiment of the present disclosure;

图10为本发明实施例提供的处理方法的应用场景示意图六;FIG. 10 is a schematic diagram 6 of an application scenario of a processing method according to an embodiment of the present disclosure;

图11为本发明实施例提供的处理方法的应用场景示意图七;FIG. 11 is a schematic diagram 7 of an application scenario of a processing method according to an embodiment of the present disclosure;

图12为本发明实施例提供的处理方法的应用场景示意图八;FIG. 12 is a schematic diagram 8 of an application scenario of a processing method according to an embodiment of the present disclosure;

图13为本发明实施例提供的一种处理方法的流程示意图二;FIG. 13 is a second schematic flowchart of a processing method according to an embodiment of the present disclosure;

图14为本发明实施例提供的处理方法的应用场景示意图九;FIG. 14 is a schematic diagram 9 of an application scenario of a processing method according to an embodiment of the present disclosure;

图15为本发明实施例提供的一种处理方法的流程示意图三;FIG. 15 is a schematic flowchart 3 of a processing method according to an embodiment of the present disclosure;

图16为本发明实施例提供的处理方法的应用场景示意图十;FIG. 16 is a schematic diagram of an application scenario of a processing method according to an embodiment of the present disclosure;

图17为本发明实施例提供的处理方法的应用场景示意图十一;FIG. 17 is a schematic diagram of an application scenario of a processing method according to an embodiment of the present invention;

图18为本发明实施例提供的处理方法的应用场景示意图十二;FIG. 18 is a schematic diagram of an application scenario of a processing method according to an embodiment of the present invention;

图19为本发明实施例提供的一种处理方法的流程示意图四;FIG. 19 is a schematic flowchart 4 of a processing method according to an embodiment of the present disclosure;

图20为本发明实施例提供的处理方法的应用场景示意图十三;FIG. 20 is a schematic diagram of an application scenario of a processing method according to an embodiment of the present disclosure;

图21为本发明实施例提供的处理方法的应用场景示意图十四;FIG. 21 is a schematic diagram of an application scenario of a processing method according to an embodiment of the present invention; FIG.

图22为本发明实施例提供的处理方法的应用场景示意图十五;FIG. 22 is a schematic diagram of an application scenario of a processing method according to an embodiment of the present invention;

图23为本发明实施例提供的处理方法的应用场景示意图十六; FIG. 23 is a schematic diagram of an application scenario of a processing method according to an embodiment of the present invention;

图24为本发明实施例提供的一种移动设备的结构示意图二;FIG. 24 is a schematic structural diagram 2 of a mobile device according to an embodiment of the present disclosure;

图25为本发明实施例提供的一种移动设备的结构示意图三。FIG. 25 is a schematic structural diagram 3 of a mobile device according to an embodiment of the present invention.

具体实施方式detailed description

下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行详细地描述。The technical solutions in the embodiments of the present invention are described in detail below with reference to the accompanying drawings in the embodiments of the present invention.

另外,术语“第一”、“第二”仅用于描述目的,而不能理解为指示或暗示相对重要性或者隐含指明所指示的技术特征的数量。由此,限定有“第一”、“第二”的特征可以明示或者隐含地包括一个或者更多个该特征。在本发明实施例的描述中,除非另有说明,“多个”的含义是两个或两个以上。In addition, the terms "first" and "second" are used for descriptive purposes only, and are not to be construed as indicating or implying a relative importance or implicitly indicating the number of technical features indicated. Thus, features defining "first" and "second" may include one or more of the features either explicitly or implicitly. In the description of the embodiments of the present invention, the meaning of "a plurality" is two or more unless otherwise specified.

本文中术语“和/或”,仅仅是一种描述关联对象的关联关系,表示可以存在三种关系,例如,A和/或B,可以表示:单独存在A,同时存在A和B,单独存在B这三种情况。另外,本文中字符“/”,一般表示前后关联对象是一种“或”的关系。The term "and/or" in this context is merely an association describing the associated object, indicating that there may be three relationships, for example, A and / or B, which may indicate that A exists separately, and both A and B exist, respectively. B these three situations. In addition, the character "/" in this article generally indicates that the contextual object is an "or" relationship.

为了方便理解本发明实施例,首先在此介绍本发明实施例描述中会引入的几个术语:In order to facilitate the understanding of the embodiments of the present invention, several terms introduced in the description of the embodiments of the present invention are first introduced herein:

预览帧,是指在移动设备在拍摄过程中通过摄像头捕捉到的一个最小单位的单幅拍摄画面,该拍摄画面可以显示在移动设备的显示界面中,例如,以手机作为上述移动设备举例说明,用户触发手机内的相机应用后,如图1所示,可以在手机的显示界面内显示一个预览框12,进而,手机将摄像头捕捉到的拍摄画面实时的显示在该预览框12内,此时,该预览框12内的每一幅拍摄画面可作为一个预览帧11。当然,上述移动设备还可以为可穿戴眼镜,可穿戴头盔等增强现实(Augmented Reality,AR)设备或便携式设备(例如,手机),本发明实施例对此不作任何限制。The preview frame refers to a single-frame shooting picture of a minimum unit captured by the mobile device during the shooting process, and the shooting picture can be displayed in the display interface of the mobile device, for example, using the mobile phone as the mobile device as an example. After the user triggers the camera application in the mobile phone, as shown in FIG. 1 , a preview frame 12 can be displayed in the display interface of the mobile phone, and then the mobile phone captures the captured image captured by the camera in the preview frame 12 in real time. Each shot in the preview frame 12 can be used as a preview frame 11. Of course, the above-mentioned mobile device may also be an Augmented Reality (AR) device or a portable device (for example, a mobile phone) such as a wearable eyeglass, a wearable helmet, and the like.

移动设备的内置摄像头13,是指用户观看移动设备的显示屏时,与用户视线相对的摄像头,内置摄像头13在工作时一般可以拍摄到用户的眼球状态,例如,手机的前置摄像头(如图2中的(a)所示),或者,在可穿戴眼镜(如图2中的(b)所示)或可穿戴头盔内设置的摄像头,内置摄像头13具体可以为红外摄像头、R(red,红)G(green,绿)B(blue,蓝)摄像头等,本发明实施例对此不作任何限制。The built-in camera 13 of the mobile device refers to a camera that is opposite to the user's line of sight when the user views the display screen of the mobile device. The built-in camera 13 can generally capture the state of the user's eyeball during work, for example, the front camera of the mobile phone (as shown in the figure) 2 (a)), or, in a wearable eyeglass (shown as (b) in FIG. 2) or a camera provided in a wearable helmet, the built-in camera 13 may specifically be an infrared camera, R (red, The red (green) green (green) B (blue, blue) camera or the like is not limited in this embodiment of the present invention.

移动设备的外置摄像头,是指用户观看移动设备的显示屏时,在背离显示屏的一侧设置的摄像头,例如,手机的后置摄像头或者在可穿戴眼镜或可穿戴头盔外设置的摄像头,外置摄像头具体可以为RGB摄像头、Mono(黑白)摄像头、或深度摄像头等,本发明实施例对此不作任何限制。The external camera of the mobile device refers to the camera set on the side facing away from the display when the user views the display screen of the mobile device, for example, the rear camera of the mobile phone or the camera set outside the wearable glasses or the wearable helmet. The external camera may specifically be an RGB camera, a Mono (black and white) camera, or a depth camera, etc., which is not limited in this embodiment of the present invention.

本发明的实施例提供一种处理方法,该方法可应用于任意头戴式电子设备,例如,增强现实(Augmented Reality,AR)眼镜、虚拟现实(Virtual Reality,VR)头盔等,也可应用于任意手持式电子设备,例如,手机、平板电脑、笔记本电脑、超级移动个人计算机(Ultra-mobile Personal Computer,UMPC)、上网本、个人数字助理(Personal Digital Assistant,PDA)等,本发明实施例 对此不作任何限制。Embodiments of the present invention provide a processing method, which can be applied to any head mounted electronic device, for example, Augmented Reality (AR) glasses, Virtual Reality (VR) helmets, etc., and can also be applied to Any handheld electronic device, such as a mobile phone, a tablet computer, a notebook computer, an ultra-mobile personal computer (UMPC), a netbook, a personal digital assistant (PDA), etc., in the embodiment of the present invention There are no restrictions on this.

该移动设备的结构示意图如图3所示。移动设备300包括:通信接口320、存储器330、输入单元340、显示单元350、内置摄像头13、外置摄像头14、音频电路370以及处理器380等部件。应理解:图3中示出的移动设备300的结构并不构成对移动设备的限定,移动设备可以包括比图示更多或更少的部件,或者组合某些部件,或者其他不同的部件。The structure of the mobile device is shown in Figure 3. The mobile device 300 includes components such as a communication interface 320, a memory 330, an input unit 340, a display unit 350, a built-in camera 13, an external camera 14, an audio circuit 370, and a processor 380. It should be understood that the structure of the mobile device 300 illustrated in FIG. 3 does not constitute a limitation to the mobile device, which may include more or fewer components than those illustrated, or some components, or other different components.

下面结合图3对移动设备300的各个构成部件进行具体的介绍:The components of the mobile device 300 will be specifically described below with reference to FIG. 3:

通信接口320,用于收发信息或通话过程中,信号的接收和发送。通常,通信接口包括但不限于天线、至少一个放大器、收发信机、耦合器、低噪声放大器(low noise amplifier,LNA)、双工器等。The communication interface 320 is configured to receive and transmit signals during and after receiving or transmitting information or during a call. Typically, communication interfaces include, but are not limited to, an antenna, at least one amplifier, a transceiver, a coupler, a low noise amplifier (LNA), a duplexer, and the like.

存储器330可用于存储软件程序以及模块,处理器380通过运行存储在存储器330的软件程序以及模块,从而执行移动设备300的各种功能应用以及数据处理。存储器330可主要包括存储程序区和存储数据区,其中,存储程序区可存储操作系统、至少一个功能所需的应用程序(比如相机功能、图像播放功能等)等;存储数据区可存储根据移动设备300的使用所创建的数据(比如音频数据、图像数据、电话本等)等。The memory 330 can be used to store software programs and modules, and the processor 380 executes various functional applications and data processing of the mobile device 300 by running software programs and modules stored in the memory 330. The memory 330 may mainly include a storage program area and a storage data area, wherein the storage program area may store an operating system, an application required for at least one function (such as a camera function, an image playing function, etc.), and the like; the storage data area may be stored according to the movement. The data created by the use of the device 300 (such as audio data, image data, phone book, etc.) and the like.

输入单元340可用于接收输入的数字或字符信息,以及产生与移动设备300的用户设置以及功能控制有关的键信号输入。具体地,输入单元340可包括触摸屏341以及其他输入设备342。触摸屏341,也称为触控面板,可检测用户在其上或附近的触摸操作,并根据预先设定的程式驱动相应的连接装置。除了触摸屏341,输入单元340还可以包括其他输入设备342。具体地,其他输入设备342可以包括但不限于物理键盘、功能键(比如音量控制按键、电源开关按键等)、轨迹球、鼠标、操作杆等中的一种或多种。The input unit 340 can be configured to receive input numeric or character information and to generate key signal inputs related to user settings and function control of the mobile device 300. In particular, the input unit 340 can include a touch screen 341 as well as other input devices 342. The touch screen 341, also referred to as a touch panel, can detect a touch operation on or near the user and drive the corresponding connection device according to a preset program. In addition to the touch screen 341, the input unit 340 may also include other input devices 342. In particular, other input devices 342 may include, but are not limited to, one or more of a physical keyboard, function keys (such as volume control buttons, power switch buttons, etc.), trackballs, mice, joysticks, and the like.

显示单元350可用于显示由用户输入的信息或提供给用户的信息以及移动设备300的各种菜单。显示单元350可包括显示面板351。进一步的,当触摸屏341检测到在其上或附近的触摸操作后,可传送给处理器380以确定触摸事件的类型,随后处理器380根据触摸事件的类型在显示面板351上提供相应的视觉输出。The display unit 350 can be used to display information input by the user or information provided to the user and various menus of the mobile device 300. The display unit 350 may include a display panel 351. Further, when the touch screen 341 detects a touch operation on or near it, it can be transmitted to the processor 380 to determine the type of the touch event, and then the processor 380 provides a corresponding visual output on the display panel 351 according to the type of the touch event. .

音频电路370、扬声器371、麦克风372可提供用户与移动设备300之间的音频接口。音频电路370可将接收到的音频数据转换后的电信号,传输到扬声器371,由扬声器371转换为声音信号输出;另一方面,麦克风372将检测到的声音信号转换为电信号,由音频电路370接收后转换为音频数据,再将音频数据输出至通信接口320以发送给比如另一移动设备,或者将音频数据输出至存储器330以便进一步处理。Audio circuitry 370, speaker 371, microphone 372 can provide an audio interface between the user and mobile device 300. The audio circuit 370 can transmit the converted electrical data of the received audio data to the speaker 371 for conversion to the sound signal output by the speaker 371; on the other hand, the microphone 372 converts the detected sound signal into an electrical signal by the audio circuit. The 370 is converted to audio data after reception, and the audio data is output to the communication interface 320 for transmission to, for example, another mobile device, or the audio data is output to the memory 330 for further processing.

处理器380是移动设备300的控制中心,利用各种接口和线路连接整个移动设备的各个部分,通过运行或执行存储在存储器330内的软件程序和/或模块,以及调用存储在存储器330内的数据,执行移动设备300的各种功能和处理数据,从而对移动设备300进行整体监控。可选的,处理器380可包括一个或多个处理单元。 Processor 380 is the control center of mobile device 300, which connects various portions of the entire mobile device using various interfaces and lines, by running or executing software programs and/or modules stored in memory 330, and by calling stored in memory 330. Data, performing various functions and processing data of the mobile device 300, thereby performing overall monitoring of the mobile device 300. Alternatively, processor 380 can include one or more processing units.

移动设备300还可以包括至少一个内置摄像头13和至少一个外置摄像头14。内置摄像头13或外置摄像头14可以将捕捉到的预览帧发送给处理器380,由处理器380对预览帧进行图像处理后,将处理后的图像发送至显示单元350进行显示。The mobile device 300 can also include at least one built-in camera 13 and at least one external camera 14. The built-in camera 13 or the external camera 14 can transmit the captured preview frame to the processor 380. After the image processing is performed on the preview frame by the processor 380, the processed image is sent to the display unit 350 for display.

具体的,在本发明实施例提供的处理方法中,如图4所示,当用户触发移动设备(图4中以头戴式电子设备为例)内的目标内容定位功能时,例如,用户设置了文档校正功能或文档翻译功能等,处理器380可调用外置摄像头14获取当前的预览帧11,并在显示单元350内显示该预览帧11,进而,处理器380可调用存储器330中预置的四边形检测算法,获取该预览帧11内至少两个四边形的顶点坐标,例如,图4中四边形A的顶点坐标、四边形B的顶点坐标、四边形C的顶点坐标以及四边形D的顶点坐标。Specifically, in the processing method provided by the embodiment of the present invention, as shown in FIG. 4, when the user triggers the target content positioning function in the mobile device (taking the head mounted electronic device in FIG. 4 as an example), for example, user setting The processor 380 can call the external camera 14 to acquire the current preview frame 11 and display the preview frame 11 in the display unit 350. Further, the processor 380 can call the preset in the memory 330. The quadrilateral detection algorithm acquires vertex coordinates of at least two quadrilaterals in the preview frame 11, for example, the vertex coordinates of the quadrilateral A in FIG. 4, the vertex coordinates of the quadrilateral B, the vertex coordinates of the quadrilateral C, and the vertex coordinates of the quadrilateral D.

此时,处理器380可调用内置摄像头13检测用户视线,确定用户的视线焦点在预览帧11内停留的时长以及停留的位置,如果用户的视线焦点在四边形B内停留的时长大于预设时长,则如图4所示,处理器380通过显示单元350在预览帧11内突出显示四边形B的轮廓线,即处理器380将四边形B的轮廓线所包围的第一图像作为该预览帧11中的目标内容。At this time, the processor 380 can call the built-in camera 13 to detect the user's line of sight, determine the duration of the user's line of sight focus staying in the preview frame 11 and the position of the stay. If the user's line of sight focus stays in the quadrilateral B for longer than the preset duration, Then, as shown in FIG. 4, the processor 380 highlights the outline of the quadrilateral B in the preview frame 11 through the display unit 350, that is, the processor 380 uses the first image surrounded by the outline of the quadrilateral B as the preview image. Target content.

那么,如果处理器380检测到第一拍照指令,则处理器380对上述第一图像执行预设处理,并在显示单元350内输出预设处理后的第一图像。Then, if the processor 380 detects the first photographing instruction, the processor 380 performs preset processing on the first image, and outputs the preset processed first image in the display unit 350.

其中,如果上述第一拍照指令是用户启动文档校正功能时触发的,则上述预设处理具体可以为:对第一图像执行四边形校正,例如,对第一图像进行倾斜校正;如果上述第一拍照指令是用户启动文档翻译功能时触发的,则上述预设处理具体可以为:对第一图像中的文字执行光学字符识别(Optical Character Recognition,OCR),例如,将第一图像中的汉字翻译为英文。If the first photographing instruction is triggered when the user starts the document correction function, the preset processing may specifically be: performing quadrilateral correction on the first image, for example, performing tilt correction on the first image; The instruction is triggered when the user initiates the document translation function, and the preset processing may be: performing optical character recognition (OCR) on the text in the first image, for example, translating the Chinese characters in the first image into English.

可以看出,在本发明实施例提供的处理方法中,移动设备通过检测用户的视线焦点确定出用户视线在预览帧内的哪个四边形内停留的时间较长,由于视线停留时长较长的四边形内的图像通常为用户需要的目标内容,那么,移动设备将该四边形的边缘线突出显示,从而向用户提示已经为用户确定该四边形内的第一图像为目标内容,后续,移动设备在检测到拍照指令后,可直接输出经过预设处理的上述第一图像。这样,无需用户手动调整预览帧的画面,移动设备便可以准确便捷的为用户定位预览帧中的目标内容,并且,移动设备只需对预览帧中第一图像进行上述预设处理,避免了对第一图像外的无效数据的处理,从而提高图像处理效率。It can be seen that, in the processing method provided by the embodiment of the present invention, the mobile device determines, by detecting the focus of the user's line of sight, which quadrant in the preview frame the user's line of sight stays for a longer time, because the line of sight stays longer in the quadrilateral. The image is usually the target content that the user needs. Then, the mobile device highlights the edge line of the quadrilateral, thereby prompting the user that the first image in the quadrilateral has been determined as the target content for the user, and subsequently, the mobile device detects the photo. After the instruction, the first image subjected to the preset processing may be directly output. In this way, the mobile device can accurately and conveniently locate the target content in the preview frame without the user manually adjusting the preview frame, and the mobile device only needs to perform the preset processing on the first image in the preview frame, thereby avoiding Processing of invalid data outside the first image, thereby improving image processing efficiency.

需要说明的是,上述处理方法除了可以应用在上述文档校正和文档翻译等应用场景中,还可以应用于各类基于目标内容定位的应用场景,例如,名片识别、银行卡识别、身份证识别等,本发明实施例对此不作任何限制。It should be noted that the foregoing processing method can be applied to various application scenarios such as document correction and document translation, and can also be applied to various application scenarios based on target content positioning, for example, business card recognition, bank card identification, ID card recognition, and the like. The embodiment of the present invention does not impose any limitation on this.

应理解:尽管如图3中未示出,但移动设备300中还可以包括电源、重力传感器(gravity sensor)、光传感器、陀螺仪、气压计、湿度计、温度计、红外线传感器、无线保真(wireless fidelity,Wi-Fi)模块或蓝牙模块等,在此不再赘述。 It should be understood that although not shown in FIG. 3, the mobile device 300 may further include a power source, a gravity sensor, a light sensor, a gyroscope, a barometer, a hygrometer, a thermometer, an infrared sensor, and a wireless fidelity ( Wireless fidelity, Wi-Fi module or Bluetooth module, etc., will not be described here.

以下,将结合具体实施例详细阐述本发明实施例提供的一种处理方法,如图5所示,该方法包括:Hereinafter, a processing method provided by an embodiment of the present invention is described in detail in conjunction with a specific embodiment. As shown in FIG. 5, the method includes:

401、移动设备对预览帧内的内容执行四边形检测操作。401. The mobile device performs a quadrilateral detection operation on the content in the preview frame.

402、移动设备保存检测到的至少两个四边形中每个四边形的顶点坐标。402. The mobile device saves vertex coordinates of each of the detected at least two quadrilaterals.

当用户开启文档校正或文档识别等需要定位目标内容的功能时,移动设备可调用外置摄像头进入预览状态,此时,移动设备将外置摄像头捕捉到的画面以预览帧的形式显示在移动设备的显示界面内。When the user turns on the function of positioning the target content, such as document correction or document recognition, the mobile device can call the external camera to enter the preview state. At this time, the mobile device displays the image captured by the external camera in the form of a preview frame on the mobile device. Within the display interface.

具体的,在步骤401中,移动设备对预览帧内的画面内容执行四边形检测操作,从而确定出预览帧内包含的一个或多个四边形。Specifically, in step 401, the mobile device performs a quadrilateral detection operation on the picture content in the preview frame, thereby determining one or more quadrilaterals included in the preview frame.

示例性的,首先可以对预览帧内的画面进行边缘检测,进而提取出预览帧内包含的直线线条,最终,可根据预先设置的四边形检测算法,例如,基于开源计算机视觉库(Open Source Computer Vision Library,OpenCV)的矩形识别算法等,确定由这些直线线条组成的一个或多个四边形。For example, the edge of the preview frame may be edge-detected, and then the straight line included in the preview frame may be extracted, and finally, according to a preset quadrilateral detection algorithm, for example, based on an open source computer vision library (Open Source Computer Vision) Library, OpenCV) Rectangular recognition algorithm, etc., determines one or more quadrilaterals composed of these straight lines.

当然,由于预览帧内的四边形可能有很多个,而面积或周长过小的四边形内的画面内容一般可能不是用户需要的目标内容,因此,在执行四边形检测操作时还可以根据四边形的大小进行筛选,将面积或周长小于预设值的四边形丢弃,避免这些面积或周长较小的四边形干扰后续的图像处理过程。另外,还可以对预览帧内的画面内容进行噪声过滤等操作,以提高四边形检测操作的准确率。Of course, since there may be many quads in the preview frame, and the content of the quadrilateral in the area or the circumference is too small may not be the target content required by the user, the quadrilateral detection operation may also be performed according to the size of the quadrilateral. Screening, discarding quadrilaterals with an area or perimeter less than the preset value, avoiding these quadrilaterals with smaller areas or perimeters from interfering with subsequent image processing. In addition, it is also possible to perform noise filtering and the like on the screen content in the preview frame to improve the accuracy of the quadrilateral detection operation.

在步骤402中,移动设备保存检测到的至少两个四边形中每个四边形的顶点坐标,从而确定出预览帧内已检测出的每个四边形的位置。In step 402, the mobile device saves the vertex coordinates of each of the detected at least two quadrilaterals to determine the position of each quadrilateral detected within the preview frame.

如图6所示,移动设备将预览帧显示在显示单元的预览框11中,假设预览框11内的拍摄画面为一张A4纸,那么,通过四边形检测操作,可以确定出该预览帧内包括5个四边形,即A4纸的边缘形成的四边形1,以及A4纸内包含的四边形2-四边形5,那么,移动设备可以通过执行四边形检测,将检测出的这5个四边形中每个四边形的顶点坐标,从而确定出这5个四边形在预览帧11中的位置。As shown in FIG. 6, the mobile device displays the preview frame in the preview frame 11 of the display unit. If the captured image in the preview frame 11 is an A4 paper, it can be determined that the preview frame is included by the quadrilateral detection operation. Five quadrangles, that is, a quadrilateral 1 formed by the edge of the A4 paper, and a quadrilateral 2-quadrant 5 included in the A4 paper, then the mobile device can detect the vertices of each of the five quadrilaterals by performing quadrilateral detection. Coordinates to determine the position of the five quadrilaterals in the preview frame 11.

403(可选的)、移动设备在预览帧中突出显示默认四边形的轮廓线;该默认四边形为上述至少两个四边形中面积最大的四边形。403 (optional), the mobile device highlights a contour of the default quadrilateral in the preview frame; the default quadrilateral is the quadrilateral having the largest area among the at least two quadrilaterals.

可选的,在步骤403中,移动设备可以从上述检测出的至少两个四边形中,确定出面积最大的四边形,并将该四边形作为默认四边形,在预览帧中突出显示默认四边形的轮廓线。Optionally, in step 403, the mobile device may determine the quadrilateral with the largest area from the at least two quadrilaterals detected above, and use the quadrilateral as the default quadrilateral to highlight the outline of the default quadrilateral in the preview frame.

其中,突出显示是指使用不同颜色或不同粗细的线条突出显示文本或图像的功能;突出显示与正常显示相比,用户通过肉眼观察可以明显的区分出来。Among them, highlighting refers to the function of highlighting text or image using lines of different colors or different thicknesses; highlighting can be clearly distinguished by visual observation compared with normal display.

如图7所示,在四边形1-四边形5中,面积最大的四边形为四边形1,即A4纸的边缘形成的四边形,那么,移动设备可以将四边形1的轮廓线进行高亮显示(图7中以加粗显示作为高亮显示进行示意),从而提示用户当前预览帧内面积最大的四边形为四边形1,四边形1的轮廓线包围的内容为当前移 动设备默认的等待预设处理的图像。As shown in FIG. 7, in the quadrilateral 1-quadrant 5, the largest quadrilateral is a quadrilateral 1, that is, a quadrangle formed by the edge of the A4 paper, then the mobile device can highlight the outline of the quadrilateral 1 (in FIG. 7). The bold display is indicated as a highlight, so that the user is prompted to display the quadrilateral with the largest area in the current preview frame as the quadrilateral 1, and the content surrounded by the outline of the quadrilateral 1 is the current shift. The default device waiting for the preset processing image.

其中,显示默认四边形的轮廓线的方法可以有多种,上述实施例中以高亮显示为例,还使用虚线等方式显示,本发明实施例对此不作任何限制。The method for displaying the outline of the default quadrilateral may be various. In the above embodiment, the highlighting is taken as an example, and the display is also performed by using a dotted line or the like, which is not limited in this embodiment of the present invention.

404、移动设备根据每个四边形的顶点坐标,检测用户的视线焦点在预览帧内的焦点位置。404. The mobile device detects, according to vertex coordinates of each quadrilateral, a focus position of the user's line of sight focus within the preview frame.

405、若确定视线焦点停留在第一四边形内的时长大于预设时长,则移动设备在预览帧内突出显示第一四边形的轮廓线,第一四边形为至少两个四边形中的一个。405. If it is determined that the duration of the line of sight focus staying in the first quadrilateral is greater than the preset duration, the mobile device highlights the contour of the first quadrilateral in the preview frame, where the first quadrilateral is at least two quadrilaterals. one of.

在步骤404和405中,移动设备可以通过调用内置摄像头,捕捉用户眼球的运动状态,进而利用眼球追踪技术检测用户的视线焦点在预览帧内的位置,即焦点位置。In steps 404 and 405, the mobile device can capture the motion state of the user's eyeball by calling the built-in camera, and then use the eyeball tracking technology to detect the position of the user's line of sight focus within the preview frame, that is, the focus position.

示例性的,通过内置摄像头可以捕捉到用户和眼球和眼球周边的图像,进而,移动设备可以根据眼球和眼球周边的特征变化进行跟踪,以确定用户的视线焦点位置;或者,移动设备可以根据虹膜角度变化进行跟踪,以确定用户的视线焦点位置,又或者,移动设备还可以主动投射红外线等光束到虹膜后提取特征信息进行跟踪,以确定用户的视线焦点位置。Illustratively, the user and the eyeball and the image around the eyeball can be captured by the built-in camera. Further, the mobile device can track the eyeball and the feature changes around the eyeball to determine the user's line of sight focus position; or, the mobile device can be based on the iris The angle change is tracked to determine the user's line of sight focus position, or the mobile device can actively project a beam of infrared light to the iris to extract feature information for tracking to determine the user's line of sight focus position.

在一种可能的设计方法中,移动设备确定焦点位置或焦点位置发生变化后,可以并行地启动两条线程,一条线程用于根据上述每个四边形的顶点坐标,确定当前的焦点位置具体位于哪个四边形内,另一条线程可用于记录用户的视线焦点在该焦点位置停留的时长。In a possible design method, after the mobile device determines that the focus position or the focus position changes, two threads can be started in parallel, and one thread is used to determine which current focus position is located according to the vertex coordinates of each quadrature described above. Within the quadrilateral, another thread can be used to record how long the user's line of sight focus stays at that focus position.

由于用户的视线焦点在一定时间内可能会有微小的变动,因此,可以当视线焦点落入以该焦点位置为中心,半径为R(R为预设值)的区域内时,均可认为用户的视线焦点停留在上述焦点位置。Since the user's line of sight focus may change slightly within a certain period of time, the user may be considered to be the user when the line of sight focus falls within the area centered on the focus position and the radius is R (R is the preset value). The line of sight focus stays at the above focus position.

那么,如果移动设备确定出该焦点位置位于第一四边形(该第一四边形为至少两个四边形中的一个)内,且用户的视线焦点在该焦点位置停留的时长大于预设时长(例如,2秒),则说明用户当前的注意力集中在了第一四边形内,此时,触发移动设备在预览帧内显示该第一四边形的轮廓线,以提示用户当前移动设备为用户确定的目前文档为第一四边形的轮廓线所包围的第一图像。Then, if the mobile device determines that the focus position is in the first quadrilateral (the first quadrilateral is one of the at least two quadrilaterals), and the user's line of sight focus stays at the focus position for longer than the preset duration (for example, 2 seconds), indicating that the user's current attention is concentrated in the first quadrilateral. At this time, the triggering mobile device displays the outline of the first quadrilateral in the preview frame to prompt the user to move. The current document determined by the device for the user is the first image surrounded by the outline of the first quadrilateral.

在另一种可能的设计方法中,移动设备确定焦点位置后,可以先根据上述每个四边形的顶点坐标确定该焦点位置具体位于哪个四边形内,例如,位于第一四边形内,进而,确定用户的视线焦点在该第一四边形内停留的时长是否大于预设时长。In another possible design method, after the mobile device determines the focus position, it may first determine, according to the vertex coordinates of each quadrilateral, which quadrilateral the focus position is located in, for example, located in the first quadrilateral, and further, determine Whether the duration of the user's line of sight focus staying within the first quadrilateral is greater than a preset duration.

或者,移动设备确定焦点位置后,可以先确定用户的视线焦点在该焦点位置停留的时长是否大于预设时长,如果大于预设时长,则进一步根据上述每个四边形的顶点坐标确定该焦点位置具体位于哪个四边形内。Alternatively, after determining the focus position, the mobile device may first determine whether the duration of the user's line of sight focus staying at the focus position is greater than a preset duration. If the duration is greater than the preset duration, the focus position is further determined according to the coordinates of the vertices of each of the quadrilaterals. Which quadrilateral is located.

如图8所示,移动设备通过调用内置摄像头,检测到用户的视线焦点在预览帧内的焦点位置为点A。那么,移动设备可以上述每个四边形的顶点坐标,确定点A位于四边形2内,此时,如果用户的视线焦点在该焦点位置停 留的时长大于2秒,则移动设备在预览帧内高亮显示四边形2的轮廓线(图8中以加粗显示作为高亮显示进行示意),即提示用户当前移动设备为用户确定的目前文档为四边形2的轮廓线所包围的图像。As shown in FIG. 8, the mobile device detects that the focus position of the user's line of sight focus in the preview frame is point A by calling the built-in camera. Then, the mobile device can determine the point A of each quadrilateral as described above, and determine that the point A is located in the quadrilateral 2, at this time, if the user's line of sight focus is at the focus position If the duration of the stay is greater than 2 seconds, the mobile device highlights the outline of the quadrilateral 2 in the preview frame (indicated by a bold display in FIG. 8 as a highlight), that is, prompting the user that the current mobile device is the current document determined by the user. An image enclosed by the outline of the quadrilateral 2.

此时,移动设备还可以正常显示步骤403中已显示的默认四边形(即四边形1)的轮廓线。At this time, the mobile device can also normally display the outline of the default quadrilateral (ie, quadrilateral 1) that has been displayed in step 403.

当然,如图9所示,如果检测到用户的视线焦点在预览帧内的焦点位置为点B,由于点B不在四边形2-四边形5内,而是位于四边形1内,此时,移动设备继续显示步骤403中已突出显示的四边形1的轮廓线。Of course, as shown in FIG. 9, if it is detected that the focus position of the user's line of sight focus in the preview frame is point B, since the point B is not in the quadrilateral 2-quadrant 5 but in the quadrilateral 1, the mobile device continues. The outline of the quadrilateral 1 that has been highlighted in step 403 is displayed.

406、检测到第一拍照指令后,移动设备对第一四边形的轮廓线所包围的第一图像执行预设处理后,输出经过所述预设处理的第一图像。406. After detecting the first photographing instruction, the mobile device performs a preset process on the first image surrounded by the outline of the first quadrilateral, and then outputs the first image subjected to the preset processing.

综上,本文所涉及的“输出经过所述预设处理的第一图像”可以理解为用户点击拍照键后获得的图像,经过所述预设处理的第一图像保存在移动设备的存储器中。例如,该经过所述预设处理的第一图像如图11所示,应理解:该经过所述预设处理的第一图像可以包括如图9中四边形2内的全部内容或部分内容;或者,该经过所述预设处理的第一图像可以除了包括如图9中四边形2内的全部内容或部分内容以外,还包括如图9中四边形1内的部分内容;或者,该经过所述预设处理的第一图像可以除了包括如图9中四边形2内的全部内容或部分内容以外,还可以包括其他内容。In summary, the “outputting the first image processed by the preset process” as referred to herein may be understood as an image obtained by the user after clicking the camera key, and the first image processed by the preset process is saved in the memory of the mobile device. For example, the first image subjected to the preset processing is as shown in FIG. 11, and it should be understood that the first image subjected to the preset processing may include all or part of the content in the quadrilateral 2 in FIG. 9; The first image subjected to the preset processing may include a part of the content in the quadrilateral 1 as shown in FIG. 9 in addition to all or part of the content in the quadrilateral 2 in FIG. 9; or, the It is assumed that the processed first image may include other content in addition to all or part of the content in the quadrilateral 2 in FIG.

在步骤406中,如果用户确定第一四边形为目标内容,则可触发移动设备生成第一拍照指令,例如,如图10所示,用户点击移动设备显示界面内的拍照按钮后,移动设备生成第一拍照指令。进一步地,生成第一拍照指令后,移动设备对第一四边形的轮廓线所包围的第一图像进行拍照,并对拍照得到的第一图像执行预设处理。In step 406, if the user determines that the first quadrilateral is the target content, the mobile device may be triggered to generate a first photographing instruction. For example, as shown in FIG. 10, after the user clicks the photographing button in the mobile device display interface, the mobile device A first photographing instruction is generated. Further, after the first photographing instruction is generated, the mobile device photographs the first image surrounded by the outline of the first quadrilateral, and performs preset processing on the first image obtained by photographing.

仍以图10所示的预览帧11为例,如果用户开启文档校正功能后触发移动设备的拍照按钮,此时,移动设备响应该操作后生成第一拍照指令,进而,根据该第一拍照指令对四边形2的轮廓线所包围的第一图像拍照,并对得到的第一图像执行四边形校正,例如,对第一图像进行倾斜校正,如图11所示,得到倾斜校正后的第一图像,然后为用户在移动设备内显示经过倾斜校正后的第一图像,即完成文档校正功能。For example, the preview frame 11 shown in FIG. 10 is used as an example. If the user activates the document correction function and then triggers the camera button of the mobile device, the mobile device generates a first camera instruction in response to the operation, and further, according to the first camera instruction. Taking a picture of the first image enclosed by the outline of the quadrilateral 2, and performing quadrilateral correction on the obtained first image, for example, performing tilt correction on the first image, as shown in FIG. 11, obtaining the first image after tilt correction, Then, for the user to display the tilt-corrected first image in the mobile device, the document correction function is completed.

又或者,如果用户开启文档翻译功能后触发移动设备的拍照按钮,此时,移动设备响应该操作后生成第一拍照指令,进而,根据该第一拍照指令对四边形2的轮廓线所包围的第一图像拍照,与上述方法不同的是,此时的预设处理为:对第一图像中的文字执行OCR。Or, if the user initiates the document translation function and triggers the camera button of the mobile device, at this time, the mobile device generates a first camera instruction in response to the operation, and further, according to the first camera instruction, the outline surrounded by the outline of the quadrilateral 2 An image is taken. Unlike the above method, the preset processing at this time is: performing OCR on the text in the first image.

具体的,在执行OCR时,首先可以通过预设的字符识别算法识别第一图像内的文字,得到移动设备可识别的计算机文字,进一步地,可以通过预设的翻译软件或程序将该计算机文字翻译为预设语言的文字,例如,将第一图像内的中文翻译为英文,后续,如图12中的(a)所示,可以直接在第一图像中使用翻译后的英文替换翻译前的中文,或者,如图12中的(b)所示,也可以在第一图像中将翻译后的英文添加到第一图像中,例如,将翻译后的 英文添加到翻译前的中文旁边,以完成文档翻译功能。Specifically, when the OCR is executed, the text in the first image may be first recognized by a preset character recognition algorithm to obtain computer characters recognizable by the mobile device, and further, the computer text may be obtained through a preset translation software or program. Translating into the text of the default language, for example, translating the Chinese in the first image into English, and subsequently, as shown in (a) of FIG. 12, the translated English can be directly used in the first image to replace the pre-translation. Chinese, or, as shown in (b) of FIG. 12, the translated English may also be added to the first image in the first image, for example, the translated image English is added next to the pre-translation Chinese to complete the document translation function.

可以看出,在执行上述文档翻译功能时,仅需要对第一四边形内的文字进行文字识别和翻译,可以有效的避免第一四边形外部区域中无用数据的干扰,从而可以提升文字识别和翻译的速度和准确性。It can be seen that when performing the above document translation function, only the characters in the first quadrilateral need to be recognized and translated, which can effectively avoid the interference of useless data in the outer area of the first quadrilateral, thereby improving the text. Speed and accuracy of recognition and translation.

进一步地,上述预设处理还可以包括对第一图像执行裁剪操作等,本发明实施例对此不作任何限制。另外,在执行上述文档翻译功能之前,还可以对第一图像执行四边形校正,从而提高后续文字识别和翻译的准确率。Further, the foregoing preset processing may further include performing a cropping operation or the like on the first image, which is not limited in this embodiment of the present invention. In addition, before performing the above document translation function, quadrilateral correction can also be performed on the first image, thereby improving the accuracy of subsequent character recognition and translation.

进一步地,在执行上述步骤404之后,如果移动设备检测到用户的视线焦点位置发生变化,则移动设备可以执行下述步骤501-504。Further, after performing the above step 404, if the mobile device detects that the user's line of sight focus position changes, the mobile device may perform the following steps 501-504.

示例性的,移动设备可以周期性的确定用户的视线焦点位置,当连续两次确定的焦点位置之间的距离大于预设距离时(该预设距离大于上述半径R),可确定用户的视线焦点位置发生变化,此时移动设备可执行下述步骤501-504;当连续两次确定的焦点位置之间的距离小于或等于上述预设距离时,可确定用户的视线焦点位置未发生变化,此时移动设备可继续执行上述步骤405-406。Exemplarily, the mobile device may periodically determine the position of the user's line of sight focus. When the distance between the two determined focus positions is greater than the preset distance (the preset distance is greater than the radius R), the user's line of sight may be determined. The focus position changes. At this time, the mobile device may perform the following steps 501-504. When the distance between the focus positions determined twice consecutive times is less than or equal to the preset distance, it may be determined that the user's line of sight focus position has not changed. At this point, the mobile device can continue to perform steps 405-406 above.

具体的,如图13所示,步骤501-504包括:Specifically, as shown in FIG. 13, steps 501-504 include:

501、移动设备确定用户视线停留在第二四边形内的时长大于预设时长,第二四边形为至少两个四边形中除第一四边形外的任一个四边形。501. The mobile device determines that a duration in which the user's line of sight stays in the second quadrilateral is greater than a preset duration, and the second quadrilateral is any one of the at least two quadrilaterals except the first quadrilateral.

示例性的,与步骤405类似的,当用户的视线焦点位置发生变化后,移动设备可以根据上述每个四边形的顶点坐标,确定新的焦点位置位于第二四边形内。例如,如图14中的(a)所示,新的焦点位置C位于四边形5内,进而,移动设备可以确定用户的视线焦点在该四边形5内停留的时长是否大于预设时长,如果大于预设时长,则继续执行下述步骤502-204。Exemplarily, similar to step 405, after the user's line of sight focus position changes, the mobile device can determine that the new focus position is within the second quadrilateral according to the vertex coordinates of each of the above quadrilaterals. For example, as shown in (a) of FIG. 14, the new focus position C is located in the quadrilateral 5, and further, the mobile device can determine whether the duration of the user's line of sight focus staying within the quadrilateral 5 is greater than a preset duration, if greater than the pre- If the duration is set, the following steps 502-204 are continued.

502、移动设备在预览帧内正常显示第一四边形的轮廓线。502. The mobile device displays the outline of the first quadrilateral normally in the preview frame.

503、移动设备在预览帧内突出显示第二四边形的轮廓线。503. The mobile device highlights a contour of the second quadrilateral in the preview frame.

与步骤405不同的是,在步骤502-203中,由于步骤405中移动设备已经在预览帧内突出显示了第一四边形的轮廓线,而此时用户的视线焦点位置发生了变化,新的焦点位置位于第二四边形内,因此,移动设备可以在预览帧内正常显示第一四边形的轮廓线,即取消或隐藏步骤405中已经突出显示的第一四边形的轮廓线,恢复预览帧内第一四边形的原始显示方式;并且,在预览帧内突出显示第二四边形的轮廓线,从而提示用户当前移动设备为用户确定的目前文档为第二四边形的轮廓线所包围的图像。Different from step 405, in steps 502-203, since the mobile device has highlighted the contour of the first quadrilateral in the preview frame in step 405, the user's line of sight focus position has changed, new The focus position is located within the second quadrilateral, so the mobile device can normally display the outline of the first quadrilateral within the preview frame, ie, cancel or hide the outline of the first quadrilateral that has been highlighted in step 405, Restores the original display mode of the first quadrilateral in the preview frame; and highlights the outline of the second quadrilateral in the preview frame, thereby prompting the user that the current mobile device determines that the current document is the second quadrilateral The image enclosed by the outline.

当然,移动设备也可以先执行步骤503,再执行步骤502,或者,移动设备也可以同时执行步骤502和步骤503,本发明实施例对此不作任何限制。Of course, the mobile device may also perform step 503 and then perform step 502. Alternatively, the mobile device may perform step 502 and step 503 at the same time.

示例性的,仍如图14中的(a)所示,当前的新的焦点位置位C点,移动设备确定C点位于四边形5内,且视线焦点在四边形5内停留的时长大于预设时长,那么,移动设备可以在预览帧11内取消或隐藏已突出显示的四边形2的轮廓线(即正常显示四边形2的轮廓线),同时,高亮显示四边形5的轮廓线(图14中中的(a)以加粗显示作为高亮显示进行示意),即提示用户当前移动设备为用户确定的目前文档为四边形5的轮廓线所包围的图像。 Exemplarily, as shown in (a) of FIG. 14, the current new focus position is C, the mobile device determines that the C point is within the quadrilateral 5, and the line of sight focus stays within the quadrilateral 5 for longer than the preset duration. Then, the mobile device can cancel or hide the outline of the highlighted quadrilateral 2 in the preview frame 11 (ie, the outline of the normal display quadrilateral 2), and at the same time, highlight the outline of the quadrilateral 5 (in FIG. 14 (a) The bold display is indicated as a highlight), that is, the image that the current mobile device determines for the user that the current document is surrounded by the outline of the quadrilateral 5.

作为一种可能的设计,当移动设备确定C点位于四边形5内,且视线焦点在四边形5内停留的时长大于预设时长时,移动设备也可以在预览帧11内仅显示四边形5中的内容,如图14中的(b),以提示用户当前移动设备为用户确定的目前文档为四边形5的轮廓线所包围的图像。此时,四边形5的轮廓线所包围的图像可以占满整个预览帧11,也可以位于预览帧11内的某个区域;四边形5的轮廓线可以突出显示,也可以正常显示,本发明实施例对此不作任何限制。As a possible design, when the mobile device determines that the C point is within the quadrilateral 5, and the line of sight focus stays within the quadrilateral 5 for longer than the preset duration, the mobile device can also display only the content of the quadrilateral 5 in the preview frame 11. As shown in (b) of FIG. 14, the current document identified by the user for the current mobile device is an image surrounded by the outline of the quadrilateral 5. At this time, the image surrounded by the outline of the quadrilateral 5 may occupy the entire preview frame 11 or may be located in a certain area in the preview frame 11; the outline of the quadrilateral 5 may be highlighted or displayed normally, in the embodiment of the present invention. There are no restrictions on this.

504、检测到第二拍照指令后,移动设备对第二四边形的轮廓线所包围的第二图像执行预设处理后,输出经过所述预设处理的第二图像。504. After detecting the second photographing instruction, the mobile device performs a preset process on the second image surrounded by the outline of the second quadrilateral, and outputs a second image that passes the preset processing.

综上,本文所涉及的“输出经过所述预设处理的第二图像”可以理解为用户点击拍照键后获得的图像,该经过所述预设处理的第一图像保存在手机的存储器中。应理解:该经过所述预设处理的第二图像可以包括如图14中四边形5内的全部内容或部分内容;或者,该经过所述预设处理的第二图像可以除了包括如图14中四边形5内的全部内容或部分内容以外,还包括如图14中四边形1内的部分内容;或者,该经过所述预设处理的第二图像可以除了包括如图14中四边形5内的全部内容或部分内容以外,还可以包括其他内容。In summary, the “outputting the second image subjected to the preset processing” as referred to herein may be understood as an image obtained after the user clicks the camera key, and the first image processed through the preset process is saved in the memory of the mobile phone. It should be understood that the second image subjected to the preset processing may include all content or partial content in the quadrilateral 5 as shown in FIG. 14; or the second image subjected to the preset processing may be included in FIG. 14 except In addition to all or part of the content in the quadrilateral 5, part of the content in the quadrilateral 1 in FIG. 14 is included; or the second image subjected to the preset processing may include all the contents in the quadrilateral 5 as shown in FIG. In addition to or in part, it may include other content.

与步骤406类似的,如果用户确定第二四边形为目标内容,则可触发移动设备生成第二拍照指令。进一步地,生成第二拍照指令后,触发移动设备对第二四边形的轮廓线所包围的第二图像进行拍照,并对拍照得到的第二图像执行预设处理。Similar to step 406, if the user determines that the second quadrilateral is the target content, the mobile device can be triggered to generate a second photographing instruction. Further, after the second photographing instruction is generated, the mobile device is triggered to take a photo of the second image surrounded by the outline of the second quadrilateral, and perform preset processing on the photographed second image.

其中,具体的预设处理方法可参见步骤406中的相关描述,故此处不再赘述。For the specific preset processing method, refer to the related description in step 406, and therefore, details are not described herein.

在另一种可能的设计方法中,如图15所示,本发明实施例还提供一种处理方法,包括:In another possible design method, as shown in FIG. 15, the embodiment of the present invention further provides a processing method, including:

601、移动设备在其预览框内显示预览帧,该预览帧包括至少两个四边形。601. The mobile device displays a preview frame in its preview frame, the preview frame including at least two quadrilaterals.

与步骤401类似的,当用户开启文档校正或文档识别等需要定位目标内容的功能时,移动设备可调用外置摄像头进入预览状态,此时,移动设备将外置摄像头捕捉到的画面以预览帧的形式显示在移动设备的显示界面内。Similar to step 401, when the user turns on the function of positioning the target content, such as document correction or document recognition, the mobile device can call the external camera to enter the preview state. At this time, the mobile device captures the image captured by the external camera as a preview frame. The form is displayed in the display interface of the mobile device.

具体的,移动设备可以对预览帧内的画面内容执行四边形检测,并保存检测到的至少两个四边形中每个四边形的顶点坐标,从而确定出预览帧内包含的一个或多个四边形。Specifically, the mobile device may perform quadrilateral detection on the content of the picture in the preview frame, and save the coordinates of the vertices of each of the detected at least two quadrilaterals, thereby determining one or more quadrilaterals included in the preview frame.

示例性的,如图16所示,移动设备通过执行四边形检测,可以确定出该预览帧11内包括四边形1-四边形5,此时,移动设备在预览帧中突出显示四边形1-四边形5中的每个四边形,即提醒用户在当前的预览帧为用户检测到四边形1-四边形5。应理解:作为备选或替换,移动设备通过执行四边形检测后确定出该预览帧11内包括四边形1-四边形5,移动设备也可以在预览帧中正常显示四边形1-四边形5。Exemplarily, as shown in FIG. 16, the mobile device can determine that the preview frame 11 includes a quadrilateral 1-quadrant 5 by performing quadrilateral detection. At this time, the mobile device highlights the quadrilateral 1-quadrant 5 in the preview frame. Each quad, that is, reminds the user that the user detects a quadrilateral 1-quadrant 5 in the current preview frame. It should be understood that, as an alternative or alternative, the mobile device determines that the preview frame 11 includes a quadrilateral 1-quadrant 5 by performing quadrilateral detection, and the mobile device can also normally display the quadrilateral 1-quadrant 5 in the preview frame.

602、移动设备检测到作用于第一四边形的操作,该第一四边形为至少两个四边形中的一个。 602. The mobile device detects an operation acting on the first quadrilateral, the first quadrilateral being one of the at least two quadrilaterals.

603、响应于操作,移动设备对第一四边形的轮廓线所包围的第一图像执行预设处理后,输出经过所述预设处理的第一图像。603. In response to the operation, after performing a preset process on the first image surrounded by the contour line of the first quadrilateral, the mobile device outputs the first image that passes the preset process.

进而,在步骤602-603中,用户可以点击预览帧内显示的任意一个四边形,例如,如图17所示,用户点击四边形2内的任意位置,此时,移动设备可检测到用户作用于四边形2的点击操作,触发移动设备对第一四边形的轮廓线所包围的第一图像执行预设处理,并输出经过预设处理的第一图像。Further, in steps 602-603, the user can click on any quadrilateral displayed in the preview frame. For example, as shown in FIG. 17, the user clicks on any position within the quadrilateral 2, and at this time, the mobile device can detect that the user acts on the quadrilateral. The click operation of 2 triggers the mobile device to perform a preset process on the first image surrounded by the outline of the first quadrilateral, and outputs the first image subjected to the preset process.

综上,本文所涉及的“输出经过所述预设处理的第一图像”可以理解为用户点击拍照键后获得的图像,该经过所述预设处理的第一图像保存在手机的存储器中。例如,该经过所述预设处理的第一图像如图18所示,应理解:该经过所述预设处理的第一图像可以包括如图17中四边形2内的全部内容或部分内容;或者,该经过所述预设处理的第一图像可以除了包括如图17中四边形2内的全部内容或部分内容以外,还包括如图17中四边形1内的部分内容;或者,该经过所述预设处理的第一图像可以除了包括如图17中四边形2内的全部内容或部分内容以外,还可以包括其他内容。In summary, the “outputting the first image processed by the preset process” as referred to herein may be understood as an image obtained by the user after clicking the camera key, and the first image processed by the preset process is saved in the memory of the mobile phone. For example, the first image subjected to the preset processing is as shown in FIG. 18, and it should be understood that the first image subjected to the preset processing may include all or part of the content in the quadrilateral 2 in FIG. 17; The first image subjected to the preset processing may include, in addition to all contents or partial contents in the quadrilateral 2 in FIG. 17, a partial content in the quadrilateral 1 in FIG. 17; or, the It is assumed that the processed first image may include other content in addition to all or part of the content in the quadrilateral 2 in FIG.

示例性的,上述预设处理包括:对第一图像执行四边形校正;或者,对第一图像中的文字执行光学字符识别。其中,具体的预设处理方法可参见步骤406中的相关描述,故此处不再赘述。Exemplarily, the foregoing preset processing includes: performing quadrilateral correction on the first image; or performing optical character recognition on the text in the first image. For the specific preset processing method, refer to the related description in step 406, and therefore, details are not described herein.

在另一种可能的设计方法中,如图19所示,本发明实施例还提供一种处理方法,包括:In another possible design method, as shown in FIG. 19, an embodiment of the present invention further provides a processing method, including:

701、移动设备在其预览框内显示预览帧,该预览帧包括至少两个四边形。701. The mobile device displays a preview frame in its preview frame, where the preview frame includes at least two quadrilaterals.

与步骤401类似的,当用户开启文档校正或文档识别等需要定位目标内容的功能时,移动设备可调用外置摄像头进入预览状态,此时,移动设备可以对预览帧内的画面内容执行四边形检测,确定出预览帧内包含的一个或多个四边形,并在其预览框内显示该预览帧。Similar to step 401, when the user turns on the function of positioning the target content, such as document correction or document recognition, the mobile device can call the external camera to enter the preview state. At this time, the mobile device can perform quadrilateral detection on the screen content in the preview frame. Determines one or more quads contained within the preview frame and displays the preview frame in its preview frame.

702、移动设备检测到作用于第一四边形的第一操作,第一四边形为至少两个四边形中的一个。702. The mobile device detects a first operation acting on the first quadrilateral, the first quadrilateral being one of the at least two quadrilaterals.

703、响应于第一操作,移动设备突出显示第一四边形的轮廓线。703. In response to the first operation, the mobile device highlights the outline of the first quadrilateral.

在步骤702-703中,用户可以点击预览帧内显示的任意一个四边形,例如,如图20所示,用户点击四边形2内的任意位置,此时,如图20所示,移动设备可检测到用户作用于四边形2的点击操作,触发移动设备将四边形2的轮廓线进行高亮显示(图21中以加粗显示作为高亮显示进行示意),从而提示用户当前移动设备为用户确定的目前文档为四边形2的轮廓线所包围的图像。In steps 702-703, the user can click on any quadrilateral displayed in the preview frame. For example, as shown in FIG. 20, the user clicks on any position within the quadrilateral 2. At this time, as shown in FIG. 20, the mobile device can detect The user acts on the click operation of the quadrilateral 2, triggering the mobile device to highlight the outline of the quadrilateral 2 (indicated by the bold display as a highlight in FIG. 21), thereby prompting the user that the current mobile device is the current document determined by the user. An image enclosed by the outline of the quadrilateral 2.

704、移动设备检测到作用于拍照键的第二操作。704. The mobile device detects a second operation that acts on the camera button.

705、响应于第二操作,移动设备对第一四边形的轮廓线所包围的第一图像执行预设处理后,输出经过所述预设处理的第一图像。705. After performing a preset process on the first image surrounded by the outline of the first quadrilateral, in response to the second operation, outputting the first image that passes the preset process.

其中,上述拍照键可以为显示界面内用于触发拍照的虚拟按钮,也可以为移动设备上用于触发拍照的实体按键,本发明实施例对此不作任何限制。The camera button may be a virtual button for triggering the photographing in the display interface, or may be a physical button for triggering the photographing on the mobile device, which is not limited in this embodiment of the present invention.

在步骤704-705中,如果用户确定第一四边形为目标内容,如图22所示, 则可触发移动设备上的拍照键,即第二操作,进而,移动设备响应于第二操作,对第一四边形的轮廓线所包围的第一图像进行拍照,对拍照得到的第一图像执行预设处理后,输出经过所述预设处理的第一图像。In steps 704-705, if the user determines that the first quadrilateral is the target content, as shown in FIG. Then, the camera button on the mobile device can be triggered, that is, the second operation, and further, the mobile device takes a photo of the first image surrounded by the outline of the first quadrilateral in response to the second operation, and the first image obtained by photographing After the preset process is performed, the first image subjected to the preset process is output.

综上,本文所涉及的“输出经过所述预设处理的第一图像”可以理解为用户点击拍照键后获得的图像,该经过所述预设处理的第一图像保存在手机的存储器中。例如,该经过所述预设处理的第一图像如图23所示,应理解:该经过所述预设处理的第一图像可以包括如图22中四边形2内的全部内容或部分内容;或者,该经过所述预设处理的第一图像可以除了包括如图22中四边形2内的全部内容或部分内容以外,还包括如图22中四边形1内的部分内容;或者,该经过所述预设处理的第一图像可以除了包括如图22中四边形2内的全部内容或部分内容以外,还可以包括其他内容。In summary, the “outputting the first image processed by the preset process” as referred to herein may be understood as an image obtained by the user after clicking the camera key, and the first image processed by the preset process is saved in the memory of the mobile phone. For example, the first image subjected to the preset processing is as shown in FIG. 23, and it should be understood that the first image subjected to the preset processing may include all or part of the content in the quadrilateral 2 in FIG. 22; The first image subjected to the preset processing may include a part of the content in the quadrilateral 1 as shown in FIG. 22 in addition to all or part of the content in the quadrilateral 2 in FIG. 22; or, the It is assumed that the processed first image may include other content in addition to all or part of the content in the quadrilateral 2 in FIG.

示例性的,上述预设处理包括:对第一图像执行四边形校正;或者,对第一图像中的文字执行光学字符识别。其中,具体的预设处理方法可参见步骤406中的相关描述,故此处不再赘述。Exemplarily, the foregoing preset processing includes: performing quadrilateral correction on the first image; or performing optical character recognition on the text in the first image. For the specific preset processing method, refer to the related description in step 406, and therefore, details are not described herein.

可以理解的是,上述移动设备等为了实现上述功能,其包含了执行各个功能相应的硬件结构和/或软件模块。本领域技术人员应该很容易意识到,结合本文中所公开的实施例描述的各示例的单元及算法步骤,本发明实施例能够以硬件或硬件和计算机软件的结合形式来实现。某个功能究竟以硬件还是计算机软件驱动硬件的方式来执行,取决于技术方案的特定应用和设计约束条件。专业技术人员可以对每个特定的应用来使用不同方法来实现所描述的功能,但是这种实现不应认为超出本发明实施例的范围。It can be understood that, in order to implement the above functions, the above mobile device or the like includes a hardware structure and/or a software module corresponding to each function. Those skilled in the art will readily appreciate that the embodiments of the present invention can be implemented in a combination of hardware or hardware and computer software in combination with the elements and algorithm steps of the various examples described in the embodiments disclosed herein. Whether a function is implemented in hardware or computer software to drive hardware depends on the specific application and design constraints of the solution. A person skilled in the art can use different methods to implement the described functions for each particular application, but such implementation should not be considered to be beyond the scope of the embodiments of the invention.

本发明实施例可以根据上述方法示例对移动设备等进行功能模块的划分,例如,可以对应各个功能划分各个功能模块,也可以将两个或两个以上的功能集成在一个处理模块中。上述集成的模块既可以采用硬件的形式实现,也可以采用软件功能模块的形式实现。需要说明的是,本发明实施例中对模块的划分是示意性的,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式。The embodiment of the present invention may divide a function module into a mobile device or the like according to the foregoing method example. For example, each function module may be divided according to each function, or two or more functions may be integrated into one processing module. The above integrated modules can be implemented in the form of hardware or in the form of software functional modules. It should be noted that the division of the module in the embodiment of the present invention is schematic, and is only a logical function division, and the actual implementation may have another division manner.

在采用对应各个功能划分各个功能模块的情况下,图24示出了上述实施例中所涉及的移动设备的一种可能的结构示意图。In the case where the respective functional modules are divided by corresponding functions, FIG. 24 shows a possible structural diagram of the mobile device involved in the above embodiment.

该移动设备包括:检测单元1101,显示单元1102,确定单元1103和处理单元1104。检测单元1101用于支持移动设备执行图4中的过程401-402,或者,图15中的过程602,或者,图18中的过程702、704;显示单元1102用于支持移动设备执行图4中的过程403、405,或者,图10中的过程502-503,或者,图15中的过程601,或者,图18中的过程701、703;确定单元1103用于支持移动设备执行图4中的过程404,或者,图10中的过程501;处理单元1104用于支持移动设备执行图4中的406,或者,图10中的过程504,或者,图15中的过程603,或者,图18中的过程705。其中,上述方法实施例涉及的各步骤的所有相关内容均可以援引到对应功能模块的功能描述,在此不再赘述。 The mobile device includes a detecting unit 1101, a display unit 1102, a determining unit 1103, and a processing unit 1104. The detecting unit 1101 is configured to support the mobile device to perform the processes 401-402 in FIG. 4, or the process 602 in FIG. 15, or the processes 702, 704 in FIG. 18; the display unit 1102 is configured to support the mobile device to perform the process in FIG. Process 403, 405, or process 502-503 in FIG. 10, or process 601 in FIG. 15, or process 701, 703 in FIG. 18; determination unit 1103 is used to support the mobile device to perform the process in FIG. Process 404, or process 501 in FIG. 10; processing unit 1104 is configured to support the mobile device to perform 406 in FIG. 4, or process 504 in FIG. 10, or process 603 in FIG. 15, or, in FIG. Process 705. All the related content of the steps involved in the foregoing method embodiments may be referred to the functional descriptions of the corresponding functional modules, and details are not described herein again.

在采用集成的单元的情况下,图25示出了上述实施例中所涉及的移动设备的一种可能的结构示意图。该移动设备包括:处理模块1302和通信模块1303。处理模块1302用于对移动设备的动作进行控制管理,例如,处理模块1302用于支持移动设备执行图5中的过程401-406,或者,图13中的过程501-504,或者,图15中的过程601-603,或者,图19中的过程701-705,和/或用于本文所描述的技术的其它过程。通信模块1303用于支持移动设备与其他网络实体的通信。移动设备还可以包括存储模块1301,用于存储移动设备的程序代码和数据。In the case of employing an integrated unit, FIG. 25 shows a possible structural diagram of the mobile device involved in the above embodiment. The mobile device includes a processing module 1302 and a communication module 1303. The processing module 1302 is configured to control and manage the actions of the mobile device. For example, the processing module 1302 is configured to support the mobile device to perform the processes 401-406 in FIG. 5, or the processes 501-504 in FIG. 13, or, in FIG. Processes 601-603, or processes 701-705 in Figure 19, and/or other processes for the techniques described herein. The communication module 1303 is configured to support communication of the mobile device with other network entities. The mobile device can also include a storage module 1301 for storing program code and data of the mobile device.

其中,处理模块1302可以是处理器或控制器,例如可以是中央处理器(Central Processing Unit,CPU),通用处理器,数字信号处理器(Digital Signal Processor,DSP),专用集成电路(Application-Specific Integrated Circuit,ASIC),现场可编程门阵列(Field Programmable Gate Array,FPGA)或者其他可编程逻辑器件、晶体管逻辑器件、硬件部件或者其任意组合。其可以实现或执行结合本发明实施例公开内容所描述的各种示例性的逻辑方框,模块和电路。所述处理器也可以是实现计算功能的组合,例如包含一个或多个微处理器组合,DSP和微处理器的组合等等。通信模块1303可以是收发器、收发电路(例如,RF电路)或通信接口等。存储模块1301可以是存储器。The processing module 1302 may be a processor or a controller, for example, may be a central processing unit (CPU), a general-purpose processor, a digital signal processor (DSP), and an application specific integrated circuit (Application-Specific Integrated Circuit (ASIC), Field Programmable Gate Array (FPGA) or other programmable logic device, transistor logic device, hardware component, or any combination thereof. It is possible to implement or carry out various exemplary logical blocks, modules and circuits described in connection with the disclosure of the embodiments of the invention. The processor may also be a combination of computing functions, for example, including one or more microprocessor combinations, a combination of a DSP and a microprocessor, and the like. The communication module 1303 may be a transceiver, a transceiver circuit (eg, an RF circuit), a communication interface, or the like. The storage module 1301 may be a memory.

当处理模块1302为处理器,通信模块1303为通信接口,存储模块1301为存储器时,本发明实施例所涉及的移动设备的具体硬件结构可以为图3所示的移动设备300。When the processing module 1302 is a processor, the communication module 1303 is a communication interface, and the storage module 1301 is a memory, the specific hardware structure of the mobile device according to the embodiment of the present invention may be the mobile device 300 shown in FIG.

进一步地,本发明实施例还提供一种计算机程序,该计算机程序包括指令,当该计算机程序被计算机执行时,可以使得计算机可以执行上述步骤401-406,或者,步骤501-504,或者,步骤601-603,或者,步骤701-705中相关的处理方法。Further, an embodiment of the present invention further provides a computer program, where the computer program includes instructions, when the computer program is executed by a computer, may cause the computer to perform the above steps 401-406, or steps 501-504, or steps 601-603, or the related processing methods in steps 701-705.

进一步地,本发明实施例还提供一种计算机可读存储介质,用于储存为上述移动设备所用的计算机软件指令,其包含用于执行为上述移动设备所设计的任意程序。Further, an embodiment of the present invention further provides a computer readable storage medium for storing computer software instructions used by the mobile device, including any program designed to execute the mobile device.

通过以上的实施方式的描述,所属领域的技术人员可以清楚地了解到,为描述的方便和简洁,仅以上述各功能模块的划分进行举例说明,实际应用中,可以根据需要而将上述功能分配由不同的功能模块完成,即将装置的内部结构划分成不同的功能模块,以完成以上描述的全部或者部分功能。上述描述的系统,装置和单元的具体工作过程,可以参考前述方法实施例中的对应过程,在此不再赘述。Through the description of the above embodiments, those skilled in the art can clearly understand that for the convenience and brevity of the description, only the division of the above functional modules is illustrated. In practical applications, the above functions can be allocated according to needs. It is completed by different functional modules, that is, the internal structure of the device is divided into different functional modules to complete all or part of the functions described above. For the specific working process of the system, the device and the unit described above, reference may be made to the corresponding process in the foregoing method embodiments, and details are not described herein again.

在本申请所提供的几个实施例中,应该理解到,所揭露的系统,装置和方法,可以通过其它的方式实现。例如,以上所描述的装置实施例仅仅是示意性的,例如,所述模块或单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,例如多个单元或组件可以结合或者可以集成到另一个系统,或一些特征可以忽略,或不执行。另一点,所显示或讨论的相互之间的耦合或直接耦合或通信连接可以是通过一些接口,装置或单元的间 接耦合或通信连接,可以是电性,机械或其它的形式。In the several embodiments provided by the present application, it should be understood that the disclosed system, apparatus, and method may be implemented in other manners. For example, the device embodiments described above are merely illustrative. For example, the division of the modules or units is only a logical function division. In actual implementation, there may be another division manner, for example, multiple units or components may be used. Combinations can be integrated into another system, or some features can be ignored or not executed. Alternatively, the mutual coupling or direct coupling or communication connection shown or discussed may be through some interface, device or unit. The coupling or communication connection can be in electrical, mechanical or other form.

所述作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部单元来实现本实施例方案的目的。The units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, may be located in one place, or may be distributed to multiple network units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution of the embodiment.

另外,在本申请各个实施例中的各功能单元可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中。上述集成的单元既可以采用硬件的形式实现,也可以采用软件功能单元的形式实现。In addition, each functional unit in each embodiment of the present application may be integrated into one processing unit, or each unit may exist physically separately, or two or more units may be integrated into one unit. The above integrated unit can be implemented in the form of hardware or in the form of a software functional unit.

所述集成的单元如果以软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读取存储介质中。基于这样的理解,本申请的技术方案本质上或者说对现有技术做出贡献的部分或者该技术方案的全部或部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质中,包括若干指令用以使得一台计算机设备(可以是个人计算机,服务器,或者网络设备等)或处理器执行本申请各个实施例所述方法的全部或部分步骤。而前述的存储介质包括:快闪存储器、移动硬盘、只读存储器、随机存取存储器、磁碟或者光盘等各种可以存储程序代码的介质。The integrated unit, if implemented in the form of a software functional unit and sold or used as a standalone product, may be stored in a computer readable storage medium. Based on such understanding, the technical solution of the present application, in essence or the contribution to the prior art, or all or part of the technical solution may be embodied in the form of a software product stored in a storage medium. A number of instructions are included to cause a computer device (which may be a personal computer, server, or network device, etc.) or processor to perform all or part of the steps of the methods described in various embodiments of the present application. The foregoing storage medium includes: a flash memory, a mobile hard disk, a read only memory, a random access memory, a magnetic disk, or an optical disk, and the like, which can store program codes.

在上述实施例中,可以全部或部分的通过软件,硬件,固件或者其任意组合来实现。当使用软件实现时,可以全部或部分地以计算机程序产品的形式出现。所以计算机程序产品包括一个或多个计算机指令。在计算机上加载和执行所述计算机程序指令时,全部或部分地产生按照本发明实施例所述的流程或功能。所述计算机可以是通用计算机、专用计算机、计算机网络、或者其他可编程装置。所述计算机指令可以存储在计算机可读存储介质中,或者从一个计算机可读存储介质向另一个计算机可读存储介质传输,例如,所述计算机指令可以从一个网站站点、计算机、服务器或数据中心通过有线(例如同轴电缆、光纤、数字用户线(DSL))或无线(例如红外、无线、微波等)方式向另一个网站站点、计算机、服务器或数据中心传输。所述计算机可读存储介质可以是计算机能够存取的任何可用介质或者是包含一个或多个可用介质集成的服务器、数据中心等数据存储设备。该可用介质可以是磁性介质,(例如,软盘,硬盘、磁带)、光介质(例如,DVD)或者半导体介质(例如固态硬盘Solid State Disk(SSD))。In the above embodiments, it may be implemented in whole or in part by software, hardware, firmware or any combination thereof. When implemented in software, it may occur in whole or in part in the form of a computer program product. The computer program product therefore includes one or more computer instructions. When the computer program instructions are loaded and executed on a computer, the processes or functions described in accordance with embodiments of the present invention are generated in whole or in part. The computer can be a general purpose computer, a special purpose computer, a computer network, or other programmable device. The computer instructions can be stored in a computer readable storage medium or transferred from one computer readable storage medium to another computer readable storage medium, for example, the computer instructions can be from a website site, computer, server or data center Transfer to another website site, computer, server, or data center by wire (eg, coaxial cable, fiber optic, digital subscriber line (DSL), or wireless (eg, infrared, wireless, microwave, etc.). The computer readable storage medium can be any available media that can be accessed by a computer or a data storage device such as a server, data center, or the like that includes one or more available media. The usable medium can be a magnetic medium (eg, a floppy disk, a hard disk, a magnetic tape), an optical medium (eg, a DVD), or a semiconductor medium (eg, a Solid State Disk (SSD)).

以上所述,仅为本申请的具体实施方式,但本申请的保护范围并不局限于此,任何在本申请揭露的技术范围内的变化或替换,都应涵盖在本申请的保护范围之内。因此,本申请的保护范围应以所述权利要求的保护范围为准。 The foregoing is only a specific embodiment of the present application, but the scope of protection of the present application is not limited thereto, and any changes or substitutions within the technical scope of the present application should be covered by the scope of the present application. . Therefore, the scope of protection of the present application should be determined by the scope of the claims.

Claims (22)

一种处理方法,所述方法应用于移动设备,其特征在于,包括:A processing method, the method being applied to a mobile device, comprising: 在所述移动设备的预览框内显示预览帧,所述预览帧包括至少两个四边形;Displaying a preview frame in a preview box of the mobile device, the preview frame including at least two quadrilaterals; 确定用户的视线焦点停留在第一四边形内的时长大于预设时长,所述第一四边形为所述至少两个四边形中的一个;Determining that the duration of the line of sight of the user staying within the first quadrilateral is greater than a preset duration, the first quadrilateral being one of the at least two quadrilaterals; 在所述预览框内突出显示所述第一四边形的轮廓线;Highlighting a contour line of the first quadrilateral in the preview frame; 检测到第一拍照指令后,对所述第一四边形的轮廓线所包围的第一图像执行预设处理后,输出经过所述预设处理的第一图像;所述预设处理包括:对所述第一图像执行四边形校正;或者,对所述第一图像中的文字执行光学字符识别。After the first photographing instruction is detected, after the preset processing is performed on the first image surrounded by the outline of the first quadrilateral, the first image subjected to the preset processing is output; the preset processing includes: Performing quadrilateral correction on the first image; or performing optical character recognition on the text in the first image. 根据权利要求1所述的方法,其特征在于,在所述预览框内突出显示所述第一四边形的轮廓线之后,还包括:The method according to claim 1, wherein after highlighting the outline of the first quadrilateral in the preview frame, the method further comprises: 确定用户的视线焦点停留在第二四边形内的时长大于所述预设时长,所述第二四边形为所述至少两个四边形中除所述第一四边形外的任一个四边形;Determining that a duration in which the user's line of sight focus stays within the second quadrilateral is greater than the predetermined length of time, and the second quadrilateral is any one of the at least two quadrilaterals except the first quadrilateral ; 在所述预览框内正常显示所述第一四边形的轮廓线;Displaying the outline of the first quadrilateral normally in the preview frame; 在所述预览框内突出显示所述第二四边形的轮廓线。An outline of the second quadrilateral is highlighted within the preview frame. 根据权利要求2所述的方法,其特征在于,在所述预览框帧内突出显示所述第二四边形的轮廓线之后,还包括:The method according to claim 2, further comprising: after highlighting the outline of the second quadrilateral in the preview frame frame, further comprising: 检测到第二拍照指令后,对所述第二四边形的轮廓线所包围的第二图像执行所述预设处理后,输出经过所述预设处理的第二图像。After detecting the second photographing instruction, after performing the preset processing on the second image surrounded by the outline of the second quadrilateral, outputting the second image subjected to the preset processing. 根据权利要求1-3中任一项所述的方法,其特征在于,在所述移动设备的预览框内显示预览帧之后,还包括:The method according to any one of claims 1 to 3, further comprising: after displaying the preview frame in the preview box of the mobile device, 在所述预览框内突出显示默认四边形的轮廓线,所述默认四边形为所述至少两个四边形中面积最大的四边形。A contour of a default quadrilateral is highlighted within the preview frame, and the default quadrilateral is a quadrilateral having the largest area among the at least two quadrilaterals. 根据权利要求1-4中任一项所述的方法,其特征在于,在所述移动设备的预览框内显示预览帧之前,还包括:The method according to any one of claims 1 to 4, further comprising: before displaying the preview frame in the preview box of the mobile device, 对所述预览帧执行四边形检测,保存检测到的至少两个四边形的顶点坐标。Quadrilateral detection is performed on the preview frame, and the detected vertex coordinates of at least two quadrilaterals are saved. 根据权利要求1-5中任一项所述的方法,其特征在于,确定用户的视线焦点停留在第一四边形内的时长大于预设时长,包括:The method according to any one of claims 1 to 5, wherein determining that the user's line of sight focus stays within the first quadrilateral is greater than a preset duration includes: 检测所述视线焦点在所述预览帧内的焦点位置;Detecting a focus position of the line of sight focus within the preview frame; 确定所述焦点位置位于所述预览帧的第一四边形内的时长大于所述预设时长。Determining that the focus position is within the first quadrilateral of the preview frame is longer than the preset duration. 根据权利要求1-6中任一项所述的方法,其特征在于,对所述第一图像中的文字执行光学字符识别,包括:The method of any of claims 1-6, wherein performing optical character recognition on the text in the first image comprises: 将所述第一图像中的文字翻译为预设语言的文字,并将所述第一图像中的文字替换为翻译后的所述预设语言的文字;或者,Translating the text in the first image into a text of a preset language, and replacing the text in the first image with the translated text in the preset language; or 将所述第一图像中的文字翻译为预设语言的文字,并将翻译后的所述预设语言的文字添加到所述第一图像中。Translating the text in the first image into a text of a preset language, and adding the translated text of the preset language to the first image. 根据权利要求1-7中任一项所述的方法,其特征在于,所述预设处理 还包括:对所述第一图像执行裁剪操作。The method according to any one of claims 1 to 7, wherein the preset processing The method further includes performing a cropping operation on the first image. 一种处理方法,所述方法应用于移动设备,其特征在于,包括:A processing method, the method being applied to a mobile device, comprising: 对待显示预览帧执行四边形检测,保存检测到的至少两个四边形的顶点坐标;Performing a quadrilateral detection on the display preview frame, and storing the detected vertex coordinates of at least two quadrilaterals; 在所述移动设备的预览框中显示所述预览帧;Displaying the preview frame in a preview box of the mobile device; 确定视线焦点位于所述至少两个四边形中的第一四边形内超过预设时长,在所述预览框中呈现所述第一四边形的轮廓线;Determining that the line of sight focus is within the first quadrilateral of the at least two quadriles for more than a preset duration, and presenting the outline of the first quadrilateral in the preview frame; 检测到用于拍照的指令;An instruction for photographing is detected; 对所述预览帧中所述第一四边形的轮廓线包围的第一图像执行预设处理后,输出经过所述预设处理的第一图像,所述经过所述预设处理的第一图像中包括所述第一四边形的轮廓线包围的第一内容,所述预设处理包括以下至少一种:四边形校正;对第一图像中的文字执行光学字符识别,将识别出的文字翻译为预设语言的文字,并将识别出的文字替换为所述预设语言的文字;对所述第一图像中的文字执行光学字符识别,将识别出的文字翻译为预设语言的文字,并将所述预设语言的文字添加到识别出的文字的旁边。After performing a preset process on the first image surrounded by the contour line of the first quadrilateral in the preview frame, outputting the first image subjected to the preset process, the first passing the preset process The image includes a first content surrounded by the contour line of the first quadrilateral, and the preset processing includes at least one of the following: quadrilateral correction; performing optical character recognition on the text in the first image, and identifying the recognized text Translating into a text of a preset language, and replacing the recognized text with the text of the preset language; performing optical character recognition on the text in the first image, and translating the recognized text into a text of a preset language And adding the text of the preset language to the side of the recognized text. 一种处理方法,所述方法应用于移动设备,其特征在于,包括:A processing method, the method being applied to a mobile device, comprising: 对待显示预览帧执行四边形检测,保存检测到的至少两个四边形的顶点坐标;Performing a quadrilateral detection on the display preview frame, and storing the detected vertex coordinates of at least two quadrilaterals; 在所述移动设备的预览框中显示所述预览帧,所述预览帧中呈现所述至少两个四边形中的第一四边形的轮廓线;Displaying the preview frame in a preview box of the mobile device, and presenting a contour line of the first quadrilateral of the at least two quadrilaterals in the preview frame; 确定视线焦点位于所述至少两个四边形中的第二四边形内超过预设时长,在所述预览框中呈现所述第二四边形的轮廓线,隐藏所述第一四边形的轮廓线,所述第二四边形为所述至少两个四边形中除所述第一四边形外的任一个;Determining that the line of sight focus is within a second quadrilateral of the at least two quadriles for more than a preset duration, presenting the outline of the second quadrilateral in the preview frame, hiding the first quadrilateral a contour line, wherein the second quadrilateral is any one of the at least two quadrilaterals except the first quadrilateral; 检测到用于拍照的指令;An instruction for photographing is detected; 对所述预览帧中所述第二四边形的轮廓线包围的第二图像执行预设处理后,输出经过所述预设处理的第二图像,所述经过所述预设处理的第二图像中包括所述第二四边形的轮廓线包围的第二内容,所述预设处理包括以下至少一种:四边形校正;对第二图像中的文字执行光学字符识别,将识别出的文字翻译为预设语言的文字,并将识别出的文字替换为所述预设语言的文字;对所述第二图像中的文字执行光学字符识别,将识别出的文字翻译为预设语言的文字,并将所述预设语言的文字添加到识别出的文字的旁边。After performing a preset process on the second image surrounded by the outline of the second quadrilateral in the preview frame, outputting a second image that passes the preset process, and the second process that passes the preset process The image includes a second content surrounded by the contour line of the second quadrilateral, the preset processing includes at least one of the following: quadrilateral correction; performing optical character recognition on the text in the second image, and identifying the recognized text Translating into a text of a preset language, and replacing the recognized text with the text of the preset language; performing optical character recognition on the text in the second image, and translating the recognized text into a text of a preset language And adding the text of the preset language to the side of the recognized text. 一种处理方法,所述方法应用于移动设备,其特征在于,包括:A processing method, the method being applied to a mobile device, comprising: 在所述移动设备的预览框内显示预览帧,所述预览帧包括至少两个四边形;Displaying a preview frame in a preview box of the mobile device, the preview frame including at least two quadrilaterals; 检测到作用于第一四边形的操作,所述第一四边形为所述至少两个四边形中的一个;Detecting an operation acting on the first quadrilateral, the first quadrilateral being one of the at least two quadrilaterals; 响应于所述操作,对所述第一四边形的轮廓线所包围的第一图像执行预设处理后,输出经过所述预设处理的第一图像;所述预设处理包括:对所述第一图像执行四边形校正;或者,对所述第一图像中的文字执行光学字符识别。In response to the operation, after performing a preset process on the first image surrounded by the outline of the first quadrilateral, outputting the first image subjected to the preset process; the preset process includes: The first image performs quadrilateral correction; or, optical character recognition is performed on the text in the first image. 一种处理方法,所述方法应用于移动设备,其特征在于,包括: A processing method, the method being applied to a mobile device, comprising: 在所述移动设备的预览框内显示预览帧,所述预览帧包括至少两个四边形;Displaying a preview frame in a preview box of the mobile device, the preview frame including at least two quadrilaterals; 检测到作用于第一四边形的第一操作,所述第一四边形为所述至少两个四边形中的一个;Detecting a first operation acting on the first quadrilateral, the first quadrilateral being one of the at least two quadrilaterals; 响应于所述第一操作,突出显示所述第一四边形的轮廓线;Highlighting the outline of the first quadrilateral in response to the first operation; 检测到作用于拍照键的第二操作;A second operation acting on the camera button is detected; 响应于所述第二操作,对所述第一四边形的轮廓线所包围的第一图像执行预设处理后,输出经过所述预设处理的第一图像;所述预设处理包括:对所述第一图像执行四边形校正;或者,对所述第一图像中的文字执行光学字符识别。After performing the preset processing on the first image surrounded by the outline of the first quadrilateral, outputting the first image subjected to the preset processing in response to the second operation; the preset processing includes: Performing quadrilateral correction on the first image; or performing optical character recognition on the text in the first image. 一种移动设备,其特征在于,包括:A mobile device, comprising: 显示单元,用于在所述移动设备的预览框内显示预览帧,所述预览帧包括至少两个四边形;a display unit, configured to display a preview frame in a preview frame of the mobile device, where the preview frame includes at least two quadrangles; 确定单元,用于确定用户的视线焦点停留在第一四边形内的时长大于预设时长,所述第一四边形为所述至少两个四边形中的一个;a determining unit, configured to determine that a duration in which the user's line of sight focus stays within the first quadrilateral is greater than a preset duration, the first quadrilateral being one of the at least two quadrilaterals; 所述显示单元,还用于在所述预览框内突出显示所述第一四边形的轮廓线;The display unit is further configured to highlight an outline of the first quadrilateral in the preview frame; 处理单元,用于检测到第一拍照指令后,对所述第一四边形的轮廓线所包围的第一图像执行预设处理后,输出经过所述预设处理的第一图像;所述预设处理包括:对所述第一图像执行四边形校正;或者,对所述第一图像中的文字执行光学字符识别。a processing unit, configured to perform a preset process on the first image surrounded by the outline of the first quadrilateral after detecting the first photographing instruction, and output a first image that passes the preset processing; The preset processing includes: performing quadrilateral correction on the first image; or performing optical character recognition on the text in the first image. 根据权利要求13所述的移动设备,其特征在于,The mobile device of claim 13 wherein: 所述确定单元,还用于确定用户的视线焦点停留在第二四边形内的时长大于所述预设时长,所述第二四边形为所述至少两个四边形中除所述第一四边形外的任一个四边形;The determining unit is further configured to determine that a duration of the line of sight of the user staying in the second quadrilateral is greater than the preset duration, and the second quadrilateral is the first of the at least two quadrilaterals Any quadrilateral outside the quadrilateral; 所述显示单元,还用于在所述预览框内正常显示所述第一四边形的轮廓线;在所述预览框内突出显示所述第二四边形的轮廓线。The display unit is further configured to normally display an outline of the first quadrilateral in the preview frame; and highlight an outline of the second quadrilateral in the preview frame. 根据权利要求14所述的移动设备,其特征在于,The mobile device of claim 14 wherein: 所述处理单元,还用于检测到第二拍照指令后,对所述第二四边形的轮廓线所包围的第二图像执行所述预设处理后,输出经过所述预设处理的第二图像。The processing unit is further configured to: after detecting the second photographing instruction, performing the preset processing on the second image surrounded by the contour line of the second quadrilateral, and outputting the Two images. 根据权利要求13-15中任一项所述的移动设备,其特征在于,Mobile device according to any of the claims 13-15, characterized in that 所述显示单元,还用于在所述预览框内突出显示默认四边形的轮廓线,所述默认四边形为所述至少两个四边形中面积最大的四边形。The display unit is further configured to highlight a contour of a default quadrilateral in the preview frame, wherein the default quadrilateral is a quadrilateral having the largest area among the at least two quadrilaterals. 根据权利要求13-16中任一项所述的移动设备,其特征在于,所述移动设备还包括:The mobile device according to any one of claims 13 to 16, wherein the mobile device further comprises: 检测单元,用于对所述预览帧执行四边形检测,保存检测到的至少两个四边形的顶点坐标。And a detecting unit, configured to perform quadrilateral detection on the preview frame, and save the detected vertex coordinates of at least two quadrilaterals. 根据权利要求13-17中任一项所述的移动设备,其特征在于,A mobile device according to any of claims 13-17, wherein 所述确定单元,具体用于检测所述用户的视线焦点在所述预览帧内的焦点位置;确定所述焦点位置位于所述预览帧的第一四边形内的时长大于所述预设时长。The determining unit is specifically configured to detect a focus position of the user's line of sight focus in the preview frame, and determine that the focus position is within the first quadrilateral of the preview frame is greater than the preset duration . 根据权利要求13-18中任一项所述的移动设备,其特征在于, Mobile device according to any of the claims 13-18, characterized in that 所述处理单元,具体用于将所述第一图像中的文字翻译为预设语言的文字,并将所述第一图像中的文字替换为翻译后的所述预设语言的文字;或者,将所述第一图像中的文字翻译为预设语言的文字,并将翻译后的所述预设语言的文字添加到所述第一图像中。The processing unit is configured to translate the text in the first image into a text in a preset language, and replace the text in the first image with the translated text in the preset language; or Translating the text in the first image into a text of a preset language, and adding the translated text of the preset language to the first image. 根据权利要求13-19中任一项所述的移动设备,其特征在于,Mobile device according to any of the claims 13-19, characterized in that 所述处理单元,具体用于将所述第一图像中的文字翻译为预设语言的文字,并将所述第一图像中的文字替换为翻译后所述预设语言的文字;或者,将所述第一图像中的文字翻译为预设语言的文字,并将翻译后所述预设语言的文字添加到所述第一图像中。The processing unit is specifically configured to translate the text in the first image into a text in a preset language, and replace the text in the first image with the text in the preset language after translation; or The text in the first image is translated into a text of a preset language, and the translated text of the preset language is added to the first image. 一种移动设备,其特征在于,包括:一个或多个处理器、存储器、总线系统、以及一个或多个程序,所述处理器和所述存储器通过所述总线系统相连;A mobile device, comprising: one or more processors, a memory, a bus system, and one or more programs, the processor and the memory being connected by the bus system; 其中,所述一个或多个程序被存储在所述存储器中,所述一个或多个程序包括指令,所述指令当被所述移动设备执行时使所述移动设备执行如权利要求1-12中任一项所述的处理方法。Wherein the one or more programs are stored in the memory, the one or more programs comprising instructions that, when executed by the mobile device, cause the mobile device to perform as claimed in claims 1-12 The processing method according to any one of the preceding claims. 一种计算机可读存储介质,其特征在于,所述计算机可读存储介质包括指令,当所述指令被移动设备执行时使所述移动设备执行如权利要求1-12中任一项所述的处理方法。 A computer readable storage medium, comprising: instructions for causing the mobile device to perform the method of any of claims 1-12 when the instructions are executed by a mobile device Approach.
PCT/CN2017/072496 2016-12-16 2017-01-24 Processing method and mobile device Ceased WO2018107566A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201780009378.XA CN108604128B (en) 2016-12-16 2017-01-24 A processing method and mobile device

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201611169528 2016-12-16
CN201611169528.7 2016-12-16

Publications (1)

Publication Number Publication Date
WO2018107566A1 true WO2018107566A1 (en) 2018-06-21

Family

ID=62557699

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2017/072496 Ceased WO2018107566A1 (en) 2016-12-16 2017-01-24 Processing method and mobile device

Country Status (2)

Country Link
CN (1) CN108604128B (en)
WO (1) WO2018107566A1 (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109495687A (en) * 2018-12-21 2019-03-19 努比亚技术有限公司 Photographic method, device, terminal and storage medium
WO2021175125A1 (en) * 2020-03-06 2021-09-10 Guangdong Oppo Mobile Telecommunications Corp., Ltd. System and method for automatically adjusting focus of a camera
CN114302054B (en) * 2021-11-30 2023-06-20 歌尔科技有限公司 Photographing method of AR equipment and AR equipment

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101620595A (en) * 2009-08-11 2010-01-06 上海合合信息科技发展有限公司 Method and system for translating text of electronic equipment
US20120194418A1 (en) * 2010-02-28 2012-08-02 Osterhout Group, Inc. Ar glasses with user action control and event input based control of eyepiece application
CN103488630A (en) * 2013-09-29 2014-01-01 小米科技有限责任公司 Method, device and terminal for processing picture
CN104395857A (en) * 2012-05-09 2015-03-04 英特尔公司 Eye tracking based selective accentuation of portions of a display
CN105739691A (en) * 2016-01-26 2016-07-06 宋宏 Virtual experience type electronic direction system

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2010026520A2 (en) * 2008-09-03 2010-03-11 Koninklijke Philips Electronics N.V. Method of performing a gaze-based interaction between a user and an interactive display system
CN101667251B (en) * 2008-09-05 2014-07-23 三星电子株式会社 OCR recognition method and device with auxiliary positioning function
JP6127839B2 (en) * 2013-09-02 2017-05-17 ソニー株式会社 Information processing apparatus, information processing method, and program
KR102173109B1 (en) * 2014-09-05 2020-11-02 삼성전자주식회사 Method of processing a digital image, Computer readable storage medium of recording the method and digital photographing apparatus
CN105786804B (en) * 2016-02-26 2018-10-19 维沃移动通信有限公司 A translation method and mobile terminal

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101620595A (en) * 2009-08-11 2010-01-06 上海合合信息科技发展有限公司 Method and system for translating text of electronic equipment
US20120194418A1 (en) * 2010-02-28 2012-08-02 Osterhout Group, Inc. Ar glasses with user action control and event input based control of eyepiece application
CN104395857A (en) * 2012-05-09 2015-03-04 英特尔公司 Eye tracking based selective accentuation of portions of a display
CN103488630A (en) * 2013-09-29 2014-01-01 小米科技有限责任公司 Method, device and terminal for processing picture
CN105739691A (en) * 2016-01-26 2016-07-06 宋宏 Virtual experience type electronic direction system

Also Published As

Publication number Publication date
CN108604128B (en) 2021-03-30
CN108604128A (en) 2018-09-28

Similar Documents

Publication Publication Date Title
US11604574B2 (en) Electronic device and method for electronic device displaying image
TWI724736B (en) Image processing method and device, electronic equipment, storage medium and computer program
JP7110412B2 (en) LIFE DETECTION METHOD AND DEVICE, ELECTRONIC DEVICE, AND STORAGE MEDIUM
CN111079576B (en) Living body detection method, living body detection device, living body detection equipment and storage medium
KR102666977B1 (en) Electronic device and method for photographing image thereof
JP7058760B2 (en) Image processing methods and their devices, terminals and computer programs
KR102593824B1 (en) Method for controlling a camera and electronic device thereof
US10623661B2 (en) Image composition method with image sensors having different angles of view and electronic device for supporting the same
EP3001247B1 (en) Method and terminal for acquiring panoramic image
KR102491564B1 (en) foldable electronic device with flexible display
WO2019101021A1 (en) Image recognition method, apparatus, and electronic device
KR102018887B1 (en) Image preview using detection of body parts
EP3440829B1 (en) Apparatus and method for processing image
US10303933B2 (en) Apparatus and method for processing a beauty effect
KR102547104B1 (en) Electronic device and method for processing plural images
WO2019219065A1 (en) Video analysis method and device
US10319086B2 (en) Method for processing image and electronic device supporting the same
CN111541845A (en) Image processing method and device and electronic equipment
CN108495045B (en) Image capturing method, device, electronic device, and storage medium
CN112488914B (en) Image stitching method, device, terminal and computer readable storage medium
WO2017124899A1 (en) Information processing method, apparatus and electronic device
CN111656391B (en) Image correction method and terminal
CN117063146A (en) Information display device and method
WO2018184260A1 (en) Correcting method and device for document image
WO2021218551A1 (en) Photographing method and apparatus, terminal device, and storage medium

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 17881390

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 17881390

Country of ref document: EP

Kind code of ref document: A1