Hong et al., 2024 - Google Patents
Learning subject-aware cropping by outpainting professional photosHong et al., 2024
View PDF- Document ID
- 2349585078635675270
- Author
- Hong J
- Yuan L
- Gharbi M
- Fisher M
- Fatahalian K
- Publication year
- Publication venue
- Proceedings of the AAAI Conference on Artificial Intelligence
External Links
Snippet
How to frame (or crop) a photo often depends on the image subject and its context; eg, a human portrait. Recent works have defined the subject-aware image cropping task as a nuanced and practical version of image cropping. We propose a weakly-supervised …
- 238000000034 method 0 abstract description 27
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00221—Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
- G06K9/00268—Feature extraction; Face representation
- G06K9/00281—Local features and components; Facial parts ; Occluding parts, e.g. glasses; Geometrical relationships
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10024—Color image
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30244—Information retrieval; Database structures therefor; File system structures therefor in image databases
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T11/00—2D [Two Dimensional] image generation
- G06T11/60—Editing figures and text; Combining figures or text
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/46—Extraction of features or characteristics of the image
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/30—Subject of image; Context of image processing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00624—Recognising scenes, i.e. recognition of a whole field of perception; recognising scene-specific objects
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T5/00—Image enhancement or restoration, e.g. from bit-mapped to bit-mapped creating a similar image
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2200/00—Indexing scheme for image data processing or generation, in general
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T1/00—General purpose image data processing
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Jiang et al. | Photohelper: portrait photographing guidance via deep feature retrieval and fusion | |
Rossler et al. | Faceforensics++: Learning to detect manipulated facial images | |
Rössler et al. | Faceforensics: A large-scale video dataset for forgery detection in human faces | |
US10657652B2 (en) | Image matting using deep learning | |
Bhattacharya et al. | A framework for photo-quality assessment and enhancement based on visual aesthetics | |
Liu et al. | Optimizing photo composition | |
Wu et al. | Motionbooth: Motion-aware customized text-to-video generation | |
Su et al. | Preference-aware view recommendation system for scenic photos based on bag-of-aesthetics-preserving features | |
Zobaed et al. | Deepfakes: Detecting forged and synthetic media content using machine learning | |
Jin et al. | Neural gaffer: Relighting any object via diffusion | |
Wang et al. | PalGAN: Image colorization with palette generative adversarial networks | |
Ni et al. | Learning to photograph: A compositional perspective | |
Yin et al. | Instance-level facial attributes transfer with geometry-aware flow | |
Hong et al. | Learning subject-aware cropping by outpainting professional photos | |
Hou et al. | Object-level attention for aesthetic rating distribution prediction | |
Huang et al. | Temporally coherent video harmonization using adversarial networks | |
WO2024131565A1 (en) | Garment image extraction method and apparatus, and device, medium and product | |
Shahrian et al. | Temporally coherent and spatially accurate video matting | |
Ai et al. | DreamClear: High-Capacity Real-World Image Restoration with Privacy-Safe Dataset Curation | |
Tous | Pictonaut: movie cartoonization using 3D human pose estimation and GANs | |
Chen et al. | Zero-shot image harmonization with generative model prior | |
Song et al. | Photo squarization by deep multi-operator retargeting | |
Jiang et al. | SPAC-Net: synthetic pose-aware animal ControlNet for enhanced pose estimation | |
Gomez-Nieto et al. | Quality aware features for performance prediction and time reduction in video object tracking | |
CN116415019B (en) | Virtual reality (VR) image recognition method and device, electronic device, and storage medium |