Hong et al., 2024 - Google Patents

Learning subject-aware cropping by outpainting professional photos

Hong et al., 2024

Document ID: 2349585078635675270
Author: Hong J; Yuan L; Gharbi M; Fisher M; Fatahalian K
Publication year: 2024
Publication venue: Proceedings of the AAAI Conference on Artificial Intelligence

External Links

Cited by

Snippet

How to frame (or crop) a photo often depends on the image subject and its context; eg, a human portrait. Recent works have defined the subject-aware image cropping task as a nuanced and practical version of image cropping. We propose a weakly-supervised …

Continue reading at ojs.aaai.org (PDF) (other versions)

238000000034 method 0 abstract description 27

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00221—Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
- G06K9/00268—Feature extraction; Face representation
- G06K9/00281—Local features and components; Facial parts ; Occluding parts, e.g. glasses; Geometrical relationships
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10024—Color image
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30244—Information retrieval; Database structures therefor; File system structures therefor in image databases
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T11/00—2D [Two Dimensional] image generation
- G06T11/60—Editing figures and text; Combining figures or text
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/46—Extraction of features or characteristics of the image
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/30—Subject of image; Context of image processing
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00624—Recognising scenes, i.e. recognition of a whole field of perception; recognising scene-specific objects
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T5/00—Image enhancement or restoration, e.g. from bit-mapped to bit-mapped creating a similar image
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2200/00—Indexing scheme for image data processing or generation, in general
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T1/00—General purpose image data processing

Similar Documents

Publication	Publication Date	Title
Jiang et al.	2022	Photohelper: portrait photographing guidance via deep feature retrieval and fusion
Rossler et al.	2019	Faceforensics++: Learning to detect manipulated facial images
Rössler et al.	2018	Faceforensics: A large-scale video dataset for forgery detection in human faces
US10657652B2 (en)	2020-05-19	Image matting using deep learning
Bhattacharya et al.	2010	A framework for photo-quality assessment and enhancement based on visual aesthetics
Liu et al.	2010	Optimizing photo composition
Wu et al.	2024	Motionbooth: Motion-aware customized text-to-video generation
Su et al.	2012	Preference-aware view recommendation system for scenic photos based on bag-of-aesthetics-preserving features
Zobaed et al.	2022	Deepfakes: Detecting forged and synthetic media content using machine learning
Jin et al.	2024	Neural gaffer: Relighting any object via diffusion
Wang et al.	2022	PalGAN: Image colorization with palette generative adversarial networks
Ni et al.	2013	Learning to photograph: A compositional perspective
Yin et al.	2019	Instance-level facial attributes transfer with geometry-aware flow
Hong et al.	2024	Learning subject-aware cropping by outpainting professional photos
Hou et al.	2020	Object-level attention for aesthetic rating distribution prediction
Huang et al.	2019	Temporally coherent video harmonization using adversarial networks
WO2024131565A1 (en)	2024-06-27	Garment image extraction method and apparatus, and device, medium and product
Shahrian et al.	2014	Temporally coherent and spatially accurate video matting
Ai et al.	2024	DreamClear: High-Capacity Real-World Image Restoration with Privacy-Safe Dataset Curation
Tous	2023	Pictonaut: movie cartoonization using 3D human pose estimation and GANs
Chen et al.	2025	Zero-shot image harmonization with generative model prior
Song et al.	2018	Photo squarization by deep multi-operator retargeting
Jiang et al.	2023	SPAC-Net: synthetic pose-aware animal ControlNet for enhanced pose estimation
Gomez-Nieto et al.	2022	Quality aware features for performance prediction and time reduction in video object tracking
CN116415019B (en)	2025-09-12	Virtual reality (VR) image recognition method and device, electronic device, and storage medium