[go: up one dir, main page]

Hong et al., 2024 - Google Patents

Learning subject-aware cropping by outpainting professional photos

Hong et al., 2024

View PDF
Document ID
2349585078635675270
Author
Hong J
Yuan L
Gharbi M
Fisher M
Fatahalian K
Publication year
Publication venue
Proceedings of the AAAI Conference on Artificial Intelligence

External Links

Snippet

How to frame (or crop) a photo often depends on the image subject and its context; eg, a human portrait. Recent works have defined the subject-aware image cropping task as a nuanced and practical version of image cropping. We propose a weakly-supervised …
Continue reading at ojs.aaai.org (PDF) (other versions)

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • G06K9/00221Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
    • G06K9/00268Feature extraction; Face representation
    • G06K9/00281Local features and components; Facial parts ; Occluding parts, e.g. glasses; Geometrical relationships
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/10Image acquisition modality
    • G06T2207/10024Color image
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor; File system structures therefor
    • G06F17/30244Information retrieval; Database structures therefor; File system structures therefor in image databases
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T11/002D [Two Dimensional] image generation
    • G06T11/60Editing figures and text; Combining figures or text
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • G06K9/36Image preprocessing, i.e. processing the image information without deciding about the identity of the image
    • G06K9/46Extraction of features or characteristics of the image
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/30Subject of image; Context of image processing
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • G06K9/00624Recognising scenes, i.e. recognition of a whole field of perception; recognising scene-specific objects
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • G06K9/62Methods or arrangements for recognition using electronic means
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T5/00Image enhancement or restoration, e.g. from bit-mapped to bit-mapped creating a similar image
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2200/00Indexing scheme for image data processing or generation, in general
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T1/00General purpose image data processing

Similar Documents

Publication Publication Date Title
Jiang et al. Photohelper: portrait photographing guidance via deep feature retrieval and fusion
Rossler et al. Faceforensics++: Learning to detect manipulated facial images
Rössler et al. Faceforensics: A large-scale video dataset for forgery detection in human faces
US10657652B2 (en) Image matting using deep learning
Bhattacharya et al. A framework for photo-quality assessment and enhancement based on visual aesthetics
Liu et al. Optimizing photo composition
Wu et al. Motionbooth: Motion-aware customized text-to-video generation
Su et al. Preference-aware view recommendation system for scenic photos based on bag-of-aesthetics-preserving features
Zobaed et al. Deepfakes: Detecting forged and synthetic media content using machine learning
Jin et al. Neural gaffer: Relighting any object via diffusion
Wang et al. PalGAN: Image colorization with palette generative adversarial networks
Ni et al. Learning to photograph: A compositional perspective
Yin et al. Instance-level facial attributes transfer with geometry-aware flow
Hong et al. Learning subject-aware cropping by outpainting professional photos
Hou et al. Object-level attention for aesthetic rating distribution prediction
Huang et al. Temporally coherent video harmonization using adversarial networks
WO2024131565A1 (en) Garment image extraction method and apparatus, and device, medium and product
Shahrian et al. Temporally coherent and spatially accurate video matting
Ai et al. DreamClear: High-Capacity Real-World Image Restoration with Privacy-Safe Dataset Curation
Tous Pictonaut: movie cartoonization using 3D human pose estimation and GANs
Chen et al. Zero-shot image harmonization with generative model prior
Song et al. Photo squarization by deep multi-operator retargeting
Jiang et al. SPAC-Net: synthetic pose-aware animal ControlNet for enhanced pose estimation
Gomez-Nieto et al. Quality aware features for performance prediction and time reduction in video object tracking
CN116415019B (en) Virtual reality (VR) image recognition method and device, electronic device, and storage medium