Chaki, 2025 - Google Patents

The Art of Deep Learning Image Augmentation: The Seeds of Success

Chaki, 2025

Document ID: 3488012495309056184
Author: Chaki J
Publication year: 2025

External Links

Cited by

Snippet

In the realm of deep learning, the adage “garbage in, garbage out” rings particularly true. The performance of computer vision models hinges heavily on the quality and quantity of the training data. Limited data poses significant challenges, leading to overfitting, poor …

Continue reading at link.springer.com (other versions)

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/46—Extraction of features or characteristics of the image
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00221—Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
- G06K9/00268—Feature extraction; Face representation
- G06K9/00281—Local features and components; Facial parts ; Occluding parts, e.g. glasses; Geometrical relationships
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00221—Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
- G06K9/00268—Feature extraction; Face representation
- G06K9/00275—Holistic features and representations, i.e. based on the facial image taken as a whole
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10024—Color image
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T11/00—2D [Two Dimensional] image generation
- G06T11/60—Editing figures and text; Combining figures or text
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/20—Image acquisition
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T3/00—Geometric image transformation in the plane of the image, e.g. from bit-mapped to bit-mapped creating a different image
- G06T3/40—Scaling the whole image or part thereof
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/30—Subject of image; Context of image processing
- G06T2207/30196—Human being; Person
- G06T2207/30201—Face
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T13/00—Animation
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30244—Information retrieval; Database structures therefor; File system structures therefor in image databases
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T5/00—Image enhancement or restoration, e.g. from bit-mapped to bit-mapped creating a similar image
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2200/00—Indexing scheme for image data processing or generation, in general

Similar Documents

Publication	Publication Date	Title
Anantrasirichai et al.	2022	Artificial intelligence in the creative industries: a review
US12340518B2 (en)	2025-06-24	Figure-ground neural radiance fields for three-dimensional object category modelling
US12260530B2 (en)	2025-03-25	Generating a modified digital image utilizing a human inpainting model
AU2005286823B2 (en)	2009-10-01	System, method, and apparatus for generating a three-dimensional representation from one or more two-dimensional images
Liu et al.	2021	Structure-guided arbitrary style transfer for artistic image and video
US20240087265A1 (en)	2024-03-14	Multidimentional image editing from an input image
Wan et al.	2021	Generative adversarial learning for detail-preserving face sketch synthesis
CN117079313A (en)	2023-11-17	Image processing method, device, equipment and storage medium
Manaswi	2020	Generative Adversarial Networks with Industrial Use Cases
US20240135510A1 (en)	2024-04-25	Utilizing a generative machine learning model and graphical user interface for creating modified digital images from an infill semantic map
US20240331247A1 (en)	2024-10-03	Animated facial expression and pose transfer utilizing an end-to-end machine learning model
Tous	2023	Pictonaut: movie cartoonization using 3D human pose estimation and GANs
Gao	2022	A method for face image inpainting based on generative adversarial networks
Behrouzi et al.	2025	Maskrenderer: 3D-infused multi-mask realistic face reenactment
Asad et al.	2025	MGAN-CRCM: a novel multiple generative adversarial network and coarse refinement-based cognizant method for image inpainting
CN118015142B (en)	2025-03-28	Face image processing method, device, computer equipment and storage medium
US20250054115A1 (en)	2025-02-13	Deep learning-based high resolution image inpainting
Zhao et al.	2019	Purifying naturalistic images through a real-time style transfer semantics network
Yan	2001	Image analysis for digital media applications
Chaki	2025	The Art of Deep Learning Image Augmentation: The Seeds of Success
Šoberl	2023	Mixed reality and deep learning: Augmenting visual information using generative adversarial networks
Das	2023	3D-GANTex: 3D Face Reconstruction with StyleGAN3-based Multi-View Images and 3DDFA based Mesh Generation
Chaki	2025	Generative Adversarial Networks Based Image Augmentation
Jain	2024	Generative Adversarial Networks: A Review of Developments and Diverse Applications
Bukar	2019	Automatic age progression and estimation from faces