Chaki, 2025 - Google Patents
The Art of Deep Learning Image Augmentation: The Seeds of SuccessChaki, 2025
- Document ID
- 3488012495309056184
- Author
- Chaki J
- Publication year
External Links
Snippet
In the realm of deep learning, the adage “garbage in, garbage out” rings particularly true. The performance of computer vision models hinges heavily on the quality and quantity of the training data. Limited data poses significant challenges, leading to overfitting, poor …
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/46—Extraction of features or characteristics of the image
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00221—Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
- G06K9/00268—Feature extraction; Face representation
- G06K9/00281—Local features and components; Facial parts ; Occluding parts, e.g. glasses; Geometrical relationships
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00221—Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
- G06K9/00268—Feature extraction; Face representation
- G06K9/00275—Holistic features and representations, i.e. based on the facial image taken as a whole
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10024—Color image
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T11/00—2D [Two Dimensional] image generation
- G06T11/60—Editing figures and text; Combining figures or text
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/20—Image acquisition
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T3/00—Geometric image transformation in the plane of the image, e.g. from bit-mapped to bit-mapped creating a different image
- G06T3/40—Scaling the whole image or part thereof
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/30—Subject of image; Context of image processing
- G06T2207/30196—Human being; Person
- G06T2207/30201—Face
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T13/00—Animation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30244—Information retrieval; Database structures therefor; File system structures therefor in image databases
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T5/00—Image enhancement or restoration, e.g. from bit-mapped to bit-mapped creating a similar image
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2200/00—Indexing scheme for image data processing or generation, in general
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Anantrasirichai et al. | Artificial intelligence in the creative industries: a review | |
US12340518B2 (en) | Figure-ground neural radiance fields for three-dimensional object category modelling | |
US12260530B2 (en) | Generating a modified digital image utilizing a human inpainting model | |
AU2005286823B2 (en) | System, method, and apparatus for generating a three-dimensional representation from one or more two-dimensional images | |
Liu et al. | Structure-guided arbitrary style transfer for artistic image and video | |
US20240087265A1 (en) | Multidimentional image editing from an input image | |
Wan et al. | Generative adversarial learning for detail-preserving face sketch synthesis | |
CN117079313A (en) | Image processing method, device, equipment and storage medium | |
Manaswi | Generative Adversarial Networks with Industrial Use Cases | |
US20240135510A1 (en) | Utilizing a generative machine learning model and graphical user interface for creating modified digital images from an infill semantic map | |
US20240331247A1 (en) | Animated facial expression and pose transfer utilizing an end-to-end machine learning model | |
Tous | Pictonaut: movie cartoonization using 3D human pose estimation and GANs | |
Gao | A method for face image inpainting based on generative adversarial networks | |
Behrouzi et al. | Maskrenderer: 3D-infused multi-mask realistic face reenactment | |
Asad et al. | MGAN-CRCM: a novel multiple generative adversarial network and coarse refinement-based cognizant method for image inpainting | |
CN118015142B (en) | Face image processing method, device, computer equipment and storage medium | |
US20250054115A1 (en) | Deep learning-based high resolution image inpainting | |
Zhao et al. | Purifying naturalistic images through a real-time style transfer semantics network | |
Yan | Image analysis for digital media applications | |
Chaki | The Art of Deep Learning Image Augmentation: The Seeds of Success | |
Šoberl | Mixed reality and deep learning: Augmenting visual information using generative adversarial networks | |
Das | 3D-GANTex: 3D Face Reconstruction with StyleGAN3-based Multi-View Images and 3DDFA based Mesh Generation | |
Chaki | Generative Adversarial Networks Based Image Augmentation | |
Jain | Generative Adversarial Networks: A Review of Developments and Diverse Applications | |
Bukar | Automatic age progression and estimation from faces |