Zhang et al., 2022 - Google Patents
A self-supervised monocular depth estimation approach based on UAV aerial imagesZhang et al., 2022
View PDF- Document ID
- 9659381451565736689
- Author
- Zhang Y
- Yu Q
- Low K
- Lv C
- Publication year
- Publication venue
- 2022 IEEE/AIAA 41st Digital Avionics Systems Conference (DASC)
External Links
Snippet
The Unmanned Aerial Vehicles (UAVs) have gained increasing attention recently, and depth estimation is one of the essential tasks for the safe operation of UAVs, especially for drones at low altitudes. Considering the limitations of UAVs' size and payload, innovative methods …
- 238000000034 method 0 abstract description 9
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10032—Satellite or aerial image; Remote sensing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10024—Color image
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G06T2207/20112—Image segmentation details
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10016—Video; Image sequence
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T15/00—3D [Three Dimensional] image rendering
- G06T15/10—Geometric effects
- G06T15/20—Perspective computation
- G06T15/205—Image-based rendering
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T17/00—Three dimensional [3D] modelling, e.g. data description of 3D objects
- G06T17/05—Geographic models
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00624—Recognising scenes, i.e. recognition of a whole field of perception; recognising scene-specific objects
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/30—Subject of image; Context of image processing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00221—Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/46—Extraction of features or characteristics of the image
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T11/00—2D [Two Dimensional] image generation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2200/00—Indexing scheme for image data processing or generation, in general
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Moreau et al. | Lens: Localization enhanced by nerf synthesis | |
Lin et al. | Depth estimation from monocular images and sparse radar data | |
CN108428255B (en) | Real-time three-dimensional reconstruction method based on unmanned aerial vehicle | |
Miclea et al. | Monocular depth estimation with improved long-range accuracy for UAV environment perception | |
CN108230361B (en) | Method and system for enhancing target tracking by fusing unmanned aerial vehicle detector and tracker | |
US11064178B2 (en) | Deep virtual stereo odometry | |
Kumar et al. | Monocular fisheye camera depth estimation using sparse lidar supervision | |
KR102718665B1 (en) | Object pose estimating method and apparatus | |
US11948309B2 (en) | Systems and methods for jointly training a machine-learning-based monocular optical flow, depth, and scene flow estimator | |
WO2019230339A1 (en) | Object identification device, system for moving body, object identification method, training method of object identification model, and training device for object identification model | |
CN113566825B (en) | Unmanned aerial vehicle navigation method, system and storage medium based on vision | |
Florea et al. | Wilduav: Monocular uav dataset for depth estimation tasks | |
Wang et al. | Unsupervised learning of 3d scene flow from monocular camera | |
He et al. | Neural radiance field in autonomous driving: A survey | |
CN112950786A (en) | Vehicle three-dimensional reconstruction method based on neural network | |
CN117197388A (en) | A method and system for constructing real-life three-dimensional virtual reality scenes based on generative adversarial neural networks and oblique photography | |
Liu et al. | A multi-modality sensor system for unmanned surface vehicle | |
Narayanan et al. | A multi-purpose realistic haze benchmark with quantifiable haze levels and ground truth | |
Byrne et al. | Expansion segmentation for visual collision detection and estimation | |
CN116912488A (en) | Three-dimensional panorama segmentation method and device based on multi-view camera | |
Dionisio et al. | Nereon-an underwater dataset for monocular depth estimation | |
Pinard et al. | End-to-end depth from motion with stabilized monocular videos | |
Jaisawal et al. | Airfisheye dataset: A multi-model fisheye dataset for uav applications | |
Zhang et al. | A self-supervised monocular depth estimation approach based on UAV aerial images | |
Bhutani et al. | Unsupervised Depth and Confidence Prediction from Monocular Images using Bayesian Inference |