Nosheen, Iqra, Iqbal, Talha, Ullah, Ihsan, Ennis, Cathy and Madden, Michael (2024) Investigating the Impact of Encoder Architectures and Batch Size on Depth Estimation through Semantic Consistency. 26th Irish Machine Vision and Image Processing Conference (IMVIP 2024). pp. 1-4.
Preview
nosheen-et-al-investigating-the-impact-of-encoder-architectures-and-batch-size-on-depth-estimation-through-semantic.pdf
Available under License Creative Commons Attribution Non-commercial Share Alike.
Download (75kB) | Preview
Abstract
Traditional methods for depth estimation rely on supervised learning with resource-intensive LiDAR data. Virtual synthetic datasets provide a cost-effective alternative, but bridging the domain gap between synthetic and real-world data remains a significant challenge. In existing work, this gap is addressed through domain adaptation techniques, aligning the feature distributions of synthetic (source) and real-world (target) domains. Our study explores the efficacy of different encoder architectures (ResNet variants with 35, 50, 101, 101-with-attention, and 152 convolution layers) and two batch sizes (2 and 4) for the depth estimation task. Our experiments show that ResNet101 without and with attention mechanisms provide the best performance across 2 and 4 batch sizes, respectively, compared to the other models. Conversely, the deeper architecture considered, ResNet152, shows the lowest performance, indicating that increasing the network depth does not necessarily lead to improved results for depth estimation tasks. This study's findings provide valuable insights for developing more effective depth estimation algorithms, and it suggests future directions in hyperparameter optimization and semantic consistency modeling.
Item Type: | Article |
---|---|
Keywords: | Depth Estimation; Semantic Consistency; Encoder Architectures; Batch sizes; Image translation; |
Academic Unit: | Faculty of Science and Engineering > Computer Science Faculty of Science and Engineering > Research Institutes > Hamilton Institute |
Item ID: | 20319 |
Identification Number: | 10.1049/icp.2024.3295 |
Depositing User: | IR Editor |
Date Deposited: | 12 Aug 2025 10:47 |
Journal or Publication Title: | 26th Irish Machine Vision and Image Processing Conference (IMVIP 2024) |
Publisher: | The Institution of Engineering and Technology (IET) |
Refereed: | Yes |
Related URLs: | |
URI: | https://mural.maynoothuniversity.ie/id/eprint/20319 |
Use Licence: | This item is available under a Creative Commons Attribution Non Commercial Share Alike Licence (CC BY-NC-SA). Details of this licence are available here |
Repository Staff Only (login required)
Downloads
Downloads per month over past year