Investigating the Impact of Encoder Architectures and Batch Size on
Depth Estimation through Semantic Consistency

Nosheen, Iqra; Iqbal, Talha; Ullah, Ihsan; Ennis, Cathy; Madden, Michael

Investigating the Impact of Encoder Architectures and Batch Size on Depth Estimation through Semantic Consistency

Share and Export

Nosheen, Iqra, Iqbal, Talha, Ullah, Ihsan, Ennis, Cathy and Madden, Michael (2024) Investigating the Impact of Encoder Architectures and Batch Size on Depth Estimation through Semantic Consistency. 26th Irish Machine Vision and Image Processing Conference (IMVIP 2024). pp. 1-4.

[thumbnail of nosheen-et-al-investigating-the-impact-of-encoder-architectures-and-batch-size-on-depth-estimation-through-semantic.pdf]

Preview

Text
Available under License Creative Commons Attribution Non-commercial Share Alike.
Download (75kB) | Preview

Abstract

Traditional methods for depth estimation rely on supervised learning with resource-intensive LiDAR data. Virtual synthetic datasets provide a cost-effective alternative, but bridging the domain gap between synthetic and real-world data remains a significant challenge. In existing work, this gap is addressed through domain adaptation techniques, aligning the feature distributions of synthetic (source) and real-world (target) domains. Our study explores the efficacy of different encoder architectures (ResNet variants with 35, 50, 101, 101-with-attention, and 152 convolution layers) and two batch sizes (2 and 4) for the depth estimation task. Our experiments show that ResNet101 without and with attention mechanisms provide the best performance across 2 and 4 batch sizes, respectively, compared to the other models. Conversely, the deeper architecture considered, ResNet152, shows the lowest performance, indicating that increasing the network depth does not necessarily lead to improved results for depth estimation tasks. This study's findings provide valuable insights for developing more effective depth estimation algorithms, and it suggests future directions in hyperparameter optimization and semantic consistency modeling.

Item Type:	Article
Keywords:	Depth Estimation; Semantic Consistency; Encoder Architectures; Batch sizes; Image translation;
Academic Unit:	Faculty of Science and Engineering > Computer Science Faculty of Science and Engineering > Research Institutes > Hamilton Institute
Item ID:	20319
Identification Number:	10.1049/icp.2024.3295
Depositing User:	IR Editor
Date Deposited:	12 Aug 2025 10:47
Journal or Publication Title:	26th Irish Machine Vision and Image Processing Conference (IMVIP 2024)
Publisher:	The Institution of Engineering and Technology (IET)
Refereed:	Yes
Related URLs:	Publisher
Use Licence:	This item is available under a Creative Commons Attribution Non Commercial Share Alike Licence (CC BY-NC-SA). Details of this licence are available here

MURAL - Maynooth University Research Archive Library

Investigating the Impact of Encoder Architectures and Batch Size on Depth Estimation through Semantic Consistency

Abstract

Downloads

Origin of downloads

Altmetric Badge

Repository Staff Only (login required)