MURAL - Maynooth University Research Archive Library



    Investigating the Impact of Encoder Architectures and Batch Size on Depth Estimation through Semantic Consistency


    Nosheen, Iqra, Iqbal, Talha, Ullah, Ihsan, Ennis, Cathy and Madden, Michael (2024) Investigating the Impact of Encoder Architectures and Batch Size on Depth Estimation through Semantic Consistency. 26th Irish Machine Vision and Image Processing Conference (IMVIP 2024). pp. 1-4.

    [thumbnail of nosheen-et-al-investigating-the-impact-of-encoder-architectures-and-batch-size-on-depth-estimation-through-semantic.pdf]
    Preview
    Text
    nosheen-et-al-investigating-the-impact-of-encoder-architectures-and-batch-size-on-depth-estimation-through-semantic.pdf
    Available under License Creative Commons Attribution Non-commercial Share Alike.

    Download (75kB) | Preview

    Abstract

    Traditional methods for depth estimation rely on supervised learning with resource-intensive LiDAR data. Virtual synthetic datasets provide a cost-effective alternative, but bridging the domain gap between synthetic and real-world data remains a significant challenge. In existing work, this gap is addressed through domain adaptation techniques, aligning the feature distributions of synthetic (source) and real-world (target) domains. Our study explores the efficacy of different encoder architectures (ResNet variants with 35, 50, 101, 101-with-attention, and 152 convolution layers) and two batch sizes (2 and 4) for the depth estimation task. Our experiments show that ResNet101 without and with attention mechanisms provide the best performance across 2 and 4 batch sizes, respectively, compared to the other models. Conversely, the deeper architecture considered, ResNet152, shows the lowest performance, indicating that increasing the network depth does not necessarily lead to improved results for depth estimation tasks. This study's findings provide valuable insights for developing more effective depth estimation algorithms, and it suggests future directions in hyperparameter optimization and semantic consistency modeling.
    Item Type: Article
    Keywords: Depth Estimation; Semantic Consistency; Encoder Architectures; Batch sizes; Image translation;
    Academic Unit: Faculty of Science and Engineering > Computer Science
    Faculty of Science and Engineering > Research Institutes > Hamilton Institute
    Item ID: 20319
    Identification Number: 10.1049/icp.2024.3295
    Depositing User: IR Editor
    Date Deposited: 12 Aug 2025 10:47
    Journal or Publication Title: 26th Irish Machine Vision and Image Processing Conference (IMVIP 2024)
    Publisher: The Institution of Engineering and Technology (IET)
    Refereed: Yes
    Related URLs:
    URI: https://mural.maynoothuniversity.ie/id/eprint/20319
    Use Licence: This item is available under a Creative Commons Attribution Non Commercial Share Alike Licence (CC BY-NC-SA). Details of this licence are available here

    Repository Staff Only (login required)

    Item control page
    Item control page

    Downloads

    Downloads per month over past year

    Origin of downloads