IEEE Access (Jan 2021)

Cocktail Glass Network: Fast Depth Estimation Using Channel to Space Unrolling

  • Jung-Jae Yu,
  • Jong-Gook Ko,
  • Junmo Kim

DOI
https://doi.org/10.1109/ACCESS.2021.3105136
Journal volume & issue
Vol. 9
pp. 114680 – 114689

Abstract

Read online

Depth-estimation from a single input image can be used in applications such as robotics and autonomous driving. Recently, depth-estimation networks with UNet encoder/decoder structures have been widely used. In these decoders, operations are repeated to gradually increase the image resolution, while decreasing the channel size. If the upsampling operation at a high magnification can be processed at once, the amount of computation in the decoder can be dramatically reduced. To achieve this, we propose a new network structure, i.e., a cocktail glass network. In this network, convolution layers in the decoder are reduced, and a novel fast upsampling method is used that is known as channel-to-space unrolling, which converts thick channel data into high-resolution data. The proposed method can be easily implemented using simple reshaping operations; therefore, it is suitable for reducing the depth-estimation network. Considering the experimental results based on the NYU V2 and KITTI datasets, we demonstrate that the proposed method reduces the amount of computation in the decoder by half, while maintaining the same level of accuracy; it can be used in both lightweight and large-model-capacity networks.

Keywords