IMAGE LABELING FOR LIDAR INTENSITY IMAGE USING K-NN OF FEATURE  OBTAINED BY CONVOLUTIONAL NEURAL NETWORK

M. Umemura; K. Hotta; H. Nonaka; K. Oda

doi:10.5194/isprs-archives-XLI-B3-931-2016

The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences (Jun 2016)

IMAGE LABELING FOR LIDAR INTENSITY IMAGE USING K-NN OF FEATURE OBTAINED BY CONVOLUTIONAL NEURAL NETWORK

M. Umemura,
K. Hotta,
H. Nonaka,
K. Oda

Affiliations

M. Umemura: Meijo University, 1-501 Shiogamaguchi, Tempaku-ku, Nagoya 468-8502, Japan
K. Hotta: Meijo University, 1-501 Shiogamaguchi, Tempaku-ku, Nagoya 468-8502, Japan
H. Nonaka: Asia Air Survey Co.,Ltd., 1-2-2 Manpukuji, Asao-ku, Kawasaki, Kanagawa, Japan
K. Oda: Asia Air Survey Co.,Ltd., 1-2-2 Manpukuji, Asao-ku, Kawasaki, Kanagawa, Japan

DOI: https://doi.org/10.5194/isprs-archives-XLI-B3-931-2016
Journal volume & issue: Vol. XLI-B3
pp. 931 – 935

Abstract

Read online

We propose an image labeling method for LIDAR intensity image obtained by Mobile Mapping System (MMS) using K-Nearest Neighbor (KNN) of feature obtained by Convolutional Neural Network (CNN). Image labeling assigns labels (e.g., road, cross-walk and road shoulder) to semantic regions in an image. Since CNN is effective for various image recognition tasks, we try to use the feature of CNN (Caffenet) pre-trained by ImageNet. We use 4,096-dimensional feature at fc7 layer in the Caffenet as the descriptor of a region because the feature at fc7 layer has effective information for object classification. We extract the feature by the Caffenet from regions cropped from images. Since the similarity between features reflects the similarity of contents of regions, we can select top K similar regions cropped from training samples with a test region. Since regions in training images have manually-annotated ground truth labels, we vote the labels attached to top K similar regions to the test region. The class label with the maximum vote is assigned to each pixel in the test image. In experiments, we use 36 LIDAR intensity images with ground truth labels. We divide 36 images into training (28 images) and test sets (8 images). We use class average accuracy and pixel-wise accuracy as evaluation measures. Our method was able to assign the same label as human beings in 97.8% of the pixels in test LIDAR intensity images.

Published in The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences

ISSN: 1682-1750 (Print); 2194-9034 (Online)
Publisher: Copernicus Publications
Country of publisher: Germany
LCC subjects: Technology: Engineering (General). Civil engineering (General): Applied optics. Photonics
Website: http://www.isprs.org/publications/archives.aspx

About the journal