26 - 29 June 2023
Munich, Germany
Conference 12623 > Paper 12623-9
Paper 12623-9

Multi-sensor data fusion using deep learning for bulky waste image classification

On demand | Presented live 28 June 2023

Abstract

Deep learning techniques are commonly utilized to tackle various computer vision problems, including recognition, segmentation, and classification from RGB images. With the availability of a diverse range of sensors, industry-specific datasets are acquired to address specific challenges. These collected datasets have varied modalities, indicating that the images possess distinct channel numbers and pixel values that have different interpretations. Implementing deep learning methods to attain optimal outcomes on such multimodal data is a complicated procedure. To enhance the performance of classification tasks in this scenario, one feasible approach is to employ a data fusion technique. Data fusion aims to use all the available information from all sensors and integrate them to obtain an optimal outcome. This paper investigates early fusion, intermediate fusion, and late fusion in deep learning models for bulky waste image classification. For training and evaluation of the models, a multimodal dataset is used. The dataset consists of RGB, hyperspectral Near Infrared (NIR), Thermography, and Terahertz images of bulky waste. The results of this work show that multimodal sensor fusion can enhance classification accuracy compared to a single-sensor approach for the used dataset. Hereby, late fusion performed the best with an accuracy of 0.921 compared to intermediate and early fusion, on our test data.

Presenter

Manuel Bihler
Karlsruher Institut für Technologie (Germany)
Manuel Bihler studied Physics at the University of Stuttgart, focusing on solid-state physics and light-material interaction. During a one-year internship at Bosch, he developed an interest in image processing, computer vision, image classification, and automated optical inspection. Now he is pursuing his Ph.D. at the KIT/IIIT, doing research in the field of image classification and data fusion using Convolutional Neuronal Networks and multimodal/multispectral image data. He is part of the ASKIVIT Project, focusing on improving the automated sorting of wood from bulky waste using a multi-sensor approach.
Presenter/Author
Manuel Bihler
Karlsruher Institut für Technologie (Germany)
Author
Fraunhofer Institute of Optronics, System Technology and Image Exploitation (Germany)
Author
Karlsruher Institut für Technologie (Germany)
Author
Karlsruher Institut für Technologie (Germany)
Author
Jochen Aderhold
Fraunhofer Institute for Wood Research, Wilhelm-Klauditz-Institut (WKI) (Germany)
Author
Fraunhofer Institute for Industrial Mathematics (ITWM) (Germany)
Author
Helmholtz-Institut Freiberg für Ressourcentechnologie (Germany)
Author
Helmholtz-Institut Freiberg für Ressourcentechnologie (Germany)
Author
Fraunhofer Institute of Optronics (Germany)
Author
Karlsruher Institut für Technologie (Germany)