Deep Learning Based Multi-Modal Fusion Architectures for Maritime Vessel Detection

Object detection is a fundamental computer vision task for many real-world applications. In the maritime environment, this task is challenging due to varying light, view distances, weather conditions, and sea waves. In addition, light reflection, camera motion and illumination changes may cause to f...

Full description

Bibliographic Details
Main Authors: Fahimeh Farahnakian, Jukka Heikkonen
Format: Article
Language:English
Published: MDPI AG 2020-08-01
Series:Remote Sensing
Subjects:
Online Access:https://www.mdpi.com/2072-4292/12/16/2509
id doaj-1045511b6c7c43b18b3a684ea4b82444
record_format Article
spelling doaj-1045511b6c7c43b18b3a684ea4b824442020-11-25T02:59:24ZengMDPI AGRemote Sensing2072-42922020-08-01122509250910.3390/rs12162509Deep Learning Based Multi-Modal Fusion Architectures for Maritime Vessel DetectionFahimeh Farahnakian0Jukka Heikkonen1Department of Future Technologies, University of Turku, 20500 Turku, FinlandDepartment of Future Technologies, University of Turku, 20500 Turku, FinlandObject detection is a fundamental computer vision task for many real-world applications. In the maritime environment, this task is challenging due to varying light, view distances, weather conditions, and sea waves. In addition, light reflection, camera motion and illumination changes may cause to false detections. To address this challenge, we present three fusion architectures to fuse two imaging modalities: visible and infrared. These architectures can provide complementary information from two modalities in different levels: pixel-level, feature-level, and decision-level. They employed deep learning for performing fusion and detection. We investigate the performance of the proposed architectures conducting a real marine image dataset, which is captured by color and infrared cameras on-board a vessel in the Finnish archipelago. The cameras are employed for developing autonomous ships, and collect data in a range of operation and climatic conditions. Experiments show that feature-level fusion architecture outperforms the state-of-the-art other fusion level architectures.https://www.mdpi.com/2072-4292/12/16/2509multi-sensor fusionobject detectiondeep learningconvolutional neural networksautonomous vehiclesmarine environment
collection DOAJ
language English
format Article
sources DOAJ
author Fahimeh Farahnakian
Jukka Heikkonen
spellingShingle Fahimeh Farahnakian
Jukka Heikkonen
Deep Learning Based Multi-Modal Fusion Architectures for Maritime Vessel Detection
Remote Sensing
multi-sensor fusion
object detection
deep learning
convolutional neural networks
autonomous vehicles
marine environment
author_facet Fahimeh Farahnakian
Jukka Heikkonen
author_sort Fahimeh Farahnakian
title Deep Learning Based Multi-Modal Fusion Architectures for Maritime Vessel Detection
title_short Deep Learning Based Multi-Modal Fusion Architectures for Maritime Vessel Detection
title_full Deep Learning Based Multi-Modal Fusion Architectures for Maritime Vessel Detection
title_fullStr Deep Learning Based Multi-Modal Fusion Architectures for Maritime Vessel Detection
title_full_unstemmed Deep Learning Based Multi-Modal Fusion Architectures for Maritime Vessel Detection
title_sort deep learning based multi-modal fusion architectures for maritime vessel detection
publisher MDPI AG
series Remote Sensing
issn 2072-4292
publishDate 2020-08-01
description Object detection is a fundamental computer vision task for many real-world applications. In the maritime environment, this task is challenging due to varying light, view distances, weather conditions, and sea waves. In addition, light reflection, camera motion and illumination changes may cause to false detections. To address this challenge, we present three fusion architectures to fuse two imaging modalities: visible and infrared. These architectures can provide complementary information from two modalities in different levels: pixel-level, feature-level, and decision-level. They employed deep learning for performing fusion and detection. We investigate the performance of the proposed architectures conducting a real marine image dataset, which is captured by color and infrared cameras on-board a vessel in the Finnish archipelago. The cameras are employed for developing autonomous ships, and collect data in a range of operation and climatic conditions. Experiments show that feature-level fusion architecture outperforms the state-of-the-art other fusion level architectures.
topic multi-sensor fusion
object detection
deep learning
convolutional neural networks
autonomous vehicles
marine environment
url https://www.mdpi.com/2072-4292/12/16/2509
work_keys_str_mv AT fahimehfarahnakian deeplearningbasedmultimodalfusionarchitecturesformaritimevesseldetection
AT jukkaheikkonen deeplearningbasedmultimodalfusionarchitecturesformaritimevesseldetection
_version_ 1724702631269498880