Learning to See Through a Few Pixels: Multi Streams Network for Extreme Low-Resolution Action Recognition

Human action recognition is one of the most pressing questions in societal emergencies of any kind. Technology is helping to solve such problems at the cost of stealing human privacy. Several approaches have considered the relevance of privacy in the pervasive process of observing people. New algori...

Full description

Bibliographic Details
Main Authors:	Paolo Russo, Salvatore Ticca, Edoardo Alati, Fiora Pirri
Format:	Article
Language:	English
Published:	IEEE 2021-01-01
Series:	IEEE Access
Subjects:	Action recognition activity recognition deep learning computer vision multi-modal low resolution
Online Access:	https://ieeexplore.ieee.org/document/9319249/

id	doaj-82de210728d64ba7b1e2bbd753109e6d
record_format	Article
spelling	doaj-82de210728d64ba7b1e2bbd753109e6d2021-04-05T17:37:24ZengIEEEIEEE Access2169-35362021-01-019120191202610.1109/ACCESS.2021.30505149319249Learning to See Through a Few Pixels: Multi Streams Network for Extreme Low-Resolution Action RecognitionPaolo Russo0https://orcid.org/0000-0002-1886-3491Salvatore Ticca1Edoardo Alati2Fiora Pirri3https://orcid.org/0000-0001-8665-9807Dipartimento di Ingegneria informatica, automatica e gestionale Antonio Ruberti, University of Rome La Sapienza, Rome, ItalyDipartimento di Ingegneria informatica, automatica e gestionale Antonio Ruberti, University of Rome La Sapienza, Rome, ItalyDipartimento di Ingegneria informatica, automatica e gestionale Antonio Ruberti, University of Rome La Sapienza, Rome, ItalyDipartimento di Ingegneria informatica, automatica e gestionale Antonio Ruberti, University of Rome La Sapienza, Rome, ItalyHuman action recognition is one of the most pressing questions in societal emergencies of any kind. Technology is helping to solve such problems at the cost of stealing human privacy. Several approaches have considered the relevance of privacy in the pervasive process of observing people. New algorithms have been proposed to deal with low-resolution images hiding people identity. However, many of these methods do not consider that social security asks for real-time solutions: active cameras require flexible distributed systems in sensible areas as airports, hospitals, stations, squares and roads. To conjugate both human privacy and real-time supervision, we propose a novel deep architecture, the Multi Streams Network. This model works in real-time and performs action recognition on extremely low-resolution videos, exploiting three sources of information: RGB images, optical flow and slack mask data. Experiments on two datasets show that our architecture improves the recognition accuracy compared to the two-streams approach and ensure real-time execution on Edge TPU (Tensor Processing Unit).https://ieeexplore.ieee.org/document/9319249/Action recognitionactivity recognitiondeep learningcomputer visionmulti-modallow resolution
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Paolo Russo Salvatore Ticca Edoardo Alati Fiora Pirri
spellingShingle	Paolo Russo Salvatore Ticca Edoardo Alati Fiora Pirri Learning to See Through a Few Pixels: Multi Streams Network for Extreme Low-Resolution Action Recognition IEEE Access Action recognition activity recognition deep learning computer vision multi-modal low resolution
author_facet	Paolo Russo Salvatore Ticca Edoardo Alati Fiora Pirri
author_sort	Paolo Russo
title	Learning to See Through a Few Pixels: Multi Streams Network for Extreme Low-Resolution Action Recognition
title_short	Learning to See Through a Few Pixels: Multi Streams Network for Extreme Low-Resolution Action Recognition
title_full	Learning to See Through a Few Pixels: Multi Streams Network for Extreme Low-Resolution Action Recognition
title_fullStr	Learning to See Through a Few Pixels: Multi Streams Network for Extreme Low-Resolution Action Recognition
title_full_unstemmed	Learning to See Through a Few Pixels: Multi Streams Network for Extreme Low-Resolution Action Recognition
title_sort	learning to see through a few pixels: multi streams network for extreme low-resolution action recognition
publisher	IEEE
series	IEEE Access
issn	2169-3536
publishDate	2021-01-01
description	Human action recognition is one of the most pressing questions in societal emergencies of any kind. Technology is helping to solve such problems at the cost of stealing human privacy. Several approaches have considered the relevance of privacy in the pervasive process of observing people. New algorithms have been proposed to deal with low-resolution images hiding people identity. However, many of these methods do not consider that social security asks for real-time solutions: active cameras require flexible distributed systems in sensible areas as airports, hospitals, stations, squares and roads. To conjugate both human privacy and real-time supervision, we propose a novel deep architecture, the Multi Streams Network. This model works in real-time and performs action recognition on extremely low-resolution videos, exploiting three sources of information: RGB images, optical flow and slack mask data. Experiments on two datasets show that our architecture improves the recognition accuracy compared to the two-streams approach and ensure real-time execution on Edge TPU (Tensor Processing Unit).
topic	Action recognition activity recognition deep learning computer vision multi-modal low resolution
url	https://ieeexplore.ieee.org/document/9319249/
work_keys_str_mv	AT paolorusso learningtoseethroughafewpixelsmultistreamsnetworkforextremelowresolutionactionrecognition AT salvatoreticca learningtoseethroughafewpixelsmultistreamsnetworkforextremelowresolutionactionrecognition AT edoardoalati learningtoseethroughafewpixelsmultistreamsnetworkforextremelowresolutionactionrecognition AT fiorapirri learningtoseethroughafewpixelsmultistreamsnetworkforextremelowresolutionactionrecognition
_version_	1721539182258552832

Learning to See Through a Few Pixels: Multi Streams Network for Extreme Low-Resolution Action Recognition

Similar Items