A Comprehensive Video Dataset for Multi-Modal Recognition Systems

This paper presents a comprehensive, highly defined and fully labelled video dataset. This dataset consists of videos related to 67 different subjects. The videos contain similar text and the text contains digits from 1 to 20 recited by 67 different subjects using the same experimental setup. This d...

Full description

Bibliographic Details
Main Authors: Anand Handa, Rashi Agarwal, Narendra Kohli
Format: Article
Language:English
Published: Ubiquity Press 2019-11-01
Series:Data Science Journal
Subjects:
Online Access:https://datascience.codata.org/articles/909
id doaj-8ef5302b315c465e9c61bd4e4dfa948f
record_format Article
spelling doaj-8ef5302b315c465e9c61bd4e4dfa948f2020-11-25T01:32:36ZengUbiquity PressData Science Journal1683-14702019-11-0118110.5334/dsj-2019-055749A Comprehensive Video Dataset for Multi-Modal Recognition SystemsAnand Handa0Rashi Agarwal1Narendra Kohli2Dr. APJ Abdul Kalam Technical University, KanpurDepartment of IT, University Institute of Engineering and Technology, KanpurDepartment of CSE, Harcourt Butler Technical University, KanpurThis paper presents a comprehensive, highly defined and fully labelled video dataset. This dataset consists of videos related to 67 different subjects. The videos contain similar text and the text contains digits from 1 to 20 recited by 67 different subjects using the same experimental setup. This dataset can be used as a unique resource for researchers and analysts for training deep neural networks to build highly efficient and accurate recognition models in various domains of computer vision such as face recognition model, expression recognition model, speech recognition model, text recognition, etc. In this paper, we also train models related to face recognition and speech recognition on our dataset and also compare the results with the publically available datasets to show the effectiveness of our dataset. The experimental results show that our comprehensive dataset is more accurate than other dataset on which the models are tested.https://datascience.codata.org/articles/909machine leaningdeep learningvideo datasetsconvolutional neural network
collection DOAJ
language English
format Article
sources DOAJ
author Anand Handa
Rashi Agarwal
Narendra Kohli
spellingShingle Anand Handa
Rashi Agarwal
Narendra Kohli
A Comprehensive Video Dataset for Multi-Modal Recognition Systems
Data Science Journal
machine leaning
deep learning
video datasets
convolutional neural network
author_facet Anand Handa
Rashi Agarwal
Narendra Kohli
author_sort Anand Handa
title A Comprehensive Video Dataset for Multi-Modal Recognition Systems
title_short A Comprehensive Video Dataset for Multi-Modal Recognition Systems
title_full A Comprehensive Video Dataset for Multi-Modal Recognition Systems
title_fullStr A Comprehensive Video Dataset for Multi-Modal Recognition Systems
title_full_unstemmed A Comprehensive Video Dataset for Multi-Modal Recognition Systems
title_sort comprehensive video dataset for multi-modal recognition systems
publisher Ubiquity Press
series Data Science Journal
issn 1683-1470
publishDate 2019-11-01
description This paper presents a comprehensive, highly defined and fully labelled video dataset. This dataset consists of videos related to 67 different subjects. The videos contain similar text and the text contains digits from 1 to 20 recited by 67 different subjects using the same experimental setup. This dataset can be used as a unique resource for researchers and analysts for training deep neural networks to build highly efficient and accurate recognition models in various domains of computer vision such as face recognition model, expression recognition model, speech recognition model, text recognition, etc. In this paper, we also train models related to face recognition and speech recognition on our dataset and also compare the results with the publically available datasets to show the effectiveness of our dataset. The experimental results show that our comprehensive dataset is more accurate than other dataset on which the models are tested.
topic machine leaning
deep learning
video datasets
convolutional neural network
url https://datascience.codata.org/articles/909
work_keys_str_mv AT anandhanda acomprehensivevideodatasetformultimodalrecognitionsystems
AT rashiagarwal acomprehensivevideodatasetformultimodalrecognitionsystems
AT narendrakohli acomprehensivevideodatasetformultimodalrecognitionsystems
AT anandhanda comprehensivevideodatasetformultimodalrecognitionsystems
AT rashiagarwal comprehensivevideodatasetformultimodalrecognitionsystems
AT narendrakohli comprehensivevideodatasetformultimodalrecognitionsystems
_version_ 1725081031622524928