A Comprehensive Video Dataset for Multi-Modal Recognition Systems
This paper presents a comprehensive, highly defined and fully labelled video dataset. This dataset consists of videos related to 67 different subjects. The videos contain similar text and the text contains digits from 1 to 20 recited by 67 different subjects using the same experimental setup. This d...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Ubiquity Press
2019-11-01
|
Series: | Data Science Journal |
Subjects: | |
Online Access: | https://datascience.codata.org/articles/909 |
id |
doaj-8ef5302b315c465e9c61bd4e4dfa948f |
---|---|
record_format |
Article |
spelling |
doaj-8ef5302b315c465e9c61bd4e4dfa948f2020-11-25T01:32:36ZengUbiquity PressData Science Journal1683-14702019-11-0118110.5334/dsj-2019-055749A Comprehensive Video Dataset for Multi-Modal Recognition SystemsAnand Handa0Rashi Agarwal1Narendra Kohli2Dr. APJ Abdul Kalam Technical University, KanpurDepartment of IT, University Institute of Engineering and Technology, KanpurDepartment of CSE, Harcourt Butler Technical University, KanpurThis paper presents a comprehensive, highly defined and fully labelled video dataset. This dataset consists of videos related to 67 different subjects. The videos contain similar text and the text contains digits from 1 to 20 recited by 67 different subjects using the same experimental setup. This dataset can be used as a unique resource for researchers and analysts for training deep neural networks to build highly efficient and accurate recognition models in various domains of computer vision such as face recognition model, expression recognition model, speech recognition model, text recognition, etc. In this paper, we also train models related to face recognition and speech recognition on our dataset and also compare the results with the publically available datasets to show the effectiveness of our dataset. The experimental results show that our comprehensive dataset is more accurate than other dataset on which the models are tested.https://datascience.codata.org/articles/909machine leaningdeep learningvideo datasetsconvolutional neural network |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Anand Handa Rashi Agarwal Narendra Kohli |
spellingShingle |
Anand Handa Rashi Agarwal Narendra Kohli A Comprehensive Video Dataset for Multi-Modal Recognition Systems Data Science Journal machine leaning deep learning video datasets convolutional neural network |
author_facet |
Anand Handa Rashi Agarwal Narendra Kohli |
author_sort |
Anand Handa |
title |
A Comprehensive Video Dataset for Multi-Modal Recognition Systems |
title_short |
A Comprehensive Video Dataset for Multi-Modal Recognition Systems |
title_full |
A Comprehensive Video Dataset for Multi-Modal Recognition Systems |
title_fullStr |
A Comprehensive Video Dataset for Multi-Modal Recognition Systems |
title_full_unstemmed |
A Comprehensive Video Dataset for Multi-Modal Recognition Systems |
title_sort |
comprehensive video dataset for multi-modal recognition systems |
publisher |
Ubiquity Press |
series |
Data Science Journal |
issn |
1683-1470 |
publishDate |
2019-11-01 |
description |
This paper presents a comprehensive, highly defined and fully labelled video dataset. This dataset consists of videos related to 67 different subjects. The videos contain similar text and the text contains digits from 1 to 20 recited by 67 different subjects using the same experimental setup. This dataset can be used as a unique resource for researchers and analysts for training deep neural networks to build highly efficient and accurate recognition models in various domains of computer vision such as face recognition model, expression recognition model, speech recognition model, text recognition, etc. In this paper, we also train models related to face recognition and speech recognition on our dataset and also compare the results with the publically available datasets to show the effectiveness of our dataset. The experimental results show that our comprehensive dataset is more accurate than other dataset on which the models are tested. |
topic |
machine leaning deep learning video datasets convolutional neural network |
url |
https://datascience.codata.org/articles/909 |
work_keys_str_mv |
AT anandhanda acomprehensivevideodatasetformultimodalrecognitionsystems AT rashiagarwal acomprehensivevideodatasetformultimodalrecognitionsystems AT narendrakohli acomprehensivevideodatasetformultimodalrecognitionsystems AT anandhanda comprehensivevideodatasetformultimodalrecognitionsystems AT rashiagarwal comprehensivevideodatasetformultimodalrecognitionsystems AT narendrakohli comprehensivevideodatasetformultimodalrecognitionsystems |
_version_ |
1725081031622524928 |