Ethnographically Oriented Repository of Assamese Telephonic Speech
Recording of the speech samples is the first step in speech recognition and related tasks. For English, there are a bunch of readily available data sets. But standard data sets with regional dialect and mood variations are not available and the need to create our own data set for our experimental wo...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
EDP Sciences
2018-01-01
|
Series: | MATEC Web of Conferences |
Online Access: | https://doi.org/10.1051/matecconf/201821005019 |
id |
doaj-fd7d955541f44cad9c0ed32596c5946b |
---|---|
record_format |
Article |
spelling |
doaj-fd7d955541f44cad9c0ed32596c5946b2021-02-02T00:04:27ZengEDP SciencesMATEC Web of Conferences2261-236X2018-01-012100501910.1051/matecconf/201821005019matecconf_cscc2018_05019Ethnographically Oriented Repository of Assamese Telephonic SpeechSharma MridusmitaSarma Kandarpa KumarMastorakis Nikos E.Recording of the speech samples is the first step in speech recognition and related tasks. For English, there are a bunch of readily available data sets. But standard data sets with regional dialect and mood variations are not available and the need to create our own data set for our experimental works has been faced. We have considered Assamese language for our case study and since it is less computationally aware, there is a need to develop the speech corpus having dialect and mood variations. Also, the development of corpus is an ongoing process and the initial task is reported in this paper.https://doi.org/10.1051/matecconf/201821005019 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Sharma Mridusmita Sarma Kandarpa Kumar Mastorakis Nikos E. |
spellingShingle |
Sharma Mridusmita Sarma Kandarpa Kumar Mastorakis Nikos E. Ethnographically Oriented Repository of Assamese Telephonic Speech MATEC Web of Conferences |
author_facet |
Sharma Mridusmita Sarma Kandarpa Kumar Mastorakis Nikos E. |
author_sort |
Sharma Mridusmita |
title |
Ethnographically Oriented Repository of Assamese Telephonic Speech |
title_short |
Ethnographically Oriented Repository of Assamese Telephonic Speech |
title_full |
Ethnographically Oriented Repository of Assamese Telephonic Speech |
title_fullStr |
Ethnographically Oriented Repository of Assamese Telephonic Speech |
title_full_unstemmed |
Ethnographically Oriented Repository of Assamese Telephonic Speech |
title_sort |
ethnographically oriented repository of assamese telephonic speech |
publisher |
EDP Sciences |
series |
MATEC Web of Conferences |
issn |
2261-236X |
publishDate |
2018-01-01 |
description |
Recording of the speech samples is the first step in speech recognition and related tasks. For English, there are a bunch of readily available data sets. But standard data sets with regional dialect and mood variations are not available and the need to create our own data set for our experimental works has been faced. We have considered Assamese language for our case study and since it is less computationally aware, there is a need to develop the speech corpus having dialect and mood variations. Also, the development of corpus is an ongoing process and the initial task is reported in this paper. |
url |
https://doi.org/10.1051/matecconf/201821005019 |
work_keys_str_mv |
AT sharmamridusmita ethnographicallyorientedrepositoryofassamesetelephonicspeech AT sarmakandarpakumar ethnographicallyorientedrepositoryofassamesetelephonicspeech AT mastorakisnikose ethnographicallyorientedrepositoryofassamesetelephonicspeech |
_version_ |
1724314767416360960 |