Ethnographically Oriented Repository of Assamese Telephonic Speech

Recording of the speech samples is the first step in speech recognition and related tasks. For English, there are a bunch of readily available data sets. But standard data sets with regional dialect and mood variations are not available and the need to create our own data set for our experimental wo...

Full description

Bibliographic Details
Main Authors: Sharma Mridusmita, Sarma Kandarpa Kumar, Mastorakis Nikos E.
Format: Article
Language:English
Published: EDP Sciences 2018-01-01
Series:MATEC Web of Conferences
Online Access:https://doi.org/10.1051/matecconf/201821005019
id doaj-fd7d955541f44cad9c0ed32596c5946b
record_format Article
spelling doaj-fd7d955541f44cad9c0ed32596c5946b2021-02-02T00:04:27ZengEDP SciencesMATEC Web of Conferences2261-236X2018-01-012100501910.1051/matecconf/201821005019matecconf_cscc2018_05019Ethnographically Oriented Repository of Assamese Telephonic SpeechSharma MridusmitaSarma Kandarpa KumarMastorakis Nikos E.Recording of the speech samples is the first step in speech recognition and related tasks. For English, there are a bunch of readily available data sets. But standard data sets with regional dialect and mood variations are not available and the need to create our own data set for our experimental works has been faced. We have considered Assamese language for our case study and since it is less computationally aware, there is a need to develop the speech corpus having dialect and mood variations. Also, the development of corpus is an ongoing process and the initial task is reported in this paper.https://doi.org/10.1051/matecconf/201821005019
collection DOAJ
language English
format Article
sources DOAJ
author Sharma Mridusmita
Sarma Kandarpa Kumar
Mastorakis Nikos E.
spellingShingle Sharma Mridusmita
Sarma Kandarpa Kumar
Mastorakis Nikos E.
Ethnographically Oriented Repository of Assamese Telephonic Speech
MATEC Web of Conferences
author_facet Sharma Mridusmita
Sarma Kandarpa Kumar
Mastorakis Nikos E.
author_sort Sharma Mridusmita
title Ethnographically Oriented Repository of Assamese Telephonic Speech
title_short Ethnographically Oriented Repository of Assamese Telephonic Speech
title_full Ethnographically Oriented Repository of Assamese Telephonic Speech
title_fullStr Ethnographically Oriented Repository of Assamese Telephonic Speech
title_full_unstemmed Ethnographically Oriented Repository of Assamese Telephonic Speech
title_sort ethnographically oriented repository of assamese telephonic speech
publisher EDP Sciences
series MATEC Web of Conferences
issn 2261-236X
publishDate 2018-01-01
description Recording of the speech samples is the first step in speech recognition and related tasks. For English, there are a bunch of readily available data sets. But standard data sets with regional dialect and mood variations are not available and the need to create our own data set for our experimental works has been faced. We have considered Assamese language for our case study and since it is less computationally aware, there is a need to develop the speech corpus having dialect and mood variations. Also, the development of corpus is an ongoing process and the initial task is reported in this paper.
url https://doi.org/10.1051/matecconf/201821005019
work_keys_str_mv AT sharmamridusmita ethnographicallyorientedrepositoryofassamesetelephonicspeech
AT sarmakandarpakumar ethnographicallyorientedrepositoryofassamesetelephonicspeech
AT mastorakisnikose ethnographicallyorientedrepositoryofassamesetelephonicspeech
_version_ 1724314767416360960