Integration of evolutionary computation algorithms and new AUTO-TLBO technique in the speaker clustering stage for speaker diarization of broadcast news

Abstract The task of speaker diarization is to answer the question "who spoke when?" In this paper, we present different clustering approaches which consist of Evolutionary Computation Algorithms (ECAs) such as Genetic Algorithm (GA), Particle Swarm Optimization (PSO) algorithm, and Differ...

Full description

Bibliographic Details
Main Authors: Karim Dabbabi, Salah Hajji, Adnen Cherif
Format: Article
Language:English
Published: SpringerOpen 2017-09-01
Series:EURASIP Journal on Audio, Speech, and Music Processing
Subjects:
Online Access:http://link.springer.com/article/10.1186/s13636-017-0117-1
id doaj-5761d2d890c0427bbb12f23e96fe86c1
record_format Article
spelling doaj-5761d2d890c0427bbb12f23e96fe86c12020-11-25T01:35:48ZengSpringerOpenEURASIP Journal on Audio, Speech, and Music Processing1687-47222017-09-012017111510.1186/s13636-017-0117-1Integration of evolutionary computation algorithms and new AUTO-TLBO technique in the speaker clustering stage for speaker diarization of broadcast newsKarim Dabbabi0Salah Hajji1Adnen Cherif2Research Unity of Analysis and Processing of Electrical and Energetic systems, Faculty of Sciences of Tunis, University of Tunis El ManarResearch Laboratory of Information and Image Processing, National School of Engineers of Tunis, University of Tunis El ManarResearch Unity of Analysis and Processing of Electrical and Energetic systems, Faculty of Sciences of Tunis, University of Tunis El ManarAbstract The task of speaker diarization is to answer the question "who spoke when?" In this paper, we present different clustering approaches which consist of Evolutionary Computation Algorithms (ECAs) such as Genetic Algorithm (GA), Particle Swarm Optimization (PSO) algorithm, and Differential Evolution (DE) algorithm as well as Teaching-Learning-Based Optimization (TLBO) technique as a new optimization technique at the aim to optimize the number of clusters in the speaker clustering stage which remains a challenging problem. Clustering validity indexes, such as Within-Class Distance (WCD) index, Davies and Bouldin (DB) index, and Contemporary Document (CD) index, is also used in order to make a correction for each possible grouping of speakers' segments. The proposed algorithms are evaluated on News Broadcast database (NDTV), and their performance comparisons are made between each another as well as with some well-known clustering algorithms. Results show the superiority of the new AUTO-TLBO technique in terms of comparative results obtained on NDTV, RT-04F, and ESTER datasets of News Broadcast.http://link.springer.com/article/10.1186/s13636-017-0117-1Speaker diarizationPSO algorithmGA algorithmDE algorithmTLBO techniqueEA algorithms
collection DOAJ
language English
format Article
sources DOAJ
author Karim Dabbabi
Salah Hajji
Adnen Cherif
spellingShingle Karim Dabbabi
Salah Hajji
Adnen Cherif
Integration of evolutionary computation algorithms and new AUTO-TLBO technique in the speaker clustering stage for speaker diarization of broadcast news
EURASIP Journal on Audio, Speech, and Music Processing
Speaker diarization
PSO algorithm
GA algorithm
DE algorithm
TLBO technique
EA algorithms
author_facet Karim Dabbabi
Salah Hajji
Adnen Cherif
author_sort Karim Dabbabi
title Integration of evolutionary computation algorithms and new AUTO-TLBO technique in the speaker clustering stage for speaker diarization of broadcast news
title_short Integration of evolutionary computation algorithms and new AUTO-TLBO technique in the speaker clustering stage for speaker diarization of broadcast news
title_full Integration of evolutionary computation algorithms and new AUTO-TLBO technique in the speaker clustering stage for speaker diarization of broadcast news
title_fullStr Integration of evolutionary computation algorithms and new AUTO-TLBO technique in the speaker clustering stage for speaker diarization of broadcast news
title_full_unstemmed Integration of evolutionary computation algorithms and new AUTO-TLBO technique in the speaker clustering stage for speaker diarization of broadcast news
title_sort integration of evolutionary computation algorithms and new auto-tlbo technique in the speaker clustering stage for speaker diarization of broadcast news
publisher SpringerOpen
series EURASIP Journal on Audio, Speech, and Music Processing
issn 1687-4722
publishDate 2017-09-01
description Abstract The task of speaker diarization is to answer the question "who spoke when?" In this paper, we present different clustering approaches which consist of Evolutionary Computation Algorithms (ECAs) such as Genetic Algorithm (GA), Particle Swarm Optimization (PSO) algorithm, and Differential Evolution (DE) algorithm as well as Teaching-Learning-Based Optimization (TLBO) technique as a new optimization technique at the aim to optimize the number of clusters in the speaker clustering stage which remains a challenging problem. Clustering validity indexes, such as Within-Class Distance (WCD) index, Davies and Bouldin (DB) index, and Contemporary Document (CD) index, is also used in order to make a correction for each possible grouping of speakers' segments. The proposed algorithms are evaluated on News Broadcast database (NDTV), and their performance comparisons are made between each another as well as with some well-known clustering algorithms. Results show the superiority of the new AUTO-TLBO technique in terms of comparative results obtained on NDTV, RT-04F, and ESTER datasets of News Broadcast.
topic Speaker diarization
PSO algorithm
GA algorithm
DE algorithm
TLBO technique
EA algorithms
url http://link.springer.com/article/10.1186/s13636-017-0117-1
work_keys_str_mv AT karimdabbabi integrationofevolutionarycomputationalgorithmsandnewautotlbotechniqueinthespeakerclusteringstageforspeakerdiarizationofbroadcastnews
AT salahhajji integrationofevolutionarycomputationalgorithmsandnewautotlbotechniqueinthespeakerclusteringstageforspeakerdiarizationofbroadcastnews
AT adnencherif integrationofevolutionarycomputationalgorithmsandnewautotlbotechniqueinthespeakerclusteringstageforspeakerdiarizationofbroadcastnews
_version_ 1725066247353139200