A Preliminary Study on Speaker Diarization for Automatic Transcription of Broadcast Radio Speech

A Preliminary Study on Speaker Diarization for Automatic Transcription of Broadcast Radio Speech

碩士 === 國立臺北科技大學 === 電子工程系 === 106 === We use Time-delay Neural Network for Speaker Diarization. The average DER is 27.74%, which is better than 31.08% of GMM. We use trained automatic speaker diarization system to classify information of unmarked speakers in the NER-210 corpus, retrain the ASR by ma...

Full description

Bibliographic Details
Main Authors:	Wu-Hua Hsu, 許吳華
Other Authors:	廖元甫
Format:	Others
Language:	zh-TW
Published:	2018
Online Access:	http://ndltd.ncl.edu.tw/handle/a3z9vr

Similar Items

Studies on Acoustic Features for Automatic Speech Recognition and Speaker Diarization in Real Environments
by: Ishizuka, Kentaro
Published: (2010)

Detection and handling of overlapping speech for speaker diarization
by: Zelenák, Martin
Published: (2012)

Speech Enhancement for Multimodal Speaker Diarization System
by: Rehan Ahmad, et al.
Published: (2020-01-01)

Speaker diarization and speech recognition in the semi-automatization of audio description: An exploratory study on future possibilities?
by: Héctor Delgado, et al.
Published: (2015-06-01)

Speaker Diarization Based on Speech Signal Approximation by Step Function
by: Rustam Latypov, et al.
Published: (2021-01-01)

ROBUST SPEAKER DIARIZATION FOR MEETINGS
by: Anguera Miró, Xavier
Published: (2006)

Unsupervised methods for speaker diarization
by: Shum, Stephen (Stephen Hin-Chung)
Published: (2011)

COMPARATIVE STUDY OF TECHNIQUES TO SPEAKER DIARIZATION
by: MARCELO DE CAMPOS NIERO
Published: (2013)

Experiments in speaker diarization using speaker vectors
by: Cui, Ming
Published: (2021)

Fast cross-session speaker diarization
by: Delgado Flores, Héctor
Published: (2015)

Efficient speaker diarization and low-latency speaker spotting
by: Patino Villar, José María
Published: (2019)

[en] COMPARATIVE STUDY OF TECHNIQUES TO SPEAKER DIARIZATION
by: MARCELO DE CAMPOS NIERO
Published: (2014)

Integration of evolutionary computation algorithms and new AUTO-TLBO technique in the speaker clustering stage for speaker diarization of broadcast news
by: Karim Dabbabi, et al.
Published: (2017-09-01)

Unsupervised adaptation of PLDA models for broadcast diarization
by: Ignacio Viñals, et al.
Published: (2019-12-01)

Latent class model with application to speaker diarization
by: Liang He, et al.
Published: (2019-07-01)

Comparison of Diarization Tools for Building Speaker Database
by: Eva Kiktova, et al.
Published: (2015-01-01)

Speaker Diarization through Waveform and Neural Net
by: Rustam Latypov, et al.
Published: (2021-05-01)

Speech, Speaker and Speaker's Gender Identification in Automatically Processed Broadcast Stream
by: J. Nouza, et al.
Published: (2006-09-01)

Steps towards end-to-end neural speaker diarization
by: Yin, Ruiqing
Published: (2019)

Robust speaker diarization for single channel recorded meetings
by: Fu, Rong
Published: (2009)

A sticky HDP-HMM with application to speaker diarization
by: Fox, Emily Beth, et al.
Published: (2013)

Speaker diarization system using HXLPS and deep neural network
by: V. Subba Ramaiah, et al.
Published: (2018-03-01)

Analysis of transition cost and model parameters in speaker diarization for meetings
by: Beatriz Martínez-González, et al.
Published: (2021-02-01)

Speaker Verification and Speaker Diarization based onGMM-HMM Forced Alignment and Recognition
by: Cheng-Jo Chang, et al.
Published: (2018)

Spatial features of reverberant speech : estimation and application to recognition and diarization
by: Peso, Pablo
Published: (2016)

Supervised Speaker Diarization Using Random Forests: A Tool for Psychotherapy Process Research
by: Lukas Fürer, et al.
Published: (2020-07-01)

The use of long-term features for GMM- and i-vector-based speaker diarization systems
by: Abraham Woubie Zewoudie, et al.
Published: (2018-09-01)

Diarization, Localization and Indexing of Meeting Archives
by: Vajaria, Himanshu
Published: (2008)

Classifier Level Fusion of Accelerometer and sEMG Signals for Automatic Fitness Activity Diarization
by: Giorgio Biagetti, et al.
Published: (2018-08-01)

Speaker normalisation for automatic speech recognition
by: Deterding, David Henry
Published: (1990)

Chinese Input Method Based on First Mandarin Phonetic Alphabet for Mobile Devices and an Approach in Speaker Diarization with Divide-and-Conquer
by: Chun-han Tseng, et al.
Published: (2008)

Speech Recognition Quality Estimation-based Semi-Supervised Training for Broadcast Radio Program Transcription
by: Sing-Yue Wang, et al.
Published: (2017)

The Radio Positioning Simulation System Using AM Broadcast Stations
by: Wu Der-Hua, et al.
Published: (1993)

Speaker model adaptation in automatic speech recognition
by: Chan, Carlos Chun Ming
Published: (1993)

Characterization of speakers for improved automatic speech recognition
by: Lincoln, Michael
Published: (1999)

Sequence Models for Speech and Music Detection in Radio Broadcast
by: Lemaire, Quentin
Published: (2019)

A study of the automatic speech recognition process and speaker adaptation
by: Stokes-Rees, Ian
Published: (2006)

A Study of the Automatic Speech Recognition Process and Speaker Adaptation
by: Stokes-Rees, Ian James
Published: (2006)

A study of the automatic speech recognition process and speaker adaptation
by: Stokes-Rees, Ian
Published: (2006)

A Study of the Automatic Speech Recognition Process and Speaker Adaptation
by: Stokes-Rees, Ian James
Published: (2006)