Expressive modulation of neutral visual speech

The need for animated graphical models of the human face is commonplace in the movies, video games and television industries, appearing in everything from low budget advertisements and free mobile apps, to Hollywood blockbusters costing hundreds of millions of dollars. Generative statistical models...

Full description

Bibliographic Details
Main Author:	Shaw, Felix
Published:	University of East Anglia 2015
Subjects:	004
Online Access:	http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.679153

id	ndltd-bl.uk-oai-ethos.bl.uk-679153
record_format	oai_dc
spelling	ndltd-bl.uk-oai-ethos.bl.uk-6791532016-08-04T03:58:50ZExpressive modulation of neutral visual speechShaw, Felix2015The need for animated graphical models of the human face is commonplace in the movies, video games and television industries, appearing in everything from low budget advertisements and free mobile apps, to Hollywood blockbusters costing hundreds of millions of dollars. Generative statistical models of animation attempt to address some of the drawbacks of industry standard practices such as labour intensity and creative inflexibility. This work describes one such method for transforming speech animation curves between different expressive styles. Beginning with the assumption that expressive speech animation is a mix of two components, a high-frequency speech component (the content) and a much lower-frequency expressive component (the style), we use Independent Component Analysis (ICA) to identify and manipulate these components independently of one another. Next we learn how the energy for different speaking styles is distributed in terms of the low-dimensional independent components model. Transforming the speaking style involves projecting new animation curves into the lowdimensional ICA space, redistributing the energy in the independent components, and finally reconstructing the animation curves by inverting the projection. We show that a single ICA model can be used for separating multiple expressive styles into their component parts. Subjective evaluations show that viewers can reliably identify the expressive style generated using our approach, and that they have difficulty in identifying transformed animated expressive speech from the equivalent ground-truth.004University of East Angliahttp://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.679153https://ueaeprints.uea.ac.uk/56812/Electronic Thesis or Dissertation
collection	NDLTD
sources	NDLTD
topic	004
spellingShingle	004 Shaw, Felix Expressive modulation of neutral visual speech
description	The need for animated graphical models of the human face is commonplace in the movies, video games and television industries, appearing in everything from low budget advertisements and free mobile apps, to Hollywood blockbusters costing hundreds of millions of dollars. Generative statistical models of animation attempt to address some of the drawbacks of industry standard practices such as labour intensity and creative inflexibility. This work describes one such method for transforming speech animation curves between different expressive styles. Beginning with the assumption that expressive speech animation is a mix of two components, a high-frequency speech component (the content) and a much lower-frequency expressive component (the style), we use Independent Component Analysis (ICA) to identify and manipulate these components independently of one another. Next we learn how the energy for different speaking styles is distributed in terms of the low-dimensional independent components model. Transforming the speaking style involves projecting new animation curves into the lowdimensional ICA space, redistributing the energy in the independent components, and finally reconstructing the animation curves by inverting the projection. We show that a single ICA model can be used for separating multiple expressive styles into their component parts. Subjective evaluations show that viewers can reliably identify the expressive style generated using our approach, and that they have difficulty in identifying transformed animated expressive speech from the equivalent ground-truth.
author	Shaw, Felix
author_facet	Shaw, Felix
author_sort	Shaw, Felix
title	Expressive modulation of neutral visual speech
title_short	Expressive modulation of neutral visual speech
title_full	Expressive modulation of neutral visual speech
title_fullStr	Expressive modulation of neutral visual speech
title_full_unstemmed	Expressive modulation of neutral visual speech
title_sort	expressive modulation of neutral visual speech
publisher	University of East Anglia
publishDate	2015
url	http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.679153
work_keys_str_mv	AT shawfelix expressivemodulationofneutralvisualspeech
_version_	1718372495335096320

Expressive modulation of neutral visual speech

Similar Items