Towards a Perceptual Loss: Using a Neural Network Codec Approximation as a Loss for Generative Audio Models

© 2019 Association for Computing Machinery. Generative audio models based on neural networks have led to considerable improvements across fields including speech enhancement, source separation, and text-to-speech synthesis. These systems are typically trained in a supervised fashion using simple ele...

Full description

Bibliographic Details
Main Authors: Ananthabhotla, Ishwarya (Author), Ewert, Sebastian (Author), Paradiso, Joseph A (Author)
Other Authors: Massachusetts Institute of Technology. Media Laboratory (Contributor), Program in Media Arts and Sciences (Massachusetts Institute of Technology) (Contributor)
Format: Article
Language:English
Published: Association for Computing Machinery (ACM), 2021-12-15T14:32:25Z.
Subjects:
Online Access:Get fulltext