Towards a Perceptual Loss: Using a Neural Network Codec Approximation as a Loss for Generative Audio Models

© 2019 Association for Computing Machinery. Generative audio models based on neural networks have led to considerable improvements across fields including speech enhancement, source separation, and text-to-speech synthesis. These systems are typically trained in a supervised fashion using simple ele...

Full description

Bibliographic Details
Main Authors: Ananthabhotla, Ishwarya (Author), Ewert, Sebastian (Author), Paradiso, Joseph A (Author)
Format: Article
Language:English
Published: Association for Computing Machinery (ACM), 2021-11-02T17:08:23Z.
Subjects:
Online Access:Get fulltext