Comparing Prosodic Frameworks: Investigating the Acoustic-Symbolic Relationship in ToBI and RaP

Raul Fernandez; Andrew Rosenberg

doi:10.1109/SLT.2018.8639539

SLT 2018

Conference paper

11 Feb 2019

Comparing Prosodic Frameworks: Investigating the Acoustic-Symbolic Relationship in ToBI and RaP

View publication

Abstract

ToBI is the dominant tool for symbolically describing prosodic content in American English speech material. This is due to its descriptive power and its theoretical grounding, but also to the amount of available annotated data. Recently, a modest amount of material annotated with the Rhythm and Pitch (RaP) framework was released publicly. In this paper, we investigate the acoustic-symbolic relationship under these two systems. We present experiments looking at this relationship in both directions. From acoustic to symbolic, we compare the automatic prediction of prosodic prominence as defined under the two systems. From symbolic to acoustic, we examine the utility of these annotation standards to correctly prescribe the acoustics of a given utterance from their symbolic sequences. We find RaP to be promising, showing a somewhat stronger acoustic-symbolic relationship than ToBI given a comparable amount of data for some aspects of these tasks. While with more annotated data ToBI results are stronger, it remains to be shown whether RaP performance can scale up.

Conference paper