Estimating Multiple Physical Parameters From Speech Data
Shareef Babu Kalluri, National Institute of Technology Karnataka Surathkal
Ashwin Kalyan Vijayakumar, National Institute of Technology Karnataka Surathkal
Deepu Vijayasenan, National Institute of Technology Karnataka Surathkal
Rita Singh, Language Technologies Institute School of Computer Science Carnegie Mellon University

Abstract:
In this work, we explore prediction of different physical parameters from speech data. We aim to predict shoulder size and waist size of people from speech data in addition to the conventional height and weight parameters. A data-set with this information is created from 207 volunteers. A bag of words representation based on log magnitude spectrum is used as features. A support vector regression predicts the physical parameters from the bag of the words representation. The system is able to achieve a root mean square error of 6.6 cm for height estimation, 2.6cm for shoulder size, 7.1cm for waist size and 8.9 kg for weight estimation. The results of height estimation is on par with state of the art results.