Fifth European Conference on Speech Communication and Technology, Rhodes, Greece
This paper describes three experiments in using frame level observation probabilities as the basis for word confidence annotation in an HMM speech recognition system. One experiment is at the word level, one uses word classes, and the other uses phone classes. In each experiment we categorize hypotheses into correct and incorrect categories by aligning a best recognition hypothesis with the known transcript. The confidence of error prediction for each class is a measure of the resolvability between the correct and incorrect histograms.
Bergen, Z. and Ward, W., "A Senone Based Confidence Measure for Speech Recognition" (1997). Space Dynamics Lab Publications. Paper 13.