Semi-Automated Alignment and Vowel Extraction

This system runs FAVE-style semi-automated analysis using both the audio and manual transcriptions, with three different transcription types.

The pronunciations of words in your transcriptions are taken from the CMU Dictionary, which contains Standard American English pronunciations. We use a grapheme-to-phoneme model (Sequitur trained on the same dictionary) to predict pronunciations for words that are not in the dictionary, such as proper names or slang terms. This model is quite accurate, but may introduce a few errors.

The returned results are in formats convenient for analysis: a basic vowel plot, spreadsheets with raw and Lobanov-normalized data, a spreadsheet formatted for the online NORM system (which has other plotting and normalization options), the transcription, and the aligned TextGrid file.