Completely Automated Alignment and Vowel Extraction
Our automated system takes uploaded audio files and returns ASR transcriptions, alignments, and vowel formant measurements.
It is recommended that you look through the discussion on the completely automated system's functionality and limitations before you begin.
Audio with transcriptions provided by our in-house speech recognition
This system uses ASR built upon the CMU Sphinx framework to transcribe your data and then runs it through automated alignment and extraction using Montreal Forced Aligner and FAVE-Extract. It also provides the facility to edit the transcripts produced by the speech recognizer, and rerun the analysis.
Automated data analysis requires a higher tolerance of potential noise in the alignment and formant extraction results. You can estimate this noise using our transcription evaluation tool, which takes a manual transcription of your recording along with the ASR transcription of the same, and uses weighted Levenshtein distance to compute error rates for words, phonemes, and stressed vowels.