Systems and Methods for Natural Language Processing for Speech Content Scoring
- Author(s):
- Chen, Lei; Zechner, Klaus; Loukina, Anastassia
- Patent Issued:
- Oct 24, 2017
- Patent Number:
- 9,799,228
- Source:
- ETS Patent
- Document Type:
- Patent
- Family ID:
- 1000002908040
- Subject/Key Words:
- Patent, Active Patent, Automated Scoring and Natural Language Processing, Speech Scoring, Computer Systems, Training, Vectors (Mathematics), Spoken Language Assessment
Abstract
Computer-implemented systems and methods are provided for scoring content of a spoken response to a prompt. A scoring model is generated for a prompt, where generating the scoring model includes generating a transcript for each of a plurality of training responses to the prompt, dividing the plurality of training responses into clusters based on the transcripts of the training responses, selecting a subset of the training responses in each cluster for scoring, scoring the selected subset of training responses for each cluster, and generating content training vectors using the transcripts from the scored subset. A transcript is generated for a received spoken response to be scored, and a similarity metric is computed between the transcript of the spoken response to be scored and the content training vectors. A score is assigned to the spoken response based on the determined similarity metric.