skip to main content skip to footer

Systems and Methods for Identifying Collocation Errors in Text

Author(s):
Futagi, Yoko; Deane, Paul; Chodorow, Martin
Patent Issued:
Jun 25, 2013
Patent Number:
8,473,278
Source:
ETS Patent
Document Type:
Patent
Family ID:
41653735
Subject/Key Words:
Patent, Active Patent, Collocation (Linguistics), Automatic Error Detection, Automated Scoring and Natural Language Processing, Language Learning Tools, Syntactic Analysis, Linguistic Annotation, Text Analysis

Abstract

Systems and methods for detecting collocation errors in a text sample using a reference database from a corpus are provided. Collocation candidates are identified within the text sample based upon syntactic patterns in the text sample. Whether a given collocation candidate contains a collocation error is detected, the detecting including: determining a first association measure using the reference database for the given collocation candidate; determining whether the first association measure satisfies a predetermined condition and identifying the given collocation candidate as proper if the first association measure satisfies the predetermined condition; determining an additional association measure for a variation of the given collocation candidate using the reference database; and determining whether or not the collocation candidate contains an error based upon the additional association measure of the variation.

Read More