Syntactic Context Semantic Preferences Chi-Squared Goodness of Fit Test
Institute of Mathematics and Informatics Bulgarian Academy of Sciences
Pliska Studia Mathematica Bulgarica, Vol. 16, No 1, (2004), 171p-182p
Semantically related words are modelled as words having the same probability distribution on the set of syntactic contexts occurring in text corpora. A learning algorithm for finding of clusters of semantically related words is developed. In that algorithm Chi-Squared statistics is used as a performance measure.