Computation of Word Associations Based on the Co-Occurrences of Words in Large Corpora
Abstract: A statistical model is presented which predicts the strengths of
word-associations from the relative frequencies of the common occurrences
of words in large bodies of text. These predictions are compared with the
Minnesota association norms for 100 stimulus words. The average agreement
between the predicted and the observed responses is only slightly weaker
than the agreement between the responses of an arbitrary subject
and the responses of the other subjects. It is shown that the
approach leads to equally good results for both English and German.
HTML
Postscript
Home-page FASK
Home-page Reinhard Rapp