- 04-94 Beate Dorow, Dominic Widdows, Katarina Ling, Jean-Pierre Eckmann, Danilo Sergi, Elisha Moses
- Using Curvature and Markov Clustering in Graphs for Lexical
Acquisition and Word Sense Discrimination
(1037K, postscript)
Mar 29, 04
-
Abstract ,
Paper (src),
View paper
(auto. generated ps),
Index
of related papers
-
Abstract. We introduce two different approaches for clustering semantically
similar words. We accommodate ambiguity by allowing a word to belong to several clusters.
Both methods use a graph-theoretic representation of words and
their paradigmatic
relationships. The first approach is based on the
concept of {\em curvature} and divides the word graph into classes of
similar words by removing words of low curvature which connect several
dispersed clusters.
The second method, instead of clustering the nodes,
clusters the links in our graph. These contain more specific contextual
information than nodes representing just words. In so doing, we
naturally accommodate ambiguity by allowing multiple class membership.
Both methods are evaluated on a lexical acquisition task, using
clustering to add nouns to the WordNet taxonomy. The most effective
method is link clustering.
- Files:
04-94.src(
04-94.keywords ,
export.ps )