Title: An efficient algorithm for building a distributional thesaurus
Abstract: Gorman and Curran (2006) argue that thesaurus generation for
billion+-word corpora is problematic as the full computation
takes many days. We present an algorithm with which the
computation takes under two hours. We have created, and made
publicly available, thesauruses based on large corpora for (at
time of writing) seven major world languages. The development
is implemented in the Sketch Engine.
Publication Year: 2007
Publication Date: 2007-01-01
Language: en
Type: article
Access and Citation
Cited By Count: 4
AI Researcher Chatbot
Get quick answers to your questions about the article from our AI researcher chatbot