著者
M. D. Gordon
タイトル
User-based document clustering by redescribing subject descriptions with a genetic algorithm
ページ
311-322
日時
June 1991
概要
Information retrieval systems have used clustering of documents and queries to improve both retrieval efficiency and retrieval effectiveness. Normally, clustering involves grouping together static descriptions of documents by their similarity to each other, though user-based clustering suggests that usage patterns concerning co-relevance can form a basis for clustering. The author reports that clusters of co-relevant documents obtain increasingly similar descriptions when a genetic algorithm is used to adapt subject descriptions so that documents become more effective in matching relevant queries and failing to match nonrelevant queries. As a result of the increased similarity, clustering algorithms can more accurately group documents into useful clusters. The findings of this work were reached through simulation experiments
カテゴリ
GA
Category: GA
Organization: Graduate School of Business Administration,
        Michigan University, Ann Arbor, MI, USA
Abstract: Information retrieval systems have used clustering
        of documents and queries to improve both retrieval
        efficiency and retrieval effectiveness. Normally,
        clustering involves grouping together static
        descriptions of documents by their similarity to
        each other, though user-based clustering suggests
        that usage patterns concerning co-relevance can form
        a basis for clustering. The author reports that
        clusters of co-relevant documents obtain
        increasingly similar descriptions when a genetic
        algorithm is used to adapt subject descriptions so
        that documents become more effective in matching
        relevant queries and failing to match nonrelevant
        queries. As a result of the increased similarity,
        clustering algorithms can more accurately group
        documents into useful clusters. The findings of this
        work were reached through simulation experiments
Number: 5
Bibtype: Article
Author: M. D. Gordon
Pages: 311-322
Month: jun
Source: Journal of the American Society for Information
        Science
Title: User-based document clustering by redescribing
        subject descriptions with a genetic algorithm
Year: 1991
Volume: 42
Keyword: classification, genetic algorithms, information
        retrieval systems, IRS, information retrieval
        systems, document clustering, subject descriptions,
        genetic algorithm, retrieval efficiency, retrieval
        effectiveness, user-based clustering, usage
        patterns, co-relevance, co-relevant documents,
        simulation experiments