Coreference Analysis on XML Documents Utilizing Structural Features / 文書構造を考慮したXML文書の照応解析

  Keyword retrieval exploits frequencies and positions of search keywords in target documents. As for retrieval by two or more keywords, semantic relation between keywords is important to improve accuracy.
  For retrieving information about a person, it is common to search by a pair of keywords consisting of person's name and his/her attribute of the interest. By using dependency analysis and coreference analysis, correct occurrences of pairs of person and his/her attributes can be retrieved. However, existing natural language analysis does not consider the factor that logical structures of the documents strongly influence probabilistic patterns of coreference. We propose a new way of person retrieval by computing a maximum entropy model from linguistic features and structural features, where structural features are learned from probabilistic distribution of coreference over XML document structures. Our method can utilize strong correlation between XML document structures and coreference, thus having superior accuracy than existing methods.




About this Entry

This page contains a single entry by webadmin published on June 8, 2011 8:09 PM.

Risk-Aware Identity Management / リスクを考慮したアイデンティティ管理 was the previous entry in this blog.

Quality Evaluation of Wikipedia Articles / ウィキペディア記事の品質評価 is the next entry in this blog.

Find recent content on the main index or look in the archives to find all content.