Faculty Publications

Web mining from competitors' websites

Xin Chen, New Jersey Institute of Technology
Yi Fang Brook Wu, New Jersey Institute of Technology

Document Type

Conference Proceeding

Publication Date

1-1-2005

Abstract

This paper presents a framework for user-oriented text mining. It is then illustrated with an example of discovering knowledge from competitors' websites. The knowledge to be discovered is in the form of association rules. A user's background knowledge is represented as a concept hierarchy developed from documents on his/her own website. The concept hierarchy captures the semantic usage of words and relationships among words in background documents. Association rules are identified among the noun phrases extracted from documents on competitors' websites. The interestingness measure, i.e. novelty, which measures the semantic distance between the antecedent and the consequent of a rule in the background knowledge, is computed from the co-occurrence frequency of words and the connection lengths among words in the concept hierarchy. A user evaluation of the novelty of discovered rules demonstrates that the correlation between the algorithm and the human judges is comparable to that between human judges. Copyright 2005 ACM.

Identifier

32344432836 (Scopus)

Publication Title

Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

External Full Text Location

https://doi.org/10.1145/1081870.1081935

First Page

550

Last Page

555

Recommended Citation

Chen, Xin and Wu, Yi Fang Brook, "Web mining from competitors' websites" (2005). Faculty Publications. 19826.
https://digitalcommons.njit.edu/fac_pubs/19826

This document is currently not available here.

COinS

DOI

10.1145/1081870.1081935

Faculty Publications

Web mining from competitors' websites

Document Type

Publication Date

Abstract

Identifier

Publication Title

External Full Text Location

First Page

Last Page

Recommended Citation

DOI

Search

Browse

Author Corner

Links

Faculty Publications

Web mining from competitors' websites

Authors

Document Type

Publication Date

Abstract

Identifier

Publication Title

External Full Text Location

First Page

Last Page

Recommended Citation

Share

DOI

Search

Browse

Author Corner

Links