Document Type
Thesis
Date of Award
Fall 10-31-1997
Degree Name
Master of Science in Computer Science - (M.S.)
Department
Computer and Information Science
First Advisor
Jason T. L. Wang
Second Advisor
James A. McHugh
Third Advisor
Peter A. Ng
Abstract
The usefulness and accessibility of programs and systems have become important issues for users and researchers alike. A program's usefulness can lw measured by the frequency with which it is used. From the author's or maintainer's point of view, the frequency of usage can be determined by how often a request for the software is received. In the past, a user became aware of a particular tool through various means, and contacted the author or maintainer to obtain a copy of it. This presented difficulties, ranging from language barriers to machine incompatibilities to control of the use of the program. Furthermore, accessibility of the program was limited to the particular machines on which it was installed (at the remote site). By porting programs and systems to the World Wide Web, the problems of accessibility and usefulness can be mitigated. Now programs can be advertised (in a non-commercial sense) to all interested parties, problems of machine incompatibility can be reduced (with the exception of browser incompatibilities), and control of the use and modification can be maintained. This thesis discusses the porting of two tools to the World Wide Web. The tools are SDISCOVER, a data mining tool used in protein string matching, and TREEDIFF, a structured document comparison toolkit. Spinoff of this research is the development of two home pages for conference registrations and Oracle user account applications in a university environment.
Recommended Citation
Johnson, Philip B., "Structured document comparison and scientific data mining on the world wide web" (1997). Theses. 1009.
https://digitalcommons.njit.edu/theses/1009