Designing a value based niche search engine using evolutionary strategies

Sourav Sengupta, Bernard J. Jansen

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

The advent of e-commerce and corporate intranets has led to the growth of organizational repositories containing large, fragmented, and unstructured document collections. Though it is difficult to retrieve relevant documents from such collections, it is relatively less cumbersome to define categories broadly classifying the information contained in the collection. Such categories lend value to the information contained in the collection. This research addresses the issue of improving retrieval accuracy of search engines that retrieve documents from organizational repositories using a value based approach. We test an evolutionary algorithm approach on a document collection. The precision of the search algorithm improved from 40% in generation 1 of the algorithm to nearly 90% in generation 10,000.

Original languageEnglish (US)
Title of host publicationProceedings ITCC 2005 - International Conference on Information Technology
Subtitle of host publicationCoding and Computing
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages800-805
Number of pages6
ISBN (Print)0769523153, 9780769523156
DOIs
StatePublished - 2005
EventITCC 2005 - International Conference on Information Technology: Coding and Computing - Las Vegas, NV, United States
Duration: Apr 4 2005Apr 6 2005

Publication series

NameInternational Conference on Information Technology: Coding and Computing, ITCC
Volume1

Other

OtherITCC 2005 - International Conference on Information Technology: Coding and Computing
Country/TerritoryUnited States
CityLas Vegas, NV
Period4/4/054/6/05

All Science Journal Classification (ASJC) codes

  • General Engineering

Fingerprint

Dive into the research topics of 'Designing a value based niche search engine using evolutionary strategies'. Together they form a unique fingerprint.

Cite this