Text-based content search and retrieval in ad-hoc P2P communities

Francisco Matias Cuenca-Acuna, Thu D. Nguyen

Research output: Chapter in Book/Report/Conference proceedingConference contribution

24 Scopus citations

Abstract

We consider the problem of content search and retrieval in peer-to-peer (P2P) communities. P2P computing is a potentially powerful model for information sharing between ad hoc groups of users because of its low cost of entry and natural model for resource scaling. As P2P communities grow, however, locating information distributed across the large number of peers becomes problematic. We address this problem by adapting a state-of-the-art text-based document ranking algorithm, the vector-space model instantiated with the TFxIDF ranking rule, to the P2P environment. We make three contributions: (a) we show how to approximate TFxIDF using compact summaries of individual peers' inverted indexes rather than the inverted index of the entire communal store; (b) we develop a heuristic for adaptively determining the set of peers that should be contacted for a query; and (c) we show that our algorithm tracks TFxIDF's performance very closely, giving P2P communities a search and retrieval algorithm as good as that possible assuming a centralized server.

Original languageEnglish (US)
Title of host publicationWeb Engineering and Peer-to-Peer Computing - NETWORKING 2002 Workshops, Revised Papers
Pages220-234
Number of pages15
StatePublished - Dec 1 2002
EventInternational Workshop on Web Engineering and Peer-to-Peer Computing, NETWORKING 2002 - Pisa, Italy
Duration: May 19 2002May 24 2002

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume2376 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Other

OtherInternational Workshop on Web Engineering and Peer-to-Peer Computing, NETWORKING 2002
CountryItaly
CityPisa
Period5/19/025/24/02

All Science Journal Classification (ASJC) codes

  • Theoretical Computer Science
  • Computer Science(all)

Fingerprint Dive into the research topics of 'Text-based content search and retrieval in ad-hoc P2P communities'. Together they form a unique fingerprint.

  • Cite this

    Cuenca-Acuna, F. M., & Nguyen, T. D. (2002). Text-based content search and retrieval in ad-hoc P2P communities. In Web Engineering and Peer-to-Peer Computing - NETWORKING 2002 Workshops, Revised Papers (pp. 220-234). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 2376 LNCS).