Mirror site maintenance based on evolution associations of web directories

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Authors

Research Organisations

External Research Organisations

  • Nanyang Technological University (NTU)
View graph of relations

Details

Original languageEnglish
Title of host publication16th International World Wide Web Conference, WWW2007
PublisherAssociation for Computing Machinery (ACM)
Pages1297-1298
Number of pages2
ISBN (print)1595936548, 9781595936547
Publication statusPublished - 8 May 2007
Event16th International World Wide Web Conference, WWW2007 - Banff, AB, Canada
Duration: 8 May 200712 May 2007

Publication series

Name16th International World Wide Web Conference, WWW2007

Abstract

Mirroring Web sites is a well-known technique commonly used in the Web community. A mirror site should be updated frequently to ensure that it reflects the content of the original site. Existing mirroring tools apply page-level strategies to check each page of a site, which is inefficient and expensive. In this paper, we propose a novel site-level mirror maintenance strategy. Our approach studies the evolution of Web directorystructures and mines association rules between ancestor-descendant Web directories. Discovered rules indicate the evolution correlations between Web directories. Thus, when maintaining the mirror of a Web site (directory), we can optimally skipsubdirectories which are negatively correlated with it in undergoing significant changes. The preliminary experimental results show that our approach improves the efficiency of the mirror maintenance process significantly while sacrificing slightly in keeping the "freshness" of the mirrors.

Keywords

    Evolution correlation, Mirror maintenance, Web evolution

ASJC Scopus subject areas

Cite this

Mirror site maintenance based on evolution associations of web directories. / Chen, Ling; Bhowmick, Sourav; Nejdl, Wolfgang.
16th International World Wide Web Conference, WWW2007. Association for Computing Machinery (ACM), 2007. p. 1297-1298 (16th International World Wide Web Conference, WWW2007).

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Chen, L, Bhowmick, S & Nejdl, W 2007, Mirror site maintenance based on evolution associations of web directories. in 16th International World Wide Web Conference, WWW2007. 16th International World Wide Web Conference, WWW2007, Association for Computing Machinery (ACM), pp. 1297-1298, 16th International World Wide Web Conference, WWW2007, Banff, AB, Canada, 8 May 2007. https://doi.org/10.1145/1242572.1242817
Chen, L., Bhowmick, S., & Nejdl, W. (2007). Mirror site maintenance based on evolution associations of web directories. In 16th International World Wide Web Conference, WWW2007 (pp. 1297-1298). (16th International World Wide Web Conference, WWW2007). Association for Computing Machinery (ACM). https://doi.org/10.1145/1242572.1242817
Chen L, Bhowmick S, Nejdl W. Mirror site maintenance based on evolution associations of web directories. In 16th International World Wide Web Conference, WWW2007. Association for Computing Machinery (ACM). 2007. p. 1297-1298. (16th International World Wide Web Conference, WWW2007). doi: 10.1145/1242572.1242817
Chen, Ling ; Bhowmick, Sourav ; Nejdl, Wolfgang. / Mirror site maintenance based on evolution associations of web directories. 16th International World Wide Web Conference, WWW2007. Association for Computing Machinery (ACM), 2007. pp. 1297-1298 (16th International World Wide Web Conference, WWW2007).
Download
@inproceedings{641052faeb554daf9afb8d9830543216,
title = "Mirror site maintenance based on evolution associations of web directories",
abstract = "Mirroring Web sites is a well-known technique commonly used in the Web community. A mirror site should be updated frequently to ensure that it reflects the content of the original site. Existing mirroring tools apply page-level strategies to check each page of a site, which is inefficient and expensive. In this paper, we propose a novel site-level mirror maintenance strategy. Our approach studies the evolution of Web directorystructures and mines association rules between ancestor-descendant Web directories. Discovered rules indicate the evolution correlations between Web directories. Thus, when maintaining the mirror of a Web site (directory), we can optimally skipsubdirectories which are negatively correlated with it in undergoing significant changes. The preliminary experimental results show that our approach improves the efficiency of the mirror maintenance process significantly while sacrificing slightly in keeping the {"}freshness{"} of the mirrors.",
keywords = "Evolution correlation, Mirror maintenance, Web evolution",
author = "Ling Chen and Sourav Bhowmick and Wolfgang Nejdl",
year = "2007",
month = may,
day = "8",
doi = "10.1145/1242572.1242817",
language = "English",
isbn = "1595936548",
series = "16th International World Wide Web Conference, WWW2007",
publisher = "Association for Computing Machinery (ACM)",
pages = "1297--1298",
booktitle = "16th International World Wide Web Conference, WWW2007",
address = "United States",
note = "16th International World Wide Web Conference, WWW2007 ; Conference date: 08-05-2007 Through 12-05-2007",

}

Download

TY - GEN

T1 - Mirror site maintenance based on evolution associations of web directories

AU - Chen, Ling

AU - Bhowmick, Sourav

AU - Nejdl, Wolfgang

PY - 2007/5/8

Y1 - 2007/5/8

N2 - Mirroring Web sites is a well-known technique commonly used in the Web community. A mirror site should be updated frequently to ensure that it reflects the content of the original site. Existing mirroring tools apply page-level strategies to check each page of a site, which is inefficient and expensive. In this paper, we propose a novel site-level mirror maintenance strategy. Our approach studies the evolution of Web directorystructures and mines association rules between ancestor-descendant Web directories. Discovered rules indicate the evolution correlations between Web directories. Thus, when maintaining the mirror of a Web site (directory), we can optimally skipsubdirectories which are negatively correlated with it in undergoing significant changes. The preliminary experimental results show that our approach improves the efficiency of the mirror maintenance process significantly while sacrificing slightly in keeping the "freshness" of the mirrors.

AB - Mirroring Web sites is a well-known technique commonly used in the Web community. A mirror site should be updated frequently to ensure that it reflects the content of the original site. Existing mirroring tools apply page-level strategies to check each page of a site, which is inefficient and expensive. In this paper, we propose a novel site-level mirror maintenance strategy. Our approach studies the evolution of Web directorystructures and mines association rules between ancestor-descendant Web directories. Discovered rules indicate the evolution correlations between Web directories. Thus, when maintaining the mirror of a Web site (directory), we can optimally skipsubdirectories which are negatively correlated with it in undergoing significant changes. The preliminary experimental results show that our approach improves the efficiency of the mirror maintenance process significantly while sacrificing slightly in keeping the "freshness" of the mirrors.

KW - Evolution correlation

KW - Mirror maintenance

KW - Web evolution

UR - http://www.scopus.com/inward/record.url?scp=35348821234&partnerID=8YFLogxK

U2 - 10.1145/1242572.1242817

DO - 10.1145/1242572.1242817

M3 - Conference contribution

AN - SCOPUS:35348821234

SN - 1595936548

SN - 9781595936547

T3 - 16th International World Wide Web Conference, WWW2007

SP - 1297

EP - 1298

BT - 16th International World Wide Web Conference, WWW2007

PB - Association for Computing Machinery (ACM)

T2 - 16th International World Wide Web Conference, WWW2007

Y2 - 8 May 2007 through 12 May 2007

ER -

By the same author(s)