Details
Original language | English |
---|---|
Title of host publication | 16th International World Wide Web Conference, WWW2007 |
Publisher | Association for Computing Machinery (ACM) |
Pages | 1297-1298 |
Number of pages | 2 |
ISBN (print) | 1595936548, 9781595936547 |
Publication status | Published - 8 May 2007 |
Event | 16th International World Wide Web Conference, WWW2007 - Banff, AB, Canada Duration: 8 May 2007 → 12 May 2007 |
Publication series
Name | 16th International World Wide Web Conference, WWW2007 |
---|
Abstract
Mirroring Web sites is a well-known technique commonly used in the Web community. A mirror site should be updated frequently to ensure that it reflects the content of the original site. Existing mirroring tools apply page-level strategies to check each page of a site, which is inefficient and expensive. In this paper, we propose a novel site-level mirror maintenance strategy. Our approach studies the evolution of Web directorystructures and mines association rules between ancestor-descendant Web directories. Discovered rules indicate the evolution correlations between Web directories. Thus, when maintaining the mirror of a Web site (directory), we can optimally skipsubdirectories which are negatively correlated with it in undergoing significant changes. The preliminary experimental results show that our approach improves the efficiency of the mirror maintenance process significantly while sacrificing slightly in keeping the "freshness" of the mirrors.
Keywords
- Evolution correlation, Mirror maintenance, Web evolution
ASJC Scopus subject areas
- Computer Science(all)
- Computer Networks and Communications
- Computer Science(all)
- Software
Cite this
- Standard
- Harvard
- Apa
- Vancouver
- BibTeX
- RIS
16th International World Wide Web Conference, WWW2007. Association for Computing Machinery (ACM), 2007. p. 1297-1298 (16th International World Wide Web Conference, WWW2007).
Research output: Chapter in book/report/conference proceeding › Conference contribution › Research › peer review
}
TY - GEN
T1 - Mirror site maintenance based on evolution associations of web directories
AU - Chen, Ling
AU - Bhowmick, Sourav
AU - Nejdl, Wolfgang
PY - 2007/5/8
Y1 - 2007/5/8
N2 - Mirroring Web sites is a well-known technique commonly used in the Web community. A mirror site should be updated frequently to ensure that it reflects the content of the original site. Existing mirroring tools apply page-level strategies to check each page of a site, which is inefficient and expensive. In this paper, we propose a novel site-level mirror maintenance strategy. Our approach studies the evolution of Web directorystructures and mines association rules between ancestor-descendant Web directories. Discovered rules indicate the evolution correlations between Web directories. Thus, when maintaining the mirror of a Web site (directory), we can optimally skipsubdirectories which are negatively correlated with it in undergoing significant changes. The preliminary experimental results show that our approach improves the efficiency of the mirror maintenance process significantly while sacrificing slightly in keeping the "freshness" of the mirrors.
AB - Mirroring Web sites is a well-known technique commonly used in the Web community. A mirror site should be updated frequently to ensure that it reflects the content of the original site. Existing mirroring tools apply page-level strategies to check each page of a site, which is inefficient and expensive. In this paper, we propose a novel site-level mirror maintenance strategy. Our approach studies the evolution of Web directorystructures and mines association rules between ancestor-descendant Web directories. Discovered rules indicate the evolution correlations between Web directories. Thus, when maintaining the mirror of a Web site (directory), we can optimally skipsubdirectories which are negatively correlated with it in undergoing significant changes. The preliminary experimental results show that our approach improves the efficiency of the mirror maintenance process significantly while sacrificing slightly in keeping the "freshness" of the mirrors.
KW - Evolution correlation
KW - Mirror maintenance
KW - Web evolution
UR - http://www.scopus.com/inward/record.url?scp=35348821234&partnerID=8YFLogxK
U2 - 10.1145/1242572.1242817
DO - 10.1145/1242572.1242817
M3 - Conference contribution
AN - SCOPUS:35348821234
SN - 1595936548
SN - 9781595936547
T3 - 16th International World Wide Web Conference, WWW2007
SP - 1297
EP - 1298
BT - 16th International World Wide Web Conference, WWW2007
PB - Association for Computing Machinery (ACM)
T2 - 16th International World Wide Web Conference, WWW2007
Y2 - 8 May 2007 through 12 May 2007
ER -