Automated delivery of Web documents through a caching infrastructure

Pablo Rodriguez, Ernst W. Biersack, Keith Ross

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Abstract

    The dramatic growth of the Internet and of the Web traffic calls for scalable solutions to accessing Web documents. To this purpose, various caching schemes have been proposed and caching has been widely deployed. Since most Web documents change very rarely, the issue of consistency, i.e. how to assure access to the most recent version of a Web document, has received not much attention. However, as the number of frequently changing documents and the number of users accessing these documents increases, it becomes mandatory to propose scalable techniques that assure consistency. We look at one class of techniques that achieve consistency by performing automated delivery of Web documents. Among all schemes imaginable, automated delivery guarantees the lowest access latency for the clients. We compare pull- and push-based schemes for automated delivery and evaluate their performance analytically and via trace-driven simulation. We show that for both, pull- and push-based schemes, the use of a caching infrastructure is important to achieve scalability. For most documents in the Web, a pull distribution with a caching infrastructure can efficiently implement an automated delivery. However, when servers update their documents randomly and servers cannot ensure a minimum time-to-live interval during which documents remain unchanged, pull generates many requests to the origin server. For this case, we consider push-based schemes that use a caching infrastructure and we present a simple algorithm to determine which documents should be pushed given a limited available bandwidth.

    Original languageEnglish (US)
    Title of host publicationProceedings - 29th EUROMICRO Conference, EUROMICRO 2003
    Pages233-240
    Number of pages8
    DOIs
    StatePublished - 2003
    Event29th EUROMICRO Conference, EUROMICRO 2003 - Belek-Antalya, Turkey
    Duration: Sep 1 2003Sep 6 2003

    Other

    Other29th EUROMICRO Conference, EUROMICRO 2003
    CountryTurkey
    CityBelek-Antalya
    Period9/1/039/6/03

    Fingerprint

    Servers
    Scalability
    Internet
    Bandwidth

    ASJC Scopus subject areas

    • Hardware and Architecture
    • Control and Systems Engineering

    Cite this

    Rodriguez, P., Biersack, E. W., & Ross, K. (2003). Automated delivery of Web documents through a caching infrastructure. In Proceedings - 29th EUROMICRO Conference, EUROMICRO 2003 (pp. 233-240). [1231595] https://doi.org/10.1109/EURMIC.2003.1231595

    Automated delivery of Web documents through a caching infrastructure. / Rodriguez, Pablo; Biersack, Ernst W.; Ross, Keith.

    Proceedings - 29th EUROMICRO Conference, EUROMICRO 2003. 2003. p. 233-240 1231595.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Rodriguez, P, Biersack, EW & Ross, K 2003, Automated delivery of Web documents through a caching infrastructure. in Proceedings - 29th EUROMICRO Conference, EUROMICRO 2003., 1231595, pp. 233-240, 29th EUROMICRO Conference, EUROMICRO 2003, Belek-Antalya, Turkey, 9/1/03. https://doi.org/10.1109/EURMIC.2003.1231595
    Rodriguez P, Biersack EW, Ross K. Automated delivery of Web documents through a caching infrastructure. In Proceedings - 29th EUROMICRO Conference, EUROMICRO 2003. 2003. p. 233-240. 1231595 https://doi.org/10.1109/EURMIC.2003.1231595
    Rodriguez, Pablo ; Biersack, Ernst W. ; Ross, Keith. / Automated delivery of Web documents through a caching infrastructure. Proceedings - 29th EUROMICRO Conference, EUROMICRO 2003. 2003. pp. 233-240
    @inproceedings{ca26c86417414511bbce6531b7096a81,
    title = "Automated delivery of Web documents through a caching infrastructure",
    abstract = "The dramatic growth of the Internet and of the Web traffic calls for scalable solutions to accessing Web documents. To this purpose, various caching schemes have been proposed and caching has been widely deployed. Since most Web documents change very rarely, the issue of consistency, i.e. how to assure access to the most recent version of a Web document, has received not much attention. However, as the number of frequently changing documents and the number of users accessing these documents increases, it becomes mandatory to propose scalable techniques that assure consistency. We look at one class of techniques that achieve consistency by performing automated delivery of Web documents. Among all schemes imaginable, automated delivery guarantees the lowest access latency for the clients. We compare pull- and push-based schemes for automated delivery and evaluate their performance analytically and via trace-driven simulation. We show that for both, pull- and push-based schemes, the use of a caching infrastructure is important to achieve scalability. For most documents in the Web, a pull distribution with a caching infrastructure can efficiently implement an automated delivery. However, when servers update their documents randomly and servers cannot ensure a minimum time-to-live interval during which documents remain unchanged, pull generates many requests to the origin server. For this case, we consider push-based schemes that use a caching infrastructure and we present a simple algorithm to determine which documents should be pushed given a limited available bandwidth.",
    author = "Pablo Rodriguez and Biersack, {Ernst W.} and Keith Ross",
    year = "2003",
    doi = "10.1109/EURMIC.2003.1231595",
    language = "English (US)",
    isbn = "0769519962",
    pages = "233--240",
    booktitle = "Proceedings - 29th EUROMICRO Conference, EUROMICRO 2003",

    }

    TY - GEN

    T1 - Automated delivery of Web documents through a caching infrastructure

    AU - Rodriguez, Pablo

    AU - Biersack, Ernst W.

    AU - Ross, Keith

    PY - 2003

    Y1 - 2003

    N2 - The dramatic growth of the Internet and of the Web traffic calls for scalable solutions to accessing Web documents. To this purpose, various caching schemes have been proposed and caching has been widely deployed. Since most Web documents change very rarely, the issue of consistency, i.e. how to assure access to the most recent version of a Web document, has received not much attention. However, as the number of frequently changing documents and the number of users accessing these documents increases, it becomes mandatory to propose scalable techniques that assure consistency. We look at one class of techniques that achieve consistency by performing automated delivery of Web documents. Among all schemes imaginable, automated delivery guarantees the lowest access latency for the clients. We compare pull- and push-based schemes for automated delivery and evaluate their performance analytically and via trace-driven simulation. We show that for both, pull- and push-based schemes, the use of a caching infrastructure is important to achieve scalability. For most documents in the Web, a pull distribution with a caching infrastructure can efficiently implement an automated delivery. However, when servers update their documents randomly and servers cannot ensure a minimum time-to-live interval during which documents remain unchanged, pull generates many requests to the origin server. For this case, we consider push-based schemes that use a caching infrastructure and we present a simple algorithm to determine which documents should be pushed given a limited available bandwidth.

    AB - The dramatic growth of the Internet and of the Web traffic calls for scalable solutions to accessing Web documents. To this purpose, various caching schemes have been proposed and caching has been widely deployed. Since most Web documents change very rarely, the issue of consistency, i.e. how to assure access to the most recent version of a Web document, has received not much attention. However, as the number of frequently changing documents and the number of users accessing these documents increases, it becomes mandatory to propose scalable techniques that assure consistency. We look at one class of techniques that achieve consistency by performing automated delivery of Web documents. Among all schemes imaginable, automated delivery guarantees the lowest access latency for the clients. We compare pull- and push-based schemes for automated delivery and evaluate their performance analytically and via trace-driven simulation. We show that for both, pull- and push-based schemes, the use of a caching infrastructure is important to achieve scalability. For most documents in the Web, a pull distribution with a caching infrastructure can efficiently implement an automated delivery. However, when servers update their documents randomly and servers cannot ensure a minimum time-to-live interval during which documents remain unchanged, pull generates many requests to the origin server. For this case, we consider push-based schemes that use a caching infrastructure and we present a simple algorithm to determine which documents should be pushed given a limited available bandwidth.

    UR - http://www.scopus.com/inward/record.url?scp=84889589974&partnerID=8YFLogxK

    UR - http://www.scopus.com/inward/citedby.url?scp=84889589974&partnerID=8YFLogxK

    U2 - 10.1109/EURMIC.2003.1231595

    DO - 10.1109/EURMIC.2003.1231595

    M3 - Conference contribution

    SN - 0769519962

    SN - 9780769519968

    SP - 233

    EP - 240

    BT - Proceedings - 29th EUROMICRO Conference, EUROMICRO 2003

    ER -