Pollution in P2P file sharing systems

Jian Liang, Rakesh Kumar, Yongjian Xi, Keith Ross

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Abstract

    One way to combat P2P file sharing of copyrighted content is to deposit into the file sharing systems large volumes of polluted files. Without taking sides in the file sharing debate, in this paper we undertake a measurement study of the nature and magnitude of pollution in the FastTrack P2P network, currently the most popular P2P file sharing system. We develop a crawling platform which crawls the majority of the FastTrack Network's 20,000+ supernodes in less than 60 minutes. From the raw data gathered by the crawler for popular audio content, we obtain statistics on the number of unique versions and copies available in a 24-hour period. We develop an automated procedure to detect whether a given version is polluted or not, and we show that the probabilities of false positives and negatives of the detection procedure are very small. We use the data from the crawler and our pollution detection algorithm to determine the fraction of versions and fraction of copies that are polluted for several recent and old songs. We observe that pollution is pervasive for recent popular songs. We also identify and describe a number of anti-pollution mechanisms.

    Original languageEnglish (US)
    Title of host publicationProceedings - IEEE INFOCOM 2005. The Conference on Computer Communications - 24th Annual Joint Conference of the IEEE Computer and Communications Societies
    EditorsK. Makki, E. Knightly
    Pages1174-1185
    Number of pages12
    Volume2
    DOIs
    StatePublished - 2005
    EventIEEE INFOCOM 2005 - Miami, FL, United States
    Duration: Mar 13 2005Mar 17 2005

    Other

    OtherIEEE INFOCOM 2005
    CountryUnited States
    CityMiami, FL
    Period3/13/053/17/05

    Fingerprint

    Pollution
    Pollution detection
    Deposits
    Statistics

    Keywords

    • Network measurements

    ASJC Scopus subject areas

    • Electrical and Electronic Engineering
    • Hardware and Architecture

    Cite this

    Liang, J., Kumar, R., Xi, Y., & Ross, K. (2005). Pollution in P2P file sharing systems. In K. Makki, & E. Knightly (Eds.), Proceedings - IEEE INFOCOM 2005. The Conference on Computer Communications - 24th Annual Joint Conference of the IEEE Computer and Communications Societies (Vol. 2, pp. 1174-1185) https://doi.org/10.1109/INFCOM.2005.1498344

    Pollution in P2P file sharing systems. / Liang, Jian; Kumar, Rakesh; Xi, Yongjian; Ross, Keith.

    Proceedings - IEEE INFOCOM 2005. The Conference on Computer Communications - 24th Annual Joint Conference of the IEEE Computer and Communications Societies. ed. / K. Makki; E. Knightly. Vol. 2 2005. p. 1174-1185.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Liang, J, Kumar, R, Xi, Y & Ross, K 2005, Pollution in P2P file sharing systems. in K Makki & E Knightly (eds), Proceedings - IEEE INFOCOM 2005. The Conference on Computer Communications - 24th Annual Joint Conference of the IEEE Computer and Communications Societies. vol. 2, pp. 1174-1185, IEEE INFOCOM 2005, Miami, FL, United States, 3/13/05. https://doi.org/10.1109/INFCOM.2005.1498344
    Liang J, Kumar R, Xi Y, Ross K. Pollution in P2P file sharing systems. In Makki K, Knightly E, editors, Proceedings - IEEE INFOCOM 2005. The Conference on Computer Communications - 24th Annual Joint Conference of the IEEE Computer and Communications Societies. Vol. 2. 2005. p. 1174-1185 https://doi.org/10.1109/INFCOM.2005.1498344
    Liang, Jian ; Kumar, Rakesh ; Xi, Yongjian ; Ross, Keith. / Pollution in P2P file sharing systems. Proceedings - IEEE INFOCOM 2005. The Conference on Computer Communications - 24th Annual Joint Conference of the IEEE Computer and Communications Societies. editor / K. Makki ; E. Knightly. Vol. 2 2005. pp. 1174-1185
    @inproceedings{fdbfa3b6d44e414fa772ccf3027cfd1f,
    title = "Pollution in P2P file sharing systems",
    abstract = "One way to combat P2P file sharing of copyrighted content is to deposit into the file sharing systems large volumes of polluted files. Without taking sides in the file sharing debate, in this paper we undertake a measurement study of the nature and magnitude of pollution in the FastTrack P2P network, currently the most popular P2P file sharing system. We develop a crawling platform which crawls the majority of the FastTrack Network's 20,000+ supernodes in less than 60 minutes. From the raw data gathered by the crawler for popular audio content, we obtain statistics on the number of unique versions and copies available in a 24-hour period. We develop an automated procedure to detect whether a given version is polluted or not, and we show that the probabilities of false positives and negatives of the detection procedure are very small. We use the data from the crawler and our pollution detection algorithm to determine the fraction of versions and fraction of copies that are polluted for several recent and old songs. We observe that pollution is pervasive for recent popular songs. We also identify and describe a number of anti-pollution mechanisms.",
    keywords = "Network measurements",
    author = "Jian Liang and Rakesh Kumar and Yongjian Xi and Keith Ross",
    year = "2005",
    doi = "10.1109/INFCOM.2005.1498344",
    language = "English (US)",
    isbn = "0780389689",
    volume = "2",
    pages = "1174--1185",
    editor = "K. Makki and E. Knightly",
    booktitle = "Proceedings - IEEE INFOCOM 2005. The Conference on Computer Communications - 24th Annual Joint Conference of the IEEE Computer and Communications Societies",

    }

    TY - GEN

    T1 - Pollution in P2P file sharing systems

    AU - Liang, Jian

    AU - Kumar, Rakesh

    AU - Xi, Yongjian

    AU - Ross, Keith

    PY - 2005

    Y1 - 2005

    N2 - One way to combat P2P file sharing of copyrighted content is to deposit into the file sharing systems large volumes of polluted files. Without taking sides in the file sharing debate, in this paper we undertake a measurement study of the nature and magnitude of pollution in the FastTrack P2P network, currently the most popular P2P file sharing system. We develop a crawling platform which crawls the majority of the FastTrack Network's 20,000+ supernodes in less than 60 minutes. From the raw data gathered by the crawler for popular audio content, we obtain statistics on the number of unique versions and copies available in a 24-hour period. We develop an automated procedure to detect whether a given version is polluted or not, and we show that the probabilities of false positives and negatives of the detection procedure are very small. We use the data from the crawler and our pollution detection algorithm to determine the fraction of versions and fraction of copies that are polluted for several recent and old songs. We observe that pollution is pervasive for recent popular songs. We also identify and describe a number of anti-pollution mechanisms.

    AB - One way to combat P2P file sharing of copyrighted content is to deposit into the file sharing systems large volumes of polluted files. Without taking sides in the file sharing debate, in this paper we undertake a measurement study of the nature and magnitude of pollution in the FastTrack P2P network, currently the most popular P2P file sharing system. We develop a crawling platform which crawls the majority of the FastTrack Network's 20,000+ supernodes in less than 60 minutes. From the raw data gathered by the crawler for popular audio content, we obtain statistics on the number of unique versions and copies available in a 24-hour period. We develop an automated procedure to detect whether a given version is polluted or not, and we show that the probabilities of false positives and negatives of the detection procedure are very small. We use the data from the crawler and our pollution detection algorithm to determine the fraction of versions and fraction of copies that are polluted for several recent and old songs. We observe that pollution is pervasive for recent popular songs. We also identify and describe a number of anti-pollution mechanisms.

    KW - Network measurements

    UR - http://www.scopus.com/inward/record.url?scp=25844529486&partnerID=8YFLogxK

    UR - http://www.scopus.com/inward/citedby.url?scp=25844529486&partnerID=8YFLogxK

    U2 - 10.1109/INFCOM.2005.1498344

    DO - 10.1109/INFCOM.2005.1498344

    M3 - Conference contribution

    SN - 0780389689

    VL - 2

    SP - 1174

    EP - 1185

    BT - Proceedings - IEEE INFOCOM 2005. The Conference on Computer Communications - 24th Annual Joint Conference of the IEEE Computer and Communications Societies

    A2 - Makki, K.

    A2 - Knightly, E.

    ER -