Saving Data Journalism: New strategies for archiving interactive, born-digital news

Meredith Broussard, Katherine Boss

    Research output: Contribution to journalArticle

    Abstract

    Important works of data journalism are disappearing from the web because they are too technologically complex to be captured or archived by libraries or web archiving technologies. Research based on journalism depends on the existence of news archives. For the benefit of future scholars, it is imperative that libraries and newsrooms solve this problem. This research contends that dynamic web archiving of data journalism will require a new, emulation-based approach to capturing these works. This new approach in turn necessitates new web archiving tools and workflows to enable collaborative collection of the projects, because unlike in print-based archiving, the process will depend on detailed technical information sharing among stakeholders. Toward this end, this article summarizes the results of a questionnaire that described the most common frameworks, database technologies, and programming languages used to build 76 complex works of data journalism published between 2008 and 2017, as well the ways these works are being maintained and stored. This information can inform the development of emulation-based archiving tools to capture and preserve these stories using methods that would fit within the workflow of news organizations. This research is a first step toward devising an automated solution for long-term preservation of data journalism projects.

    Original languageEnglish (US)
    Pages (from-to)1206-1221
    Number of pages16
    JournalDigital Journalism
    Volume6
    Issue number9
    DOIs
    StatePublished - Oct 21 2018

    Fingerprint

    journalism
    news
    workflow
    Computer programming languages
    programming language
    stakeholder
    questionnaire

    Keywords

    • archiving
    • born-digital news
    • data journalism
    • news applications
    • software preservation
    • web archiving

    ASJC Scopus subject areas

    • Communication

    Cite this

    Saving Data Journalism : New strategies for archiving interactive, born-digital news. / Broussard, Meredith; Boss, Katherine.

    In: Digital Journalism, Vol. 6, No. 9, 21.10.2018, p. 1206-1221.

    Research output: Contribution to journalArticle

    Broussard, Meredith ; Boss, Katherine. / Saving Data Journalism : New strategies for archiving interactive, born-digital news. In: Digital Journalism. 2018 ; Vol. 6, No. 9. pp. 1206-1221.
    @article{9b835d26b53048759c08a3cc7c500f19,
    title = "Saving Data Journalism: New strategies for archiving interactive, born-digital news",
    abstract = "Important works of data journalism are disappearing from the web because they are too technologically complex to be captured or archived by libraries or web archiving technologies. Research based on journalism depends on the existence of news archives. For the benefit of future scholars, it is imperative that libraries and newsrooms solve this problem. This research contends that dynamic web archiving of data journalism will require a new, emulation-based approach to capturing these works. This new approach in turn necessitates new web archiving tools and workflows to enable collaborative collection of the projects, because unlike in print-based archiving, the process will depend on detailed technical information sharing among stakeholders. Toward this end, this article summarizes the results of a questionnaire that described the most common frameworks, database technologies, and programming languages used to build 76 complex works of data journalism published between 2008 and 2017, as well the ways these works are being maintained and stored. This information can inform the development of emulation-based archiving tools to capture and preserve these stories using methods that would fit within the workflow of news organizations. This research is a first step toward devising an automated solution for long-term preservation of data journalism projects.",
    keywords = "archiving, born-digital news, data journalism, news applications, software preservation, web archiving",
    author = "Meredith Broussard and Katherine Boss",
    year = "2018",
    month = "10",
    day = "21",
    doi = "10.1080/21670811.2018.1505437",
    language = "English (US)",
    volume = "6",
    pages = "1206--1221",
    journal = "Digital Journalism",
    issn = "2167-0811",
    publisher = "Taylor and Francis Ltd.",
    number = "9",

    }

    TY - JOUR

    T1 - Saving Data Journalism

    T2 - New strategies for archiving interactive, born-digital news

    AU - Broussard, Meredith

    AU - Boss, Katherine

    PY - 2018/10/21

    Y1 - 2018/10/21

    N2 - Important works of data journalism are disappearing from the web because they are too technologically complex to be captured or archived by libraries or web archiving technologies. Research based on journalism depends on the existence of news archives. For the benefit of future scholars, it is imperative that libraries and newsrooms solve this problem. This research contends that dynamic web archiving of data journalism will require a new, emulation-based approach to capturing these works. This new approach in turn necessitates new web archiving tools and workflows to enable collaborative collection of the projects, because unlike in print-based archiving, the process will depend on detailed technical information sharing among stakeholders. Toward this end, this article summarizes the results of a questionnaire that described the most common frameworks, database technologies, and programming languages used to build 76 complex works of data journalism published between 2008 and 2017, as well the ways these works are being maintained and stored. This information can inform the development of emulation-based archiving tools to capture and preserve these stories using methods that would fit within the workflow of news organizations. This research is a first step toward devising an automated solution for long-term preservation of data journalism projects.

    AB - Important works of data journalism are disappearing from the web because they are too technologically complex to be captured or archived by libraries or web archiving technologies. Research based on journalism depends on the existence of news archives. For the benefit of future scholars, it is imperative that libraries and newsrooms solve this problem. This research contends that dynamic web archiving of data journalism will require a new, emulation-based approach to capturing these works. This new approach in turn necessitates new web archiving tools and workflows to enable collaborative collection of the projects, because unlike in print-based archiving, the process will depend on detailed technical information sharing among stakeholders. Toward this end, this article summarizes the results of a questionnaire that described the most common frameworks, database technologies, and programming languages used to build 76 complex works of data journalism published between 2008 and 2017, as well the ways these works are being maintained and stored. This information can inform the development of emulation-based archiving tools to capture and preserve these stories using methods that would fit within the workflow of news organizations. This research is a first step toward devising an automated solution for long-term preservation of data journalism projects.

    KW - archiving

    KW - born-digital news

    KW - data journalism

    KW - news applications

    KW - software preservation

    KW - web archiving

    UR - http://www.scopus.com/inward/record.url?scp=85058220590&partnerID=8YFLogxK

    UR - http://www.scopus.com/inward/citedby.url?scp=85058220590&partnerID=8YFLogxK

    U2 - 10.1080/21670811.2018.1505437

    DO - 10.1080/21670811.2018.1505437

    M3 - Article

    AN - SCOPUS:85058220590

    VL - 6

    SP - 1206

    EP - 1221

    JO - Digital Journalism

    JF - Digital Journalism

    SN - 2167-0811

    IS - 9

    ER -