THE CANNON: A DATA-DRIVEN APPROACH to STELLAR LABEL DETERMINATION

M. Ness, David W. Hogg, H. W. Rix, Anna Y Q Ho, G. Zasowski

    Research output: Contribution to journalArticle

    Abstract

    New spectroscopic surveys offer the promise of stellar parameters and abundances ("stellar labels") for hundreds of thousands of stars; this poses a formidable spectral modeling challenge. In many cases, there is a subset of reference objects for which the stellar labels are known with high(er) fidelity. We take advantage of this with The Cannon, a new data-driven approach for determining stellar labels from spectroscopic data. The Cannon learns from the "known" labels of reference stars how the continuum-normalized spectra depend on these labels by fitting a flexible model at each wavelength; then, The Cannon uses this model to derive labels for the remaining survey stars. We illustrate The Cannon by training the model on only 542 stars in 19 clusters as reference objects, with and as the labels, and then applying it to the spectra of 55,000 stars from APOGEE DR10. The Cannon is very accurate. Its stellar labels compare well to the stars for which APOGEE pipeline (ASPCAP) labels are provided in DR10, with rms differences that are basically identical to the stated ASPCAP uncertainties. Beyond the reference labels, The Cannon makes no use of stellar models nor any line-list, but needs a set of reference objects that span label-space. The Cannon performs well at lower signal-to-noise, as it delivers comparably good labels even at one-ninth the APOGEE observing time. We discuss the limitations of The Cannon and its future potential, particularly, to bring different spectroscopic surveys onto a consistent scale of stellar labels.

    Original languageEnglish (US)
    Article number16
    JournalAstrophysical Journal
    Volume808
    Issue number1
    DOIs
    StatePublished - Jul 20 2015

    Fingerprint

    guns (ordnance)
    stars
    reference stars
    wavelength
    stellar models
    lists
    set theory
    modeling
    education
    continuums

    Keywords

    • methods: data analysis
    • methods: statistical
    • stars: abundances
    • stars: fundamental parameters
    • surveys
    • techniques: spectroscopic

    ASJC Scopus subject areas

    • Space and Planetary Science
    • Astronomy and Astrophysics

    Cite this

    Ness, M., Hogg, D. W., Rix, H. W., Ho, A. Y. Q., & Zasowski, G. (2015). THE CANNON: A DATA-DRIVEN APPROACH to STELLAR LABEL DETERMINATION. Astrophysical Journal, 808(1), [16]. https://doi.org/10.1088/0004-637X/808/1/16

    THE CANNON : A DATA-DRIVEN APPROACH to STELLAR LABEL DETERMINATION. / Ness, M.; Hogg, David W.; Rix, H. W.; Ho, Anna Y Q; Zasowski, G.

    In: Astrophysical Journal, Vol. 808, No. 1, 16, 20.07.2015.

    Research output: Contribution to journalArticle

    Ness, M. ; Hogg, David W. ; Rix, H. W. ; Ho, Anna Y Q ; Zasowski, G. / THE CANNON : A DATA-DRIVEN APPROACH to STELLAR LABEL DETERMINATION. In: Astrophysical Journal. 2015 ; Vol. 808, No. 1.
    @article{20146721888e4c4e95345e7b9c2825ab,
    title = "THE CANNON: A DATA-DRIVEN APPROACH to STELLAR LABEL DETERMINATION",
    abstract = "New spectroscopic surveys offer the promise of stellar parameters and abundances ({"}stellar labels{"}) for hundreds of thousands of stars; this poses a formidable spectral modeling challenge. In many cases, there is a subset of reference objects for which the stellar labels are known with high(er) fidelity. We take advantage of this with The Cannon, a new data-driven approach for determining stellar labels from spectroscopic data. The Cannon learns from the {"}known{"} labels of reference stars how the continuum-normalized spectra depend on these labels by fitting a flexible model at each wavelength; then, The Cannon uses this model to derive labels for the remaining survey stars. We illustrate The Cannon by training the model on only 542 stars in 19 clusters as reference objects, with and as the labels, and then applying it to the spectra of 55,000 stars from APOGEE DR10. The Cannon is very accurate. Its stellar labels compare well to the stars for which APOGEE pipeline (ASPCAP) labels are provided in DR10, with rms differences that are basically identical to the stated ASPCAP uncertainties. Beyond the reference labels, The Cannon makes no use of stellar models nor any line-list, but needs a set of reference objects that span label-space. The Cannon performs well at lower signal-to-noise, as it delivers comparably good labels even at one-ninth the APOGEE observing time. We discuss the limitations of The Cannon and its future potential, particularly, to bring different spectroscopic surveys onto a consistent scale of stellar labels.",
    keywords = "methods: data analysis, methods: statistical, stars: abundances, stars: fundamental parameters, surveys, techniques: spectroscopic",
    author = "M. Ness and Hogg, {David W.} and Rix, {H. W.} and Ho, {Anna Y Q} and G. Zasowski",
    year = "2015",
    month = "7",
    day = "20",
    doi = "10.1088/0004-637X/808/1/16",
    language = "English (US)",
    volume = "808",
    journal = "Astrophysical Journal",
    issn = "0004-637X",
    publisher = "IOP Publishing Ltd.",
    number = "1",

    }

    TY - JOUR

    T1 - THE CANNON

    T2 - A DATA-DRIVEN APPROACH to STELLAR LABEL DETERMINATION

    AU - Ness, M.

    AU - Hogg, David W.

    AU - Rix, H. W.

    AU - Ho, Anna Y Q

    AU - Zasowski, G.

    PY - 2015/7/20

    Y1 - 2015/7/20

    N2 - New spectroscopic surveys offer the promise of stellar parameters and abundances ("stellar labels") for hundreds of thousands of stars; this poses a formidable spectral modeling challenge. In many cases, there is a subset of reference objects for which the stellar labels are known with high(er) fidelity. We take advantage of this with The Cannon, a new data-driven approach for determining stellar labels from spectroscopic data. The Cannon learns from the "known" labels of reference stars how the continuum-normalized spectra depend on these labels by fitting a flexible model at each wavelength; then, The Cannon uses this model to derive labels for the remaining survey stars. We illustrate The Cannon by training the model on only 542 stars in 19 clusters as reference objects, with and as the labels, and then applying it to the spectra of 55,000 stars from APOGEE DR10. The Cannon is very accurate. Its stellar labels compare well to the stars for which APOGEE pipeline (ASPCAP) labels are provided in DR10, with rms differences that are basically identical to the stated ASPCAP uncertainties. Beyond the reference labels, The Cannon makes no use of stellar models nor any line-list, but needs a set of reference objects that span label-space. The Cannon performs well at lower signal-to-noise, as it delivers comparably good labels even at one-ninth the APOGEE observing time. We discuss the limitations of The Cannon and its future potential, particularly, to bring different spectroscopic surveys onto a consistent scale of stellar labels.

    AB - New spectroscopic surveys offer the promise of stellar parameters and abundances ("stellar labels") for hundreds of thousands of stars; this poses a formidable spectral modeling challenge. In many cases, there is a subset of reference objects for which the stellar labels are known with high(er) fidelity. We take advantage of this with The Cannon, a new data-driven approach for determining stellar labels from spectroscopic data. The Cannon learns from the "known" labels of reference stars how the continuum-normalized spectra depend on these labels by fitting a flexible model at each wavelength; then, The Cannon uses this model to derive labels for the remaining survey stars. We illustrate The Cannon by training the model on only 542 stars in 19 clusters as reference objects, with and as the labels, and then applying it to the spectra of 55,000 stars from APOGEE DR10. The Cannon is very accurate. Its stellar labels compare well to the stars for which APOGEE pipeline (ASPCAP) labels are provided in DR10, with rms differences that are basically identical to the stated ASPCAP uncertainties. Beyond the reference labels, The Cannon makes no use of stellar models nor any line-list, but needs a set of reference objects that span label-space. The Cannon performs well at lower signal-to-noise, as it delivers comparably good labels even at one-ninth the APOGEE observing time. We discuss the limitations of The Cannon and its future potential, particularly, to bring different spectroscopic surveys onto a consistent scale of stellar labels.

    KW - methods: data analysis

    KW - methods: statistical

    KW - stars: abundances

    KW - stars: fundamental parameters

    KW - surveys

    KW - techniques: spectroscopic

    UR - http://www.scopus.com/inward/record.url?scp=84940118729&partnerID=8YFLogxK

    UR - http://www.scopus.com/inward/citedby.url?scp=84940118729&partnerID=8YFLogxK

    U2 - 10.1088/0004-637X/808/1/16

    DO - 10.1088/0004-637X/808/1/16

    M3 - Article

    AN - SCOPUS:84940118729

    VL - 808

    JO - Astrophysical Journal

    JF - Astrophysical Journal

    SN - 0004-637X

    IS - 1

    M1 - 16

    ER -