The need for open source software in machine learning

Sören Sonnenburg, Mikio L. Braun, Soon Ong Cheng, Samy Bengio, Leon Bottou, Geoffrey Holmes, Yann LeCun, Klaus Robert Müller, Fernando Pereira, Carl Edward Rasmussen, Gunnar Rätsch, Bernhard Schölkopf, Alexander Smola, Pascal Vincent, Jason Weston, Robert C. Williamson

Research output: Contribution to journalArticle

Abstract

Open source tools have recently reached a level of maturity which makes them suitable for building large-scale real-world systems. At the same time, the field of machine learning has developed a large body of powerful learning algorithms for diverse applications. However, the true potential of these methods is not used, since existing implementations are not openly shared, resulting in software with low usability, and weak interoperability. We argue that this situation can be significantly improved by increasing incentives for researchers to publish their software under an open source model. Additionally, we outline the problems authors are faced with when trying to publish algorithmic implementations of machine learning methods. We believe that a resource of peer reviewed software accompanied by short articles would be highly valuable to both the machine learning and the general scientific community.

Original languageEnglish (US)
Pages (from-to)2443-2466
Number of pages24
JournalJournal of Machine Learning Research
Volume8
StatePublished - Oct 2007

Fingerprint

Open Source Software
Learning systems
Machine Learning
Open Source
Software
Incentives
Interoperability
Learning algorithms
Usability
Learning Algorithm
Resources
Open source software
Model

Keywords

  • Algorithms
  • Creditability
  • Machine learning
  • Open source
  • Reproducibility
  • Software

ASJC Scopus subject areas

  • Artificial Intelligence

Cite this

Sonnenburg, S., Braun, M. L., Cheng, S. O., Bengio, S., Bottou, L., Holmes, G., ... Williamson, R. C. (2007). The need for open source software in machine learning. Journal of Machine Learning Research, 8, 2443-2466.

The need for open source software in machine learning. / Sonnenburg, Sören; Braun, Mikio L.; Cheng, Soon Ong; Bengio, Samy; Bottou, Leon; Holmes, Geoffrey; LeCun, Yann; Müller, Klaus Robert; Pereira, Fernando; Rasmussen, Carl Edward; Rätsch, Gunnar; Schölkopf, Bernhard; Smola, Alexander; Vincent, Pascal; Weston, Jason; Williamson, Robert C.

In: Journal of Machine Learning Research, Vol. 8, 10.2007, p. 2443-2466.

Research output: Contribution to journalArticle

Sonnenburg, S, Braun, ML, Cheng, SO, Bengio, S, Bottou, L, Holmes, G, LeCun, Y, Müller, KR, Pereira, F, Rasmussen, CE, Rätsch, G, Schölkopf, B, Smola, A, Vincent, P, Weston, J & Williamson, RC 2007, 'The need for open source software in machine learning', Journal of Machine Learning Research, vol. 8, pp. 2443-2466.
Sonnenburg S, Braun ML, Cheng SO, Bengio S, Bottou L, Holmes G et al. The need for open source software in machine learning. Journal of Machine Learning Research. 2007 Oct;8:2443-2466.
Sonnenburg, Sören ; Braun, Mikio L. ; Cheng, Soon Ong ; Bengio, Samy ; Bottou, Leon ; Holmes, Geoffrey ; LeCun, Yann ; Müller, Klaus Robert ; Pereira, Fernando ; Rasmussen, Carl Edward ; Rätsch, Gunnar ; Schölkopf, Bernhard ; Smola, Alexander ; Vincent, Pascal ; Weston, Jason ; Williamson, Robert C. / The need for open source software in machine learning. In: Journal of Machine Learning Research. 2007 ; Vol. 8. pp. 2443-2466.
@article{6e1463121dfb481a912de0c48c7301c4,
title = "The need for open source software in machine learning",
abstract = "Open source tools have recently reached a level of maturity which makes them suitable for building large-scale real-world systems. At the same time, the field of machine learning has developed a large body of powerful learning algorithms for diverse applications. However, the true potential of these methods is not used, since existing implementations are not openly shared, resulting in software with low usability, and weak interoperability. We argue that this situation can be significantly improved by increasing incentives for researchers to publish their software under an open source model. Additionally, we outline the problems authors are faced with when trying to publish algorithmic implementations of machine learning methods. We believe that a resource of peer reviewed software accompanied by short articles would be highly valuable to both the machine learning and the general scientific community.",
keywords = "Algorithms, Creditability, Machine learning, Open source, Reproducibility, Software",
author = "S{\"o}ren Sonnenburg and Braun, {Mikio L.} and Cheng, {Soon Ong} and Samy Bengio and Leon Bottou and Geoffrey Holmes and Yann LeCun and M{\"u}ller, {Klaus Robert} and Fernando Pereira and Rasmussen, {Carl Edward} and Gunnar R{\"a}tsch and Bernhard Sch{\"o}lkopf and Alexander Smola and Pascal Vincent and Jason Weston and Williamson, {Robert C.}",
year = "2007",
month = "10",
language = "English (US)",
volume = "8",
pages = "2443--2466",
journal = "Journal of Machine Learning Research",
issn = "1532-4435",
publisher = "Microtome Publishing",

}

TY - JOUR

T1 - The need for open source software in machine learning

AU - Sonnenburg, Sören

AU - Braun, Mikio L.

AU - Cheng, Soon Ong

AU - Bengio, Samy

AU - Bottou, Leon

AU - Holmes, Geoffrey

AU - LeCun, Yann

AU - Müller, Klaus Robert

AU - Pereira, Fernando

AU - Rasmussen, Carl Edward

AU - Rätsch, Gunnar

AU - Schölkopf, Bernhard

AU - Smola, Alexander

AU - Vincent, Pascal

AU - Weston, Jason

AU - Williamson, Robert C.

PY - 2007/10

Y1 - 2007/10

N2 - Open source tools have recently reached a level of maturity which makes them suitable for building large-scale real-world systems. At the same time, the field of machine learning has developed a large body of powerful learning algorithms for diverse applications. However, the true potential of these methods is not used, since existing implementations are not openly shared, resulting in software with low usability, and weak interoperability. We argue that this situation can be significantly improved by increasing incentives for researchers to publish their software under an open source model. Additionally, we outline the problems authors are faced with when trying to publish algorithmic implementations of machine learning methods. We believe that a resource of peer reviewed software accompanied by short articles would be highly valuable to both the machine learning and the general scientific community.

AB - Open source tools have recently reached a level of maturity which makes them suitable for building large-scale real-world systems. At the same time, the field of machine learning has developed a large body of powerful learning algorithms for diverse applications. However, the true potential of these methods is not used, since existing implementations are not openly shared, resulting in software with low usability, and weak interoperability. We argue that this situation can be significantly improved by increasing incentives for researchers to publish their software under an open source model. Additionally, we outline the problems authors are faced with when trying to publish algorithmic implementations of machine learning methods. We believe that a resource of peer reviewed software accompanied by short articles would be highly valuable to both the machine learning and the general scientific community.

KW - Algorithms

KW - Creditability

KW - Machine learning

KW - Open source

KW - Reproducibility

KW - Software

UR - http://www.scopus.com/inward/record.url?scp=35748939406&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=35748939406&partnerID=8YFLogxK

M3 - Article

VL - 8

SP - 2443

EP - 2466

JO - Journal of Machine Learning Research

JF - Journal of Machine Learning Research

SN - 1532-4435

ER -