Visus: An interactive system for automatic machine learning model building and curation

Aecio Santos, Sonia Castelo, Cristian Felix, Jorge Piazentin Ono, Bowen Yu, Sungsoo Hong, Claudio Silva, Enrico Bertini, Juliana Freire

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

While the demand for machine learning (ML) applications is booming, there is a scarcity of data scientists capable of building such models. Automatic machine learning (AutoML) approaches have been proposed that help with this problem by synthesizing end-toend ML data processing pipelines. However, these follow a besteffort approach and a user in the loop is necessary to curate and refine the derived pipelines. Since domain experts often have little or no expertise in machine learning, easy-to-use interactive interfaces that guide them throughout the model building process are necessary. In this paper, we present Visus, a system designed to support the model building process and curation of ML data processing pipelines generated by AutoML systems. We describe the framework used to ground our design choices and a usage scenario enabled by Visus. Finally, we discuss the feedback received in user testing sessions with domain experts.

Original languageEnglish (US)
Title of host publicationProceedings of the Workshop on Human-In-the-Loop Data Analytics, HILDA 2019
PublisherAssociation for Computing Machinery
ISBN (Electronic)9781450367912
DOIs
StatePublished - Jul 5 2019
Event2019 Workshop on Human-In-the-Loop Data Analytics, HILDA 2019, co-located with SIGMOD 2019 - Amsterdam, Netherlands
Duration: Jul 5 2019 → …

Publication series

NameProceedings of the ACM SIGMOD International Conference on Management of Data
ISSN (Print)0730-8078

Conference

Conference2019 Workshop on Human-In-the-Loop Data Analytics, HILDA 2019, co-located with SIGMOD 2019
CountryNetherlands
CityAmsterdam
Period7/5/19 → …

Fingerprint

Learning systems
Pipelines
Feedback
Testing

Keywords

  • Automatic machine learning
  • Data analytics
  • Data visualization

ASJC Scopus subject areas

  • Software
  • Information Systems

Cite this

Santos, A., Castelo, S., Felix, C., Ono, J. P., Yu, B., Hong, S., ... Freire, J. (2019). Visus: An interactive system for automatic machine learning model building and curation. In Proceedings of the Workshop on Human-In-the-Loop Data Analytics, HILDA 2019 [a6] (Proceedings of the ACM SIGMOD International Conference on Management of Data). Association for Computing Machinery. https://doi.org/10.1145/3328519.3329134

Visus : An interactive system for automatic machine learning model building and curation. / Santos, Aecio; Castelo, Sonia; Felix, Cristian; Ono, Jorge Piazentin; Yu, Bowen; Hong, Sungsoo; Silva, Claudio; Bertini, Enrico; Freire, Juliana.

Proceedings of the Workshop on Human-In-the-Loop Data Analytics, HILDA 2019. Association for Computing Machinery, 2019. a6 (Proceedings of the ACM SIGMOD International Conference on Management of Data).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Santos, A, Castelo, S, Felix, C, Ono, JP, Yu, B, Hong, S, Silva, C, Bertini, E & Freire, J 2019, Visus: An interactive system for automatic machine learning model building and curation. in Proceedings of the Workshop on Human-In-the-Loop Data Analytics, HILDA 2019., a6, Proceedings of the ACM SIGMOD International Conference on Management of Data, Association for Computing Machinery, 2019 Workshop on Human-In-the-Loop Data Analytics, HILDA 2019, co-located with SIGMOD 2019, Amsterdam, Netherlands, 7/5/19. https://doi.org/10.1145/3328519.3329134
Santos A, Castelo S, Felix C, Ono JP, Yu B, Hong S et al. Visus: An interactive system for automatic machine learning model building and curation. In Proceedings of the Workshop on Human-In-the-Loop Data Analytics, HILDA 2019. Association for Computing Machinery. 2019. a6. (Proceedings of the ACM SIGMOD International Conference on Management of Data). https://doi.org/10.1145/3328519.3329134
Santos, Aecio ; Castelo, Sonia ; Felix, Cristian ; Ono, Jorge Piazentin ; Yu, Bowen ; Hong, Sungsoo ; Silva, Claudio ; Bertini, Enrico ; Freire, Juliana. / Visus : An interactive system for automatic machine learning model building and curation. Proceedings of the Workshop on Human-In-the-Loop Data Analytics, HILDA 2019. Association for Computing Machinery, 2019. (Proceedings of the ACM SIGMOD International Conference on Management of Data).
@inproceedings{fe51b1264d2f4aa3be3a2572d6327881,
title = "Visus: An interactive system for automatic machine learning model building and curation",
abstract = "While the demand for machine learning (ML) applications is booming, there is a scarcity of data scientists capable of building such models. Automatic machine learning (AutoML) approaches have been proposed that help with this problem by synthesizing end-toend ML data processing pipelines. However, these follow a besteffort approach and a user in the loop is necessary to curate and refine the derived pipelines. Since domain experts often have little or no expertise in machine learning, easy-to-use interactive interfaces that guide them throughout the model building process are necessary. In this paper, we present Visus, a system designed to support the model building process and curation of ML data processing pipelines generated by AutoML systems. We describe the framework used to ground our design choices and a usage scenario enabled by Visus. Finally, we discuss the feedback received in user testing sessions with domain experts.",
keywords = "Automatic machine learning, Data analytics, Data visualization",
author = "Aecio Santos and Sonia Castelo and Cristian Felix and Ono, {Jorge Piazentin} and Bowen Yu and Sungsoo Hong and Claudio Silva and Enrico Bertini and Juliana Freire",
year = "2019",
month = "7",
day = "5",
doi = "10.1145/3328519.3329134",
language = "English (US)",
series = "Proceedings of the ACM SIGMOD International Conference on Management of Data",
publisher = "Association for Computing Machinery",
booktitle = "Proceedings of the Workshop on Human-In-the-Loop Data Analytics, HILDA 2019",

}

TY - GEN

T1 - Visus

T2 - An interactive system for automatic machine learning model building and curation

AU - Santos, Aecio

AU - Castelo, Sonia

AU - Felix, Cristian

AU - Ono, Jorge Piazentin

AU - Yu, Bowen

AU - Hong, Sungsoo

AU - Silva, Claudio

AU - Bertini, Enrico

AU - Freire, Juliana

PY - 2019/7/5

Y1 - 2019/7/5

N2 - While the demand for machine learning (ML) applications is booming, there is a scarcity of data scientists capable of building such models. Automatic machine learning (AutoML) approaches have been proposed that help with this problem by synthesizing end-toend ML data processing pipelines. However, these follow a besteffort approach and a user in the loop is necessary to curate and refine the derived pipelines. Since domain experts often have little or no expertise in machine learning, easy-to-use interactive interfaces that guide them throughout the model building process are necessary. In this paper, we present Visus, a system designed to support the model building process and curation of ML data processing pipelines generated by AutoML systems. We describe the framework used to ground our design choices and a usage scenario enabled by Visus. Finally, we discuss the feedback received in user testing sessions with domain experts.

AB - While the demand for machine learning (ML) applications is booming, there is a scarcity of data scientists capable of building such models. Automatic machine learning (AutoML) approaches have been proposed that help with this problem by synthesizing end-toend ML data processing pipelines. However, these follow a besteffort approach and a user in the loop is necessary to curate and refine the derived pipelines. Since domain experts often have little or no expertise in machine learning, easy-to-use interactive interfaces that guide them throughout the model building process are necessary. In this paper, we present Visus, a system designed to support the model building process and curation of ML data processing pipelines generated by AutoML systems. We describe the framework used to ground our design choices and a usage scenario enabled by Visus. Finally, we discuss the feedback received in user testing sessions with domain experts.

KW - Automatic machine learning

KW - Data analytics

KW - Data visualization

UR - http://www.scopus.com/inward/record.url?scp=85072792719&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85072792719&partnerID=8YFLogxK

U2 - 10.1145/3328519.3329134

DO - 10.1145/3328519.3329134

M3 - Conference contribution

AN - SCOPUS:85072792719

T3 - Proceedings of the ACM SIGMOD International Conference on Management of Data

BT - Proceedings of the Workshop on Human-In-the-Loop Data Analytics, HILDA 2019

PB - Association for Computing Machinery

ER -