TY - JOUR
T1 - Declarative Recursive Computation on an RDBMS or, Why You Should Use a Database For Distributed Machine Learning
AU - Jankov, Dimitrije
AU - Luo, Shangyu
AU - Yuan, Binhang
AU - Cai, Zhuhua
AU - Zou, Jia
AU - Jermaine, Chris
AU - Gao, Zekai J.
N1 - Publisher Copyright:
© 2019, VLDB Endowment.
PY - 2018
Y1 - 2018
N2 - A number of popular systems, most notably Google's TensorFlow, have been implemented from the ground up to support machine learning tasks. We consider how to make a very small set of changes to a modern relational database management system (RDBMS) to make it suitable for distributed learning computations. Changes include adding better support for recursion, and optimization and execution of very large compute plans. We also show that there are key advantages to using an RDBMS as a machine learning platform. In particular, learning based on a database management system allows for trivial scaling to large data sets and especially large models, where different computational units operate on different parts of a model that may be too large to fit into RAM.
AB - A number of popular systems, most notably Google's TensorFlow, have been implemented from the ground up to support machine learning tasks. We consider how to make a very small set of changes to a modern relational database management system (RDBMS) to make it suitable for distributed learning computations. Changes include adding better support for recursion, and optimization and execution of very large compute plans. We also show that there are key advantages to using an RDBMS as a machine learning platform. In particular, learning based on a database management system allows for trivial scaling to large data sets and especially large models, where different computational units operate on different parts of a model that may be too large to fit into RAM.
UR - https://www.webofscience.com/wos/woscc/full-record/WOS:000497519200008
UR - https://openalex.org/W4288365054
UR - https://www.scopus.com/pages/publications/85077239058
U2 - 10.14778/3317315.3317323
DO - 10.14778/3317315.3317323
M3 - Conference article published in journal
SN - 2150-8097
VL - 12
SP - 822
EP - 835
JO - Proceedings of the VLDB Endowment
JF - Proceedings of the VLDB Endowment
IS - 7
T2 - 45th International Conference on Very Large Data Bases, VLDB 2019
Y2 - 26 August 2017 through 30 August 2017
ER -