Petuum: A New Platform for Distributed
Machine Learning on Big Data

Friday, February 12th, 2015
6:30pm – 7:30pm
SkullSpace, 2nd Floor, 374 Donald St.

(Please RSVP via the Meetup link at the bottom, so we have enough seating.)


How can one build a distributed framework that allows efficient deployment of a wide spectrum of modern advanced machine learning (ML) programs for industrial-scale problems using Big Models (100s of billions of parameters) on Big Data (terabytes or petabytes)? Contemporary parallelization strategies employ fine-grained operations and scheduling beyond the classic bulk-synchronous processing paradigm popularized by MapReduce, or even specialized operators relying on graphical representations of ML programs. The variety of approaches tends to pull systems and algorithms design in different directions, and it remains difficult to find a universal platform applicable to a wide range of different ML programs at scale. We propose a general-purpose framework that systematically addresses data- and model-parallel challenges in large-scale ML.

Check back for updates and slides of the presentation

Presenter Bio

Shibashis Bal completed his B. Sc. (Mathematics) at the University of Manitoba. He has obtained the MCSE, CNE, CISSP, and CISA designations. After using his skills in some far corners of the world, he currently owns and operates 6279040 Canada Inc., an information security, audit and analysis company.  When not engaged directly in IT security audit, he is applying statistical techniques to health, fitness and financial analysis.

What is Papers We Love?

What was the last paper within the realm of computing you read and loved? What did it inspire you to build or tinker with? Papers We Love is a Meetup group for sharing the ideas in awesome academic and research papers with fellow programmers, engineers, and paper-readers. While everyone is welcome to present, you can also just come, listen, and discuss.