Tensor Comprehensions: deep learning as a polyhedral compiler's killer app (ARRAY 2018)

Track

ARRAY 2018

Time Zone

The program is currently displayed in (GMT-04:00) Eastern Time (US & Canada).

Use conference time zone: (GMT-04:00) Eastern Time (US & Canada)Select other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Tue 19 Jun 2018 09:00 - 10:00 at Grand Ballroom C - Keynote

Abstract

Deep learning models with convolutional and recurrent networks analyze massive amounts of audio, image, video, text and graph data, with applications to automatic translation, speech-to-text, scene understanding, ranking user preferences, ad placement, etc. Competing frameworks for building these networks such as TensorFlow, Chainer, CNTK, Torch/PyTorch, Caffe1/2, MXNet and Theano, explore different tradeoffs between usability and expressiveness, research or production orientation and supported hardware. They operate on a DAG of computational operators, wrapping high-performance libraries such as CUDNN (for NVIDIA GPUs) or NNPACK (for various CPUs), and automate memory allocation, synchronization, distribution. Custom operators are needed where the computation does not fit existing high-performance library calls, usually at a high engineering cost. Such operators suffer a severe performance penalty, which limits the pace of innovation. Furthermore, existing library primitives often do not offer optimal performance in a particular network architecture, missing optimizations between operators as well as specialization to the size and shape of data.

We will survey the work-in-progress design of

(1) a language close to the mathematics of deep learning called Tensor Comprehensions, featuring interesting developments in the areas of automatic range inference, declarative array programming, and data-flow modeling of recurrent networks;

(2) a polyhedral Just-In-Time compiler to convert a mathematical description of a deep learning DAG into a CUDA kernel with delegated memory management and synchronization, also providing optimizations such as operator fusion and specialization for specific sizes;

(3) a high level metaprogramming environment and compilation cache populated by an autotuner, acting as a built-to-order library.

Our first results demonstrate the suitability of the polyhedral framework to construct a fully automatic, domain-specific optimizer, effective on state-of-the-art deep learning models and targeting NVIDIA GPUs. Our compilation flow reaches up to 4x speedup over NVIDIA libraries on kernels relevant to the Machine Learning Community, and on an actual model used in production at Facebook. TC also facilitates algorithmic exploration, exposing up to 2 orders of magnitude speedup on research layers. It is open source, integrated with mainstream frameworks Caffe2 (production-oriented) and PyTorch (research-oriented). TC is still at an early stage, and looking for contributions and collaboration.

https://research.fb.com/announcing-tensor-comprehensions

http://pytorch.org/2018/03/05/tensor-comprehensions.html

Link to Preprint

https://arxiv.org/abs/1802.04730

Time Zone

The program is currently displayed in (GMT-04:00) Eastern Time (US & Canada).

Use conference time zone: (GMT-04:00) Eastern Time (US & Canada)Select other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Tue 19 Jun
Displayed time zone: Eastern Time (US & Canada) change

09:00 - 10:00	KeynoteARRAY at Grand Ballroom C

09:00 60m Talk		Tensor Comprehensions: deep learning as a polyhedral compiler's killer app ARRAY K: Albert Cohen Inria, France / ENS, France Pre-print

Tensor Comprehensions: deep learning as a polyhedral compiler's killer app

Tue 19 Jun
Displayed time zone: Eastern Time (US & Canada) change

Albert CohenKeynote Speaker

Inria, France / ENS, France

Tracks

Co-hosted Conferences

Workshops

Tensor Comprehensions: deep learning as a polyhedral compiler's killer app

Program Display Configuration

Program Display Configuration

Tue 19 JunDisplayed time zone: Eastern Time (US & Canada) change

Albert CohenKeynote Speaker

Inria, France / ENS, France

Tue 19 Jun
Displayed time zone: Eastern Time (US & Canada) change