Documentation for doing model parallelism on multiple GPUs #321

matt-gardner · 2017-04-23T20:00:28Z

With dropping theano support, it should be easy to make our models use multiple GPUs, not just with batch parallelism, and to put some parts of the model on the CPU (e.g., the embedding layer, as recommended by Matt Peters). I think this is pretty straightforward, but I haven't done it before. We should:

Write some documentation with recommendations for how and when to use this (thinking of people new to the codebase and to deep learning in general; can we give them some guidance on how to structure a model for optimal efficiency?).
Implement some reasonable defaults, like putting the embedding layer on the CPU, in TextTrainer.

The text was updated successfully, but these errors were encountered:

matt-gardner · 2017-04-29T16:26:28Z

#326 does point 2 above, but not point 1 yet.

matt-gardner · 2017-06-09T22:49:54Z

With the batch parallelism PR merged, I'm renaming this issue to focus on the one remaining thing: I believe that models can currently use model parallelism if you want, by using device scopes. Making sure this works and providing some documentation for it would be nice, but not super high priority.

DeNeutoy · 2017-06-09T23:49:47Z

I think that the more important aspect of parallelism still left is to get it working with the various types of data generators/padding stuff we have, rather than model parallelism, but yeah in general it would be nice to double check that this works as smoothly as it might do.

matt-gardner · 2017-06-10T03:26:55Z

Agreed, hence the P2.

matt-gardner added Easy New API feature P0 Performance improvement In progress labels Apr 23, 2017

matt-gardner closed this as completed Jun 9, 2017

matt-gardner reopened this Jun 9, 2017

matt-gardner changed the title ~~Figure out / document using multiple devices~~ Documentation for doing model parallelism on multiple GPUs Jun 9, 2017

matt-gardner added P2 and removed In progress P0 Performance improvement labels Jun 9, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Documentation for doing model parallelism on multiple GPUs #321

Documentation for doing model parallelism on multiple GPUs #321

matt-gardner commented Apr 23, 2017

matt-gardner commented Apr 29, 2017

matt-gardner commented Jun 9, 2017

DeNeutoy commented Jun 9, 2017

matt-gardner commented Jun 10, 2017

Documentation for doing model parallelism on multiple GPUs #321

Documentation for doing model parallelism on multiple GPUs #321

Comments

matt-gardner commented Apr 23, 2017

matt-gardner commented Apr 29, 2017

matt-gardner commented Jun 9, 2017

DeNeutoy commented Jun 9, 2017

matt-gardner commented Jun 10, 2017