Avoid instantiating huge tensors as input to similarity functions #308

matt-gardner · 2017-04-20T18:20:46Z

I'm not sure how this would work, really, but it takes a whole lot of memory to do it like we do it, tiling everything and then doing elementwise multiplication. There might be some way to make this work using some kind of batch_dot or dot.

The text was updated successfully, but these errors were encountered:

matt-gardner · 2017-04-23T05:18:57Z

It looks like tf.einsum might do the trick, at least for simple similarity functions. For more complicated ones, I'm not sure.

matt-peters · 2017-04-23T19:24:51Z

tf.matmul works well for generic dot product based similarities. It's probably a lot faster since it'll call directly the optimized matrix routines.

matt-gardner · 2017-04-23T19:51:06Z

The issue is that our similarity functions try to be fancy, letting you easily swap out different parameterized and non-parameterized functions when computing attentions. The trouble is that the way we make this easy is by taking a whole lot of memory. We need to re-think the API a bit.

matt-gardner · 2017-05-10T18:14:47Z

I'm decreasing the priority of this, as the adaptive batch size and dynamic padding stuff makes this not too big of an issue anymore.

It'd still be a nice optimization, and would likely make runtimes faster, but it's not blocking anything anymore.

matt-gardner added Hard P1 Performance improvement labels Apr 20, 2017

matt-gardner added P2 and removed P1 labels May 10, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Avoid instantiating huge tensors as input to similarity functions #308

Avoid instantiating huge tensors as input to similarity functions #308

matt-gardner commented Apr 20, 2017

matt-gardner commented Apr 23, 2017

matt-peters commented Apr 23, 2017

matt-gardner commented Apr 23, 2017

matt-gardner commented May 10, 2017

Avoid instantiating huge tensors as input to similarity functions #308

Avoid instantiating huge tensors as input to similarity functions #308

Comments

matt-gardner commented Apr 20, 2017

matt-gardner commented Apr 23, 2017

matt-peters commented Apr 23, 2017

matt-gardner commented Apr 23, 2017

matt-gardner commented May 10, 2017