Commit Graph

6 Commits

Author SHA1 Message Date
Zach DeVito
c8b303e853 guard dump, guard cuda 2017-09-05 17:48:55 -04:00
Zach DeVito
f4b7178b59 track scalar type 2017-09-05 17:48:55 -04:00
Zach DeVito
b6175eb54d enable fusion group execution in autograd closure. implement chunk. propagate type information through fusion optimization. 2017-09-05 17:48:55 -04:00
Adam Paszke
233a66dcbe Remove SimpleMap from JIT IR 2017-09-05 17:48:55 -04:00
Edward Z. Yang
ea4aaa6b0b Document TemplateEnv & PR fixes 2017-09-05 17:48:55 -04:00
Zach DeVito
50e51eaa7f Fusion of simple map operations using nvrtc.
Approach is based on the approach of THC's pointwiseApply{1,2,3} family of kernels,
but doesn't have any dependencies on that code.

Adjacent contiguous dimensions of input tensors are compressed to reduce the complexity of indexing math.
For the completely contiguous case, the indexing logic simplifies to just the linear index.

In simple tests, this code matched or beat the equivalent from THC.
2017-09-05 17:48:55 -04:00