Skip to content

Workaround for 0 bytes allocation for SYCL device#42

Merged
benoitsteiner merged 46 commits into
benoitsteiner:masterfrom
lukeiwanski:master
Feb 3, 2017
Merged

Workaround for 0 bytes allocation for SYCL device#42
benoitsteiner merged 46 commits into
benoitsteiner:masterfrom
lukeiwanski:master

Conversation

@lukeiwanski
Copy link
Copy Markdown
Collaborator

No description provided.

luke iwanski added 30 commits December 30, 2016 10:44
This commit reduces number of failing tests when TensorFlow compiles
for OpenCL support.

 - Registration of Type Traits required for stride slise op
 - Registration of ConcatOffset, _ListToArray, _ArrayToList
   Pad, Reverse ( CPU ), ReverseV2 ( CPU ), Size, ExpandDims,
   Squeeze, StridedSlice, StridedSliceGrad, StridedSliceAssign,
   TileGrad, InvertPermutation, Transpose
 - Registration of Sycl kernels only for essential data types
 - Floor_div_real has been disabled for SYCL device
 - Device in control_flow_ops_py_test.py needed to be lower cased
Removed FloorDiv, Reverse and ReverseV2 registration for SYCL device
as the cpu implementation of the kernels won't be able to access
the input data.
 - Expm1, Greater, GreaterEqual, Maximum, Reciprocal, Select, Sigmoid, Sign Ops for SYCL
 - Disables ComplexPow for SYCL device
 - Registers Cast, LessEqual, Minimum, MatMul Ops
 - Disables BatchMatMul test for SYCL device
 - MatMul only for floats
 - Int32 Add Op registration
 - Equal, Max, Mean, Min, Prod Op registered for SYCL
 - Disabled Mean_XXX tests for SYCL device due to lack
   of support from Eigen
     - ComputeCpp no longer compiles external projects nor llvm
     - ComputeCpp checks file to determin if the device compilation pass is needed
 - ReduceFunctor specialisation for MeanReducer on the SYCLDevice
 - Re-enable Mean tests
 - Fixes documentation issue mentioned in benoitsteiner#32 (comment)
Conflicts:
	tensorflow/workspace.bzl
@benoitsteiner benoitsteiner merged commit 80e7aba into benoitsteiner:master Feb 3, 2017
benoitsteiner added a commit that referenced this pull request Feb 25, 2017
* OpenCL improvements

Added Tile, Transpose and Range Ops double support for SYCL device.
Moved gpu_device_name() to test_util.py so now it can be used in force_gpu to pull either GPU or SYCL depending on what is available in the system.

* Improvements to the SYCL device support

 - Registration of Type Traits required for stride slice op
 - Registration of ConcatOffset, _ListToArray, _ArrayToList
   Pad, Reverse ( CPU ), ReverseV2 ( CPU ), Size, ExpandDims,
   Squeeze, StridedSlice, StridedSliceGrad, StridedSliceAssign,
   TileGrad, InvertPermutation, Transpose
 - Registration of Sycl kernels only for essential data types
 - Floor_div_real has been disabled for SYCL device
 - Device in control_flow_ops_py_test.py needed to be lower cased

* SYCL support improvements (#31)

* Improvements to the SYCL device support

This commit reduces number of failing tests when TensorFlow compiles
for OpenCL support.

 - Registration of Type Traits required for stride slice op
 - Registration of ConcatOffset, _ListToArray, _ArrayToList
   Pad, Reverse ( CPU ), ReverseV2 ( CPU ), Size, ExpandDims,
   Squeeze, StridedSlice, StridedSliceGrad, StridedSliceAssign,
   TileGrad, InvertPermutation, Transpose
 - Registration of Sycl kernels only for essential data types
 - Floor_div_real has been disabled for SYCL device
 - Device in control_flow_ops_py_test.py needed to be lower cased

* Fixes & Version bump (#33)

* Fix Unbuntu typo. (#38)

unbuntu -> ubuntu

* Add problem descriptions and solutions (#35)

* Add ComputeCpp lib folder to LD_LIBRARY_PATH

* Add ImportError problem + solution

If you get the error message "ImportError: libComputeCpp.so: cannot open shared
object file: No such file or directory", make sure you have added the
path to ComputeCpp's lib folder to your `LD_LIBRARY_PATH`.

* Add another ImportError problem + solution

If you get the error message "ImportError: cannot import name
'pywrap_tensorflow'" you may be standing in the TensorFlow directory.

* Improvements to the SYCL device support

* Registers FloorDiv, FloorMod and SoftMax Ops for SYCL device

* Workaround for 0 bytes allocation for SYCL device (#42)

* Sycl improvements (#44)

- Eigen version bump
 - Extends Cast and Cwise ops benchmark to cover Sycl device
 - Extends device_lib_test.py to cover Sycl device
 - Registers int32, string and ResourceHandler to run on host for
   Enter and RefEnter Sycl Ops
 - Enables RecudeMax op for Sycl since Eigen implementation is ready
 - Registers Less op for Sycl device

* Improved the formatting of the SYCL code

* Fixed compilation error.

* Made sure that using test sessions with force_gpu=True forces the
placement on a gpu device even if none is detected.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants