v0.15.1
·
1848 commits
to master
since this release
Summary
This release reaches milestone of 100% compliance of dpctl.tensor functions with Python Array API 2022.12 standard for the main namespace.
Added
- Added reduction functions
dpctl.tensor.min,dpctl.tensor.max,dpctl.tensor.argmin,dpctl.tensor.argmax, anddpctl.tensor.prodper Python Array API specifications: #1399 - Added dedicated in-place operations for binary elementwise operations and deployed them in Python operators of
dpctl.tensor.usm_ndarraytype: #1431, #1447 - Added new elementwise functions
dpctl.tensor.cbrt,dpctl.tensor.rsqrt,dpctl.tensor.exp2,dpctl.tensor.copysign,dpctl.tensor.angle, anddpctl.tensor.reciprocal: #1443, #1474 - Added statistical functions
dpctl.tensor.mean,dpctl.tensor.std,dpctl.tensor.varper Python Array API specifications: #1465 - Added sorting functions
dpctl.tensor.sortanddpctl.tensor.argsort, and set functionsdpctl.tensor.unique_values,dpctl.tensor.unique_counts,dpctl.tensor.unique_inverse,dpctl.tensor.unique_all: #1483 - Added linear algebra functions from the Array API namespace
dpctl.tensor.matrix_transpose,dpctl.tensor.matmul,dpctl.tensor.vecdot, anddpctl.tensor.tensordot: #1490, #1525, #1541 - Added
dpctl.tensor.clipfunction: #1444, #1505 - Added custom reduction functions
dpt.logsumexp(reduction using binary functiondpctl.tensor.logaddexp),dpt.reduce_hypot(reduction using binary functiondpctl.tensor.hypot): #1446 - Added inspection API to query capabilities of Python Array API specification implementation: #1469
- Support for compilation for NVIDIA(R) sycl target with use of CodePlay oneAPI plug-in: #1411, #1124
- Added
dpctl.utils.intel_device_infofunction to query additional information about Intel(R) GPU devices: gh-1428 and gh-1445 - Added support for two new device descriptors,
dpctl.SyclDevice.max_mem_alloc_sizeanddpctl.SyclDevice.max_clock_frequency: #1530
Changed
- Functions
dpctl.tensor.result_typeanddpctl.tensor.can_castbecame device-aware: #1488, #1473 - Implementation of method
dpctl.SyclEvent.wait_forchanged to usesycl::event::waitinstead ofsycl::event::wait_and_throw: gh-1436 dpctl.tensor.astypewas changed to supportdevicekeyword as per Python Array API specification: #1511- C++ header files in
libtensor/include/kernelscontaining implementations of SYCL kernels no longer depends on "pybind11.h": #1516