Outputs a tensor containing the reduction across all input tensors.
tf.raw_ops.NcclAllReduce( input, reduction, num_devices, shared_name, name=None )
Outputs a tensor containing the reduction across all input tensors passed to ops within the same `shared_name.
The graph should be constructed so if one op runs with shared_name value c
, then num_devices
ops will run with shared_name value c
. Failure to do so will cause the graph execution to fail to complete.
input: the input to the reduction data: the value of the reduction across all num_devices
devices. reduction: the reduction operation to perform. num_devices: The number of devices participating in this reduction. shared_name: Identifier that shared between ops of the same reduction.
Returns | |
---|---|
A Tensor . Has the same type as input . |