scatter

paddle. scatter ( x, index, updates, overwrite=True, name=None ) [源代码] 

通过基于 updates 来更新选定索引 index 上的输入来获得输出。具体行为如下：

如下图，当 overwrite 为 True 的时候使用覆盖模式更新相同索引的输出，依次将 x[index[i]] 更新为 update[i] ；而当 overwrite 为 False 时使用累加模式更新相同索引的输出，先依次将 x[index[i]] 更新为与该行大小相同的元素值均为 0 的 Tensor ，再依次将 update[i] 加到 x[index[i]] 产生输出。

  >>> import paddle >>> #input: >>> x = paddle.to_tensor([[1, 1], [2, 2], [3, 3]], dtype='float32') >>> index = paddle.to_tensor([2, 1, 0, 1], dtype='int64') >>> # shape of updates should be the same as x >>> # shape of updates with dim > 1 should be the same as input >>> updates = paddle.to_tensor([[1, 1], [2, 2], [3, 3], [4, 4]], dtype='float32') >>> overwrite = False >>> # calculation: >>> if not overwrite: ... for i in range(len(index)): ... x[index[i]] = paddle.zeros([2]) >>> for i in range(len(index)): ... if (overwrite): ... x[index[i]] = updates[i] ... else: ... x[index[i]] += updates[i] >>> # output: >>> out = paddle.to_tensor([[3, 3], [6, 6], [1, 1]]) >>> print(out.shape) [3, 2]  

Notice： 因为 updates 的应用顺序是不确定的，因此，如果索引 index 包含重复项，则输出将具有不确定性。

参数

x (Tensor) - ndim> = 1 的输入 N-D Tensor。数据类型可以是 float32，float64。

index （Tensor）- 一维或者零维 Tensor。数据类型可以是 int32，int64。 index 的长度不能超过 updates 的长度，并且 index 中的值不能超过输入的长度。

updates （Tensor）- 根据 index 使用 update 参数更新输入 x。当 index 为一维 tensor 时，updates 形状应与输入 x 相同，并且 dim>1 的 dim 值应与输入 x 相同。当 index 为零维 tensor 时，updates 应该是一个 (N-1)-D 的 Tensor，并且 updates 的第 i 个维度应该与 x 的 i+1 个维度相同。

overwrite （bool，可选）- 指定索引 index 相同时，更新输出的方式。如果为 True，则使用覆盖模式更新相同索引的输出，如果为 False，则使用累加模式更新相同索引的输出。默认值为 True。

name (str，可选) - 具体用法请参见 api_guide_Name，一般无需设置，默认值为 None。

返回

Tensor，与 x 有相同形状和数据类型。

代码示例

  >>> import paddle >>> x = paddle.to_tensor([[1, 1], [2, 2], [3, 3]], dtype='float32') >>> index = paddle.to_tensor([2, 1, 0, 1], dtype='int64') >>> updates = paddle.to_tensor([[1, 1], [2, 2], [3, 3], [4, 4]], dtype='float32') >>> output1 = paddle.scatter(x, index, updates, overwrite=False) >>> print(output1) Tensor(shape=[3, 2], dtype=float32, place=Place(cpu), stop_gradient=True, [[3., 3.], [6., 6.], [1., 1.]]) >>> output2 = paddle.scatter(x, index, updates, overwrite=True) >>> # CPU device: >>> # [[3., 3.], >>> # [4., 4.], >>> # [1., 1.]] >>> # GPU device maybe have two results because of the repeated numbers in index >>> # result 1: >>> # [[3., 3.], >>> # [4., 4.], >>> # [1., 1.]] >>> # result 2: >>> # [[3., 3.], >>> # [2., 2.], >>> # [1., 1.]]  

使用本API的教程文档¶

PyTorch 写法，dim=0