torch_npu.npu_rotary_mul

torch_npu.npu_rotary_mul(Tensor x, Tensor r1, Tensor r2): -> Tensor
实现RotaryEmbedding旋转位置编码。
x1, x2 = torch.chunk(x, 2, -1)
x_new = torch.cat((-x2, x1), dim=-1)
output = r1 * x + r2 * x_new