融合算子替换
RotaryMul & RotaryMulGrad
RmsNorm & RmsNormGrad
ScaledMaskedSoftmax & ScaledMaskedSoftmaxGrad
MatmulAllReduce
FlashAttentionScore
SwiGlu
父主题:
NPU亲和适配优化