torch.npu.npu.NPUPluggableAllocator

定义文件

torch_npu/npu/memory.py

接口原型

torch_npu.npu.NPUPluggableAllocator(path_to_so_file, alloc_fn_name, free_fn_name)

功能描述

从so文件加载的NPU内存分配器。

参数说明

path_to_so_file：(str) so文件路径。
alloc_fn_name：(str)内存申请函数名（与c/c++文件中函数名一致）。
free_fn_name：(str)内存释放函数名（与c/c++文件中函数名一致）。

约束说明

alloc_fn_name内存申请函数名必须与c/c++文件中函数名一致。

free_fn_name内存释放函数名必须与c/c++文件中函数名一致。

支持的型号

Atlas 训练系列产品
Atlas A2 训练系列产品
Atlas 推理系列产品

调用示例

Python代码示例：

import torch
import torch_npu
# Load the allocator
new_alloc = torch_npu.npu.memory.NPUPluggableAllocator('pluggable_allocator_extensions.so', 'my_malloc', 'my_free')
# Swap the current allocator
torch_npu.npu.memory.change_current_allocator(new_alloc)
# This will allocate memory in the device using the new allocator
npu_tensor = torch.zeros(10, device='npu')

风险提示：自定义的so建议严格参照安全代码示例，内存申请，释放函数必须正确实现。错误的so文件可能出现内存申请失败，内存泄漏等问题。

建议处理方式：用户可在内存申请，释放等内存相关操作函数中，增加日志记录内存行为，方便后续定位问题。

c++安全代码示例：

#include <sys/types.h>
#include <iostream>
#include <torch/extension.h>
 
#include "third_party/acl/inc/acl/acl_base.h"
#include "third_party/acl/inc/acl/acl_rt.h"
#include "torch_npu/csrc/core/npu/NPUCachingAllocator.h"
 
extern "C" {
 
void* my_malloc(ssize_t size, int device, aclrtStream stream)
{
    void *ptr;
    aclrtMallocAlign32(&ptr, size, aclrtMemMallocPolicy::ACL_MEM_MALLOC_HUGE_FIRST);
    std::cout<<"alloc ptr = "<<ptr<<", size = "<<size<<std::endl;
    useflag = true;
    return ptr;
}
 
void my_free(void* ptr, ssize_t size, int device, aclrtStream stream)
{
    std::cout<<"free ptr = "<<ptr<<std::endl;
    aclrtFree(ptr);
}
}

日志记录示例：可使用Ascend Extension for PyTorch自带的debug级别日志打印。日志需注意存放至安全路径。

#include "torch_npu/csrc/core/npu/npu_log.h"
 
ASCEND_LOGD("Pluggable Allocator malloc: malloc = %zu", size);
 
ASCEND_LOGD("Pluggable Allocator free: free= %zu", size);

父主题： Memory management