创建日志目录
在对应节点创建组件日志父目录和各组件的日志目录,并设置目录对应属主和权限。
操作步骤
- 执行以下命令,创建组件日志父目录。
mkdir -m 755 /var/log/mindx-dl chown root:root /var/log/mindx-dl
- 根据所使用组件的具体情况,创建相应的日志目录;组件日志目录信息请参见表1。若用户想要为所有组件创建日志目录,可参见步骤3执行。
表1 集群调度组件日志路径列表 组件
创建日志目录命令
日志路径创建节点
说明
Ascend Device Plugin
mkdir -m 750 /var/log/mindx-dl/devicePlugin chown hwMindX:hwMindX /var/log/mindx-dl/devicePlugin
计算节点
-
NPU Exporter
mkdir -m 750 /var/log/mindx-dl/npu-exporter chown hwMindX:hwMindX /var/log/mindx-dl/npu-exporter
NodeD
mkdir -m 750 /var/log/mindx-dl/noded chown hwMindX:hwMindX /var/log/mindx-dl/noded
Elastic Agent
mkdir -m 750 /var/log/mindx-dl/elastic chown 由用户自行定义 /var/log/mindx-dl/elastic
说明:Elastic-Agent的用户名需要用户根据实际情况修改。
目录属主由用户自定义
Ascend Docker Runtime
自动创建,无需用户手动创建,日志路径为:/var/log/ascend-docker-runtime/
-
HCCL Controller
mkdir -m 750 /var/log/mindx-dl/hccl-controller chown hwMindX:hwMindX /var/log/mindx-dl/hccl-controller
管理节点
-
Ascend Operator
mkdir -m 750 /var/log/mindx-dl/ascend-operator chown hwMindX:hwMindX /var/log/mindx-dl/ascend-operator
Resilience Controller
mkdir -m 750 /var/log/mindx-dl/resilience-controller chown hwMindX:hwMindX /var/log/mindx-dl/resilience-controller
Volcano
mkdir -m 750 /var/log/mindx-dl/volcano-controller chown hwMindX:hwMindX /var/log/mindx-dl/volcano-controller
mkdir -m 750 /var/log/mindx-dl/volcano-scheduler chown hwMindX:hwMindX /var/log/mindx-dl/volcano-scheduler
cert-importer
/var/log/mindx-dl/cert-importer
各节点
导入证书时自动创建,目录权限750,属主为root:root
- (可选)执行以下命令,为所有组件创建日志目录。用户可以根据实际使用的组件,删除多余不用的组件日志目录。
mkdir -m 750 /var/log/mindx-dl/devicePlugin # Ascend Device Plugin chown hwMindX:hwMindX /var/log/mindx-dl/devicePlugin mkdir -m 750 /var/log/mindx-dl/npu-exporter # NPU Exporter chown hwMindX:hwMindX /var/log/mindx-dl/npu-exporter mkdir -m 750 /var/log/mindx-dl/noded # NodeD chown hwMindX:hwMindX /var/log/mindx-dl/noded mkdir -m 750 /var/log/mindx-dl/elastic # Elastic Agent chown 由用户自行定义 /var/log/mindx-dl/elastic mkdir -m 750 /var/log/mindx-dl/hccl-controller # HCCL Controller chown hwMindX:hwMindX /var/log/mindx-dl/hccl-controller mkdir -m 750 /var/log/mindx-dl/ascend-operator # Ascend Operator chown hwMindX:hwMindX /var/log/mindx-dl/ascend-operator mkdir -m 750 /var/log/mindx-dl/resilience-controller # Resilience Controller chown hwMindX:hwMindX /var/log/mindx-dl/resilience-controller mkdir -m 750 /var/log/mindx-dl/volcano-controller # Volcano chown hwMindX:hwMindX /var/log/mindx-dl/volcano-controller mkdir -m 750 /var/log/mindx-dl/volcano-scheduler chown hwMindX:hwMindX /var/log/mindx-dl/volcano-scheduler
父主题: 通用操作