下载
中文
注册

EI0006 Get Socket Timeout

Symptom

Getting socket times out. Reason: %s

Possible Cause

N/A

Solution

1. Check the rank service processes with other errors or no errors in the cluster.

2. If this error is reported for all NPUs, check whether the time difference between the earliest and latest errors is greater than the connect timeout interval (120s by default). If so, adjust the timeout interval by using the HCCL_CONNECT_TIMEOUT environment variable.

3. Check the connectivity of the communication link between nodes.