본문 바로가기
소프트웨어/LSF

LSF 데몬 관련 ERROR

by yororing 2024. 5. 13.

00 개요

  • 목적: 흔한 LSF 문제를 다룸
  • 대부분의 문제들은 due to incorrect installation or configuration
  • 절차: error log files 먼저 확인해보기

01 LIM 

1. LIM dies quietly 

  • 절차
    • Run the following command to check for errors in the LIM configuration files.
    •  
    • # lsadmin ckconfig -v
    • This displays most configuration error
    • If this does not report any errors, check in the LIM error log

2. LIM unavailable

  • 설명
    • sometimes the LIM is up, but executing the lsload command prints the following error message: Communication time out
    • If the LIM has just been started, this is normal, because the LIM needs time to get initialized by reading configuration files and contacting other LIMs. If the LIM does not become available within one or two minutes, check the LIM error log for the host you are working on. To prevent communication timeouts when starting or restarting the local LIM, define the parameter LSF_SERVER_HOSTS in the lsf.conf file. The client will contact the LIM on one of the LSF_SERVER_HOSTS and execute the command, provided that at least one of the hosts defined in the list has a LIM that is up and running. When the local LIM is running but there is no master LIM in the cluster, LSF applications display the following message: Cannot locate master LIM now, try later.
  • 절차
    • Check the LIM error logs on the first few hosts listed in the Host section of the lsf.cluster.cluster_name file.
    • If LSF_MASTER_LIST is defined in lsf.conf, check the LIM error logs on the hosts listed in this parameter instead.

 

참조

  1. https://www.bsc.es/support/LSF/9.1.2/lsf_admin/index.htm?troubleshooting_common_problems_lsf.html~main 
  2.  

 

'소프트웨어 > LSF' 카테고리의 다른 글

Resource (자원)  (0) 2024.06.26
External Load Indices, External Load Info Manager (elim)  (1) 2024.06.13
lsfstartup, lsfrestart, lsfshutdown (LSF 명령어)  (0) 2024.04.29
LSF 빠른 참조  (0) 2024.04.22
LSF 클러스터, 잡, 큐  (0) 2024.04.22