Troubleshooting Hub Submit

The submit tool uses a monitoring daemon to monitor jobs that have been submitted to remote hosts. The machinery for this tool resides in the ~/Submit directory of the submit user on the host (e.g., ~vhub/submit on u2-grid). At times, this machinery may stop working for various reasons (filesystem changes, etc.). To troubleshoot, first examine the job monitor on the target host to see if there are entries for any recently submitted jobs. If there are not, this could be an indication that the monitoring server on the hub is down (it initiates monitoring requests).


Also examine the job monitor log on the hub (www.vhub.org)


The job monitor process on the hub is {{{/opt/submit/monitorJob.py}}}.  If it has stopped it can be restarted using

{{{/etc/init.d/submon start}}}

A list of available sites is stored on login.vhub.org in {{{/opt/submit/sites.dat}}} and can be modified by the apps user.




			

Created on , Last modified on