|
|
Sample Loadleveler Batch Script (including how to monitor jobs) |
Batch Job Submission on NYBlue/LIn your LoadLeveler batch script: Class normal jobs (48 hour wall clock limit) must specify a particular predefined partition of compute nodes on which your job will be run.Class short jobs (24 hour wall clock limit) must instead specify simply the number of nodes on which to run the job, and the specified number must be either 1024, 2048, 3072, or 4096 (i.e. one, two, three, or four racks).If you will be running class normal jobs, see the: NYBlue Predefined Partitions Naming Convention hyperlinks on the sidebar for details. If you will be running class short jobs, see only the last of those three hyperlinks. Notes For Class Normal Jobs Onlyreadyblocks.plThen specify any one of the displayed predefined partitions, in your LoadLeveler job control file.Note: If there are no available partitions, there will be no output from the above command. In that case, select a partition following the procedure described in the List of NYBlue Predefined Partitions hyperlink on the sidebar. Your LoadLeveler job will be queued and will run when that partition becomes available. Note: There are times when readyblocks.pl will indicate a predefined partition is available, yet your batch job specifying that partition will be queued rather than run immediately. Notes for Both Class Normal and Class Short JobsBear in mind that the batch system uses backfilling and favors incoming jobs with smaller specified wall clock limits, so it is to your advantage to specify a wall clock limit that is not larger than what your job will actually need.The Sample Loadleveler Batch Script hyperlink on the sidebar at left describes how to cancel a job, as well as usage of llq -s to learn about the status of a job.There is a maximum wall clock limit of 48 hours on all class normal LoadLeveler jobs and 24 hours on all class short jobs. All jobs must specify the class to be used and the class must be normal or short, see the Sample LoadLeveler Batch Script hyperlink on the sidebar at the left which also discusses how to monitor LoadLeveler jobs and the meaning of the codes displayed when one monitors one's batch job. maxjobs and maxqueued both equal two for class short. The Notes section of the Sample LoadLeveler Batch Script hyperlink on the sidebar at the left explains the significance of this statement. We are still in the process of testing LoadLeveler and implementing the best configuration for it, your patience is appreciated. This site maintained by: bgwebmaster@bnl.gov
|