Figure 2.
Pipeline job management. The pipeline database tracks the status of jobs, analyses
and batches. As indicated by the ovals, a batch is a collection of analyses, and an
analysis is a set of jobs. A job is a single execution of a program on a single sequence
(for example, BLASTX similarity searching of a unit of genomic sequence). All three
have a current task status. The slowest running in the set dictates the status of
an analysis and a batch. Thus, in terms of analyses, the analysis status is the same
as the status of the slowest job in that analysis, and for batches, the status is
the same as the slowest analysis in that batch. The allowed values for the status
attribute are READY, RUN, FIN, PROCD, UNPRC and FAIL. With respect to jobs, READY
means the jobs are ready to be sent to the pipeline queue, RUN means the jobs are
on the queue or being run, FIN means the jobs have run but have not yet been processed
by BOP to extract the results from the raw data, UNPRC generally means there was an
error in the processing step, FAIL means there was an error in job execution, and
PROCD means the jobs have run and been processed by BOP.
Mungall et al. Genome Biology 2002 3:research0081.1 doi:10.1186/gb-2002-3-12-research0081 |