[maker-devel] PARALLELIZED DE NOVO GENOME ANNOTATION WITHOUT MPI
Quanwei Zhang
qwzhang0601 at gmail.com
Mon Feb 27 08:25:04 MST 2017
Hello:
I am doing genome annotation using Maker on our high performance
computational cluster (HPC). Due to some issues of MPI, I submitted the
Maker jobs several times under the same directory to HPC. Followed by the
example in the protocol (as shown below), when I submit the jobs I make
them as background processes by "&" except the first one. Is this necessary
when I submit a job to a HPC? I found it costed much much longer time than
I expected (according to a testing on a smaller data set). I am not sure
whether setting the process as background process lead to this issue?
The example in the protocol
% maker 2> maker1.error
% maker 2> maker2.error &
% maker 2> maker3.error &
......
BTW, will the annotation on shorter contig (e.g., 500bp) cost ~ 1/100 of
the time that cost for annotation a 50000bp contig? I am using SNAP for an
inito and RNA-seq assembly and protein sequences as evidence. I have more
than half contigs shorter than 300bp (whose total length is only about 5%
of the total length of all contigs), I want to know whether I can save
about half (or only about 5%) of the time if I ignore those short contigs.
Thanks
Best
Quanwei
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://yandell-lab.org/pipermail/maker-devel_yandell-lab.org/attachments/20170227/50a134e5/attachment-0002.html>
More information about the maker-devel
mailing list