[maker-devel] tradeoff between run time & file number

Wed Mar 19 19:19:27 MDT 2014

Hi -

I'm running maker on a dataset of >400,000 scaffolds with MPI -n 64. I've
gone through it once - and used the clean_up option because otherwise maker
exceeds the clusters file_quote. However, now I'm retraining SNAP and it is
taking a very long time - probably because it has to go through BLAST
again. Is there anyway of getting around this? I expect I may have to train
SNAP and rerun maker multiple times and it is taking about 3 weeks to get
through my dataset. Is there a way to prune down my original dataset based
on maker's output?

Thanks,
Rebecca
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://yandell-lab.org/pipermail/maker-devel_yandell-lab.org/attachments/20140319/80de6463/attachment-0002.html>