<div dir="ltr">Hi Carson,<div><br></div><div>Looking at the progress more carefully, I learned that some query and database combinations cause tblastx to run forever. Typically, the tblastx search ends in reasonable times (a few hours maximum), but for those, it takes days ( and still running ) to search the 100 kb query against a 50 Mb database. And all CPUs are trapped by these searches, making MAKER to never finish. </div><div><br></div><div>Would it be possible to skip tblastx search for these queries + databases? I have intermediate files from a previous MAKER run produced with a smaller size of databases, so I attempted to copy some of these files into the current run folders. For instance, for atg000169l.12.Solanacea%2Ecds%2Efa.tblastx.temp_dir that causes the issue,</div><div><br></div><div>I first copied atg000169l.12.Solanacea%2Ecds%2Efa.tblastx from the previous run into the proper directory and deleted
atg000169l.12.Solanacea%2Ecds%2Efa.tblastx.temp_dir. </div><div><br></div><div>Then I modified run.log.child.12 to include FINISHED SH1353.alternative.noPlasmid.maker.output/SH1353.alternative.noPlasmid_datastore/42/CC/atg000169l//theVoid.atg000169l/1/atg000169l.12.Solanacea%2Ecds%2Efa.tblastx</div><div><br></div><div>However, it seems like MAKER still starts over from tblastx. I have a small number of contigs left, so manually working around this is feasible. Would there be a way to do this?</div><div><br></div><div>Thank you for your help!</div><div>Kyungyong</div><div><br></div><div><br></div></div><br><div class="gmail_quote"><div dir="ltr" class="gmail_attr">On Sun, Dec 19, 2021 at 10:02 AM Kyungyong Seong <<a href="mailto:s.kyungyong@berkeley.edu">s.kyungyong@berkeley.edu</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div dir="ltr">Thank you for the tips! How about reducing the time for tblastx? My cluster has a 3 days run limit. I think what is happening is that MAKER is terminated because of out-of-memory issues or runtime cap, and when MAKER is restarted, tblastx needs to start from scratch. Do you think it would be better not to use MPI and set cpus=30? Or would it be okay to set up mpi = 3 and cpus=10 if I have 30 cores?<div><br></div></div><br><div class="gmail_quote"><div dir="ltr" class="gmail_attr">On Fri, Dec 17, 2021 at 9:29 AM Carson Holt <<a href="mailto:carsonhh@gmail.com" target="_blank">carsonhh@gmail.com</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">1. Make sure your system is not configured with an in memory /tmp directory. If it is, every file written to temporary storage will use RAM.<br>
2. If running under MPI, cpu= in maker_opts.ctl must be set to 1.<br>
3. max_dna_len= should be 100000 (the default)<br>
4. In maker_bopts.ctl, set all the depth_blast= options to something like 10 or 20 (there are 3 depth values you will have to set). The default is to keep everything, and if you have really deep alignments that can use a lot of RAM with out any actual benefit for gene prediction.<br>
<br>
—Carson<br>
<br>
<br>
<br>
> On Dec 16, 2021, at 11:03 AM, Kyungyong Seong <<a href="mailto:s.kyungyong@berkeley.edu" target="_blank">s.kyungyong@berkeley.edu</a>> wrote:<br>
> <br>
> Hi!<br>
> <br>
> MAKER has been running fine on my genome (~1Gb; 800 contigs) but is now stuck with ~30 contigs that keep failing because of high memory consumption. I am using mpi, running 20-30 contigs for annotation in parallel, depending on the machine. I started with 64Gb memory machines but have moved up to 1.5 Tb machines as the job kept failing. Unfortunately, all memory of this machine is also saturated. It looks like tblastx is taking lots of time and resources. The databases I have are about 200 Mb for the proteins and 570 Mb for cDNAs. max_dna_len is set as 100000 in maker_opt.ctl. Would there be a way to improve this? Decreasing the number of jobs for MPI slowed down memory saturation but eventually the same happened. <br>
> <br>
> Thank you!<br>
> Kyungyong<br>
> <br>
> <br>
> _______________________________________________<br>
> maker-devel mailing list<br>
> <a href="mailto:maker-devel@yandell-lab.org" target="_blank">maker-devel@yandell-lab.org</a><br>
> <a href="http://yandell-lab.org/mailman/listinfo/maker-devel_yandell-lab.org" rel="noreferrer" target="_blank">http://yandell-lab.org/mailman/listinfo/maker-devel_yandell-lab.org</a><br>
<br>
</blockquote></div>
</blockquote></div>