<div dir="ltr"><div><div>Hi Panos,<br><br></div>Have you tried using webAugustus for the (re)training? I found it very convenient for generating the models for Augustus.<br><br></div>Cheers,<br></div><div class="gmail_extra"><br><div class="gmail_quote">2015-03-24 19:29 GMT+11:00 Panos Ioannidis <span dir="ltr"><<a href="mailto:panos.ioannidis@gmail.com" target="_blank">panos.ioannidis@gmail.com</a>></span>:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir="ltr"><div><div><div><div>Hello All,<br><br></div>I'm trying to retrain Augustus using EST data from the same species and realized that quite a few of the gene models I get based on EST data are incomplete (i.e. no start and/or stop codon).<br><br></div>Now, when I get to the "etraining" step in Augustus retraining (right after the time-consuming "<a href="http://optimize_augustus.pl" target="_blank">optimize_augustus.pl</a>" step), I get a warning for each gene that doesn't contain a start or stop codon.<br><br><span style="font-family:monospace,monospace">.....<br>gene maker-scaffold4|size2210279-exonerate_est2genome-gene-20.1-mRNA-1 transcr. 1 in sequence scaffold4|size2210279_2021791-2044735: Initial exon does not begin with start codon but with acg<br>gene maker-scaffold4|size2210279-exonerate_est2genome-gene-20.2-mRNA-1 transcr. 1 in sequence scaffold4|size2210279_2045713-2064983: Terminal exon doesn't end in stop codon. Variable stopCodonExcludedFromCDS set right?<br>....</span><br><br></div>Does anyone know whether training is compromised by such incomplete gene models? Do you usually exclude them from the training set?<br><br></div>Oh, and by the way, the best guide to retraining Augustus is <a href="http://avrilomics.blogspot.ch/2013/04/training-augustus-gene-finding-software.html" target="_blank">here</a>. The <a href="http://bioinf.uni-greifswald.de/augustus/binaries/retraining.html" target="_blank">official</a> web page isn't bad, but doesn't explain in detail certain things.<br><div><div><br></div><div>Thanks,<br></div><div>Panos<br><br></div></div></div>
<br>_______________________________________________<br>
maker-devel mailing list<br>
<a href="mailto:maker-devel@box290.bluehost.com">maker-devel@box290.bluehost.com</a><br>
<a href="http://box290.bluehost.com/mailman/listinfo/maker-devel_yandell-lab.org" target="_blank">http://box290.bluehost.com/mailman/listinfo/maker-devel_yandell-lab.org</a><br>
<br></blockquote></div><br><br clear="all"><br>-- <br><div class="gmail_signature">Xabier Vázquez Campos<br><i>PhD Candidate</i><br>Water Research Centre<br>School of Civil and Environmental Engineering<br>
The University of New South Wales<br>Sydney NSW 2052 AUSTRALIA<br></div>
</div>