<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1">
<style type="text/css" style="display:none;"> P {margin-top:0;margin-bottom:0;} </style>
</head>
<body dir="ltr">
<div>Hi Maker author<br>
<div>
<div style="font-family:Calibri,Helvetica,sans-serif; font-size:12pt; color:rgb(0,0,0)">
I have been using Maker for long years and recently, I've tried to train agustus using the snap training files. To do this, I have used the train_augustus.pl script as follows:<br>
<br>
zff2genbank.pl export.ann export.dna > final_genes.gb<br>
train_augustus.pl final_genes.gb MyOrg<br>
<br>
For each of my gene models the error is the following:<br>
<br>
<span>Constructing GenBank feature: Feature begins after it ends: 1006..1001,2051..1917,7791..7689,7993..7880,8485..8374,8775..8628,9050..8873,10467..10459,13315..11920,13598..13511,14971..14945,18637..18471,18898..18821,20558..20389,21067..20923,23249..23004,23549..23354,23647..23624<br>
</span>
<div>GBProcessor::getGeneList(): GBFeature constructor:Format error when reading genbank format.<br>
</div>
<span>Encountered error after reading 0 annotations.</span><br>
<br>
The export files are generated with SNAP as described by your reference guides (two maker rounds). The issue seems related with the sense of the gene model that can be inspected here:<br>
(export.ann file)<br>
<span>>MODEL236<br>
</span>
<div>Eterm 23624 23647 MODEL236<br>
</div>
<div>Exon 23354 23549 MODEL236<br>
</div>
<div>Exon 23004 23249 MODEL236<br>
</div>
<div>Exon 20923 21067 MODEL236<br>
</div>
<div>Exon 20389 20558 MODEL236<br>
</div>
<div>Exon 18821 18898 MODEL236<br>
</div>
<div>Exon 18471 18637 MODEL236<br>
</div>
<div>Exon 14945 14971 MODEL236<br>
</div>
<div>Exon 13511 13598 MODEL236<br>
</div>
<div>Exon 11920 13315 MODEL236<br>
</div>
<div>Exon 10459 10467 MODEL236<br>
</div>
<div>Exon 8873 9050 MODEL236<br>
</div>
<div>Exon 8628 8775 MODEL236<br>
</div>
<div>Exon 8374 8485 MODEL236<br>
</div>
<div>Exon 7880 7993 MODEL236<br>
</div>
<div>Exon 7689 7791 MODEL236<br>
</div>
<div>Exon 1917 2051 MODEL236<br>
</div>
<span>Einit 1001 1006 MODEL236<br>
<br>
</span>(genbankfile)<br>
<span>LOCUS MODEL236 24647 bp dna linear UNK<br>
</span>
<div>ACCESSION unknown<br>
</div>
<div>FEATURES Location/Qualifiers<br>
</div>
<div> source 1..24647<br>
</div>
<div> CDS complement(join(1006..1001,2051..1917,7791..7689,<br>
</div>
<div> 7993..7880,8485..8374,8775..8628,9050..8873,10467..10459,<br>
</div>
<div> 13315..11920,13598..13511,14971..14945,18637..18471,<br>
</div>
<div> 18898..18821,20558..20389,21067..20923,23249..23004,<br>
</div>
<div> 23549..23354,23647..23624))<br>
<br>
</div>
<span></span>It seems that augustus needs the direct sense description of the gene model in order to read the gb file and perform the training. How could I fix the problem?<br>
Thank you in advance<br>
Pedro Seoane<br>
</div>
</div>
</div>
</body>
</html>