[maker-devel] Serious Start Codon Problem
Fields, Christopher J
cjfields at illinois.edu
Mon Oct 14 09:07:43 MDT 2013
Carson,
Regarding the CodonTable change below, should this be something that needs to be fixed, or made more flexible, in bioperl? I could see this being a problem if using MAKER for bacterial annotation (where the alternative start codons are actually more common).
chris
On Oct 14, 2013, at 9:45 AM, Carson Holt <carsonhh at gmail.com> wrote:
> Probably Tuesday or Wednesday.
>
> --Carson
>
>
> On 10/12/13 12:16 PM, "Felix Bemm" <felix.bemm at uni-wuerzburg.de> wrote:
>
>> Thx Carson! Do you now when 2.30 will be available? Bests
>>
>> On 12.10.2013 17:48, Carson Holt wrote:
>>> This issue just came up this week on another e-mail as well.
>>>
>>> One way to fix it, is to edit the Bio::Tools::CodonTable module from
>>> BioPerl (use maker --debug to have maker print out the locations of all
>>> modules it uses). Such a hack should only be done as a temporary work
>>> around.
>>>
>>> You change line 256 from -->
>>> ---M---------------M---------------M----------------------------
>>>
>>> To-->
>>> -----------------------------------M----------------------------
>>>
>>>
>>> Maker uses the BioPerl is_start_codon function to determine if the codon
>>> used in a result it receives is acceptable or if it should look upstream
>>> or downstream for a different one. Apparently there are actually 3
>>> acceptable start codons in the standard codon table (2 rare ones and one
>>> common), so making the edit removes the two rare ones from the list.
>>>
>>> I'm also preparing a 2.30 release that exports a "strict" codon table to
>>> the BioPerl module, that will be used by default and should fix the
>>> issue.
>>>
>>> Thanks,
>>> Carson
>>>
>>>
>>>
>>> On 10/12/13 11:06 AM, "Felix Bemm" <felix.bemm at uni-wuerzburg.de> wrote:
>>>
>>>> Hi,
>>>>
>>>> I have a serious problems with maker annotations especially the start
>>>> codon placement. It started happening with maker version 2.27! About
>>>> 1/3
>>>> of my genes are missing a proper start codon. When reviewing the GFF
>>>> files gene predictors and est evidence standalone would predict the
>>>> right gene structure including the start codon. I was thinking that
>>>> this
>>>> problem was somehow linked to my gene models but looking at their
>>>> standalone prediction I discarded that theory. Just some stats about
>>>> proteins with proper start codon (first column) and missing start codon
>>>> (second column).
>>>>
>>>> Maker 9268 4215
>>>> SNAP 16577 896
>>>> Genemark 18764 290
>>>>
>>>> Numbers look the same when Augustus is used (currently retraining).
>>>> Manually using EST's in Apollo often fixes the problems but I don't
>>>> want
>>>> to check > 4000 genes for this.
>>>>
>>>> Do you have any idea what could cause such a problem? If you need some
>>>> example gff files I can send you.
>>>>
>>>> Best regards
>>>> Felix
>>>>
>>>> --
>>>> Felix Bemm
>>>> Department of Bioinformatics
>>>> University of Würzburg, Germany
>>>> Tel: +49 931 - 31 83696
>>>> Fax: +49 931 - 31 84552
>>>> felix.bemm at uni-wuerzburg.de
>>>>
>>>> _______________________________________________
>>>> maker-devel mailing list
>>>> maker-devel at box290.bluehost.com
>>>> http://box290.bluehost.com/mailman/listinfo/maker-devel_yandell-lab.org
>>
>> --
>> Felix Bemm
>> Department of Bioinformatics
>> University of Würzburg, Germany
>> Tel: +49 931 - 31 83696
>> Fax: +49 931 - 31 84552
>> felix.bemm at uni-wuerzburg.de
>
>
>
> _______________________________________________
> maker-devel mailing list
> maker-devel at box290.bluehost.com
> http://box290.bluehost.com/mailman/listinfo/maker-devel_yandell-lab.org
More information about the maker-devel
mailing list