[maker-devel] CDS retrieve from augustus_masked

Carson Holt carsonhh at gmail.com
Sat Apr 6 15:00:19 MDT 2013


I additionally found the offset value some position after Œ>¹ letter. Is
that indicate the starting ATG?

> Only in the maker.transcripts.fasta will have offsets other than 0, you can
> use these to get the transcription offset.  All other *.transcript.fasta files
> will always have an offset of 0 for the reason previously mentioned.  Some
> genes will not start with ATG or have stop codons.  These are partial models.
> Set always_complete=1 to reduce these.


Secondly, there is several files named *.augustus_masked.proteins.fasta,
*.non_overlapping_ab_initio.proteins.fasta, and *.proteins.fasta. What is
the criteria of splitting those files?
> Final selected annotations go in the maker.proteins.fasta and
> maker.transcripts.fasta files.  Raw unfiltered ab initio prediction from
> augustus go in the augustus_masked.proteins.fasta and
> augustus_masked.transcripts.fasta file (these are for reference purposes).  A
> set of non-redundant rejected models go in the
> non-overlapping.transcripts.fasta and  non-overlapping.proteins.fasta files
> (if you are missing a gene you expected to find, look in this file first ­ you
> can add them back if you find protein domains in them for example).


The reason why I¹m asking is that some genes were redundant between
*.augustus_masked.proteins.fasta and *.proteins.fasta.
> This is because some of the augustus generated models made it into the final
> annotation set.
> 
> 
Thanks,
Carson

From: Carson Holt [mailto:carsonhh at gmail.com]
Sent: Sunday, April 07, 2013 12:13 AM
To: Michael Thon; Kang, Yang Jae
Cc: maker-devel at yandell-lab.org
Subject: Re: [maker-devel] CDS retrieve from augustus_masked
 

Augustus only predicts UTR for a handful of organisms.  I trim them off the
rejected models before outputting to the GFF3 as match/match_part features
(per my previous e-mail concerning the limitations of GFF3).

 

 --Carson

 

From: Michael Thon <mike.thon at gmail.com>
Date: Saturday, 6 April, 2013 10:37 AM
To: "Kang, Yang Jae" <kangyangjae at gmail.com>
Cc: <maker-devel at yandell-lab.org>
Subject: Re: [maker-devel] CDS retrieve from augustus_masked

 

Thats a good point because 'transcripts' implies that it would have the
UTRs. Does augustus predict the UTRs?  I manually checked the translations
of the .transcript. file and I only found valid translations but that does
not mean that UTRs could not be present...

On Apr 6, 2013, at 1:24 PM, "Kang, Yang Jae" <kangyangjae at gmail.com> wrote:


Thank for your quick response Mike

I looked the file named transcript, but it might include UTRs I suspect.
What I want to do is calculating Ka Ks values so that I need coding
sequences. Is there any indication where is exact START and STOP in the
transcript file?

 

Thank you

 

 

From: Michael Thon [mailto:mike.thon at gmail.com <http://gmail.com> ]
Sent: Saturday, April 06, 2013 8:20 PM
To: Kang, Yang Jae
Cc: maker-devel at yandell-lab.org
Subject: Re: [maker-devel] CDS retrieve from augustus_masked

 

Hi Kang - After running fasta_merge there should be a file:

 

[prefix].all.maker.augustus_masked.transcripts.fasta

 

in the output directory.  Is that what you need?

Mike

 

On Apr 6, 2013, at 9:25 AM, "Kang, Yang Jae" <kangyangjae at gmail.com
<mailto:kangyangjae at gmail.com> > wrote:




Dear everyone!
 
I want to retrieve CDS sequences from the output of maker; however, in the
augustus_masked feature there is no indication of CDS or Exon like maker
features. Is there any way for me to retrieve CDS from augustus_masked?
There were protein sequences in outdir but no CDS information.
 
Thank you!
 

Kang, Yang Jae

Ph.D.

Cropgenomics Lab.

College of Agriculture and Life Science

Seoul National University

Korea
 

_______________________________________________
maker-devel mailing list
maker-devel at box290.bluehost.com <mailto:maker-devel at box290.bluehost.com>
http://box290.bluehost.com/mailman/listinfo/maker-devel_yandell-lab.org
<http://box290.bluehost.com/mailman/listinfo/maker-devel_yandell-lab.org>
 
_______________________________________________ maker-devel mailing list
maker-devel at box290.bluehost.com
http://box290.bluehost.com/mailman/listinfo/maker-devel_yandell-lab.org


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://yandell-lab.org/pipermail/maker-devel_yandell-lab.org/attachments/20130406/ec7c3dec/attachment-0002.html>


More information about the maker-devel mailing list