[maker-devel] Maker output

Panos Ioannidis panos.ioannidis at gmail.com
Thu Apr 23 07:57:31 MDT 2015


Hello Maker community,

I have, at last, finished annotating my genome with Maker (!) and have a
few questions on the final output.

1. I used gff3_merge and fasta_merge in order to merge all the gffs and all
the different fasta files that were produced during the runs (I split my
assembly to smaller chunks that ran in parallel). Are these two scripts the
only ones I have to run after Maker has finished? Am I leaving anything
important behind?

2. I noticed that all my transcripts (both in the fasta files as well as in
the gff) have the name "XXX-mRNA-1". The fact that I can't find any of them
containing "mRNA-2" means that there are no splice variants from the same
gene?

3. In my *maker.proteins.fasta file I see that some proteins have a name
like

snap_masked-XXX

whereas others (apparently, also predicted by SNAP) have a name, like

maker-XXX-snap-gene-XXX

What is the difference between these two genes that are both predicted by
SNAP? By reading other posts in this list, I was left with the impression
that all genes predicted by SNAP/Augustus that lie in a masked region (as
the first name implies), are put to another fasta file, named
*maker.snap_masked.proteins.fasta.

4. By looking at a few genes in the *maker.transcripts.fasta file I came to
the conclusion that only complete genes (i.e. with a start and a stop
codon) are reported in this file. Am I right?

Thanks in advance,
Panos
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://yandell-lab.org/pipermail/maker-devel_yandell-lab.org/attachments/20150423/01a080c5/attachment-0002.html>


More information about the maker-devel mailing list