[maker-devel] Updating Reference Gene Models - Legacy - Strategy

Alessandro Rossoni Alessandro.Rossoni at uni-duesseldorf.de
Fri Feb 10 08:50:07 MST 2017


Dear makers of MAKER,
first of all - thank you for this awesome program! In the context of my 
project, I have been running MAKER on a set of novel genomes and it 
worked very well :)

During the last days, I realized that the reference sequences of species 
A that I have been using as starting point for gene model prediction for 
species B, C and D are a bit flawed. How do I know that? Through 
visualization of novel RNA-Seq data that was mapped to the "old" gene 
models of species A, I came to the conclusion that the "old" gene models 
are not really accurate. In fact, between 52–62% of the reads map within 
annotated exonic regions of the genome and up to 47% map within 
intergenic regions. Intron/Exon boarders are pretty messed up and there 
are a lot more transcribed sequences in species A than previously 
thought.

Hence, I would love to update the gene models of species A including the 
new RNA-Seq evidence and hope to get more accurate gene models out of 
it. The more accurate gene models of species A would be then used to 
predict genes for species B, C and D (which are hopefully going to 
benefit from the more accurate input). However, I there is a little 
understanding issue on how to set the parameters of the maker_opts.ctl.

My plan is to produce new RNA-Seq based gene models for species A using 
Cufflinks, Stringtie, Breaker, Trinity and Velvet. I would pass the 
output to the maker_opts.ctl as:

model_gff=Cufflinks.gff,Stringtie.gff,Breaker.gff,Trinity.gff,Velvet.gff 
#the new gene models
est=Species_A_reference_ests.fasta  #the old/flawed gene models
protein=swissprot.fasta

Question 1: is this correct so far?

But what sequences do I use for training the ab-initio predictors?
snaphmm=
augustus_species=

Question 2: Do I use the "old/flawed" sequences that I know are not 
really good? I am not sure what sequences to use for training.

Any help on this issue would be amazing!
Best,
Ale


-- 
Alessandro W. Rossoni, M.Sc.
Institute for Plant Biochemistry
Heinrich-Heine-University

--
http:///www.plant-biochemistry.hhu.de/
E-Mail:  alessandro.rossoni at hhu-duesseldorf.de




More information about the maker-devel mailing list