[maker-devel] Q on MAKER

hcma hcma at uci.edu
Wed Feb 10 15:27:41 MST 2016


Hi Mike,

Thanks for the reply. So i can input raw RNA-seq reads to Tophat and 
feed the output to maker?

Thanks.

Best Regards
KAren



On 2016-02-10 06:17, Michael Campbell wrote:
> HI Karen,
> 
> From my experience trimming reads will not make things worse and it
> generally makes things better. As far as the best program to use, one
> doesn’t really stand out above the others as far as I can tell.
> However, with paired end reads it is important to use a trimmer that
> preserves the pairing between the two files (i.e when an entire read
> is discarded the paired read is moved into a file for singletons).
> 
> Thanks
> Mike
> 
>> On Feb 9, 2016, at 5:35 PM, hcma <hcma at uci.edu> wrote:
>> 
>> Hi Carson,
>> 
>> For the final run of annotation, I would like to incorporate tophat 
>> results from RNA-seq data, from your experience, do you know if it is 
>> better to use raw RNA-seq (Illumina paired-end data) or trimmed 
>> (trimmed using Trimmomatuc) data for feeding into tophat? If trimmed, 
>> do you recommend a particular programme?
>> 
>> Thanks for your time.
>> 
>> Best Regards
>> KAren
>> 
>> 
>> 
>> 
>> On 2016-02-05 15:33, Carson Holt wrote:
>>> I recommend using both.  You probably don't have augustus installed.
>>> --Carson
>>> Sent from my iPhone
>>>> On Feb 5, 2016, at 4:20 PM, hcma <hcma at uci.edu> wrote:
>>>> Hi Carlson,
>>>> Thanks for the instruction and in maker_exe.ctl, i only see path to 
>>>> snap, but not to augustus, so my system admin is checking this for 
>>>> me.
>>>> From some manual i found, people use both snap and augustus when 
>>>> using MAKER to annotate genomes. Would you recommend using both or 
>>>> one of the 2 is sufficient?
>>>> Thanks for your valuable time and advise.
>>>> Best Regards
>>>> Karen
>>>>> On 2016-02-05 15:03, Carson Holt wrote:
>>>>> You need to find out where the augustus MAKER is using is 
>>>>> installed.
>>>>> Check the maker_exe.ctl file you are using, or type ‘which 
>>>>> augustus’.
>>>>> —Carson
>>>>>> On Feb 5, 2016, at 3:58 PM, hcma <hcma at uci.edu> wrote:
>>>>>> Hi Carlson,
>>>>>> These are the list of directories under maker/2.31.8
>>>>>> bin  data  GMOD  INSTALL  lib  LICENSE  MWAS  perl  README  
>>>>>> RELEASE  src
>>>>>> Where can i find augustus/? Or i have to ask my system admin to 
>>>>>> install this?
>>>>>> Thanks.
>>>>>> Best Regards
>>>>>> Karen
>>>>>>> On 2016-02-05 14:54, Carson Holt wrote:
>>>>>>> Augustus gives you an entire directory rather than just a single 
>>>>>>> file
>>>>>>> like SNAP.  You have to take the directory and copy it to the
>>>>>>> .../augustus/config/species/ directory.
>>>>>>> Example:
>>>>>>> …/augustus/config/species/arabidopsis/
>>>>>>> Then ‘arabidopsis’ would be the species name to use with MAKER.
>>>>>>> Sometimes you may have to do a second round of both SNAP and 
>>>>>>> Augustus
>>>>>>> training (called bootstrapping). Look at the models you get after 
>>>>>>> the
>>>>>>> first round, and if they look good then, the second round is 
>>>>>>> probably
>>>>>>> not going top be beneficial.
>>>>>>> —Carson
>>>>>>>> On Feb 5, 2016, at 3:42 PM, hcma <hcma at uci.edu> wrote:
>>>>>>>> Hi Dr Holt,
>>>>>>>> Thanks for the email. Here is my pipeline, does it seems 
>>>>>>>> acceptable? Any comments is welcome and much appreciated.
>>>>>>>> 1. Use maker to generate training gene set:
>>>>>>>> genome=all-chromosome-r1.04.fasta
>>>>>>>> est=Trinity.fasta
>>>>>>>> est2genome=1
>>>>>>>> 2. Use output of Maker to train SNAP:
>>>>>>>> maker2zff dwil-all-chromosome-r1.04.all.gff
>>>>>>>> fathom genome.ann genome.dna –gene-stats
>>>>>>>> fathom genome.ann genome.dna –categorize 1000
>>>>>>>> fathom genome.ann genome.dna –gene-stats
>>>>>>>> fathom uni.ann uni.dna –export 1000 –plus
>>>>>>>> hmm-assembler.pl genome . > dwil_genome.hmm
>>>>>>>> 3. Use output of Maker to train Augustus on their webserver:
>>>>>>>> File used:
>>>>>>>> Upload ‘export.dna’ as the genome file
>>>>>>>> Upload ‘export.aa’ as the protein file
>>>>>>>> 4. second and final Maker run:
>>>>>>>> genome=all-chromosome-r1.04.fasta
>>>>>>>> est=Trinity.fasta
>>>>>>>> est2genome=0
>>>>>>>> Snaphmm=output of 2
>>>>>>>> How do i incorporate the output of training set of gene from 
>>>>>>>> Augustus web server here into this step 4?
>>>>>>>> Thanks for your time.
>>>>>>>> Best Regards
>>>>>>>> Karen
>>>>>>>>> On 2016-02-05 06:36, Carson Holt wrote:
>>>>>>>>> Hi Karen,
>>>>>>>>> There are many ways to train Augustus. I prefer to identify 
>>>>>>>>> gene
>>>>>>>>> models in MAKER (GFF3) and use those to train both SNAP and 
>>>>>>>>> Augustus.
>>>>>>>>> Here is a previous post on the topic —>
>>>>>>>>> https://groups.google.com/forum/#!searchin/maker-devel/augustus/maker-devel/FWMSTdqWQqI/lC3miQtiCpwJ
>>>>>>>>> [1]
>>>>>>>>> In the end you need to look at the SNAP and Augustus models 
>>>>>>>>> together
>>>>>>>>> with evidence alignments in a genome browser (like desktop 
>>>>>>>>> Apollo).
>>>>>>>>> When everything is trained well, both SNAP and Augustus models 
>>>>>>>>> will
>>>>>>>>> look like each other and both seem to look like the evidence
>>>>>>>>> alignments.
>>>>>>>>> Thanks,
>>>>>>>>> Carson
>>>>>>>>>> On Feb 4, 2016, at 5:52 PM, hcma <hcma at uci.edu> wrote:
>>>>>>>>>> Hi,
>>>>>>>>>> I have a genome sequence and Trinity assembly for a new 
>>>>>>>>>> species and
>>>>>>>>>> I am wondering what are the best steps to take when using 
>>>>>>>>>> MAKER?
>>>>>>>>>> 1. I used the genome sequence and all assembled Trinity 
>>>>>>>>>> sequence to
>>>>>>>>>> do first run of MAKER in order to generate training set for 
>>>>>>>>>> SNAP and
>>>>>>>>>> Augustus.
>>>>>>>>>> In maker_opts.ctl:
>>>>>>>>>> genome=all-chromosome-r1.04.fasta
>>>>>>>>>> est=Trinity.fasta
>>>>>>>>>> est2genome=1
>>>>>>>>>> 2. Train SNAP
>>>>>>>>>> 3. Train Augustus
>>>>>>>>>> When i train Augustus, i only supply genome and protein file, 
>>>>>>>>>> should
>>>>>>>>>> i also supply the trinity file here?
>>>>>>>>>> 4. what's the best parameter to use when running MAKER the 
>>>>>>>>>> second
>>>>>>>>>> time for obtaining the final annotation? I would prefer not to 
>>>>>>>>>> use
>>>>>>>>>> any external protein data.
>>>>>>>>>> genome=all-chromosome-r1.04.fasta
>>>>>>>>>> est=Trinity.fasta
>>>>>>>>>> est2genome=0
>>>>>>>>>> SNAP
>>>>>>>>>> Augustus
>>>>>>>>>> Thanks.
>>>>>>>>>> Best Regards
>>>>>>>>>> KAren
>>>>>>>>> Links:
>>>>>>>>> ------
>>>>>>>>> [1]
>>>>>>>>> https://groups.google.com/forum/#!searchin/maker-devel/augustus/maker-devel/FWMSTdqWQqI/lC3miQtiCpwJ
>> 





More information about the maker-devel mailing list