[maker-devel] Q on MAKER
hcma
hcma at uci.edu
Fri Feb 5 16:20:26 MST 2016
Hi Carlson,
Thanks for the instruction and in maker_exe.ctl, i only see path to
snap, but not to augustus, so my system admin is checking this for me.
From some manual i found, people use both snap and augustus when using
MAKER to annotate genomes. Would you recommend using both or one of the
2 is sufficient?
Thanks for your valuable time and advise.
Best Regards
Karen
On 2016-02-05 15:03, Carson Holt wrote:
> You need to find out where the augustus MAKER is using is installed.
> Check the maker_exe.ctl file you are using, or type ‘which augustus’.
>
> —Carson
>
>
>> On Feb 5, 2016, at 3:58 PM, hcma <hcma at uci.edu> wrote:
>>
>> Hi Carlson,
>>
>> These are the list of directories under maker/2.31.8
>>
>> bin data GMOD INSTALL lib LICENSE MWAS perl README RELEASE
>> src
>>
>>
>> Where can i find augustus/? Or i have to ask my system admin to
>> install this?
>>
>> Thanks.
>>
>> Best Regards
>> Karen
>>
>>
>>
>>
>> On 2016-02-05 14:54, Carson Holt wrote:
>>> Augustus gives you an entire directory rather than just a single file
>>> like SNAP. You have to take the directory and copy it to the
>>> .../augustus/config/species/ directory.
>>> Example:
>>> …/augustus/config/species/arabidopsis/
>>> Then ‘arabidopsis’ would be the species name to use with MAKER.
>>> Sometimes you may have to do a second round of both SNAP and Augustus
>>> training (called bootstrapping). Look at the models you get after the
>>> first round, and if they look good then, the second round is probably
>>> not going top be beneficial.
>>> —Carson
>>>> On Feb 5, 2016, at 3:42 PM, hcma <hcma at uci.edu> wrote:
>>>> Hi Dr Holt,
>>>> Thanks for the email. Here is my pipeline, does it seems acceptable?
>>>> Any comments is welcome and much appreciated.
>>>> 1. Use maker to generate training gene set:
>>>> genome=all-chromosome-r1.04.fasta
>>>> est=Trinity.fasta
>>>> est2genome=1
>>>> 2. Use output of Maker to train SNAP:
>>>> maker2zff dwil-all-chromosome-r1.04.all.gff
>>>> fathom genome.ann genome.dna –gene-stats
>>>> fathom genome.ann genome.dna –categorize 1000
>>>> fathom genome.ann genome.dna –gene-stats
>>>> fathom uni.ann uni.dna –export 1000 –plus
>>>> hmm-assembler.pl genome . > dwil_genome.hmm
>>>> 3. Use output of Maker to train Augustus on their webserver:
>>>> File used:
>>>> Upload ‘export.dna’ as the genome file
>>>> Upload ‘export.aa’ as the protein file
>>>> 4. second and final Maker run:
>>>> genome=all-chromosome-r1.04.fasta
>>>> est=Trinity.fasta
>>>> est2genome=0
>>>> Snaphmm=output of 2
>>>> How do i incorporate the output of training set of gene from
>>>> Augustus web server here into this step 4?
>>>> Thanks for your time.
>>>> Best Regards
>>>> Karen
>>>> On 2016-02-05 06:36, Carson Holt wrote:
>>>>> Hi Karen,
>>>>> There are many ways to train Augustus. I prefer to identify gene
>>>>> models in MAKER (GFF3) and use those to train both SNAP and
>>>>> Augustus.
>>>>> Here is a previous post on the topic —>
>>>>> https://groups.google.com/forum/#!searchin/maker-devel/augustus/maker-devel/FWMSTdqWQqI/lC3miQtiCpwJ
>>>>> [1]
>>>>> In the end you need to look at the SNAP and Augustus models
>>>>> together
>>>>> with evidence alignments in a genome browser (like desktop Apollo).
>>>>> When everything is trained well, both SNAP and Augustus models will
>>>>> look like each other and both seem to look like the evidence
>>>>> alignments.
>>>>> Thanks,
>>>>> Carson
>>>>>> On Feb 4, 2016, at 5:52 PM, hcma <hcma at uci.edu> wrote:
>>>>>> Hi,
>>>>>> I have a genome sequence and Trinity assembly for a new species
>>>>>> and
>>>>>> I am wondering what are the best steps to take when using MAKER?
>>>>>> 1. I used the genome sequence and all assembled Trinity sequence
>>>>>> to
>>>>>> do first run of MAKER in order to generate training set for SNAP
>>>>>> and
>>>>>> Augustus.
>>>>>> In maker_opts.ctl:
>>>>>> genome=all-chromosome-r1.04.fasta
>>>>>> est=Trinity.fasta
>>>>>> est2genome=1
>>>>>> 2. Train SNAP
>>>>>> 3. Train Augustus
>>>>>> When i train Augustus, i only supply genome and protein file,
>>>>>> should
>>>>>> i also supply the trinity file here?
>>>>>> 4. what's the best parameter to use when running MAKER the second
>>>>>> time for obtaining the final annotation? I would prefer not to use
>>>>>> any external protein data.
>>>>>> genome=all-chromosome-r1.04.fasta
>>>>>> est=Trinity.fasta
>>>>>> est2genome=0
>>>>>> SNAP
>>>>>> Augustus
>>>>>> Thanks.
>>>>>> Best Regards
>>>>>> KAren
>>>>> Links:
>>>>> ------
>>>>> [1]
>>>>> https://groups.google.com/forum/#!searchin/maker-devel/augustus/maker-devel/FWMSTdqWQqI/lC3miQtiCpwJ
>>
More information about the maker-devel
mailing list