[maker-devel] maker output- transcripts.fasta and proteins.fasta files missing
dhivya arasappan
darasappan at gmail.com
Thu Mar 13 10:47:25 MDT 2014
Thanks Carson for the response. I understand that est2genome=1 does
not use any ab initio gene predictions, but simply identifies ests
based on alignment. I'm a little confused because I ran maker on my
assembly before, using the same parameters ( including est2genome=1).
I got a very good result with > 20,000 transcripts and proteins.
Then I was able to get an improved assembly, where many scaffolds
were combined into superscaffolds. So I reran maker on this
assembly. Same parameters, same transcriptome and proteins files.
Now, I see such drastically different results: Only 500+ genes and
transcripts. My scaffolds are now bigger than before, so I'm not sure
how this is happening. These were the results I sent you.
Another odd thing I noticed (and I am hesitant to report this because
perhaps it is due to some sort of error on my part): I ran maker on
the improved assembly the first time and maker did not complete in the
48 hours I allocated. But I had 19,000+ transcripts in the
unfinished output. When I reran maker, just changing the time
allocated, it completed much faster, but is giving much fewer
transcripts and proteins as output. Could something like this happen?
If not, then I'm guessing I must have changed something although I'm
pretty sure that I did not change anything other than the time
allocated. I've attached the trascripts and proteins files from the
first time I ran maker on my improved assembly.
Thanks again for your help
Dhivya
On Mar 13, 2014, at 11:14 AM, Carson Holt wrote:
> Note protein/transcript fasts are only created when there are gene
> models to output to those files (so their absence means there were
> no gene models for that contig). Most sequences without protein/
> transcript fasts in your sample are very short and thus don’t
> contain anything. What is left either have no est2genome results or
> the est2genome alignments do not have sufficient open reading frame
> to be turned into a gene model (false merging of regions by trinity
> can cause this, so make sure you use the jaccard index option when
> assembling reads with trinity to avoid this).
>
> You are using only the est2genome=1 option. This will result in a
> limited set of genes that can be used for training SNAP/Augustus (so
> not getting results on all contigs is expected). You really won’t
> get much as far as results until you have one of the ab initio
> predictors turned on.
>
> Thanks,
> Carson
>
>
> From: dhivya arasappan <darasappan at gmail.com>
> Date: Tuesday, March 11, 2014 at 8:52 AM
> To: Carson Holt <carsonhh at gmail.com>
> Cc: Daniel Ence <dence at genetics.utah.edu>
> Subject: Re: maker output- transcripts.fasta and proteins.fasta
> files missing
>
> Alright done. My username is daras
>
> Thanks
> Dhivya
>
> On Mar 10, 2014, at 5:10 PM, Carson Holt wrote:
>
>> Input and compressed file of output.
>>
>> Thanks,
>> Carson
>>
>> From: dhivya arasappan <darasappan at gmail.com>
>> Date: Monday, March 10, 2014 at 2:09 PM
>> To: Carson Holt <carsonhh at gmail.com>
>> Cc: Daniel Ence <dence at genetics.utah.edu>
>> Subject: Re: maker output- transcripts.fasta and proteins.fasta
>> files missing
>>
>> Hi Carson,
>>
>> Do you mean the whole maker output?
>>
>> Thanks
>> dhivya
>>
>> On Mar 10, 2014, at 4:55 PM, Carson Holt wrote:
>>
>>> Could you upload everything here —> http://weatherby.genetics.utah.edu/cgi-bin/mwas/bug.cgi
>>>
>>> Than send us the link generated or your user ID.
>>>
>>> Thanks,
>>> Carson
>>>
>>>
>>>
>>> From: dhivya arasappan <darasappan at gmail.com>
>>> Date: Monday, March 10, 2014 at 1:50 PM
>>> To: Carson Holt <carsonhh at gmail.com>, Daniel Ence <dence at genetics.utah.edu
>>> >
>>> Subject: Fwd: maker output- transcripts.fasta and proteins.fasta
>>> files missing
>>>
>>> Hi Carson and Daniel,
>>>
>>> I'm sending this across to you separately since maker list is
>>> blocking my email due to attachment size.
>>>
>>> As always, thanks for any guidance you can provide.
>>> Dhivya
>>>
>>>
>>> Begin forwarded message:
>>>
>>>> From: dhivya arasappan <darasappan at gmail.com>
>>>> Date: March 10, 2014 3:14:03 PM CDT
>>>> To: maker-devel at yandell-lab.org
>>>> Subject: maker output- transcripts.fasta and proteins.fasta files
>>>> missing
>>>>
>>>> Hello,
>>>>
>>>> I've been running maker with different assembly files, reference
>>>> files etc and I check the output by:
>>>>
>>>> 1. concatenating the gff files
>>>> 2. concatenating the *transcripts.fasta files
>>>> 3. concatenating the *proteins.fasta files
>>>>
>>>> I'm noticing that when I ran maker twice with same parameters,
>>>> the second time around, many of the output subdirectories do not
>>>> have a *transcripts.fasta or *proteins.fasta file in it.
>>>> There are 251 subdirectories and only 97 of them have all 3
>>>> output files. Maker log looks ok to me, but I've attached it
>>>> here as well.
>>>>
>>>> What could be the reason for this?
>>>>
>>>> Thanks
>>>> dhivya
>>>>
>>>
>>>>
>>>>
>>>>
>>>>
>>>
>>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://yandell-lab.org/pipermail/maker-devel_yandell-lab.org/attachments/20140313/2048cfef/attachment-0006.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: transcripts.cat.fasta.old.gz
Type: application/x-gzip
Size: 7927581 bytes
Desc: not available
URL: <http://yandell-lab.org/pipermail/maker-devel_yandell-lab.org/attachments/20140313/2048cfef/attachment.tgz>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://yandell-lab.org/pipermail/maker-devel_yandell-lab.org/attachments/20140313/2048cfef/attachment-0007.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: proteins.cat.fasta.old.gz
Type: application/x-gzip
Size: 3668381 bytes
Desc: not available
URL: <http://yandell-lab.org/pipermail/maker-devel_yandell-lab.org/attachments/20140313/2048cfef/attachment-0001.tgz>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://yandell-lab.org/pipermail/maker-devel_yandell-lab.org/attachments/20140313/2048cfef/attachment-0008.html>
More information about the maker-devel
mailing list