[maker-devel] Possible ways to improve annotated gene numbers
Carson Holt
carsonhh at gmail.com
Mon Jul 3 15:04:40 MDT 2017
MAKER excludes models without evidence support (this is because gene predictors can overcall by as much as a factor of 10, i.e. lots of false positives). So you may be lacking in either protein or transcript evidence (you should alway supply a minimum of 2 related proteomes for any MAKER analysis - transcript evidence by itself is insufficient).
You can also try and rescue models based on protein domain content using iprscan. Details in this protocol paper —> https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4286374/ <https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4286374/>
—Carson
> On Jun 30, 2017, at 1:30 PM, Qihua Liang <qlian003 at ucr.edu> wrote:
>
> Dear Maker Development Team,
>
> Hi, I am using Maker for annotation and BUSCO to evaluate the completeness.
>
> For de novo perditions, I am using Augustus, GeneMark, and SNAP, and the annotated proteins have completeness of ~80%, ~50%, ~50% correspondingly. When I cat all de novo annotated proteins of these three tools, the completeness is much higher as ~92%.
>
> But for all.maker.proteins.fasta, the completeness is only ~80%.
>
> 1. Does this mean that some proteins annotated by Augustus/GeneMark/SNAP, are not included in the file all.maker.proteins.fasta? Does it because such excluded proteins do not have hits with the EST evidences?
>
> 2. To achieve a higher BUSCO completeness, what possible ways can be used? Including more EST evidences from other species?
>
>
> Thank you
> Qihua
> _______________________________________________
> maker-devel mailing list
> maker-devel at box290.bluehost.com
> http://box290.bluehost.com/mailman/listinfo/maker-devel_yandell-lab.org
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://yandell-lab.org/pipermail/maker-devel_yandell-lab.org/attachments/20170703/eb71cb04/attachment-0002.html>
More information about the maker-devel
mailing list