[maker-devel] About split genes in MAKER annotation
Prashant Narendra SHINGATE
prashantns at imcb.a-star.edu.sg
Wed Dec 12 03:29:17 MST 2018
Hi Carson,
I am Prashant a Bioinformatics postdoctoral fellow from Prof B Venkatesh's lab, IMCB, A*STAR. I am using MAKER-tool to annotate an invertebrate genome (~2Gb). During annotation process, we found several instances of split genes even though we have full-length reference protein sequences from very closely related species. Hence we decided to look at one of the loci to understand the reason behind it and to optimize the parameters.
We looked at a gene ~110kb long and codes for a ~1200 amino acid protein. We have a highly identical reference protein (>90% identity and 100% coverage) from another species. In addition we also have a high coverage Trinity transcript assembly from our species. Still, this gene is split into 4 fragments during evidence-based MAKER run. On closer a look, we found that the above mentioned closely related protein is not aligned by exonerate (protein2genome) even though it is the closest protein to this gene in our dataset. It looks like the program is giving more weightage to transcripts which are typically fragments of the gene. So we are at a loss as to how to predict this gene in full.
For your reference, I am herewith enclosing maker_opts.ctl file and maker_bopts.ctl. I will be glad to share the scaffold sequence and other input files if required.
Can you please help me to understand the reason behind MAKER not able to use the full-length reference protein for gene prediction and how we can overcome this problem.
Thanks for your time and help.
Best regards,
Prashant Shingate, PhD<mailto:prashantns at imcb.a-star.edu.sg> :: Research Fellow :: Comparative and Medical Genomics Lab :: Institute of Molecular and Cell Biology (IMCB) :: Agency for Science, Technology and Research (A*STAR)
61 Biopolis Drive :: #05-04 Proteos :: Singapore 138673 :: DID (+65) 6586 9570 :: Fax (+65) 6779 1117:: http://www.imcb.a-star.edu.sg/
We advance science and develop innovative technology to further economic growth and improve lives.
Note: This message may contain confidential information. If this Email/Fax has been sent to you by mistake, please notify the sender and delete it immediately. Thank you.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://yandell-lab.org/pipermail/maker-devel_yandell-lab.org/attachments/20181212/14c5d5ca/attachment-0002.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: maker_opts.ctl
Type: application/octet-stream
Size: 5147 bytes
Desc: maker_opts.ctl
URL: <http://yandell-lab.org/pipermail/maker-devel_yandell-lab.org/attachments/20181212/14c5d5ca/attachment-0004.obj>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: maker_opts.ctl
Type: application/octet-stream
Size: 5147 bytes
Desc: maker_opts.ctl
URL: <http://yandell-lab.org/pipermail/maker-devel_yandell-lab.org/attachments/20181212/14c5d5ca/attachment-0005.obj>
More information about the maker-devel
mailing list