[maker-devel] mapping cDNA to updated genome
Prashant S Hosmani
psh65 at cornell.edu
Tue Sep 20 14:33:42 MDT 2016
Hi Mike and Carson,
Thank you for your help. I used masked genome for aligning cDNAs. And yes, this was due to multiple aligning cDNA’s. I guess you could also filter according genes based on the alignment score from gff. I used GMAP (http://research-pub.gene.com/gmap/) to align cDNA on to the updated genome. GMAP has parameters to filter based on alignment scores and also can choose best path per cDNA.
Regards,
Prashant
Prashant Hosmani
Sol Genomics Network
Boyce Thompson Institute, Ithaca, NY, USA
On Aug 31, 2016, at 12:12 PM, Carson Holt <carsonhh at gmail.com<mailto:carsonhh at gmail.com>> wrote:
Also if you have multiple alignments of the same cDNA, you can use the score column of the mRNA feature to see which aligns best. If they have the same score, you will have to disambiguate manually or just remove all copies.
—Carson
On Aug 31, 2016, at 10:10 AM, Michael Campbell <michael.s.campbell1 at gmail.com<mailto:michael.s.campbell1 at gmail.com>> wrote:
Hi Prashant,
I’m almost positive that the additional genes are coming from multiply aligning cDNAs. Did you repeat mask your genome before mapping things forward?
Another thought, what kind of whole genome duplications has your plant been through. it may be that the multiple alignments are to pseudogenes is some stage of decay. If that is the case it would probably be safe to keep the the gene from longest/best aligned cDNA.
Thanks,
Mike
On Aug 31, 2016, at 10:35 AM, Prashant S Hosmani <psh65 at cornell.edu<mailto:psh65 at cornell.edu>> wrote:
Hi All,
I am working on updating a plant genome annotation. I would like to map genes from previous annotation to a new genome build. There is a protocol about this in Campbell et al 2014, current protocols in bioinformatics (basic protocol 4 - Mapping annotations to a new assembly). I followed that protocol exactly with setting est_forward=1. But in output I’m getting large number of genes. My input cDNA fasta contains ~35K genes and after mapping there are ~58K genes.
I’m using maker version 3.0. There are few changes in the genome and I’m not expecting many changes in the mapping previous genes.
Please let me know if there are any other parameters to control mapping of EST’s. I was hoping to get similar number of genes mapped on to new assembly with very few changes.
Thank you for your help in advance.
Prashant
Prashant Hosmani
Sol Genomics Network
Boyce Thompson Institute, Ithaca, NY, USA
_______________________________________________
maker-devel mailing list
maker-devel at box290.bluehost.com<mailto:maker-devel at box290.bluehost.com>
http://box290.bluehost.com/mailman/listinfo/maker-devel_yandell-lab.org
_______________________________________________
maker-devel mailing list
maker-devel at box290.bluehost.com<mailto:maker-devel at box290.bluehost.com>
http://box290.bluehost.com/mailman/listinfo/maker-devel_yandell-lab.org
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://yandell-lab.org/pipermail/maker-devel_yandell-lab.org/attachments/20160920/4a212ad4/attachment-0002.html>
More information about the maker-devel
mailing list