<html><head><meta http-equiv="Content-Type" content="text/html charset=us-ascii"></head><body style="word-wrap: break-word; -webkit-nbsp-mode: space; -webkit-line-break: after-white-space; "><div>Dear all,</div><div><br></div><div>First of all, I'd like to thank everyone in this forum for all the tips and comments on the best strategies for running MAKER, they have been really helpful so far.</div><div>However, I still don't fully understand the behaviour of MAKER when ran iteratively, and I compare the predictions from each round. Let me explain:</div><div><br></div><div>My input data are the following:</div><div>- the repeat-masked genome of a vertebrate (~2Gb);</div><div>- mRNA data for this species mapped to the genome with tophat2 and assembled into transcripts with cufflinks;</div><div>- exonerate-mapped proteins in gff3 format to the reference genome, from closely related species (global alignment)</div><div><br></div><div>For the first round of MAKER, I provided both cufflinks and exonerate-mapped proteins with the options est2genome and protein2genome = 1. From maker output, I generated the SNAP .hmm file (as the instructions in <a href="http://gmod.org/wiki/MAKER_Tutorial">http://gmod.org/wiki/MAKER_Tutorial</a>) and provided it as input to the second round of MAKER.</div><div>For this second round I still gave cufflinks + exonerated proteins, but switched both est2genome ad protein2genome to 0. After finished, I generated SNAP .hmm once more and provided it for the 3rd and final round of MAKER, along with cufflinks and exonerated-mapped prots and est/prot2genome=0</div><div><br></div><div>As sort of a sanity check, I went on and ran a 4th round of MAKER with the SNAP .hmm file from round3, cufflinks and exonerated-mapped prots and est/prot2genome=0, and this time specifying alt_splice=1.</div><div>For all the rounds, I also specified single_exon=1.</div><div><br></div><div><br></div><div>I loaded the gene predictions from each round plus the cufflink transcripts and the exonerated proteins to the genome browser to visually inspect the output. I saw a few strange cases where MAKER doesn't seem to use the protein/mRNA evidences for the gene predictions, and I would greatly appreciate any feedback/ideas on what I could possible be doing wrong. Here are a few screenshots so you know what I'm talking about:</div><div><br></div><div>In this first example, MAKER misses a conserved exon for which there is both protein and mRNA evidence, and only if I specify alt_splice I get the exon 'back'.</div><div><br></div><div><img height="458" width="1012" apple-width="yes" apple-height="yes" id="598d4de4-caa0-4f94-9e73-6f9c0872aba5" src="cid:2E1CA7B1-EA4D-4408-80FE-A3DCC194F64D@t-mobile.de"></div><div><br></div><div>In this second example, MAKER completely ignores lots of exons, all conserved across vertebrates, and supported by protein/mRNA evidence.</div><div><br></div><div><img height="440" width="1012" apple-width="yes" apple-height="yes" id="12463682-5a6a-4b96-ae81-1561ded9867d" src="cid:84DA9F9A-3E2A-46C4-86EF-5583D27F61E2@t-mobile.de"></div><div><br></div><div>In the third example, there is no prediction from round1, the one from round2 matches the protein/mRNA evidence, and then in the final round3 and 4, an extra exon appears.</div><div><br></div><div><img height="430" width="1012" apple-width="yes" apple-height="yes" id="8ffe1ed1-5737-4e80-afc1-171267cb8e07" src="cid:90905F87-EBCD-4247-AF49-EBF7344D9286@t-mobile.de"></div><div><br></div><div><br></div><div>(hope you'l be able to see the images above)</div><div>As I said, I would greatly appreciate any feedback on these strange cases. Perhaps I'm missing some parameter(s)?</div><div><br></div><div>Thanks a lot.</div><div>All the best,</div><div>Juliana</div></body></html>