<html xmlns:v="urn:schemas-microsoft-com:vml" xmlns:o="urn:schemas-microsoft-com:office:office" xmlns:w="urn:schemas-microsoft-com:office:word" xmlns:m="http://schemas.microsoft.com/office/2004/12/omml" xmlns="http://www.w3.org/TR/REC-html40">
<head>
<meta http-equiv="Content-Type" content="text/html; charset=utf-8">
<meta name="Generator" content="Microsoft Word 15 (filtered medium)">
<!--[if !mso]><style>v\:* {behavior:url(#default#VML);}
o\:* {behavior:url(#default#VML);}
w\:* {behavior:url(#default#VML);}
.shape {behavior:url(#default#VML);}
</style><![endif]--><style><!--
/* Font Definitions */
@font-face
{font-family:"Cambria Math";
panose-1:2 4 5 3 5 4 6 3 2 4;}
@font-face
{font-family:Calibri;
panose-1:2 15 5 2 2 2 4 3 2 4;}
/* Style Definitions */
p.MsoNormal, li.MsoNormal, div.MsoNormal
{margin:0in;
margin-bottom:.0001pt;
font-size:11.0pt;
font-family:"Calibri",sans-serif;}
a:link, span.MsoHyperlink
{mso-style-priority:99;
color:#0563C1;
text-decoration:underline;}
a:visited, span.MsoHyperlinkFollowed
{mso-style-priority:99;
color:#954F72;
text-decoration:underline;}
p.msonormal0, li.msonormal0, div.msonormal0
{mso-style-name:msonormal;
mso-margin-top-alt:auto;
margin-right:0in;
mso-margin-bottom-alt:auto;
margin-left:0in;
font-size:11.0pt;
font-family:"Calibri",sans-serif;}
span.EmailStyle18
{mso-style-type:personal-reply;
font-family:"Calibri",sans-serif;
color:windowtext;}
.MsoChpDefault
{mso-style-type:export-only;
font-size:10.0pt;}
@page WordSection1
{size:8.5in 11.0in;
margin:1.0in 1.0in 1.0in 1.0in;}
div.WordSection1
{page:WordSection1;}
--></style><!--[if gte mso 9]><xml>
<o:shapedefaults v:ext="edit" spidmax="1026" />
</xml><![endif]--><!--[if gte mso 9]><xml>
<o:shapelayout v:ext="edit">
<o:idmap v:ext="edit" data="1" />
</o:shapelayout></xml><![endif]-->
</head>
<body lang="EN-US" link="#0563C1" vlink="#954F72">
<div class="WordSection1">
<p class="MsoNormal">Hi Lior,<o:p></o:p></p>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal">Fun! The short answer is I don’t know. Obviously, the good stuff is on the right side of 0.5.
<span style="font-size:12.0pt"><o:p></o:p></span></p>
<p class="MsoNormal">That said, I can think of a couple of things to look into to explain the left side of the graph. Are you allowing single exon genes? Are you using RNA seq data, protein, or both? What about repeat masking? Are you doing it? Do you have
your own library? <o:p></o:p></p>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal">My first guess, would be low complexity/repeat sequences generating more or less random blastx hits across the genome…Carson, what do you think?<o:p></o:p></p>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal">And finally, what does the AED look like for the genes included in the final build?
<o:p></o:p></p>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal">Sorry for all the questions, Lior. That’s your punishment for asking an interesting one.
<span style="font-family:"Apple Color Emoji"">😉</span><o:p></o:p></p>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal">--mark<o:p></o:p></p>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal"><o:p> </o:p></p>
<div style="border:none;border-top:solid #B5C4DF 1.0pt;padding:3.0pt 0in 0in 0in">
<p class="MsoNormal"><b><span style="font-size:12.0pt;color:black">From: </span></b><span style="font-size:12.0pt;color:black">maker-devel <maker-devel-bounces@yandell-lab.org> on behalf of Lior Glick <liorglic@mail.tau.ac.il><br>
<b>Date: </b>Sunday, April 7, 2019 at 7:26 AM<br>
<b>To: </b>"maker-devel@yandell-lab.org" <maker-devel@yandell-lab.org><br>
<b>Subject: </b>[maker-devel] Curious pattern in AED distributions<o:p></o:p></span></p>
</div>
<div>
<p class="MsoNormal"><o:p> </o:p></p>
</div>
<div>
<div>
<p class="MsoNormal">Hi MAKER users,<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal">Lately I've been performing annotations for multiple genomes from the same species.<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal">When plotting the histogram of AED scores over all genes, I repeatedly see a very specific pattern, that looks something like this:<o:p></o:p></p>
</div>
<div>
<div>
<div>
<p class="MsoNormal"><img width="497" height="301" style="width:5.177in;height:3.1354in" id="_x0000_i1025" src="cid:image001.png@01D4ED21.E6C1A0E0" alt="AED_hist.png"><o:p></o:p></p>
</div>
</div>
<div>
<p class="MsoNormal">This pattern is a bit surprising to me, in two aspects:<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal">1) Why is there a surge towards 0.5?<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal">2) Why is there a sudden drop right after that surge?<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal"><o:p> </o:p></p>
</div>
<div>
<p class="MsoNormal">Has anyone else seen this, or is this a specific outcome of my data/configuration?<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal">Any ideas of what may cause such a distribution?<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal"><o:p> </o:p></p>
</div>
<div>
<p class="MsoNormal">While this is not necessarily an indication of a problem or bug, it does seem a bit odd, and might imply some bias or artifact.<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal">Would appreciate your comments.<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal">Thank you!<o:p></o:p></p>
</div>
</div>
</div>
</div>
</body>
</html>