<div dir="ltr">Hi,<div><br></div><div>I am attempting to annotate a plant genome. I have a couple of questions:</div><div><br></div><div><b>1) RNA-seq assembly</b></div><div>a) I assembled my RNA-seq data using Trinity and StringTie. The two produce drastically different numbers. When I compare the two assemblies for each sample using TransRate, StringTie produces a higher score. for most of the assemblies. I see in all of the threads that you recommend Trinity but doesn't trinity produce way too many transcripts (even after chucking out the "bad" ones using transrate).</div><div>b) During hint creation in MAKER, does it take into account that different transcripts have different read coverage (expression levels). I guess my question is should I filter transcripts that have a small read coverage.</div><div><br></div><div><b>2) Repeat Masking </b></div><div>I am following the advanced repeat library construction tutorial (<a href="http://weatherby.genetics.utah.edu/MAKER/wiki/index.php/Repeat_Library_Construction-Advanced">http://weatherby.genetics.utah.edu/MAKER/wiki/index.php/Repeat_Library_Construction-Advanced</a>). The initial steps find 15 sequences for the LTR and 159 for MITE. But, when I get to the perl DIR_CRL/CRL_Step4.pl step, both output files (Inner_Seq_For_BLAST.fasta, lLTRs_Seq_For_BLAST.fasta) are empty.</div><div><br></div><div>a) are these numbers normal because I was expecting a lot more than 16 for the LTR? </div><div>b) I don't get any errors when I run CRL_Step4.pl yet no output. What's going on?!</div><div><br></div><div>Many thanks,</div><div>/SB</div></div><div dir="ltr">-- <br></div><div data-smartmail="gmail_signature"><p dir="ltr">____________________________<br>
Sent from Inbox Mobile</p>
</div>