[maker-devel] The origin of te_proteins.fasta

Ole Kristian Tørresen ole.toerresen at gmail.com
Wed Sep 30 13:00:23 MDT 2015


Hi,
the file te_proteins.fasta is distributed with MAKER and is suggested as a
way to find more divergent transposable elements by searching in protein
level instead of at nucleotide level. I've been unable to find any
information about it's creation, and whether or not it has been kept
current. There is a file with mobile elements derived proteins distributed
with RepBase, called RepeatPeps.lib, which seem to contain the same amount
of sequences (about 9.4 Mbp in both), but half the number (10500 vs 25000).

Does anyone know how these two files compare? Could I use RepeatPeps.lib
instead, or combine them (with some clustering maybe?)?

Thank you.

Sincerely,
Ole Kristian Tørresen
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://yandell-lab.org/pipermail/maker-devel_yandell-lab.org/attachments/20150930/05eb4fea/attachment-0002.html>


More information about the maker-devel mailing list