[maker-devel] Size of initial EST training set for SNAP
Carson Holt
carsonhh at gmail.com
Tue Mar 18 10:14:29 MDT 2014
That sounds good. 1,500 initial models should be more than sufficient for
the first round of training.
—Carson
From: Felipe Barreto <fbarreto at ucsd.edu>
Date: Tuesday, March 18, 2014 at 10:08 AM
To: MAKER group <maker-devel at yandell-lab.org>
Subject: [maker-devel] Size of initial EST training set for SNAP
Hi, all,
I've been learning a lot from reading posts from this group, and finally
started doing actual runs of Maker on our current genome assembly
(arthropod, genome size ~230Mb). I started by training SNAP, but would like
to check my approach before continuing with longer runs.
>From our full set of ~40,000 ESTs (RNA-seq assembly), I chose ~2000 that I
deemed of very high quality based on blast alignments to Swiss-Prot (based
on query-subject coverage, bit score, etc). I then used only these 2000
ESTs in a first Maker run using est2genome=1. The output returned 1500
models (with the 500 "missing" models probably a result of single-exon
issues; not a concern at this point).
I now plan on training SNAP with this first output, and then doing another
Maker run now using: 1) all ESTs (but est2genome=0), 2) my chosen protein
evidence, and 3) SNAP with the first HMM file. The output of this second
run will be used to re-train SNAP, and this second HMM file will be used in
a final "official" run (while continuing to provide the EST and protein
evidence, of course).
Does this sound like a reasonable approach? Simply put, my main concern is
whether I'm using too few ESTs in my first est2genome step.
Thanks for any insight!
--
Felipe Barreto
Post-doctoral Scholar
Scripps Institution of Oceanography
University of California, San Diego
La Jolla, CA 92093
_______________________________________________ maker-devel mailing list
maker-devel at box290.bluehost.com
http://box290.bluehost.com/mailman/listinfo/maker-devel_yandell-lab.org
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://yandell-lab.org/pipermail/maker-devel_yandell-lab.org/attachments/20140318/2cd5fce1/attachment-0003.html>
More information about the maker-devel
mailing list