[maker-devel] gene:multiple_Einit and overlaps_prev_exon errors in first round of SNAP training

Carneiro,Celine M c143dad at ufl.edu
Wed May 6 11:36:40 MDT 2020


Hello,

I am getting the errors gene:multiple_Einit, gene:multiple_Eterm, and exon:overlaps_prev_exon, at just about every gene model. I've ran the first round of maker on a bird genome I'm annotating with no errors and have started the steps to train SNAP. However, after running fathom -categorize, just about every single gene model has the same set of errors. Here is an example from my log file after running fathom -categorize:

MODEL117 1 1 8 - errors(6): gene:multiple_Einit gene:multiple_Eterm exon-7:overlaps_prev_exon exon-5:overlaps_prev_exon exon-3:overlaps_prev_exon exon-1:overlaps_prev_exon
MODEL851 1 1 100 - errors(78): gene:multiple_Einit gene:multiple_Eterm exon-99:overlaps_prev_exon exon-98:overlaps_prev_exon exon-97:overlaps_prev_exon exon-95:overlaps_prev_exon exon-94:overlaps_prev_exon exon-93:overlaps_prev_exon exon-91:overlaps_prev_exon exon-90:overlaps_prev_exon exon-89:overlaps_prev_exon exon-87:overlaps_prev_exon exon-86:overlaps_prev_exon exon-85:overlaps_prev_exon exon-83:overlaps_prev_exon exon-82:overlaps_prev_exon exon-81:overlaps_prev_exon exon-79:overlaps_prev_exon exon-78:overlaps_prev_exon exon-77:overlaps_prev_exon exon-75:overlaps_prev_exon exon-74:overlaps_prev_exon exon-73:overlaps_prev_exon exon-71:overlaps_prev_exon exon-70:overlaps_prev_exon exon-69:overlaps_prev_exon exon-67:overlaps_prev_exon exon-66:overlaps_prev_exon exon-65:overlaps_prev_exon exon-63:overlaps_prev_exon exon-62:overlaps_prev_exon exon-61:overlaps_prev_exon exon-59:overlaps_prev_exon exon-58:overlaps_prev_exon exon-57:overlaps_prev_exon exon-55:overlaps_prev_exon exon-54:overlaps_prev_exon exon-53:overlaps_prev_exon exon-51:overlaps_prev_exon exon-50:overlaps_prev_exon exon-49:overlaps_prev_exon exon-48:overlaps_prev_exon exon-47:overlaps_prev_exon exon-46:overlaps_prev_exon exon-45:overlaps_prev_exon exon-43:overlaps_prev_exon exon-42:overlaps_prev_exon exon-41:overlaps_prev_exon exon-39:overlaps_prev_exon exon-38:overlaps_prev_exon exon-37:overlaps_prev_exon exon-35:overlaps_prev_exon exon-34:overlaps_prev_exon exon-33:overlaps_prev_exon exon-31:overlaps_prev_exon exon-30:overlaps_prev_exon exon-29:overlaps_prev_exon exon-27:overlaps_prev_exon exon-26:overlaps_prev_exon exon-25:overlaps_prev_exon exon-23:overlaps_prev_exon exon-22:overlaps_prev_exon exon-21:overlaps_prev_exon exon-19:overlaps_prev_exon exon-18:overlaps_prev_exon exon-17:overlaps_prev_exon exon-15:overlaps_prev_exon exon-14:overlaps_prev_exon exon-13:overlaps_prev_exon exon-11:overlaps_prev_exon exon-10:overlaps_prev_exon exon-9:overlaps_prev_exon exon-7:overlaps_prev_exon exon-6:overlaps_prev_exon exon-5:overlaps_prev_exon exon-3:overlaps_prev_exon exon-2:overlaps_prev_exon exon-1:overlaps_prev_exon
MODEL190 1 1 39 + errors(35): gene:multiple_Einit gene:multiple_Eterm exon-2:overlaps_prev_exon exon-3:overlaps_prev_exon exon-4:overlaps_prev_exon exon-5:overlaps_prev_exon exon-6:overlaps_prev_exon exon-7:overlaps_prev_exon exon-8:overlaps_prev_exon exon-9:overlaps_prev_exon exon-11:overlaps_prev_exon exon-12:overlaps_prev_exon exon-13:overlaps_prev_exon exon-14:overlaps_prev_exon exon-15:overlaps_prev_exon exon-16:overlaps_prev_exon exon-17:overlaps_prev_exon exon-18:overlaps_prev_exon exon-20:overlaps_prev_exon exon-21:overlaps_prev_exon exon-22:overlaps_prev_exon exon-23:overlaps_prev_exon exon-24:overlaps_prev_exon exon-25:overlaps_prev_exon exon-26:overlaps_prev_exon exon-27:overlaps_prev_exon exon-29:overlaps_prev_exon exon-30:overlaps_prev_exon exon-32:overlaps_prev_exon exon-33:overlaps_prev_exon exon-34:overlaps_prev_exon exon-35:overlaps_prev_exon exon-36:overlaps_prev_exon exon-38:overlaps_prev_exon exon-39:overlaps_prev_exon
MODEL424 1 1 10 - errors(8): gene:multiple_Einit gene:multiple_Eterm exon-9:overlaps_prev_exon exon-7:overlaps_prev_exon exon-6:overlaps_prev_exon exon-5:overlaps_prev_exon exon-3:overlaps_prev_exon exon-1:overlaps_prev_exon
MODEL902 1 1 20 - errors(14): gene:multiple_Einit gene:multiple_Eterm exon-19:overlaps_prev_exon exon-18:overlaps_prev_exon exon-17:overlaps_prev_exon exon-15:overlaps_prev_exon exon-13:overlaps_prev_exon exon-11:overlaps_prev_exon exon-9:overlaps_prev_exon exon-7:overlaps_prev_exon exon-6:overlaps_prev_exon exon-5:overlaps_prev_exon exon-3:overlaps_prev_exon exon-1:overlaps_prev_exon
MODEL238 1 1 14 - errors(11): gene:multiple_Einit gene:multiple_Eterm exon-13:overlaps_prev_exon exon-12:overlaps_prev_exon exon-11:overlaps_prev_exon exon-9:overlaps_prev_exon exon-7:overlaps_prev_exon exon-6:overlaps_prev_exon exon-5:overlaps_prev_exon exon-3:overlaps_prev_exon exon-1:overlaps_prev_exon
MODEL39 1 1 6 - errors(1): exon-3:overlaps_prev_exon
MODEL119 1 1 10 + errors(8): gene:multiple_Einit gene:multiple_Eterm exon-2:overlaps_prev_exon exon-4:overlaps_prev_exon exon-6:overlaps_prev_exon exon-7:overlaps_prev_exon exon-8:overlaps_prev_exon exon-10:overlaps_prev_exon

Furthermore, I checked my genome.ann file and noticed that my Einit and Exon sites are duplicated. For example:

>ScdimlH_1004;HRSCAF=1084
Einit  38730 38677 MODEL851
Exon  38255 38178 MODEL851
Exon  38074 38021 MODEL851
Exon  24755 24717 MODEL851
Exon  24213 24149 MODEL851
Exon  23176 23098 MODEL851
Exon  22037 21961 MODEL851
Exon  21269 21080 MODEL851
Exon  20232 20167 MODEL851
Exon  19742 19704 MODEL851
Exon  14705 14590 MODEL851
Exon  14255 13980 MODEL851
Exon  14169 13980 MODEL851
Exon  13303 13223 MODEL851
Exon  13303 13223 MODEL851
Exon  12782 12639 MODEL851
Exon  12782 12639 MODEL851
Exon  5761 5592 MODEL851
Exon  5482 5404 MODEL851
Exon  5140 5064 MODEL851
Exon  4951 4750 MODEL851
Exon  4567 4502 MODEL851
Exon  4256 4185 MODEL851
Exon  3569 3403 MODEL851
Exon  3157 3076 MODEL851
Exon  2936 2800 MODEL851
Eterm  2186 2000 MODEL851
Einit  38730 38677 MODEL851
Exon  38255 38178 MODEL851
Exon  38074 38021 MODEL851
Exon  24755 24717 MODEL851
Exon  24213 24149 MODEL851
Exon  23176 23098 MODEL851
Exon  22037 21961 MODEL851
Exon  21269 21080 MODEL851
Exon  20232 20167 MODEL851
Exon  19742 19704 MODEL851
Exon  14705 14590 MODEL851
Exon  14255 13980 MODEL851
Exon  14169 13980 MODEL851
Exon  13303 13223 MODEL851
Exon  13303 13223 MODEL851
Exon  12782 12639 MODEL851
Exon  12782 12639 MODEL851
Exon  5761 5592 MODEL851
Exon  5482 5404 MODEL851
Exon  5140 5064 MODEL851
Exon  4951 4750 MODEL851
Exon  4567 4502 MODEL851
Exon  4256 4185 MODEL851
Exon  3569 3403 MODEL851
Exon  3157 3076 MODEL851
Exon  2936 2800 MODEL851
Eterm  2186 2000 MODEL851

Any ideas why I'm seeing this duplication? Lastly, any ideas why my exons are overlapping so much? I appreciate any input and please let me know if you require any more information.

Thank you!

Celine


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://yandell-lab.org/pipermail/maker-devel_yandell-lab.org/attachments/20200506/f429999e/attachment-0003.html>


More information about the maker-devel mailing list