[maker-devel] Problem with cegma2zff

David Powell david.powell at monash.edu
Thu Sep 6 18:23:57 MDT 2012


Greetings,

I am using CEGMA to train SNAP for use with maker.  However, I had a
problem with the cegma2zff script that comes with maker.  This script
converts the gff file from CEGMA into a zff file for SNAP.

The problem is that it was producing a zff file with every multi-exon gene
as being "invalid" from SNAPs point of view.  My fix was to
modify cegma2zff to ignore any feature with the tag "Exon" - as these are
always duplicated by cegma as another feature (one of First, Internal,
Terminal, Single).

Just wanted to post this here in case this fix is useful to anyone else.

Cheers,

-- David Powell

diff --git a/cegma2zff b/cegma2zff
index c795da8..a3bbb77 100755
--- a/cegma2zff
+++ b/cegma2zff
@@ -39,6 +39,8 @@ while(my $line = <IN>){
     my @F = split("\t", $line);
     ($F[3], $F[4]) = ($F[4], $F[3]) if($F[6] eq '-');

+    next if $F[2] =~ /Exon/;
+
     $F[2] =~ s/First/Einit/;
     $F[2] =~ s/Terminal/Eterm/;
     $F[2] =~ s/Internal/Exon/;
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://yandell-lab.org/pipermail/maker-devel_yandell-lab.org/attachments/20120907/a2821cd8/attachment-0002.html>


More information about the maker-devel mailing list