[maker-devel] duplicate CDS in annotation

Barry Moore barry.moore at genetics.utah.edu
Mon Mar 11 05:32:44 MDT 2013


Hi Sasha,

This gene model appears to be correctly formatted to me.  In GFF3 format the CDS features are allowed to span multiple lines and they share the same ID to indicate that it is all the same features.  See the GFF3 specification on the Sequence Ontology website (http://www.sequenceontology.org/resources/gff3.html), and in particular the description of the ID attribute specifies:

ID Indicates the ID of the feature. IDs for each feature must be unique within the scope of the GFF file. In the case of discontinuous features (i.e. a single feature that exists over multiple genomic locations) the same ID may appear on multiple lines. All lines that share an ID collectively represent a single feature. 

So each of those CDS lines forms one part of the single CDS feature for this gene.

B
 
On Mar 11, 2013, at 3:46 AM, Sasha Mikheyev wrote:

> Dear Yandell lab,
> 
> I am re-annotating the harvester and genome using protein and RNA-seq data. However, I get many artifacts like the one below. It seems that there are several CDS records that should tie in to the same mRNA, but they are really hanging out separately, and produce several nucleotide sequences with the same name when extracted from the gff. I would appreciate any guidance about how to fix this!
> 
> Thank you,
> 
> Sasha
> 
> grep "pbar_scf7180000350377:hit:2506" Pbar.2.0.gff 
> pbar_scf7180000350377	protein2genome	protein_match	172004	172162	150	-	.	ID=pbar_scf7180000350377:hit:2506;Name=Hsal|HS9704;score=150;
> pbar_scf7180000350377	protein2genome	match_part	172004	172162	150	-	.	ID=pbar_scf7180000350377:hsp:2798;Parent=pbar_scf7180000350377:hit:2506;Name=Hsal|HS9704;Target=Hsal|HS9704 1 53 +;Gap=M159;
> pbar_scf7180000350377	maker	mRNA	538308	558769	.	+	.	ID=pbar_scf7180000350377:hit:2506;Parent=augustus_masked-pbar_scf7180000350377-abinit-gene-5.29;Name=augustus_masked-pbar_scf7180000350377-abinit-gene-5.29-mRNA-1;_AED=0.48;_eAED=0.39;_QI=0|0|0|0.5|1|1|6|0|395;score=0.01;
> pbar_scf7180000350377	maker	exon	538308	538334	0.01	+	.	ID=pbar_scf7180000350377:hit:2506:exon:305;Parent=pbar_scf7180000350377:hit:2506;
> pbar_scf7180000350377	maker	exon	538748	538968	0.01	+	.	ID=pbar_scf7180000350377:hit:2506:exon:306;Parent=pbar_scf7180000350377:hit:2506;
> pbar_scf7180000350377	maker	exon	539842	540242	0.01	+	.	ID=pbar_scf7180000350377:hit:2506:exon:307;Parent=pbar_scf7180000350377:hit:2506;
> pbar_scf7180000350377	maker	exon	542624	542798	0.01	+	.	ID=pbar_scf7180000350377:hit:2506:exon:308;Parent=pbar_scf7180000350377:hit:2506;
> pbar_scf7180000350377	maker	exon	555823	556025	0.01	+	.	ID=pbar_scf7180000350377:hit:2506:exon:309;Parent=pbar_scf7180000350377:hit:2506;
> pbar_scf7180000350377	maker	exon	558609	558769	0.01	+	.	ID=pbar_scf7180000350377:hit:2506:exon:310;Parent=pbar_scf7180000350377:hit:2506;
> pbar_scf7180000350377	maker	CDS	538308	538334	.	+	0	ID=pbar_scf7180000350377:hit:2506:cds:305;Parent=pbar_scf7180000350377:hit:2506;
> pbar_scf7180000350377	maker	CDS	538748	538968	.	+	0	ID=pbar_scf7180000350377:hit:2506:cds:306;Parent=pbar_scf7180000350377:hit:2506;
> pbar_scf7180000350377	maker	CDS	539842	540242	.	+	1	ID=pbar_scf7180000350377:hit:2506:cds:307;Parent=pbar_scf7180000350377:hit:2506;
> pbar_scf7180000350377	maker	CDS	542624	542798	.	+	2	ID=pbar_scf7180000350377:hit:2506:cds:308;Parent=pbar_scf7180000350377:hit:2506;
> pbar_scf7180000350377	maker	CDS	555823	556025	.	+	1	ID=pbar_scf7180000350377:hit:2506:cds:309;Parent=pbar_scf7180000350377:hit:2506;
> pbar_scf7180000350377	maker	CDS	558609	558769	.	+	2	ID=pbar_scf7180000350377:hit:2506:cds:310;Parent=pbar_scf7180000350377:hit:2506;
> 
> _______________________________________________
> maker-devel mailing list
> maker-devel at box290.bluehost.com
> http://box290.bluehost.com/mailman/listinfo/maker-devel_yandell-lab.org

Barry Moore
Research Scientist
Dept. of Human Genetics
University of Utah
Salt Lake City, UT 84112
--------------------------------------------
(801) 585-3543




-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://yandell-lab.org/pipermail/maker-devel_yandell-lab.org/attachments/20130311/faf74c55/attachment-0003.html>


More information about the maker-devel mailing list