<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd"><html xmlns="http://www.w3.org/1999/xhtml"><head>
<meta http-equiv="Content-Type" content="text/html; charset=UTF-8"/>
</head><body style="">
<div>
Hi Carson,
</div>
<div>
</div>
<div>
Thank you for the quick answer.
</div>
<div>
I ran gff3_merge to merge all the gff files and this resulted in a gff file, which has these type of fields:
</div>
<div>
scaffold32239 blastx protein_match 22905 34500 174 + . ID=scaffold32239:hit:976144;Name=ENSTGUG00000000198|ENSTGUT00000000219|DSCAML1-2039;
<br/>scaffold32239 blastx match_part 22905 23045 174 + . ID=scaffold32239:hsp:2806529;Parent=scaffold32239:hit:976144;Name=ENSTGUG00000000198|ENSTGUT00000000219|DSCAML1-2039;Target=ENSTGUG00000000198|ENSTGUT00000000219|DSCAML1-2039 172 218;Gap=M47;
</div>
<div>
In comparison to the dpp_contig test file, I am missing est2genome evidence, most probably because my est data set is pretty poor. I have blastx and protein2genome evidence though.
</div>
<div>
</div>
<div>
My goal is to extract the genes that could be annotated on the scaffolds. In the gff files the hits overlap most of the times, I can visualize this properly in apollo: for example one scaffold hits DSCAML gene in both zebrafinch and chicken, but extracting the coordinates between which this scaffold fits this annotated gene is difficult from the gff. Manually curating the genes is also not an option, since I am trying to do this for a 1.7Gb genome.
</div>
<div>
</div>
<div>
I hope this explains better what we are after.
</div>
<div>
</div>
<div>
Thank you once again.
</div>
<div>
</div>
<div>
Best regards,
</div>
<div>
</div>
<div>
Diana
<br/>On May 10, 2013 at 6:13 PM Carson Holt <carsonhh@gmail.com> wrote:
</div>
<div style="position: relative;">
<blockquote style="margin-left: 0px; padding-left: 10px; border-left: solid 1px blue;" type="cite">
<div>
I'm sorry I don’t' understand question 1. You are you missing resulting fasta files, correct? Did your resulting GFF3 file have any features of type "gene"? Did you run fasta_merge after running gff3_merge?
</div>
<div>
</div>
<div>
Could you give me more details on what you are trying to do, so I can take a stab at question 2 as well.
</div>
<div>
</div>
<div>
Thanks,
</div>
<div>
Carson
</div>
<div>
</div>
<div>
</div>
<div>
</div>
<div style="font-family: Calibri; font-size: 11pt; text-align: left; color: black; border-bottom: none; border-left: none; padding-bottom: 0in; padding-left: 0in; padding-right: 0in; border-top: #b5c4df 1pt solid; border-right: none; padding-top: 3pt;">
<span style="font-weight: bold;">From: </span> Diana LeDuc <
<a href="mailto:diana_leduc@eva.mpg.de">diana_leduc@eva.mpg.de</a>>
<br/>
<span style="font-weight: bold;">Reply-To: </span> Diana LeDuc <
<a href="mailto:diana_leduc@eva.mpg.de">diana_leduc@eva.mpg.de</a>>
<br/>
<span style="font-weight: bold;">Date: </span> Friday, 10 May, 2013 10:44 AM
<br/>
<span style="font-weight: bold;">To: </span> <
<a href="mailto:maker-devel@yandell-lab.org">maker-devel@yandell-lab.org</a>>
<br/>
<span style="font-weight: bold;">Cc: </span> Gabriel Renaud <
<a href="mailto:gabriel_renaud@eva.mpg.de">gabriel_renaud@eva.mpg.de</a>>, Janet Kelso <
<a href="mailto:kelso@eva.mpg.de">kelso@eva.mpg.de</a>>, Torsten Schoeneberg <
<a href="mailto:torsten.schoeneberg@medizin.uni-leipzig.de">torsten.schoeneberg@medizin.uni-leipzig.de</a>>
<br/>
<span style="font-weight: bold;">Subject: </span> [maker-devel] Maker consensus
</div>
<div>
</div>
<div>
<div>
<div>
<p>Dear maker developers,</p>
<p>I am a phD student working on de novo assembly and annotation of a bird genome. I used Maker as annotation pipeline, which ran very well, and I obtained different annotations with evidence from Augustus gene predictor, small EST dataset from my organism and protein sequences from chicken, turkey and zebrafinch. I could combine the different gff files from different scaffolds into one gff file with annotations for the entire genome.</p>
<p>I now have two questions:</p>
<p>1. What could be the reason that I haven't gotten the protein.fasta and trancript.fasta files</p>
<p>2. How can I obtain a consensus gene list of different evidences from maker? What I would actually need is the scaffold, coordinates and annotation (gene name) according to the 3 other bird species.</p> Thank you in advance.
</div>
<div>
</div>
<div>
Best regards,
</div>
<div>
</div>
<div>
Diana Le Duc
</div>
<div>
</div>
<div>
--
<br/>
<br/>Max Planck Institute for Evolutionary Anthropology
<br/>Department of Evolutionary Genetics
<br/>Deutscher Platz 6
<br/>D-04103 Leipzig
<br/>
<br/>Phone +49 (0)341-3550-554
<br/>
<a target="_blank" href="http://www.eva.mpg.de">www.eva.mpg.de</a>
</div>
</div>
</div>
<span>_______________________________________________ maker-devel mailing list <a href="mailto:maker-devel@box290.bluehost.com">maker-devel@box290.bluehost.com</a> <a href="http://box290.bluehost.com/mailman/listinfo/maker-devel_yandell-lab.org">http://box290.bluehost.com/mailman/listinfo/maker-devel_yandell-lab.org</a> </span>
</blockquote>
<br/>
</div>
</body></html>