<div dir="ltr"><div><div>Thank you! I used both canonical and isoform of protein sequences from swiss-prot in the beginning. It reported the error, but later I only used the canonical protein sequences to build the database and then it worked.<br><br></div>Best<br></div>Quanwei <br></div><div class="gmail_extra"><br><div class="gmail_quote">2017-02-20 18:56 GMT-05:00 Carson Holt <span dir="ltr"><<a href="mailto:carsonhh@gmail.com" target="_blank">carsonhh@gmail.com</a>></span>:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">You either uses TrEMBL or the UniProtKB Isoform sequence set. Their fasta headers are slightly different and will not be parsed correctly,<br>
<br>
For example, here is the header as formatted for the same sequence in the Swiss-prot dataset download —><br>
>sp|Q7Z5M8|AB12B_HUMAN Protein ABHD12B OS=Homo sapiens GN=ABHD12B PE=2 SV=1<br>
<br>
I think you used the UniProtKB Isoform sequence dataset instead.<br>
<br>
—Carson<br>
<div><div class="h5"><br>
<br>
<br>
<br>
<br>
> On Feb 13, 2017, at 8:16 AM, Quanwei Zhang <<a href="mailto:qwzhang0601@gmail.com">qwzhang0601@gmail.com</a>> wrote:<br>
><br>
> Hello:<br>
><br>
> I am trying to add putative gene function to the predicted gene models. Firstly, I use uniProt/Swiss-Prot protein sequences to build the database. I used canonical and isoform proteins of human, mouse and rat with the script "makeblastdb". Then use "blastp" generated "maker2uni.blastp" whose context is as below.<br>
> maker-CasCan_contig_64815-<wbr>snap-gene-0.0-mRNA-1 sp|Q6P5S2|LEG1H_HUMAN 69.97 303 91 0 1 303 1 303 7e-164 464<br>
> snap_masked-CasCan_contig_<wbr>14203-processed-gene-0.10-<wbr>mRNA-1 sp|Q91ZA8|NRARP_MOUSE 99.12 114 1 0 1 114 1 114 3e-80 236<br>
><br>
> After that, I am trying to add the protein homology data to the Maker gff3 and fasta files with maker_functional_gff and maker_functional_fasta, but get the reports as below.<br>
><br>
> Can't parse details from FASTA header: >sp|Q7Z5M8-2|AB12B_HUMAN Isoform 2 of Protein ABHD12B OS=Homo sapiens GN=ABHD12B<br>
><br>
> Use of uninitialized value $id in hash element at /public/apps/MAKER/2.31.9/bin/<wbr>maker_functional_gff line 139, <$IN> line 39.<br>
> Use of uninitialized value $id in hash element at /public/apps/MAKER/2.31.9/bin/<wbr>maker_functional_gff line 141, <$IN> line 39.<br>
> Can't parse details from FASTA header: >sp|Q7Z5M8-4|AB12B_HUMAN Isoform 4 of Protein ABHD12B OS=Homo sapiens GN=ABHD12B<br>
><br>
> Use of uninitialized value $id in hash element at /public/apps/MAKER/2.31.9/bin/<wbr>maker_functional_gff line 139, <$IN> line 45.<br>
> Use of uninitialized value $id in hash element at /public/apps/MAKER/2.31.9/bin/<wbr>maker_functional_gff line 141, <$IN> line 45.<br>
> Can't parse details from FASTA header: >sp|Q7Z5M8-5|AB12B_HUMAN Isoform 5 of Protein ABHD12B OS=Homo sapiens GN=ABHD12B<br>
> .....<br>
><br>
> I am not sure how to deal with this. I followed the command given in the protocol. Any suggestions?<br>
><br>
> Thanks<br>
><br>
> Best<br>
> Quanwei<br>
</div></div>> ______________________________<wbr>_________________<br>
> maker-devel mailing list<br>
> <a href="mailto:maker-devel@box290.bluehost.com">maker-devel@box290.bluehost.<wbr>com</a><br>
> <a href="http://box290.bluehost.com/mailman/listinfo/maker-devel_yandell-lab.org" rel="noreferrer" target="_blank">http://box290.bluehost.com/<wbr>mailman/listinfo/maker-devel_<wbr>yandell-lab.org</a><br>
<br>
</blockquote></div><br></div>