[maker-devel] failed to assign putative gene function

Quanwei Zhang qwzhang0601 at gmail.com
Tue Feb 21 07:30:35 MST 2017


Thank you! I used both canonical and isoform of protein sequences from
swiss-prot in the beginning. It reported the error, but later I only used
the canonical protein sequences to build the database and then it worked.

Best
Quanwei

2017-02-20 18:56 GMT-05:00 Carson Holt <carsonhh at gmail.com>:

> You either uses TrEMBL or the UniProtKB Isoform sequence set. Their fasta
> headers are slightly different and will not be parsed correctly,
>
> For example, here is the header as formatted for the same sequence in the
> Swiss-prot dataset download —>
> >sp|Q7Z5M8|AB12B_HUMAN Protein ABHD12B OS=Homo sapiens GN=ABHD12B PE=2 SV=1
>
> I think you used the  UniProtKB Isoform sequence dataset instead.
>
> —Carson
>
>
>
>
>
> > On Feb 13, 2017, at 8:16 AM, Quanwei Zhang <qwzhang0601 at gmail.com>
> wrote:
> >
> > Hello:
> >
> > I am trying to add putative gene function to the predicted gene models.
> Firstly, I use uniProt/Swiss-Prot protein sequences to build the database.
> I used canonical and isoform proteins of human, mouse and rat with the
> script "makeblastdb". Then use "blastp" generated "maker2uni.blastp" whose
> context is as below.
> > maker-CasCan_contig_64815-snap-gene-0.0-mRNA-1
> sp|Q6P5S2|LEG1H_HUMAN    69.97    303    91    0    1    303    1    303
> 7e-164    464
> > snap_masked-CasCan_contig_14203-processed-gene-0.10-mRNA-1
> sp|Q91ZA8|NRARP_MOUSE    99.12    114    1    0    1    114    1    114
> 3e-80    236
> >
> > After that, I am trying to add the protein homology data to the Maker
> gff3 and fasta files with maker_functional_gff and maker_functional_fasta,
> but get the reports as below.
> >
> > Can't parse details from FASTA header: >sp|Q7Z5M8-2|AB12B_HUMAN Isoform
> 2 of Protein ABHD12B OS=Homo sapiens GN=ABHD12B
> >
> > Use of uninitialized value $id in hash element at
> /public/apps/MAKER/2.31.9/bin/maker_functional_gff line 139, <$IN> line
> 39.
> > Use of uninitialized value $id in hash element at
> /public/apps/MAKER/2.31.9/bin/maker_functional_gff line 141, <$IN> line
> 39.
> > Can't parse details from FASTA header: >sp|Q7Z5M8-4|AB12B_HUMAN Isoform
> 4 of Protein ABHD12B OS=Homo sapiens GN=ABHD12B
> >
> > Use of uninitialized value $id in hash element at
> /public/apps/MAKER/2.31.9/bin/maker_functional_gff line 139, <$IN> line
> 45.
> > Use of uninitialized value $id in hash element at
> /public/apps/MAKER/2.31.9/bin/maker_functional_gff line 141, <$IN> line
> 45.
> > Can't parse details from FASTA header: >sp|Q7Z5M8-5|AB12B_HUMAN Isoform
> 5 of Protein ABHD12B OS=Homo sapiens GN=ABHD12B
> > .....
> >
> > I am not sure how to deal with this. I followed the command given in the
> protocol. Any suggestions?
> >
> > Thanks
> >
> > Best
> > Quanwei
> > _______________________________________________
> > maker-devel mailing list
> > maker-devel at box290.bluehost.com
> > http://box290.bluehost.com/mailman/listinfo/maker-devel_yandell-lab.org
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://yandell-lab.org/pipermail/maker-devel_yandell-lab.org/attachments/20170221/81191253/attachment-0003.html>


More information about the maker-devel mailing list