<div dir="ltr"><div><div><div><div><div><div><div><div><div><div><div>Hello Maker community,<br><br></div>I have, at last, finished annotating my genome with Maker (!) and have a few questions on the final output.<br><br></div>1. I used <span style="font-family:monospace,monospace">gff3_merge</span> and <span style="font-family:monospace,monospace">fasta_merge</span> in order to merge all the gffs and all the different fasta files that were produced during the runs (I split my assembly to smaller chunks that ran in parallel). Are these two scripts the only ones I have to run after Maker has finished? Am I leaving anything important behind?<br><br></div>2. I noticed that all my transcripts (both in the fasta files as well as in the gff) have the name "XXX-mRNA-1". The fact that I can't find any of them containing "mRNA-2" means that there are no splice variants from the same gene?<br><br></div>3. In my <span style="font-family:monospace,monospace">*maker.proteins.fasta</span> file I see that some proteins have a name like<br><br></div>snap_masked-XXX<br><br></div>whereas others (apparently, also predicted by SNAP) have a name, like<br><br></div>maker-XXX-snap-gene-XXX<br><br></div>What is the difference between these two genes that are both predicted by SNAP? By reading other posts in this list, I was left with the impression that all genes predicted by SNAP/Augustus that lie in a masked region (as the first name implies), are put to another fasta file, named <span style="font-family:monospace,monospace">*maker.snap_masked.proteins.fasta</span>.<br><br></div>4. By looking at a few genes in the <span style="font-family:monospace,monospace">*maker.transcripts.fasta</span> file I came to the conclusion that only complete genes (i.e. with a start and a stop codon) are reported in this file. Am I right?<br><br></div>Thanks in advance,<br></div>Panos<br></div>