Spine identifies a core genome from input genomic sequences. Sequences are aligned using Nucmer
and regions found to be in common between all or a user-defined subset of genomes will be returned. Sequences can be given in fasta (Example),
genbank (Example), or gff3 (Example) format. Annotation files in gff3 format must be accompanied
by a separate sequence file in fasta format. Core and accessory genes will be output for any sequences given in annotated genbank format with
locus_tag
tags for each CDS or gff3 files including CDS records that are uploaded with fasta sequence files.
Please enter only one genome per line. Records in multi-fasta files will be assumed to belong to the same genome (i.e. draft genomes consisting of scaffolds or contigs).
Limit 2 - 15 genomes per analysis. Maximum of 20 Mb total file(s) size per genome. If analysis of a larger number of genomes or longer genomes is desired, please download the perl scripts to run locally (Link).
IMPORTANT: Please note that the order in which genomes are entered may be important. Genomes entered first will be given highest priority in producing core genome sequence, with decreasing priority for each subsequently entered genome.
Email Egon with questions or bugs.