DOME ID (Diagnostic Oligo Motifs for Explicit IDentification): a simple SIDE (sequence identification engine)



Damon P. Little1

1Lewis B. and Dorothy Cullman Program for Molecular Systematics, The New York Botanical Garden, Bronx, NY, USA


description

This set of scripts is designed to transform a set of FASTA formated sequences into a queriable DNA barcoding reference database. These scripts were first used by Little and Stevenson (2007).

script usage

(1) Create a MySQL database: “mysql -u root -p barcode < db-tables.sql”.

(2) Create a table of motifs: “patterns.pl size motifs”.
8-10 bp motifs are recommended.

(3) Import fasta formatted sequences: “fst2mysql.pl sequences.fasta division locus”.
The file “sequences.fasta” is assumed to be DNA sequences in FASTA format (either GenBank FASTA, or FASTA with id_genus_species).

(4) Barcode sequences: “barcode.pl locus”.

(5) Identify sequences: “dome.pl sequence”.

requirements

PERL interpreter
MySQL

citation

Little, D. P. 2007. DOME ID (Diagnostic Oligo Motifs for Explicit IDentification): a simple SIDE (sequence identification engine). Program distributed by the author.

download

DOME ID