MIDORI is a reference dataset of DNA sequences, which can be used for taxonomic assignments of metazoan mitochondrial DNA sequences. Currently, the dataset is available for download in five formats, one compatible with the RDP Classifier, the other with MOTHUR, QIIME, SPINGO, SINTAX. The MIDORI_LONGEST_SP_COI_GB249 FROGS formated database concern LONGEST COI amplicon sequences with species (SP) taxonomy identification. Taxonomies always contain 7 ranks (superkingdom phylum class order family genus species), and unknown taxa have been simplified to contain only the current rank level example: MH535936.1 Eukaryota_2759;phylum_class_order_family_genus_Amoebozoa sp._1892891;class_order_family_genus_Amoebozoa sp._1892891;order_family_genus_Amoebozoa sp._1892891;family_genus_Amoebozoa sp._1892891;genus_Amoebozoa sp._1892891;Amoebozoa sp._1892891 become MH535936.1 Eukaryota_2759;phylum_Amoebozoa_sp._1892891;class_Amoebozoa_sp._1892891;order_Amoebozoa_sp._1892891;family_Amoebozoa_sp._1892891;genus_Amoebozoa_sp._1892891;Amoebozoa_sp._1892891 URL : http://reference-midori.info/index.html Please cite : Metazoan mitochondrial gene sequence reference datasets for taxonomic assignment of environmental samples. Ryuji J. Machida et al, Nature Scientific Data 2017, doi: 10.1038/sdata.2017.27.