A metabolomethe collection of comprehensive quantitative data on metabolites in an organismhas been increasingly utilized for applications such as data-intensive systems biology, disease diagnostics, biomarker discovery, and assessment of food quality. promoting data publication. As an example of the practical use of KOMICS, a workflow for a study on is offered. The tools and databases available at KOMICS should contribute to enhanced production, interpretation, and utilization of metabolomic Big Data. 1. Introduction A metabolome, which comprises comprehensive data on quantification of metabolites in an organism calculated using metabolomic technologies [9, 10], has been progressively utilized for the analysis and practical applications of biological and environmental systems. Within the data-intensive systems biology discipline, metabolomics is particularly important compared to other omics (genome, transcriptome, and proteome) disciplines since metabolomes are more closely related to phenotype and regulate gene and protein expression networks [11C13]. Mass spectrometry (MS) and nuclear magnetic resonance spectroscopy (NMR) are complementary techniques often utilized for the detection and identification of metabolites. MS technology has integrated separation techniques and is used in most cases because of its sensitivity, selectivity, velocity, and broad 208538-73-2 manufacture applicability [14C16]. Owing to the wide range of chemical diversity, there is no ideal apparatus that is capable of analyzing all possible metabolites. Combinations of separation techniques with MS, such as liquid chromatography- (LC-) MS, gas chromatography- (GC-) MS, and capillary electrophoresis- (CE-) MS, are chosen according to a study’s purpose [17C19]. Metabolomics technology, including instrumental analysis, detection and identification of metabolites, statistical interpretation, and generation of hypotheses with computational support, is used for a variety of studies, such as functional analysis of biological systems [20C22], biomarker discovery [23, 24], medical diagnostics [14, 25], quality assessment of foods [26, 27], evaluation of genetically altered crops [28, 29], and assessment of environmental pollution [30, 31]. A considerable number of software tools and databases have been developed for processing the complicated and multidimensional metabolome datasets generated by various types of MS-based devices [32C35]. A typical workflow of metabolomic data analysis includes the following processes: (a) preprocessing of natural data for extraction of metabolite features, annotation of the metabolites, and finally generation of metabolome data; (b) mining and visualization of metabolome data for statistical interpretation of its nature and hypothesis generation; (c) storing and disseminating the data for further utilization and comparison (Physique 1). XCMS2 , MzMine2 , MathDAMP , MetAlign , and MET-IDEA  are common tools for preprocessing including detection, alignment, and annotation of metabolite features. Some of these tools also provide statistical analysis functions for data interpretation. MassBank , METLIN , PRIMe , and HMDB  are available as recommendations of mass spectra for metabolite annotation. The metabolite data are interpreted by means of the genome information from compound databases such as KNApSAcK , PubChem , and Chemspider (http://www.chemspider.com/) and by means of metabolic pathway databases including KEGG , BioCyc , and Reactome , which enable data 208538-73-2 manufacture visualization on pathway maps. The natural and processed data are stored publicly in databases such as PlantMetabolomics.org , GMD@CSB.DB , SetupX (currently not available), MetabolomeExpress , MetaboLights , and Metabolomics Workbench (http://www.metabolomicsworkbench.org/). Physique 1 A typical workflow of a metabolomics study and KOMICS-relevant tools and databases. The process of data acquisition and preprocessing (a) is required for generating the metabolome data. A working hypothesis is generated 208538-73-2 manufacture by interpreting the metabolome … We statement here a portal website 208538-73-2 manufacture namedKOMICS(The Kazusa Metabolomics Portal, http://www.kazusa.or.jp/komics/), which hosts tools and databases that we developed for metabolomics. Although an increasing quantity of tools and databases have become available, two major issues remain to be resolved, that is, comprehensiveness of metabolites [54, 55] and data dissemination [53, 56, 57]. Our main aim in developing data preprocessing tools RPB8 is to help researchers with the manual annotation process that remains essential for nontarget metabolomics . PowerGet for LC-high-resolution-MS and FragmentAlign for GC-MS are tools that enable curation of peak alignment results. SpiceHit is usually a high-throughput metabolite identification tool for CE-MS analysis using the selected ion monitoring (SIM) method. We have also developed data mining and visualization tools for the generation of working hypotheses (KaPPA-View and RnR). Actual data is indispensable for comparative analysis and for the development and improvement of preprocessing tools [53, 58]. MassBase is one of the largest natural data repositories, and KomicMarket is usually a database of metabolic profiling data. We developed a metadata-specific database, Metabolonote, to promote data publication by.