Machine learning modeling of the effects of media formulated with various yeast extracts on heterologous protein production in Escherichia coli
Seiga Tachibana, Tai-Ying Chiou, Masaaki Konishi
In microbial manufacturing, yeast extract is an important component of the growth media. The production of heterologous proteins often varies because of the yeast extract composition. To identify why this reduces protein production, the effects of yeast extract composition on the growth and green fluorescent protein (GFP) production of engineered Escherichia coli were investigated using a deep neural network (DNN)-mediated metabolomics approach. We observed 205 peaks from the various yeast extracts using gas chromatography-mass spectrometry. Principal component analyses of the peaks identified at least three different clusters. Using 20 different compositions of yeast extract in M9 media, the yields of cells and GFP in the yeast extract-containing media were higher than those in the control without yeast extract by approximately 3.0- to 5.0-fold and 1.5- to 2.0-fold, respectively. We compared machine learning models and found that DNN best fit the data. To estimate the importance of each variable, we performed DNN with a mean increase error calculation based on a permutation algorithm. This method identified the significant components of yeast extract. DNN learning with varying numbers of input variables provided the number of significant components. The influence of specific components on cell growth and GFP production was confirmed with a validation cultivation.
microbial, proteins, DNN leaning, machine larning