Figure 1. The number of proteins cataloged in UniProt databases [16]. Swiss-Prot contains reviewed and manually-annotated proteins. Its growth is unnoticeable compared to UniRef50 that comprises unreviewed, automatically annotated sequences.
Summary
- – Understanding of microbial proteins is crucial for unlocking the microbiome’s clinical potential.
- – Developing a precise protein function prediction method is still a significant challenge.
- – Deep learning is a powerful tool that, with sufficient amounts of data, can take proteomics far further than current methods.
To be continued…
The next part in this series will summarize the recent adoption of deep learning advancements in proteomics, which is slowly leading to a better understanding of (microbial) proteins.
Bibliography
[1] N. Koppel and E. P. Balskus, “Exploring and Understanding the Biochemical Diversity of the Human Microbiota,” Cell Chem Biol, vol. 23, no. 1, pp. 18–30, Jan. 2016, doi: 10.1016/j.chembiol.2015.12.008.
[2] P. Amon and I. Sanderson, “What is the microbiome?,” Archives of Disease in Childhood – Education and Practice, vol. 102, no. 5, pp. 257–260, Oct. 2017, doi: 10.1136/archdischild-2016-311643.
[3] “CAFA | Bio Function Prediction.” [Online]. Available: https://www.biofunctionprediction.org/cafa/. [Accessed: 30-Apr-2020].
[4] “Home – Prediction Center.” [Online]. Available: http://predictioncenter.org/. [Accessed: 30-Apr-2020].
[5] S. F. Altschul, W. Gish, W. Miller, E. W. Myers, and D. J. Lipman, “Basic local alignment search tool,” J. Mol. Biol., vol. 215, no. 3, pp. 403–410, Oct. 1990, doi: 10.1016/S0022-2836(05)80360-2.
[6] S. F. Altschul et al., “Gapped BLAST and PSI-BLAST: a new generation of protein database search programs,” Nucleic Acids Res., vol. 25, no. 17, pp. 3389–3402, Sep. 1997, doi: 10.1093/nar/25.17.3389.
[7] S. R. Eddy, “Profile hidden Markov models,” Bioinformatics, vol. 14, no. 9, pp. 755–763, 1998, doi: 10.1093/bioinformatics/14.9.755.
[8] M. Steinegger, M. Meier, M. Mirdita, H. Vöhringer, S. J. Haunsberger, and J. Söding, “HH-suite3 for fast remote homology detection and deep protein annotation,” BMC Bioinformatics, vol. 20, no. 1. 2019, doi: 10.1186/s12859-019-3019-7.
[9] Z. D. Ariel Schwartz, “Deep Learning Applied to Genomics, Deep Semantic Protein Representation.”
[10] C. Angermueller, T. Pärnamaa, L. Parts, and O. Stegle, “Deep learning for computational biology,” Mol. Syst. Biol., vol. 12, no. 7, p. 878, Jul. 2016, doi: 10.15252/msb.20156651.
[11] A. W. Senior et al., “Improved protein structure prediction using potentials from deep learning,” Nature, vol. 577, no. 7792, pp. 706–710, Jan. 2020, doi: 10.1038/s41586-019-1923-7.
[12] A. W. Senior et al., “Protein structure prediction using multiple deep neural networks in the 13th Critical Assessment of Protein Structure Prediction (CASP13),” Proteins, vol. 87, no. 12, pp. 1141–1148, Dec. 2019, doi: 10.1002/prot.25834.
[13] A. Kryshtafovych, T. Schwede, M. Topf, K. Fidelis, and J. Moult, “Critical assessment of methods of protein structure prediction (CASP)-Round XIII,” Proteins, vol. 87, no. 12, pp. 1011–1020, Dec. 2019, doi: 10.1002/prot.25823.
[14] T. Ching et al., “Opportunities and obstacles for deep learning in biology and medicine: 2019 update.” [Online]. Available: https://greenelab.github.io/deep-review/. [Accessed: 28-Dec-2018].
[15] V. Boža, B. Brejová, and T. Vinař, “DeepNano: Deep recurrent neural networks for base calling in MinION nanopore reads,” PLoS One, vol. 12, no. 6, p. e0178751, Jun. 2017, doi: 10.1371/journal.pone.0178751.
[16] A. Bateman et al., “UniProt: the universal protein knowledgebase,” Nucleic Acids Res., vol. 45, no. D1, pp. D158–D169, Jan. 2017, doi: 10.1093/nar/gkw1099.