Neural network based integration of assays to assess pathogenic potential. Academic Article uri icon

abstract

  • Limited data significantly hinders our capability of biothreat assessment of novel bacterial strains. Integration of data from additional sources that can provide context about the strain can address this challenge. Datasets from different sources, however, are generated with a specific objective and which makes integration challenging. Here, we developed a deep learning-based approach called the neural network embedding model (NNEM) that integrates data from conventional assays designed to classify species with new assays that interrogate hallmarks of pathogenicity for biothreat assessment. We used a dataset of metabolic characteristics from a de-identified set of known bacterial strains that the Special Bacteriology Reference Laboratory (SBRL) of the Centers for Disease Control and Prevention (CDC) has curated for use in species identification. The NNEM transformed results from SBRL assays into vectors to supplement unrelated pathogenicity assays from de-identified microbes. The enrichment resulted in a significant improvement in accuracy of 9% for biothreat. Importantly, the dataset used in our analysis is large, but noisy. Therefore, the performance of our system is expected to improve as additional types of pathogenicity assays are developed and deployed. The proposed NNEM strategy thus provides a generalizable framework for enrichment of datasets with previously collected assays indicative of species.

published proceedings

  • Sci Rep

altmetric score

  • 4.5

author list (cited authors)

  • Eslami, M., Chen, Y., Nicholson, A. C., Weston, M., Bell, M., McQuiston, J. R., ... de Figueiredo, P.

citation count

  • 0

complete list of authors

  • Eslami, Mohammed||Chen, Yi-Pei||Nicholson, Ainsley C||Weston, Mark||Bell, Melissa||McQuiston, John R||Samuel, James||van Schaik, Erin J||de Figueiredo, Paul

publication date

  • April 2023