Introduction:
To best address antimicrobial resistance (AMR), clinicians must be provided with tools that accurately characterise the AMR profiles of bacterial infections [1]. Proteome informatics process data from experiments by analysing how the proteome varies in complexity and application [2]. Though important for the clinic, bioinformatics remains a barrier to entry for the clinician wanting to engage proteomics as a solution [3].
Aim:
This study sought to construct a workflow that maximally characterised a bacterial isolate's AMR capabilities and identify deficiencies in the field to improve proteomic informatics overall.
Method:
Escherichia coli strain 2009-52 was utilised for its "complete" genome [4], and previously-obtained AMR proteome profile, obtained via antimicrobial susceptibility testing and a bottom-up LC-MS/MS-based workflow examining AMR in response to a sub-lethal dose of Ampicillin. PATRIC BRC [5] was utilised to provide genome annotation and translation to canonical proteins, as well as information on AMR. Other informatics used were PEAKS Studio X Pro [6], MaxQuant [7], LFQ Analyst [8], ResFinder [9], VirulenceFinder [10], and STRING DB [11] (which uses UniProt KB [12] and KEGG [13]).
Results:
Many databases had little experimentally determined annotations on the protein level, even for well-characterised bacterial strains. There were many unresolvable contradictions between proposed biological meanings or associations, and it was not immediately apparent which result was most reliable. Contradictions could be overcome with the manual, time-consuming assessment of entries by the user.
Discussion:
There is a distinct lack of functional knowledge about non-model organisms in databases. Active participation in uploading findings must be prioritised so that databases better represent proteomic, not genomic, evidence. It is essential to standardise annotations and records management in the field to bring much-needed benefit to clinical and research spaces alike. Until this is resolved, the clinic cannot easily address its needs.