Top-down mass spectrometry (TD-MS) analysis of intact protein ions can be utilised to return information on the primary amino acid sequence of proteins and localise protein modifications and/or protein-ligand binding sites. In TD-MS experiments, intact protein ions are generated by electrospray ionisation, isolated, and subsequently fragmented in a mass spectrometer to generate fragment ions, that can be matched to the primary protein sequence. Traditionally, TD-MS experiments only utilise terminal fragments ions (containing the N- or C- terminus) for protein sequencing which leads to many mass spectral signals going unassigned, limiting protein sequence information.
In TD-MS, internal fragments (fragment ions containing neither the N- nor C- terminus) can also be generated during the fragmentation process. Internal fragments often go unassigned due to the computational complexity associated with assigning internal fragments.1,2 However, assignment of internal fragments can enhance TD-MS experiments and extend the information that can be gleaned from these experiments.
Here, we describe Clips-MS, a program written in Python that can precisely assign internal fragments from a TD-MS experiment.3 By including internal fragments, sequence information was increased from ~30% to >80%.4 Internal fragments were found to explain portions of the sequence within the centre of the protein, complementing terminal fragments which show limited coverage of the interior of the protein sequence. This suggests that modifications located within the centre of the protein can be interrogated using internal fragments. We demonstrate that for disulfide intact proteins, internal fragments can be utilised to extend the characterisation of disulphide intact proteins and localise disulfide connectivity. Further, internal fragments can aid in the localisation of ligand binding on membrane proteins. Combining terminal and internal fragment assignments in TD-MS should allow for more efficient TD-MS experiments which could further enhance localisation of protein modifications and/or protein-ligand binding sites.