par De Meutter, Joëlle ;Goormaghtigh, Erik
Référence Computational and Structural Biotechnology Journal, 18, page (1864-1876)
Publication Publié, 2020-01
Article révisé par les pairs
Résumé : While several Raman, CD or FTIR spectral libraries are available for well-characterized proteins of known structure, proteins themselves are usually very difficult to acquire, preventing a convenient calibration of new instruments and new recording methods. The problem is particularly critical in the field of FTIR spectroscopy where numerous new methods are becoming available on the market. The present papers reports the construction of a protein library (cSP92) including commercially available products, that are well characterized experimentally for their purity and solubility in conditions compatible with the recording of FTIR spectra and whose high-resolution structure is available. Overall, 92 proteins were selected. These proteins cover well the CATH space at the level of classes and architectures. In terms of secondary structure content, an analysis of their high-resolution structure by DSSP shows that the mean content in the different secondary structures present in cSP92 is very similar to the mean content found in the PDB. The 92-protein set is analyzed in details for the distribution of helix length, number of strands in β- sheets, length of β-strands and amino acid content, all features that may be important for the interpretation of FTIR spectra.