Proteins are present in all living cells where they perform a multitude of functions. Some of them play essential roles in various fields such as human health, biology or biotechnology. To discover them, a research team from the Computational and Quantitative Biology Laboratory (Sorbonne University, CNRS), in collaboration with the Chloroplast Biology and Light Perception in Microalgae Laboratory (Sorbonne University, CNRS), has developed an innovative computational approach for the functional classification of protein families. She presented this researchentitled "Multiple profile models extract features from protein sequence data and resolve the functional diversity of very different protein families" in Molecular Biology and Evolution.
Functional classification of biological sequences is necessary to understand genomic and metagenomic sequence data. But there are thousands of protein sequences from the same ancestor that have been mutated and are involved in interaction with nucleic acids, amino acids and small molecules.
ProfileView, a computational approach for the functional classification of protein families
The team of the Computational and Quantitative Biology Laboratory has developed ProfileView to classify these thousands of sequences with a common ancestor by function. This innovative approach is based on two concepts:- Use multiple probabilistic profile models to explore and extract evolutionary information from sequence databases;
- Define a new sequence representation space where sequences are analyzed from the point of view of the functional motifs encoded in the profiles.
