UNIverse - Public Research Portal
Project cover

The Protein Universe Atlas

Research Project
 | 
01.09.2022
 - 31.12.2023

The term "protein universe" refers to the collection of all possible proteins that can be constructed from the small alphabet of 22 proteinogenic amino acids1,2. In this representation, functionally characterised proteins correspond to stars, protein families to galaxies, and protein superfamilies to clusters of galaxies, surrounded by all those sequences which are evolutionary related but not hitherto functionally characterised or sampled by nature. In this project, we will develop a new web service to navigate through the landscape of this universe that is currently covered by all catalogued natural proteins - the "Protein Universe Atlas". We will apply deep learning protein language models (pLMs) and abstract protein structure representations to model this landscape in three dimensions (3D), providing users with an interactive and integrative platform that will facilitate the annotation, biocuration and further study of a protein, a set of proteins, or all proteins catalogued so far.

Funding

The Protein Universe Atlas

Foundations / Associations (GrantsTool), 09.2022-12.2023 (16)
PI : Schwede, Torsten,Soares Pereira, Joana Maria,Tauriello, Gerardo.

Publications

Durairaj, Janani et al. (2023) ‘Uncovering new families and folds in the natural protein universe’, Nature, 622(7983), pp. 646–653. Available at: https://doi.org/10.1038/s41586-023-06622-3.

URLs
URLs

Members (3)

MALE avatar

Gerardo Tauriello

Co-Investigator
FEMALE avatar

Joana Maria Soares Pereira

Principal Investigator
Profile Photo

Torsten Schwede

Principal Investigator