UNIverse - Public Research Portal
Profile Photo

Prof. Dr. Torsten Schwede

Department Biozentrum
Profiles & Affiliations

Projects & Collaborations

28 found
Show per page
Project cover

Evolutionary-scale interpretation of protein functions in the human gut microbiome

Research Project  | 3 Project Members

Thanks to over a decade of metagenomics efforts, we have a catalogue of over 170 million unique putative proteins from the human gut microbiome. About 40% of these do not have function assigned (dark proteins), hence limiting our understanding of the well-established relationship between the gut microbiome and human health. Current homology-based methods used for functional assignment have reached their limits because of the availability of reference data. But we are now in a new Era of computational biology, where deep-learning-based approaches allow us to predict protein structures (e.g. AlphaFold2), functions (e.g. deepFRI), and molecular mechanisms at extremely high levels of detail. This opens the door to further model protein interactions and remote evolutionary relationships (e.g. the Protein Universe Atlas) at unprecedented scales. Following up on these developments, our aim is to construct an integrative view of putative molecular functions and biological roles of proteins from the human gut microbiome by combining deep learning-driven, structure-based function prediction with a large-scale view of the protein universe.

Project cover

Open Research Data (ORD) best practices for computational macromolecular models. (Short title: ModelArchive)

Research Project  | 7 Project Members

Open Research Data (ORD) best practices for computational macromolecular models Biological macromolecules such as proteins, DNA or RNA are essential for almost all biological processes. To gain insights into their function, life science research relies on accurate information on their 3D structure. Typically, such structures are determined experimentally at atomic resolution via X-ray crystallography, NMR, and increasingly single particle cryo EM techniques. In recent years, computational methods for structure prediction have made impressive progress, achieving near-experimental accuracy in predicting 3D structures of proteins. This breakthrough has large implications for structure-based approaches in different research fields, including life sciences, biomedical research, ecology, protein engineering, biotechnology and green chemistry. Not surprisingly, the journal Nature has nominated protein structure prediction as "Method of the Year 2021". Since the creation of the Protein Data Bank in 1971, the structural biology community pioneered open research data principles. The PDB (https://www.wwpdb.org) is the single global archive of 3D structures of biological macromolecules determined by experimental techniques, but not for structures obtained through computational modelling. As a consequence, computational models are often stored in undefined locations in a variety of incompatible formats, and lack essential metadata indicating their usability (e.g. model quality estimates or licence information). Following a recommendation given in an international community workshop, we have developed an archive for computed macromolecular structures (https://modelarchive.org) and an extension of the mmCIF data format to store metadata. With the technical infrastructure of ModelArchive now established, we are in a good position to further develop respective ORD practices in our community. This includes promotion of best practices for data and metadata interoperability standards, collaborating with scientific journals and funding agencies on establishing deposition policies, improving reusability of protein models by promoting accuracy estimates, and interlinking with other ORD resources to make models easily findable and accessible.

Project cover

The Protein Universe Atlas

Research Project  | 3 Project Members

The term "protein universe" refers to the collection of all possible proteins that can be constructed from the small alphabet of 22 proteinogenic amino acids1,2. In this representation, functionally characterised proteins correspond to stars, protein families to galaxies, and protein superfamilies to clusters of galaxies, surrounded by all those sequences which are evolutionary related but not hitherto functionally characterised or sampled by nature. In this project, we will develop a new web service to navigate through the landscape of this universe that is currently covered by all catalogued natural proteins - the "Protein Universe Atlas". We will apply deep learning protein language models (pLMs) and abstract protein structure representations to model this landscape in three dimensions (3D), providing users with an interactive and integrative platform that will facilitate the annotation, biocuration and further study of a protein, a set of proteins, or all proteins catalogued so far.

Project cover

LIGATE - Ligand Generator and portable drug discovery platform AT Exascale

Research Project  | 6 Project Members

LIGATE is an EU funded project that aims to integrate and co-design best in class European components on Computer-Aided Drug Design (CADD) solutions exploiting today high-end supercomputer and tomorrow Exascale resources. The implementation of machine learning, extreme scale computer simulations, and big data analytics in the drug design and development process offer an excellent opportunity to lower the risk of investment and reduce the time to patient. The availability of powerful computing resources, new numerical models for simulations, and artificial intelligence increase the accuracy and predictability of CADD, reducing the costs and time for the design and the production of novel drugs.

Project cover

National research infrastructure project "Information and computational service infrastructure network to support biomedical research in Switzerland (BioMedITS)"

Research Project  | 2 Project Members

The BioMedIT project will establish a coordinated nationwide network of secure infrastructures to support computational biomedical research and clinical bioinformatics. BioMedIT builds on the centers of expertise established at our partner universities (regional nodes), coordinated by the SIB Swiss Institute of Bioinformatics. The purpose of the BioMedIT project is to support personalized health research by widening scope of the regional nodes to fulfil the stringent legal and security requirements with respect to technology, policies and competences. The BioMedIT is complementary to the activities of the Data Coordination Center of the Swiss Personalized Health Network initiative, which aims to establish interoperability of health-related information by building a dynamic scalable network of data providers based on common standards for formats, semantics, governance, and exchange mechanisms.

Project cover

Reconstituted Artificial Coronas: engineering nano-particles by in-silico protein mutagenesis

Research Project  | 5 Project Members

Nanoparticles (NP) are novel materials being used increasingly in a range of fields, including in medicine. When NP enter the blood stream, they often interact with proteins to form a "corona", which may change the properties and function of the nanoparticle; it is also thought that formation of the corona is related to toxic effects of nanoparticles. However, how proteins bind to NP and how exactly this protein-inorganic interaction occurs remains largely unknown; consequently, any prediction of corona formation and of potential toxicity of NP remains difficult. In this project, we aim to study the interaction at the biological-inorganic interface, in specific model proteins binding to silica and gold NP of defined size and surface chemistry. Using a reiterative approach combining experiment and computation, hypotheses generated compitationally will be tested experimentally using recombinant protein mutants designed to probe potential interaction sites.

Project cover

Data Life Cycle Management (DLCM) SUC-P2 Program

Research Project  | 5 Project Members

The primary objective of this project is to provide sustainable and tangible solutions at a national level to implement research data life-cycle management (DLCM). In doing so, this project endeavors to consolidate and further develop collaboration, while promoting coordination, between Switzerland's higher education institutions. Building on existing resources and tools at national and international levels, we target the setting up of the needed services that will allow efficient managing of active research data, and ensure publication, long-term reference and preservation of subsets of data selected by researchers. Those services will need adequate underlying infrastructures that the project partners intend to set up and test on specific use cases (in a first stage, mainly in life-sciences and humanities). Guidelines and data management plans (DMP), necessary for providing researchers with the incentive to care for their data, will be provided based on pre-existing national and international policies. Moreover, because DLCM of research data involves many questions, which include, but are not limited to data organization, file formats, metadata as well as legal and regulatory aspects, important outcomes of this project are the training of the end-users and the offering of consulting in some specific DLCM areas. These resources will be made available through a national portal.

Project cover

eScience Coordination Team eSCT

Research Project  | 5 Project Members

eSCT: eScience Coordination Team Creating a sustainable national coordination layer for local eScience units eSCT is part of the SUC P-2 "Scientific information: access, processing and safeguarding" program. The aim of the eSCT initiative is to address the growing need of scientists to access professionally supported information technology (IT) platforms in order to be competitive. The term 'eScience' refers to activities in science that are related to the computation and storage, analysis and publication of data or to the simulation of scientific processes. Several research institutions in Switzerland have created dedicated eScience support units to provide expert support and access to competitive infrastructure for eScience research on topics including complex data management and flexible cloud computing resources. However, the support levels and services provided is not homogeneous between institutions. Also, not all the institutions are capable of sustaining a dedicated eScience unit. This poses significant challenges on national collaborative research projects where data needs to be shared and exchanged across institutional boundaries. The aim of this project is to create a coordination layer on top of existing local eScience support units in Switzerland, with the objective of: supporting computing in the research sector coordinating and increasing the knowledge-base, quality, reach and impact of the already established local Swiss eScience support staff motivating the creation of units at institutions that still lag behind harmonizing the quality of eScience support across the country sciCORE is part of the eSCT project as a SciNex (Science IT Netwrok of Excellence) member.