HIV-1 whole-genome quasispecies analysis in longitudinal clinical samples
Research Project
|
01.04.2013
- 31.03.2014
Genetic diversity is a hallmark of pathogen populations, associated with disease progression, immune escape, and drug resistance. We propose to use next-generation sequencing (NGS) for the analysis of viral populations within infected patients to improve viral diagnostics and treatment decisions by reconstructing the entire population structure, including low-frequency mutants and co-occurring mutations. Based on the achievements in the predecessor project (CR32I2_127017), we will develop experimental protocols and computational methods for the analysis of NGS data obtained from intra-patient HIV-1 populations. Our goal is to infer the haplotype sequences and their frequencies over the full length of the 9.2 kb HIV genome. Since the limited read length of current NGS platforms turned out to be the main bottleneck in this endeavor, we will develop two major extensions of our software tools "PredictHaplo" and "QuasiRecomb" to address this limitation. Firstly, we will use Illumina's paired-end option to generate read pairs of 2x250bp length covering the HIV genome. These data are informative about long-range phasing of single-nucleotide variants (SNVs) and will be integrated into probabilistic global haplotype reconstruction using either soft constraints on SNV linkage (PredictHaplo) or silent delete states for the insert (QuasiRecomb). Secondly, we will explore Pacific Biosciences' PacBio RS technology as an alternative long-read sequencing platform with an average read length of 1,500 bp. Using these improved experimental and computational tools, we will analyze and interpret genetic diversity in the context of HIV-1 drug resistance. Specifically, over 100 samples from 40 patients will be analyzed for pre- versus post-treatment changes of viral populations, for low-frequency drug resistant variants and their role in treatment failure, for linkage among drug resistance mutations and evolutionary escape pathways, for recombinants, and for viral phenotypes such as drug resistance and co-receptor usage. Our full-length haplotype approach provides, for the first time, a complete picture of the virus population and will yield new insights into drug resistance development.
Publications
Giallonardo, Francesca Di et al. (2014) ‘Full-length haplotype reconstruction to infer the structure of heterogeneous virus populations’, Nucleic Acids Research, 42(14), p. e115. Available at: https://doi.org/10.1093/nar/gku537.
URLs
URLs
Prabhakaran, Sandhya (2014) Machine learning methods for HIV/AIDS diagnostics and therapy planning. . Translated by Roth Volker. Dissertation.