Immunome Research was a scientific journal published from 2005-2010 about cutting edge immunology
research, integrating genomics, and bioinformatics. NeuEve supports science, so we have put the
articles originally published in Immunome Research back online.
Viruses are fast evolving pathogens that continuously adapt to the highly variable environments they live and reproduce in. Strategies devoted to inhibit virus replication and to control their spread among hosts need to cope with these extremely heterogeneous populations and with their potential to avoid medical interventions. Computational techniques such as phylogenetic methods have broadened our picture of viral evolution both in time and space, and mathematical modeling has contributed substantially to our progress in unraveling the dynamics of virus replication, fitness, and virulence. Integration of multiple computational and mathematical approaches with experimental data can help to predict the behavior of viral pathogens and to anticipate their escape dynamics. This piece of information plays a critical role in some aspects of vaccine development, such as viral strain selection for vaccinations or rational attenuation of viruses. Here we review several aspects of viral evolution that can be addressed quantitatively, and we discuss computational methods that have the potential to improve vaccine design.
Viruses are intracellular parasites that need the cellular machinery of the host to reproduce . They have the potential to generate huge population sizes in short generation times. Viruses in general, and RNA viruses in particular, exist in genetically heterogeneous populations because of their error-prone replication . These features make the evolution of viruses a phenomenon observable on short time scales of weeks to months. The consequences of the extreme viral evolutionary dynamics are of tremendous importance for disease control and prevention. For example, influenza vaccines need to be updated every year, viral variants develop resistance to antiviral drugs, and mild viral strains turn into virulent ones spontaneously. These global health care issues and others arise from the rapid evolution of viruses.
Molecular profiling techniques, including DNA sequencing, have produced an enormous amount of data on viral spread, genetic diversity, and infection dynamics. The integration and analysis of these data can provide valuable information on the evolution of viral pathogens. Mathematical, statistical, and computational methods are necessary to deal with those large data sets and to predict phenotypes form genetic data that ultimately can be used in vaccine development. In this review, we summarize some computational and mathematical techniques that play a critical role in understanding viral evolution and vaccine design. Specifically, we discuss phylogenetic methods for vaccine strain selection, statistical models of evolutionary escape from selective immune pressure, and virus dynamics models for therapeutic vaccines and attenuation strategies. Our major examples are Influenza, human immunodeficiency virus (HIV), and foot-and-mouth disease virus (FMDV). They are all RNA viruses of great medical or veterinary importance and have been studied extensively.
We also discuss current, approved experimental methodologies employed to the development of antiviral vaccines. Vaccine development deals with the four most important traits of viral evolution: virulence, fitness, diversity, and dynamics. Each one of these concepts has been subjected to intense studies but is still difficult to predict, probably because of the complexity of each trait and the intrinsic stochasticity of the evolutionary process. Progress in understanding the interplay of these factors will depend on quantitative descriptions and predictive models. Thus, mathematical evolutionary modeling is likely to play an increasingly important role in the development of new vaccines and the control of viral disease.
A quantitative description of viral evolution is necessary for monitoring the spread of viral pandemics and for developing effective therapies and vaccines . Viruses are not only a threat to human health, but they also provide attractive model systems for evolutionary studies due to their short genomes, large population sizes, and high genetic diversity . The extreme replication dynamics of RNA viruses, for example, allow for observing significant evolutionary changes over time. Hypotheses and theories about evolutionary mechanisms can often be tested directly with these measurably evolving viruses .
The mutation rate of RNA viruses is about a million times larger than the human mutation rate . Thus, RNA viruses display a huge genetic diversity and this feature is critical for survival of the virus [7, 8]. Virus populations are exposed to fluctuating environments when migrating through different organs and tissues of the host organism and when exposed to immune responses mounted by the host. Transmission to a new host is typically associated with both traversing various tissues and facing new immune responses, and therefore it represents a major bottleneck for the virus population. The genetic diversity of RNA viruses makes it likely that adapted variants preexist in the population even before the selective pressure has changed [9, 10].
Because the diversity of the virus population can determine its evolutionary fate, selection seems to operate on the population level rather than at the level of individual viruses . This idea was originally developed for self-replicating RNA molecules and termed quasispecies theory , and then applied to RNA viruses [13, 14]. One prediction of quasispecies theory is the existence of an upper bound on the mutation rate beyond which the population cannot maintain its essential genetic information. Many RNA viruses appear to have mutation rates close to this error threshold [9, 15, 16].
The evolutionary dynamics of viral infectious diseases can be analyzed at considerable detail today owing to advancements in high-throughput DNA sequencing technologies and statistical and computational modeling of these data . Viral evolution occurs on different temporal and spatial scales and is shaped by different ecological processes within and between hosts. Integrated modeling efforts across these scales that make use of phylogenetic, population genetics, virus dynamics, and epidemiological methods are termed phylodynamics approaches . Application of these techniques enabled the reconstruction of the molecular origin of the HIV pandemic [19, 20] and the explanation of influenza A epidemics by the interplay of natural selection and migration [21, 22]. Comparing viral DNA sequences is at the heart of the phylodynamics approach.
In general, the study of correlated evolution among genotypic and phenotypic traits or between traits and the environment across species is known as the comparative method . It has been extremely successful in analyzing DNA sequence data and it is the basis for predicting phenotypes, such as protein structure or function, from genotypes. In the case of RNA viruses, which display a large genetic diversity between hosts, the different viral quasispecies take the role of the different species in the traditional application of the comparative method. Viral phenotypes of interest include immunological escape and drug resistance.
Vaccine development strategies
Vaccine development can be considered one of the biggest achievements of modern medicine. While some bacterial families share common biochemical features, such as surface lipopolysacharide and therefore can be treated with common drugs or antibiotics , viruses are typically different enough between families or even inside families, making the development of broadly applicable antiviral drugs challenging [2, 25]. For example, the base analogue ribavirin alone or in combination with interferon, has been proven to act as an antiviral compound of broad activity in the clinical treatment of infections with hepatitis C virus (HCV), hepatitis B virus (HBV), and respiratory syncytial virus (RSV) [26, 27, 28, 29, 30, 31]. Overall, only few antiviral drugs targeting viral proteins are currently approved for use in humans . Viruses that can be fought with chemical compounds include HIV, influenza virus, RSV, HCV, HBV, and the herpes viruses, including herpes simplex virus, varicella-zoster virus (VZV), and cytomegalovirus.
However, currently the vast majority of viruses cannot be controlled today with any approved compound and frequently not even all subgroups inside the same viral species are drug sensitive. For example, the 2009 seasonal influenza A virus (H1N1) presents a natural resistance to oseltamivir . The efficiency of antiviral compounds is usually hampered by the generation of drug resistant viruses. Moreover, some compounds may help control infection, but only rarely can the virus be completely cleared from the host organism. By contrast, vaccines can boost the immune system response and, in principle, achieve complete clearance of a virus from an infected host. Therefore, vaccines are considered the best weapon to fight viruses.
The general idea of vaccination is to resemble a viral infection but avoiding the associated pathology. There are several strategies to address this goal with varying efficacies. Currently approved viral vaccines for use in humans are listed in Table 1. They have resulted from one out of three main types of vaccine design strategies referred to as (i) live attenuated vaccines, (ii) inactivated vaccines, and (iii) recombinant vaccines and peptide-derived vaccines.
Vaccine-preventable viral diseases as defined by CDC Atlanta .
Type of vaccine
recombinant (subunit, surface antigen)
live attenuated, MMR vaccine1,
live attenuated, MMR vaccine1
live attenuated, Sabin strain
inactivated, Salk strain
live attenuated, MMR vaccine1
Japanese encephalitis virus
Children severe diarrhea
live attenuated vaccinia virus2
Varicella (chickenpox, shingles), herpes zoster
yellow fever virus
1 MMR is a triple vaccine including Mumps, Measels, and Rubella vaccine.
2 Small pox vaccine is no longer routinely administered because of eradication of smallpox. The vaccine consists of a stock of vaccinia virus, a smallpox-related virus which does not cause disease in humans.
Live attenuated vaccines (LAVs). Immunization with LAV has been proved to be the most efficient vaccination strategy to date [34, 35]. LAV preparations include viruses with reduced virulence, which means that they do not produce the disease when infecting the host, or they produce a mild version of the disease. Viruses become attenuated for the original host after serial infections (passages) in cell culture of different organism. This is the case of the polio vaccine preparation in monkey cells . Infection of embryonated hen eggs is the standard protocol to obtain attenuated yellow fever or measles viruses suitable for vaccine preparation . The rationale for this attenuation strategy is that due to the high error rates during replication of viruses, especially of RNA viruses , the virus accumulates mutations in the genome that optimize their replication in the new host or new cell type, at the expense of replication efficiency in the original host [1, 34, 38].
The attenuated virus is still competent for replication and it retains the ability to infect host cells. For this reason, LAVs can elicit different effector mechanisms of the immune system. The intracellular replication of the virus can stimulate cytotoxic CD8+ cells because they can be presented by major histocompatibility complex (MHC) class I molecules. Particles released outside the cell can also be presented by class II MHC molecules . After immunization with the LAV, the immune system is exposed to multiple antigens of the virus in its native conformation. Once the infection with the LAV is cleared from the organism, the virus-specific immune cells remain as memory cells in the host. A future challenge with the wild type virus will trigger the correct response, i.e., predominantly cellular response or predominantly humoral response. LAVs are considered the most successful vaccines because the efficient and multiple stimulation of the immune system typically induces a potent and durable response .
Inactivated vaccines (IVs). Viral stocks are susceptible to inactivation by some chemical and physical treatments. IVs consist of a concentrated viral stock that has been treated with a chemical reagent, such as binary ethylenimine or formaldehyde, which completely abolish virus replication . Some viruses are difficult to attenuate because they may change their antigenic properties or they remain virulent after few passages in cell culture . The latter is the case for FMDV, an animal virus from the picornavirus family. When a virus cannot be attenuated with sufficient reliability, inactivation has been proven to be a successful vaccination strategy, as documented for FMDV, influenza, or hepatitis A vaccines [41, 42, 43, 44]. Some security issues may arise however, if the chemical compound does not reach all virus particles, for example because some viral particles tend to form compact aggregates, and the preparation still contains a portion of live viruses. Inactivated viruses are usually not efficiently presented by MHC class I molecules, which stimulate the cellular immune response. IV preparations include adjuvants which are chemical compounds that act on antigen-presenting cells enhancing the immunogenicity of the vaccine [45, 46]. However, the strength and duration of the protection induced by IVs is usually lower than that obtained with LAVs.
Recombinant and peptide-based vaccines. A great variety of vaccine strategies can be catalogued inside this category, most of them experimental. Recombinant vaccines are produced by the expression of a genetic construct that codifies viral peptides [47, 48], subunits of the virus , or whole viruses with genetic modifications including deletions of key proteins [50, 51]. Other strategies include live viral vectors that carry multiple copies of heterologous proteins of interest [52, 53]. Many of these strategies have failed, mainly because of the low immunogenic capacity of peptides or subunits, compared with the whole live or inactivated particle. The HBV vaccine is a yeast-derived recombinant vaccine. It contains the hepatitis B surface antigen which is one of the viral envelope proteins. HBV vaccine is the only recombinant vaccine currently approved and in use for humans .
DNA vaccination is a strategy based on injecting a DNA construct directly into the host [54, 55]. Such DNA constructs, which code for the immunizing protein or other parts of interest of the virus, can be transcribed and translated into the cell. Therefore, expressed gene products can elicit an immune response by presentation of peptides by MHC class I and II molecules.
Phylogenetic methods in influenza vaccine design
Influenza A virus is a negative-stranded RNA virus that infects about one fifth of the worldwide human population each year . The viral genome consists of eight segments and is categorized by the serology and genetics of its two surface glycoproteins neuraminidase (NA) and hemagglutinin (HA). Several subtypes of both genes have been isolated from mammalian and avian hosts, including the two most recent pandemic strains H3N2 and H1N1 currently circulating in the human population and responsible for the 1968–1969 Hong Kong Flu and the 2009 Swine Flu, respectively.
Influenza infects large portions of its host population every season and immunized hosts are resistant to infection with the subtype they have been exposed to for several years. Therefore, selective pressure exists for the virus to diversify and to generate immunological escape variants. Indeed, the HA gene has been shown to be under strong selective pressure through immune surveillance . Positive (diversifying) Darwinian selection acts at the antigen-determining sites of HA1, the most immunogenic part of HA. At these loci, significantly more non-synonymous than synonymous nucleotide substitutions are observed, and the rate of evolution is accelerated considerably as compared to other sites of the genome [58, 59].
Not only selection, but also neutral genetic drift seems to play an important role in the evolution of influenza virus. Both evolutionary forces, termed antigenic drift and antigenic shift, have been observed in human hosts over the last century. Antigenic drift refers to neutral evolutionary changes accumulating over time, whereas antigenic shift involves a change in genetic and serological properties of the virus due to new HA or NA subtypes.
The evolutionary dynamics of H3N2 epidemics have been studied in detail by allowing for different genetic and antigenic properties of the HA gene . Variations of this influenza subtype emerge and replace each other every 2 to 8 years. The mapping of HA genotypes to antigenic phenotypes is based on a neutral network model, i.e., mutants are organized in connected, phenotypically identical clusters. The model explains the observed pattern of viral evolution, with periods of antigenic stasis and with episodic cluster transitions. During antigenic stasis the population drifts through the neutral network and its diversity increases. If it hits the boundary of the neutral network, the antigenic phenotype changes and the population continues to evolve in the new network. The cluster transition presents a population bottleneck and gives rise to a selective sweep which reduces the genetic diversity of the population .
The influenza vaccine needs to be redesigned regularly to account for genetic changes in the virus population. Normally, the changes are made in response to the antigenic drift of the virus. For example, between 1968 and 2001 the H3N2 component of the influenza vaccine was changed a total of 17 times . The selection of viral strains to be included in the vaccine for the coming season is based on the antigenic properties of recent isolates, on epidemiological data, and on post-vaccination serological studies in humans.
The evolutionary dynamics of influenza drive its immune escape and give rise to a new dominant strain every season. Therefore, vaccine design is not only supported by immunoinformatics methods for epitope prediction [63, 64, 65], but also by statistical genetics and phylogenetic methods for analyzing genetic diversity and predicting evolutionary changes. To predict the evolution of the influenza HA gene, phylogenetic trees were constructed based on DNA sequences derived from viruses during the years 1983 through 1997 . Eighteen codons were identified to be under positive selective pressure and the genetic diversity at these loci was significantly higher than at the other loci of the HA gene . The rationale for predicting the next dominant virus is that extant strains with additional mutations at the 18 loci will be better adapted to evade the host immune response and thus have a selective advantage in the coming season. Phylogenetic analysis confirmed that the viral lineages with the greatest number of mutations in the positively selected codons were the ancestors of future H3 lineages in 9 out of 11 influenza seasons .
This approach to predicting the evolution of influenza relies on solving two classical evolutionary biology problems: the detection of genetic loci under selective pressure and the reconstruction of the evolutionary history of a set of individuals. Quantifying the relative contributions of selection versus random genetic drift is a longstanding task rooted in Kimura’s theory of neutral evolution which predicts that most mutations are selectively neutral [66, 67]. Selection is typically identified by testing for departure from neutrality, although such deviations can also have different causes. The statistical tests are either based on the allelic distribution or on comparing variability in different classes of mutations . Tajima’s D is the prototype test of the first kind . It detects differences in two distinct estimates of genetic diversity. The null distribution of the test statistic D is obtained from sampling genealogies according to the coalescent, a stochastic process describing the sampling variation [70, 71]. Similarly, the Ewens–Watterson test compares the observed to the expected homozygosity based on the Ewens sampling formula for the infinite-alleles model [72, 73]. In the second category of tests fall the McDonald–Kreitman test  and likelihood ratio tests based on the allelic distribution in non-synonymous versus synonymous sites [58, 75]. Codon usage in influenza sequences has also been analyzed based on codon volatility, which measures the degree to which a random nucleotide mutation is expected to change the corresponding amino acid .
Phylogenetic methods play an important role in the analysis of viral sequence data. They are used to reconstruct the evolutionary relationships between different viral strains. Three main approaches exist to phylogenetic tree reconstruction: distance-based methods, parsimony, and probabilistic methods based on maximum likelihood or Bayesian statistics [77, 78, 79, 80]. Distance-based methods start by defining an evolutionary distance between sequences and then apply hierarchical clustering algorithms to obtain a phylogenetic tree. Unweighted Pair Group Method with Arithmetic mean (UPGMA) , an average linkage clustering method, and Neighbor Joining  are two prominent examples of this approach. Distance-based methods are computationally efficient, but the reduction of the observed data to a distance matrix presents a loss of information. Both UPGMA and Neighbor Joining can reconstruct the correct tree only under strong assumptions about the metric defined by the distance matrix and they are sensitive to violations of these assumptions.
Unlike distance-based methods, character-based methods follow character substitutions in the sequence explicitly. Maximum Parsimony is based on the minimum evolution principle and tries to find the tree that explains the data by the minimum number of mutations . The method has been applied successfully to the analysis of influenza virus sequence data . It is also computationally efficient, but lacks an explicit evolutionary model (other than minimum evolution), and it is not statistically consistent. On the other hand, probabilistic phylogenetic models have these desirable properties, but they come at a computational cost usually rendering exact maximum likelihood estimation impossible. However, approximate likelihood methods, such as DNAML  or QuartetPuzzling , and methods making use of Bayesian inference, such as MrBayes  are applied in practice. For both distance-based methods and probabilistic phylogenetic methods, the choice of an appropriate model of nucleotide substitution is critical.
The application of statistical genetics and phylogenetics methods to influenza sequence data has not only improved our understanding of the evolutionary dynamics and the epidemiology of the virus, but it has also become an integral part of the yearly vaccine design cycle. However, the successful case of influenza does not seem to provide a practical model for HIV. One reason for this discrepancy might be the evolutionary dynamics of HIV which are strikingly different from those of influenza. Rather than the drift-and-shift pattern of influenza evolution which generates only a small amount of genetic diversity around the successful trunk lineage, HIV tends to spread out from an ancestor in a radial fashion and to generate much more variation. The worldwide diversity of influenza sequences in any given year appears to be comparable to the diversity of HIV sequences found within a single infected individual at one time point . Thus, an HIV vaccine must stimulate a very broad reactive immune response against a large set of diverse viral strains and the genetic makeup of these sequences is much more difficult to predict from the currently circulating strains as compared to influenza. It is for these and possibly other reasons that the same bioinformatics-assisted vaccine design approach that is established for influenza, has not been equally successful for HIV to date. In the following sections, we will discuss extensions of the models discussed above as well as complementary mathematical and computational approaches that might be of help in search for an HIV vaccine in the future.
Evolutionary escape of HIV from selective immune pressure
HIV populations display a high genetic diversity due to the quasispecies nature of RNA virus replication [2, 87, 88]. HIV occurs in three main groups, the principal of which, group M, is composed of nine subtypes. Moreover, HIV has a great capacity for recombination among subtypes, and as a consequence, new circulating recombinant forms are constantly arising [89, 90]. Each subtype itself represents a great genetic diversity as well and even during infection of a single patient, HIV exists as an ensemble of different sequences . The high genetic diversity of HIV populations hampers the development of a vaccine of broad applicability. Vaccine candidates need to elicit responses against multiple epitopes in order to counteract the immune evasion by virus mutation [91, 92].
With one exception, the few attempts to bring HIV vaccines to the last phases of clinical trials have been quite disappointing so far. One of the first and most prominent vaccine candidates, the VAXGEN vaccine, was intended to immunize subjects with a recombinant envelope protein of HIV (rgp120) [93, 94]. The envelope protein is located on the surface of HIV and is responsible for the attachment of the virus to the host cell surface receptor . Antibodies targeted against this protein could block HIV infection and subsequently block virus entry into cells. During VAXGEN trials, the immunization induced the production of antibodies in vaccinated individuals, but they were unable to control infection or viremia.
The STEP vaccine trials were designed to test the efficiency of a T cell vaccine in reducing viremia and enhancing the cellular immune response . This vaccine candidate is a therapeutic vaccine (see below), because it was intended to enhance the immune response against HIV even if sterilizing immunity would not be achieved. The STEP vaccine formulation was based in an adenovirus serotype 5 vector (Ad5). The vaccine included three independent Ad5 vectors, each one carrying one of the three HIV proteins Gag, Pol, and Nef. Although in early phases of the clinical trials, the vaccine was proven to elicit specific anti-HIV T cell responses, no significant protection was conferred to people receiving the vaccine in phase IIb trials. Indeed, individuals having immune memory against the Ad5 vector were more susceptible to infection by HIV . This unexpected result was probably due to an adding-fuel-to-the-fire effect, in which the Ad5 vector activated T cells creating a suitable environment for HIV replication .
Recently, a combined vaccine of recombinant canarypox vector vaccine  plus a recombinant glycoprotein 120 subunit vaccine has been tested in phase III clinical trials in Thailand involving more than 16.000 subjects. The results showed a slight trend of protection in the vaccinated group when compared to the control group. Although these numbers are still debated  and vaccination did not affect the degree of viremia or CD4+ T cell count in HIV infected subjects, the results offer insight for future research.
In order to address the challenge of HIV sequence diversity, several new vaccine design strategies are explored based on combining different epitopes. These methods include the construction of pseudo-protein strings of T cell epitopes  and the synthetic scrambled antigen vaccine, which combines consensus overlapping peptide sets from HIV-1 proteins . Both approaches select codons, peptides, or sequences according to codon usage, covered diversity, and predicted HLA affinity, and randomize sequence fragments.
Vaccine design strategies based on whole viral protein sequences make extensive use of phylogenetics. In addition to the basic methods discussed above, HIV-specific probabilistic models of protein evolution have been constructed which allow for improved phylogenetic inference using likelihood or Bayesian methods . The reconstructed phylogenetic tree can guide the selection of viral genomes to be included in the vaccine. Different selection strategies have been proposed to stimulate a broad immune response and to minimize the amount of sequence divergence between the antigen and contemporaneously circulating viruses. Natural strains that represent the total observed sequence space or derived consensus sequences have been selected as vaccine strains [104, 105, 106]. Probabilistic phylogenetic models also allow for inferring the DNA sequences at internal nodes of the tree which represent extinct common ancestors. The most recent common ancestor (MRCA) of a given set of viruses is the root of the phylogenetic tree for these sequences . It has been proposed as a vaccine strain stimulating cross-reactive immune responses against all of its descendants [104, 108]. However, in asymmetric phylogenies, both the consensus sequence and the MRCA can perform poorly at minimizing the distance to contemporary strains . To address this limitation, the center of tree node has been proposed. It is calculated as the node minimizing the least squares distance to all leaves of the phylogenetic tree .
The usefulness of phylogenetic trees is limited in the presence of reticulate evolutionary events, such as hybridization, horizontal gene transfer, or recombination, which cannot be represented by a tree. For this situation, phylogenetic network models have been developed . They generalize phylogenetic tree models and include reticulate networks and split networks . In most RNA viruses, homologous recombination can occur when a cell is coinfected with two different strains. In HIV, multiple infections are common  and the recombination rate is on the order of 2 to 3 times per genome per replication cycle . Several epidemiological circulating recombinant forms provide evidence for recombination in HIV. Intra-host evolutionary dynamics are also shaped by recombination affecting the generation of multidrug-resistant strains in treated patients [115, 116] and the development of immune escape variants. Efficient parsimony algorithms have been developed for computing recombination networks [117, 118].
Immune responses to HIV infection vary depending on the genetic constellation of the human leukocyte antigen (HLA) locus. Antigen-specific T cell immunity is HLA-restricted and therefore mutations in HIV epitopes that allow escape from host immune responses are HLA allele-specific [119, 120]. Cytotoxic T lymphocyte (CTL) escape mutations have been shown to be stable under vertical transmission of the virus . Thus, CTL escape presents an important driving force in shaping HIV evolutionary patterns. Differential HLA-restricted viral evolution has been observed in several HIV-1 genes . From a vaccine design point of view, it is pivotal to characterize CTL escape quantitatively in order to address limitations of immune stimulations and to minimize the risk of viral evolutionary escape. This task involves identification of HLA-specific escape mutations and prediction of mutational escape pathways.
HIV polymorphisms associated with CTL escape leave HLA-specific footprints at the population level and hence detection of escape mutations is based on the comparative method. However, there are at least three confounding factors in this approach. First, viral sequences obtained from different hosts cannot be regarded as independent observations of the same stochastic process. Rather, these sequences share a common evolutionary history. Second, structural and functional limitations, such as the biophysics of three-dimensional protein structures, impose constraints on codon covariation and give rise to linkage disequilibrium (LD) in the HIV genome. Third, the host’s HLA genes are also in LD because of their close physical distance on human chromosome 6.
Phylogenetic dependency networks (PDNs) are a class of probabilistic graphical models that account for these confounding factors . PDNs explicitly represent selection pressure from multiple sources and model the dependency structure among escape mutations conditioned on a phylogenetic tree of the observed viruses and on the HLA types of their hosts. By analyzing 1000 individuals from a multicenter cohort, the statistical model identified a dense network of interactions between HLA alleles and HIV codons, as well as among HIV codons, reflecting the complexity but also the consistency of HIV adaptation to immune response .
HIV mutational pathways have also been modeled in the context of evolutionary escape from the selective pressure of antiretroviral therapy. Several probabilistic models have been developed for describing the accumulation of amino acid changes in response to specific drugs or drug combinations, including Markov chains , Bayesian networks , mutagenetic trees [126, 127, 128], and conjunctive Bayesian networks (CBNs) . CBNs are a class of probabilistic graphical models that describe the order in which mutations occur. In this model, the partial order of mutations that maximizes the likelihood of the data can be learned efficiently from observed mutational patterns. CBNs allow for several escape paths with different probabilities and the partial order restricts the viral genotype space to the subset of those mutational patterns compatible with the partial order constraints (Figure 1). The rationale for this approach is that many combinations of mutations are never observed, for example, because they result in inviable viruses, or because they are too far away in sequence space from current strains to be reached on the relevant time scale. CBNs can explicitly represent the timeline of mutations occurring along escape pathways  and they have been extended to account for noisy observations .
Computational models of viral escape dynamics have been applied successfully in the design of optimal combination therapies [132, 133]. Because the development of drug resistance is a major factor for treatment failure, not only the current resistance profile, but also the likelihood of evolving resistant viruses is a strong predictor of therapeutic outcome. The difficulty for the virus to escape from the applied selective drug pressure is known as the genetic barrier and it can be computed based on probabilistic models of accumulating mutations . Retrospective analyses of large observational clinical databases have demonstrated that estimates of the genetic barrier based on viral progression models are independent predictors of treatment outcome. The genetic barrier improves therapy outcome predictions and the resulting models outperform standard-of-care expert rule-based treatment recommendations [134, 135]. Therefore, computational models of viral escape dynamics might also be useful for vaccine design. A successful HIV vaccine should not only minimize the distance to currently circulating strains, but also anticipate possible immune escape pathways of the virus. Although it is unlikely that the complete picture of escape pathways can be learned from data, improvements in terms of hindered and delayed escape might be possible, especially in the context of therapeutic vaccines where selective immune and drug pressure together may constrain virus evolution significantly and result in control of infection.
Besides genetic heterogeneity, the development of an efficient vaccine against HIV remains elusive because of the difficulties of inducing an efficient immune response [98, 136]. Individuals who control infection display a strong cellular response [137, 138, 139]. Many experimental vaccines have failed in directing the effector response to a more cellular profile . Furthermore, HIV infection induces a low titer of neutralizing antibodies [141, 142]. Gp120 and gp41 are the HIV proteins exposed at the surface of the virion. These proteins are responsible for the attachment of HIV to the cell surface and the virus has developed several strategies to avoid recognition and blocking by antibodies. The region of the protein that interacts with the CD4 cellular receptor is a hypervariable loop. A great number of antibody-escape mutants are mapped to this region of the HIV genome. The loop however, is highly glycosylated and it is only exposed at the surface of the protein in the precise moment of the interaction with the cellular receptor . These two combined features complicate the fitting of potentially neutralizing antibodies .
HIV is a member of the retrovirus family with the ability to integrate its genome into the host cell genome . Genome integration is another challenge to develop an effective vaccine because latently infected cells cannot be recognized by the immune system until the integrated provirus is activated .
After several attempts to obtain a protective vaccine against HIV, current efforts have shifted toward developing therapeutic vaccines which help to control infection. A vaccine that elicits an incomplete response may be sufficient to keep viral load in controllable levels. The vaccine would not prevent infection by HIV, but it would delay or prevent the progression to AIDS, the final stage of the disease . The reduction in viral load would also reduce the number of secondary transmissions of the infection, because the efficiency of transmission depends on viral load levels.
Virus dynamics and therapeutic vaccines
A viral infection is a complex molecular process, but it can be approximated by a few major steps (Figure 2). Initially, the virus needs to find a susceptible cell and enter into it. Once inside the cell, virus replication starts and viral offspring is released. Infected cells will eventually die. Released virus particles are either inactivated or they hit a new susceptible cell, in which case the infection cycle is closed and a chain reaction of sequential infections is triggered [144, 145, 146]. The dynamics of the viral replication cycle can be expressed in mathematical terms as follows:
This ordinary differential equation (ODE) system describes uninfected cells, x, being infected with efficiency β, infected cells, y, dying and releasing viral offspring at rate a, and free virus, v, being produced at rate k and inactivated at rate u. In the absence of viral infection, cells reproduce at rate λ and die at rate d. Oversimplifying the role of the immune system, the immune cells, z, grow and die with rates c and b, respectively. They remove infected cells from the system with efficiency p. Each specific viral family may give rise to modifications of this model due to variations in its life cycle. But the ODE system is the core of a family of mathematical models that describe the turnover of viruses and cells during an infection.
Virus dynamics models have been successfully employed to the study of simian immunodeficiency viruses (SIV), HIV, and HBV, among others [144, 146, 147, 148, 149, 150]. The dynamics of this model are shown in Figure 3. The model is an epidemiological SIR model , in which infection is treated as a microepidemic and host cells play the role of infected or susceptible individuals. Whether the virus infection can spread in the cell pool or not depends on a condition very similar to the spread of an epidemic in a population of individuals. The parameters of the model must satisfy the inequality R0 > 1, where R0 is the basic reproductive number, defined as the number of newly infected cells that arise from any one infected cell when almost all cells are uninfected [144, 151]. For a’ = cp/b, this number is given by
For generic parameters, if R0 > 1 uninfected cells become infected and produce progeny viruses exponentially. Activation of the immune system (a’ > 1) reduces the value of R0 and slows down the spread of the infection. At the beginning of the infection, before the immune response is mounted (a’ ≈ 0), and after the initial peak of viral load, viruses and infected and uninfected cells reach a stable equilibrium termed viral set point (Figure 3). While monitoring viral load of SIV in infected macaques, a correlation between the viral load at initial stages of the infection and the viral set point was observed [152, 153]. One can demonstrate that the equilibrium abundance of viruses, v*, and the logarithm of the virus load during the exponential growth phase, follow the linear relationship
This result is important in HIV research, because several studies indicate that there is a positive correlation between viral load and disease progression. Individuals who display a lower viral load during the first stage of the infection have higher chances to survive and control the infection .
From the virus dynamics point of view, a successful vaccine is one that boosts the immune response, i.e., increases the parameter a’, such that R0 is reduced below the critical value of 1. Under such conditions, the infection will initially grow, but then viral load will rapidly decline and the infection will eventually be cleared from the organism. However, even if the vaccine induces an immune response that is not strong enough to reduce R0 to a value below 1, it may still be beneficial. As described above, slowing down the initial exponential increase of the viral load has a negative effect on the viral set point. This type of imperfect vaccine is also called a therapeutic vaccine. In the case of HIV, it could help infected patients to control disease progression in two ways. First, it may reduce viral load during the chronic phase of the infection, and second it may reduce viremia below the threshold of inter-host virus transmission (Figure 3) [137, 144, 155, 156, 157].
On the epidemiological level, mathematical models of inter-host transmission of a given viral pathogen can predict the portion of individuals within a population that needs to be vaccinated in order to achieve herd immunity (HI). HI occurs if partial vaccination provides protection also to unvaccinated individuals and the population is protected against invasion of the pathogen. The critical proportion of immune individuals above which the disease cannot persist is the HI threshold
Here R0 is the basic reproductive number of the pathogen in the host population, rather than in the cell population of a single host as discussed above. The higher the reproductive ratio of the disease, the more individuals need to be effectively protected by vaccination. The HI threshold varies not only with the virulence of the disease, but also with the efficacy of the vaccine and the contact dynamics of the host population.
Attenuation strategies and the evolution of virulence in RNA viruses
Individual viruses inside the quasispecies can display very different degrees of virulence. This trait is measured as the cell killing capacity, when considering cell culture experiments, and as the host morbidity or mortality, when focusing on whole-organism pathology [158, 159]. Replicative fitness is also highly variable among viruses of the same quasispecies, and in general, not directly correlated to virulence . Understanding the evolution of these two viral traits is fundamental to understanding and controlling the spread of viral diseases and to the design of LAVs.
Since attenuated viruses may elicit a potent immune response without causing harm to the host, several strategies are being explored to obtain candidate viruses for LAVs, i.e., viruses displaying a reduced cell killing or replicative ability. Serial cytolytic transfers in cell culture tend to select viruses attenuated in the original host . This is the case for FMDV or yellow fever virus among other viruses, and is currently an approved technique for several LAV preparations. Viruses selected after severe bottlenecks, such as serial plaque-to-plaque transfers, present a reduced fitness due to the accumulation of mutations associated with the Müller’s ratchet effect [161, 162]. A new promising strategy for the attenuation of viruses is the rational design and synthesis of viral genomes with a strong codon bias. This approach has been implemented for Poliovirus and influenza virus [163, 164]. The viral genome synthesized encodes the same amino acid sequence as the wild type virus but encoded by infrequent codons in their host cells. Viruses harboring fidelity mutations in the replicase genes tend to produce a quasispecies of lower diversity and to be attenuated in vivo. This feature has also been employed to the rational design of a Poliovirus LAV . Other strategies conceived to limit viral replication include the design of specific microRNAs or zinc finger nucleases targeting the viral genome .
Many mathematical models have been developed to describe the evolution of virulence in diverse viral populations. One conclusion of these theoretical studies is that virulence can increase in the population under a variety of conditions . The basic model of virus dynamics, however, states that less cytopathic variants are more productive in the long term of the infection, because the abundance of both viruses and infected cells is inversely correlated with the cytopathogenicity, a, of the viral variants . For R0> 1,
Recently, a FMDV population has been reported to diversify into two genetically distinct subpopulations that also differ in virulence. The viral variants have been characterized phenotypically in considerable detail and their coevolutionary dynamics, when competing for the same cell pool in vitro, have been analyzed. The competition experiment has been described by an extension of the basic model of virus dynamics introduced above.
The experimental and theoretical results indicate that less virulent strains are more efficient in outcompeting the virulent ones in coinfected cells. Therefore, the fitness of variants of different virulence is density-dependent . The cell competition model offers an explanation of several previous observations of suppression of high fitness mutants in dissimilar viral systems [169, 170, 171, 172]. This density-dependent selection due to varying efficiency of viral replication alone or in coinfection is reminiscent of the concept of a competition-colonization trade-off in ecology . Here, virulent viral variants play the role of colonizers and viruses efficient within coinfected cells are competitors. The attenuating effect of competitor-colonizer competition appears even more pronounced if many viruses from a broad spectrum of virulence are considered.
Viral evolution and the genetic diversity it produces are fundamental factors for the success of vaccine candidates, because immune responses need to be stimulated against a potentially very broad spectrum of existing viruses and new viral immune escape variants are likely to be generated. Mathematical modeling of viral evolutionary dynamics will therefore play an increasingly important role in vaccine design. It can identify genomic regions that are under selective pressure, support the selection or construction of vaccine strains, predict evolutionary escape from immune pressure, guide vaccination campaigns, estimate the effect of therapeutic vaccines, and support the design of new attenuation strategies. Most of our discussion has been in the context of RNA viruses and many issues are most pronounced for this class of viruses. Nevertheless, we expect most statistical models and computational methods to be applicable to other viruses and different pathogens, too. On the other hand, the distinct evolutionary dynamics of influenza A and HIV-1, two of the most widely studied RNA viruses, have highlighted the need for careful analysis of viral infection dynamics within and among individuals.
This article has been published as part of Immunome Research Volume 6 Supplement 2, 2010: Computational Vaccinology: State-of-the-art Assessments. The full contents of the supplement are available online at http://www.immunome-research.com/supplements/6/S2.
Vignuzzi M, Stone JK, Arnold JJ, Cameron CE, Andino R: Quasispecies diversity determines pathogenesis through cooperative interactions in a viral population. Nature. 2006, 439 (7074): 344-348. 10.1038/nature04388.PubMed CentralPubMedGoogle Scholar
Bonhoeffer S, Sniegowski P: Virus evolution: the importance of being erroneous. Nature. 2002, 420 (6914): 367-369. 10.1038/420367a.PubMedGoogle Scholar
Bonhoeffer S, Nowak MA: Pre-existence and emergence of drug resistance in HIV-1 infection. Proc Biol Sci. 1997, 264 (1382): 631-637. 10.1098/rspb.1997.0089.PubMed CentralPubMedGoogle Scholar
Martin V, Domingo E: Influence of the mutant spectrum in viral evolution: focused selection of antigenic variants in a reconstructed viral quasispecies. Mol Biol Evol. 2008, 25: 1544-1554. 10.1093/molbev/msn099.PubMedGoogle Scholar
Eigen M, McCaskill J, Schuster P: Molecular quasi-species. J Phys Chem. 1988, 92 (24): 6881-6891. 10.1021/j100335a010.Google Scholar
Domingo E, Holland JJ: RNA virus mutations and fitness for survival. Annu Rev Microbiol. 1997, 51: 151-178. 10.1146/annurev.micro.51.1.151.PubMedGoogle Scholar
Holland JJ, Torre JCDL, Steinhauer DA: RNA virus populations as quasispecies. Curr Top Microbiol Immunol. 1992, 176: 1-20.PubMedGoogle Scholar
Kamp C, Wilke CO, Adami C, Bornholdt S: Viral evolution under the pressure of an adaptive immune system: Optimal mutation rates for viral escape. Complexity. 2003, 8 (2): 28-33. 10.1002/cplx.10067.Google Scholar
Ojosnegros S, Agudo R, Sierra M, Briones C, Sierra S, Gonzalez-Lopez C, Domingo E, Cristina J: Topology of evolving, mutagenized viral populations: quasispecies expansion, compression, and operation of negative selection. BMC Evol Biol. 2008, 8: 207-10.1186/1471-2148-8-207.PubMed CentralPubMedGoogle Scholar
Eriksson N, Pachter L, Mitsuya Y, Rhee S-Y, Wang C, Gharizadeh B, Ronaghi M, Shafer RW, Beerenwinkel N: Viral population estimation using pyrosequencing. PLoS Comput Biol. 2008, 4 (4): e1000074-10.1371/journal.pcbi.1000074.PubMed CentralPubMedGoogle Scholar
Pybus OG, Rambaut A: Evolutionary analysis of the dynamics of viral infectious disease. Nat Rev Genet. 2009, 10 (8): 540-550. 10.1038/nrg2583.PubMedGoogle Scholar
Korber B, Muldoon M, Theiler J, Gao F, Gupta R, Lapedes A, Hahn BH, Wolinsky S, Bhattacharya T: Timing the ancestor of the HIV-1 pandemic strains. Science. 2000, 288 (5472): 1789-1796. 10.1126/science.288.5472.1789.PubMedGoogle Scholar
Lemey P, Pybus OG, Wang B, Saksena NK, Salemi M, Vandamme A-M: Tracing the origin and history of the HIV-2 epidemic. Proc Natl Acad Sci U S A. 2003, 100 (11): 6588-6592. 10.1073/pnas.0936469100.PubMed CentralPubMedGoogle Scholar
Rambaut A, Pybus OG, Nelson MI, Viboud C, Taubenberger JK, Holmes EC: The genomic and epidemiological dynamics of human influenza A virus. Nature. 2008, 453 (7195): 615-619. 10.1038/nature06945.PubMed CentralPubMedGoogle Scholar
Bush RM, Bender CA, Subbarao K, Cox NJ, Fitch WM: Predicting the evolution of human influenza A. Science. 1999, 286 (5446): 1921-1925. 10.1126/science.286.5446.1921.PubMedGoogle Scholar
Martins : Adaptation and the comparative method. Trends Ecol Evol. 2000, 15 (7): 296-299. 10.1016/S0169-5347(00)01880-2.PubMedGoogle Scholar
Domingo E, Menendez-Arias L, Quinones-Mateu ME, Holguin A, Gutierrez-Rivas M, Martinez MA, Quer J, Novella IS, Holland JJ: Viral quasispecies and the problem of vaccine-escape and drug-resistant mutants. Prog Drug Res. 1997, 48: 99-128.PubMedGoogle Scholar
Smith RA, Kirkpatrick W: Ribavirin: a broad spectrum antiviral agent. 1980, New York: Academic Press, Inc.Google Scholar
McCormick JB, King IJ, Webb PA, Scribner CL, Craven RB, Johnson KM, Elliott LH, Belmont-Williams R: Lassa fever. Effective therapy with ribavirin. N Engl J Med. 1986, 314 (1): 20-26. 10.1056/NEJM198601023140104.PubMedGoogle Scholar
Graham BS, Crowe JE: Immunization against viral diseases. Fields Virology. Edited by: Knipe DM, Howley PM. 2007, Philadelphia: Lappincott Williams & Wilkins, 12 (z): 389-421. 5,Google Scholar
Sabin AB: Properties and behavior of orally administered attenuated poliovirus vaccine. J Am Med Assoc. 1957, 164 (11): 1216-1223.PubMedGoogle Scholar
Theiler M, Smith HH: The Use of Yellow Fever Virus Modified by in Vitro Cultivation for Human Immunization. J Exp Med. 1937, 65 (6): 787-800. 10.1084/jem.65.6.787.PubMed CentralPubMedGoogle Scholar
Murdin AD, Barreto L, Plotkin S: Inactivated poliovirus vaccine: past and present experience. Vaccine. 1996, 14 (8): 735-746. 10.1016/0264-410X(95)00211-I.PubMedGoogle Scholar
Janeway A, Travers P, Walport M, Shlomchik M: Immunobiology. The immune system in health and disease. 2001, New York: Garland Publishing, 5Google Scholar
Bahnemann HG: Inactivation of viral antigens for vaccine preparation with particular reference to the application of binary ethylenimine. Vaccine. 1990, 8 (4): 299-303. 10.1016/0264-410X(90)90083-X.PubMedGoogle Scholar
Barteling SJ: Modern Inactivated Foot-and-Mouth Disease (FMD) Vaccines: Historical Background and Key Elements in Production and Use. Foot-and-mouth Disease: Current Perspectives. Edited by: Sobrino F, Domingo E. Wymondham. 2004, England: Horizon Bioscience, 305-334.Google Scholar
Andre FE: Universal mass vaccination against hepatitis A. Curr Top Microbiol Immunol. 2006, 304: 95-114. full_text.PubMedGoogle Scholar
Mosca F, Tritto E, Muzzi A, Monaci E, Bagnoli F, Iavarone C, O'Hagan D, Rappuoli R, De Gregorio E: Molecular and cellular signatures of human vaccine adjuvants. Proc Natl Acad Sci U S A. 2008, 105 (30): 10501-10506. 10.1073/pnas.0804699105.PubMed CentralPubMedGoogle Scholar
Freund J, Thomson KJ: A Simple, Rapid Technic of Preparing Water-in-Oil Emulsions of Penicillin, Drugs and Biologics. Science. 1945, 101 (2627): 468-469. 10.1126/science.101.2627.468-a.PubMedGoogle Scholar
DiMarchi R, Brooke G, Gale C, Cracknell V, Doel T, Mowat N: Protection of cattle against foot-and-mouth disease by a synthetic peptide. Science. 1986, 232 (4750): 639-641. 10.1126/science.3008333.PubMedGoogle Scholar
Zanetti AR, Van Damme P, Shouval D: The global impact of vaccination against hepatitis B: a historical overview. Vaccine. 2008, 26 (49): 6266-6273. 10.1016/j.vaccine.2008.09.056.PubMedGoogle Scholar
Daniel MD, Kirchhoff F, Czajak SC, Sehgal PK, Desrosiers RC: Protective effects of a live attenuated SIV vaccine with a deletion in the nef gene. Science. 1992, 258 (5090): 1938-1941. 10.1126/science.1470917.PubMedGoogle Scholar
Mason PW, Piccone ME, McKenna TS, Chinsangaram J, Grubman MJ: Evaluation of a live-attenuated foot-and-mouth disease virus as a vaccine candidate. Virology. 1997, 227 (1): 96-102. 10.1006/viro.1996.8309.PubMedGoogle Scholar
Andino R, Silvera D, Suggett SD, Achacoso PL, Miller CJ, Baltimore D, Feinberg MB: Engineering poliovirus as a vaccine vector for the expression of diverse antigens. Science. 1994, 265 (5177): 1448-1451. 10.1126/science.8073288.PubMedGoogle Scholar
Sullivan NJ, Sanchez A, Rollin PE, Yang ZY, Nabel GJ: Development of a preventive vaccine for Ebola virus infection in primates. Nature. 2000, 408 (6812): 605-609. 10.1038/35046108.PubMedGoogle Scholar
Yang ZY, Kong WP, Huang Y, Roberts A, Murphy BR, Subbarao K, Nabel GJ: A DNA vaccine induces SARS coronavirus neutralization and protective immunity in mice. Nature. 2004, 428 (6982): 561-564. 10.1038/nature02463.PubMedGoogle Scholar
Fitch WM, Leiter JM, Li XQ, Palese P: Positive Darwinian evolution in human influenza A viruses. Proc Natl Acad Sci U S A. 1991, 88 (10): 4270-4274. 10.1073/pnas.88.10.4270.PubMed CentralPubMedGoogle Scholar
Ina Y, Gojobori T: Statistical analysis of nucleotide sequences of the hemagglutinin gene of human influenza A viruses. Proc Natl Acad Sci U S A. 1994, 91 (18): 8388-8392. 10.1073/pnas.91.18.8388.PubMed CentralPubMedGoogle Scholar
Bush RM, Fitch WM, Bender CA, Cox NJ: Positive selection on the H3 hemagglutinin gene of human influenza virus A. Mol Biol Evol. 1999, 16 (11): 1457-1465.PubMedGoogle Scholar
Koelle K, Cobey S, Grenfell B, Pascual M: Epochal evolution shapes the phylodynamics of interpandemic influenza A (H3N2) in humans. Science. 2006, 314 (5807): 1898-1903. 10.1126/science.1132745.PubMedGoogle Scholar
van Nimwegen E: Epidemiology. Influenza escapes immunity along neutral networks. Science. 2006, 314 (5807): 1884-1886. 10.1126/science.1137300.PubMedGoogle Scholar
Korber B, Gaschen B, Yusim K, Thakallapally R, Kesmir C, Detours V: Evolutionary and immunological implications of contemporary HIV-1 variation. Br Med Bull. 2001, 58: 19-42. 10.1093/bmb/58.1.19.PubMedGoogle Scholar
Yang X, Yu X: An introduction to epitope prediction methods and software. Rev Med Virol. 2009, 19 (2): 77-96. 10.1002/rmv.602.PubMedGoogle Scholar
Toussaint NC, Dönnes P, Kohlbacher O: A mathematical framework for the selection of an optimal set of peptides for epitope-based vaccines. PLoS Comput Biol. 2008, 4 (12): e1000246-10.1371/journal.pcbi.1000246.PubMed CentralPubMedGoogle Scholar
Ewens WJ: The sampling theory of selectively neutral alleles. Theor Popul Biol. 1972, 3 (1): 87-112. 10.1016/0040-5809(72)90035-4.PubMedGoogle Scholar
McDonald JH, Kreitman M: Adaptive protein evolution at the Adh locus in Drosophila. Nature. 1991, 351 (6328): 652-654. 10.1038/351652a0.PubMedGoogle Scholar
Goldman N, Yang Z: A codon-based model of nucleotide substitution for protein-coding DNA sequences. Mol Biol Evol. 1994, 11 (5): 725-736.PubMedGoogle Scholar
Plotkin JB, Dushoff J: Codon bias and frequency-dependent selection on the hemagglutinin epitopes of influenza A virus. Proc Natl Acad Sci U S A. 2003, 100 (12): 7152-7157. 10.1073/pnas.1132114100.PubMed CentralPubMedGoogle Scholar
Michener CD, Sokal RR: A quantitative approach to a problem in classification. Evolution. 1957, 11: 130-162. 10.2307/2406046.Google Scholar
Saitou N, Nei M: The neighbor-joining method: a new method for reconstructing phylogenetic trees. Mol Biol Evol. 1987, 4 (4): 406-425.PubMedGoogle Scholar
Fitch WM, Margoliash E: Construction of phylogenetic trees. Science. 1967, 155 (760): 279-284. 10.1126/science.155.3760.279.PubMedGoogle Scholar
Fitch WM, Bush RM, Bender CA, Cox NJ: Long term trends in the evolution of H(3) HA1 human influenza type A. Proc Natl Acad Sci U S A. 1997, 94 (15): 7712-7718. 10.1073/pnas.94.15.7712.PubMed CentralPubMedGoogle Scholar
Strimmer K, von Haeseler A: Quartet Puzzling: A Quartet Maximum-Likelihood Method for Reconstructing Tree Topologies. Mol Biol Evol. 1996, 13: 964-969.Google Scholar
Wolinsky SM, Korber BT, Neumann AU, Daniels M, Kunstman KJ, Whetsell AJ, Furtado MR, Cao Y, Ho DD, Safrit JT: Adaptive evolution of human immunodeficiency virus-type 1 during the natural course of infection. Science. 1996, 272 (5261): 537-542. 10.1126/science.272.5261.537.PubMedGoogle Scholar
Meyerhans A, Cheynier R, Albert J, Seth M, Kwok S, Sninsky J, Morfeldt-Manson L, Asjo B, Wain-Hobson S: Temporal fluctuations in HIV quasispecies in vivo are not reflected by sequential HIV isolations. Cell. 1989, 58 (5): 901-910. 10.1016/0092-8674(89)90942-2.PubMedGoogle Scholar
Jung A, Maier R, Vartanian JP, Bocharov G, Jung V, Fischer U, Meese E, Wain-Hobson S, Meyerhans A: Multiply infected spleen cells in HIV patients. Nature. 2002, 418 (6894): 144-10.1038/418144a.PubMedGoogle Scholar
Wain-Hobson S, Renoux-Elbe C, Vartanian JP, Meyerhans A: Network analysis of human and simian immunodeficiency virus sequence sets reveals massive recombination resulting in shorter pathways. J Gen Virol. 2003, 84 (Pt 4): 885-895. 10.1099/vir.0.18894-0.PubMedGoogle Scholar
Novella IS, Domingo E, Holland JJ: Rapid viral quasispecies evolution: implications for vaccine and drug strategies. Molecular Medicine Today. 1995, 1: 248-253. 10.1016/S1357-4310(95)91551-6.PubMedGoogle Scholar
Domingo E, Menéndez-Arias L, Quiñones-Mateu ME, Holguín A, Gutiérrez-Rivas M, Martínez MA, Quer J, Novella IS, Holland JJ: Viral quasispecies and the problem of vaccine-escape and drug-resistant mutants. Prog Drug Res. 1997, 48: 99-128.PubMedGoogle Scholar
Gilbert PB, Ackers ML, Berman PW, Francis DP, Popovic V, Hu DJ, Heyward WL, Sinangil F, Shepherd BE, Gurwith M: HIV-1 virologic and immunologic progression and initiation of antiretroviral therapy among HIV-1-infected subjects in a trial of the efficacy of recombinant glycoprotein 120 vaccine. J Infect Dis. 2005, 192 (6): 974-983. 10.1086/432734.PubMedGoogle Scholar
Gilbert PB, Peterson ML, Follmann D, Hudgens MG, Francis DP, Gurwith M, Heyward WL, Jobes DV, Popovic V, Self SG, et al: Correlation between immunologic responses to a recombinant glycoprotein 120 vaccine and incidence of HIV-1 infection in a phase 3 HIV-1 preventive vaccine trial. J Infect Dis. 2005, 191 (5): 666-677. 10.1086/428405.PubMedGoogle Scholar
Coffin JM, Hughes SH, Varmus HE: Retroviruses. 1997, New York: Cold Spring Harbor Laboratory PressGoogle Scholar
Buchbinder SP, Mehrotra DV, Duerr A, Fitzgerald DW, Mogg R, Li D, Gilbert PB, Lama JR, Marmor M, Del Rio C, et al: Efficacy assessment of a cell-mediated immunity HIV-1 vaccine (the Step Study): a double-blind, randomised, placebo-controlled, test-of-concept trial. Lancet. 2008, 372 (9653): 1881-1893. 10.1016/S0140-6736(08)61591-3.PubMed CentralPubMedGoogle Scholar
Sekaly RP: The failed HIV Merck vaccine study: a step back or a launching point for future vaccine development?. J Exp Med. 2008, 205 (1): 7-12. 10.1084/jem.20072681.PubMed CentralPubMedGoogle Scholar
Walker BD, Burton DR: Toward an AIDS vaccine. Science. 2008, 320 (5877): 760-764. 10.1126/science.1152622.PubMedGoogle Scholar
Rerks-Ngarm S, Pitisuttithum P, Nitayaphan S, Kaewkungwal J, Chiu J, Paris R, Premsri N, Namwat C, de Souza M, Adams E, et al: Vaccination with ALVAC and AIDSVAX to prevent HIV-1 infection in Thailand. N Engl J Med. 2009, 361 (23): 2209-2220. 10.1056/NEJMoa0908492.PubMedGoogle Scholar
Cohen J: HIV/AIDS research. Beyond Thailand: making sense of a qualified AIDS vaccine "success". Science. 2009, 326 (5953): 652-653. 10.1126/science.326_652.PubMedGoogle Scholar
Groot ASD, Marcon L, Bishop EA, Rivera D, Kutzler M, Weiner DB, Martin W: HIV vaccine development by computer assisted design: the GAIA vaccine. Vaccine. 2005, 23 (17-18): 2136-2148. 10.1016/j.vaccine.2005.01.097.PubMedGoogle Scholar
Thomson SA, Jaramillo AB, Shoobridge M, Dunstan KJ, Everett B, Ranasinghe C, Kent SJ, Gao K, Medveckzy J, Ffrench RA, et al: Development of a synthetic consensus sequence scrambled antigen HIV-1 vaccine designed for global use. Vaccine. 2005, 23 (38): 4647-4657. 10.1016/j.vaccine.2005.04.045.PubMedGoogle Scholar
Nickle DC, Heath L, Jensen MA, Gilbert PB, Mullins JI, Pond SLK: HIV-specific probabilistic models of protein evolution. PLoS One. 2007, 2 (6): e503-10.1371/journal.pone.0000503.PubMed CentralPubMedGoogle Scholar
Gaschen B, Taylor J, Yusim K, Foley B, Gao F, Lang D, Novitsky V, Haynes B, Hahn BH, Bhattacharya T, et al: Diversity considerations in HIV-1 vaccine selection. Science. 2002, 296 (5577): 2354-2360. 10.1126/science.1070441.PubMedGoogle Scholar
Gao F, Korber BT, Weaver E, Liao H-X, Hahn BH, Haynes BF: Centralized immunogens as a vaccine strategy to overcome HIV-1 diversity. Expert Rev Vaccines. 2004, 3 (4 Suppl): S161-S168. 10.1586/14760584.3.4.S161.PubMedGoogle Scholar
Gao F, Weaver EA, Lu Z, Li Y, Liao H-X, Ma B, Alam SM, Scearce RM, Sutherland LL, Yu J-S, et al: Antigenicity and immunogenicity of a synthetic human immunodeficiency virus type 1 group m consensus envelope glycoprotein. J Virol. 2005, 79 (2): 1154-1163. 10.1128/JVI.79.2.1154-1163.2005.PubMed CentralPubMedGoogle Scholar
Stewart CB: Molecular evolution. Active ancestral molecules. Nature. 1995, 374 (6517): 12-13. 10.1038/374012a0.PubMedGoogle Scholar
Doria-Rose NA, Learn GH, Rodrigo AG, Nickle DC, Li F, Mahalanabis M, Hensel MT, McLaughlin S, Edmonson PF, Montefiori D, et al: Human immunodeficiency virus type 1 subtype B ancestral envelope protein is functional and elicits neutralizing antibodies in rabbits similar to those elicited by a circulating subtype B envelope. J Virol. 2005, 79 (17): 11214-11224. 10.1128/JVI.79.17.11214-11224.2005.PubMed CentralPubMedGoogle Scholar
Nickle DC, Jensen MA, Gottlieb GS, Shriner D, Learn GH, Rodrigo AG, Mullins JI: Consensus and ancestral state HIV vaccines. Science. 2003, 299 (5612): 1515-1518. 10.1126/science.299.5612.1515c. author reply 1515-1518PubMedGoogle Scholar
Nickle DC, Rolland M, Jensen MA, Pond SLK, Deng W, Seligman M, Heckerman D, Mullins JI, Jojic N: Coping with viral diversity in HIV vaccine design. PLoS Comput Biol. 2007, 3 (4): e75-10.1371/journal.pcbi.0030075.PubMed CentralPubMedGoogle Scholar
Huson DH, Bryant D: Application of Phylogenetic Networks in E volutionary Studies. Mol Biol Evol. 2005, 23 (2): 254-267. 10.1093/molbev/msj030.PubMedGoogle Scholar
Bandelt HJ, Dress AW: Split decomposition: a new and useful approach to phylogenetic analysis of distance data. Mol Phylogenet Evol. 1992, 1 (3): 242-252. 10.1016/1055-7903(92)90021-8.PubMedGoogle Scholar
Jung A, Maier R, Vartanian J-P, Bocharov G, Jung V, Fischer U, Meese E, Wain-Hobson S, Meyerhans A: Multiply infected spleen cells in HIV patients. Nature. 2002, 418 (6894): 144-10.1038/418144a.PubMedGoogle Scholar
Zhuang J, Jetzt AE, Sun G, Yu H, Klarmann G, Ron Y, Preston BD, Dougherty JP: Human immunodeficiency virus type 1 recombination: rate, fidelity, and putative hot spots. J Virol. 2002, 76 (22): 11273-11282. 10.1128/JVI.76.22.11273-11282.2002.PubMed CentralPubMedGoogle Scholar
Burke DS: Recombination in HIV: an important viral evolutionary strategy. Emerg Infect Dis. 1997, 3 (3): 253-259. 10.3201/eid0303.970301.PubMed CentralPubMedGoogle Scholar
Bretscher MT, Althaus CL, Müller V, Bonhoeffer S: Recombination in HIV and the evolution of drug resistance: for better or for worse?. Bioessays. 2004, 26 (2): 180-188. 10.1002/bies.10386.PubMedGoogle Scholar
Hein JJ: Method to Reconstruct the History of Sequences Subject to Recombination. J Mol Evol. 1993, 20: 402-411.Google Scholar
Moore CB, John M, James IR, Christiansen FT, Witt CS, Mallal SA: Evidence of HIV-1 adaptation to HLA- restricted immune responses at a population level. Science. 2002, 296 (5572): 1439-1443. 10.1126/science.1069660.PubMedGoogle Scholar
Ahlenstiel G, Roomp K, Däumer M, Nattermann J, Vogel M, Rockstroh JK, Beerenwinkel N, Kaiser R, Nischalke H-D, Sauerbruch T, et al: Selective pressures of HLA genotypes and antiviral therapy on human immunodeficiency virus type 1 sequence mutation at a population level. Clin Vaccine Immunol. 2007, 14 (10): 1266-1273. 10.1128/CVI.00169-07.PubMed CentralPubMedGoogle Scholar
Goulder PJ, Brander C, Tang Y, Tremblay C, Colbert RA, Addo MM, Rosenberg ES, Nguyen T, Allen R, Trocha A, et al: Evolution and transmission of stable CTL escape mutations in HIV infection. Nature. 2001, 412 (6844): 334-338. 10.1038/35085576.PubMedGoogle Scholar
Brumme ZL, Brumme CJ, Heckerman D, Korber BT, Daniels M, Carlson J, Kadie C, Bhattacharya T, Chui C, Szinger J, et al: Evidence of Differential HLA Class I-Mediated Viral Evolution in Functional and Accessory/Regulatory Genes of HIV-1. PLoS Pathog. 2007, 3 (7): e94-10.1371/journal.ppat.0030094.PubMed CentralPubMedGoogle Scholar
Carlson JM, Brumme ZL, Rousseau CM, Brumme CJ, Matthews P, Kadie C, Mullins JI, Walker BD, Harrigan PR, Goulder PJR, et al: Phylogenetic dependency networks: inferring patterns of CTL escape and codon covariation in HIV-1 Gag. PLoS Comput Biol. 2008, 4 (11): e1000225-10.1371/journal.pcbi.1000225.PubMed CentralPubMedGoogle Scholar
Foulkes AS, DeGruttola V: Characterizing the Progression of Viral Mutations Over Time. J Am Stat Assoc. 2003, 98 (464): 859-867. 10.1198/016214503000000792.Google Scholar
Deforche K, Silander T, Camacho R, Grossman Z, Soares MA, Laethem KV, Kantor R, Moreau Y, Vandamme AM, non BW: Analysis of HIV-1 pol sequences using B ayesian Networks: implications for drug resistance. Bioinformatics. 2006, 22 (24): 2975-2979. 10.1093/bioinformatics/btl508.PubMedGoogle Scholar
Beerenwinkel N, Rahnenführer J, Däumer M, Hoffmann D, Kaiser R, Selbig J, Lengauer T: Learning multiple evolutionary pathways from cross-sectional data. J Comput Biol. 2005, 12 (6): 584-598. 10.1089/cmb.2005.12.584.PubMedGoogle Scholar
Beerenwinkel N, Däumer M, Sing T, Rahnenführer J, Lengauer T, Selbig J, Hoffmann D, Kaiser R: Estimating HIV evolutionary pathways and the genetic barrier to drug resistance. J Infect Dis. 2005, 191 (11): 1953-1960. 10.1086/430005.PubMedGoogle Scholar
Beerenwinkel N, Drton M: A mutagenetic tree hidden Markov model for longitudinal clonal HIV sequence data. Biostatistics. 2007, 8 (1): 53-71. 10.1093/biostatistics/kxj033.PubMedGoogle Scholar
Beerenwinkel N, Eriksson N, Sturmfels B: Conjunctive Bayesian networks. Bernoulli. 2007, 13 (4): 893-909. 10.3150/07-BEJ6133.Google Scholar
Beerenwinkel N, Sullivant S: Markov models for accumulating mutations. Biometrika. 2009, 96 (3): 645-661. 10.1093/biomet/asp023.Google Scholar
Gerstung M, Baudis M, Moch H, Beerenwinkel N: Quantifying cancer progression with conjunctive Bayesian networks. Bioinformatics. 2009, 25 (21): 2809-2815. 10.1093/bioinformatics/btp505.PubMed CentralPubMedGoogle Scholar
Beerenwinkel N, Sing T, Lengauer T, Rahnenführer J, Roomp K, Savenkov I, Fischer R, Hoffmann D, Selbig J, Korn K, et al: Computational methods for the design of effective therapies against drug resistant HIV strains. Bioinformatics. 2005, 21 (21): 3943-3950. 10.1093/bioinformatics/bti654.PubMedGoogle Scholar
Beerenwinkel N, Lengauer T, Däumer M, Kaiser R, Walter H, Korn K, Hoffmann D, Selbig J: Methods for optimizing antiviral combination therapies. Bioinformatics. 2003, 19 (Suppl 1): i16-i25. 10.1093/bioinformatics/btg1001.PubMedGoogle Scholar
Altmann A, Däumer M, Beerenwinkel N, Peres Y, Schülter E, Büch J, Rhee S-Y, Sönnerborg A, Fessel WJ, Shafer RW, et al: Predicting the Response to Combination Antiretroviral Therapy: Retrospective Validation of geno2pheno-THEO on a Large Clinical Database. J Infect Dis. 2009, 199 (7): 999-1006. 10.1086/597305.PubMedGoogle Scholar
Altmann A, Beerenwinkel N, Sing T, Savenkov I, Däumer M, Kaiser R, Rhee S-Y, Fessel WJ, Shafer RW, Lengauer T: Improved prediction of response to antiretroviral combination therapy using the genetic barrier to drug resistance. Antivir Ther. 2007, 12 (2): 169-178.PubMedGoogle Scholar
Richman DD, Margolis DM, Delaney M, Greene WC, Hazuda D, Pomerantz RJ: The challenge of finding a cure for HIV infection. Science. 2009, 323 (5919): 1304-1307. 10.1126/science.1165706.PubMedGoogle Scholar
Johnston MI, Fauci AS: An HIV vaccine--evolving concepts. N Engl J Med. 2007, 356 (20): 2073-2081. 10.1056/NEJMra066267.PubMedGoogle Scholar
Rosenberg ES, Altfeld M, Poon SH, Phillips MN, Wilkes BM, Eldridge RL, Robbins GK, D'Aquila RT, Goulder PJ, Walker BD: Immune control of HIV-1 after early treatment of acute infection. Nature. 2000, 407 (6803): 523-526. 10.1038/35035103.PubMedGoogle Scholar
Rosenberg ES, Billingsley JM, Caliendo AM, Boswell SL, Sax PE, Kalams SA, Walker BD: Vigorous HIV-1-specific CD4+ T cell responses associated with control of viremia. Science. 1997, 278 (5342): 1447-1450. 10.1126/science.278.5342.1447.PubMedGoogle Scholar
Korber BT, Letvin NL, Haynes BF: T-cell vaccine strategies for human immunodeficiency virus, the virus with a thousand faces. J Virol. 2009, 83 (17): 8300-8314. 10.1128/JVI.00114-09.PubMed CentralPubMedGoogle Scholar
Srivastava IK, Ulmer JB, Barnett SW: Neutralizing antibody responses to HIV: role in protective immunity and challenges for vaccine design. Expert Rev Vaccines. 2004, 3 (4 Suppl): S33-52. 10.1586/14760584.3.4.S33.PubMedGoogle Scholar
Burton DR, Desrosiers RC, Doms RW, Koff WC, Kwong PD, Moore JP, Nabel GJ, Sodroski J, Wilson IA, Wyatt RT: HIV vaccine design and the neutralizing antibody problem. Nat Immunol. 2004, 5 (3): 233-236. 10.1038/ni0304-233.PubMedGoogle Scholar
Yuste E, Bixby J, Lifson J, Sato S, Johnson W, Desrosiers R: Glycosylation of gp41 of simian immunodeficiency virus shields epitopes that can be targets for neutralizing antibodies. J Virol. 2008, 82 (24): 12472-12486. 10.1128/JVI.01382-08.PubMed CentralPubMedGoogle Scholar
Nowak MA, May RM: Virus Dynamics. Mathematical Principles of Immunology and Virology. 2000, New York: Oxford University Press Inc.Google Scholar
Bonhoeffer S, May RM, Shaw GM, Nowak MA: Virus dynamics and drug therapy. Proc Natl Acad Sci U S A. 1997, 94 (13): 6971-6976. 10.1073/pnas.94.13.6971.PubMed CentralPubMedGoogle Scholar
Perelson AS, Neumann AU, Markowitz M, Leonard JM, Ho DD: HIV-1 dynamics in vivo: virion clearance rate, infected cell life-span, and viral generation time. Science. 1996, 271 (5255): 1582-1586. 10.1126/science.271.5255.1582.PubMedGoogle Scholar
Mohri H, Bonhoeffer S, Monard S, Perelson AS, Ho DD: Rapid turnover of T lymphocytes in SIV-infected rhesus macaques. Science. 1998, 279 (5354): 1223-1227. 10.1126/science.279.5354.1223.PubMedGoogle Scholar
Nowak MA, Bangham CR: Population dynamics of immune responses to persistent viruses. Science. 1996, 272 (5258): 74-79. 10.1126/science.272.5258.74.PubMedGoogle Scholar
Nowak MA, Bonhoeffer S, Hill AM, Boehme R, Thomas HC, McDade H: Viral dynamics in hepatitis B virus infection. Proc Natl Acad Sci U S A. 1996, 93 (9): 4398-4402. 10.1073/pnas.93.9.4398.PubMed CentralPubMedGoogle Scholar
Wei X, Ghosh SK, Taylor ME, Johnson VA, Emini EA, Deutsch P, Lifson JD, Bonhoeffer S, Nowak MA, Hahn BH, et al: Viral dynamics in human immunodeficiency virus type 1 infection. Nature. 1995, 373 (6510): 117-122. 10.1038/373117a0.PubMedGoogle Scholar
Anderson RM, May RM: Infectious diseases of humans. 1992, New York: Oxford University PressGoogle Scholar
Nowak MA, Lloyd AL, Vasquez GM, Wiltrout TA, Wahl LM, Bischofberger N, Williams J, Kinter A, Fauci AS, Hirsch VM, et al: Viral dynamics of primary viremia and antiretroviral therapy in simian immunodeficiency virus infection. J Virol. 1997, 71 (10): 7518-7525.PubMed CentralPubMedGoogle Scholar
Lifson JD, Nowak MA, Goldstein S, Rossio JL, Kinter A, Vasquez G, Wiltrout TA, Brown C, Schneider D, Wahl L, et al: The extent of early viral replication is a critical determinant of the natural history of simian immunodeficiency virus infection. J Virol. 1997, 71 (12): 9508-9514.PubMed CentralPubMedGoogle Scholar
Schooley RT: Correlation between viral load measurements and outcome in clinical trials of antiviral drugs. AIDS. 1995, 9 (Suppl 2): S15-S19.PubMedGoogle Scholar
Autran B, Murphy RL, Costagliola D, Tubiana R, Clotet B, Gatell J, Staszewski S, Wincker N, Assoumou L, El-Habib R, et al: Greater viral rebound and reduced time to resume antiretroviral therapy after therapeutic immunization with the ALVAC-HIV vaccine (vCP1452). AIDS. 2008, 22 (11): 1313-1322. 10.1097/QAD.0b013e3282fdce94.PubMedGoogle Scholar
Sodora DL, Allan JS, Apetrei C, Brenchley JM, Douek DC, Else JG, Estes JD, Hahn BH, Hirsch VM, Kaur A, et al: Toward an AIDS vaccine: lessons from natural simian immunodeficiency virus infections of African nonhuman primate hosts. Nat Med. 2009, 15 (8): 861-865. 10.1038/nm.2013.PubMed CentralPubMedGoogle Scholar
Fruth U: Considerations regarding efficacy endpoints in HIV vaccine trials: executive summary and recommendations of an expert consultation jointly organized by WHO, UNAIDS and ANRS in support of the Global HIV Vaccine Enterprise. Vaccine. 2009, 27 (14): 1989-1996. 10.1016/j.vaccine.2008.12.029.PubMedGoogle Scholar
Herrera M, Garcia-Arriaza J, Pariente N, Escarmis C, Domingo E: Molecular basis for a lack of correlation between viral fitness and cell killing capacity. PLoS Pathog. 2007, 3 (4): e53-10.1371/journal.ppat.0030053.PubMed CentralPubMedGoogle Scholar
Carrasco P, de la Iglesia F, Elena SF: Distribution of fitness and virulence effects caused by single-nucleotide substitutions in Tobacco Etch virus. J Virol. 2007, 81 (23): 12979-12984. 10.1128/JVI.00524-07.PubMed CentralPubMedGoogle Scholar
Salguero FJ, Sanchez-Martin MA, Diaz-San Segundo F, de Avila A, Sevilla N: Foot-and-mouth disease virus (FMDV) causes an acute disease that can be lethal for adult laboratory mice. Virology. 2005, 332 (1): 384-396. 10.1016/j.virol.2004.11.005.PubMedGoogle Scholar
Chao L: Fitness of RNA virus decreased by Muller's ratchet. Nature. 1990, 348 (6300): 454-455. 10.1038/348454a0.PubMedGoogle Scholar
Escarmis C, Davila M, Charpentier N, Bracho A, Moya A, Domingo E: Genetic lesions associated with Muller's ratchet in an RNA virus. J Mol Biol. 1996, 264 (2): 255-267. 10.1006/jmbi.1996.0639.PubMedGoogle Scholar
Coleman JR, Papamichail D, Skiena S, Futcher B, Wimmer E, Mueller S: Virus attenuation by genome-scale changes in codon pair bias. Science. 2008, 320 (5884): 1784-1787. 10.1126/science.1155761.PubMed CentralPubMedGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.