PLoS ONE (Jan 2024)
A unified classification system for HIV-1 5' long terminal repeats.
Abstract
The HIV-1 provirus mainly consists of internal coding region flanked by 1 long terminal repeats (LTRs) at each terminus. The LTRs play important roles in HIV-1 reverse transcription, integration, and transcription. However, despite of the significant study advances of the internal coding regions of HIV-1 by using definite reference classification, there are no systematic and phylogenetic classifications for HIV-1 5' LTRs, which hinders our elaboration on 5' LTR and a better understanding of the viral origin, spread and therapy. Here, by analyzing all available resources of 5' LTR sequences in public databases following 4 recognized principles for the reference classification, 83 representatives and 14 consensus sequences were identified as representatives of 2 groups, 6 subtypes, 6 sub-subtypes, and 9 CRFs. To test the reliability of the supplemented classification system, the constructed references were applied to identify the 5' LTR assignment of the 22 clinical isolates in China. The results revealed that 16 out of 22 tested strains showed a consistent subtype classification with the previous LTR-independent classification system. However, 6 strains, for which recombination events within 5' LTR were demonstrated, unexpectedly showed a different subtype classification, leading a significant change of binding sites for important transcription factors including SP1, p53, and NF-κB. The binding change of these transcriptional factors would probably affect the transcriptional activity of 5' LTR. This study supplemented a unified classification system for HIV-1 5' LTRs, which will facilitate HIV-1 characterization and be helpful for both basic and clinical research fields.