Center for RNA Research, Institute for Basic Science, Seoul, Republic of Korea; School of Biological Sciences, Seoul National University, Seoul, Republic of Korea
Dohoon Lee
Bioinformatics Institute, Seoul National University, Seoul, Republic of Korea; BK21 FOUR Intelligence Computing, Seoul National University, Seoul, Republic of Korea
Jiseok Jeong
Center for RNA Research, Institute for Basic Science, Seoul, Republic of Korea; School of Biological Sciences, Seoul National University, Seoul, Republic of Korea
Sungwon Lee
Center for RNA Research, Institute for Basic Science, Seoul, Republic of Korea; School of Biological Sciences, Seoul National University, Seoul, Republic of Korea
Sun Kim
Department of Computer Science and Engineering, Seoul National University, Seoul, Republic of Korea
Center for RNA Research, Institute for Basic Science, Seoul, Republic of Korea; School of Biological Sciences, Seoul National University, Seoul, Republic of Korea; SNU Institute for Virus Research, Seoul National University, Seoul, Republic of Korea
Although HIV-1 integration sites favor active transcription units in the human genome, high-resolution analysis of individual HIV-1 integration sites has shown that the virus can integrate into a variety of host genomic locations, including non-genic regions. The invisible infection by HIV-1 integrating into non-genic regions, challenging the traditional understanding of HIV-1 integration site selection, is more problematic because they are selected for preservation in the host genome during prolonged antiretroviral therapies. Here, we showed that HIV-1 integrates its viral genome into the vicinity of R-loops, a genomic structure composed of DNA-RNA hybrids. VSV-G-pseudotyped HIV-1 infection initiates the formation of R-loops in both genic and non-genic regions of the host genome and preferentially integrates into R-loop-rich regions. Using a HeLa cell model that can independently control transcriptional activity and R-loop formation, we demonstrated that the exogenous formation of R-loops directs HIV-1 integration-targeting sites. We also found that HIV-1 integrase proteins physically bind to the host genomic R-loops. These findings provide novel insights into the mechanisms underlying retroviral integration and the new strategies for antiretroviral therapy against HIV-1 latent infection.