Page 33 - GTM-2-4
P. 33

Global Translational Medicine                                      The research advances in HPV integration



            pipeline with a pair-assembling strategy, integration   capacity to reveal more extensive changes, such as changes
            sites are identified through data analysis. This approach   in gene copy number resulting from the integration of
            has proven successful in several studies  for pinpointing   the HPV genome into the host genome. The development
            meaningful integration sites . Another strategy, tagging,   of long-read single-molecule sequencing technologies
                                  [65]
            enrichment, and NGS of HPV16 (TEN16), relies on targeted   by Pacific Biosciences (PacBio) and Oxford Nanopore
            next-generation DNA sequencing . The process involves   Technologies (ONT) has introduced new possibilities for
                                       [66]
            Nextera in vitro transposition for DNA fragmentation and   the application of direct nucleic acid enrichment tools.
            adapter-tagging, followed by multiplex PCR with HPV16-  Long-read sequencing enables the elucidation of complex
            specific  primers for  HPV16  DNA enrichment. This   gene rearrangements, and Nanopore analysis requires
            approach successfully identified 75 HPV16 integration   smaller amounts of sequencing data compared to Illumina
            sites  in  a  mixed  tumor  sample  pool.  RNA-sequencing   methods while maintaining similar accuracy [73,74] .
            stands out as another method of choice for detecting   One study successfully identified four new candidate
            viral-host fusion transcripts. Koneva et al.  successfully   target genes for CC by integrating multi-omics data
                                              [67]
            identified 320 integration breakpoints distributed across   from The Cancer Genome Atlas with patient-matched
            the human and HPV genomes using RNA-seq data from   long-read sequencing of HPV integration sites. The
            84 HNSCC tumors. In addition, whole-exome sequencing   identified genes are BNC1, RSBN1, USP36, and TAOK3.
            and high-throughput chromosome conformation capture   The study employed the HPV-targeted PacBio long-
            sequencing have been adapted for the identification of   read sequencing method and validated the potential
            HPV integration sites [56,65,68] .
                                                               involvement of these genes in tumorigenesis by assessing
              Parsing the NGS data is crucial for the identification of   their expression levels. It is essential to note that the study
            integration sites, and various novel bioinformatic pipelines   had a limited sample size, including samples from only
            have been developed to process the data to obtain the   eight women, and as such, its findings may not be broadly
            necessary information. ViFi, a computational approach in   representative .
                                                                          [75]
            2018, utilizes phylogenetic methods combined with reference-  Pooled  CRISPR  inverse  PCR  sequencing  (PCIP-seq)
            based read mapping for calling integration sites . ViFi   is a method employing a pool of CRISPR guide RNAs
                                                    [69]
            achieves not only high precision and sensitivity in identifying   for selective cleavage of circular DNA fragments carrying
            fusion reads but also detects circular extrachromosomal   viral DNA. Subsequently, reverse long-fragment PCR and
            DNA (ecDNA) human-viral fusion structures. SearcHPV is
            another novel pipeline based on targeted capture technology,   multiplex sequencing are performed using the MinION
            specifically designed for the analysis of targeted capture DNA   platform. The DNA is then sheared into approximately 8 kb
            sequencing data. Notably, SearcHPV demonstrates higher   fragments, followed by intramolecular ligation and removal
            sensitivity and specificity for detecting HPV-host integration   of remaining linear DNA through digestion. To selectively
            sites compared to VirusFinder and VirusSeq, two integration   linearize circular DNA containing viral sequences, specific
            callers developed earlier. It is important to acknowledge that   regions of the HPV18 genome are targeted for selective
            a significant limitation of SearcHPV is its validation on only   CRISPR-mediated cleavage, and PCR amplification is
            three samples . In addition  to SearcHPV, deep-learning   performed before sequencing. The identified insertion
                       [70]
            models such as DeepHPV have been designed for predicting   sites support the hypothesis that rearrangements occur
            HPV integration sites . A  comprehensive database is   through a “looping” mechanism, as the two sets of breaks
                              [71]
                                                                                     [76]
            essential for systematic searches, comparisons, and analyses   share a common breakpoint .
            of viral integration sites. The Viral Integration Site Database   Xdrop technology is a novel microfluidics system
            was developed as a user-friendly web tool for browsing,   designed for targeted enrichment in low-input and long-
            searching, analyzing, visualizing, and downloading data .   fragment DNA. The technology relies on the isolation
                                                        [72]
            The  utilization  of  well-organized,  comprehensive,  and   of long DNA fragments within millions of droplets,
            continuously updated databases, along with the availability   wherein the droplets containing a fluorescently labeled
            of various evolving intelligent analysis pipelines, has   target sequence of interest are sorted using fluorescence-
            significantly alleviated the burden of analyzing large amounts   activated cell sorting. The sorted DNA fragments
            of sequencing data in advanced biomedical research.  undergo amplification through multiple displacement
                                                               amplification (MDA) and are subsequently subjected to
            4.3. Third-generation sequencing-based techniques  sequencing. This approach successfully resolved three
            Short reads are suitable for investigating specific   HPV18-chr8 integration sites at base-pair resolution .
                                                                                                           [77]
            structural variations at insertion sites, such as deletions   Table 1 summarizes the studies conducted on the
            or chromosomal translocations. However, they lack the   identification of HPV integration sites for various types of


            Volume 2 Issue 4 (2023)                         7                        https://doi.org/10.36922/gtm.2034
   28   29   30   31   32   33   34   35   36   37   38