长读长测序方法学

面向 PacBio 和 ONT 长读长数据的算法,用于解析转录本异构体、结构变异和调控扰动。

我们发展面向 long-read RNA-seq and DNA-seq 的计算方法,用于解析短读长技术难以捕获的复杂转录本异构体、融合转录本、结构变异和调控扰动。这些方法覆盖 bulk 和单细胞数据模式,同时关注基于参考基因组和基于组装的基因组表示。

课题组相关贡献包括 CTAT-LR-Fusion,用于在 bulk 或单细胞分辨率下准确识别融合转录本 (Qin et al., 2025);结合 pangenome 和 de novo personal genome assembly 的体细胞结构变异检测方法 (Qin et al., 2025);以及将 diploid donor-specific assemblies (DSAs) 作为癌症参考基因组,恢复 GRCh38 或 CHM13 上遗漏的 20% 以上体细胞 SV (Zhang et al., 2025)。相关工作还包括 colorSV (Le et al., 2025)、LongcallD (Gao et al., 2026) 和 FuFiHLA (Hu et al., 2026)。这些工具共同提升了对短读长流程常常遗漏的临床相关变异的检测灵敏度和精确度。

参考文献

2026

  1. bioRxiv
    LongcallD: joint calling and phasing of small, structural and mosaic variants from long reads
    Yan Gao, Wen-Wei Liao, Qian Qin, and 2 more authors
    bioRxiv, Mar 2026
  2. Bioinformatics
    FuFiHLA: A tool for full-field HLA typing from long-read data
    Jingqing Hu, Qian Qin, Heng Li, and 1 more author
    Bioinformatics, May 2026

2025

  1. Genome Res
    Accurate fusion transcript identification from long- and short-read isoform sequencing at bulk or single-cell resolution
    Qian Qin, Victoria Popic, Kirsty Wienand, and 11 more authors
    Genome Research, Apr 2025
  2. bioRxiv
    Improving long-read somatic structural variant calling with pangenome and de novo personal genome assembly
    Qian Qin, Jakob Heinz, and Heng Li
    bioRxiv, Oct 2025
  3. bioRxiv
    Diploid donor-specific assembly enhances somatic structural variant detection in cancer genomes
    Yuwei Zhang, Han Qu, Qian Qin, and 2 more authors
    bioRxiv, Oct 2025
  4. GPB
    colorSV: Long-range Somatic Structural Variation Calling from Matched Tumor-normal Co-assembly Graphs
    M K Le, Qian Qin, and Heng Li
    Genomics, Proteomics & Bioinformatics, Sep 2025