QTL pipeline

Lopes, K.d.P., Snijders, G.J.L., Humphrey, J. et al. Genetic analysis of the human microglial transcriptome across brain regions, aging and disease pathologies. Nat Genet 54, 4–17 (2022). https://doi.org/10.1038/s41588-021-00976-y

Import QTL data

This data is actually merged GWAS-QTL colocalization results, but it contains all of the necessary columns from the original eQTL summary stats that we need to perform eQTL fine-mapping.

coloc_res <- echodata::get_Kunkle2019_coloc(return_path = TRUE)

Prepare colmap

Prepare a column mapping object for the summary statistics. We’ll reuse this for both the import_topSNPs and finemap_loci steps.

colmap <- echodata::construct_colmap( 
      CHR = "chr",
      POS = "pos",
      N = "qtl.N",
      SNP = "snp",
      P = "qtl.pvalues",
      Effect = "qtl.beta",
      StdErr = "qtl.varbeta",
      MAF = "qtl.MAF",
      Locus = "Locus",
      Gene = "gene")

Prepare top_SNPs data.frame

  • In this case, we don’t have a top SNPs file ready. So we’re just going to make one directly from the full summary stats file itself (NOTE: You can only use this approach if you can fit the entire file in memory).
  • In this case, you’ll want to make sure to set grouping_vars=c("Locus","Gene") so that you get top SNPs for each eGene-locus pair (not just one SNP per locus).
topSNPs <- echodata::import_topSNPs(
  topSS = coloc_res$path,
  colmap = colmap,
  ## Important for QTLs: group by both Locus and Gene
  grouping_vars = c("Locus","Gene"))
## Returning results as nested list.

## All loci done in: 0.27 min

Session info

## R Under development (unstable) (2023-01-11 r83598)
## Platform: x86_64-pc-linux-gnu (64-bit)
## Running under: Ubuntu 22.04.1 LTS
## Matrix products: default
## BLAS:   /usr/lib/x86_64-linux-gnu/openblas-pthread/libblas.so.3 
## LAPACK: /usr/lib/x86_64-linux-gnu/openblas-pthread/libopenblasp-r0.3.20.so;  LAPACK version 3.10.0
## locale:
##  [1] LC_CTYPE=en_US.UTF-8       LC_NUMERIC=C              
##  [3] LC_TIME=en_US.UTF-8        LC_COLLATE=en_US.UTF-8    
##  [5] LC_MONETARY=en_US.UTF-8    LC_MESSAGES=en_US.UTF-8   
##  [7] LC_PAPER=en_US.UTF-8       LC_NAME=C                 
##  [9] LC_ADDRESS=C               LC_TELEPHONE=C            
## time zone: UTC
## tzcode source: system (glibc)
## attached base packages:
## [1] stats     graphics  grDevices utils     datasets  methods   base     
## other attached packages:
## [1] snpStats_1.49.0   Matrix_1.5-3      survival_3.5-0    echolocatoR_2.0.3
## [5] BiocStyle_2.27.0 
## loaded via a namespace (and not attached):
##   [1] ProtGenerics_1.31.0         fs_1.5.2                   
##   [3] matrixStats_0.63.0          bitops_1.0-7               
##   [5] EnsDb.Hsapiens.v75_2.99.0   httr_1.4.4                 
##   [7] RColorBrewer_1.1-3          Rgraphviz_2.43.0           
##   [9] tools_4.3.0                 backports_1.4.1            
##  [11] utf8_1.2.2                  R6_2.5.1                   
##  [13] DT_0.27                     lazyeval_0.2.2             
##  [15] withr_2.5.0                 prettyunits_1.1.1          
##  [17] GGally_2.1.2                gridExtra_2.3              
##  [19] cli_3.6.0                   Biobase_2.59.0             
##  [21] textshaping_0.3.6           labeling_0.4.2             
##  [23] ggbio_1.47.0                sass_0.4.4                 
##  [25] mvtnorm_1.1-3               readr_2.1.3                
##  [27] proxy_0.4-27                pkgdown_2.0.7              
##  [29] mixsqp_0.3-48               Rsamtools_2.15.1           
##  [31] systemfonts_1.0.4           foreign_0.8-84             
##  [33] R.utils_2.12.2              dichromat_2.0-0.1          
##  [35] maps_3.4.1                  BSgenome_1.67.3            
##  [37] readxl_1.4.1                susieR_0.12.27             
##  [39] pals_1.7                    rstudioapi_0.14            
##  [41] RSQLite_2.2.20              httpcode_0.3.0             
##  [43] generics_0.1.3              BiocIO_1.9.1               
##  [45] echoconda_0.99.9            dplyr_1.0.10               
##  [47] zip_2.2.2                   interp_1.1-3               
##  [49] fansi_1.0.3                 DescTools_0.99.47          
##  [51] S4Vectors_0.37.3            catalogueR_1.0.1           
##  [53] R.methodsS3_1.8.2           lifecycle_1.0.3            
##  [55] yaml_2.3.6                  SummarizedExperiment_1.29.1
##  [57] BiocFileCache_2.7.1         echoplot_0.99.6            
##  [59] grid_4.3.0                  blob_1.2.3                 
##  [61] crayon_1.5.2                dir.expiry_1.7.0           
##  [63] lattice_0.20-45             GenomicFeatures_1.51.2     
##  [65] mapproj_1.2.11              KEGGREST_1.39.0            
##  [67] MungeSumstats_1.7.15        pillar_1.8.1               
##  [69] knitr_1.41                  GenomicRanges_1.51.4       
##  [71] rjson_0.2.21                osfr_0.2.9                 
##  [73] boot_1.3-28.1               gld_2.6.6                  
##  [75] codetools_0.2-18            glue_1.6.2                 
##  [77] data.table_1.14.6           coloc_5.1.0.1              
##  [79] vctrs_0.5.1                 png_0.1-8                  
##  [81] XGR_1.1.8                   cellranger_1.1.0           
##  [83] gtable_0.3.1                assertthat_0.2.1           
##  [85] cachem_1.0.6                dnet_1.1.7                 
##  [87] xfun_0.36                   openxlsx_4.2.5.1           
##  [89] Rfast_2.0.6                 gargle_1.2.1               
##  [91] ellipsis_0.3.2              nlme_3.1-161               
##  [93] bit64_4.0.5                 progress_1.2.2             
##  [95] filelock_1.0.2              googleAuthR_2.0.0          
##  [97] GenomeInfoDb_1.35.12        rprojroot_2.0.3            
##  [99] bslib_0.4.2                 irlba_2.3.5.1              
## [101] rpart_4.1.19                colorspace_2.0-3           
## [103] BiocGenerics_0.45.0         DBI_1.1.3                  
## [105] Hmisc_4.7-2                 nnet_7.3-18                
## [107] Exact_3.2                   tidyselect_1.2.0           
## [109] bit_4.0.5                   compiler_4.3.0             
## [111] curl_5.0.0                  graph_1.77.1               
## [113] htmlTable_2.4.1             expm_0.999-7               
## [115] basilisk.utils_1.11.1       xml2_1.3.3                 
## [117] desc_1.4.2                  DelayedArray_0.25.0        
## [119] bookdown_0.32               rtracklayer_1.59.1         
## [121] checkmate_2.1.0             scales_1.2.1               
## [123] hexbin_1.28.2               echoLD_0.99.9              
## [125] RBGL_1.75.0                 RCircos_1.2.2              
## [127] rappdirs_0.3.3              stringr_1.5.0              
## [129] supraHex_1.37.0             digest_0.6.31              
## [131] piggyback_0.1.4             rmarkdown_2.20             
## [133] basilisk_1.11.2             XVector_0.39.0             
## [135] htmltools_0.5.4             pkgconfig_2.0.3            
## [137] jpeg_0.1-10                 base64enc_0.1-3            
## [139] MatrixGenerics_1.11.0       echodata_0.99.16           
## [141] highr_0.10                  ensembldb_2.23.1           
## [143] dbplyr_2.3.0                fastmap_1.1.0              
## [145] rlang_1.0.6                 htmlwidgets_1.6.1          
## [147] farver_2.1.1                echofinemap_0.99.5         
## [149] jquerylib_0.1.4             jsonlite_1.8.4             
## [151] BiocParallel_1.33.9         R.oo_1.25.0                
## [153] VariantAnnotation_1.45.0    RCurl_1.98-1.9             
## [155] magrittr_2.0.3              Formula_1.2-4              
## [157] GenomeInfoDbData_1.2.9      ggnetwork_0.5.10           
## [159] patchwork_1.1.2             munsell_0.5.0              
## [161] Rcpp_1.0.9                  ggnewscale_0.4.8           
## [163] ape_5.6-2                   viridis_0.6.2              
## [165] reticulate_1.27             RcppZiggurat_0.1.6         
## [167] stringi_1.7.12              rootSolve_1.8.2.3          
## [169] zlibbioc_1.45.0             MASS_7.3-58.1              
## [171] plyr_1.8.8                  parallel_4.3.0             
## [173] ggrepel_0.9.2               lmom_2.9                   
## [175] deldir_1.0-6                echoannot_0.99.10          
## [177] Biostrings_2.67.0           splines_4.3.0              
## [179] hms_1.1.2                   igraph_1.3.5               
## [181] reshape2_1.4.4              biomaRt_2.55.0             
## [183] stats4_4.3.0                crul_1.3                   
## [185] XML_3.99-0.13               evaluate_0.20              
## [187] latticeExtra_0.6-30         biovizBase_1.47.0          
## [189] BiocManager_1.30.19         tzdb_0.3.0                 
## [191] tidyr_1.2.1                 purrr_1.0.1                
## [193] reshape_0.8.9               ggplot2_3.4.0              
## [195] echotabix_0.99.9            restfulr_0.0.15            
## [197] AnnotationFilter_1.23.0     e1071_1.7-12               
## [199] downloadR_0.99.6            viridisLite_0.4.1          
## [201] class_7.3-20.1              ragg_1.2.5                 
## [203] OrganismDbi_1.41.0          tibble_3.1.8               
## [205] memoise_2.0.1               AnnotationDbi_1.61.0       
## [207] GenomicAlignments_1.35.0    IRanges_2.33.0             
## [209] cluster_2.1.4