From 5025818dd4bf1a63eb121a60cf7c25de3d2b0393 Mon Sep 17 00:00:00 2001 From: Astrid Deschenes Date: Mon, 21 Oct 2024 20:54:03 -0400 Subject: [PATCH] Update vignette --- vignettes/RAIDS.Rmd | 18 +++++++++++------- 1 file changed, 11 insertions(+), 7 deletions(-) diff --git a/vignettes/RAIDS.Rmd b/vignettes/RAIDS.Rmd index 5dd83da1..2501ef03 100644 --- a/vignettes/RAIDS.Rmd +++ b/vignettes/RAIDS.Rmd @@ -116,7 +116,7 @@ The main steps are: **Step 1.** Set-up and provide population reference files -**Step 2** Sample the reference data for donor genotypes, to be used for synthesis and optimize ancestry inference parameters +**Step 2** Sample the reference data for donor genotypes to be used for synthesis and optimize ancestry inference parameters **Step 3** Infer ancestry for the subjects of the external study @@ -238,6 +238,9 @@ file.copy(fileAnnotGDS, file.path(pathWorkingDirectoryData, "refGDS")) ### 2.1 Set-up required directories +All required directories need to be created. In addition, the path to +the reference files are kept in variables that will be used later. + ```{r installRaids, echo=TRUE, eval=TRUE, collapse=TRUE, warning=FALSE, message=FALSE} ############################################################################# @@ -295,23 +298,21 @@ dataRef <- select1KGPopForSynthetic(fileReferenceGDS=refGenotype, ``` -The output object is going to be used later at the ancestry inference step. +The output object is going to be used later.
### 2.3 Perform the ancestry inference -Within a single function call, data synthesis is performed, the synthetic +Ancestry inference can be done in one function call. Within a single function +call, data synthesis is performed, the synthetic data are used to optimize the inference parameters and, with these, the ancestry of the input profile donor is inferred. According to the type of input data (RNA or DNA), a specific function -is available. The *inferAncestry()* function is used for DNA profiles while +should be called. The *inferAncestry()* function is used for DNA profiles while the *inferAncestryGeneAware()* function is RNA specific. -In this example, the profile is from DNA source and requires the use of the -*inferAncestry()* function. - The *inferAncestry()* function requires a specific profile input format. The format is set by the *genoSource* parameter. @@ -337,6 +338,9 @@ Beware that the starting position in the **population reference GDS file** is zero (like BED files). The generic SNP file should also start at position zero. +In this example, the profile is from DNA source and requires the use of the +*inferAncestry()* function. + ```{r infere, echo=TRUE, eval=TRUE, collapse=TRUE, warning=FALSE, message=FALSE}