-
Hi all, I have PLINK dosage files (e.g., CHT1_2_chr1.plink.dosage.gz), one for each chromosome (1 to 22), containing genetic data for two cohorts, CHT_1 and CHT_2. In the same directory, there are corresponding .fam and .map files for each chromosome. Initially, I provided filepaths to the chromosomes with data of the two cohorts in the "path_prefix" column of the samplesheet. Then, on running pgsc_calc, I got an error message saying that 2 sample sets had been detected. So, I wrote a script to filter out samples from CHT in which I am interested. These samples have "IID", the 1st column of the .fam file, containing the string "CHT1". Here is the code: ---
---Then I run the pgsc_calc as follows, with the "path_prefix" in the sample sheet pointing to the directory containing the plink2 fileset (pgen, pvar, psam) without the extensions: ---
---Then, I get the following message: "Invalid format: plink" I also keep getting this "WARNING: Could not load nf-core/config profiles: https://raw.githubusercontent.com/nf-core/configs/master/nfcore_custom.config" (I got in the first instance too. Could it be because I am running the pipeline in offline mode?) System Information Version: 24.10.2 build 5932 PBS executor |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 2 replies
-
@mathewrm, if you're supplying |
Beta Was this translation helpful? Give feedback.
@mathewrm, if you're supplying
.pgen
the format in the sample sheet needs to bepfile
(see docs).