tutorials:population-diversity:snp-chips
Differences
This shows you the differences between two versions of the page.
| Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
| tutorials:population-diversity:snp-chips [2020/09/18 08:34] – [Data analysis workflow with Plink 1.9] bngina | tutorials:population-diversity:snp-chips [2020/09/22 10:21] (current) – [Data analysis workflow with Plink 1.9] bngina | ||
|---|---|---|---|
| Line 153: | Line 153: | ||
| | | ||
| + | </ | ||
| + | Above creates three files in the specified output directory '' | ||
| + | *// | ||
| + | *// | ||
| + | | ||
| + | |||
| + | Now we use the created binary files, indicated to plink using '' | ||
| + | |||
| + | -Look a the individuals with missing data and SNPs not typed in all the individuals | ||
| + | |||
| + | < | ||
| + | |||
| + | |||
| + | |||
| + | ######### summary statistics ######## | ||
| + | |||
| + | # | ||
| + | |||
| + | plink --bfile ${out}/ | ||
| + | --out ${out}/ | ||
| + | | ||
| </ | </ | ||
| + | This creates two files. | ||
| + | *// | ||
| + | *// | ||
| + | #The missing information found in the '' | ||
| + | < | ||
| + | FID IID MISS_PHENO | ||
| + | | ||
| + | | ||
| + | | ||
| + | | ||
| + | | ||
| + | | ||
| + | | ||
| + | | ||
| + | | ||
| + | 10 | ||
| + | </ | ||
| + | |||
| + | The information in each header is as follows; | ||
| + | < | ||
| + | FID Family ID | ||
| + | IID Individual ID | ||
| + | MISS_PHENO | ||
| + | N_MISS | ||
| + | N_GENO | ||
| + | F_MISS | ||
| + | </ | ||
| + | |||
| + | The information found in the '' | ||
| + | < | ||
| + | | ||
| + | | ||
| + | | ||
| + | | ||
| + | | ||
| + | | ||
| + | | ||
| + | | ||
| + | | ||
| + | | ||
| + | | ||
| + | </ | ||
| + | |||
| + | The information in each column is as follows; | ||
| + | < | ||
| + | SNP SNP identifier | ||
| + | CHR Chromosome number | ||
| + | N_MISS | ||
| + | N_GENO | ||
| + | F_MISS | ||
| + | </ | ||
| + | |||
| + | We can generate a file with filters added for the rate missing data in individuals '' | ||
| + | |||
| + | The thresholds for these filters should be adjusted accordingly to the different data sets. | ||
| + | |||
| + | < | ||
| + | |||
| + | #### filter data ### | ||
| + | |||
| + | plink --file ${file} \ | ||
| + | | ||
| + | --maf 0.01\ #SNPs with less than 1% minor allele frequencies | ||
| + | | ||
| + | --out ${out}/ | ||
| + | | ||
| + | |||
| + | </ | ||
| ===== Data analysis workflow with R and adegenet ===== | ===== Data analysis workflow with R and adegenet ===== | ||
tutorials/population-diversity/snp-chips.1600418040.txt.gz · Last modified: by bngina
