Difference between revisions of "Advgenemap2021"

From Statistical Genetics Courses

Jump to: navigation, search
(Handouts)
(Shamil Sunyaev)
 
(62 intermediate revisions by the same user not shown)
Line 7: Line 7:
  
 
==General Information==
 
==General Information==
An Advanced Gene Mapping course will be held in online from Monday through Friday, January 27-31, 2020. The cost of the 5-day course is $100 for student, academic, and government researchers and $1,500 for researchers working in industry. This fee covers tuition and course-related expenses (cloud computing, etc.).
+
An Advanced Gene Mapping course will be held in online from Monday through Friday, January 25-29, 2021. The cost of the 5-day course is $100 for student, academic, and government researchers and $1,500 for researchers working in industry. This fee covers tuition and course-related expenses (cloud computing, etc.).
  
 
The course emphasis is on analyzing sequence and other omics data to elucidate the genetic etiology of complex human disease traits. Topics will include: data quality control of sequence and other types of data; single variant and aggregate rare variant association analysis of whole-genome data (genotype, sequence, and imputed) for qualitative and quantitative traits (population and family-based data); controlling for population admixture and substructure; generalized linear mix models and linear mixed models; meta-analysis; sample size estimation and power calculations; detecting gene x gene and gene x environmental interactions; analysis of epigenomic data, e.g methylation, and chromatin; heritability estimation using variant and RNA-Seq data; analysis of RNA-Seq data; eQTL mapping; elucidating pleiotropy; functional prediction and variant annotation; estimation of polygenic risk scores; Mendelian randomization; mediation analysis; and fine mapping.''' '''As mandated by the NIH there will also be a special session on responsible conduct of research that will include sessions on conflict of interest, research ethics, data management (security), and ethical use of human research subjects.
 
The course emphasis is on analyzing sequence and other omics data to elucidate the genetic etiology of complex human disease traits. Topics will include: data quality control of sequence and other types of data; single variant and aggregate rare variant association analysis of whole-genome data (genotype, sequence, and imputed) for qualitative and quantitative traits (population and family-based data); controlling for population admixture and substructure; generalized linear mix models and linear mixed models; meta-analysis; sample size estimation and power calculations; detecting gene x gene and gene x environmental interactions; analysis of epigenomic data, e.g methylation, and chromatin; heritability estimation using variant and RNA-Seq data; analysis of RNA-Seq data; eQTL mapping; elucidating pleiotropy; functional prediction and variant annotation; estimation of polygenic risk scores; Mendelian randomization; mediation analysis; and fine mapping.''' '''As mandated by the NIH there will also be a special session on responsible conduct of research that will include sessions on conflict of interest, research ethics, data management (security), and ethical use of human research subjects.
Line 14: Line 14:
  
 
==Course Instructors==
 
==Course Instructors==
The instructors for the course are: Heather Cordell (University of Newcastle), Nancy Cox (Vanderbilt University), Andrew DeWan (Yale University), Suzanne Leal (The Rockefeller University & Columbia University),  [http://www.fas.harvard.edu/~biophys/Shamil_Sunyaev.htm Shamil Sunyaev] (Harvard University) & Gao Wang (Columbia University)
+
The instructors for the course are Heather Cordell (University of Newcastle), Nancy Cox (Vanderbilt University), Andrew DeWan (Yale University), Suzanne Leal (The Rockefeller University & Columbia University), [http://www.fas.harvard.edu/~biophys/Shamil_Sunyaev.htm Shamil Sunyaev] (Harvard University) & Gao Wang (Columbia University). Judy Matuk (HRP Consulting Group) will lecture on ethics and the regulation of human subject research. A special guest lecture will be given by Jurg Ott (Rockefeller University).
  
==More Information==
+
==Additional Information==
The maximum number of participants for this online course is 25.  
+
The maximum number of participants for this online course is 34.  
  
 
Knowledge genetic association analysis, genetic epidemiology and/or statistical genetics are screening criteria for the selection of participants.  Please describe your experience in detail in your application. It is helpful if you also enclose a copy of your CV. We may contact you personally to discuss your application. Although experience of using LINUX is not necessary it is highly beneficial to have basic knowledge of this operating system before the start of the course.
 
Knowledge genetic association analysis, genetic epidemiology and/or statistical genetics are screening criteria for the selection of participants.  Please describe your experience in detail in your application. It is helpful if you also enclose a copy of your CV. We may contact you personally to discuss your application. Although experience of using LINUX is not necessary it is highly beneficial to have basic knowledge of this operating system before the start of the course.
Line 23: Line 23:
 
For additional information, please contact [mailto:montagk@rockefeller.edu?subject=Rockefeller Advanced Gene Mapping Course 2021 Katherine Montague]: <br />email: [mailto:montagk@rockefeller.edu montagk@rockefeller.edu] <br /><br /> 
 
For additional information, please contact [mailto:montagk@rockefeller.edu?subject=Rockefeller Advanced Gene Mapping Course 2021 Katherine Montague]: <br />email: [mailto:montagk@rockefeller.edu montagk@rockefeller.edu] <br /><br /> 
  
For additional information on the scientific program contact the course organizer [mailto:suzannemleal@gmail.com?subject=Rockefeller Advanced Gene Mapping Course 2021 Suzanne Leal]: <br />email: [mailto:suzannemleal@gmail.com suzannemleal@gmail.com] or [mailto:sml3@cumc.columbia.edu sml3@cumc.columbia.edu] 
+
For additional information on the scientific program contact the course organizer [mailto:suzannemleal@gmail.com?subject=Rockefeller Advanced Gene Mapping Course 2021 Suzanne Leal]: <br />email: [mailto:suzannemleal@gmail.com suzannemleal@gmail.com] or [mailto:sml3@cumc.columbia.edu sml3@cumc.columbia.edu]
  
==Course Photo - 2020==
 
[[File:142FBA93-B26F-45A4-A0C5-264B812AB558.JPG|thumb|none|300px]]
 
  
==Handouts - 2020==
+
Applications are no longer being accepted.
====[//www.statgen.us/files/2020/01/docs/lectures/Lecture_booklet.pdf All Lectures]====
+
 
====[//www.statgen.us/files/2020/01/docs/exercises/Booklet_exercises.pdf All Exercises]====
+
 
 +
[http://statgen.us/files/2021/02/adv_gene_mapping_schedule_2021.pdf Click here for course schedule]
 +
 
 +
[http://statgen.us/files/2021/02/Application_Advanced_Gene_Mapping_Course_2021_New_York.html Click here for the application form]
 +
 
 +
[http://statgen.us/files/2021/02/Rockefeller_advanced_course_flyer_2021.pdf Click here for course flyer] (please post and distribute)
 +
 
 +
===[https://github.com/statgenetics/statgen-courses/wiki/How-to-launch-course-tutorials How to run the exercises]===
 +
==Handouts==
 
===Heather Cordell===
 
===Heather Cordell===
 +
[http://www.statgen.us/files/2021/02/bios/about_HJC.pdf About Heather Cordell]
 +
 
====Lectures====
 
====Lectures====
* [//www.statgen.us/files/2020/01/docs/lectures/NY2020-lecture1-nopause_4.pdf Genome-wide association studies (GWAS) - Part 1]
+
* [http://statgen.us/files/2021/02/lectures/NY2021-lecture1-nopause.pdf Genome-wide association studies - Part 1]
* [//www.statgen.us/files/2020/01/docs/lectures/NY2020-lecture2-nopause_4.pdf Genome-wide association studies (GWAS) - Part 2]
+
* [http://statgen.us/files/2021/02/lectures/NY2021-lecture2-nopause.pdf Genome-wide association studies - Part 2]
  
 
====Exercises====
 
====Exercises====
* [[Tutorials#FastLMM | Family-based Association using FaST-LMM, PLINK and R]] | [https://github.com/statgenetics/statgen-courses/blob/master/docker/fastlmm-gcta.dockerfile dockefile] and [https://hub.docker.com/r/statisticalgenetics/fastlmm-gcta/tags docker image]
+
* [http://statgen.us/files/2021/02/exercises/FASTLMM-NY2020.pdf Family-based Association using FaST-LMM, PLINK and R]
* [[Tutorials#GCTA|GCTA]] | [https://github.com/statgenetics/statgen-courses/blob/master/docker/fastlmm-gcta.dockerfile dockefile] and [https://hub.docker.com/r/statisticalgenetics/fastlmm-gcta/tags docker image]
+
* [http://statgen.us/files/2021/02/exercises/GCTA-NY2021.pdf GCTA]
* [[Tutorials#Epistasis_.28PLINK_and_CASSI.29 | Interaction analysis using PLINK and CASSI]] | [https://github.com/statgenetics/statgen-courses/blob/master/docker/epistasis.dockerfile dockerfile] and [https://hub.docker.com/r/statisticalgenetics/epistasis/tags docker image]
+
* [http://statgen.us/files/2021/02/exercises/Epistasis-NY2020.pdf Interaction analysis using PLINK and CASSI]
  
 
===Nancy Cox===
 
===Nancy Cox===
 +
[http://statgen.us/files/2021/02/bios/about_njc.pdf about Nancy Cox]
 +
 
====Lectures====
 
====Lectures====
* [//www.statgen.us/files/2020/01/docs/lectures/LealCourse_Cox_Jan2020.pdf Integrative Approaches in Biobanks: Getting to Biological Mechanisms of Disease]
+
[http://statgen.us/files/2021/02/lectures/LealCourse_Cox_Jan2021.pdf Integrative Approaches in Biobanks: Getting to Biological Mechanisms of Disease]
  
 
===Andrew DeWan===
 
===Andrew DeWan===
====Lectures====
+
[http://www.statgen.us/files/2021/02/bios/about_atd.pdf About Andy DeWan]
* [//www.statgen.us/files/2020/01/docs/lectures/Rockefeller_Adv_Course_DeWan_pleiotropy_4.pdf From cross-phenotype associations to pleiotropy in human genetic studies]
+
 
 +
====Lecture====
 +
* [http://statgen.us/files/2021/02/lectures/DeWan_Adv_Gene_Mapping_Mendelian%20Randomization.pdf Mendelian randomization]
 +
* [http://statgen.us/files/2021/02/lectures/Rockefeller_Adv_Course_DeWan_pleiotropy.pdf Pleiotropy]
  
 
====Exercises====
 
====Exercises====
* [[Tutorials#Pleiotropy|Pleiotropy]] | [https://github.com/statgenetics/statgen-courses/blob/master/docker/pleiotropy.dockerfile dockefile] and [https://hub.docker.com/r/statisticalgenetics/pleiotropy/tags docker image]
+
* [http://statgen.us/files/2021/02/exercises/pleio_exercise.pdf Pleiotropy]
 +
* [http://statgen.us/files/2021/02/exercises/pleio_exercise_answers.pdf Answers to Pleitropy Exercise]
 +
* [http://statgen.us/files/2021/02/exercises/MR_exercise.pdf Mendelian Randomization]
 +
* [http://www.statgen.us/files/2021/02/exercises/MR_exercise_answers.pdf Mendelian Randomization Answers]
  
 
===Suzanne Leal===
 
===Suzanne Leal===
 +
[http://www.statgen.us/files/2021/02/bios/about_sml.pdf About Suzanne Leal]
 +
 
====Lectures====
 
====Lectures====
* [//www.statgen.us/files/2020/01/docs/lectures/data_quality_control_1.pdf Data Quality Control]
+
* [http://statgen.us/files/2021/02/lectures/data_quality_control_1.pdf Data Quality Control]
* [//www.statgen.us/files/2020/01/docs/lectures/NGS_data_qc.pdf NGS Data Quality Control]
+
* [http://statgen.us/files/2021/02/lectures/NGS_data_qc.pdf NGS Data Quality Control]
* [//www.statgen.us/files/2020/01/docs/lectures/analysis_rare_variants_part_1.pdf Complex Trait Association Analysis of Rare Variants]
+
* [http://statgen.us/files/2021/02/lectures/analysis_rare_variants_part_1.pdf Complex Trait Association Analysis of Rare Variants Obtained from Sequence Data: Population-Based Data]
* [//www.statgen.us/files/2020/01/docs/lectures/power_sample_sizes.pdf Power Analysis for Single and Rare Variant Aggregate Association Analyses]
+
* [http://statgen.us/files/2021/02/lectures/power_sample_sizes.pdf Power Analysis for Single and Rare Variant Aggregate Association Analyses]
* [//www.statgen.us/files/2020/01/docs/lectures/Analysis_rare_variants_part_2.pdf National Heart Lung and Blood Institute Exome Sequencing Project]
+
  
 
====Exercises====
 
====Exercises====
* [[Tutorials#PSEQ|PLINK/SEQ (PSEQ)]] | [https://github.com/statgenetics/statgen-courses/blob/master/docker/pseq.dockerfile dockefile] and [https://hub.docker.com/r/statisticalgenetics/pseq/tags docker image]
+
* [http://statgen.us/files/2021/02/exercises/PSEQ_exercise.pdf PLINK/SEQ (PSEQ)]
* [[Tutorials#Variant_Association_Tools|Association Analysis of Sequence Data using Variant Association Tools for Complex Traits]] | [https://github.com/statgenetics/statgen-courses/blob/master/docker/vat.dockerfile dockerfile] and [https://hub.docker.com/r/statisticalgenetics/vat/tags docker image]
+
* [http://statgen.us/files/2021/02/exercises/VAT_2021.pdf Association Analysis of Sequence Data using Variant Association Tools for Complex Traits]
* [[Tutorials#Cochran_Armitage_Trend_Test|Cochran Armitage Trend Test for GWAS power analysis]]
+
* [http://statgen.us/files/2021/02/exercises/Armitage_sample_size.pdf Cochran Armitage Trend Test for GWAS power analysis]
* [[Tutorials#Annotation|ANNOVAR Annotation]] | [https://github.com/statgenetics/statgen-courses/blob/master/docker/annovar.dockerfile dockefile] and [https://hub.docker.com/r/statisticalgenetics/annovar/tags docker image]
+
* [http://statgen.us/files/2021/02/exercises/FunctionalAnnotation.2021.pdf ANNOVAR Annotation]
 +
* [http://statgen.us/files/2021/02/exercises/GWAS_Data_Controlling_for_Substructure_V2.pdf Genome-Wide Association - Association Analysis Controlling for Population Substructure]
 +
* [http://statgen.us/files/2021/02/exercises/PLINK_data_QC_V2.pdf Genome-Wide Association Analysis - Data Quality Control]
 +
 
 +
===Judy Matuk===
 +
[http://www.statgen.us/files/2021/02/bios/about_jm.pdf About Judy Matuk]
  
===Bogdan Pasanuic===
 
 
====Lectures====
 
====Lectures====
* [//www.statgen.us/files/2020/01/docs/lectures/finemapping_pasaniuc.pdf Statistical fine-­mapping of GWAS signals]
+
* [http://statgen.us/files/2021/02/lectures/Rockefeller_draft_HSR%20Ethics%20and%20Regulations%20PPT_%20jm%201.18.2021.pdf Human Subject Training]
<!--
+
 
====Exercises====
+
===Jurg Ott===
* finemap
+
[http://www.statgen.us/files/2021/02/bios/about_jo.pdf About Jurg Ott]
-->
+
 
 +
====Lectures====
 +
[http://www.statgen.us/files/2021/02/lectures/FPMforInteractions2021-01.pptx Frequent Pattern Mining Methods for Finding SNP-SNP Interactions]
  
 
===Shamil Sunyaev===
 
===Shamil Sunyaev===
 +
[http://www.statgen.us/files/2021/02/bios/about_ss.pdf About Shamil Sunyaev]
 +
 +
====Exercises====
 +
* [http://statgen.us/files/2021/02/exercises/NPS.pdf Polygenic risk prediction using non-parametric shrinkage] | [https://github.com/statgenetics/statgen-courses/blob/master/docker/nps.dockerfile dockerfile] and [https://hub.docker.com/r/statisticalgenetics/nps/tags docker image]
 +
 
====Lectures====
 
====Lectures====
* [//www.statgen.us/files/2020/01/docs/lectures/Rockefeller_pop_gen_2020.pdf Population Genetics]
+
* [http://statgen.us/files/2021/02/lectures/Rockefeller_complex_traits_2021.pdf Complex Traits]
* [//www.statgen.us/files/2020/01/docs/lectures/Rockefeller_complex_traits_2020.pdf Evolution, maintenance and allelic architecture of complex traits]
+
* [http://statgen.us/files/2021/02/lectures/Rockefeller_functional_annotation_2021.pdf Functional Annotation]
* [//www.statgen.us/files/2020/01/docs/lectures/Rockefeller_polygenic_2020.pdf Polygenic Risk Score]
+
* [http://statgen.us/files/2021/02/lectures/Rockefeller_polygenic_risk_score_2021.pdf Polygenic Risk Score]
* [//www.statgen.us/files/2020/01/docs/lectures/Rockefeller_function_2019.pdf Functional Annotation]
+
* [http://statgen.us/files/2021/02/lectures/Rockefeller_population_genetics_2021.pdf Population Genetics]
 +
 
 +
===Gao Wang===
 +
[http://www.statgen.us/files/2021/02/bios/about_gw.pdf About Gao Wang]
 +
 
 +
====Lectures====
 +
* [http://statgen.us/files/2021/02/lectures/Lecture_Univariate_Fine_Mapping.pdf Statistical fine-mapping]
 +
* [http://statgen.us/files/2021/02/lectures/Lecture_Multivariate_Analysis.pdf Multivariate analysis]
  
 
====Exercises====
 
====Exercises====
* [[Tutorials#Polygenic_risk_prediction_.28NPS_method.29|Polygenic risk prediction using non-parametric shrinkage]] | [https://github.com/statgenetics/statgen-courses/blob/master/docker/nps.dockerfile dockefile] and [https://hub.docker.com/r/statisticalgenetics/nps/tags docker image]
+
* [https://github.com/statgenetics/statgen-courses/raw/master/handout/finemapping.docx Statistical fine-mapping in association studies] | [https://github.com/statgenetics/statgen-courses/blob/master/docker/finemap.dockerfile dockerfile] and [https://hub.docker.com/r/statisticalgenetics/finemap/tags docker image]
 +
* [https://github.com/statgenetics/statgen-courses/raw/master/handout/finemapping_answers.docx Answers to fine-mapping exercise questions]
  
 
This course is supported by a grant from the National Institute of Health (NIH) - National Human Genome Research Institute (NHGRI).
 
This course is supported by a grant from the National Institute of Health (NIH) - National Human Genome Research Institute (NHGRI).
 +
 +
===Dan Zhou===
 +
[http://www.statgen.us/files/2021/02/bios/about_dz.pdf About Dan Zhou]
 +
====Computer Demonstration====
 +
* [http://statgen.us/files/2021/02/lectures/MR-JTI.pdf MR-JTI: a framework for Joint Tissue Imputation-based TWAS and Mendelian Randomization]

Latest revision as of 18:03, 29 January 2021

Advanced Gene Mapping Course

The Rockefeller University, New York 
Online
Monday through Friday, January 25-29, 2021

General Information

An Advanced Gene Mapping course will be held in online from Monday through Friday, January 25-29, 2021. The cost of the 5-day course is $100 for student, academic, and government researchers and $1,500 for researchers working in industry. This fee covers tuition and course-related expenses (cloud computing, etc.).

The course emphasis is on analyzing sequence and other omics data to elucidate the genetic etiology of complex human disease traits. Topics will include: data quality control of sequence and other types of data; single variant and aggregate rare variant association analysis of whole-genome data (genotype, sequence, and imputed) for qualitative and quantitative traits (population and family-based data); controlling for population admixture and substructure; generalized linear mix models and linear mixed models; meta-analysis; sample size estimation and power calculations; detecting gene x gene and gene x environmental interactions; analysis of epigenomic data, e.g methylation, and chromatin; heritability estimation using variant and RNA-Seq data; analysis of RNA-Seq data; eQTL mapping; elucidating pleiotropy; functional prediction and variant annotation; estimation of polygenic risk scores; Mendelian randomization; mediation analysis; and fine mapping. As mandated by the NIH there will also be a special session on responsible conduct of research that will include sessions on conflict of interest, research ethics, data management (security), and ethical use of human research subjects.

A variety of freely available software will be used to perform the practical exercises, due to differences in their functionality. PSEQ and VAT will be used to analyze sequence data to perform annotation, quality control, rare variant association analysis, and meta-analysis. FaST-LMM, GCTA-MLMA, REGENIE will be implemented to analyze population- and family-based data by applying generalized linear mixed models (qualitative traits) and linear mixed models (quantitative traits). For rare variant association analysis of trio data, RV-TDT will be applied. MultiPhen (multivariate) and PLINK (univariate) will be contrasted in their ability to detect pleiotropy; Mediation analysis will be performed using R to aid in distinguishing between biological, mediated, and spurious pleiotropy. To make inferences on causality, Mendelian randomization will be performed using MR-base. Estimation of polygenic risk scores will be performed using LDpred and non-parametric shrinkage. SuSie will be used for fine mapping to aid in the detection of causal susceptibility variants. Heritability estimates will be performed using GCTA. For analysis of eQTLs, Matrix eQTL will be used. Analysis of imputed expression data will be performed by applying PrediXCan; To perform analytical and empirical power analysis for single and rare variant aggregate tests, a variety of tools will be used that includes the Armitage Power Tool and the SKAT R library will be used. Additionally, variant annotation will be performed with ANNOVAR as well as directly using a variety of functional prediction and conservation tools, e.g. CADD, GERP, MutationTaster, MutPred, Polyphen-2, and SIFT.

Course Instructors

The instructors for the course are Heather Cordell (University of Newcastle), Nancy Cox (Vanderbilt University), Andrew DeWan (Yale University), Suzanne Leal (The Rockefeller University & Columbia University), Shamil Sunyaev (Harvard University) & Gao Wang (Columbia University). Judy Matuk (HRP Consulting Group) will lecture on ethics and the regulation of human subject research. A special guest lecture will be given by Jurg Ott (Rockefeller University).

Additional Information

The maximum number of participants for this online course is 34.  

Knowledge genetic association analysis, genetic epidemiology and/or statistical genetics are screening criteria for the selection of participants.  Please describe your experience in detail in your application. It is helpful if you also enclose a copy of your CV. We may contact you personally to discuss your application. Although experience of using LINUX is not necessary it is highly beneficial to have basic knowledge of this operating system before the start of the course.

For additional information, please contact Advanced Gene Mapping Course 2021 Katherine Montague
email: montagk@rockefeller.edu 

 

For additional information on the scientific program contact the course organizer Advanced Gene Mapping Course 2021 Suzanne Leal
email: suzannemleal@gmail.com or sml3@cumc.columbia.edu


Applications are no longer being accepted.


Click here for course schedule

Click here for the application form

Click here for course flyer (please post and distribute)

How to run the exercises

Handouts

Heather Cordell

About Heather Cordell

Lectures

Exercises

Nancy Cox

about Nancy Cox

Lectures

Integrative Approaches in Biobanks: Getting to Biological Mechanisms of Disease

Andrew DeWan

About Andy DeWan

Lecture

Exercises

Suzanne Leal

About Suzanne Leal

Lectures

Exercises

Judy Matuk

About Judy Matuk

Lectures

Jurg Ott

About Jurg Ott

Lectures

Frequent Pattern Mining Methods for Finding SNP-SNP Interactions

Shamil Sunyaev

About Shamil Sunyaev

Exercises

Lectures

Gao Wang

About Gao Wang

Lectures

Exercises

This course is supported by a grant from the National Institute of Health (NIH) - National Human Genome Research Institute (NHGRI).

Dan Zhou

About Dan Zhou

Computer Demonstration