This is assignment is due by 5pm, Tuesday 9th June.
All questions are to be answered on the same R Markdown / PDF, regardless of if they require a plain text answer, or require execution of code.
Marks directly correspond to the amount of time and effort we expect for each question, so please answer with this is mind.
We strongly advise working in the folder ~/transcriptomics/assignment4
on your virtual machine. Using an R Project for each individual assignment is also strongly advised.
If all files required for submission are contained on your VM:
.zip
If all files are on your on your local Windows machine:
Send to > Compressed (zipped) folder
.zip
If all the files are on your *local macOS machine`:
A Module Eigengene (ME) is one of the fundamental concepts of gene co-expression network analysis using WGCNA.
We have discussed that the key point of WGCNA analysis is to discover biological significance. Describe how can we build the link between our co-expression network analysis and biological significance.
Bulk RNA-seq and scRNA-seq differ in many ways. Describe two common aspects shared between the two approaches, and provide details about two key differences between the two.
Use the given gene expression dataset and corresponding clinical data to identify co-expression module(s) with the strongest correlation with clinical data. The gene expression dataset includes raw counts for human reference genes across 37 samples (skin cancer patients), and the clinical data includes diagnosed cancer stage status of 37 patients. The results should minimally include:
Using the SingleCellExperiment object provided, perform the following steps. Please note that undetectable genes and low quality cells have already been removed from this dataset. Counts are already normalised and log transformed, and contain expression patterns from 622 mouse neuronal cells.