Hello, dear friend, you can consult us at any time if you have any questions, add WeChat: daixieit

Assignment 1

BS6203

Task 1:

• Go to Pinterest, and review some data science or machine learning or programming infographics (e.g. https://www.pinterest.com/kourouklides/data-science/?autologin=true or https://www.pinterest.com/gohwils/data-science/)

• Find one (or a few) that you really like (for presentation and content)

• Try to list down why you enjoyed it.

Task 2:

The UPR (Unfolded Protein Response) is a cellular stress response that is activated by an accumulation of unfolded or misfolded proteins in the lumen of the endoplasmic reticulum. In this scenario, the UPR has three aims: initially to restore normal function of the cell by halting protein translation, degrading misfolded proteins, and activating the signalling pathways that lead to increasing the production of molecular chaperones involved in protein folding. If these objectives are not achieved within a certain time span or the disruption is prolonged, the UPR aims towards apoptosis. There are 3 pathways that feed into a mechanism known as the UPR. This requires turning on any of 3 potential paths via IRE1, PERK and ATF6. Let us also assume that when turning on these 3 paths, all downstream targets are also all turned on, and there is no suppression.

A student performed a series of knock outs. And for each knock out, measured which genes are still inducible to create the UPR. The results are shown in the Venn diagram here.

1. Does "a" show the set of genes downstream of IRE1? If not, what is it?

2. What is the expected value of "v" if only 3 paths exist?

3. What should you expect the value of v to be if > 3 paths exist?

4. Do you expect "b" and "c" to be 0?

5. Is "x" the common set of genes shared between IRE1 and Perk?

6. What is the expected value of "u"? Is it possible to even get "u"?

7. Are there situations where regions a b and c are non-empty?

8. Does v correspond to triple knock out?