-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy pathppi-network.Rmd
54 lines (37 loc) · 1.35 KB
/
ppi-network.Rmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
---
# Protein-Protein Network
### The data
- The PPI network in the neighborhood of ER composed by 741 proteins
- Valued dyads: $\omega_{ij} \in (0,1]$ reflecting the level of confidence in the interaction
- Binarization of the network with a threshold $\gamma$:
$$\mathbf{Y}^\gamma = \left(Y^\gamma\right)_{ij} = \left\{
\begin{array}{ll}
1 & \text{if } \omega_{ij} > 1-\gamma, \\
\texttt{NA} & \text{if } \gamma \leq \omega_{ij} \leq 1-\gamma, \\
0 & \text{if } \omega_{ij} < \gamma. \\
\end{array}
\right.$$
### Questions
- What $\gamma$?
- What sampling design: MAR or NMAR?
---
# Model Selection
```{r, echo = FALSE}
knitr::include_graphics("slides_files/ICL-PPI.png")
```
- The ICL criterion selects $\gamma = .35$ and MNAR sampling as the one that better fit the data
- Number of selected clusters: $11$ (MAR) and $13$ (NMAR)
- ARI between NMAR clustering and MAR clustering: $.39$
- MNAR clustering somehow coherent with gene ontology
---
# Imputation
```{r, echo = FALSE}
knitr::include_graphics("slides_files/net_esr1_NA_nu_both.jpg")
```
---
# Gene Ontology (GO)
Enrichment analysis *i.e.* identifying classes of genes over-represented in a large set of genes
MNAR founde $13$ significant biological process founded (MAR: only $1$)
```{r, echo = FALSE, fig.height=3}
knitr::include_graphics("slides_files/ontology.png")
```