Essentially in this code we wanted to take microarray data and be able to turn it into more readable and usable information that the general population could understand.
The overall framework of this program includes:
-
Loading in the microarray data and confirting it into red, green and blue as an array. This was done so that we have a way to distinctly quantify the expressions.
-
From there the cell size was assessed in order to determine how exactly we would partition the groupings.
-
We created a blank image of the cells to begin to compare how many different cells would exist. This was particularly important because not all cells exhibit any pixels.
-
The circles are than overlaid to match the image of the microarray provided. You can visually confirm that this is correct by showing the plot.
-
From there, we calculated the average color in each circle this will help us to understand what expression rate is present.
-
For each circle created, we stored both the poition and the RBG values to create a dataframe with.
-
Then the corresponding color was added to the corresponding circle on the blank cells. This helped to simplify the overall expression and create an image of such.
-
Then a new image was created with the averaged colors of each cell. Any cells that were black or did not show any expression at all whatsoever were minimized to be black.
-
Then each of the colors red and green were saved as a tiff file to isolate the values. Each of these intensities was added to a dataframe to store.
-
Because each cell represents a gene, we added the correlating gene to the dataframe to begin to associate genes with expressions other than being just values.
-
From there we plotted red and green intensities to overall determine what would be an appropriate threshold for our heat map so that we can leave out the wells with no binding.
- Based on threshold values we created a dataframe with those thresholds that we determined in step 11. Then, we created a heat map based on this.