Intro cluster ml #935

dsbuddy · 2024-05-27T19:41:11Z

No description provided.

drelliche · 2024-06-21T17:42:38Z

drelliche · 2024-06-21T18:37:49Z

This module is very much in conversation with #908 from which it was broken out of. Here is some general feedback:

General Structure

Learning objectives

I think these are just copied over from when you split the module in two. For this intro modules, the learning objectives might be more like that learners will be able to

Define crucial vocabulary including "clustering," "supervised vs. unsupervised,"
Understand how the k-means clustering algorithm works, broadly speaking.
Identify appropriate applications of k-means clustering

Pre-Reqs

Since no programming will be done in this module, the pre-reqs should also be updated

Timing

I think 20 minutes is an underestimate, but let's see where it ends up after editing

Quizzes

I love that there are so many quiz questions. I think we might want to move and group them into specific "Quiz" sections. We will also want to double check that they align with the learning objectives once those are smoothed out.

Examples

The biomedical examples are great, I think they would be more effective if they were used to illustrate specific concepts algorithm types/parts. The classic example of customer segmentation can probably be omitted and replaced with a (general or specific) example of patient segmentation.

Introduction to Clustering

What is Clustering?

Example: Patient Stratification

This seems to be a classic example of clustering

Quiz: Clustering

Key Vocabulary

Encouragement box here! Use one of the biomedical examples to illustrate outcome/response/dependent variables/labels and input/predicotrs/features etc.

Supervised vs Unsupervised Learning

For this and the next 3 sections, you already have a description, would it be possible to join it with one of your biomedical research examples to illustrate each of these concepts?

Normalization

Distance to the centroid

Visualization

Quiz: what vocabulary is it crucial people know?

Types of Clustering

"There are many different clustering algorithms, each with its own strengths and weaknesses. Some of the most common clustering algorithms include K-Means clustering, hierarchical clustering, and Gaussian Mixture Models (GMMs)" Expand a little more on this.

K-Means Clustering

This is the "how it works" description

A K-Means Example

Is there a video/image/illustration/example that you can link to or embed?

Potential Pitfalls

This is the "important notes" page

Quiz: K-Means Clustering

Additional Resources

Feedback

dsbuddy added 7 commits November 2, 2023 09:47

Initial draft of clustering module

44149ea

Added heart data for clustering python exercise

be925e8

Added clustering python exercise

c043653

Added polyps data for clustering real world example

46656de

Updated module to include real-world example

3af59e3

Addressed Rose's comments from the following module

981f599

Split up clustering module and extract overview

a53001b

dsbuddy assigned rosemm May 27, 2024

rosemm added the Quality Assurance label Jun 12, 2024

Updated changes based off Elizabeth's comments

5e436b1

drelliche marked this pull request as draft July 17, 2024 15:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Intro cluster ml #935

Intro cluster ml #935

dsbuddy commented May 27, 2024

drelliche commented Jun 21, 2024

drelliche commented Jun 21, 2024

Intro cluster ml #935

Are you sure you want to change the base?

Intro cluster ml #935

Conversation

dsbuddy commented May 27, 2024

drelliche commented Jun 21, 2024

drelliche commented Jun 21, 2024

General Structure

Learning objectives

Pre-Reqs

Timing

Quizzes

Examples

Suggested Table of Contents

Introduction to Clustering

What is Clustering?

Example: Patient Stratification

Quiz: Clustering

Key Vocabulary

Supervised vs Unsupervised Learning

Normalization

Distance to the centroid

Visualization

Quiz: what vocabulary is it crucial people know?

Types of Clustering

K-Means Clustering

A K-Means Example

Potential Pitfalls

Quiz: K-Means Clustering

Additional Resources

Feedback