-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
no clip labels question #19
Comments
I think you would need to modify the get_best_kmeans function, which relies on the silhouette score. Some other options for clustering metrics can be found here: |
@LumenPallidium in hierarchical clustering cant we use clip and tags like we compare the first tags ex "art,realism,design" then each image picks one label and goes down the tree, if it pick art it now compares for example "watercolor,pointilist,oilpainting,graphitti etc" etc so we can define a strcuture and tell the clip to pick the best of options and at the end it organizes everything nicelly, like an image of a lion could be in [realism -> wild photography -> lion ] folder i was thinking like we define in a json that each node has children, so instead of clip comparing all the tags at the same time , we compare level by level of the tree until it reaches a final leaf iam not sure how accurate hierarchical clustering is if it doesnt use clip, it tried it and in the 3d plot it looked off, |
When i use no clip labels and at the same time i use estimate_k = True i get only 2-3 clusters, is there a way to increase this number and force more cluster that have similar features, without disabling estimate_k ? if i disable estimate_k i have to guess moreless how many clusters i need and end up with too many clusters
The text was updated successfully, but these errors were encountered: