-
Notifications
You must be signed in to change notification settings - Fork 15
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Implemented Various Distance Metric #107
base: main
Are you sure you want to change the base?
Changes from 25 commits
4900b40
341ebc0
34f6cc2
0c12221
d6d077b
16031e4
3fb7402
019f9f2
e955eee
9718f25
c2d485e
3512f9b
5d0550e
93d1b7a
0d73928
c3fb7c9
004c5c2
8d19454
2e4d110
a55d7cb
4e01ffb
6601058
cf7e2c6
313e84e
fbbeb70
56ead03
77e2b07
deb0d05
87d1367
5c3a18e
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,47 @@ | ||
# Distance Metric | ||
|
||
## Euclidean Distance | ||
Euclidean Distance represents the shortest distance between two points. | ||
|
||
## Manhattan Distance | ||
Manhattan Distance is the sum of absolute differences between points across all the dimensions. | ||
|
||
## Manhattan Distance | ||
Minkowski Distance is the generalized form of Euclidean and Manhattan Distance. | ||
|
||
## Cosine Similarity | ||
Cosine similarity is a metric, helpful in determining, how similar the data objects are irrespective of their size. | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Add sentence like
for all four metric. |
||
|
||
## Parameters | ||
|
||
| Name | Definition | Type | | ||
| -----------------| --------------------------------------------------------------------------------------- | ---- | | ||
| x | A vector of values | `T`| | ||
| y | A vector of values | `T`| | ||
|
||
|
||
## Methods | ||
|
||
| Name | Definition | Return value | | ||
| ------------------------------- | ----------------------------------------------------- | ----------------- | | ||
| `euclidean()` | To find the euclidean distance | `double` | | ||
| `manhattan()` | To find the manhattan distance | `int, double` | | ||
| `minkowski(int p)` | To find the minkowski distance | `double` | | ||
| `magnitude(vector<T> &x)` | To find the magnitude of the vector | `double` | | ||
| `cosineSimilarity()` | To find the cosine similarity | `double` | | ||
|
||
|
||
|
||
## Example | ||
|
||
```cpp | ||
std::vector<double> dist1 = {1, 4, 4, 4}; | ||
std::vector<double> dist2 = {1, 2, 3, 4}; | ||
DistanceMetric Dist(dist1, dist2); | ||
std::cout << "Minkowski Distance is " << Dist.minkowski(3) << std::endl; | ||
std::cout << "Euclidean Distance is " << Dist.euclidean() << std::endl; | ||
std::cout << "Manhattan Distance is " << Dist.manhattan() << std::endl; | ||
std::cout << "Cosine Similarity is " << Dist.cosineSimilarity() << std::endl; | ||
``` | ||
|
||
|
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,14 @@ | ||
// #include "../src/slowmokit/methods/metrics/distance_metric/distance_metric.hpp" | ||
|
||
|
||
// int main() | ||
// { | ||
// std::vector<double> dist1 = {1, 4, 4, 4}; | ||
// std::vector<double> dist2 = {1, 2, 3, 4}; | ||
// DistanceMetric Dist(dist1, dist2); | ||
// std::cout << "Minkowski Distance is " << Dist.minkowski(3) << std::endl; | ||
// std::cout << "Euclidean Distance is " << Dist.euclidean() << std::endl; | ||
// std::cout << "Manhattan Distance is " << Dist.manhattan() << std::endl; | ||
// std::cout << "Cosine Similarity is " << Dist.cosineSimilarity() << std::endl; | ||
// return 0; | ||
// } |
Original file line number | Diff line number | Diff line change | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
@@ -0,0 +1,72 @@ | ||||||||||||||
/** | ||||||||||||||
* @file methods/metrics/distance_metric/distance_metric.hpp | ||||||||||||||
* | ||||||||||||||
* Easy include to calculate distance metrics | ||||||||||||||
*/ | ||||||||||||||
|
||||||||||||||
#include "distance_metric.hpp" | ||||||||||||||
|
||||||||||||||
template<class T> | ||||||||||||||
DistanceMetric<T>::DistanceMetric(std::vector<T> &x, std::vector<T> &y) | ||||||||||||||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more.
Suggested change
We don't want user to feel insecure about their data. Use const in cpp but not in hpp?
void f(int);
void f(const int param1) {
return;
} is valid and recommended. |
||||||||||||||
{ | ||||||||||||||
this->x = x; | ||||||||||||||
this->y = y; | ||||||||||||||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more.
Suggested change
Use initialiser list, they are faster. |
||||||||||||||
if (x.size() != y.size()) | ||||||||||||||
{ | ||||||||||||||
throw std::domain_error("Size of the two vectors must be same"); | ||||||||||||||
} | ||||||||||||||
} | ||||||||||||||
|
||||||||||||||
template<class T> double DistanceMetric<T>::euclidean() | ||||||||||||||
{ | ||||||||||||||
double distance = 0; | ||||||||||||||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more.
Suggested change
You will have to downcast UPDATE: Do this wherever I have added |
||||||||||||||
int n = x.size(); | ||||||||||||||
for (int i = 0; i < n; i++) | ||||||||||||||
{ | ||||||||||||||
distance += (x[i] - y[i]) * (x[i] - y[i]); | ||||||||||||||
} | ||||||||||||||
return std::sqrt(distance); | ||||||||||||||
} | ||||||||||||||
|
||||||||||||||
template<class T> T DistanceMetric<T>::manhattan() | ||||||||||||||
{ | ||||||||||||||
T distance = 0; | ||||||||||||||
int n = x.size(); | ||||||||||||||
for (int i = 0; i < n; i++) | ||||||||||||||
{ | ||||||||||||||
distance += std::abs(x[i] - y[i]); | ||||||||||||||
} | ||||||||||||||
return distance; | ||||||||||||||
} | ||||||||||||||
template<class T> double DistanceMetric<T>::minkowski(int power) | ||||||||||||||
{ | ||||||||||||||
double distance = 0; | ||||||||||||||
int n = x.size(); | ||||||||||||||
for (int i = 0; i < n; i++) | ||||||||||||||
{ | ||||||||||||||
distance += std::pow(x[i] - y[i], power); | ||||||||||||||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Use |
||||||||||||||
} | ||||||||||||||
return std::pow(distance, 1.0 / power); | ||||||||||||||
} | ||||||||||||||
|
||||||||||||||
template<class T> double DistanceMetric<T>::magnitude(std::vector<T> &x) | ||||||||||||||
{ | ||||||||||||||
double result = 0; | ||||||||||||||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more.
Suggested change
|
||||||||||||||
int n = x.size(); | ||||||||||||||
for (int i = 0; i < n; i++) | ||||||||||||||
{ | ||||||||||||||
result += x[i] * x[i]; | ||||||||||||||
} | ||||||||||||||
return std::sqrt(result); | ||||||||||||||
} | ||||||||||||||
|
||||||||||||||
template<class T> double DistanceMetric<T>::cosineSimilarity() | ||||||||||||||
{ | ||||||||||||||
double dotProduct = 0; | ||||||||||||||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more.
Suggested change
|
||||||||||||||
int n = x.size(); | ||||||||||||||
for (int i = 0; i < n; i++) | ||||||||||||||
{ | ||||||||||||||
dotProduct += x[i] * y[i]; | ||||||||||||||
} | ||||||||||||||
return dotProduct / (magnitude(x) * magnitude(y)); | ||||||||||||||
} |
Original file line number | Diff line number | Diff line change | ||||||
---|---|---|---|---|---|---|---|---|
@@ -0,0 +1,58 @@ | ||||||||
/** | ||||||||
* @file methods/metrics/distance_metric/distance_metric.hpp | ||||||||
* | ||||||||
* Easy include to calculate distances | ||||||||
*/ | ||||||||
|
||||||||
#ifndef SLOWMOKIT_DISTANCE_METRIC_HPP | ||||||||
#define SLOWMOKIT_DISTANCE_METRIC_HPP | ||||||||
#include "../../../core.hpp" | ||||||||
|
||||||||
/** | ||||||||
* Takes predicted and actual values of classes | ||||||||
* @param x | ||||||||
* @param y | ||||||||
Comment on lines
+13
to
+14
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Add short statement describing |
||||||||
* @returns the distance metrics | ||||||||
* @throws domain_error exception when size of the two vectors is not equal | ||||||||
*/ | ||||||||
template<class T> class DistanceMetric | ||||||||
{ | ||||||||
private: | ||||||||
std::vector<T> x; | ||||||||
std::vector<T> y; | ||||||||
|
||||||||
public: | ||||||||
DistanceMetric(std::vector<T> &x, std::vector<T> &y); | ||||||||
|
||||||||
/** | ||||||||
* @returns euclidean distance between the two vectors | ||||||||
*/ | ||||||||
double euclidean(); | ||||||||
|
||||||||
|
||||||||
/** | ||||||||
* @returns manhattan distance between the two vectors | ||||||||
*/ | ||||||||
T manhattan(); | ||||||||
|
||||||||
|
||||||||
/** | ||||||||
* @param power The order of the norm | ||||||||
* @returns minkowski distance between the two vectors | ||||||||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more.
Suggested change
|
||||||||
*/ | ||||||||
double minkowski(int); | ||||||||
|
||||||||
/** | ||||||||
* @brief to find the magnitude of the vector | ||||||||
* @param x a vector | ||||||||
* @returns magnitude of x | ||||||||
*/ | ||||||||
double magnitude(std::vector<T> &); | ||||||||
Ishwarendra marked this conversation as resolved.
Show resolved
Hide resolved
|
||||||||
|
||||||||
/** | ||||||||
* @returns cosine similarity between the two vectors | ||||||||
*/ | ||||||||
double cosineSimilarity(); | ||||||||
}; | ||||||||
|
||||||||
#endif // SLOWMOKIT_DISTANCE_METRIC_HPP |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Change distance name to minkowski