-
Notifications
You must be signed in to change notification settings - Fork 1
/
Copy pathREADME
208 lines (150 loc) · 7.48 KB
/
README
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
YOUTUBE Under Armour Innovation Search Project
This is an Innovation project concept searching for specific niche innovation online.
THE PROTOTYPE:
The demo and supporting documentation provided outlines what it takes to find innovative videos pertaining to shoes
from YouTube in Near real-time (NRT).
See main algorithm, query-shoe.conf
The prototype is designed and built by me (Ian Robertson), and provides a specific detailed analysis of all shoe(s) videos
on the Google YouTube API.
I use scoring as a means of determining statistically whether a newly-uploaded video contains the relevant content (i.e. words)
that are directly associated with innovation.
To do this, I must consider four components of any form of innovation:
1. Nomenclature
2. Anatomy
3. Creative Expressions
4. Uniqueness
The main part of the "algorithm" explores the concept of leveraging Distance-Vector analysis
using a logarithmic scale base10 of key words and phrases commonly associated with innovation.
A Premise for any Search Framework:
An organization searching for a specific product enhancement or feature (Innovation) should leverage an
iterative algorithmic search for external knowledge that encompasses situated search paths, analogical search paths,
sophisticated search paths, and scientific search paths.
Once a specific instance is found and deemed relevant to the initial objective,
I believe that a recursive search pattern should be automated to determine the strength of the iterative search path,
and whether any adjustments need to be made to the former.
The "Algorithm":
*Written in Ruby
*Runs within Logstash, Outputs to Elasticsearch, viewed on Kibana (UI)
*Performs:
Tagging => Preprocessing => parsing => aggregation of terms => by-word parsing then looping => scoring =>
labeling => insert back into Backend.
Located: /etc/logstash/conf.d/query-shoe.conf
Project Highlights;
1. Setup Server
2. Installed/upgraded software
3. Configured applications
4. Created Dashboards & Visualizations
5. Created Email and Dashboard Alerting
6. Developed and ran ingest pipeline
7. Built data storage, and retrieval process
Shoe Innovation Search:
OBJECTIVES:
Find videos that express innovative uses pertaining to shoes.
Identify context, creative expression, innovative components (nomenclature) on various parts of the anatomy.
Creative, Anatomy, Nomenclature (CAN) words/expressions:
Only score CAN words: Creative, Anatomy, Nomenclature
Usage: per each CAN word in C(sub-i) recursively move to last starter word and count distance for Distance Vector.
Identify distance between critical words and directional word flow pattern.
i.e. “My new welt design” or “This new insole technology I designed”
The Abstract Feed Query:
https://<GoogleAPI>/search?part=snippet&order=date&q=shoe%7Cshoes&relevanceLanguage=en&maxResults=20&fields=items(id,snippet(title,description,channelId,channelTitle,publishedAt,liveBroadcastContent))&type=video&key=<KEY>
Terms Consideration Discussion:
Assess what are Positive and Negative Terms.
Impacts scoring. See, “terms_score” in Kibana.
+1 for positive terms, -1 for negative terms.
Videos w/ score at or below 0 are typically non-conducive.
Vital Considerations:
Nomenclature
Anatomy
Creative Expressions
Uniqueness
In all probability, a video of a unique innovation will have attributes of all four.
---
Nomenclature (in this case of a shoe):
Laces, arch, outsole, insole, vamp, welt, eyelet, leather, material, quarter, tongue, feather, lining, seat, shank,
throat, waist, puff, upper, lower
---
Anatomy (of how the user uses a shoe) i.e. End User considerations:
Ankle, arch*, men, women, toes, calf, legs, comfort*, fit, motion, muscles, length, width, health, foot, feet, round, flat, curve
Uniqueness:
Creativeness Expressions in use/form of Prepositions, individual, and possessive terms:
For, my, mine, I, I am, I’m, you’re, your, man, woman
---
Creative Expressions:
Design, designed, pattern, patent, tech, technology, latest, innovation, idea, create, creation, created, running,
develop, developed, tested, test
---
What are the components and functions of a shoe?
words = ['laces', 'ankle','arch','foot','feet','wear','design','designed','heel','tested','test',
'outsole','sole','puff','shank','waist','upper','lower','throat','seat','tongue','feather',
'lining','mens','womans','going','completed','muscles','muscle','motion','length','width',
'health','fitness','vamp','welt','insole','eyelet','toe','toes','leather','material','calf',
'leg','pattern','quarter','patent','man','men','woman','women','resist','resistant', 'weather',
'develop','developed','built','comfort','stability', 'grip', 'lacing', 'round', 'flat', 'curve',
'fit', 'tech', 'technology', 'latest', 'lace', 'for', 'running', 'run', 'my', 'new', 'innovation',
'idea', 'create', 'creation', 'created']
What are the negative words that may negate the relevancy of the specific video in question?
example:
negate_words = [ 'nike', 'brooks', 'collection', 'collect', 'jordan', 'curry','lonzo','lebron',
'durant','lavar','subscribe','reebok', 'adidas', 'northface', 'newbalance', 'review', 'sale',
'offer', 'live','cheap','ebay','discount','amazon', 'buy', 'top', 'best', 'how', 'free' ]
How negation works:
We must asses the negative categories that are disproportionate to our objective of finding innovative shoe videos:
In theory this can and should be automated, however, automation was not a consideration for
this algorithm because of financial and time constraints.
OBJECTIVES:
Remove Reviews, How-to’s
Remove Competitors and their references
Remove sales pitches (free, cheap, ebay, amazon, discount)
Negate Athletes (Jordan, Curry, LeBron)
Negate professional channels (subscribe)
---
Additional things to consider for alerting algorithm:
Frequency :
Uses: Occurrences of any given word
Example: 3 references to Under Armour in Title.
New Term :
Uses: For finding new uses of specified value
Example: Finding New Channels in Fitness data.
Spike:
Uses: Trending Events
Example: Increase in references to “shoe”
The Youtube API:
Allows you to gather videos as they are uploaded to YouTube or search existing content.
Filter criteria into topics, keyword, relevance, channels, or users
Initial Project Proposal and Design:
(pull data from a broad range of fashion topics)
-General (All new)
-Lifestyle
-Fashion
-Fitness
-Technology
-Under Armour Channel
Competition (Nike, Reebok, Adidas) Channels
**Topics (General Pipeline Parsing, TBD)
Dashboards in Kibana:
-Home dashboard
-Alerting history
-Visualizations
-Competition
-Innovation
-Live Broadcast
Random Additional considerations for integrating:
*Decay function – The distance between key words
*Propagation – Occurrences, duplicates and synonyms
*Coercion - synonyms (spread patterns or numerical analysis into metadata)
Innovation precursors:
A silly easy way to keep your running shoes in top condition.
https://www.youtube.com/watch?v=uawNiGjAcgY
Differences Between Hard and Clay Court Tennis Shoes (w/ Bethanie Mattek-Sands)
https://www.youtube.com/watch?v=MyANkWDXUyQ
Ladies Gym Trainers Walking Trainers Womens Fitness Lightweight Sports Running Shoes
A1nxV3WWFg
Crayola Shoe Studio Unboxing Create Your Own Designer Shoes | Design your Shoes
Dc6RUdG40Fk
Dr Comfort Diabetic Shoes Men's
I_jULmsvLlc
Get inspired - passion and craftmanship for cycling shoes @ Northwave -
“The development of a cycling shoe. Inspired by the needs of the athletes
and realised with a combination of pure craftmanship and high technology.”
3RNkEeiIQJ0