Twitter Stock Big Data Project

This project demonstrates the lambda architecture applied on big data being generated by the twitter firehose and dailying/historical stock information.

This project makes use of a number of big data technologies (Kafka, Spark, Hive, HBase, etc.) and the details on how these are applied are contained in their relevant sub-project folder of this submission.

Breakdown of Project Stucture

Please visit the sub-project folders for more information. They each contain their own README.md on architecture considerations/decisions as well as installation and deployment information.

project root
|
|
|_ big-data
|  |_ batch-ingest (spark, kafka, scala)
|  |_ batch-processing (spark, hive)
|  |_ speed-layer (kafka, hbase, python)
|
|
|_ scripts
|  |_ batch-to-serving (hql, manual workaround)
|  |_ misc-stuff
|
|
|_ web
   |_ backend-web-app (flask, kafka, hbase, python)
   |_ frontend-web-app (client interface, vuejs)

Project Status

Batch Ingest -- completed
Batch Processing -- completed
- Individual views -- completed
- Join views -- completed, not turned on
- ML-Sentiment analysis -- completed
- ML-LinRegression -- not completed
Batch to Serving -- manual workaround (see scripts)
Serving Layer --completed
Speed Layer -- completed
Backend Web -- completed
Frontend Web -- completed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Twitter Stock Big Data Project

Breakdown of Project Stucture

Project Status

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 37 Commits
big-data		big-data
scripts		scripts
web		web
.gitignore		.gitignore
README.md		README.md

huarngpa/huarngpa-twitter-stocks-big-data

Folders and files

Latest commit

History

Repository files navigation

Twitter Stock Big Data Project

Breakdown of Project Stucture

Project Status

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages