Skip to content

Latest commit

 

History

History
9 lines (7 loc) · 285 Bytes

README.md

File metadata and controls

9 lines (7 loc) · 285 Bytes

Spark-Text-Clustering

The following project demonstrates how to use LDA Models in Scala in a Spark environment on TF-IDF matrixs of texts, in order to cluster those in different topics.

Requirements

Java (jdk-13.0.1) Scala (scala-sdk-2.12.10) Spark (Spark-3.0.0 and sbt-1.3.10)