Skip to content

lauragalera/Model-PySpark

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 

Repository files navigation

Model-PySpark

Code to build and evaluate predictive models with a dataset comprising movie reviews using PySpark.

The task consists of classifying movies as positive (1) or negative (0) based on the user's text comment. The data is formed by 2000 examples, 1000 each class, and stored in HDFS. The steps are: feature engineering, model selection and model evaluation. There is also a part of data exploration that has not been included.

About

Predictive Model using PySpark

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages