This notebook is my first attempt at using PySpark for EDA and Machine Learning models.
-
Updated
Jul 22, 2022 - Jupyter Notebook
This notebook is my first attempt at using PySpark for EDA and Machine Learning models.
Build and evaluate classification model using PySpark 3.0.1 library.
Student Performance Tracker
One of the important steps towards People based marketing which is essential for targeting audience
Binary Classification Models with pySpark in Apache Spark
This is the material for Jose Portilla's Spark and Python for Big Data and ML course.
Amazon Review Sentiment Classification with PySpark ML: This project applies NLP and machine learning techniques using PySpark to classify Amazon product reviews as positive or negative. The pipeline includes tokenization, stopword removal, TF-IDF vectorization, chi-square feature selection, and logistic regression modeling.
Comparing Logistic Regression and ML algorithms for predicting defaults on consumer loans.
Add a description, image, and links to the gbt-classification topic page so that developers can more easily learn about it.
To associate your repository with the gbt-classification topic, visit your repo's landing page and select "manage topics."