A Comprehensive Guide For Text Classification using PySpark MLlib

Himanshu Tripathi
Towards AI
Published in
6 min readAug 24, 2022

--

Introduction

Have you ever wondered if you ever post something on social media websites that goes against their community standard, how can they identify it and perform appropriate action on that?

Well, the idea behind this is called Classification, whether it’s text classification, image classification, video classification, or audio classification. Still, the concept stays the same; we’re trying to classify relevant and irrelevant content.

--

--

NLP || Machine Learning || Deep Learning || Data Science || Web Developer || Android Developer (UI) ||