The below paper is the final term project for the fall 2024 CIS 9665 - Applied Natural Language Processing course at Baruch College.
Using a dataset of categorized Amazon reviews hosted by the University of California - San Diego
Rady School of Management, our team created a binary classification model to predict customer review verification status.
Given the increasing importance of online marketplaces and the value of customer trust,
we sought to create a solution with a wide range of applications.
Implementing several natural language processing techniques such as sentiment analysis as well as classifying
review phrasing and -contents, we built a Random Forest classification achieving an accuracy of 86%. Since the model was created on a subset of the data, the preprocessing steps and model
creation can be easily reproduced and applied in academic and commercial settings.
Team member names have been redacted for privacy considerations.