Creating a dataset derived from live web data: Reddit.com Utilizing Machine learning along with the Python requests library, multiple models were built and tested to the scopes. A Logistic Regression Model showed that a post could be inferred to meet the target amount of comments with 73% accuracy. This fulfilled scope #1.
Using the same data, a Logistic Regression model was then used again to model a post score greater than the target. This model was able to reach 95.4% accuracy of predicting a posts score based on the subreddit and title words.