Developed a program that determines whether an individual can be associated with a given internet sub-community based on their messages. While the focus of this project was the "gamers" sub-community, it is applicable to any group with an online presence.
Data was collected from five "gaming" subreddits and five "normal" subreddits and compared in order to create a database of "gamer" words.
Messages were also collected from various users and a few stats were calculated: z-score for language similarity to the "normal words," z-score for closeness to the "gamer words," and the ratio of gamer words to all words in a user's messages. Based on the calculated scores, users were either considered part of the "gamers" sub-community or not.
Team: Aditi Vinod & Luke Witten