A Machine Learning Classification Model for Predicting Depression Using Social Media Posts of User
DOI:
https://doi.org/10.31305/rrijm.2024.v09.n02.025Keywords:
Depression, sentiment analysis, social media, natural language processing, machine learning, Depressive PostsAbstract
Depression is a prevalent mental health disorder, with social media platforms serving as a rich source of data for identifying and understanding depressive symptoms. This paper presents a machine learning stratification model designed to predict depression using social media data collected from Twitter, Facebook, and Reddit. The model incorporates sentiment analysis, keyword labeling, and stratification techniques to accurately categorize depressive posts. The research evaluates the model's performance using various metrics and compares it against traditional methods for identifying depressive content.
References
Asad, Nafiz Al, et al. "Detecting Depression in Social Media Posts Using Negative Sentiment and Social Network Information." 2021 IEEE International Conference on Systems, Man, and Cybernetics (SMC), 2021, pp. 1522-1527. Institute of Electrical and Electronics Engineers (IEEE). https://ieeexplore.ieee.org/document/9065101
Al-Saggaf, N., et al. "Identifying Users with Depression in Social Networks Using Data Mining and Machine Learning Techniques." 2017 IEEE International Conference on Smart Cloud (SmartCloud), pp. 205-210, doi: 10.1109/SmartCloud.2017.83
P. V. Narayanrao & P. Lalitha Surya Kumari. (2020). Analysis of Machine Learning Algorithms for Predicting Depression. In 2020 International Conference on Computer Science, Engineering and Applications (ICCSEA) (pp. 1-4). Institute of Electrical and Electronics Engineers (IEEE). doi: 10.1109/ICCSEA49143.2020.9132963
M. M. Aldarwish and H. F. Ahmad, "Predicting Depression Levels Using Social Media Posts," 2017 IEEE 13th International Symposium on Autonomous Decentralized System (ISADS), Bangkok, Thailand, 2017, pp. 277-280, doi: 10.1109/ISADS.2017.41
M. R. Islam, A. R. M. Kamal, N. Sultana, R. Islam, M. A. Moni and A. ulhaq, "Detecting Depression Using K-Nearest Neighbors (KNN) Classification
Technique," 2018 International Conference on Computer, Communication, Chemical, Material and Electronic Engineering (IC4ME2), Rajshahi, Bangladesh, 2018, pp. 1-4, doi: 10.1109/IC4ME2.2018.8465641.
A. U. Hassan, J. Hussain, M. Hussain, M. Sadiq and S. Lee, "Sentiment analysis of social networking sites (SNS) data using machine learning approach for the measurement of depression," 2017 International Conference on Information and Communication Technology Convergence (ICTC), Jeju, Korea (South), 2017, pp. 138- 140, doi: 10.1109/ICTC.2017.8190959.
L. Ma and Y. Wang, "Constructing a Semantic Graph with Depression Symptoms Extraction from Twitter," 2019 IEEE Conference on Computational Intelligence in Bioinformatics and Computational Biology (CIBCB), Siena, Italy, 2019, pp. 1-5, doi: 10.1109/CIBCB.2019.8791452.
X. Wang, C. Zhang and L. Sun, "An Improved Model for Depression Detection in Micro-Blog Social Network," 2013 IEEE 13th International Conference on Data Mining Workshops, Dallas, TX, USA, 2013, pp. 80-87, doi: 10.1109/ICDMW.2013.132.
Tao, X., Dharmalingam, R., Zhang, J., Zhou, X., Li, L., & Gururajan, R. (2019). Twitter analysis for depression on social networks based on sentiment and stress. In Proceedings of the 6th International Conference on Behavioral, Economic, and Socio-Cultural Computing (BESC 2019) (pp. 8963550). Institute of Electrical and Electronics Engineers (IEEE).doi: 10.1109/BESC48373.2019.8963550
P. Karthika, R. Murugeswari and R. Manoranjithem, "Sentiment Analysis of Social Media Network Using Random Forest Algorithm," 2019 IEEE International Conference on Intelligent Techniques in Control, Optimization and Signal Processing (INCOS), Tamilnadu, India, 2019, pp. 1-5, doi: 10.1109/INCOS45849.2019.8951367.
A. Ramachandran, A. Gadwe, D. Poddar, S. Satavalekar and S. Sahu, "Performance Evaluation of Different Machine Learning Techniques using Twitter Data for Identification of Suicidal Intent," 2020 International Conference on Electronics and Sustainable Communication Systems (ICESC), Coimbatore, India, 2020, pp. 223-227, doi: 10.1109/ICESC48915.2020.9155747.
Y. Jia, Y. Liang and T. Zhu, "A linguistic representation scheme for depression prediction - with a case study," 2019 22nd Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques (O-COCOSDA), Cebu, Philippines, 2019, pp. 1-5, doi: 10.1109/O-COCOSDA46868.2019.9060849.
Rathi, Megha & Malik, Aditya & Varshney, Daksh & Sharma, Rachita & Mendiratta, Sarthak. (2018). Sentiment Analysis of Tweets Using Machine Learning Approach. 1-3. 10.1109/IC3.2018.8530517.
A. Shrestha and F. Spezzano, "Detecting Depressed Users in Online Forums," 2019 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM), Vancouver, BC, Canada, 2019, pp. 945-951, doi: 10.1145/3341161.3343511.
S. Jain, S. P. Narayan, R. K. Dewang, U. Bhartiya, N. Meena and V. Kumar, "A Machine Learning based Depression Analysis and Suicidal Ideation Detection System using Questionnaires and Twitter," 2019 IEEE Students Conference on Engineering and Systems (SCES), Allahabad, India, 2019, pp. 1-6, doi: 10.1109/SCES46477.2019.8977211.
Saini, "Anuj@IEEE BigData 2019: A Novel Code-Switching Behavior Analysis in Social Media Discussions Natural Language Processing," 2019 IEEE International Conference on Big Data (Big Data), Los Angeles, CA, USA, 2019, pp. 5957-5961, doi: 10.1109/BigData47090.2019.9005961.
Li G, Li B, Huang L, Hou S. Automatic Construction of a Depression-Domain Lexicon Based on Microblogs: Text Mining Study. JMIR Med Inform. 2020 Jun 23;8(6):e17650. doi: 10.2196/17650. PMID: 32574151; PMCID: PMC7381008.
Nadeem, Moin. (2016). Identifying Depression on Twitter.
Q. Hu, A. Li, F. Heng, J. Li and T. Zhu, "Predicting Depression of Social Media User on Different Observation Windows," 2015 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT), Singapore, 2015, pp. 361-364, doi: 10.1109/WI-IAT.2015.166.
Q. Cong, Z. Feng, F. Li, Y. Xiang, G. Rao and C. Tao, "X-A-BiLSTM: a Deep Learning Approach for Depression Detection in Imbalanced Data," 2018 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), Madrid, Spain, 2018, pp. 1624-1627, doi: 10.1109/BIBM.2018.8621230.
Islam MR, Kabir MA, Ahmed A, Kamal ARM, Wang H, Ulhaq A. Depression detection from social network data using machine learning techniques. Health Inf Sci Syst. 2018 Aug 27;6(1):8. doi: 10.1007/s13755-018-0046-0. PMID: 30186594; PMCID: PMC6111060.
M. L. Tlachac and E. A. Rundensteiner, "Depression Screening from Text Message Reply Latency," 2020 42nd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC), Montreal, QC, Canada, 2020, pp. 5490-5493, doi: 10.1109/EMBC44109.2020.9175690.
Nadeem, Moin. (2016). Identifying Depression on Twitter.
Q. Hu, A. Li, F. Heng, J. Li and T. Zhu, "Predicting Depression of Social Media User on Different Observation Windows," 2015 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT), Singapore, 2015, pp. 361-364, doi: 10.1109/WI-IAT.2015.166.
Q. Cong, Z. Feng, F. Li, Y. Xiang, G. Rao and C. Tao, "X-A-BiLSTM: a Deep Learning Approach for Depression Detection in Imbalanced Data," 2018 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), Madrid, Spain, 2018, pp. 1624-1627, doi: 10.1109/BIBM.2018.8621230.
Islam MR, Kabir MA, Ahmed A, Kamal ARM, Wang H, Ulhaq A. Depression detection from social network data using machine learning techniques. Health Inf Sci Syst. 2018 Aug 27;6(1):8. doi: 10.1007/s13755-018-0046-0. PMID: 30186594; PMCID: PMC6111060.
M. L. Tlachac and E. A. Rundensteiner, "Depression Screening from Text Message Reply Latency," 2020 42nd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC), Montreal, QC, Canada, 2020, pp. 5490-5493, doi: 10.1109/EMBC44109.2020.9175690.
Downloads
Published
How to Cite
Issue
Section
License
This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.
This is an open access article under the CC BY-NC-ND license Creative Commons Attribution-Noncommercial 4.0 International (CC BY-NC 4.0).