ALTRUIST: A Python Package to Emulate a Virtual Digital Cohort Study Using Social Media Data | IEEE Journals & Magazine | IEEE Xplore

ALTRUIST: A Python Package to Emulate a Virtual Digital Cohort Study Using Social Media Data


Abstract:

Epidemiological cohort studies play a crucial role in identifying risk factors for various outcomes among participants. These studies are often time-consuming and costly ...Show More

Abstract:

Epidemiological cohort studies play a crucial role in identifying risk factors for various outcomes among participants. These studies are often time-consuming and costly due to recruitment and long-term follow-up. Social media (SM) data has emerged as a valuable complementary source for digital epidemiology and health research, as online communities of patients regularly share information about their illnesses. Unlike traditional clinical questionnaires, SM offer unstructured but insightful information about patients’ disease burden. Yet, there is limited guidance on analyzing SM data as a prospective cohort. We presented the concept of virtual digital cohort studies (VDCS) as an approach to replicate cohort studies using SM data. In this paper, we introduce ALTRUIST, an open-source Python package enabling standardized generation of VDCS on SM. ALTRUIST facilitates data collection, preprocessing, and analysis steps that mimic a traditional cohort study. We provide a practical use case focusing on diabetes to illustrate the methodology. By leveraging SM data, which offers large-scale and cost-effective information on users’ health, we demonstrate the potential of VDCS as an essential tool for specific research questions. ALTRUIST is customizable and can be applied to data from various online communities of patients, complementing traditional epidemiological methods and promoting minimally disruptive health research.
Published in: IEEE Transactions on Big Data ( Volume: 10, Issue: 4, August 2024)
Page(s): 568 - 575
Date of Publication: 05 February 2024

ISSN Information:

Funding Agency:


References

References is not available for this document.