Depression Intensity Classification from Tweets Using FastText Based Weighted Soft Voting Ensemble
Artículo
Materias > Ingeniería
Materias > Psicología
Universidad Europea del Atlántico > Investigación > Producción Científica
Fundación Universitaria Internacional de Colombia > Investigación > Artículos y libros
Universidad Internacional Iberoamericana México > Investigación > Producción Científica
Universidad Internacional Iberoamericana Puerto Rico > Investigación > Producción Científica
Universidad Internacional do Cuanza > Investigación > Producción Científica
Abierto
Inglés
Predicting depression intensity from microblogs and social media posts has numerous benefits and applications, including predicting early psychological disorders and stress in individuals or the general public. A major challenge in predicting depression using social media posts is that the existing studies do not focus on predicting the intensity of depression in social media texts but rather only perform the binary classification of depression and moreover noisy data makes it difficult to predict the true depression in the social media text. This study intends to begin by collecting relevant Tweets and generating a corpus of 210000 public tweets using Twitter public application programming interfaces (APIs). A strategy is devised to filter out only depression-related tweets by creating a list of relevant hashtags to reduce noise in the corpus. Furthermore, an algorithm is developed to annotate the data into three depression classes: ‘Mild,’ ‘Moderate,’ and ‘Severe,’ based on International Classification of Diseases-10 (ICD-10) depression diagnostic criteria. Different baseline classifiers are applied to the annotated dataset to get a preliminary idea of classification performance on the corpus. Further FastText-based model is applied and fine-tuned with different preprocessing techniques and hyperparameter tuning to produce the tuned model, which significantly increases the depression classification performance to an 84% F1 score and 90% accuracy compared to baselines. Finally, a FastText-based weighted soft voting ensemble (WSVE) is proposed to boost the model’s performance by combining several other classifiers and assigning weights to individual models according to their individual performances. The proposed WSVE outperformed all baselines as well as FastText alone, with an F1 of 89%, 5% higher than FastText alone, and an accuracy of 93%, 3% higher than FastText alone. The proposed model better captures the contextual features of the relatively small sample class and aids in the detection of early depression intensity prediction from tweets with impactful performances.
metadata
Rizwan, Muhammad; Mushtaq, Muhammad Faheem; Rafiq, Maryam; Mehmood, Arif; Diez, Isabel de la Torre; Gracia Villar, Mónica; Garay, Helena y Ashraf, Imran
mail
SIN ESPECIFICAR, SIN ESPECIFICAR, SIN ESPECIFICAR, SIN ESPECIFICAR, SIN ESPECIFICAR, monica.gracia@uneatlantico.es, helena.garay@uneatlantico.es, SIN ESPECIFICAR
(2024)
Depression Intensity Classification from Tweets Using FastText Based Weighted Soft Voting Ensemble.
Computers, Materials & Continua, 78 (2).
pp. 2047-2066.
ISSN 1546-2226
|
Texto
TSP_CMC_37347.pdf Available under License Creative Commons Attribution. Descargar (861kB) |
Resumen
Predicting depression intensity from microblogs and social media posts has numerous benefits and applications, including predicting early psychological disorders and stress in individuals or the general public. A major challenge in predicting depression using social media posts is that the existing studies do not focus on predicting the intensity of depression in social media texts but rather only perform the binary classification of depression and moreover noisy data makes it difficult to predict the true depression in the social media text. This study intends to begin by collecting relevant Tweets and generating a corpus of 210000 public tweets using Twitter public application programming interfaces (APIs). A strategy is devised to filter out only depression-related tweets by creating a list of relevant hashtags to reduce noise in the corpus. Furthermore, an algorithm is developed to annotate the data into three depression classes: ‘Mild,’ ‘Moderate,’ and ‘Severe,’ based on International Classification of Diseases-10 (ICD-10) depression diagnostic criteria. Different baseline classifiers are applied to the annotated dataset to get a preliminary idea of classification performance on the corpus. Further FastText-based model is applied and fine-tuned with different preprocessing techniques and hyperparameter tuning to produce the tuned model, which significantly increases the depression classification performance to an 84% F1 score and 90% accuracy compared to baselines. Finally, a FastText-based weighted soft voting ensemble (WSVE) is proposed to boost the model’s performance by combining several other classifiers and assigning weights to individual models according to their individual performances. The proposed WSVE outperformed all baselines as well as FastText alone, with an F1 of 89%, 5% higher than FastText alone, and an accuracy of 93%, 3% higher than FastText alone. The proposed model better captures the contextual features of the relatively small sample class and aids in the detection of early depression intensity prediction from tweets with impactful performances.
| Tipo de Documento: | Artículo |
|---|---|
| Palabras Clave: | Depression classification; deep learning; FastText; machine learning |
| Clasificación temática: | Materias > Ingeniería Materias > Psicología |
| Divisiones: | Universidad Europea del Atlántico > Investigación > Producción Científica Fundación Universitaria Internacional de Colombia > Investigación > Artículos y libros Universidad Internacional Iberoamericana México > Investigación > Producción Científica Universidad Internacional Iberoamericana Puerto Rico > Investigación > Producción Científica Universidad Internacional do Cuanza > Investigación > Producción Científica |
| Depositado: | 14 Mar 2024 23:30 |
| Ultima Modificación: | 14 Mar 2024 23:30 |
| URI: | https://repositorio.unincol.edu.co/id/eprint/11264 |
Acciones (logins necesarios)
![]() |
Ver Objeto |
<a href="/28577/1/PIIS0002944026001367.pdf" class="ep_document_link"><img class="ep_doc_icon" alt="[img]" src="/style/images/fileicons/text.png" border="0"/></a>
en
open
An Integrated Machine Learning and Genomic Framework for Precise Detection of Gastric Cancer
This study presents a novel integrative approach for the analysis of high-dimensional gene expression data, leveraging the complementary strengths of unsupervised clustering and supervised classification. Using K-means clustering, the dataset is stratified into three distinct clusters, revealing intrinsic biological patterns and relationships. The resulting cluster assignments are subsequently employed as pseudo-labels to train machine learning models, including support vector machines, random forest, and a stacking ensemble classifier. To validate and enhance the robustness of clustering, complementary methodologies such as hierarchical clustering and DBSCAN are employed, with results visualized through PCA-driven dimensionality reduction. The high predictive accuracy achieved by the classifiers underscores the separability and reliability of the identified clusters. Furthermore, feature importance analysis highlighted key genetic determinants within each cluster, offering actionable insights into potential biomarkers and critical genomic features. This framework bridges the gap between exploratory unsupervised learning and predictive supervised modeling, providing a scalable and interpretable methodology for analyzing complex genomic datasets. Its applicability extends to biomarker discovery, patient stratification, and other precision medicine applications, emphasizing its utility in advancing genomic research and clinical practice.
Eshmal Iman mail , Sohail Jabbar mail , Shabana Ramzan mail , Ali Raza mail , Farwa Raoof mail , Stefanía Carvajal-Altamiranda mail stefania.carvajal@uneatlantico.es, Vivian Lipari mail vivian.lipari@uneatlantico.es, Imran Ashraf mail ,
Iman
<a href="/28319/1/s41598-026-45575-1_reference.pdf" class="ep_document_link"><img class="ep_doc_icon" alt="[img]" src="/style/images/fileicons/text.png" border="0"/></a>
en
open
A novel approach for disease and pests detection in potato production system based on deep learning
Vulnerability of potato crops to diseases and pest infestation can affect its quality and lead to significant yield losses. Timely detection of such diseases can help take effective decisions. For this purpose, a deep learning-based object detection framework is designed in this study to identify and classify major potato diseases and pests under real-world field conditions. A total of 2,688 field images were collected from two research farms in Punjab, Pakistan, across multiple growth stages in various seasonal conditions. Excluding 285 symptoms-free images from the earliest collection led to 2,403 images which were annotated into four biotic-stress classes: blight disease (n = 630), leaf spot disease (n = 370), leafroll virus (viral symptom complex; n = 888), and Colorado potato beetle (larvae/adults; n = 515), indicating class imbalance. Several state-of-the-art models were used including YOLOv8 variants (n/s/m), YOLOv7, YOLOv5, and Faster R-CNN, and the results are discussed in relation to recent potato disease classification studies involving cropped leaf images. Stratified splitting (70% training, 20% validation, 10% testing) was applied to preserve class distribution across all subsets. YOLOv8-medium achieve the best performance with mean average precision (mAP)@0.5 of 98% on the held-out test images. Results for stable 5-fold cross-validation show a mean mAP@0.5 of 97.8%, which offers a balance between accuracy and inference time. Model robustness was evaluated using 5-fold cross-validation and repeated training with different random seeds, showing a low variance of ±0.4% mAP. Results demonstrate promising outcomes under the real-world field conditions, while, broader cross-region and cross-season validation is intended for the future.
Ahmed Abbas mail , Saif Ur Rehman mail , Khalid Mahmood mail , Santos Gracia Villar mail santos.gracia@uneatlantico.es, Luis Alonso Dzul López mail luis.dzul@uneatlantico.es, Aseel Smerat mail , Imran Ashraf mail ,
Abbas
<a class="ep_document_link" href="/28323/1/s40520-026-03363-x_reference.pdf"><img class="ep_doc_icon" alt="[img]" src="/style/images/fileicons/text.png" border="0"/></a>
en
open
Fish consumption and brain structure: a comprehensive systematic review of observational studies
Background Age-related structural changes in the human brain, including cortical atrophy, reductions in grey and white matter volumes, and the accumulation of small vessel–related lesions such as white matter hyperintensities (WMH) and cerebral microbleeds, represent critical biological substrates underlying cognitive decline and dementia. Fish consumption has been associated with slower cognitive decline and reduced risk of dementia, but a comprehensive evaluation of its relation with brain structures is lacking. Aims The aim of this study was to systematically review current scientific literature providing evidence of relation between fish intake and brain structures in human studies. Methods Studies indexed in two major electronic databases have been screened based on a combination of keywords and MeSH terms. Studies were eligible whether they assessed fish consumption in relation to brain structures in the adult populations. Results A total of 24 studies conducted predominantly on older adults met inclusion criteria. Most brain volume measures were obtained via magnetic resonance imaging (MRI) procedures. Higher fish consumption was associated with reduced severity of white matter hyperintensities (a biomarker of cerebral small vessel disease and white matter damage) and cerebral micro-bleed, preservation of certain brain areas volumes (i.e., hippocampus, temporal lobe and periventricle white matter) and cortical thickness of specific areas (i.e., precuneus, parietal, and cingulate grey matter), among others, compared to lower intake. Some analyses found no association and isolated findings suggested possible adverse associations that were not consistently replicated. Studies reporting null findings may underline the possible relevance of the overall diet (i.e., adherence to the Mediterranean diet). Conclusions Inclusion of fish in a healthy and balanced diet is associated with better white matter grades on MRI and slower progression of white matter hyperintensities and reduction of vascular-related lesions of the aging brain, suggesting a potential role in preventing neurocognitive deterioration. Heterogeneity across studies underscores the need for additional studies.
Justyna Godos mail , Giuseppe Caruso mail , Agnieszka Micek mail , Alberto Dolci mail , Zoltan Ungvari mail , Andrea Lehoczki mail , Lisandra León Brizuela mail , Evelyn Frias-Toral mail , Andrea Di Mauro mail , Mario Siervo mail , Michelino Di Rosa mail , Giuseppe Grosso mail ,
Godos
<a class="ep_document_link" href="/28563/1/fnut-13-1809163.pdf"><img class="ep_doc_icon" alt="[img]" src="/style/images/fileicons/text.png" border="0"/></a>
en
open
Background: Mild Cognitive Impairment (MCI) is viewed as a transitional stage between normal brain aging and dementia and is characterized by subtle cognitive deficits without significant impairment in daily functioning. Growing evidence supports the contribution of neuroinflammation and modifiable lifestyle factors, including diet, in the progression of cognitive decline.Aim: This study aimed to investigate the association between adherence to the Mediterranean diet, neuroinflammatory biomarkers, and MCI status in older adults.Design: Ninety-two participants were enrolled in this cross-sectional study, including 37 subjects with MCI. Dietary intake was assessed using a validated food frequency questionnaire (FFQ) and adherence to the Mediterranean diet explored through the MedDietScore. Plasma levels of TGF-β1 and TNF-α were measured by ELISA. Cognitive status was evaluated using the Mini Mental Examination (MMSE) and the Montreal Cognitive Assessment (MoCA), both adjusted for age and education. Statistical analyses included non-parametric tests, correlation analysis, and logistic regression models.Results: MCI patients showed significantly reduced plasma levels of TGF-β1 and increased TNF-α concentrations compared to other participants. After adjustment for potential confounding factors, greater adherence to the Mediterranean diet was associated with a lower likelihood of MCI in a dose–response manner (highest versus lowest adherence quartile, odds ratio: 0.07, 95% confidence interval: 0.01–0.60). Additional adjustment for inflammatory biomarkers attenuated the associations, suggesting a potential mediating role.Conclusion: Our findings showed that higher adherence to Mediterranean diet is associated with lower likelihood of being MCI. Such a relation might be, at least in part, mediated by inflammatory biomarkers. Overall, these results support the role of dietary modulation in preventive strategies against cognitive decline and progression into MCI.
Margherita Grasso mail , Francesca L’Episcopo mail , Annamaria Fidilio mail , Marco Antonio Olvera-Moreira mail , Giuseppe Toscano mail , Stefano Muratore mail , Margherita Drago mail , Sabrina Musso mail , Veronica Bentivegna mail , Lucrezia Costanzo mail , Melannie Toral-Noristz mail , Raynier Zambrano-Villacres mail , Lisandra León Brizuela mail , Giuseppe Lanza mail , Raffaele Ferri mail , Filippo Caraci mail ,
Grasso
<a class="ep_document_link" href="/28572/1/s41598-026-47906-8.pdf"><img class="ep_doc_icon" alt="[img]" src="/style/images/fileicons/text.png" border="0"/></a>
en
open
Correction: Enhancing fault detection in new energy vehicles via novel ensemble approach
In the original version of this Article, Umair Shahid was incorrectly listed as a corresponding author. The correct corresponding authors for this Article are Imran Ashraf and Kashif Munir. Correspondence and request for materials should be addressed to ashrafimran@live.com and kashif.munir@kfueit.edu.pk.
Iqra Akhtar mail , Mahnoor Nabeel mail , Umair Shahid mail , Kashif Munir mail , Ali Raza mail , Irene Delgado Noya mail irene.delgado@uneatlantico.es, Santos Gracia Villar mail santos.gracia@uneatlantico.es, Imran Ashraf mail ,
Akhtar
