Published on in Vol 12 (2025)

Preprints (earlier versions) of this paper are available at https://preprints.jmir.org/preprint/63515, first published .
Evaluation of a Guided Chatbot Intervention for Young People in Jordan: Feasibility Randomized Controlled Trial

Evaluation of a Guided Chatbot Intervention for Young People in Jordan: Feasibility Randomized Controlled Trial

Evaluation of a Guided Chatbot Intervention for Young People in Jordan: Feasibility Randomized Controlled Trial

Original Paper

1Department of Mental Health, Brain Health and Substance Use, World Health Organization, Geneva, Switzerland

2Institute for Family Health, King Hussein Foundation, Amman, Jordan

3School of Psychology, University of New South Wales, Sydney, Australia

4Department of Clinical Neuroscience, Division of Insurance Medicine, Karolinska Institutet, Stockholm, Sweden

5Community Health Nursing Department, School of Nursing, University of Jordan, Amman, Jordan

*these authors contributed equally

Corresponding Author:

Anne Marijn de Graaff, MSc

Department of Mental Health, Brain Health and Substance Use

World Health Organization

Av. Appia 20

Geneva, 1211

Switzerland

Phone: 41 792493513

Email: degraaffa@who.int


Background: Depression and anxiety are a leading cause of disability worldwide and often start during adolescence and young adulthood. The majority of young people live in low- and middle-income countries where there is a lack of mental health services. The World Health Organization (WHO) developed a guided, nonartificial intelligence chatbot intervention called Scalable Technology for Adolescents and youth to Reduce Stress (STARS) to reduce symptoms of depression and anxiety among young people affected by adversity.

Objective: The objective of this study was to evaluate the feasibility of the STARS intervention and study procedures among young people in Jordan.

Methods: A 2-arm, single-blind, feasibility randomized controlled trial was conducted among 60 young people aged 18 years to 21 years living in Jordan with self-reported elevated levels of psychological distress. Immediately after baseline, participants were randomized 1:1 into the STARS intervention or enhanced care as usual (ECAU). STARS consisted of 10 lessons in which participants interacted with a chatbot and learned several cognitive behavioral therapy strategies, with optional guidance by a trained e-helper through 5 weekly phone calls. ECAU consisted of a static web page providing basic psychoeducation. Online questionnaires were administered at baseline (week 0) and postassessment (week 8) to assess depression (Hopkins Symptom Checklist-25 [HSCL-25]), anxiety (HSCL-25), functional impairment (WHO Disability Assessment Schedule [WHODAS] 2.0), psychological well-being (WHO-Five Well-Being Index [WHO-5]), and agency (State Hope Scale). Process evaluation interviews with stakeholders were conducted after the postassessment.

Results: Participants were recruited in December 2022 and January 2023. Of 700 screening website visits, 160 participants were eligible, and 60 participants (mean age 19.7, SD 1.16 years; 49/60, 82% female) continued to baseline and were randomized into STARS (n=30) or ECAU (n=30). Of those who received STARS, 37% (11/30) completed at least 8 chatbot lessons, and 13% (4/30) completed all 5 support calls. The research protocol functioned well in terms of balanced randomization, high retention at postassessment (48/60, 80%), and good psychometric properties of the online questionnaires. Process evaluation interviews with STARS participants, ECAU participants, e-helpers, and the clinical supervisor indicated the acceptability of the study procedures and the STARS and ECAU conditions and highlighted several aspects that could be improved, including the e-helper support and features of the STARS chatbot.

Conclusions: This study demonstrated the feasibility and acceptability of the STARS intervention and research procedures. A fully powered, definitive randomized controlled trial will be conducted to evaluate the effectiveness of STARS.

Trial Registration: ISRCTN ISRCTN19217696; https://doi.org/10.1186/ISRCTN19217696

JMIR Ment Health 2025;12:e63515

doi:10.2196/63515

Keywords



Globally, 1 in 8 people live with a mental disorder, and among adolescents, the rate is 1 in 7 [GBD Results Tool. Institute for Health Metrics and Evaluation. URL: https://vizhub.healthdata.org/gbd-results/ [accessed 2024-03-23] 1]. Anxiety and depression are the most common mental disorders, affecting 5.5% and 3.5%, respectively, of young people aged 15 years to 19 years globally in 2021 [GBD Results Tool. Institute for Health Metrics and Evaluation. URL: https://vizhub.healthdata.org/gbd-results/ [accessed 2024-03-23] 1]. The majority of young people live in low- and middle-income countries (LMIC) where services, especially for young people, are often scarce due to a lack of government policy on mental health, funding, and trained mental health professionals [McGorry PD, Mei C, Dalal N, Alvarez-Jimenez M, Blakemore S, Browne V, et al. The Lancet Psychiatry Commission on youth mental health. Lancet Psychiatry. Sep 2024;11(9):731-774. [FREE Full text] [CrossRef] [Medline]2]. Additionally, there is a paucity of research on psychological interventions for young people in LMIC [Uppendahl JR, Alozkan-Sever C, Cuijpers P, de Vries R, Sijbrandij M. Psychological and psychosocial interventions for PTSD, depression and anxiety among children and adolescents in low- and middle-income countries: a meta-analysis. Front Psychiatry. 2019;10:933. [FREE Full text] [CrossRef] [Medline]3-Alozkan-Sever C, Uppendahl J, Cuijpers P, de Vries R, Rahman A, Mittendorfer-Rutz E, et al. Research Review: Psychological and psychosocial interventions for children and adolescents with depression, anxiety, and post-traumatic stress disorder in low- and middle-income countries - a systematic review and meta-analysis. J Child Psychol Psychiatry. Dec 2023;64(12):1776-1788. [FREE Full text] [CrossRef] [Medline]5].

Digital mental health or e–mental health shows promise as a strategy for reducing symptoms of common mental health conditions. A recent meta-analysis showed that, among adults and young people, mental health smartphone apps have small but significant effects on reducing depression and anxiety, with larger effect sizes when cognitive behavioral therapy strategies, chatbot technology (ie, a computer simulating a human conversation), or mood monitoring are included [Linardon J, Torous J, Firth J, Cuijpers P, Messer M, Fuller-Tyszkiewicz M. Current evidence on the efficacy of mental health smartphone apps for symptoms of depression and anxiety. A meta-analysis of 176 randomized controlled trials. World Psychiatry. Feb 2024;23(1):139-149. [FREE Full text] [CrossRef] [Medline]6]. In addition, digital interventions for young people (12-25 years) were found to be mostly effective when guided (in which a client is provided motivation and support by a therapist or researcher) [Garrido S, Millington C, Cheers D, Boydell K, Schubert E, Meade T, et al. What works and what doesn't work? A systematic review of digital mental health interventions for depression and anxiety in young people. Front Psychiatry. 2019;10:759. [FREE Full text] [CrossRef] [Medline]7]. Among the advantages of digital interventions are reduced reliance on human resources, greater access and availability, and the potential to overcome concerns about stigmatization. Among the challenges are poor engagement and adherence [Karyotaki E, Kleiboer A, Smit F, Turner D, Pastor A, Andersson G, et al. Predictors of treatment dropout in self-guided web-based interventions for depression: an ‘individual patient data’ meta-analysis. Psychol. Med. Apr 17, 2015;45(13):2717-2726. [FREE Full text] [CrossRef]8].

Research on chatbots that provide psychological interventions has grown significantly. The majority of chatbots included in a 2019 review investigated rule-based chatbots (ie, depending on decision trees), whereas few investigated artificial intelligence (AI)–based chatbots (ie, reliant on machine learning). Although AI-based chatbots better mimic natural conversations, can apply therapeutic techniques dynamically, and learn over time, they are a new technology, and possible risks and challenges with large language models are currently unclear [Opel DJ, Kious B, Cohen I. AI as a mental health therapist for adolescents. JAMA Pediatr. Dec 01, 2023;177(12):1253-1254. [FREE Full text] [CrossRef] [Medline]9]. Rule-based chatbots operate with predefined scripts and pathways and are therefore predictable and allow for standardization of interventions. The World Health Organization (WHO) has developed, tested, and released low-intensity or potentially scalable psychological interventions [Scalable psychological interventions for people in communities affected by adversity: a new area of mental health and psychosocial work at WHO. Geneva. World Health Organization; 2017. URL: https://iris.who.int/handle/10665/25458110,Scalable psychological interventions for people in communities affected by adversity: a new area of mental healthpsychosocial work at WHO. World Health Organization. 2017. URL: https://iris.who.int/bitstream/handle/10665/254581/WHO-MSD-MER-17.1-eng.pdf [accessed 2025-01-08] 11]. This includes digital interventions to reach a larger proportion of the population, due in part to the majority of the global population (66%) now having access to the internet [Petrosyan A. Worldwide digital population 2024. Statista. 2024. URL: https://www.statista.com/statistics/617136/digital-population-worldwide/ [accessed 2024-02-23] 12]. Previous studies have confirmed the effectiveness of a WHO digital intervention that uses behavioral activation for adults, including Syrian refugees, with depression [Cuijpers P, Heim E, Ramia J, Burchert S, Carswell K, Cornelisz I, et al. Guided digital health intervention for depression in Lebanon: randomised trial. Evid Based Ment Health. Dec 2022;25(e1):e34-e40. [FREE Full text] [CrossRef] [Medline]13,Cuijpers P, Heim E, Abi Ramia J, Burchert S, Carswell K, Cornelisz I, et al. Effects of a WHO-guided digital health intervention for depression in Syrian refugees in Lebanon: a randomized controlled trial. PLoS Med. Jun 2022;19(6):e1004025. [FREE Full text] [CrossRef] [Medline]14]. Building on this promising work, WHO developed a digital intervention specifically aimed at young people called Scalable Technology for Adolescents and youth to Reduce Stress (STARS), which uses a rule-based chatbot interface to increase engagement and teach a range of psychological techniques [Hall J, Jordan S, van Ommeren M, Au T, Sway RA, Crawford J, et al. Sustainable Technology for Adolescents and youth to Reduce Stress (STARS): a WHO transdiagnostic chatbot for distressed youth. World Psychiatry. Feb 2022;21(1):156-157. [FREE Full text] [CrossRef] [Medline]15]. The intervention was developed using a human-centered design approach in which 269 youth and 86 community members from Pakistan, Jamaica, occupied Palestinian territories, South Africa, and Nepal and 20 clinical psychology experts were involved [Hall J, Jordan S, van Ommeren M, Au T, Sway RA, Crawford J, et al. Sustainable Technology for Adolescents and youth to Reduce Stress (STARS): a WHO transdiagnostic chatbot for distressed youth. World Psychiatry. Feb 2022;21(1):156-157. [FREE Full text] [CrossRef] [Medline]15]. Because, at the time of testing and development, AI-powered chatbots were a very new technology, a decision was made to use a rule-based chatbot to ensure a greater level of predictability and control.

The objective of this study was to evaluate the feasibility of the STARS intervention and study procedures among young people in Jordan. WHO had initially planned to test STARS with 15-year-olds to 18-year-olds in schools in Jordan as there are few evidence-based psychological interventions available for this age group, especially in LMIC. However, COVID-19 meant that accessing schools was not feasible. To test STARS with a group as close to the intended original age range as possible, while managing the pragmatic realities of recruitment of people younger than 18 years, a decision was taken to test it instead with youth aged 18 years to 21 years.


Design and Setting

A 2-arm, single-blind, feasibility randomized controlled trial (RCT) including a process evaluation was conducted in Jordan, with study implementation coordinated by the Institute of Family Health (IFH). Jordan is a lower middle-income country [World Bank Country and Lending Groups. The World Bank. URL: https:/​/datahelpdesk.​worldbank.org/​knowledgebase/​articles/​906519-world-bank-country-and-lending-groups [accessed 2025-01-14] 16] with a population of 11.5 million people [Population. Department of Statistics. URL: https://dosweb.dos.gov.jo/population/population-2/ [accessed 2025-01-14] 17], of which there are about 3 million Palestinians [Minority Rights Group International. World Directory of Minorities and Indigenous Peoples - Jordan : Palestinians. UNHCR. URL: https://www.refworld.org/reference/countryrep/mrgi/2008/en/65027 [accessed 2025-01-08] 18] and 1.3 million Syrians [Jordan Operational Data Portal. UNHCR Operational Data Portal (ODP). URL: https://data.unhcr.org/en/country/jor [accessed 2025-01-08] 19]. Young people in Jordan, especially refugees, face barriers to education [The situation of children in Jordan: country factsheet. UNICEF. 2024. URL: https://www.unicef.org/mena/media/26706/file [accessed 2025-01-14] 20], high unemployment rates [The situation of children in Jordan: country factsheet. UNICEF. 2024. URL: https://www.unicef.org/mena/media/26706/file [accessed 2025-01-14] 20,Jordan: youth unemployment rate from 2004 to 2023. Statista. URL: https://www.statista.com/statistics/812127/youth-unemployment-rate-in-jordan/ [accessed 2025-01-08] 21], and increasing poverty [Aljaloudi J. Increase the state of poverty in Jordan during the period 2010-2017. SocioEconomic Challenges. 2020;4(4):39-47. [CrossRef]22].

Ethical Considerations

The trial was approved by the research committee of the School of Nursing, University of Jordan (PF.22.9 on March 23, 2022) and the Ethics Review Committee at WHO (ERC.0003729 on July 1, 2022). The CONSORT (Consolidated Standards of Reporting Trials) guideline for feasibility studies is available in

Multimedia Appendix 1

CONSORT checklist.

PDF File (Adobe PDF File), 93 KBMultimedia Appendix 1 [Eldridge SM, Chan CL, Campbell MJ, Bond CM, Hopewell S, Thabane L, et al. CONSORT 2010 statement: extension to randomised pilot and feasibility trials. BMJ. Oct 24, 2016;355:i5239. [FREE Full text] [CrossRef] [Medline]23]. Digital informed consent was obtained from all participants before screening on the screening website. Participant data were collected online through the Qualtrics data collection software and deidentified after data collection was complete. Participants received a compensation of JD 3 (US $4.23) upon completion of the postassessment. Only members of the research team had access to the data.

Procedures

We aimed to recruit 60 participants (30 in each condition; cf [Teresi JA, Yu X, Stewart A, Hays R. Guidelines for designing and evaluating feasibility pilot studies. Med Care. Jan 01, 2022;60(1):95-103. [FREE Full text] [CrossRef] [Medline]24]) through IFH peer educators (educators and volunteers at IFH who provide health-related information to youth through various activities), universities, and online advertising on Facebook and Instagram. Participants were included if they met the following criteria: (1) aged between 18 years and 21 years; (2) lived in Jordan; (3) elevated levels of psychological distress, as indicated by a score of 20 or higher on the Kessler Psychological Distress Scale (K10) [Easton SD, Safadi N, Wang Y, Hasson R. The Kessler psychological distress scale: translation and validation of an Arabic version. Health Qual Life Outcomes. Oct 27, 2017;15(1):215. [FREE Full text] [CrossRef] [Medline]25,Kessler RC, Andrews G, Colpe L, Hiripi E, Mroczek D, Normand SLT, et al. Short screening scales to monitor population prevalences and trends in non-specific psychological distress. Psychol Med. Aug 2002;32(6):959-976. [CrossRef] [Medline]26]; and (4) had access to a device for intervention delivery or were willing to use one at a participating center. Participants who met all inclusion criteria were asked to complete screening questions on the imminent risk of suicide [Self-Help Plus (SH+): a group-based stress management course for adults. World Health Organization. Sep 23, 2021. URL: https://www.who.int/publications/i/item/9789240035119 [accessed 2025-01-08] 27]. Those who answered “yes” to having taken actions to end their life or who answered “yes” or “unsure” to having a plan to end their life in the next 2 weeks were shown a message encouraging them to seek professional care as soon as possible. The message provided referral options to mental health services in Jordan, including the phone number of the clinical supervisor at IFH. Participants at imminent risk of suicide were excluded from the trial.

Participants who screened positive were then contacted by phone by the project manager for a research engagement call in which participants were reminded of the study procedures and could ask any further questions; after the call, they received a personalized baseline link. Upon completion of the baseline link, participants were randomized by the Qualtrics questionnaire tool on a 1:1 ratio to either STARS or enhanced care as usual (ECAU). The research manager then created participants’ accounts for the STARS or ECAU website and allocated STARS participants to an e-helper (a trained and supervised nonspecialist offering additional support sessions over the phone). The intervention period lasted 8 weeks followed by the postassessment.

Conditions

STARS Intervention

The STARS intervention uses a cognitive behavioral therapy framework to address symptoms of depression and anxiety in young people. This approach is consistent with WHO guidelines of psychological treatments for young people [Mental Health Gap Action Programme (mhGAP) guideline for mental, neurological and substance use disorders. World Health Organization. 2023. URL: https://www.who.int/publications/i/item/9789240084278 [accessed 2025-01-08] 28]. STARS is a non-AI decision tree logic-based conversational agent (chatbot) delivered through a website. The intervention consists of 10 brief lessons, each taking approximately 20 minutes to complete. The lessons are delivered in a conversational, text messaging style by a chatbot called “Salam.” The name Salam was chosen during preliminary adaptation work in Jordan as it is a gender-neutral Arabic name and greeting that means peace. During the chatbot lessons, participants are introduced to the intervention (lesson 1) and learn several self-help strategies, including psychoeducation (lesson 2), emotion regulation (lessons 3 and 4), behavioral activation (lessons 5 and 6), problem management (lesson 7), thought challenging (lessons 8 and 9), and relapse prevention (lesson 10). Participants are encouraged to practice and apply the strategies in their daily lives. A “toolbox” provides support files, such as stress management audios and short video clips, and information on other sources of mental health and psychosocial support in Jordan, including emergency help and contact details to local service providers. Participants are encouraged to complete 2 chatbot lessons a week and to complete all 10 chatbot lessons within the 8-week intervention period. Participants can continue to access the chatbot and their account during the whole study period (ie, also after their postassessment).

In addition to the chatbot, participants are offered 5 calls lasting approximately 15 minutes by an e-helper delivered over a maximum period of 8 weeks. The aim of the calls is to support participants and increase motivation and adherence to the intervention. e-Helpers received 5 days of training and weekly group supervision with a clinical psychologist. The project coordinator attended 10% of calls to rate fidelity to the e-helper manual using a fidelity checklist.

Enhanced Care as Usual

This condition included psychoeducation about depression and anxiety. Psychoeducation was delivered via a static website (not a chatbot) that participants could access for the whole study period after registering via a personal invitation link. The content of the psychoeducation was very similar to the content of lesson 2 of the STARS chatbot and included psychoeducation about emotions, a personal story about a fictional character who talks about her emotions, and information about where to access mental health support. It included a list of organizations providing mental health and psychosocial support in Jordan. This list was also included in the “toolbox” section of the chatbot for STARS intervention participants (see the STARS Intervention section). Participants in the ECAU condition did not receive additional e-helper support.

Feasibility Measures

The primary outcome of this study was the feasibility of the STARS intervention and study among youth aged 18 years to 21 years in Jordan. Feasibility was assessed using the following criteria: (1) recruitment rate recorded as the number of eligible participants who consented to participate in the study, (2) dropout rates recorded as the number of randomized participants who did not complete the intervention nor the postassessment, (3) number of adverse events and serious adverse events, and (4) evaluation of (secondary) outcome measures and estimation of differences across the 2 groups. In addition, a qualitative process evaluation was conducted to understand aspects related to implementing the STARS intervention and study.

Secondary Outcome Measures

Psychological distress was measured with the K10 [Easton SD, Safadi N, Wang Y, Hasson R. The Kessler psychological distress scale: translation and validation of an Arabic version. Health Qual Life Outcomes. Oct 27, 2017;15(1):215. [FREE Full text] [CrossRef] [Medline]25,Kessler RC, Andrews G, Colpe L, Hiripi E, Mroczek D, Normand SLT, et al. Short screening scales to monitor population prevalences and trends in non-specific psychological distress. Psychol Med. Aug 2002;32(6):959-976. [CrossRef] [Medline]26]. It includes 10 questions scored on a scale from 1 to 5 (sum score range: 10-50), with higher scores indicating higher psychological distress. The measure has been validated among Palestinians with good internal consistency [Easton SD, Safadi N, Wang Y, Hasson R. The Kessler psychological distress scale: translation and validation of an Arabic version. Health Qual Life Outcomes. Oct 27, 2017;15(1):215. [FREE Full text] [CrossRef] [Medline]25]. Symptoms of anxiety and depression in the last week were assessed with the 25-item Hopkins Symptom Checklist (HSCL-25) [Derogatis LR, Lipman RS, Rickels K, Uhlenhuth EH, Covi L. The Hopkins Symptom Checklist (HSCL): a self-report symptom inventory. Behav Sci. Jan 1974;19(1):1-15. [CrossRef] [Medline]29]. Questions are scored on a scale from 1 to 4, with higher scores indicating greater distress. The total scores for the subscales are the average of the 10 anxiety symptoms and the average of the 15 depression symptoms. The instrument has been validated in Arabic-speaking populations with good psychometric properties [Selmo P, Koch T, Brand J, Wagner B, Knaevelsrud C. Psychometric properties of the online Arabic versions of BDI-II, HSCL-25, and PDS. European Journal of Psychological Assessment. 2019;35(1):46-54. [FREE Full text] [CrossRef]30,Mahfoud Z, Kobeissi L, Peters T, Araya R, Ghantous Z, Khoury B. The Arabic validation of the Hopkins Symptoms Checklist-25 against MINI in a disadvantaged suburb of Beirut, Lebanon. The International Journal of Educational and Psychological Assessment. 2013;13(1):17-33. [FREE Full text]31]. Functional impairment was measured with the 12-item version of the WHO Disability Assessment Schedule (WHODAS 2.0) [Measuring health and disability: manual for WHO Disability Assessment Schedule (‎WHODAS 2.0)‎. World Health Organization. URL: https:/​/www.​who.int/​publications/​i/​item/​measuring-health-and-disability-manual-for-who-disability-assessment-schedule-(-whodas-2.​0) [accessed 2025-01-08] 32], which covers 6 domains of functioning (ie, cognition, mobility, self-care, getting along, life activities, and participation) over the past 30 days. Items are scored on a scale from 0 to 4 (total sum score: 0-48), with a higher score indicating greater functional impairment. The instrument has been widely used and across various populations [Federici S, Bracalenti M, Meloni F, Luciano JV. World Health Organization disability assessment schedule 2.0: An international systematic review. Disabil Rehabil. Nov 2017;39(23):2347-2380. [FREE Full text] [CrossRef] [Medline]33] and performed well in terms of psychometric properties among Syrian refugees [Akhtar A, Cuijpers P, Morina N, Sijbrandij M, Bryant R. Exploratory and confirmatory factor analysis of the 12-item Arabic World Health Organization Disability Assessment Schedule (WHODAS 2.0) as a screening tool for Syrian refugees. BJPsych Open. Oct 11, 2021;7(6):1. [FREE Full text] [CrossRef]34]. The sociodemographic section of the WHODAS 2.0 was adapted to collect data on age, gender, nationality, living situation, education level, marital status, and work status. Self-identified problems were assessed with the Psychological Outcomes Profiles (PSYCHLOPS) instrument [Ashworth M, Shepherd M, Christey J, Matthews V, Wright K, Parmentier H, et al. A client‐generated psychometric instrument: The development of ‘PSYCHLOPS’. Couns and Psychother Res. Aug 18, 2006;4(2):27-31. [FREE Full text] [CrossRef]35]. Items are rated on a scale from 0 to 5 (total sum score 0-20) referring to the last week. The instrument was previously used in Arabic-speaking populations [de Graaff AM, Cuijpers P, Twisk J, Kieft B, Hunaidy S, Elsawy M, et al. Peer-provided psychological intervention for Syrian refugees: results of a randomised controlled trial on the effectiveness of Problem Management Plus. BMJ Ment Health. Feb 2023;26(1):e300637. [FREE Full text] [CrossRef] [Medline]36]. Psychological well-being was measured with the WHO-Five Well-Being Index (WHO-5) [Topp CW, Østergaard SD, Søndergaard S, Bech P. The WHO-5 Well-Being Index: a systematic review of the literature. Psychother Psychosom. 2015;84(3):167-176. [CrossRef] [Medline]37], a widely used instrument in which 5 positively framed statements about well-being in the past 2 weeks are rated on a scale from 0 to 5, with higher scores indicating better well-being. The Arabic translation has been validated in a Saudi sample [Kassab Alshayea A. Development and validation of an Arabic version of the World Health Organization Well-Being Index (WHO-5). J Psychopathol Behav Assess. Feb 01, 2023;45(1):247-255. [CrossRef]38]. Agency was assessed through the agency subscale of the State Hope Scale [Snyder CR, Sympson S, Ybasco F, Borders T, Babyak M, Higgins R. Development and validation of the State Hope Scale. J Pers Soc Psychol. Feb 1996;70(2):321-335. [CrossRef] [Medline]39], which consists of 3 items of perceived levels of goal-directed energy measured on a scale from 1 to 8. Higher scores indicate higher levels of perceived agency.

Additionally, satisfaction with the intervention was assessed with the Client Satisfaction Questionnaire for internet-based interventions (CSQ-I) [Boß L, Lehr D, Reis D, Vis C, Riper H, Berking M, et al. Reliability and validity of assessing user satisfaction with web-based health interventions. J Med Internet Res. Aug 31, 2016;18(8):e234. [FREE Full text] [CrossRef] [Medline]40] at the postassessment only. All secondary outcome measures are presented in Table 1.

Table 1. Secondary outcome measures.
DomainInstrumentTime frameNumber of itemsScreening (T0)Baseline (T1)InterventionPostassessment (T2)
Psychological distressK10a30 days10XbX
Depression symptomsHSCL-25c7 days15XX
Depression symptomsPHQ-2d14 days2X
Anxiety symptomsHSCL-257 days10XX
Functional impairmentWHODASe 2.030 days12XX
Subjective well-beingWHO-5f14 days5XX
Self-identified problemsPSYCHLOPSgCurrently7 (T1) or 8 (T2)XX
AgencySHS-AhCurrently3XX
User satisfactionCSQ-IiCurrently8X

aK10: Kessler Psychological Distress Scale.

bNot applicable.

cHSCL-25: Hopkins Symptom Checklist-25.

dPHQ-2: Patient Health Questionnaire.

eWHODAS: World Health Organization Disability Assessment Schedule.

fWHO-5: World Health Organization-Five Well-being Index.

gPSYCHLOPS: Psychological Outcomes Profiles.

hSHS-A: State Hope Scale agency subscale.

iCSQ-I: Client Satisfaction Questionnaire for internet-based interventions.

Quantitative and Qualitative Analyses

Descriptive statistics are presented using mean values, SDs, frequencies (n), and percentages. Cronbach α was used to evaluate the internal reliability of study instruments at baseline, and correlation analysis was used to evaluate convergent validity across instruments. Baseline differences were tested using chi-square tests for frequencies and independent-sample t tests for continuous variables.

As this was a feasibility RCT aimed to evaluate the feasibility of the STARS intervention, it was not powered to detect significant differences. However, to evaluate the secondary outcome measures and estimate differences across the 2 groups, univariate completers analyses of covariance (ANCOVA) for all outcome measures were performed, with their respective baseline scores as covariates. Only participants who completed the postassessment were included in the statistical analyses. Prorated imputation was used for participants with maximally 20% missing values on the scale (ie, using the participant’s mean score of the other items of the scale). Total scores were considered missing for participants if more than 20% of items were missing on the scale.

Subsequent to the trial, a qualitative process evaluation was conducted with 5 key informant groups: (1) STARS completers, (2) STARS noncompleters, (3) ECAU participants, (4) e-helpers, and (5) the clinical supervisor. The number of key informants per group was predefined in the study protocol based on comparable studies (eg, [Harper Shehadeh MJ, Abi Ramia J, Cuijpers P, El Chammay R, Heim E, Kheir W, et al. Step-by-Step, an e-mental health intervention for depression: a mixed methods pilot study from Lebanon. Front Psychiatry. 2019;10:986. [FREE Full text] [CrossRef] [Medline]41,de Graaff AM, Cuijpers P, McDaid D, Park A, Woodward A, Bryant RA, et al. Peer-provided Problem Management Plus (PM+) for adult Syrian refugees: a pilot randomised controlled trial on effectiveness and cost-effectiveness. Epidemiol Psychiatr Sci. Aug 18, 2020;29:1. [CrossRef]42]) and feasibility given the low number of participants and staff in this study. We followed the following procedure to select key informants for an interview. The local research coordinator first categorized participants as being a STARS completer (ie, having completed at least 8 chatbot lessons), STARS noncompleter (ie, having completed less than 8 chatbot lessons), or ECAU participant. The research coordinator then contacted participants from each group in order of enrollment date and stopped inviting new participants when a sufficient number of key informants per group was reached (ie, 4 to 5 key informants per group). Given the small helper team during the feasibility study, all helpers (n=2) and the only clinical supervisor were invited to participate in the process evaluation. Key informants all provided informed consent to participate in the process evaluation interview including being audio recorded and did not receive (additional) compensation for their participation. The topics covered in the semistructured interviews are outlined in Table 2. Interviews were conducted over the phone (STARS and ECAU participants) or face-to-face (e-helpers and supervisor) and audio recorded for transcription. Transcripts were transcribed verbatim to the language of the interview and deidentified prior to analysis. Deidentified Arabic transcripts were independently analyzed by 2 Arabic-speaking researchers (RH and SF) using thematic analysis. Categorization of responses was checked between the researchers. Key informant groups were analyzed separately, and data were compared to identify commonalities and differences in responses. Results were discussed with a third researcher (AMDG).

Table 2. Process evaluation interview topics.
Key informant groupInterview topics
STARSa completers and noncompleters
  • Overall impressions of the STARS intervention and study
  • Reason for discontinuation (if applicable)
  • Views on the STARS intervention
  • Rapport with the e-helper
  • Research process
ECAUb participants
  • Overall impressions of the ECAU condition and study
  • Views on the ECAU condition
  • Research process
e-Helpers
  • Overall impressions of the STARS intervention and study
  • Views on the STARS intervention
  • Rapport with participants
  • e-Helper support calls
  • Local management and supervision
Clinical supervisor
  • Overall impressions of the STARS intervention and study
  • Integrating the role of supervisor in the workload
  • Training, supervision, and retention of e-helpers

aSTARS: Scalable Technology for Adolescents and youth to Reduce Stress.

bECAU: enhanced care as usual.


Feasibility of STARS

Recruitment

Participants were recruited through IFH peer educators, universities in Amman, and online advertising between December 1, 2022, and January 26, 2023. Of 700 visits to the screening website, 476 provided informed consent and completed the screening questionnaires. Of those, 160 participants were eligible to participate and called by the project manager for an “engagement call.” Of those, 54 did not respond to the engagement call, 25 did not proceed to baseline, 12 indicated they did not want to participate, 5 did not provide correct contact details, and 4 registered twice (see Figure 1). The 60 participants who continued to baseline heard about the study through a friend (n=28), social media (n=13), an IFH peer educator (n=10), or from another source (n=9).

Figure 1. CONSORT (Consolidated Standards of Reporting Trials) flow diagram. IC: informed consent; K10: Kessler Psychological Distress Scale; STARS: Scalable Technology for Adolescents and youth to Reduce Stress.
Group Allocation and Registration to the STARS Website

Upon completion of the baseline assessment, 60 participants were randomized into STARS (n=30) or ECAU (n=30). The project manager assigned participants to either a control facilitator or e-helper who created a condition-specific account for the participant. Of the 60 participants invited, 52 participants registered at the STARS website, of which 26 were STARS chatbot participants and 26 were ECAU participants.

Participant Characteristics

The baseline sample of 60 participants consisted of 49 women (82%) and 11 men (18%) between the ages of 18 years and 21 years (mean 19.7, SD 1.16 years; Table 3). Participants identified as Jordanian (31/60, 52%), Syrian (27/60, 45%), or other (2/60, 3%). The majority (53/60, 88%) were unmarried. Most participants (40/60, 67%) reported a general secondary education or a bachelor’s degree as their highest level of education started and indicated that they were currently a student (28/60, 47%). There were no statistically significant differences in baseline characteristics between the STARS and ECAU conditions.

Table 3. Baseline characteristics.
CharacteristicsTotal sample (N=60)STARSa intervention (n=30)Control condition (n=30)Comparison statistic (df)P value
Gender (male), n (%)11 (18)5 (17)7 (23)0.11 (1)b.73
Age (years), mean (SD)19.68 (1.16)19.43 (1.10)19.93 (1.27)1.70 (58)c.94
Age (years), range18-2118-2118-21d
Nationality, n (%)3.22 (2)b.20

Jordanian31 (52)17 (57)14 (47)


Syrian27 (45)11 (37)16 (53)


Other (Palestinian, Iraqi)2 (3)2 (7)0 (0)

Highest education started, n (%)6.94 (5)b.22

No education2 (3)2 (7)0 (0)


Basic education9 (15)5 (17)4 (13)


Technical/vocational secondary education5 (8)1 (3)4 (13)


Technical diploma4 (7)3 (10)1 (3)


General secondary education15 (25)5 (17)10 (33)


Bachelor’s degree25 (42)14 (47)11 (37)

Marital status, n (%)2.19 (3)b.52

Never married53 (88)27 (90)26 (87)


Currently married5 (8)2 (7)3 (10)


Separated1 (2)0 (0)1 (3)


Divorced1 (2)1 (3)0 (0)

Work status, n (%)7.39 (6)b.28

Paid work11 (18)3 (10)8 (27)


Nonpaid work1 (2)1 (3)0 (0)


Self-employed0 (0)0 (0)0 (0)


Student28 (47)14 (47)14 (47)


Keeping house8 (13)5 (17)3 (10)


Retired0 (0)0 (0)0 (0)


Unemployed (health reasons)0 (0)0 (0)0 (0)


Unemployed (other reasons)7 (12)5 (17)2 (7)


Other3 (5)2 (7)1 (3)


Not allowed to work in Jordan2 (3)0 (0)2 (7)

aSTARS: Scalable Technology for Adolescents and youth to Reduce Stress.

bChi-square test.

ct test.

dNot applicable.

Adherence to STARS and ECAU Conditions

In total, 26 STARS and 22 control participants accessed the content of their intervention following allocation. For the STARS intervention, this meant accessing at least the home page, and for the control group, this meant accessing the psychoeducation page. Within the 8-week intervention period, 22 of the 30 STARS participants (73%) completed at least 1 chatbot lesson, 11 participants (37%) completed at least 8 chatbot lessons, and 10 participants (33%) completed all 10 chatbot lessons, with an average of 4.47 lessons completed. Table 4 shows the number of participants completing each lesson.

Table 4. Scalable Technology for Adolescents and youth to Reduce Stress (STARS) intervention attendance of chatbot lessons and e-helper support calls (n=30).
Lesson or call numberParticipants who completed, n (%)Duration of call (minutes), mean (SD)
Lesson

122 (73)a

217 (57)

314 (47)

414 (47)

512 (40)

612 (40)

711 (37)

811 (37)

911 (37)

1010 (33)
e-Helper call

121 (70)7.3 (2.1)

213 (43)8.3 (2.6)

311 (37)7.5 (1.9)

49 (30)8.4 (3.2)

54 (13)7.3 (3.0)

aNot applicable.

In terms of the e-helper support calls, 21 of the 30 participants (70%) attended at least 1 support call, and 4 of the 30 participants (13%) attended all 5 support calls. The average number of calls attended was 1.93, and the average call duration was 7.8 (SD 2.4; range 4-14) minutes. Of the 26 participants who did not attend all support calls, 13 participants ran out of time (ie, 8 weeks had passed without completing all calls), and 13 participants did not continue attending the support calls. In addition, 7 participants stopped responding to the e-helper (ie, as per the standard operating procedure, the e-helper stopped contacting the participant after 3 consecutive unsuccessful attempts to get in touch), 2 participants indicated external events hindered them from continuing the intervention (ie, passing of family member, exams), 2 participants expressed the intervention did not meet their expectations (eg, looking for a job), 1 participant stopped after expressing concern about confidentiality of the chatbot, and 1 participant’s phone was disconnected. Table 4 presents the attendance at the STARS chatbot lessons and e-helper support calls.

STARS Protocol Fidelity Assessment of e-Helper Phone Calls

A total of 15 e-helper phone calls were attended and assessed on fidelity (call 1: n=4; call 2: n=3; call 3: n=4; call 4: n=2; call 5: n=2). Fidelity assessments indicated that 98% of the call content was carried out.

Serious Adverse Events

No adverse events nor serious adverse events were reported.

Retention at Postassessment

The postassessment, at 8 weeks after baseline, between January 26, 2023, and March 27, 2023, was completed by 48 (80%) of the 60 participants, of which 25 (25/30, 83%) were STARS intervention participants and 23 (23/30, 77%) were control group participants. Reasons for dropout at postassessment were not provided due to the online nature of the study.

Secondary Outcome Measures

All outcome measures had good internal consistency with a Cronbach α >0.80 (see Table 5). Across assessments completed by participants, 2.7% (180/6576) of the outcome questionnaire items were missing.

Table 5. Baseline comparison of scores on secondary outcome measures and internal consistency.
DomainMeasurePossible rangeValidated cutoffTotal (N=60), mean (SD; range)STARSa (n=30), mean (SD)ECAUb (n=30), mean (SD)ComparisonCronbach α







Statistic (df)P value
Suicidal thoughts past monthSingle itemYes/noc10 (17)d3 (10)d7 (23)d1.92 (1)e.29
Psychological distressK10f10-50≥20g32.77 (7.31; 20-48)31.53 (7.82; 20-48)34.00 (6.66; 21-46)–1.32 (58)h.190.84
Functional impairmentWHODASi0-48≥17j26.68 (8.66; 14-50)26.73 (10.11; 14-50)26.63 (7.34; 14-43)–0.07 (55)h.940.88
Anxiety and depression combinedHSCL-25k1-42.46 (0.69; 1.04-3.70)2.39 (0.73; 1.04-3.60)2.54 (0.65; 1.20-3.70)–0.86 (57)h.190.94
AnxietyHSCL-25 subscale1-4≥2.00l2.38 (0.70; 1.00-3.60)2.60 (0.67; 1.27-3.93)2.46 (0.68; 1.00-3.50)–0.89 (56)h.370.88
DepressionHSCL-25 subscale1-4≥2.10m2.52 (0.72; 1.07-3.93)2.45 (0.76; 1.07-3.67)2.30 (0.71; 1.00-3.60)–0.83 (57)h.410.91
Self-identified problemsPSYCHLOPSn0-2013.66 (5.49; 0-20)13.54 (4.96; 0-20)13.89 (6.14; 0-20)–0.17 (51)h.860.90
Psychological well-beingWHO-5o0-25<50p36.93 (22.29; 0-100)42.27 (24.38; 4-100)31.60 (18.91; 0-84)1.89 (58)h.060.87
AgencySHSq agency3-2412.32 (5.29; 3-23)12.73 (5.29; 3-23)11.90 (5.34; 3-21)0.61 (58)h.540.87

aSTARS: Scalable Technology for Adolescents and youth to Reduce Stress.

bECAU: enhanced care as usual.

cNot applicable.

dn (%).

eChi-square test.

fK10: Kessler Psychological Distress Scale.

gCutoff for probable anxiety or affective disorder with a sensitivity of 0.66 and specificity of 0.92 [Andrews G, Slade T. Interpreting scores on the Kessler Psychological Distress Scale (K10). Aust N Z J Public Health. Dec 2001;25(6):494-497. [FREE Full text] [CrossRef] [Medline]43].

ht test.

iWHODAS: World Health Organization Disability Assessment Schedule.

jIndicator of moderate impairment [Akhtar A, Cuijpers P, Morina N, Sijbrandij M, Bryant R. Exploratory and confirmatory factor analysis of the 12-item Arabic World Health Organization Disability Assessment Schedule (WHODAS 2.0) as a screening tool for Syrian refugees. BJPsych Open. Oct 11, 2021;7(6):1. [FREE Full text] [CrossRef]34].

kHSCL-25: Hopkins Symptom Checklist-25.

lCutoff for probable anxiety disorder with a sensitivity of 0.84 and a specificity of 0.59 [Mahfoud Z, Kobeissi L, Peters T, Araya R, Ghantous Z, Khoury B. The Arabic validation of the Hopkins Symptoms Checklist-25 against MINI in a disadvantaged suburb of Beirut, Lebanon. The International Journal of Educational and Psychological Assessment. 2013;13(1):17-33. [FREE Full text]31].

mCutoff for probable depressive disorder with a sensitivity of 0.82 and a specificity of 0.70 [Mahfoud Z, Kobeissi L, Peters T, Araya R, Ghantous Z, Khoury B. The Arabic validation of the Hopkins Symptoms Checklist-25 against MINI in a disadvantaged suburb of Beirut, Lebanon. The International Journal of Educational and Psychological Assessment. 2013;13(1):17-33. [FREE Full text]31].

nPSYCHLOPS: Psychological Outcomes Profiles.

oWHO-5: World Health Organization-Five Well-Being Index.

pCutoff for depression with a sensitivity of 0.86 and a specificity of 0.81 [Topp CW, Østergaard SD, Søndergaard S, Bech P. The WHO-5 Well-Being Index: a systematic review of the literature. Psychother Psychosom. 2015;84(3):167-176. [CrossRef] [Medline]37].

qSHS: State Hope Scale.

Table 6 presents the baseline mean values for the HSCL-25 (total score and anxiety and depression subscales), K10, WHODAS 2.0, PSYCHLOPS, WHO-5, and State Hope Scale agency subscale indicating no significant differences between groups at baseline. The ANCOVA of postassessment scores on the HSCL-25 controlling for baseline HSCL-25 scores indicated a nonsignificant effect of condition (STARS: mean 2.26, SD 0.79; ECAU: mean 2.52, SD 0.51; F1, 43=0.42, P=.52, ƞ=0.01). Univariate ANCOVAs also indicated nonsignificant effects of condition on all other outcomes (see Table 6). The mean score on the CSQ-I to evaluate satisfaction with the intervention was 26.79 (SD 5.07; n=24). Figure 2 summarizes the responses to each statement on the CSQ-I.

Table 6. Univariate completer analysis of covariance (ANCOVA) results for each outcome measure.
OutcomePostassessment scores, mean (SD)ANCOVA

STARSaECAUbF (df)P valueEffect size, ƞ
HSCL-25c total2.26 (0.79)2.52 (0.51)0.42 (1,43).520.01
HSCL-25 anxiety2.21 (0.78)2.39 (0.61)0.02 (1,42).880.00
HSCL-25 depression2.29 (0.84)2.62 (0.52)0.91 (1,43).340.02
WHODASd 2.025.39 (8.67)26.77 (6.82)0.45 (1,42).500.01
K10e28.20 (9.17)30.00 (8.63)0.02 (1,43).900.00
PSYCHLOPSf10.96 (5.46)13.30 (4.40)3.21 (1,41).080.07
WHO-5g49.12 (23.99)35.27 (20.27)2.18 (1,44).140.05
SHSh agency14.58 (4.37)13.86 (5.20)0.01 (1,43).900.00

aSTARS: Scalable Technology for Adolescents and youth to Reduce Stress.

bECAU: enhanced care as usual.

cHSCL-25: Hopkins Symptom Checklist-25.

dWHODAS: World Health Organization Disability Assessment Schedule.

eK10: Kessler Psychological Distress Scale.

fPSYCHLOPS: Psychological Outcomes Profiles.

gWHO-5: World Health Organization-Five Well-Being Index.

hSHS: State Hope Scale.

Figure 2. Results of the Client Satisfaction Questionnaire for internet-based interventions (CSQ-I; n=24).

Process Evaluation

Semistructured interviews were conducted between February 14, 2023, and March 19, 2023, with 4 STARS completers, 5 STARS noncompleters, 4 ECAU participants, 2 e-helpers, and 1 clinical supervisor (n=16). Themes included perspectives on the study procedures, STARS intervention, e-helper support, technical aspects of the chatbot, and ECAU condition.

Study Procedures

Most participants found completing the questionnaires “easy” but lengthy and suggested clearer explanations of the randomization process. Regarding the assessment of protocol fidelity, 1 e-helper expressed discomfort with live ratings and suggested the use of audio recordings of calls instead.

General Views of the STARS Intervention

Generally, participants held positive views regarding the STARS chatbot, found the lessons informative (eg, the skills learned), enjoyed the human-like interaction, and appreciated the use of informal Arabic to mimic text messaging among young people. The supervisor was positive about the intervention overall including its possible ability to tackle stigma yet was uncertain about the minimal level of human contact. Additional content such as more chatbot lessons or longer character “stories” was desired by 4 STARS completers. All 4 STARS completers reported using the therapeutic skills and stress management audios in their daily life. STARS noncompleters acknowledged the benefits of the skills, but they did not apply them routinely. One e-helper noticed the benefits of the skills for herself as well.

Views on and Engagement With e-Helper Support

Views on the e-helper support were mixed. Although e-helpers were described as “friendly,” “good listeners,” and showing understanding, there were also suggestions for improvements, such as using a more caring tone. One e-helper emphasized the importance of building rapport and to encourage participants to use the chatbot. There were also other opinions expressed but no clear consensus. For example, e-helpers noted varying participant preferences for the level of contact; 1 STARS completer suggested e-helpers should be available 24/7, while 1 participant dropped out because they thought their e-helper might be able to read the chatbot conversations. e-Helpers felt that young people were hesitant to share much over the phone, and some participants preferred expressing themselves through text messages. e-Helpers expressed a need for more training to conduct the 15-minute support calls. All STARS completers said the 15-minute call duration was suitable.

Engagement With the STARS Chatbot

Reasons for not completing the intervention included factors such as technical difficulties; concerns about the privacy of the chatbot lessons; the intervention being too “simple” to address problems; and external factors, such as feeling too stressed and having a health issue. Two participants suggested adding videos about the skills to enhance engagement, 3 participants suggested follow-up calls after the study was over (eg, a check-in), and 1 suggested a “modular” instead of sequential chatbot. Last, 1 e-helper suggested a call before participants start their chatbot lessons so that the e-helper can explain more about the intervention and support participants with registration.

Technical Aspects of the STARS Chatbot

STARS participants mentioned that the intervention was user-friendly (eg, conversational chatbot) but suggested several improvements. More flexibility in responding to the chatbot (eg, open response formats rather than predefined response options) was preferred by 2 STARS dropouts, and 3 suggested the use of notifications as reminders to complete the lessons. One e-helper and some participants noted the problems created by using email to register, as some people will not be familiar with it. Other technical limitations included expiry of a sign-up link within 24 hours, the need to restart a chatbot lesson in case of internet disconnection, and the presence of various bugs that hindered progress.

Views on the ECAU Website

All ECAU participants found the psychoeducation “not boring” but also too brief, although 1 respondent considered the succinctness of the information as something positive.


Principal Findings

This study aimed to evaluate the feasibility of the STARS intervention and study among young people in Jordan. Key findings include that the trial procedures were adequate and overall views of STARS were positive, with some suggested improvements. As this was a feasibility study, the sample size was not powered to find a significant effect. Nonetheless, we did not find any differences between conditions over time that were statistically significant.

This study showed that trial procedures were feasible in terms of recruitment, dropout rates from the intervention, and retention at postassessment. The target sample size of 60 participants was reached within 2 months, mostly through word-of-mouth, with limited investment in social media advertisements, suggesting the suitability of online recruitment for a fully powered RCT. In terms of the recruited population, the majority reported a higher level of education and a Jordanian background. The internal consistency of the online-administered measures was good, and the item nonresponse was low [Jakobsen JC, Gluud C, Wetterslev J, Winkel P. When and how should multiple imputation be used for handling missing data in randomised clinical trials - a practical guide with flowcharts. BMC Med Res Methodol. Dec 06, 2017;17(1):162. [FREE Full text] [CrossRef] [Medline]44], indicating that the set of questionnaires selected was acceptable. Adherence to the STARS chatbot lessons was adequate, with 73% of participants starting the intervention and 37% completing at least 8 chatbot lessons. These findings are in line with meta-analytic evidence on adherence rates in smartphone-delivered interventions for mental health problems [Linardon J, Fuller-Tyszkiewicz M. Attrition and adherence in smartphone-delivered interventions for mental health problems: a systematic and meta-analytic review. J Consult Clin Psychol. Jan 2020;88(1):1-13. [CrossRef] [Medline]45] and comparable to what has been found for the Step-by-Step intervention studies in Lebanon, where 19% [Cuijpers P, Heim E, Abi Ramia J, Burchert S, Carswell K, Cornelisz I, et al. Effects of a WHO-guided digital health intervention for depression in Syrian refugees in Lebanon: a randomized controlled trial. PLoS Med. Jun 2022;19(6):e1004025. [FREE Full text] [CrossRef] [Medline]14] and 32% [Cuijpers P, Heim E, Ramia J, Burchert S, Carswell K, Cornelisz I, et al. Guided digital health intervention for depression in Lebanon: randomised trial. Evid Based Ment Health. Dec 2022;25(e1):e34-e40. [FREE Full text] [CrossRef] [Medline]13] completed the intervention. Furthermore, retention at postassessment was good (80%) compared with what is commonly observed in smartphone-delivered interventions [Cuijpers P, Heim E, Ramia J, Burchert S, Carswell K, Cornelisz I, et al. Guided digital health intervention for depression in Lebanon: randomised trial. Evid Based Ment Health. Dec 2022;25(e1):e34-e40. [FREE Full text] [CrossRef] [Medline]13,Cuijpers P, Heim E, Abi Ramia J, Burchert S, Carswell K, Cornelisz I, et al. Effects of a WHO-guided digital health intervention for depression in Syrian refugees in Lebanon: a randomized controlled trial. PLoS Med. Jun 2022;19(6):e1004025. [FREE Full text] [CrossRef] [Medline]14,Torous J, Lipschitz J, Ng M, Firth J. Dropout rates in clinical trials of smartphone apps for depressive symptoms: a systematic review and meta-analysis. J Affect Disord. Feb 15, 2020;263:413-419. [CrossRef] [Medline]46]. During the trial, no serious adverse events were reported.

Stakeholder views on the STARS intervention—identified through qualitative interviews—were overall positive. Participants generally liked the chatbot interface and found the therapeutic skills useful. Several aspects hampered easy use of the chatbot, including technical barriers such as bugs and internet connection problems that prevented continuing lessons; access barriers such as reliance on email for login; and restrictions to the intervention’s flexibility such as content spacing, predefined response options, and limited availability of e-helpers. Improvements to the chatbot within the confines of the system will be made prior to the main RCT. For example, it is not feasible to use mainly open-text responses as it does not use machine learning.

Adherence to the e-helper support calls was relatively low, with only 13% of participants attending all 5 e-helper support calls. In addition, although the fidelity review of e-helper support calls indicated that most aspects of the calls were carried out, the calls were short in duration. The process evaluation interviews also revealed that some participants felt e-helpers could be more caring, and e-helpers expressed a need for more guidance on how to carry out the calls more effectively. These very important findings together point to a need for increased support for e-helpers, for example, more detailed call scripts that address common issues and additional training and supervision. This will be addressed for the main RCT.

Limitations of this study include the use of only an immediate postassessment, while the primary end point of the fully powered RCT will be at a 3-month follow-up. This study thus does not provide an estimate of the retention rate at the 3-month follow-up. Second, recruitment of participants was mainly done through IFH peer educators and universities and limited posts on social media, while for a fully powered RCT, the focus will likely be more on large social media advertisements. Finally, this study did not include an active control group, which did not allow delineation of the STARS content from nonspecific effects of engaging with a digital intervention. STARS needs to be initially evaluated to determine its efficacy, but subsequent dismantling studies are needed to determine the active components of the program by comparing STARS against more active controls. Strengths include the use of both quantitative and qualitative research methods allowing for identification of more subtle use and engagement factors with both the chatbot and e-helper calls that would not have been possible with quantitative data alone. This study aimed to test the feasibility of STARS prior to a fully powered definitive RCT to evaluate the effectiveness of STARS. We expect the fully powered RCT to provide an important contribution to the field in terms of the effectiveness of delivering such digital mental health interventions in countries with limited mental health resources, particularly as most mental health chatbots have been evaluated in higher income country settings [Abd-Alrazaq AA, Alajlani M, Alalwan A, Bewick B, Gardner P, Househ M. An overview of the features of chatbots in mental health: a scoping review. Int J Med Inform. Dec 2019;132:103978. [FREE Full text] [CrossRef] [Medline]47].

Conclusion

This feasibility RCT showed that the newly developed digital STARS intervention is feasible for use by young people in Jordan. The study revealed several aspects of the STARS intervention that could be improved, especially aspects related to ease of accessing the chatbot, flexibility in its use, and e-helper support.

Acknowledgments

We thank the e-helper team, the Institute for Family Health, Jinane Abi Ramia, and Sally Khoury for their inputs into the development of the research protocol and related research processes for the upcoming trial. We thank TPX Impact and OpenDialog for their support and technical expertise with the technical implementation of the Scalable Technology for Adolescents and youth to Reduce Stress intervention and enhanced care as usual website.

This work was supported by the Research for Health in Humanitarian Crises Programme (47467). The sponsor had no role in the collection, management, analysis, and interpretation of data, nor did they have a role in the decision to submit the report for publication. The authors alone are responsible for the views expressed in this article, and they do not necessarily represent the views, decisions, or policies of the institutions with which they are affiliated.

Data Availability

Data will be available from the corresponding author upon reasonable request.

Authors' Contributions

AMDG, RH, SF, DK, AA, LD, MVO, RB, and KC were responsible for the development of the study design and protocol. AMDG and RH wrote the first draft of the manuscript. All authors reviewed and approved the final manuscript as submitted.

Conflicts of Interest

None declared.

Multimedia Appendix 1

CONSORT checklist.

PDF File (Adobe PDF File), 93 KB

  1. GBD Results Tool. Institute for Health Metrics and Evaluation. URL: https://vizhub.healthdata.org/gbd-results/ [accessed 2024-03-23]
  2. McGorry PD, Mei C, Dalal N, Alvarez-Jimenez M, Blakemore S, Browne V, et al. The Lancet Psychiatry Commission on youth mental health. Lancet Psychiatry. Sep 2024;11(9):731-774. [FREE Full text] [CrossRef] [Medline]
  3. Uppendahl JR, Alozkan-Sever C, Cuijpers P, de Vries R, Sijbrandij M. Psychological and psychosocial interventions for PTSD, depression and anxiety among children and adolescents in low- and middle-income countries: a meta-analysis. Front Psychiatry. 2019;10:933. [FREE Full text] [CrossRef] [Medline]
  4. Pedersen GA, Smallegange E, Coetzee A, Hartog K, Turner J, Jordans M, et al. A systematic review of the evidence for family and parenting interventions in low- and middle-income countries: child and youth mental health outcomes. J Child Fam Stud. Apr 29, 2019;28(8):2036-2055. [FREE Full text] [CrossRef]
  5. Alozkan-Sever C, Uppendahl J, Cuijpers P, de Vries R, Rahman A, Mittendorfer-Rutz E, et al. Research Review: Psychological and psychosocial interventions for children and adolescents with depression, anxiety, and post-traumatic stress disorder in low- and middle-income countries - a systematic review and meta-analysis. J Child Psychol Psychiatry. Dec 2023;64(12):1776-1788. [FREE Full text] [CrossRef] [Medline]
  6. Linardon J, Torous J, Firth J, Cuijpers P, Messer M, Fuller-Tyszkiewicz M. Current evidence on the efficacy of mental health smartphone apps for symptoms of depression and anxiety. A meta-analysis of 176 randomized controlled trials. World Psychiatry. Feb 2024;23(1):139-149. [FREE Full text] [CrossRef] [Medline]
  7. Garrido S, Millington C, Cheers D, Boydell K, Schubert E, Meade T, et al. What works and what doesn't work? A systematic review of digital mental health interventions for depression and anxiety in young people. Front Psychiatry. 2019;10:759. [FREE Full text] [CrossRef] [Medline]
  8. Karyotaki E, Kleiboer A, Smit F, Turner D, Pastor A, Andersson G, et al. Predictors of treatment dropout in self-guided web-based interventions for depression: an ‘individual patient data’ meta-analysis. Psychol. Med. Apr 17, 2015;45(13):2717-2726. [FREE Full text] [CrossRef]
  9. Opel DJ, Kious B, Cohen I. AI as a mental health therapist for adolescents. JAMA Pediatr. Dec 01, 2023;177(12):1253-1254. [FREE Full text] [CrossRef] [Medline]
  10. Scalable psychological interventions for people in communities affected by adversity: a new area of mental health and psychosocial work at WHO. Geneva. World Health Organization; 2017. URL: https://iris.who.int/handle/10665/254581
  11. Scalable psychological interventions for people in communities affected by adversity: a new area of mental healthpsychosocial work at WHO. World Health Organization. 2017. URL: https://iris.who.int/bitstream/handle/10665/254581/WHO-MSD-MER-17.1-eng.pdf [accessed 2025-01-08]
  12. Petrosyan A. Worldwide digital population 2024. Statista. 2024. URL: https://www.statista.com/statistics/617136/digital-population-worldwide/ [accessed 2024-02-23]
  13. Cuijpers P, Heim E, Ramia J, Burchert S, Carswell K, Cornelisz I, et al. Guided digital health intervention for depression in Lebanon: randomised trial. Evid Based Ment Health. Dec 2022;25(e1):e34-e40. [FREE Full text] [CrossRef] [Medline]
  14. Cuijpers P, Heim E, Abi Ramia J, Burchert S, Carswell K, Cornelisz I, et al. Effects of a WHO-guided digital health intervention for depression in Syrian refugees in Lebanon: a randomized controlled trial. PLoS Med. Jun 2022;19(6):e1004025. [FREE Full text] [CrossRef] [Medline]
  15. Hall J, Jordan S, van Ommeren M, Au T, Sway RA, Crawford J, et al. Sustainable Technology for Adolescents and youth to Reduce Stress (STARS): a WHO transdiagnostic chatbot for distressed youth. World Psychiatry. Feb 2022;21(1):156-157. [FREE Full text] [CrossRef] [Medline]
  16. World Bank Country and Lending Groups. The World Bank. URL: https:/​/datahelpdesk.​worldbank.org/​knowledgebase/​articles/​906519-world-bank-country-and-lending-groups [accessed 2025-01-14]
  17. Population. Department of Statistics. URL: https://dosweb.dos.gov.jo/population/population-2/ [accessed 2025-01-14]
  18. Minority Rights Group International. World Directory of Minorities and Indigenous Peoples - Jordan : Palestinians. UNHCR. URL: https://www.refworld.org/reference/countryrep/mrgi/2008/en/65027 [accessed 2025-01-08]
  19. Jordan Operational Data Portal. UNHCR Operational Data Portal (ODP). URL: https://data.unhcr.org/en/country/jor [accessed 2025-01-08]
  20. The situation of children in Jordan: country factsheet. UNICEF. 2024. URL: https://www.unicef.org/mena/media/26706/file [accessed 2025-01-14]
  21. Jordan: youth unemployment rate from 2004 to 2023. Statista. URL: https://www.statista.com/statistics/812127/youth-unemployment-rate-in-jordan/ [accessed 2025-01-08]
  22. Aljaloudi J. Increase the state of poverty in Jordan during the period 2010-2017. SocioEconomic Challenges. 2020;4(4):39-47. [CrossRef]
  23. Eldridge SM, Chan CL, Campbell MJ, Bond CM, Hopewell S, Thabane L, et al. CONSORT 2010 statement: extension to randomised pilot and feasibility trials. BMJ. Oct 24, 2016;355:i5239. [FREE Full text] [CrossRef] [Medline]
  24. Teresi JA, Yu X, Stewart A, Hays R. Guidelines for designing and evaluating feasibility pilot studies. Med Care. Jan 01, 2022;60(1):95-103. [FREE Full text] [CrossRef] [Medline]
  25. Easton SD, Safadi N, Wang Y, Hasson R. The Kessler psychological distress scale: translation and validation of an Arabic version. Health Qual Life Outcomes. Oct 27, 2017;15(1):215. [FREE Full text] [CrossRef] [Medline]
  26. Kessler RC, Andrews G, Colpe L, Hiripi E, Mroczek D, Normand SLT, et al. Short screening scales to monitor population prevalences and trends in non-specific psychological distress. Psychol Med. Aug 2002;32(6):959-976. [CrossRef] [Medline]
  27. Self-Help Plus (SH+): a group-based stress management course for adults. World Health Organization. Sep 23, 2021. URL: https://www.who.int/publications/i/item/9789240035119 [accessed 2025-01-08]
  28. Mental Health Gap Action Programme (mhGAP) guideline for mental, neurological and substance use disorders. World Health Organization. 2023. URL: https://www.who.int/publications/i/item/9789240084278 [accessed 2025-01-08]
  29. Derogatis LR, Lipman RS, Rickels K, Uhlenhuth EH, Covi L. The Hopkins Symptom Checklist (HSCL): a self-report symptom inventory. Behav Sci. Jan 1974;19(1):1-15. [CrossRef] [Medline]
  30. Selmo P, Koch T, Brand J, Wagner B, Knaevelsrud C. Psychometric properties of the online Arabic versions of BDI-II, HSCL-25, and PDS. European Journal of Psychological Assessment. 2019;35(1):46-54. [FREE Full text] [CrossRef]
  31. Mahfoud Z, Kobeissi L, Peters T, Araya R, Ghantous Z, Khoury B. The Arabic validation of the Hopkins Symptoms Checklist-25 against MINI in a disadvantaged suburb of Beirut, Lebanon. The International Journal of Educational and Psychological Assessment. 2013;13(1):17-33. [FREE Full text]
  32. Measuring health and disability: manual for WHO Disability Assessment Schedule (‎WHODAS 2.0)‎. World Health Organization. URL: https:/​/www.​who.int/​publications/​i/​item/​measuring-health-and-disability-manual-for-who-disability-assessment-schedule-(-whodas-2.​0) [accessed 2025-01-08]
  33. Federici S, Bracalenti M, Meloni F, Luciano JV. World Health Organization disability assessment schedule 2.0: An international systematic review. Disabil Rehabil. Nov 2017;39(23):2347-2380. [FREE Full text] [CrossRef] [Medline]
  34. Akhtar A, Cuijpers P, Morina N, Sijbrandij M, Bryant R. Exploratory and confirmatory factor analysis of the 12-item Arabic World Health Organization Disability Assessment Schedule (WHODAS 2.0) as a screening tool for Syrian refugees. BJPsych Open. Oct 11, 2021;7(6):1. [FREE Full text] [CrossRef]
  35. Ashworth M, Shepherd M, Christey J, Matthews V, Wright K, Parmentier H, et al. A client‐generated psychometric instrument: The development of ‘PSYCHLOPS’. Couns and Psychother Res. Aug 18, 2006;4(2):27-31. [FREE Full text] [CrossRef]
  36. de Graaff AM, Cuijpers P, Twisk J, Kieft B, Hunaidy S, Elsawy M, et al. Peer-provided psychological intervention for Syrian refugees: results of a randomised controlled trial on the effectiveness of Problem Management Plus. BMJ Ment Health. Feb 2023;26(1):e300637. [FREE Full text] [CrossRef] [Medline]
  37. Topp CW, Østergaard SD, Søndergaard S, Bech P. The WHO-5 Well-Being Index: a systematic review of the literature. Psychother Psychosom. 2015;84(3):167-176. [CrossRef] [Medline]
  38. Kassab Alshayea A. Development and validation of an Arabic version of the World Health Organization Well-Being Index (WHO-5). J Psychopathol Behav Assess. Feb 01, 2023;45(1):247-255. [CrossRef]
  39. Snyder CR, Sympson S, Ybasco F, Borders T, Babyak M, Higgins R. Development and validation of the State Hope Scale. J Pers Soc Psychol. Feb 1996;70(2):321-335. [CrossRef] [Medline]
  40. Boß L, Lehr D, Reis D, Vis C, Riper H, Berking M, et al. Reliability and validity of assessing user satisfaction with web-based health interventions. J Med Internet Res. Aug 31, 2016;18(8):e234. [FREE Full text] [CrossRef] [Medline]
  41. Harper Shehadeh MJ, Abi Ramia J, Cuijpers P, El Chammay R, Heim E, Kheir W, et al. Step-by-Step, an e-mental health intervention for depression: a mixed methods pilot study from Lebanon. Front Psychiatry. 2019;10:986. [FREE Full text] [CrossRef] [Medline]
  42. de Graaff AM, Cuijpers P, McDaid D, Park A, Woodward A, Bryant RA, et al. Peer-provided Problem Management Plus (PM+) for adult Syrian refugees: a pilot randomised controlled trial on effectiveness and cost-effectiveness. Epidemiol Psychiatr Sci. Aug 18, 2020;29:1. [CrossRef]
  43. Andrews G, Slade T. Interpreting scores on the Kessler Psychological Distress Scale (K10). Aust N Z J Public Health. Dec 2001;25(6):494-497. [FREE Full text] [CrossRef] [Medline]
  44. Jakobsen JC, Gluud C, Wetterslev J, Winkel P. When and how should multiple imputation be used for handling missing data in randomised clinical trials - a practical guide with flowcharts. BMC Med Res Methodol. Dec 06, 2017;17(1):162. [FREE Full text] [CrossRef] [Medline]
  45. Linardon J, Fuller-Tyszkiewicz M. Attrition and adherence in smartphone-delivered interventions for mental health problems: a systematic and meta-analytic review. J Consult Clin Psychol. Jan 2020;88(1):1-13. [CrossRef] [Medline]
  46. Torous J, Lipschitz J, Ng M, Firth J. Dropout rates in clinical trials of smartphone apps for depressive symptoms: a systematic review and meta-analysis. J Affect Disord. Feb 15, 2020;263:413-419. [CrossRef] [Medline]
  47. Abd-Alrazaq AA, Alajlani M, Alalwan A, Bewick B, Gardner P, Househ M. An overview of the features of chatbots in mental health: a scoping review. Int J Med Inform. Dec 2019;132:103978. [FREE Full text] [CrossRef] [Medline]


AI: artificial intelligence
ANCOVA: analysis of covariance
CONSORT: Consolidated Standards of Reporting Trials
CSQ-I: Client Satisfaction Questionnaire for internet-based interventions
ECAU: enhanced care as usual
HSCL-25: Hopkins Symptom Checklist-25
IFH: Institute of Family Health
K10: Kessler Psychological Distress Scale
LMIC: low- and middle-income countries
PSYCHLOPS: Psychological Outcomes Profiles
RCT: randomized controlled trial
STARS: Scalable Technology for Adolescents and youth to Reduce Stress
WHO: World Health Organization
WHO-5: WHO-Five Well-Being Index
WHODAS: WHO Disability Assessment Schedule


Edited by J Torous; submitted 27.06.24; peer-reviewed by S Richmond, S Schueller; comments to author 04.11.24; revised version received 09.12.24; accepted 26.12.24; published 05.02.25.

Copyright

©Anne Marijn de Graaff, Rand Habashneh, Sarah Fanatseh, Dharani Keyan, Aemal Akhtar, Adnan Abualhaija, Muhannad Faroun, Ibrahim Said Aqel, Latefa Dardas, Chiara Servili, Mark van Ommeren, Richard Bryant, Kenneth Carswell. Originally published in JMIR Mental Health (https://mental.jmir.org), 05.02.2025.

This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Mental Health, is properly cited. The complete bibliographic information, a link to the original publication on https://mental.jmir.org/, as well as this copyright and license information must be included.