A Digital Human for Delivering a Remote Loneliness and Stress Intervention to At-Risk Younger and Older Adults During the COVID-19 Pandemic: Randomized Pilot Trial

doi:10.2196/31586

Original Paper

¹Department of Psychological Medicine, The University of Auckland, Auckland, New Zealand

²Auckland Bioengineering Institute, The University of Auckland, Auckland, New Zealand

³Soul Machines Ltd, Auckland, New Zealand

Corresponding Author:

Elizabeth Broadbent, PhD

Department of Psychological Medicine

The University of Auckland

Building 507, Level 3

22-30 Park Avenue, Grafton

Auckland, 1023

New Zealand

Phone: 64 9 923 0003

Email: e.broadbent@auckland.ac.nz

Background: Loneliness is a growing public health issue that has been exacerbated in vulnerable groups during the COVID-19 pandemic. Computer agents are capable of delivering psychological therapies through the internet; however, there is limited research on their acceptability to date.

Objective: The objectives of this study were to evaluate (1) the feasibility and acceptability of a remote loneliness and stress intervention with digital human delivery to at-risk adults and (2) the feasibility of the study methods in preparation for a randomized controlled trial.

Methods: A parallel randomized pilot trial with a mixed design was conducted. Participants were adults aged 18 to 69 years with an underlying medical condition or aged 70 years or older with a Mini-Mental State Examination score of >24 (ie, at greater risk of developing severe COVID-19). Participants took part from their place of residence (independent living retirement village, 20; community dwelling, 7; nursing home, 3). Participants were randomly allocated to the intervention or waitlist control group that received the intervention 1 week later. The intervention involved completing cognitive behavioral and positive psychology exercises with a digital human facilitator on a website for at least 15 minutes per day over 1 week. The exercises targeted loneliness, stress, and psychological well-being. Feasibility was evaluated using dropout rates and behavioral observation data. Acceptability was evaluated from behavioral engagement data, the Friendship Questionnaire (adapted), self-report items, and qualitative questions. Psychological measures were administered to evaluate the feasibility of the trial methods and included the UCLA Loneliness Scale, the 4-item Perceived Stress Scale, a 1-item COVID-19 distress measure, the Flourishing Scale, and the Scale of Positive and Negative Experiences.

Results: The study recruited 30 participants (15 per group). Participants were 22 older adults and 8 younger adults with a health condition. Six participants dropped out of the study. Thus, the data of 24 participants were analyzed (intervention group, 12; waitlist group, 12). The digital human intervention and trial methods were generally found to be feasible and acceptable in younger and older adults living independently, based on intervention completion, and behavioral, qualitative, and some self-report data. The intervention and trial methods were less feasible to nursing home residents who required caregiver assistance. Acceptability could be improved with additional content, tailoring to the population, and changes to the digital human’s design.

Conclusions: Digital humans are a promising and novel technological solution for providing at-risk adults with access to remote psychological support during the COVID-19 pandemic. Research should further examine design techniques to improve their acceptability in this application and investigate intervention effectiveness in a randomized controlled trial.

Trial Registration: Australia New Zealand Clinical Trials Registry ACTRN12620000786998; https://www.anzctr.org.au/Trial/Registration/TrialReview.aspx?id=380113

JMIR Ment Health 2021;8(11):e31586

doi:10.2196/31586

Keywords

COVID-19; loneliness; stress; well-being; eHealth; digital human; conversational agent; older adults; chronic illness

This study investigated the feasibility and acceptability of a digital human (DH) that delivered a psychological intervention to mitigate the effects of social restrictions on loneliness, stress, and well-being in vulnerable populations during the COVID-19 pandemic. The results will inform the design of a randomized controlled trial (RCT) to evaluate intervention effectiveness. To provide a rationale and context for the study, the introduction describes the effects of the COVID-19 pandemic on loneliness, the importance of treating loneliness, and previous work on robot and conversational agent (CA) interventions for loneliness.

Many countries have adopted socially restrictive public health measures over recent months to slow the spread of the COVID-19 pandemic, including the United Kingdom, Canada, the European Union, Japan, and Australia [1]. Precautions have included bans on mass gatherings, closure of schools and businesses, mandatory work from home conditions, and limits on social activities [2]. New Zealand adopted some of the strictest lockdown rules globally, which included 2-m physical distancing between people and staying at home, except for essential trips to a supermarket or pharmacy, or to seek medical care, with restrictions gradually eased as appropriate [3]. Individuals who were at risk of developing a severe illness should they contract COVID-19 were advised to take additional precautions to social distance and isolate [4]. This included older adults over the age of 70 years (who are at greater risk of dying from COVID-19) [5] and younger adults with an underlying medical condition who may be immunocompromised [6].

While these precautions can help protect vulnerable populations, there are mental health implications of strict social distancing, including increased loneliness [7]. Older adults and adults with underlying health conditions were already at greater risk of loneliness prepandemic [8,9], and these restrictions have exacerbated this risk. Interventions to reduce loneliness are especially important for this group given the long-term implications for health as described below.

Loneliness is a subjective psychological state in which a person perceives a mismatch between their actual and desired social relations [10]. While brief feelings of loneliness can serve as an adaptive motivator to seek social interaction, chronic loneliness has negative effects on physical and mental health outcomes [11,12]. Loneliness is associated with feelings of stress [13], which activate the body’s “fight or flight” response. The sympathetic nervous system becomes activated, and over a prolonged period, it creates negative downstream effects on the cardiovascular, neuroendocrine, and immune systems [14]. As a result, chronic loneliness has been associated with increased risks of morbidity (eg, coronary heart disease, high blood pressure, and stroke) [11] and mortality [15]. Loneliness can be improved through psychological interventions that target the following 4 key areas: changing maladaptive social cognitions, increasing social support, increasing opportunities for social interaction, and improving social skills [16].

Loneliness interventions can be delivered in-person or remotely through technology, and both have been shown to be effective [16], including for older adults [17]. In-person loneliness interventions have included individual psychotherapy involving social cognitive training as part of cognitive behavioral therapy [18], mindfulness-based therapies [19], and social support groups [20]. However, remote interventions may be more suitable for at-risk individuals in isolation as a result of the pandemic. Remote therapies for loneliness have included internet-based cognitive behavioral therapy [21] and internet skills training to access online support [22]. However, research has shown that engagement with technology-based interventions is often lower outside of a clinical trial context [23,24].

Artificial agents may be a particularly engaging way to provide psychological support to people during a pandemic. People have been shown to feel a sense of social presence with artificial agents, which can improve technology engagement [25,26]. Social robots are artificial agents with embodiment in a physical hardware form that are capable of social interaction and are programmed to autonomously interact with their physical environment [27]. CAs are artificial agents that include a dialogue system, and may or may not include a digital embodiment or face [28]. Under the umbrella term of CAs fall chatbots, embodied CAs, voice assistants, and DHs, among others. CAs may be more feasible for providing remote support than robots because they are less expensive and more scalable [29], as they can be accessed through websites or software applications on devices that many patients already own (eg, smartphones and computers).

A recent scoping review on robot-facilitated loneliness interventions found evidence supporting their use with older adults [30]. For example, Paro, a companion robot in the form of a fluffy baby harp seal, alleviated feelings of loneliness in older adults in nursing homes by providing direct companionship in a manner akin to a pet [31]. Other robots include Giraff (a telepresence robot that connects users and their families over video call [32]), MARIO (which includes a My Memories function where users can show photographs to others as a conversation starter [33]), and SYMPARTNER (which reminds people of their upcoming social engagements [34]). Social robots may also improve loneliness in younger adults [35]. Robots have been shown to be effective at delivering other kinds of psychological interventions, such as positive psychology interventions for well-being [36].

Research looking at the clinical effectiveness of CAs in health care is relatively limited, and a more robust methodology is required [28]. However, a study found that daily conversations with an animal-like embodied CA over the course of a hospital stay significantly improved loneliness in older adults [37]. Another study found that daily interactions over 1 week with a human-like embodied CA that used a proactive communication strategy improved loneliness and happiness in older adults [38].

CAs also show promise for delivering psychological therapies to improve stress and well-being; outcomes that may be worsened by chronic loneliness. Vivibot, a Facebook messenger chatbot that delivered positive psychology exercises over 4 weeks, was found to be acceptable and effective for reducing anxiety in young adults with a chronic health condition [39]. Other research has found that a Facebook messenger chatbot that delivered cognitive behavioral therapy exercises, such as mindfulness and gratitude activities, improved stress and well-being [40].

DHs are a new type of CA that use artificial intelligence to build social and emotional engagement with users [41], which could help to reduce loneliness. DHs differ from other CAs in that they are modeled off real people using Hollywood light room technology and computer-generated imagery (CGI) animation techniques [42]. This provides DHs with a very life-like appearance. In addition, DHs include a complex cognitive architecture modeled off humans and involve a digital brain with virtual neurotransmitters to influence behavior [43]. For example, while in “high oxytocin mode,” DHs show attachment and separation distress toward users, which can help to build a bond [44]. DHs use live neural networks while interacting with people to classify their emotional state, and respond to people using a combination of speech, facial behaviors, and body gestures. DHs may be a particularly promising technology to deliver a remote loneliness intervention given their engaging social abilities and scalability; all that users require to access one is a computer and an internet connection. However, as DHs are a relatively new technology, it is unknown whether they are a feasible and acceptable way to deliver a remote loneliness intervention.

This study aimed to investigate whether a DH was a feasible and acceptable method of delivering a remote loneliness and stress intervention to high-risk adults during the COVID-19 pandemic. In addition, this study evaluated the feasibility of the study methods in advance of a future definitive RCT. It was hypothesized that a DH would be a feasible and acceptable method of intervention delivery, and that the study methods would be feasible. The results will inform the design of an RCT to investigate the effectiveness of the DH intervention.

Trial Registration

This trial was reported in keeping with the CONSORT (Consolidated Standards of Reporting Trials) 2010 statement extension for randomized pilot and feasibility trials [45]. Ethics approval was obtained from the University of Auckland Human Participants Ethics Committee on July 06, 2020 (approval number: 024752). The trial was prospectively registered with the Australia New Zealand Clinical Trials Registry on August 04, 2020 (registration number: ACTRN12620000786998).

Trial Design

A randomized pilot trial was conducted involving a parallel mixed design with a waitlist control condition (1:1 allocation ratio). The primary outcomes were feasibility and acceptability, and the secondary outcomes were rapport with the DH, loneliness, stress, COVID-19 distress, positive and negative experiences, and psychological well-being. No major changes were made to the methods after commencing the trial.

DH Intervention

The DH facilitator (“Bella”) was developed by Soul Machines Ltd (Auckland, New Zealand) (Figure 1). Bella was autonomously animated and presented on a website that participants accessed from their personal computer, tablet, or smartphone. Bella was modeled to be a young adult female of mixed race (Māori and New Zealand European). She was synthesized from the visual features of several human models (ie, not modeled off a singular person). Bella was presented in front of a white background in a portrait view of her head and shoulders. Her appearance, background, and proximity to the screen remained consistent throughout the study.

Bella autonomously responded to participants’ language using a finite state conversation engine with preprogrammed responses. Bella was programmed to have some autonomous variation in her language for phrases that would not affect her intervention delivery (eg, she varied her greetings each day). Bella spoke using a computer-generated female voice with an Australian accent (“Wavenet C – female” by Google). Participants could communicate with Bella in 1 of the following 3 ways: (1) speech, (2) typing, and (3) clicking on-screen buttons (where present). Bella always responded to participants in speech; however, if participants opened the messenger window to type, they could see a typed version of Bella’s speech as well (see Figure 1 for an example). If Bella did not understand a participant’s language, she would say, “I’m sorry, I didn’t understand. Could you please repeat or reword your statement?” or similar. If she did not understand after a couple of attempts, she would redirect the participant back to her main menu.

Figure 1. The digital human's user interface when using the messenger function.

Bella engaged in human-like facial and body gestures as she spoke, including blinking, maintaining eye gaze, raising her eyebrows, and moving her head and shoulders. She showed emotional expressions on her face as she spoke to portray joy and concern, which were preprogrammed and triggered by phrases she spoke using text-to-speech emotional markup language. This involved a process of manually tagging language in her script to elicit particular facial emotions each time Bella spoke the phrase. Bella’s facial expressions were autonomously generated in real-time using visual computing and neurobehavioral modeling techniques (described in greater detail in previous reports [41-43,46]). Bella had a virtual nervous system that contained virtual neurotransmitters and live neural networks to process emotional data and inform her responses; however, these capabilities were not used in this study in order to maintain experimental control.

Bella was designed to deliver several relationship building strategies derived from psychology [47] and human-computer interaction research [48]. These included engaging in shared activities with the user, mutual self-disclosure, showing empathy, expressing the value of the friendship, and being nonjudgmental. These relationship building strategies were incorporated into Bella’s language at various points in the interaction.

Participants were informed that Bella continuously collected speech and video data in order to communicate (eg, to hear speech and to make eye contact). These data were not recorded, saved, or analyzed by the researchers. Bella’s data collection and use processes are in keeping with the European Union General Data Protection Regulation (GDPR) [49,50].

DH Intervention Content

Participants were asked to prioritize visiting the mental health and well-being content that Bella offered as part of their daily website visit. This content included evidence-based exercises to improve loneliness, stress, and psychological well-being, as described below.

The Expressing Kindness Challenge

Three challenges were delivered over 3 days and included evidence-based strategies to improve loneliness and psychological well-being. The first 2 challenges were (1) to make contact with an old friend, relative, or someone the participant had not been in touch with for a while and (2) to contact someone to let them know something that the participant appreciated about them. These tasks aimed to increase opportunities for social interaction, strengthen social support, and improve social skills. The third challenge asked the participant to make a list of 3 things that they were grateful for, as a positive psychology exercise. Each of the challenges was accompanied by examples to help the participant generate ideas (eg, on day 2, Bella told participants something that she appreciated about them). At the end of the module, participants were reminded to continue practicing kindness toward others and themselves.

The Brain and Stress Module

This module provided psychoeducation about stress and stress awareness through verbal explanations and diagrams over 1 visit (Figure 2). It covered how stress affects the body and symptoms that are associated with the stress response. The module encouraged participants to reflect on the sources of stress in their lives, and informed participants of behavioral strategies for stress management. These included educating participants about a deep breathing exercise that they could practice, and linking participants to the Headspace website [51], where participants could access audio recordings of deep breathing and meditation exercises. At the end, participants were encouraged to visit the mental health tips, which are described in further detail below.

Mental Health Tips

Six modules each focused on a separate psychological well-being tip. The tips encouraged social connection, exercise, acknowledging feelings, being mindful of anxiety-provoking news media consumption, doing activities that elicit positive emotions, and trying out behaviors from a self-care guide.

Other Conversation Modules

Participants were able to talk with Bella about a range of other topics beyond mental health and well-being. This included information about the COVID-19 pandemic (eg, New Zealand’s alert levels, details about the virus, symptoms and prevention, and New Zealand’s Healthline and health support resources), and information regarding business and entrepreneurship (eg, remote work and business support organizations).

Participants

Thirty participants were recruited. Participants were adults who were at greater risk of developing severe illness if they contracted COVID-19, and as a result, they were asked by the local New Zealand Government to self-isolate to a greater degree during the pandemic. They included (1) older adults aged 70 years or older and (2) adults aged 18 to 69 years who had at least one underlying medical condition that increased the risk of contracting severe COVID-19. The underlying medical condition could have included a serious respiratory disease (such as a chronic lung disease or moderate to severe asthma), a serious heart condition, an immunocompromised condition (such as cancer treatment, smoking-related illness, bone marrow or organ transplantation, hematologic neoplasms, immune deficiency, uncontrolled HIV or AIDS, and prolonged use of corticosteroids and/or other immune-weakening medications such as disease-modifying antirheumatic drugs), a BMI of 40 or higher, diabetes, chronic kidney disease, dialysis, liver disease, and/or pregnancy at the third trimester stage. Participants were required to have English fluency, and access to a computer and internet connection at home. Participants who were 70 years or older were required to achieve a score of 25 or higher on the Mini-Mental State Examination (MMSE). Potential participants were excluded if they received a score of 24 or lower on the MMSE, which would indicate cognitive decline to a moderate or greater degree. Participants aged 70 years or older were not excluded on the basis of whether or not they had an underlying health condition, as their age placed them at a higher risk of developing severe COVID-19.

Twenty-two older adult participants (aged 70 years or older) were recruited from 5 Summerset retirement village sites around the greater Auckland area. Recruitment methods involved presentations to residents about the research, email flyers, and caregiver word of mouth. Residents approached the research team if they were interested in participating. Eligibility screening involving the MMSE and an informed consent procedure (for those who were eligible) were conducted in-person at the retirement village with a member of the research team.

Eight younger adult participants (aged 18-69 years with an underlying medical condition) were recruited from a flyer posted to a staff email list at the University of Auckland, in addition to targeted Facebook advertising, word of mouth, and a Summerset retirement village presentation. Younger adults interested in taking part completed an eligibility screen and informed consent procedure online via a survey website (Qualtrics), except 1 participant who was recruited from a retirement village presentation. This participant completed an eligibility screen and informed consent procedure in-person.

A recruitment target of 30 participants was set, as a minimum of 12 participants per group is recommended for a feasibility study due to precision about the mean and variance [52], and to allow for 20% attrition. Recruitment took place between November 11, 2020, and March 04, 2021, with a 3-week break from late December. Recruitment stopped once the quota of 30 participants had been reached.

Data were collected from online questionnaires using Qualtrics, which participants completed from their place of residence. For older adult participants, this may have included completion from a Summerset retirement village independent living villa or apartment, or from the nursing home facility. For younger adult participants, participation took place online from their place of residence in the community or a Summerset care home facility. Data collection took place between November 16, 2020, and March 11, 2021. All participants in the study were provided with a NZ $30 (US $21.50) shopping voucher for their involvement in the research.

Randomization

Participants were randomly allocated to an intervention or waitlist control group by a member of the research team (EB) (1:1 allocation ratio). Simple randomization was performed using a computerized sequence generation software called Research Randomizer. Allocations were concealed in sealed opaque envelopes from the researchers who enrolled participants (KL and IP) until after participants were enrolled and allocated an ID code. At this point, the researcher was deblinded to assign participants to conditions and provide participants with the appropriate instructions. Participants were deblinded after their assignment to conditions.

Procedure

Once enrolled, participants were contacted over email with instructions for proceeding in the trial. For nursing home residents, their caregiver was copied in the email communications and facilitated the participant’s involvement in the study.

All participants completed an online baseline questionnaire on day 1 of their participation. Then, participants in the intervention group completed a DH training session with a member of the research team for 30 minutes. For all older adults (plus 1 younger adult participant), this took place in-person at their retirement village or nursing home facility. For 7 of 8 younger adults, this took place either in-person at the University of Auckland Clinical Research Centre or online over Zoom video conferencing software (Zoom Video Communications), depending on the lockdown conditions.

All participants received the same technology training, which involved learning how to interact with Bella and completing “day 1” of their intervention week with the researcher present to answer questions. The researcher ensured that the software worked on each participant’s computer. Participants were provided with written instructions and pictures of the user interface that summarized the training session content. For sessions over Zoom videoconferencing, the screen share feature was used and participants received a PDF copy of the interaction instructions. Three participants were trained over Zoom, and 23 participants were trained in-person.

Participants were asked to interact with Bella for at least 15 minutes per day over 1 week. Participants visited Bella’s website independently from their place of residence. The daily 15 minutes could include time spent interacting with Bella and doing therapy activities (eg, a deep breathing exercise). They were asked to prioritize completing the mental health and well-being modules before visiting other content. Participants interacted with Bella at their chosen time of day. Participants were sent a daily text reminder to engage in the intervention and were informed that they could text back to receive technical support.

On day 8, the intervention group finished their intervention week and completed an online postintervention questionnaire. One week later, on day 15, intervention group participants filled out an online follow-up questionnaire.

For participants in the waitlist control group, the order of the procedure was slightly different. Participants in the waitlist group completed an online baseline questionnaire on day 1 and then waited for 1 week. On day 8, waitlist participants completed a second online questionnaire, completed the technology training session, and began their intervention week. On day 15, at the end of their intervention week, waitlist participants completed the postintervention questionnaire.

Measures

Figure 3 depicts the time points at which each measure was administered. Questionnaires were administered online using Qualtrics, a secure survey website.

Figure 3. Time points for assessments. PSS-4: Perceived Stress Scale 4 items; SPANE: Scale of Positive and Negative Experiences.

Feasibility Measures

Feasibility of the DH Intervention

Observations were made by a member of the research team (KL) about how the intervention was used (eg, independently or with the aid of a caregiver), along with dropout rates and reasons. Observations were also made regarding the feasibility of the technology training methods for younger and older adults, and nursing home residents. Instances were recorded where participants refused to receive training through a particular delivery method (eg, video calling).

Feasibility of the Study Methods

Observations were recorded during recruitment and data collection by a member of the research team (KL). Observations pertained to the success rate of different recruitment strategies for younger and older adult participants, and challenges associated with data collection from the online forms that participants completed independently.

Acceptability Measures

Acceptability of the DH

Bella’s acceptability was measured using quantitative self-report items and open-ended qualitative questions designed for the study. Behavioral engagement data were also collected. The acceptability measures are outlined in further detail below.

Self-Report Items

Participants were asked to rate whether (1) they felt Bella was helpful for promoting resilience and psychological well-being, (2) they felt Bella was helpful for improving feelings of loneliness, and (3) they would be willing to use Bella again in the future, using a 7-point scale with response anchors (1, “definitely no” to 7, “definitely yes”).

Qualitative Responses

Participants provided written responses to the following open-ended questions: What did you like most about Bella? and How do you think Bella could be improved? These questions were intended to provide an overall indication of Bella’s acceptability and to identify aspects of the technology that could be improved.

Behavioral Engagement

Behavioral engagement with Bella over 1 week was evaluated by retrospective self-report. Participants reported on which days of the week they visited Bella and estimated approximately how long they used Bella each day in minutes.

Acceptability of the Intervention Content

The acceptability of each psychological intervention module was evaluated separately in the postintervention questionnaire. Participants rated how much they liked the brain and stress module, Headspace (if they visited), and the expressing kindness challenge (including each of its 3 activities) on a 7-point scale with response anchors (1, “not at all” to 7, “very much”). Participants rated how beneficial they found the expressing kindness challenge for well-being, and how well they felt the brain and stress module improved their understanding of the stress response on a 7-point scale (1, “not at all” to 7, “very much”). Participants who visited Headspace were asked whether they felt that Headspace was a helpful resource to link to with a dichotomous yes/no response option. The participants were also asked the following qualitative question: Were there any particular topics that you would have liked to talk about with Bella, which were not available? Participants provided written responses. Self-reported behavioral engagement data were collected on whether participants visited each module and whether they did the activity that the module asked of them.

Rapport With the DH

Rapport with Bella was measured using the 20-item Friendship Questionnaire developed by Johanson et al [53], with items adapted to suit a DH. It is comprised of items taken from multiple friendship scales, including the McGill Friendship Questionnaire, the McGill Friendship Questionnaire Functions scale, the Interactant Satisfaction Survey, and the Acquaintance Description Form-F2 [53]. Participants indicated their agreement with each item using a 5-point Likert scale from 1 (“strongly disagree”) to 5 (“strongly agree”). Responses were summed to derive a total score from 20 to 100, where a higher score indicated greater rapport. The friendship questionnaire has been shown to have good internal consistency reliability when used to evaluate a social robot in a New Zealand adult sample (α=.94) [53]. The scale showed good internal consistency reliability in this study sample when adapted for use with a DH (α=.95). The adapted scale has been included in Multimedia Appendix 1.

Loneliness

Loneliness was measured using the 20-item UCLA Loneliness Scale (Version 3) [54]. Participants rated how often they felt the way described in each statement using a 4-point scale. Responses could range from 1 (“never”) to 4 (“always”). Items were reverse coded where appropriate, and responses were summed to derive a total score from 20 to 80. A higher score indicated greater perceived loneliness. This scale was developed with language to improve readability and has demonstrated acceptable psychometric properties with older adults. This includes good internal consistency reliability (α=.89), discriminant validity with social support, and construct validity [54].

Psychological Stress

Perceived stress was measured using the 4-item Perceived Stress Scale (PSS-4) [55], which evaluated the degree of stress participants felt over the past week using a 5-point scale (0, “never” to 4, “very often”). Items 2 and 3 were reverse coded, and all responses were summed to form a total score from 0 to 16. A higher score indicated greater perceived stress. Although the psychometric properties of the PSS-10 and PSS-14 have been shown to be superior, the PSS-4 was chosen as it is a shorter measure of perceived stress that reduces participant burden and has adequate internal consistency reliability [56].

COVID-19 Distress

Worry about contracting COVID-19 was measured using a 1-item scale [57]. The scale evaluated participants’ degree of worry over the past week on a 4-point scale as follows: 0, “I do not worry about getting COVID-19;” 1, “I occasionally worry about getting COVID-19;” 2, “I spend much of my time worrying about getting COVID-19;” and 3, “I spend most of my time worrying about getting COVID-19.”

Positive and Negative Affect

The Scale of Positive and Negative Experiences (SPANE) has two 6-item subscales that measure positive emotions (SPANE-P) and negative emotions (SPANE-N) [58]. The subscales measured the extent to which positive or negative emotions were experienced over the past week using a 5-point scale (1, “very rarely or never” to 5, “very often or always”). For each subscale, responses were summed, and a total score was derived ranging from 6 to 30. A higher score indicated stronger positive or negative affect, depending on the subscale. Affect balance scores (SPANE-B) were calculated, which indicate the participant’s balance of positive and negative affect from −24 to 24, where positive scores indicate more positive than negative affect during the period. The scale has good internal consistency (SPANE-B: α=.89; SPANE-P: α=.87; SPANE-N: α=.81) and convergent validity [58].

Psychological Well-Being

Psychological well-being was measured using the 8-item Flourishing Scale [58]. Participants were asked to rate their perceived success across items pertaining to different aspects of psychological well-being, including purpose, relationships, self-esteem, and optimism, using a 7-point Likert scale (1, “strong disagreement” to 7, “strong agreement”). Responses were summed to derive a total well-being score between 8 and 56. Higher scores indicated greater well-being. The Flourishing Scale has been shown to have good psychometric properties including convergent validity and discriminant validity. It has also been shown to have good reliability and validity in a nationally representative New Zealand sample [59].

Data Analysis

Quantitative Data

Data were analyzed using SPSS software (version 27; IBM Corp). Missing data were addressed by imputing the mean score of the participant’s other responses to the scale at the timepoint. For 1-item scales, where it was not possible to impute a score or where the participant did not complete a full scale, the participant’s data were excluded from analysis of the relevant variable.

Baseline demographic and psychological variables were calculated for the overall sample, and compared between groups using chi-square tests and independent samples t tests. Average acceptability and rapport scores were calculated for the overall sample, and independent samples t tests were conducted to compare group means. A series of mixed factorial analyses of variance (ANOVA) were conducted to evaluate the main and interaction effects of condition and time on psychological outcomes. Data were checked for violations of test assumptions. Greenhouse-Geisser–adjusted values were reported for data where sphericity assumptions were violated (COVID-19 distress, SPANE-P, and SPANE-B). Exploratory pair-wise comparisons with Bonferroni corrections were conducted as follow-up analyses for significant or trending effects.

Qualitative Data

Written responses to 3 open-ended questions were analyzed using reflexive thematic analysis [60], which is theoretically flexible and suitable for analyzing the content of language data. One member of the research team (KL) conducted the analysis in keeping with recommendations by Braun & Clarke [60], using the following steps: (1) familiarization with the data, (2) coding, (3) generating initial themes, (4) reviewing themes, (5) defining and naming themes, and (6) writing results. An inductive approach was taken whereby coding and theme development were informed by the content of the data. As part of the theme development in stages 3, 4, and 5, themes and subthemes were checked against the original data set and each other to ensure that they were internally coherent (ie, organized around a clear central concept), consistent, and distinctive. Themes and subthemes were split or combined during the review process (stage 4) to improve specificity. All coded data for each theme and subtheme were collated to assist with result writeup. Data were combined across groups as both received the same intervention.

Participants

Participants were predominantly female (24/30, 80%) and Caucasian (22/30, 73%), and mainly had high school or less education (14/30, 47%). Half of the sample (15/30, 50%) reported an underlying medical condition. Participants reported low levels of loneliness (mean 37.79, SD 9.90) and stress at baseline (mean 3.86, SD 2.88). Participant characteristics at baseline are reported in Multimedia Appendix 2. A CONSORT diagram depicts participant flow through the study in Figure 4.

Figure 4. CONSORT (Consolidated Standards of Reporting Trials) diagram of participant flow.

Feasibility of the DH Intervention

DH Training Method

Older adults required technology training to be completed in-person at their retirement village with a member of the research team (KL or IP). This method worked well as it avoided any discomfort with using the video calling software. One older adult was offered training over the Zoom video calling software during the lockdown period and refused as she was not able to use the software. Only 1 of 22 older adult participants did not have a webcam as part of their computer, which was uncovered during the technology training (the DH website requires a webcam). To solve this, a webcam was borrowed from the retirement village reception and installed by a member of the research team (IP) during the training session.

Younger adult participants were generally able to be trained either in-person from a clinic room at the university (outside of the lockdown period) or online over Zoom (during the lockdown period). Video calling did not appear to impact the effectiveness of the training. Technical support requests were low for younger adults during the study, irrespective of how their training was delivered. One younger adult participant who was a nursing home resident required in-person technology training.

Dropout

Six participants withdrew from the study (all older adults). The reasons for withdrawal were as follows: (1) the Wi-Fi speed at the retirement village location was too slow for Bella to load properly (n=2); (2) cognitive health difficulties interfered with understanding study instructions (n=1); (3) the participant was too busy to take part after enrollment (n=2); and (4) technology training could not be scheduled (n=1).

Intervention Completion

Twenty-four participants completed the intervention, 22 of whom completed it independently after training. Two participants (1 younger adult and 1 older adult) who were both nursing home residents required caregiver assistance to access the website and interact with Bella.

Feasibility of Study Methods

Feasibility of Recruitment Methods

The majority of older adult participants were recruited through information sessions held at retirement villages (21/22, 96%). One older adult participant was recruited through an email flyer sent by a village staff member to residents.

For younger adults, the most effective recruitment method was by advertisement to a university junk email list (5/8, 63%). Facebook advertising and word of mouth each resulted in recruitment of 1 participant, and 1 participant was recruited from an information session at a nursing home.

Feasibility of Assessment Delivery Methods

Most participants completed assessments online using Qualtrics without significant issue. Two participants reported instances where they were unsure whether their responses had been submitted. Nursing home residents were unable to complete assessments independently on a website and required caregiver assistance.

Acceptability of the DH

Overall, participants reported that Bella was somewhat helpful for promoting resilience and psychological well-being (mean score 4.39 out of 7, SD 1.83) and for improving any feelings of loneliness (mean score 4.09 out of 7, SD 1.76), as responses on average were above the mid-point. Participants were somewhat willing to use Bella again in the future (mean score 4.09 out of 7, SD 1.98). Younger and older adults rated Bella similarly across the acceptability items, and no significant differences were found.

On average, participants interacted with Bella 6 out of 7 days (mean 6.23, SD 1.19). Participants interacted with Bella for approximately 20 minutes per day (mean 20.20, SD 13.95); 5 minutes longer than the 15 minutes per day requested by the researchers. The average total interaction time with Bella over 1 week was 128 minutes (mean 128.33, SD 102.77). There were no significant differences between younger and older adults in engagement behavior.

Participants identified several strengths and limitations of Bella through responses to the following 2 written open-ended questions: What did you like most about Bella? and How do you think Bella could be improved? Themes, subthemes, and representative quotes are presented below in Tables 1 and 2. Definitions of themes are presented in Multimedia Appendix 3. Overall, participants liked aspects of Bella’s appearance, speech, and interpersonal skills; the informational support Bella provided; the user experience; and the interaction with a new technology. Aspects of Bella that participants felt could be improved were the human likeness of her interaction behaviors and voice, and aspects of the conversation design (eg, more personalization and conversation topics). Some participants reported that they felt Bella would be improved with gradual advances in the underlying technology (eg, natural language understanding). Other participants reported that they would have preferred to interact with a real human, and some participants did not request any improvements.

Table 1. Themes, subthemes, and representative quotes describing what participants liked most about Bella.

Themes and subthemes		Representative quotes
Bella’s appearance
	Facial expressions	Her friendly smile. [Participant ID 106]
	Human-like	I liked the ‘human’ aspects of her. [Participant ID 124]
	Attractive face	She is attractive looking. [Participant ID 112]
	Similarity to user	I felt in some ways visually represented by Bella. [Participant ID 124]
Bella’s speech
	Gentle voice	Quite relaxing. Liked the soft voice. You can hear compassion in her voice. [Participant ID 115]
	Self-disclosure	I really appreciated how the conversation would be ‘softened’ by more personal statements from her. [Participant ID 124]
	Clear language	Clear speaking. [Participant ID 118]
Bella’s interpersonal skills
	Companionship	That she was there. [Participant ID 117]
	Nonjudgmental	Feel like you can tell her just about anything and she wouldn’t be shocked. Like talking to a priest in confession. [Participant ID 115]
	Friendly personality	Friendly and likeable. [Participant ID 119]
	Validating	Nice being told what you’re feeling is normal. [Participant ID 115]
	Calm personality	Her calmness. [Participant ID 125]
Informational support
	Quality resources	A good selection of resources. [Participant ID 126]
	Accessible delivery	Accessibility, most people would find her approachable. [Participant ID 103]
User experience
	Interaction modalities	The direct interaction. [Participant ID 130]
	User controls interaction	A good medium that allowed me to have plenty of control. [Participant ID 124]
	Easy to use	That it was easy to use. [Participant ID 127]
	Overall experience	Enjoyed the ‘experience’ of Bella and certainly a helpful person if you were lonely. [Participant ID 111]
Novel technology
	Something different	Something to do with somebody to talk to me. She was different. [Participant ID 104]
	A new technology	New technology is always intriguing, and I had heard of Bella before. [Participant ID 105]

Table 2. Themes, subthemes, and representative quotes describing what participants felt could be improved about Bella.

Themes and subthemes		Representative quotes
Interaction behaviors
	More human-like movements	The movement of Bella is still quite robotic and her eyes cannot really focus, which makes her sometimes not seem very engaged in the interaction. [Participant ID 125]
	Ability to touch	A sad thing is you can’t touch her. Make a doll out of her. A nice cuddly soft doll. [Participant ID 115]
	More positive emotional expression	Smiles. [Participant ID 118]
Conversation design
	Extra conversation topics	A wider range of subjects. [Participant ID 101]
	Greater interactivity	More interaction by talking to her, rather than just a yes or no. [Participant ID 107]
	More personalized responses	Answers need to be more individualized (e.g., welcoming the participant by name and building on each day’s responses). [Participant ID 112]
	Regularly update information	Information wasn’t as up to date (eg, COVID levels). [Participant ID 129]
	Avoid human-like backstories	I didn’t like the comments she made such as ‘I contacted my friend today.’ I found it weird that she was pretending to be real. I would have preferred if it was just accepted as an interface that had a good selection of resources that you could navigate in an interesting dynamic way. [Participant ID 126]
	Address user by name	By addressing each person by their name, that way we could feel in the moment. [Participant ID 116]
	Incorporate user’s life experience	Most older people (I am 82) have many years of life experience and perhaps some way could be found to take life experience into consideration. [Participant ID 112]
Robotic speech
	A more human-like voice	Perhaps maybe not sound so robotic? It might be hard to achieve but all the inflections and warmth that someone would have in their tone and delivery was missing and I think that’s what would have made Bella more engaging for me. [Participant ID 127]
	Formal speech delivery	Improve pronunciation. Use correct English (nope and yeah are not acceptable). [Participant ID 105]
Technology advances
	Natural language understanding	I think given advances in technology this will happen anyway. Found the response from her using the audio didn’t always work so found it easier to type the responses to her. [Participant ID 102]
	General technology advances	I guess technology will advance and make changes but pretty amazing now. [Participant ID 111]
Preference for a real human		Personally, I believe talking to a real person is far more desirable. [Participant ID 106]
No changes		I accept it for what she is. Saying that we are not all the same, she is different, she is what she is. [Participant ID 104]

Acceptability of Intervention Content