The impact of high-immersion virtual reality on foreign language anxiety
Smart Learning Environments volume 10, Article number: 46 (2023)
Public speaking, especially in a foreign language, is associated with increased anxiety. Research has shown the potential of virtual reality (VR) for simulating real-life experiences, allowing for public speaking practice in an ecological and safe environment. This between-subjects study investigated the effect of VR on foreign language anxiety (FLA) in public speaking practice. Intermediate learners of English participated in eight public speaking sessions over a three-month period, yielding 160 research observations. The experimental intervention took place in high-immersion VR with subjects wearing a VR headset and speaking in front of virtual audience. In the control intervention, subjects used a videoconferencing platform (Zoom) to speak in front of a real-life audience. Multivariate regression analysis revealed that practicing speaking in VR was associated with statistically significant lower FLA scores, compared with speaking practice using Zoom. The study found that VR technology had a positive effect on practicing public speaking in a foreign language. The research findings have practical implications for professionals and curriculum designers in various domains where public speaking skills are essential. For example, incorporating VR-based public speaking practice can benefit professionals preparing for a job interview, an elevator pitch, or a conference presentation. Curriculum designers can consider integrating VR simulations into language courses to provide students with realistic public speaking experiences. This approach can help students overcome language barriers, reduce anxiety, and develop their communication skills in a controlled and supportive environment.
Public speaking is an essential skill for many professionals (Kuai et al., 2020), yet speaking or presenting in public is associated with elevated levels of anxiety (Smith & Sodano, 2011). Speaking in public can trigger fear and the expectation of negative feedback from others (Schlenker & Leary, 1982). Public speaking anxiety is also common to language learners who are concerned about being misunderstood or ridiculed, due to their accent, limited vocabulary, or grammatical errors. This particular type of anxiety is called foreign language anxiety (FLA), and it has been investigated extensively in language learning research (see Jin et al., 2021; Li et al., 2021), seeking to help language learners reduce FLA or learn how to cope with it.
Some evidence exists that the use of technology can help reduce FLA (Stupar-Rutenfrans et al., 2017). Decreased speaking anxiety, relative to face-to-face interactions, may be due to technology creating a shield or comfort zone. Facing a real person while speaking may evoke undesirable emotions, including anxiety. Examples of technologies used by language educators for speaking practice are videoconferencing tools (e.g., Zoom, Skype), virtual worlds (e.g., Second Life), and high-immersion virtual reality (VR; e.g., Vtime XR, Immerse). The factors influencing the level of anxiety vary depending on which technology is used and, for instance, whether the speaking takes place in front of a real person, on Zoom, or in front of a virtual human in VR. Which platform, video-based or VR, is more effective for public speaking practice for language learners remains unknown.
Recent advances in VR technology and its growing availability have provided researchers with a novel experimental method, combining high ecological validity with experimental control (Parsons, 2015). VR mimics real world interactions, simultaneously sustaining experimental control needed for neurophysiological data collection (Tromp et al., 2018). The VR technology further offers new ways to practice public speaking, including speaking in a foreign language with the intention to reduce FLA.
This paper reports on an empirical, between-subjects design study of public speaking in a foreign language, which investigated settings that could reduce speakers’ FLA. The research question examined differences between using VR (the treatment group) and Zoom (the control group). The impact of VR on public speaking in a foreign language compared with an online format (e.g., Zoom) that was measured based on multiple VR interventions is understudied. Therefore, findings and implications of this study contribute to the body of knowledge in the educational technology, computer-assisted language learning, and VR-assisted language learning.
Virtual reality is a rapidly developing technology that allows users to experience simulations of real-life experiences. Two main types of VR exist (Kaplan-Rakowski & Gruber, 2019; Makransky & Petersen, 2021; Xie et al., 2019). One is low-immersion VR, with experiences taking place on a desktop monitor. The other is high-immersion VR, with experiences occurring within “a computer-generated 360° virtual space that can be perceived as being spatially realistic, due to the high immersion afforded by a head-mounted device” (Kaplan-Rakowski & Gruber, 2019, p. 552). The main distinction between the two types of VR lies within the degree of immersion available. High-immersion VR offers a higher sense of presence, and authenticity compared with low-immersion VR. In this study, we focus solely on high-immersion VR.
Immersion happens when users get involved in VR to such a point that they lose their awareness of time and the real world (Radianti et al., 2020). Some researchers posit that immersion is a technological capability of VR, with objective assessment being possible (Slater & Wilbur, 1997). The technological view is that the degree of immersion experienced by the user is determined by technological attributes, such as display resolution (Bowman & McMahan, 2007). From a psychological viewpoint, the degree of immersion is individualistic but simultaneously based on, as well as restricted by, technological attributes of the VR system (Mütterlein, 2018).
According to Slater (2018), presence can be defined as “the illusion of being there, notwithstanding that you know for sure that you are not” (p. 432). The sense of presence is elevated when a VR scenario triggers emotions (Diemer, et al., 2015). The illusion of presence is mainly a perceptual, not a cognitive, concept because VR triggers the perceptual system first, and the cognitive system reacts subsequently (Slater, 2018).
Slater (2009) showed that when a virtual human and a real human look at each other, the real human has a physical response, such as a change of heart rate, which is an indication that the particular situation triggered internal feelings. Similarly, some participants may feel that their experience is really happening; that is, a plausibility illusion takes place (Gruber & Kaplan-Rakowski, 2020; Slater, 2009). Consequently, participants’ responses reflect real-life behavior. For instance, a learner speaking in front of virtual classmates and addressing them with “Hello, class” is an indication that the learner is reacting to the situation as if the classmates were real (Gruber & Kaplan-Rakowski, 2020).
Virtual reality and language learning
Until 2015, literature on VR and computer-assisted language learning mainly covered studies on learning in the virtual world Second Life (e.g., Lin & Lan, 2015; Melchor-Couto, 2017), which is low-immersion VR. A synthesis of literature on VR for foreign language learning from 2015 to 2018 shows a wealth of studies on low-immersion VR, but a limited number regarding the use of high-immersion VR (Dhimolea et al., 2022). Technological advances have offered VR that is increasingly immersive and authentic, prompting foreign language learning researchers to explore the effect of using high-immersion VR for language learning (Gruber & Kaplan-Rakowski, 2020; Gruber et al., 2023; Kaplan-Rakowski & Gruber, 2021; Lan & Grant, 2021; Papin & Kaplan-Rakowski, 2022; Taguchi, 2021; Thrasher, 2022).
Various language-specific aspects in VR settings have been explored, for example, vocabulary (Alfadil, 2020; Papin & Kaplan-Rakowski, 2022; Vázquez et al., 2018), listening (Tai et al., 2020; Ye & Kaplan-Rakowski, 2023), reading (Kaplan-Rakowski & Gruber, 2022, 2023), writing (Barrett et al., 2021; Dolgunsöz et al., 2018), and culture (Cheng et al., 2017). Developing speaking skills in VR has also gained attention (Dooly et al., 2023; Gruber & Kaplan-Rakowski, 2020; Kaplan-Rakowski & Gruber, 2021; Nobrega & Rozenfeld, 2019; Thrasher, 2022; Xie et al., 2019). Another step forward was an exploration of how VR could improve communicative skills (Dooly et al., 2023; Yang et al., 2020) and contributions of ways to reduce FLA (Gruber & Kaplan-Rakowski, 2020; Thrasher, 2022; York et al., 2021).
Foreign language anxiety
Foreign language anxiety is referred to as “the feeling of tension and apprehension specifically associated with second language contexts, including speaking, listening, and learning” (MacIntyre & Gardner, 1994, p. 284). Language scholars have invested substantial effort into studying FLA (e.g., Horwitz et al., 1986; MacIntyre & Gardner, 1991; Zhang, 2019). FLA is a common phenomenon, negatively influencing second language acquisition (Gardner & MacIntyre, 1993) and considered potentially face-threatening for language learners (Dörnyei, 2001). Horwitz (2017) pointed out that individuals experiencing language anxiety have the trait of feeling state anxiety when learning or using a language. Sometimes a mere thought of having to use a foreign language may trigger anxiety. State anxiety is the level of anxiety a person feels from moment to moment while trait anxiety describes relatively consistent individual variations in anxiety propensity (Spielberger et al., 1970). A meta-analysis (Botes et al., 2020) of studies on the impact of FLA on academic performance confirmed earlier research (Teimouri et al., 2019) that anxiety affects achievement in all language skills (i.e., reading, writing, listening, and speaking). The impact of FLA on speaking exists in both in-person and online settings (Pichette, 2009; Russell, 2018).
Virtual reality and foreign language speaking anxiety
Researchers have made substantial efforts to study the effect of different computer-mediated communication modalities on FLA (Melchor-Couto, 2017). The means employed included audio and videoconferencing platforms (Hampel & Baber, 2003) and virtual worlds (Dickey, 2005). Study results using low-immersion VR have been inconsistent regarding the effectiveness of computer-mediated communication on FLA (Toyama & Yamazaki, 2021). The general finding of the studies testing the impact of low-immersion VR on FLA is that interacting in a virtual environment allows for shielding through a personal avatar (Kruk, 2020). Given that high-immersion VR can immerse learners in authentic learning settings (Kaplan-Rakowski & Gruber, 2021), VR has become an additional viable platform for language learners to practice speaking.
In computer-generated VR, users interact with avatars. VR offers language learners an environment in which they can make mistakes without feeling embarrassed because they are represented by an avatar and do not show their face. As a result, language learners are more likely to be willing to speak (Yang et al., 2020) and the anonymity in VR is likely to reduce FLA. This affordance of VR sets it apart from videoconferencing platforms which, when web cameras are activated, expose speakers’ real faces.
To our knowledge, as of 2023, only three studies investigated how high-immersion VR impacts FLA: Gruber and Kaplan-Rakowski (2020), Thrasher (2022), and York et al., 2021. In a qualitative study by Gruber and Kaplan-Rakowski (2020), 12 university students gave eight presentations in a foreign language (English) in VR. The intention of the study was to explore the potential of risk-free VR technology to simulate a high-anxiety setting such as a virtual classroom. The researchers studied subjects’ perceptions of the VR environment, the behavior of the virtual humans, the realism of the experiences, and the subjects’ attitudes toward the VR speaking practice. Through the analysis of post-intervention semi-structured interviews, the researchers concluded that speaking practice in VR has the potential to reduce FLA because the subjects perceived the sense of presence and the plausibility illusion of high-immersion VR as useful aspects of speaking practice.
Intermediate learners (N = 25) of French in the study by Thrasher (2022) completed oral production tasks over eight weeks. One group interacted in VR while another group interacted in a face-to-face format. The analysis of scores based on self-reported FLA questionnaires provided preliminary evidence that speaking in VR alleviates levels of FLA, making VR an attractive platform for language practice.
York et al. (2021) tested differences in the level of FLA of Japanese undergraduate learners of English (N = 30) while they practiced speaking using audio, video, or VR. The analysis of FLA and post-experiment questionnaires yielded no statistically significant differences between the conditions. Each condition equally diminished FLA, with learners reporting to be most entertained and motivated by the VR condition.
In sum, the existing studies (Gruber & Kaplan-Rakowski, 2020; Thrasher, 2022; York et al., 2021) show that high-immersion VR is a promising setting for language learners to practice public speaking and reduce their FLA. The study by Gruber and Kaplan-Rakowski (2020) was limited to qualitative analysis of interviews, focusing on public speaking in a foreign language in VR. The study design did not include a control group. We complement their study by adding a quantitative analysis of subjects’ self-reported measures of FLA, comparing them with a control group speaking on Zoom.
The study by Thrasher (2022) was of an exploratory nature and the types of interventions differed from ours. Thrasher compared VR versus face-to-face interventions, while our study compared VR versus Zoom interventions, making both studies contribute to research in a unique and complementary way.
While in York et al. (2021) subjects spoke only once, our subjects had multiple exposures to VR speaking practice. A systematic review of language research in VR by Dhimolea et al. (2022) showed that learners need multiple exposures to VR content to detect significant differences between conditions. In our study, the subjects participated in four speaking sessions, giving two presentations per session, which provided a total of eight opportunities to practice speaking and to measure FLA. In addition, we contribute with a rigorous methodology by employing a fixed effects regression model that is capable of correcting for within-subject and within-session patterns.
Drawing on the existing literature on potential affordances of VR for reducing FLA (Gruber & Kaplan-Rakowski, 2020; Thrasher, 2022; York et al., 2021), and on the theoretical foundations of immersion along with the sense of presence, this study explored the potential of VR to create a viable environment for language learners to engage in speaking practice. The main research question guiding our study was: “Is practicing speaking in high-immersion VR associated with lower FLA scores, as compared with practicing speaking using a videoconferencing tool, Zoom?” In this study, we use “Zoom” as a generic term for videoconferencing.
Zoom is a collaborative, videoconferencing platform that facilitates online meetings. It allows for synchronous communication where individuals can interact using video, sound, and chat. Zoom imitates face-to-face interactions as both verbal and nonverbal cues can be used for communication. Language learners can use Zoom for synchronous speaking practice, for instance, in intercultural online collaborations, where language learners can negotiate for meaning and practice conversational skills with their language partners. Research on online synchronous speaking practice and its impact on FLA (Fondo & Jacobetty, 2020) is growing. However, little empirical research exists on FLA when using Zoom for practicing public speaking in a foreign language. Unlike in VR, interlocutors on Zoom are typically humans who are synchronously in the virtual room. The presence of other humans might negatively influence speakers’ anxiety levels.
We hypothesize that speaking in VR in front of virtual humans reduces anxiety because practicing in front of virtual humans, instead of real people, provides a comfort zone for learners to make mistakes which might reduce embarrassment or a feeling of humiliation.
This experimental, between-subjects study used a repeated measures research design yielding quantitative data on subjects’ (N = 20) self-reported FLA. The subjects in the experimental group (n1 = 12) practiced speaking in a foreign language in high-immersion VR (see Fig. 1), while the subjects in the control group (n2 = 8) practiced public speaking on Zoom. Both groups practiced speaking on eight occasions over a three-month period. The FLA questionnaire scores served as a dependent variable, and the independent variable was the speaking setting (VR versus Zoom). The study generated 160 observations, drawn from 20 subjects and 8 presentations. The regression modeling included fixed effects to account for repeated observations of each subject (Wooldridge, 2002).
The study was advertised via newsletter and e-mail to approximately 600 students from Heilbronn University of Applied Sciences in Spring 2019. The researcher contacted the study volunteers on a first-come-first-serve-basis, and then followed with scheduling the volunteers’ speaking practice sessions. Initially, we targeted a sample of about 30 learners. However, only 20 learners were able to fulfil the requirement of attending all the public speaking sessions.
The experimental condition was applied to 12 subjects (10 male, two female) consisting of ten native speakers of German, one of French, and one of Korean. The control condition was applied to eight students (three male, five female) from the same university in different study programs. The participants’ mean age was 21.12 and they had studied English for 10 years on average. The proficiency level of English ranged from B1 to B2 on the Common European Framework of Reference. The study followed the ethics standards as stated in the Declaration of Helsinki. The participants received monetary incentives for completing all the study steps.
Participants in both the VR and the Zoom conditions followed comparable procedures. They completed the same number and types of questionnaires and spoke on the same topics for the same duration. What differed was the setting (either VR or Zoom) where they spoke. VR sessions took place in a VR laboratory at the university. Zoom sessions took place remotely.
After signing consent forms, all study participants proceeded with the following seven steps:
Completing a demographic questionnaire,
Completing a pre-intervention FLA questionnaire,
Speaking on topic #1,
Completing a post-intervention FLA questionnaire after discussing topic #1,
Speaking on topic #2,
Completing a post-intervention FLA questionnaire after discussing topic #2.
Steps 2–7 constitute a session. The sessions with each participant were scheduled individually. Under both VR and Zoom conditions, participants did not receive any prompts or preparation time. Instead, in a straightforward manner, the researcher initiated the intervention with “Could you talk about [topic #1]?”.
Approximately two minutes into speaking, the researcher said, “Now, could you talk about [topic #2]?” One topic differed from session to session and was intended to be unexpected so as to induce foreign language speaking anxiety. The other topic remained constant throughout the sessions.
All 20 subjects participated in four sessions, each scheduled on separate days. Two topics were discussed per session, drawing from a pool of topics which included: (1) communication and the internet, (2) healthy living, (3) hobbies and free time, (4) shopping and money, and (5) cities and countryside. Altogether, the study yielded 160 oral presentations and 480 FLA questionnaire responses. Each presentation was associated with three FLA questionnaires. Because each presentation took up to two minutes, the total presentation time for all the subjects was 320 min. We implemented a two-minute time limit for presentations to accommodate the needs of foreign language learners, aligning with the duration of established speaking exams like TELC English B2 of the Common European Framework of Reference. Additionally, given the difficulty for speakers at B2 level to sustain longer monologues on a single topic, the shorter time limit allowed for focused practice and challenged students to perform effectively within a confined timeframe.
During the VR sessions, the researcher indicated a change of topic by raising her hand after taking over one of the avatars, which was done by projecting her movements onto the virtual human, using a motion-tracking camera (Microsoft Kinect v2). The researcher talked to the participants behind a soundproof partition, and the participants heard her voice through headphones. At the end of each session, we conducted interviews on participants’ experience with VR and Zoom speaking practice (Gruber & Kaplan-Rakowski, 2020).
The VR and the Zoom settings
The VR system used in the study was HTC Vive, which consists of a headset with high-resolution displays (2160 × 1200) and a refresh rate of 90 Hz. The system uses room-scale motion tracking technology with advanced sensors accurately tracking the position of the user in real-time, enhancing the sense of presence and immersion. Figure 1 shows a participant in the research lab during the VR intervention. The participant is wearing a VR headset which is facilitating the public speaking simulation. Noteworthy are the participant’s hand gestures which distinctly indicate his active engagement in the public speaking simulation.
As Fig. 2 displays, the specific scene that the VR participants experienced depicted a virtual classroom.Footnote 1 The classroom consisted of virtual humans sitting at their desks, acting as the audience for the presenting students. Such a classroom offered a familiar, customized setting that resembled an everyday real-world context.
The body language, posture, and head orientation of the virtual humans were pre-programmed. The nonverbal affective expressions included nodding, hand gestures indicating reassurance (palm-down gesture), and forward-leaning posture that represents real-world behavior in a classroom. Eye contact with the speaker was simulated because most virtual characters were programmed to look toward the direction of the desk where the speaker was standing. This simulation added to the impression of a real-life audience. Background noises consisted of distant quiet talking, which was intended to make the situation more authentic.
The researcher controlled the visual, auditory, and haptic sensory input that the participants received. The researcher could interact through a selected avatar by speaking via microphone. A motion-tracking camera (Microsoft Kinect v2) could take over the avatar by projecting the researchers’ movements onto the avatar.
In the Zoom setting, the participants had their web cameras activated, allowing to simulate face-to-face interactions. During the presentations, the interlocutors used body movements such as nodding to give non-verbal feedback.
Data collection and instruments
The data collection lasted for over three months during the 2019 Spring semester. The scope of this article is narrowed down to the quantitative analysis, using Statistical Analysis Software (SAS). The study used two instruments: the demographic questionnaire and an operationalized FLA questionnaire. The demographic questionnaire consisted of 21 items and solicited information about the participants such as gender, age, native language, linguistic background, and language learning experience.
The FLA questionnaire was based on a validated instrument (Cronbach Alpha coefficient = 0.93) measuring foreign language classroom anxiety (Horwitz et al., 1986). We operationalized the Horwitz et al. (1986) questionnaire to fit the context of the study (see Appendixes A and B). Because foreign language speaking anxiety was the main focus of our intervention, we ensured that our instrument covered items relevant to speaking. We also omitted irrelevant statements (e.g., “I often feel like not going to my language class”) and made adjustments to fit the VR or Zoom settings. The questionnaire was further operationalized by replacing all mentions to “FL” with "English".
One of the researchers, a native speaker of German, translated the operationalized questionnaire into German and that translation was verified with another German language speaker. The operationalized scale had a high level of internal consistency, as determined by Cronbach Alpha of 0.96.
The modified instrument required the face and construct validity. As recommended by Ary et al., (2010), we formed a panel of experts to help us with evaluating the face and construct validity of our modified instrument. The evaluation was conducted independently by two second language acquisition (SLA) experts, two educational technology professors, and one neuropsychologist. Each expert held a doctoral degree from renowned universities and were actively engaged in research within their respective fields. To gather feedback on the first draft of our FLA questionnaire, we started by engaging two experts in SLA. We shared the draft as a Word document and requested them to evaluate each questionnaire item, specifically focusing on the appropriateness of the FLA concept. The experts tracked changes and provided their feedback, which we subsequently combined and thoroughly discussed. Through this iterative process, we actively sought consensus on the questionnaire items, ensuring that they accurately captured the essence of FLA. The process of gathering feedback from the two educational technology professors and the neuropsychologist followed a similar approach. However, in this instance, these three panel members were specifically requested to evaluate the questionnaire items from the lenses of educational technology and the psychological aspects of VR-based learning.
The feedback received from the panel allowed us to fine-tune our instrument, consequently, the face and construct validity was confirmed.
We generated four versions of the questionnaire to reflect the intervention type (i.e., VR/Zoom) and to reflect the intervention timing (i.e., pre-/post-). The four versions were:
Pre-intervention VR questionnaire (Appendix A),
Post-intervention VR questionnaire (Appendix B),
Pre-intervention Zoom questionnaire,
Post-intervention Zoom questionnaire.
All the versions were identical except for two minor differences. First, because the pre-intervention questionnaires were administered before the intervention, they used the present tense. Meanwhile, the post-intervention questionnaires used the past tense because they were administered after the intervention. Second, the VR questionnaires used relevant mentions of VR, while the Zoom questionnaires had relevant mentions of Zoom.
Our data are constructed from double-repeated measures for each subject. That is, each subject’s FLA was measured three times in each session, and sessions were conducted four times. To appropriately correct for repeated measurements of each subject, we estimated a fixed effects regression model (Wooldridge, 2002), with time fixed effects for both tests and sessions. The regression model is given in Eq. (1):
where the FLA measure was defined as the sum of the scores from the FLA instrument for subject i in session s and test t, minus the score for subject i on the initial session 1 pretest. Because each subject is compared to their own pretest, the measure automatically accounts for across-subject variation, such as from previous language experience, baseline individual anxiety levels, demographics, age, gender, and personality. This approach is referred to as a difference-in-differences methodology in the wider social sciences and is considered a reliable method of inference when subjects are evaluated before and after a treatment effect (Donald & Lang, 2007).
Our coefficient estimate on the VR variable provides a measure of the effect size and can then be interpreted as the marginal impact of VR on FLA relative to other subjects in the same test and the same session. The use of fixed effects has the added benefit of correcting for learning, adaptation, and novelty effects over time. The fixed effects are indicated in Eq. (1) by the terms sessions and testt, which indicate dummy variables for tests two through four and sessions two and three. Fixed effects are not indicated for the first test and first session, as this is captured by the model’s intercept term, β0.
Table 1 reports the multivariate regression results for the estimation of Eq. (1). The intercept term, β0, can be interpreted as the impact of the Zoom control condition on FLA and was not significantly different from zero. The VR treatment was significantly and negatively associated with FLA (t = − 2.00; p = 0.0468). The coefficient estimate of − 1.81 for β6 indicates that FLA was 1.81 points lower following the VR treatment, after correcting for the session and test sequence. To interpret the effect size, the coefficient estimate of the VR treatment was consistent with a reduction in FLA about ten times greater than the in the control condition (− 1.81 coefficient estimate on β6 compared to model intercept, β0, of − 0.18). The statistical strength of the VR effect is evident from the significance of the coefficient estimate for β6 (t-statistic of − 2.00, p value of 5%) on VR despite the small number of subjects and the multiple fixed effects terms included in the model. Dummy variables, or fixed effects, for the session and test sequence were all insignificant, indicating that FLA was not significantly different on later sessions or tests, relative to the initial pretest. The model R2 was 3.3%.
The main finding of the study is that practicing speaking in VR is associated with significantly lower anxiety scores, compared with practicing speaking on Zoom. While this finding is mostly aligned with the existing literature (Gruber & Kaplan-Rakowski, 2020; Thrasher, 2022), our contribution differs from previous studies. This previously lacking quantitative evidence (Parmaxi, 2023) was founded on repeated VR and Zoom interventions and contributes to the literature in several meaningful ways. First, our findings enrich research by confirming the qualitative findings by Gruber and Kaplan-Rakowski (2020) in which, based on semi-structured interviews, the researchers concluded that speaking practice in VR could serve as a useful setting in terms of FLA. Our finding adds to their evidence in that we used statistical calculations and a control group of students speaking on Zoom.
Second, York et al. (2021) found that practicing in VR lowered FLA, but the rate was insignificantly different compared with other media, such as voice or video. Our study provides support for the growing body of evidence that, compared with video format, VR-based public speaking can be more effective with regard to reducing FLA. One reason could be that our participants performed in front of programmed virtual humans, whereas participants in York et al. (2021) interacted with real classmates, who were represented by avatars.
Third, our participants had eight occasions to speak, increasing the chance for VR to be effective. According to the systematic review by Dhimolea et al. (2022), VR interventions are more likely to be statistically significant when subjects are exposed to VR interventions multiple times.
The interview data in (Gruber & Kaplan-Rakowski, 2020) support the quantitative finding in this study. During the semi-structured interviews, some participants who exhibited signs of public speaking anxiety employed different strategies to deal with their anxiety within the VR environment. For instance, one student expressed the belief that VR had the potential to replicate a classroom setting, thereby enabling him to conquer stage fright. Another student remarked that thanks to the repeated public speaking practice sessions, he developed a routine that made him feel less anxious. Yet, a participant who experienced high anxiety during real-life presentations reported that he wanted to use VR as a means to practice for an upcoming foreign language class presentation (Gruber & Kaplan-Rakowski, 2020). Participants cited the ability to move and talk in a classroom environment in a realistic setting as a benefit of practicing in VR as opposed to practicing speaking, for example, in front of a mirror (Gruber & Kaplan-Rakowski, 2020). Such iterations seem to reveal that subjects felt comfortable in the VR setting, which goes along with our study finding.
This study finding has several pedagogical implications and is useful to educators and curriculum designers who should consider recommending speaking practice sessions in VR to their learners. Speaking simulations in VR can be particularly useful for students who are prone to anxiety when giving presentations or individuals preparing for inherently stressful situations such as a job interview or an elevator pitch. The reason is that repeatedly practicing in front of a virtual audience in VR is potentially nonthreatening.
From a practical point of view, the equipment used in our study was high-end and therefore not accessible to everybody. However, low-cost devices exist which allow students to insert their mobile phones into a VR viewer (e.g., Google Cardboard) that can provide similar speaking simulations as described in this study. The number of higher education institutions with VR laboratories is growing, and VR devices are increasingly affordable. Therefore, the augmenting affordability and accessibility of VR devices increasingly allow students to practice speaking in a foreign language on their own, which fosters autonomy.
While certain benefits of practicing public speaking in VR exist, acknowledging its drawbacks is necessary. Typical issues include the risk of cybersickness, potential technical difficulties, and the limited ability of VR to accurately replicate and effectively practice body language and mimicry. However, this last limitation is being addressed with the newest VR systems, such as Oculus Pro (Gruber & Kaplan-Rakowski, 2022). Presenting in a foreign language in front of real humans using a videoconferencing tool (e.g., Zoom) is likely to make presenters more anxious due to features such as the heightened amount of eye gaze at a close distance, or the real-time camera feed (Bailenson, 2021). In contrast, speakers in VR are shielded through an avatar, which may have a positive effect on their anxiety levels.
Regarding the conduct of VR-based public speaking research, although it offers numerous benefits, it also presents several challenges and risks. For instance, potential emotional distress when simulating anxiety-inducing situations like public speaking might occur. Moreover, there is limited prior research to draw upon, making it necessary for researchers to navigate this underexplored research territory.
Limitations and future studies
A recognition of certain limitations within this study should be noted. A limitation in FLA studies is often the sole use of self-reported questionnaires which are prone to bias. In our study, we administered self-reported questionnaires, and a wearable Empatica E4 collected objective data (e.g., heartrate and electrodermal activity), but reporting these data extends the scope of this paper. To collect objective data to detect emotional states, future studies should use eye-trackers, wearable devices, or automatic emotion recognition tools. Alternatively, salivary cortisol samples could also provide alternative measures of FLA, as reported by Thrasher (2022).
Future endeavors should extend this type of study to more participants, subjects from different age groups, learners with different proficiency levels, various task complexities, and different presenting conditions (e.g., small or large audiences, with or without an activated camera). Future studies could explore the effect of diverse virtual environments. For instance, a comparative analysis could be conducted between presenting in VR using computer-generated content and realistic content using 360-degree videos. Alternatively, a comparison between the effectiveness of using VR as opposed to mixed-reality devices can constitute another future research endeavor.
Future research endeavors could study other factors that influence FLA including contextual factors (e.g., speaking task complexity, time constraints) or cultural factors in which learners’ backgrounds and experiences may play a role. Moreover, coping with FLA could be done using emotional regulation strategies such as relaxation techniques or mindfulness because such approaches have the potential to regulate anxiety (e.g., Kaplan-Rakowski et al., 2021) in high-stress situations.
Speaking in public, and especially in a foreign language, is associated with increased levels of anxiety. Given the foundations of VR-based theories and the related literature on the topic of public speaking, our study answered the research question: “Is practicing speaking in high-immersion VR associated with lower FLA scores, as compared with practicing speaking using a videoconferencing tool, Zoom?” Learners of English practiced speaking in public on eight occasions either in VR or on Zoom. We explored how VR and Zoom settings impacted the participants’ FLA levels. The quantitative analysis revealed that practicing speaking in VR was associated with significantly lower anxiety scores, compared with practicing speaking on Zoom. This research contributes to understanding the potential of VR technology to support students when practicing for public speaking in a foreign language, including presentations in an academic or corporate context.
Availability of data and materials
Data and materials can be accessed by contacting the lead author.
The virtual classroom was designed using Unity in UniTyLab at Heilbronn University of Applied Sciences. The setup of the classroom was originally designed for public speaking anxiety therapy.
Foreign language anxiety
Alfadil, M. (2020). Effectiveness of virtual reality game in foreign language vocabulary acquisition. Computers & Education, 153, 103893.
Ary, D., Jacobs, L. C., & Sorensen, C. (2010). Introduction to research in education (8th ed.). Wadsworth Cengage Learning.
Bailenson, J. N. (2021). Nonverbal overload: A theoretical argument for the causes of Zoom fatigue. Technology, Mind, and Behavior. https://doi.org/10.1037/tmb0000030
Barrett, A. J., Pack, A., & Quaid, E. D. (2021). Understanding learners’ acceptance of high-immersion virtual reality systems: Insights from confirmatory and exploratory PLS-SEM analyses. Computers & Education, 169, 104214.
Botes, E., Dewaele, J. M., & Greiff, S. (2020). The foreign language classroom anxiety scale and academic achievement: An overview of the prevailing literature and a meta-analysis. Journal for the Psychology of Language Learning, 2(1), 26–56.
Bowman, D. A., & McMahan, R. P. (2007). Virtual reality: How much immersion is enough? Computer, 40(7), 36–43.
Cheng, A., Yang, L., & Andersen, E. (2017, May). Teaching language and culture with a virtual reality game. In CHI ’17: Proceedings of the 2017 CHI Conference on Human Factors in Computing Systems (pp. 541–549). Association for Computing Machinery.
Dewaele, J.-M. (2007). The effect of multilingualism and socio-situational factors on communicative anxiety and foreign language anxiety of mature language learners. The International Journal of Bilingualism, 11(4), 391–410.
Dhimolea, T. K., Kaplan-Rakowski, R., & Lin, L. (2022). A systematic review of research on high-immersion virtual reality for language learning. TechTrends, 66, 810–824. https://doi.org/10.1007/s11528-022-00717-w
Dickey, M. D. (2005). Three-dimensional virtual worlds and distance learning: Two case studies of Active Worlds as a medium for distance education. British Journal of Educational Technology, 36(3), 439–451.
Diemer, J., Alpers, G. W., Peperkorn, H. M., Shiban, Y., & Mühlberger, A. (2015). The impact of perception and presence on emotional reactions: A review of research in virtual reality. Frontiers in Psychology, 6, 1–9.
Dolgunsöz, E., Yildirim, G., & Yildirim, S. (2018). The effect of virtual reality on EFL writing performance. Journal of Language and Linguistic Studies, 14(1), 278–292.
Donald, S. G., & Lang, K. (2007). Inference with difference-in-differences and other panel data. The Review of Economics and Statistics, 89(2), 221–233.
Dooly, M., Thrasher, T., & Sadler, R. (2023). Whoa! Incredible!: Language Learning Experiences in Virtual Reality. RELC Journal. https://doi.org/10.1177/00336882231167610
Dörnyei, Z. (2001). Motivational strategies in the language classroom. Cambridge University Press.
Fondo, M., & Jacobetty, P. (2020). Exploring affective barriers in virtual exchange: The telecollaborative foreign language anxiety scale. Journal of Virtual Exchange, 3, 37–61.
Gardner, R. C., & MacIntyre, P. D. (1993). On the measurement of affective variables in second language learning. Language Learning, 43(2), 157–194.
Gruber, A., & Kaplan-Rakowski, R. (2020). User experience of public speaking practice in virtual reality. In R. Z. Zheng (Ed.), Cognitive and affective perspectives on immersive technology in education (pp. 235–249). IGI Global.
Gruber, A., Canto, S., & Jauregi-Ondarra, K. (2023). Exploring the use of social virtual reality for virtual exchange. ReCALL, 35(3), 258–273. https://doi.org/10.1017/S0958344023000125
Gruber, A., & Kaplan-Rakowski, R. (2022). Verbal and nonverbal communication in high-immersion virtual reality for language learners. In B. Arnbjörnsdóttir, B. Bédi, L. Bradley, K. Friðriksdóttir, H. Garðarsdóttir, S. Thouësny, & M. J. Whelpton (Eds), Intelligent CALL, granular systems, and learner data: short papers from EUROCALL 2022 (pp. 129–134). https://doi.org/10.14705/rpnet.2022.61.1447
Hampel, R., & Baber, E. (2003). Using internet audio-graphic and video conferencing for language teaching and learning. In U. Felix (Ed.), Language learning online: Towards best practice (pp. 171–191). Swets & Zeitlinger.
Horwitz, E. K., Horwitz, M. B., & Cope, J. (1986). Foreign language classroom anxiety. The Modern Language Journal, 70(2), 125–132.
Horwitz, E. K. (2017). On the Misreading of Horwitz, Horwitz and Cope (1986) and the Need to Balance Anxiety Research and the Experiences of Anxious Language Learners. In New Insights into Language Anxiety (pp. 31–48). Multilingual Matters.
Jin, Y., Dewaele, J. M., & MacIntyre, P. D. (2021). Reducing anxiety in the foreign language classroom: A positive psychology approach. System, 101, 102604.
Kaplan-Rakowski, R., & Gruber, A. (2019). Low-immersion versus high-immersion virtual reality: Definitions, classification, and examples with a foreign language focus. In Proceedings of the Innovation in Language Learning International Conference 2019 (pp. 552–555). Pixel.
Kaplan-Rakowski, R., & Gruber, A. (2021). One-on-one foreign language speaking practice in high-immersion virtual reality. In Y. J. Lan & S. Grant (Eds.), Contextual Language Learning, (pp.187–202).
Kaplan-Rakowski, R., & Gruber, A. (2022). Motivation and reading in high-immersion virtual reality. In B. Arnbjörnsdóttir, B. Bédi, L. Bradley, K. Friðriksdóttir, H. Garðarsdóttir, S. Thouësny, & M. J. Whelpton (Eds.), Intelligent CALL, granular systems, and learner data: Short papers from EUROCALL 2022 (pp. 208–213). Research-publishing net.
Kaplan-Rakowski, R., & Gruber, A. (2023). An experimental study on reading in high-immersion virtual reality. British Journal of Educational Technology. https://doi.org/10.1111/bjet.13392
Kruk, M. (2020). Fluctuations in self-perceived foreign language anxiety during visits to Second Life: A case study. Innovation in Language Learning and Teaching. https://doi.org/10.1080/17501229.2020.1813737
Kuai, S. G., Liang, Q., & He, Y. Y. (2020). Higher anxiety rating does not mean poor speech performance: Dissociation of the neural mechanisms of anticipation and delivery of public speaking. Brain Imaging and Behavior. https://doi.org/10.1007/s11682-020-00387-3
Lan, Y. J., & Grant, S. (Eds.). (2021). Contextual Language Learning: Real Language Learning on the Continuum from Virtuality to Reality. Springer Nature. Springer.
Li, C., Huang, J., & Li, B. (2021). The predictive effects of classroom environment and trait emotional intelligence on Foreign Language Enjoyment and Anxiety. System, 96, 1023938.
Lin, T. J., & Lan, Y. J. (2015). Language learning in virtual reality environments: Past, present, and future. Journal of Educational Technology & Society, 18(4), 486–497.
MacIntyre, P. D., & Gardner, R. C. (1991). Language anxiety: Its relationship to other anxieties and to processing in native and second languages. Language Learning, 41(4), 513–534.
MacIntyre, P. D., & Gardner, R. C. (1994). The subtle effects of language anxiety on cognitive processing in the second language. Language Learning, 44(2), 283–305.
Makransky, G., & Petersen, G. B. (2021). The Cognitive Affective Model of Immersive Learning (CAMIL): A theoretical research-based model of learning in immersive virtual reality. Educational Psychology Review, 33, 937–958. https://doi.org/10.1007/s10648-020-09586-2
Melchor-Couto, S. (2017). Foreign language anxiety levels in Second Life oral interaction. ReCALL, 29(1), 99–119.
Mütterlein, J. (2018, January). The three pillars of virtual reality? Investigating the roles of immersion, presence, and interactivity. In Proceedings of the 51st Hawaii International Conference on System Sciences. Hawaii International Conference on System Sciences.
Nobrega, F. A., & de Rozenfeld, C. C. F. (2019). Virtual reality in the teaching of FLE in a Brazilian public school. Languages, 4(2), 1–13.
Papin, K., & Kaplan-Rakowski, R. (2022). A study on vocabulary learning using immersive 360° pictures. Computer Assisted Language Learning, 35. https://doi.org/10.1080/09588221.2022.2068613
Parmaxi, A. (2023). Virtual reality in language learning: A systematic review and implications for research and practice. Interactive Learning Environments, 31(1), 172–184. https://doi.org/10.1080/10494820.2020.1765392
Parsons, T. D. (2015). Ecological validity in virtual reality–based neuropsychological assessment. In M. Khosrow-Pour (Ed.), Encyclopedia of information science and technology (3rd ed., pp. 1006–1015). IGI Global.
Pichette, F. (2009). Second language anxiety and distance language learning. Foreign Language Annals, 42(1), 77–93.
Radianti, J., Majchrzak, T. A., Fromm, J., & Wohlgenannt, I. (2020). A systematic review of immersive virtual reality applications for higher education: Design elements, lessons learned, and research agenda. Computers & Education, 147, 103778.
Russell, V. (2018). Assessing the effect of pedagogical interventions on success rates and on students’ perceptions of connectedness online. Assessment Across Online Language Education, 49–70.
Schlenker, B. R., & Leary, M. R. (1982). Social anxiety and self-presentation: A conceptualization and model. Psychological Bulletin, 92(3), 641–669. https://doi.org/10.1037/0033-2909.92.3.641
Slater, M. (2009). Place illusion and plausibility can lead to realistic behaviour in immersive virtual environments. Philosophical Transactions of the Royal Society b: Biological Sciences, 364(1535), 3549–3557.
Slater, M. (2018). Immersion and the illusion of presence in virtual reality. British Journal of Psychology, 109(3), 431–433.
Slater, M., & Wilbur, S. (1997). A framework for immersive virtual environments (FIVE): Speculations on the role of presence in virtual environments. Presence Teleoperators and Virtual Environments, 6(6), 603–616.
Smith, C. M., & Sodano, T. M. (2011). Integrating lecture capture as a teaching strategy to improve student presentation skills through self-assessment. Active Learning in Higher Education, 12(3), 151–162.
Spielberger, C. D. (1970). Manual for the state-trait anxiety inventory. Consulting Psychologist.
Stupar-Rutenfrans, S., Ketelaars, L. E. H., & Van Gisbergen, M. S. (2017). Beat the fear of public speaking: Does the 360° video virtual reality exposure training in home environment reduces speech anxiety? Cyberpsychology, Behavior, and Social Networking, 20, 624–633.
Taguchi, N. (2021). Immersive Virtual Reality for Pragmatics Task Development. TESOL Quarterly.
Tai, T. Y., Chen, H. H. J., & Todd, G. (2020). The impact of a virtual reality app on adolescent EFL learners’ vocabulary learning. Computer Assisted Language Learning, 35, 1–26.
Teimouri, Y., Goetze, J., & Plonsky, L. (2019). Second language anxiety and achievement: A meta-analysis. Studies in Second Language Acquisition, 41(2), 363–387.
Thrasher, T. (2022). The impact of virtual reality on L2 French learners’ language anxiety and oral comprehensibility: An exploratory study. CALICO Journal, 39(2), 219–238.
Toyama, M., & Yamazaki, Y. (2021). Classroom interventions and foreign language anxiety: A systematic review with narrative approach. Frontiers in Psychology. https://doi.org/10.3389/fpsyg.2021.614184
Tromp, J., Peeters, D., Meyer, A. S., & Hagoort, P. (2018). The combined use of virtual reality and EEG to study language processing in naturalistic environments. Behavior Research Methods, 50(2), 862–869.
Vázquez, C., Xia, L., Aikawa, T., & Maes, P. (2018). Words in motion: Kinesthetic language learning in virtual reality. 2018 IEEE 18th International Conference on Advanced Learning Technologies (ICALT) (pp. 272–276). Institute of Electrical and Electronics Engineers. https://doi.org/10.1109/ICALT.2018.00069.
Wooldridge, J. M. (2002). Econometric analysis of cross section and panel data. MIT Press.
Xie, Y., Chen, Y., & Ryder, L. H. (2019). Effects of using mobile-based virtual reality on Chinese L2 students’ oral proficiency. Computer Assisted Language Learning. https://doi.org/10.1080/09588221.2019.1604551
Yang, F. C. O., Lo, F. Y. R., Hsieh, J. C., & Wu, W. C. V. (2020). Facilitating communicative ability of EFL learners via high-immersion virtual reality. Journal of Educational Technology & Society, 23(1), 30–49.
Ye, Y., & Kaplan-Rakowski, R. (2023). Practicing listening comprehension skills in high-immersion virtual reality. SSRN. https://ssrn.com/abstract=4335690
York, J., Shibata, K., Tokutake, H., & Nakayama, H. (2021). Effect of SCMC on foreign language anxiety and learning experience: A comparison of voice, video, and VR-based oral interaction. ReCALL, 33(1), 49–70.
Zhang, X. (2019). Foreign language anxiety and foreign language performance: A meta-analysis. The Modern Language Journal, 103(4), 763–781.
We would like to acknowledge and express our gratitude to the team of UniTyLab at Heilbronn University of Applied Sciences. Special thanks go to Professor Gerrit Meixner, Marius Koller, Philip Schäfer, Ketoma Vix Kemanji, and Daniel Martinez for their support with the equipment, the setting for the study, and technical advice during the VR experiment.
This research did not receive any specific Grant from funding agencies in the public, commercial, or not-for-profit sectors.
Ethics approval and consent to participate
This research was carried out with full ethical approval from the Research Ethics Committee at Heilbronn University of Applied Sciences.
The authors declare they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
The German and English versions of the operationalized pre-intervention FLA questionnaire.
The German version
Bitte geben Sie an, inwieweit Sie den folgenden Aussagen zustimmen (stimme voll und ganz zu, stimme zu, weder noch, stimme nicht zu, stimme überhaupt nicht zu)
1. Ich fühle mich nie ganz sicher, wenn ich Englisch spreche.
2. Ich mache mir keine Sorgen, wenn ich Fehler im Englischen mache.
3. Ich zittere, wenn ich weiß, dass ich aufgefordert werde, Englisch zu sprechen.
4. Es macht mir große Angst, wenn ich Englisch sprechen muss, ohne vorbereitet zu sein.
5. Im Fremdsprachenunterricht werde ich so nervös, dass ich Dinge vergesse, die ich eigentlich weiß.
6. Auch wenn ich gut auf das Sprechen in Englisch vorbereitet bin, mache ich mir Sorgen.
7. Ich bin selbstbewusst, wenn ich Englisch rede.
8. Ich fühle, wie mein Herz bis zum Hals schlägt, wenn ich Englisch sprechen muss.
9. Ich fühle mich unsicher, wenn ich vor anderen Leuten Englisch sprechen soll.
10. Ich denke, ich bin angespannter und nervöser, wenn ich nicht meine Muttersprache, sondern Englisch sprechen soll.
11. Ich werde nervös und verunsichert, wenn ich Englisch spreche.
12. Wenn ich weiß, dass ich Englisch sprechen werde, fühle ich mich sehr sicher und entspannt.
13. Ich fühle mich von der Zahl der Regeln, die man in Englisch lernen muss, überfordert.
14. Ich habe Angst, dass andere Studierende über mich lachen werden, wenn ich Englisch spreche.
15. Ich würde mich unter Englisch-Muttersprachlern:innen wahrscheinlich wohl fühlen.
The English version
Please indicate your level of agreement with the following statements (strongly disagree, disagree, neutral, agree, strongly agree).
1. I never feel completely confident when I speak English.
2. I don't worry when I make mistakes in English.
3. I tremble when I know I am going to be asked to speak English.
4. It scares me a lot when I have to speak English without being prepared.
5. I get so nervous in foreign language classes that I forget things I actually know.
6. Even if I am well prepared for speaking in English, I am worried.
7. I feel self-confident when I speak English.
8. I feel my heart beating up to my throat when I have to speak English.
9. I feel insecure when I have to speak English in front of other people.
10. I think I am more tense and nervous when I have to speak English instead of my mother tongue.
11. I get nervous and insecure when I speak English.
12. When I know I am going to speak English, I feel very confident and relaxed.
13. I feel overwhelmed by the number of rules you need to learn in English.
14. I am afraid that other students will laugh at me when I speak English.
15. I would probably feel comfortable around native English speakers.
The German and English versions of the operationalized post-intervention FLA questionnaire.
The German version
Bitte beantworten Sie die Fragen in Hinblick auf das Thema, über das Sie gerade gesprochen haben (strongly disagree, disagree, neutral, agree, strongly agree).
1. Ich habe mich nie ganz sicher gefühlt, als ich gerade Englisch gesprochen habe.
2. Ich habe mir keine Sorgen gemacht, dass ich Fehler im Englischen mache.
3. Ich habe gezittert, als ich angefangen habe, Englisch zu sprechen.
4. Es hat mir große Angst gemacht, als ich Englisch sprechen musste, ohne vorbereitet zu sein.
5. Ich wurde so nervös, dass ich Dinge vergessen habe, die ich eigentlich weiß.
6. Auch wenn ich mich gut auf das Sprechen in Englisch vorbereitet fühlte, habe ich mir Sorgen gemacht.
7. Ich habe mich selbstbewusst gefühlt, als ich Englisch geredet habe.
8. Ich habe gefühlt, wie mein Herz bis zum Hals geschlagen hat, als ich Englisch sprechen musste.
9. Ich habe mich unsicher gefühlt, als ich vor anderen in VR Englisch sprechen sollte.
10. Ich denke, ich war angespannter und nervöser, da ich nicht meine Muttersprache, sondern Englisch sprechen sollte.
11. Ich war nervös und verunsichert, als ich Englisch spreche sollte.
12. Obwohl ich wusste, dass ich Englisch sprechen musste, fühlte ich mich sehr sicher und entspannt.
13. Ich hatte Angst, dass die andere Studierenden in VR über mich lachen werden, wenn ich Englisch spreche.
14. Ich wurde nervös, als ich auf Englisch in VR über dieses Thema sprechen musste, das ich nicht vorher vorbereitet hatte.
The English version
Please answer the questions with respect to the topic you just talked about (strongly disagree, disagree, neutral, agree, strongly agree).
1. I never felt completely confident when I spoke English just now.
2. I didn't worry about making mistakes in English.
3. I was shaking when I started speaking English.
4. It worried me a lot when I had to speak English without being prepared.
5. I got so nervous that I forgot things I actually know.
6. Even though I felt well prepared for speaking in English, I was worried.
7. I felt self-confident when I spoke English.
8. My heart was pounding when I had to speak English.
9. I felt insecure when I had to speak English in front of others in VR.
10. I think I was more tense and nervous because I had to speak English instead of my mother tongue.
11. I felt nervous and insecure when I was supposed to speak English.
12. Although I knew I had to speak English, I felt very confident and relaxed.
13. I was afraid that the other students in VR would laugh at me when I speak English.
14. I got nervous when I had to speak in English in VR about this topic that I had not prepared beforehand.
About this article
Cite this article
Kaplan-Rakowski, R., Gruber, A. The impact of high-immersion virtual reality on foreign language anxiety. Smart Learn. Environ. 10, 46 (2023). https://doi.org/10.1186/s40561-023-00263-9