Structured interaction between teacher and student in the flipped classroom enhances learning and interbrain synchrony

Structured interaction between teacher and student in the flipped classroom enhances learning and interbrain synchrony

Introduction

Teaching, a cornerstone of knowledge transfer in human society, has facilitated the sharing of experience and wisdom across generations. In the traditional lecture-based teaching (LT) model, teachers deliver lectures directly, with students primarily engaging as passive and note-taking listeners, although they can ask questions and receive immediate responses1,2,3. The flipped classroom teaching (FT) model, leveraging video lectures before class to make room for interactive discussions during class, is on the rise in modern educational settings4,5,6. Research has demonstrated that the FT improves students’ learning in subject areas such as humanities, mathematics, and engineering3,7. Moreover, numerous studies have found that FT enhances learning compared to LT5,7,8,9. Compared with the LT, the FT includes video lectures and structured teacher–student interaction phases. However, it is yet to be elucidated whether the structured teacher–student interaction plays a key role in improving learning outcomes in the FT and its underlying neural basis.

The FT model generally consists of two phases: the lecture phase and the knowledge consolidation phase (also known as the teacher–student interaction phase)10,11,12. In the lecture phase, students engage with lecture videos to acquire foundational knowledge. Subsequently, in the teacher–student interaction phase, they delve deeper into the material through face-to-face discussions with the teacher. Additionally, before attending the FT class, the teacher generally arranges the interaction process step by step, generating a structured teacher–student interaction phase. In contrast, the teacher–student interaction is spontaneous or voluntary in the LT model, which happens in the process of lecturing. If students have no questions and the teacher does not initiate queries, interaction between teacher and students would not occur. Hence, it is reasonable to conclude that the FT will bolster communication during the interaction phase, thanks to prior knowledge acquisition and the well-planned schedule.

According to the interactive-constructive-active-passive theory (ICAP)13, the interaction (e.g., teacher–student interaction), with a sufficient degree of turn-taking and predominantly constructive statements by interactants, promotes students’ learning. During the interaction, teachers’ explanations and feedback help to complete students’ knowledge schemas by filling in the gaps in their understanding. Moreover, teachers prompt students to reevaluate their existing knowledge, correct misconceptions, and infer new insights. Empirical studies have found that well-arranged teacher–student interactions are effective in improving learning14,15. For instance, a previous study found that students’ learning performance was significantly improved when teachers used scaffolding strategies (e.g., asking guiding questions or providing hints) to individualize instruction for students15. Therefore, the structured teacher–student interaction in the FT model may play a vital role in enhancing learning.

Moreover, the educational benefits of teacher–student interaction can be due to the enhanced interbrain synchrony (IBS), which is defined as the temporal coupling of neural systems between two individuals engaged in social interaction16. The burgeoning of “second-person neuroscience“17 has prompted an increasing number of researchers to use neuroimaging techniques (e.g., fNIRS) to simultaneously measure the brain activity of two or more individuals during interaction (known as “hyperscanning”)18. Studies have revealed that the teacher–student interaction enhances the IBS in the prefrontal cortex (PFC) and the temporal-parietal junction (TPJ)15,19,20. Importantly, students’ learning outcomes were improved with the increase in the IBS15,19,20,21. For example, a previous study found that when teachers adopted a face-to-face communication mode to teach students with prior knowledge, the IBS in the left PFC between teachers and students was enhanced, and the larger the IBS, the better the student’s academic performance22. Hence, the IBS may be the neural basis of the FT to enhance students’ learning.

In summary, the current study aimed to investigate whether the structured teacher–student interaction is crucial for enhancing learning and to explore the neural basis for improving students’ learning in the FT model. It is expected that the FT will enhance learning given its superior teacher–student interaction compared to the LT. Additionally, we expect that the IBS in the FT will be significantly higher than that in the LT and the IBS is positively correlated with the student’s learning. Taken together, it can be supposed that the structured interaction in the FT model played a key role in improving learning.

To examine the aforementioned proposals, an experiment was conducted in which a teacher taught three groups of students using three types of teaching models: the FT condition, which contained a video-based lecture phase and a structured teacher–student interaction phase; the LT condition, which included a face-to-face lecture phase and a spontaneous teacher–student interaction phase; and the control teaching (CT) condition, which comprised a video-based lecture phase and a spontaneous interaction phase. The CT condition was designed to directly compare the impact of structured and spontaneous interactions. Moreover, we recorded the brain activity of teacher–student dyads using the functional near-infrared spectroscopy (fNIRS)-based hyperscanning technique during both phases. We compared the learning improvements and the IBS across the three groups and further examined the relationship between learning improvements and IBS. This study seeks to elucidate the important role of structured teacher–student interactions in improving students’ learning outcomes, offering empirical evidence to inform future educational reforms.

Results

Behavioral results

Pre- and post-test scores of students in three conditions are displayed in Fig. 1. A two-way ANOVA analysis was performed on these scores. We found a significant main effect of the test phases (F (1,61) = 126.83, p < 0.001, ηp2 = 0.68, observed power = 1.00), and the lecture types (F (2,61) = 4.75, p = 0.012, ηp2 = 0.13, observed power = 0.78). Additionally, the interaction between the test phases and lecture types was significant (F (2,61) = 7.74, p = 0.0010, ηp2 = 0.20, observed power = 0.95). Then, the simple-effect test with Bonferroni correction showed that the post-test scores were higher than the pre-test scores in all conditions (ps < 0.001, Cohen’s ds > 1.00). Meanwhile, for the pre-test score, there was no significant difference among the FT, LT, and CT conditions (ps > 0.10, Cohen’s ds < 0.23). In contrast, for the post-test score, the FT condition (M = 9.90, SD = 2.68) was significantly higher than the LT condition (M = 7.71, SD = 2.93, p = 0.049, Cohen’s d = 0.78) and the CT condition (M = 6.75, SD = 2.99, p = 0.0019, Cohen’s d = 1.11); there was no significant difference between the latter two conditions (p = 0.83, Cohen’s d = 0.32).

Fig. 1: Pre- and post-test scores of students in three conditions (FT, LT, and CT).
Structured interaction between teacher and student in the flipped classroom enhances learning and interbrain synchrony

Each dot represents a student’s score. Diamonds indicate the mean score across all participants in each group. The error bars indicate the standard error of the mean. Significance levels are marked as follows: *p < 0.05; **p < 0.01; ***p < 0.001; Abbreviations: FT flipped classroom teaching condition, LT lecture-based teaching condition, CT control teaching condition.

Full size image

In addition, one-way ANOVAs were performed on likability and appearance attractiveness. No significant main effect of the lecture types was found on likability (teacher: F (2,61) = 0.38, p = 0.69, ηp2 = 0.012; student: F (2,61) = 1.42, p = 0.25, ηp2 = 0.045) and appearance attractiveness (teacher: F (2,61) = 0.23, p = 0.80, ηp2 = 0.0075; student: F (2,61) = 0.76, p = 0.47, ηp2 = 0.024). Due to the fact that the data did not meet the prerequisite assumption of equal variance, we used the Kruskal–Wallis test to calculate the perceived quantity of teacher–student interaction and found a significant main effect of the lecture types (χ2(2) = 18.95, p < 0.001, η2 = 0.28). The perceived quantity of teacher–student interaction was higher in the FT condition (M = 7.24, SD = 1.04) than in the LT (M = 4.33, SD = 2.24, p < 0.001, Cohen’s d = 1.67, Bonferroni corrected) and CT conditions (M = 4.45, SD = 2.60, p < 0.001, Cohen’s d = 1.41, Bonferroni corrected), with no significant difference between the LT and CT conditions (p > 0.10). Therefore, we suggest that the implementation of the experiment was in line with our expectations. That is, there were more interactive discussions between teacher and student in the FT condition, indicating the implementation of the experiment with fidelity.

GLM tests showed that no significant linear relationships were identified between perceived quantity of interaction and learning improvements (FT condition: t = −0.84, R2 = 0.036, β = −0.47, p = 0.41; LT condition: t = 0.64, R2 = 0.021, β = 0.17, p = 0.53; CT condition: t = 1.16, R2 = 0.063, β = 0.23, p = 0.26).

Interbrain synchrony results

Within the FOI, one-sample t-tests against zero (FDR corrected) revealed the enhanced class-related IBS during the FT condition (Fig. 2a) in the left dorsolateral prefrontal cortex (DLPFC, CH09, M = 0.038, SE = 0.011, t (20) = 3.37, p = 0.019, Cohen’s d = 0.73, observed power = 0.99; CH11, M = 0.052, SE = 0.014, t (19) = 3.73, p = 0.017, Cohen’s d = 0.84, observed power = 0.99), superior temporal gyrus (STG, CH13, M = 0.037, SE = 0.012, t (18) = 3.14, p = 0.027, Cohen’s d = 0.73, observed power = 0.99), and primary somatosensory cortex (S1, CH14, M = 0.038, SE = 0.011, t (19) = 3.63, p = 0.017, Cohen’s d = 0.81, observed power = 0.99). In contrast, no significant enhanced class-related IBS was observed for the LT (ts < 2.60, ps > 10, Cohen’s ds < 0.57, Fig. 2b) and the CT (ts < 2.02, ps > 0.10, Cohen’s ds < 0.44, Fig. 2c) conditions.

Fig. 2: T-map from one-sample t-test against zero on class-related IBS.
figure 2

Each row represents a teaching model: a flipped classroom teaching, b lecture-based teaching, and c control teaching, with brighter color representing a higher t value. Channels with significant results are labeled accordingly.

Full size image

For the channels showing significant enhanced class-related IBSs, a one-way ANOVA was conducted to compare the class-related IBS across the lecture types. The main effect of the lecture types was significant in the left DLPFC (CH11, F (2,56) = 5.30, p = 0.031, ηp2 = 0.16, observed power = 0.85, FDR corrected). It should be noted that 5 dyads were excluded from the analysis as the CH11 in them was determined to be a bad channel (see “Methods” for bad channel identification methods). Class-related IBS in the FT condition (M = 0.052, SE = 0.014) was higher than that in the LT condition (M = -0.012, SE = 0.012, p = 0.012, Cohen’s d = 1.09, Bonferroni corrected) and the CT condition (M = −0.0029, SE = 0.019, p = 0.040, Cohen’s d = 0.76, Bonferroni corrected); however, the latter two were not significantly different (p > 0.10, Fig. 3a). No significant main effect was found in other channels (ps > 0.10, FDR corrected; see Supplementary Table 1 for exact p values).

Fig. 3: Task-related IBS at CH09, CH11, CH13, and CH14 among three conditions (FT, LT, and CT).
figure 3

Class-related (a), lecture-related (b), and interaction-related (c) IBS at CH09, CH11, CH13, and CH14 among three conditions. The error bars indicate the standard error of the mean. Significance levels are marked as follows: *p < 0.05; **p < 0.01; Abbreviations: FT flipped classroom teaching condition, LT lecture-based teaching condition, CT control teaching condition.

Full size image

We further examined the lecture and interaction phases. We found that the main effect of the lecture types was significant in the interaction phase (F (2,56) = 6.00, p = 0.017, ηp2 = 0.18, observed power = 0.89, FDR corrected, Fig. 3c) but not in the lecture phase (F (2,56) = 2.69, p = 0.19, ηp2 = 0.088, FDR corrected, Fig. 3b). Also, it was not significant for other channels (ps > 0.10, FDR corrected; see Supplementary Table 1 for exact p values).

Correlation between behavioral indices and IBS

The IBS results have highlighted the important role of the left dorsolateral prefrontal cortex (CH11) in the FT condition. Next, we examined whether there was a linear relationship between the IBS in the left DLPFC and the learning improvements in each teaching model. GLM tests showed that the IBS showed a strong linear relationship with learning improvements in both FT and CT conditions, regardless of whether the IBS was class-related, lecture-related, or interaction-related. However, this relationship was not identified in LT. The statistical results are displayed in Table 1 and Fig. 4.

Table 1 Results of the GLM test between task-related IBS at CH11 and learning improvements
Full size table
Fig. 4: Linear regression results between task-related IBS at CH11 and learning improvements.
figure 4

Linear relationship between the class-related (a), lecture-related (b), and interaction-related (c) IBS in left dorsolateral prefrontal (CH11) and learning improvements in flipped classroom teaching (left), lecture-based teaching (middle), and control teaching (right) conditions. x-Axis indicates the IBS value, and y-axis indicates the learning improvements. Shaded areas indicate 95% confidence interval. Significance levels are marked as follows: *p < 0.05; **p < 0.01.

Full size image

In addition, we analyzed the relationships between the perceived quantity of teacher–student interaction and interaction-related IBS for each teaching model. GLM tests revealed no significant linear relationship between them (FT condition: t = 0.57, R2 = 0.018, β = -0.0082, p = 0.58; LT condition: t = −0.53, R2 = 0.015, β = −0.0036, p = 0.61; CT condition: t = 1.32, R2 = 0.093, β = 0.0090, p = 0.20).

Discussion

Since the earliest days of chalk on slate, teaching has relied on the dynamic interaction between the teachers and students. Technological advances have enhanced this interaction. The flipped classroom model (FT), emerging from such developments, encourages students to explore new materials through lecture videos before class. As a result, students can have deeper and more informed interactions with their teachers in the following class session6,10. The current study considers that the FT enhances the quality of teacher–student interactions and their neural synchrony, thereby leading to better learning outcomes. Moreover, we suggest that a well-designed, structured interaction is essential for augmenting this advantage.

In the present exploratory study, we conducted a comparative analysis among the FT condition, the traditional lecture-based teaching (LT) condition with a face-to-face lecture and spontaneous interaction, and a control teaching (CT) condition including a video lecture and spontaneous interaction. Through this approach, the study comprehensively examined the impact of structured interaction on improving learning outcomes. Importantly, we employed the fNIRS-based hyperscanning technology to uncover the role of interbrain synchrony (IBS) in the flipped classroom setting.

The behavioral results showed that the FT significantly improved students’ learning outcomes compared to both LT and CT, while the latter two conditions showed no significant difference. The distinction between LT and CT conditions lay solely in the lecture phase; LT involved an in-person lecture, whereas CT featured a video lecture by the same instructor. This variation in lecture format did not result in different learning outcomes, consistent with a prior study20, suggesting that the video lecture is comparable to the in-person lecture. Additionally, the FT condition differed from the CT condition solely in the interaction phase, with the FT containing a structured interaction and the CT allowing a spontaneous interaction. This distinction led to significantly enhanced learning in the FT condition, supporting the hypothesis that structured interaction plays a key role in improving learning. These findings indicate that spontaneous interaction offers lower educational benefits than structured interaction. Overall, a well-planned, structured interaction is crucial for achieving positive teaching outcomes, which supports and expands the ICAP theory13.

Another interesting finding was that the perceived quantity of teacher–student interaction was higher in the FT condition compared to the LT and CT conditions, but GLM tests revealed no significant linear relationship to learning improvements. These results suggest that the interaction frequency had a minor impact on students’ learning enhancement in the FT condition. However, previous evidence indicated that the greater the quantity of teacher–student interaction, the better the students learned23,24. This inconsistency could be explained by the differences in measurement methods and learning content. Specifically, Pan and his colleagues quantified interactions by coding videos23, while we relied on participants’ subjective reports. Additionally, the previous study focused on skill learning (a song)23, whereas our study centered on theoretical knowledge. Skill learning, which requires more practice25, benefits from frequent interactions that provide more practice opportunities. In contrast, theoretical knowledge learning benefits more from the depth or quality of interaction, where people share their insights revolving around a certain knowledge, facilitating each other’s thinking and thus promoting deeper learning. Hence, the key to enhancing student learning outcomes in FT may lie in the quality of structured interaction rather than the frequency alone.

The fNIRS results indicated that the neural synchrony captured by fNIRS-based hyperscanning appears to underlie the enhanced communication in the flipped classroom. The FT alone showed significantly enhanced IBS in the left dorsolateral prefrontal cortex (DLPFC), superior temporal gyrus (STG), and primary somatosensory cortex (S1). These findings align with previous hyperscanning studies15,22,26,27, which also observed enhanced IBS in these cortical areas during teacher–student instruction, collaborative learning, and group engagement in creative problem-solving. Specifically, the superior temporal cortex, a critical area for theory of mind and mentalizing28, is involved in processing syntactic and semantic information29,30. The lecture given in the current study required deep semantic processing of management psychology knowledge. Teacher and students in the FT condition formed a shared knowledge representation reflected by the increase in IBS15,22 in the STG. Furthermore, the primary somatosensory cortex is widely recognized as a key region for processing sensory and motor signals27,31, and the left DLPFC plays important roles in various cognitive functions, such as mentalizing32,33, information integration34,35, joint attention36, and monitoring and feedback37,38. The increased IBS in these regions indicated intense interaction dynamics between teacher and students, along with a joint attention to educational materials39,40.

We found that the FT condition exhibited significantly higher IBS in the left DLPFC compared to the LT and CT conditions in the interaction phase. However, no significant difference in the IBS was identified between the LT and CT conditions. These patterns were consistent with the behavioral results regarding learning improvement, implying that increased IBS underlies improved learning outcomes from structured interaction in the flipped classroom. These findings provide neurobiological evidence for the ICAP theory13.

Importantly, GLM analysis supported the linear relationship between learning improvements and IBS in left DLPFC in the structured interaction phase of FT. Specifically, a well-planned interaction schedule enables progressively deepening engagement between teacher and students, leading to a convergence in their understanding of knowledge. This alignment, driven by extensive cognitive involvement, is reflected by enhanced IBS and improved learning outcomes19. In addition, we observed that the perceived amount of teacher–student interaction did not significantly predict the IBS, which might suggest that the increased IBS mainly depended on high-quality teacher–student interaction41. In sum, our findings emphasize the critical role of structured designs in the interaction phase for enhancing IBS and learning.

Additionally, we observed that in the spontaneous interaction phase of the CT, IBS in the left DLPFC was correlated with learning improvements, an association not found in the same phase of the LT condition. Combined with the findings about the lecture phase, lecture-related IBS was linked to learning improvements in CT, but not in LT. We conjecture that the effects of the lecture phase might continue into the interaction phase; that is, a kind of “aftereffect” is generated. Specifically, the lecture and interaction phases of each teaching model are closely related. Students attending the video lecture may have a better learning experience than those in the face-to-face lecture39,40. The immersive experience of students during the video lecture (owing to minimal distractions) persists into the following interaction phase in CT. Likewise, the distracted state of students during the in-person lecture (due to unpredictable disruptions) continues into the interaction phase in LT. Consequently, the interaction phases show a similar correlation pattern to the lecture phases. Moreover, the spontaneous interaction could be subject to various potentially unpredictable factors42, which might lead to an unstable link between IBS and learning outcomes. However, empirical evidence is needed to further support this supposition; future studies could track the dynamics and changes in IBS and behavior over time and link coded behavioral labels with IBS point-by-point23,27 to explore this effect.

Regarding the IBS in the lecture phase, no significant difference in IBS in the left DLPFC was found among the FT, LT, and CT conditions. GLM analysis revealed significant links between lecture-related IBS and learning improvements in FT and CT, but not in LT. Specifically, the lecture videos in FT and CT conditions are prerecorded, allowing for careful control prior to the lecturing, and students watch them alone. As a result, they are exposed to fewer potential distractions, which enables the IBS to be determined by the joint attention on teaching materials39,40. In contrast, the on-site lecture in the LT condition exposes both the instructor and students to more potential distractions and additional sensory inputs43, which might account for the lack of a relationship between the IBS and learning improvements.

Moreover, the current study opens several avenues for future studies. First, this study highlights the benefits of the structured interaction of the flipped classroom. To determine whether a structured interaction following a face-to-face lecture can enhance learning as effectively as that following a video lecture, the former needs to be tested. Second, our study involves one-on-one teaching conducted in a controlled laboratory setting, which is a simulated and “micro” flipped class. It would be valuable to employ portable neuroimaging techniques such as EEG or fNIRS to explore the effects of FT in real-world classroom settings, as in previous studies39,40. Third, more types of knowledge (e.g., procedural knowledge) should be explored rather than declarative knowledge alone. Fourth, this study did not measure the number of questions asked by students across conditions. Future studies could collect such data to gain a more comprehensive understanding of the instructional process. Finally, given that this study is a preliminary attempt to explore the neural foundation of FT using fNIRS hyperscanning technology, it is encouraged that future studies increase the sample size to consolidate the current findings.

In conclusion, this study provides empirical evidence that the flipped classroom teaching model, characterized by pre-recorded lectures and structured interaction phases, improves learning outcomes significantly more than traditional lecture-based teaching or the flipped teaching model controlled with spontaneous interactions. This improvement is underpinned by increased teacher–student IBS in key brain regions, such as the left DLPFC, STG, and S1, which are associated with cognitive processing and teacher–student interaction dynamics. Such evidence highlights the crucial role of interaction in the learning process, which is promoted by the structured approach of flipped classrooms. The findings suggest that carefully planned interaction schedules can significantly benefit educational content delivery.

The current study, therefore, offers valuable insights for the optimization of flipped classroom methodologies, potentially guiding future educational strategies and policies to better meet the needs of both teachers and students in achieving optimal learning experiences. For example, in future teaching activities, teachers should design their interaction processes with students carefully and scientifically and strive to catch students’ attention and engage them, thereby fostering synchronization in learning behaviors and neural activities between teachers and students.

Methods

Participants

A previous study using a comparable experimental design reported large effects44. We thus performed a power analysis for large effects using G* Power 3.1. The analysis identified the minimum number of participants as 60, with a power (1 − β) = 0.80, an error probability of α = 0.05, and a large effect size of f = 0.42. Sixty-six university students (27 males, 39 females; M = 20.96, SD = 2.23) and 1 teacher (male, 23 years old, a postgraduate student with one year of teaching experience) were recruited through public announcements posted at our university to participate in this experiment. Note that the sample size is consistent with previous studies20,22,44. All participants were healthy and right-handed, had normal hearing and vision, and were naive to the purposes of the experiment. The students did not major in psychology. The teacher, professionally trained in psychology, was thoroughly familiar with the educational content. Each student was paired with the teacher, generating 66 teacher–student dyads. The members of each dyad were not familiar with each other. Note that the same teacher was involved in every dyad to maintain a consistent teaching style23.

The students were randomly assigned to three conditions, each with twenty-two students. The first group (8 males, M = 21.59, SD = 3.25; 14 females, M = 21.84, SD = 2.01) attended the FT condition. The second group (10 males, M = 19.91, SD = 2.03; 12 females, M = 21.66, SD = 1.87) attended the LT condition. The third group (9 males, M = 20.34, SD = 2.39; 13 females, M = 20.19, SD = 1.65) attended the CT condition. Note that two female students, one each from the FT and LT conditions, were removed because their post-test scores were lower than the pre-test scores and the learning improvements (the difference between post- and pre-test scores) deviated from the mean by more than three standard deviations, leaving 21 students in both FT and LT conditions. The aim of the study was to improve students’ learning outcomes through effective teaching. The decrease in the two participants’ learning outcomes suggests that they did not take the experiment seriously. Therefore, we removed them from the participant pool.

We got the written consent form from all participants before the experiment. The study was in accordance with the Declaration of Helsinki and supported by the Ethics Committee of Zhejiang normal university (ZSRT2022020).

Experimental materials

Students were asked to study two theories selected from “Management psychology”: the “achievement need theory”45 and the “ERG theory”46.

To ensure consistent teaching content and strategy across all dyads, the teacher underwent standardized training before attending the experiment. Specifically, a teaching script containing teaching contents and procedures was provided to the teacher. He was then asked to rehearse and prepare at home. Three days later, the teacher demonstrated his teaching to the experimenter in the lab. The main experiment started only after the teacher demonstrated consistency in the duration of instruction, speech rate, instructional content, and methodology15,22. Additionally, after training, we invited him to pre-record a 5-min lecture video that would be used in the FT and CT conditions in the lecture phase.

Two sets of parallel tests regarding the two theories were prepared for pre- and post-tests, respectively. They were two short case analysis questions with 15 points for each test. The tests were adapted from the textbook Human Resource Management: Theory, Practice and Art47.

To assess the difficulty between the pre- and post-tests, 12 new participants (6 males, 6 females; M = 19.45, SD = 0.98) were recruited to complete the two tests. Two raters who were naive to the experimental purposes were invited to score the tests. The Pearson correlation coefficients between the two raters for the pre- and post-tests were 0.72 (p = 0.0085) and 0.69 (p = 0.013), respectively, indicating that inter-rater reliability was acceptable. The final scores of each student were the average of the two raters’ scores. A paired t-test on the final score revealed no significant difference between the pre- and post-tests (t (11) = −0.95, p = 0.36, Cohens’ d = 0.28), suggesting that the difficulties of the two tests were comparable. In addition, a Pearson correlation analysis showed that the correlation coefficient between the two tests was 0.70 (p = 0.011), indicating that the two tests were reliable.

Experimental tasks and procedures

At the beginning of the experiment, all students completed the pre-test within 15 min. Then, they rested for 3-min (left panels in Fig. 5), with no communication allowed. Students were asked to relax and remain still.

Fig. 5: Experimental design.
figure 5

Students were randomly assigned to three conditions with different teaching models: a lecture-based teaching (LT), b flipped classroom teaching (FT), and c control teaching (CT). All models encompassed three phases: rest, lecture, and teacher–student interaction.

Full size image

After the rest, a 5-min lecture about the above-mentioned theories was given in either the LT, FT or CT condition. Note that the CT condition was included to underscore the impact of structured interaction on improving learning outcomes. In the LT condition (middle panel in Fig. 5a), the teacher taught the theories. The students listened and took notes, during this phase the students were allowed to raise questions and the teacher would give feedback immediately. In contrast, in the FT and CT conditions (middle panel in Fig. 5b, c), the students learned the theories by watching the lecture video and taking notes. The lecture video could not be paused, fast-forward, fast-backward, and replayed.

After a 1-min break, an 8-min teacher–student interaction phase was followed. In the LT and CT conditions (spontaneous interaction, right panel in Fig. 5a, c), the students could ask questions freely, and the teacher would give feedback immediately. If the students did not have any questions, they were instructed to review their notes. In the FT condition (structured interaction, right panel in Fig. 5b), the students were asked to summarize the teaching contents, and the teacher would give feedback. Next, the teacher encouraged the students to ask questions and provided answers. Following this, the teacher asked questions and the students responded, receiving feedback accordingly. Finally, the teacher summarized the knowledge concisely.

After the interaction phase, the students completed the post-test and reported their perceived quantity of teacher–student interaction. Additionally, both teachers and students were required to complete their subjective ratings of the likability and appearance attractiveness of the interacting partners. All ratings were on a 9-point Likert scale, with 1 denoting “not very much” and 9 denoting “very much”. No communication was allowed during the assessment.

During the experiment, the brain activities of the teacher and student were recorded using an fNIRS device. Note that, for the lecture phase in the CT and FT conditions, we collected the teacher’s brain activity while he recorded the lecture video before the main experiment began.

Behavioral data analyses

Each student’s pre- and post-tests were scored by the two raters. Pearson correlation analysis showed that the two raters’ scores were significantly correlated for the pre-test (r = 0.65, p < 0.001) and the post-test (r = 0.79, p < 0.001), indicating that the inter-rater reliabilities were acceptable.

For each student, the final score of each test, reflecting the learning outcome, was the average score of the two raters. To examine the effects of test phases and lecture types on learning, an ANOVA with test phases (pre vs. post) as the within-subjects factor and lecture types (FT, LT, vs. CT) as the between-subjects factor was conducted. If the FT can improve students’ learning outcomes better than the LT and CT conditions, then we anticipated a significant interaction effect. In specific, we expected no significant difference in the pre-test score among the three lecture types, but that the post-test score for the FT condition should be significantly higher than that of the LT and CT conditions.

In addition, we conducted one-way ANOVAs to analyze likability, appearance attractiveness, and students’ perceived quantity of teacher–student interaction among the three lecture types. We then calculated whether the perceived quantity of interaction could predict students’ learning improvements—the difference between post- and pre-test scores—with a general linear model (GLM).

fNIRS data acquisition

The brain activity of both the teacher and student was measured by the ETG-4000 (Hitachi Medical Corporation, Japan) fNIRS system with a sampling rate of 10 Hz. This system contains eight near-infrared light emitters at two wavelengths (695 and 830 nm) and seven detectors, which formed 19 measurement channels. This layout covered the left prefrontal cortex (PFC) and temporal-parietal junction (TPJ). Both brain areas have been recognized as playing a key role in teacher–student interactions15,20,21,22,34. A 3 × 3 probe patch (five emitters and four detectors, resulting in 12 measurement channels with 30 mm optode separation) was used to cover each participant’s left PFC and a 2 × 3 probe patch (three emitters and three detectors, resulting in 7 measurement channels with 30 mm optode separation) was placed over each participant’s left TPJ. According to the international 10–20 system, the rightmost optode of the lowest probe row of the 3 × 3 patch was placed at Fpz, and the leftmost optode of the lowest probe row of the 2 × 3 patch was placed at P5 (Fig. 6a). We applied a virtual registration method to obtain correspondence between the measured channels and the measured points on the cerebral cortex48,49.

Fig. 6: Optode probe patch and frequency band of interest.
figure 6

a The patches were placed over the left prefrontal and temporal-parietal junction regions. b t-value map. x-Axis denotes the frequency ranging from 0.01 to 1 Hz. y-Axis denotes the t-value resulting from the comparison between lecture and rest phases in each channel. The frequency band of interest, 0.13 to 0.17 Hz, is highlighted by a red rectangle.

Full size image

Preprocessing of fNIRS data

The fNIRS data were preprocessed with the NIRS-KIT toolbox50, based on MATLAB. In line with previous studies, the quality of fNIRS data was checked by visual inspection. All channels that did not show a clear heart band at around 1 Hz in the wavelet transform plot were identified as bad channels and were excluded from all further analysis (approximately 11% of the channels)51,52,53,54. Subsequently, the first and last 10 s of data in each phase were removed to ensure the measurement of stable brain activity. The remained data were first converted to changes in oxyhemoglobin (HbO) and deoxyhemoglobin (HbR), which were calculated according to the modified Beer-Lambert law. Since the HbO concentration is more sensitive to changes in cerebral blood flow than the HbR concentration55, this study focused on the HbO concentration, which aligns with previous fNIRS-based hyperscanning studies20,21,56,57. Then, motion artifacts were corrected by using a “correlation-based signal improvement” (CBSI) approach58. Finally, a bandpass filtering procedure (0.01–1 Hz) was performed to reduce low-frequency drifts and high-frequency noise21,59.

Interbrain synchrony (IBS) calculation

After preprocessing, we employed wavelet transform coherence (WTC) analysis to estimate the neural synchrony between analogous channels of each dyad60. In specific, we estimated the time-averaged (also averaged across all channels in each dyad) IBS across the frequencies from 0.01 to 1 Hz (frequency step = 0.058, after logarithmic transformation)61,62 and then a series of paired sample t-tests were conducted to compare the IBS between the rest phase and the whole teaching process (i.e., lecture phase plus interaction phase). The resulting p-values were corrected using false discovery rate (FDR) method63. The IBS was significantly increased during the entire teaching process compared to the rest phase within the frequencies between 0.13 Hz and 0.17 Hz (i.e., period 5.89–7.42 s). Consequently, as depicted in Fig. 6b, this frequency range was selected as the frequency of interest (FOI) for subsequent analyses, and the IBS values within this FOI were averaged.

We quantified enhancements in IBS throughout the class (class-related IBS), as well as during the lecture (lecture-related IBS) and interaction phases (interaction-related IBS), relative to the baseline established during the rest phase. Consequently, we derived three distinct categories of task-related IBS, which we then transformed into Fisher z-scores for further analysis64. Subsequently, we conducted one-sample t-tests against zero for each channel to identify the channels showing significant class-related IBS with FDR correction (p < 0.05). For all significant channels, one-way ANOVAs were performed to assess differences in class-related, lecture-related, and interaction-related IBS across three types of teaching models. The statistical results were adjusted using FDR correction. For results with a significant main effect of group, post hoc multiple comparisons with Bonferroni correction were performed.

Relationship between IBS and behavioral indices

We first calculated each student’s learning improvements (post-test score minus pre-test score). Next, we used GLM to predict learning improvements from IBS measures and analyzed whether the perceived quantity of teacher–student interaction predicted IBS. We then assessed the significance of the regression coefficient (β) to confirm the predictive power of IBS and the perceived quantity of interaction.

Related Articles

Exploring the potential of LLM to enhance teaching plans through teaching simulation

The introduction of large language models (LLMs) may change future pedagogical practices. Current research mainly focuses on the use of LLMs to tutor students, while the exploration of LLMs’ potential to assist teachers is limited. Taking high school mathematics as an example, we propose a method that utilizes LLMs to enhance the quality of teaching plans through guiding the LLM to simulate teacher-student interactions, generate teaching reflections, and subsequently direct the LLM to refine the teaching plan by integrating these teaching process and reflections. Human evaluation results show that this method significantly elevates the quality of the original teaching plans generated directly by LLM. The improved teaching plans are comparable to high-quality ones crafted by human teachers across various assessment dimensions and knowledge modules. This approach provides a pre-class rehearsal simulation and ideas for teaching plan refinement, offering practical evidence for the widespread application of LLMs in teaching preparation.

On opportunities and challenges of large multimodal foundation models in education

Recently, the option to use large language models as a middleware connecting various AI tools and other large language models led to the development of so-called large multimodal foundation models, which have the power to process spoken text, music, images and videos. In this overview, we explain a new set of opportunities and challenges that arise from the integration of large multimodal foundation models in education.

Effect of Nb contents on the passivation behavior of high-strength anti-seismic rebar in concrete environments

In this study, the surface analysis, cross-section analysis and electrochemical analysis were used to explore the formation mechanism of Nb contents on the passive film of high-strength anti-seismic rebar in simulated concrete pore solution. The passivation experiments confirmed that the addition of Nb promoted the stability and compactness of surface passive film of Nb-containing rebar, and the passivation efficiency of Nb-containing rebar was stronger than that of CS rebar. Firstly, with the decreases of pH, the increases of Nb promoted that the outer layer of the passive film were mainly composed of Fe oxides and Fe oxyhydroxides, the inner layer were mainly composed of Fe oxides and Nb oxides. Secondly, the increases of Nb were beneficial to the formation of Nb oxides, which enhanced the passivation rate of the passive film and inhibited the degradation of Fe oxides, thus enhancing the thickness of surface passive film of Nb-containing rebar.

Enhancing children’s numeracy and executive functions via their explicit integration

Executive functions (EF) are crucial to regulating learning and are predictors of emerging mathematics. However, interventions that leverage EF to improve mathematics remain poorly understood. 193 four-year-olds (mean age = 3 years; 11 months pre-intervention; 111 female, 69% White) were assessed 5 months apart, with 103 children randomised to an integrated EF and mathematics intervention. Our pre-registered hypotheses proposed that the intervention would improve mathematics more than practice as usual. Multi-level modelling and network analyses were applied to the data. The intervention group improved more than the control group in overall numeracy, even when controlling for differences across settings in EF and mathematics-enhancing practices. EF and mathematics measures showed greater interconnectedness post-intervention. In addition, disadvantaged children in the intervention group made greater gains than in the control group. Our findings emphasise the need to consider EFs in their integration with co-developing functions, and in their educational and socio-economic context.

Evaluating large language models in analysing classroom dialogue

This study explores the use of Large Language Models (LLMs), specifically GPT-4, in analysing classroom dialogue—a key task for teaching diagnosis and quality improvement. Traditional qualitative methods are both knowledge- and labour-intensive. This research investigates the potential of LLMs to streamline and enhance this process. Using datasets from middle school mathematics and Chinese classes, classroom dialogues were manually coded by experts and then analysed with a customised GPT-4 model. The study compares manual annotations with GPT-4 outputs to evaluate efficacy. Metrics include time efficiency, inter-coder agreement, and reliability between human coders and GPT-4. Results show significant time savings and high coding consistency between the model and human coders, with minor discrepancies. These findings highlight the strong potential of LLMs in teaching evaluation and facilitation.

Responses

Your email address will not be published. Required fields are marked *