Related Articles

A unified acoustic-to-speech-to-language embedding space captures the neural basis of natural language processing in everyday conversations

This study introduces a unified computational framework connecting acoustic, speech and word-level linguistic structures to study the neural basis of everyday conversations in the human brain. We used electrocorticography to record neural signals across 100 h of speech production and comprehension as participants engaged in open-ended real-life conversations. We extracted low-level acoustic, mid-level speech and contextual word embeddings from a multimodal speech-to-text model (Whisper). We developed encoding models that linearly map these embeddings onto brain activity during speech production and comprehension. Remarkably, this model accurately predicts neural activity at each level of the language processing hierarchy across hours of new conversations not used in training the model. The internal processing hierarchy in the model is aligned with the cortical hierarchy for speech and language processing, where sensory and motor regions better align with the model’s speech embeddings, and higher-level language areas better align with the model’s language embeddings. The Whisper model captures the temporal sequence of language-to-speech encoding before word articulation (speech production) and speech-to-language encoding post articulation (speech comprehension). The embeddings learned by this model outperform symbolic models in capturing neural activity supporting natural speech and language. These findings support a paradigm shift towards unified computational models that capture the entire processing hierarchy for speech comprehension and production in real-world conversations.

Language measures correlate with other measures used to study emotion

Researchers are increasingly using language measures to study emotion, yet less is known about whether language relates to other measures often used to study emotion. Building on previous work which focuses on associations between language and self-report, we test associations between language and a broader range of measures (self-report, observer report, facial cues, vocal cues). Furthermore, we examine associations across different dictionaries (LIWC-22, NRC, Lexical Suite, ANEW, VADER) used to estimate valence (i.e., positive versus negative emotion) or discrete emotions (i.e., anger, fear, sadness) in language. Associations were tested in three large, multimodal datasets (Ns = 193–1856; average word count = 316.7–2782.8). Language consistently related to observer report and consistently related to self-report in two of the three datasets. Statistically significant associations between language and facial cues emerged for language measures of valence but not for language measures of discrete emotions. Language did not consistently show significant associations with vocal cues. Results did not tend to significantly vary across dictionaries. The current research suggests that language measures (in particular, language measures of valence) are correlated with a range of other measures used to study emotion. Therefore, researchers may wish to use language to study emotion when other measures are unavailable or impractical for their research question.

Networks and identity drive the spatial diffusion of linguistic innovation in urban and rural areas

Cultural innovation (e.g., music, beliefs, language) tends to be adopted regionally. The geographic area where innovation is adopted is often attributed to one of two factors: (i) speakers adopting new behaviors that signal their demographic identities (i.e., an identity effect), or (ii) these behaviors spreading through homophilous networks (i.e., a network effect). In this study, we show that network and identity play complementary roles in determining where new language is adopted; thus, modeling the diffusion of lexical innovation requires incorporating both network and identity. We develop an agent-based model of cultural adoption, and validate geographic properties in our simulations against a dataset of innovative words that we identify from a 10% sample of Twitter (e.g., fleeky, birbs, ubering). Using our model, we are able to directly test the roles of network and identity by comparing a model that combines network and identity against simulated network-only and identity-only counterfactuals. We show that both effects influence different mechanisms of diffusion. Specifically, network principally drives spread among urban counties via weak-tie diffusion, while identity plays a disproportionate role in transmission among rural counties via strong-tie diffusion. Diffusion between urban and rural areas, a key component in innovation spreading nationally, requires both network and identity. Our work suggests that models must integrate both factors in order to understand and reproduce the adoption of innovation.

Land use conversion increases network complexity and stability of soil microbial communities in a temperate grassland

Soils harbor highly diverse microbial communities that are critical to soil health, but agriculture has caused extensive land use conversion resulting in negative effects on critical ecosystem processes. However, the responses and adaptations of microbial communities to land use conversion have not yet been understood. Here, we examined the effects of land conversion for long-term crop use on the network complexity and stability of soil microbial communities over 19 months. Despite reduced microbial biodiversity in comparison with native tallgrass prairie, conventionally tilled (CT) cropland significantly increased network complexity such as connectivity, connectance, average clustering coefficient, relative modularity, and the number of species acting at network hubs and connectors as well as resulted in greater temporal variation of complexity indices. Molecular ecological networks under CT cropland became significantly more robust and less vulnerable, overall increasing network stability. The relationship between network complexity and stability was also substantially strengthened due to land use conversion. Lastly, CT cropland decreased the number of relationships between network structure and environmental properties instead being strongly correlated to management disturbances. These results indicate that agricultural disturbance generally increases the complexity and stability of species “interactions”, possibly as a trade-off for biodiversity loss to support ecosystem function when faced with frequent agricultural disturbance.

Machine learning on interictal intracranial EEG predicts surgical outcome in drug resistant epilepsy

Surgical success for patients with focal drug resistant epilepsy (DRE) relies on accurate localization of the epileptogenic zone (EZ). Currently, no exam delineates this zone unambiguously. Instead, the EZ is approximated by the area where seizures begin, which is identified manually through a tedious process that is prone to errors and biases. More importantly, resection of this area does not always predict good surgical outcome. Here, we propose an artificially intelligent, patient-specific framework that automatically identifies the EZ requiring little to no input from clinicians, without having to wait for a seizure to occur. The framework transforms interictal intracranial electroencephalography data into spatiotemporal representations of brain activity discriminating the interictal epileptogenic network from background activity. The epileptogenic network delineates the EZ with high precision and predicts surgical outcome. Our framework eliminates the need for manual data inspection, reduces prolonged monitoring, and enhances surgical planning for DRE patients.

Responses

Your email address will not be published. Required fields are marked *