Volume 6 Issue 4, April 2024

Generating turbulence trajectories with diffusion models

Diffusion models can be used to generate intricate and detailed particle paths in turbulent flows, reflecting the complex nature of fluid motion. By means of statistical analysis, Li et al. show that diffusion models can capture the full complexity of turbulent dynamics and generalize to extreme events.

See Li et al.

Image: Michele Buzzicotti, University of Rome Tor Vergata. Cover design: Amie Fernandez

Editorial

The rewards of reusable machine learning code

Research papers can make a long-lasting impact when the code and software tools supporting the findings are made readily available and can be reused and built on. Our reusability reports explore and highlight examples of good code sharing practices.

Editorial 24 Apr 2024

Advertisement

Top of page ⤴

Comment & Opinion

Federated learning is not a cure-all for data ethics

Although federated learning is often seen as a promising solution to allow AI innovation while addressing privacy concerns, we argue that this technology does not fix all underlying data ethics concerns. Benefiting from federated learning in digital health requires acknowledgement of its limitations.

Marieke Bak
Vince I. Madai
Stuart McLennan
Comment 18 Mar 2024
The curious case of the test set AUROC

The area under the receiver operating characteristic curve (AUROC) of the test set is used throughout machine learning (ML) for assessing a model’s performance. However, when concordance is not the only ambition, this gives only a partial insight into performance, masking distribution shifts of model outputs and model instability.

Michael Roberts
Alon Hazan
Carola-Bibiane Schönlieb
Comment 04 Apr 2024
Dangers of speech technology for workplace diversity

Speech technology offers many applications to enhance employee productivity and efficiency. Yet new dangers arise for marginalized groups, potentially jeopardizing organizational efforts to promote workplace diversity. Our analysis delves into three critical risks of speech technology and offers guidance for mitigating these risks responsibly.

Mike Horia Mihail Teodorescu
Mingang K. Geiger
Lily Morse
Comment 22 Apr 2024

Top of page ⤴

News & Views

Artificial intelligence tackles the nature–nurture debate

A classic question in cognitive science is whether learning requires innate, domain-specific inductive biases to solve visual tasks. A recent study trained machine-learning systems on the first-person visual experiences of children to show that visual knowledge can be learned in the absence of innate inductive biases about objects or space.

Justin N. Wood
News & Views 19 Apr 2024

Top of page ⤴

Reviews

The benefits, risks and bounds of personalizing the alignment of large language models to individuals

Tailoring the alignment of large language models (LLMs) to individuals is a new frontier in generative AI, but unbounded personalization can bring potential harm, such as large-scale profiling, privacy infringement and bias reinforcement. Kirk et al. develop a taxonomy for risks and benefits of personalized LLMs and discuss the need for normative decisions on what are acceptable bounds of personalization.

Hannah Rose Kirk
Bertie Vidgen
Scott A. Hale
Perspective 23 Apr 2024

Top of page ⤴

Research

Synthetic Lagrangian turbulence by generative diffusion models

Modelling the statistical and geometrical properties of particle trajectories in turbulent flows is key to many scientific and technological applications. Li and colleagues introduce a data-driven diffusion model that can generate high-Reynolds-number Lagrangian turbulence trajectories with statistical properties consistent with those of the training set and even generalize to rare, intense events unseen during training.

T. Li
L. Biferale
M. Buzzicotti
Article Open Access 17 Apr 2024
Tandem mass spectrum prediction for small molecules using graph transformers

Identifying compounds in tandem mass spectrometry requires extensive databases of known compounds or computational methods to simulate spectra for samples not found in databases. Simulating tandem mass spectra is still challenging, and long-range connections in particular are difficult to model for graph neural networks. Young and colleagues use a graph transformer model to learn patterns of long-distance relations between atoms and molecules.

Adamo Young
Hannes Röst
Bo Wang
Article 05 Apr 2024
Equivariant 3D-conditional diffusion model for molecular linker design

Fragment-based molecular design uses chemical motifs and combines them into bio-active compounds. While this approach has grown in capability, molecular linker methods are restricted to linking fragments one by one, which makes the search for effective combinations harder. Igashov and colleagues use a conditional diffusion model to link multiple fragments in a one-shot generative process.

Ilia Igashov
Hannes Stärk
Bruno Correia
Article Open Access 11 Apr 2024
Geometry-enhanced pretraining on interatomic potentials

Using machine learning methods to model interatomic potentials enables molecular dynamics simulations with ab initio level accuracy at a relatively low computational cost, but requires a large number of labelled training data obtained through expensive ab initio computations. Cui and colleagues propose a geometric learning framework that leverages self-supervised learning pretraining to enhance existing machine learning based interatomic potential models at a negligible additional computational cost.

Taoyong Cui
Chenyu Tang
Wanli Ouyang
Article 05 Apr 2024
Invalid SMILES are beneficial rather than detrimental to chemical language models

Generative models for chemical structures are often trained to create output in the common SMILES notation. Michael Skinnider shows that training models with the goal of avoiding the generation of incorrect SMILES strings is detrimental to learning other chemical properties and that allowing models to generate incorrect molecules, which can be easily removed post hoc, leads to better performing models.

Michael A. Skinnider
Article Open Access 29 Mar 2024
A 5′ UTR language model for decoding untranslated regions of mRNA and function predictions

The 5′ untranslated region is a critical regulatory region of mRNA, influencing gene expression regulation and translation. Chu, Yu and colleagues develop a language model for analysing untranslated regions of mRNA. The model, pretrained on data from diverse species, enhances the prediction of mRNA translation activities and has implications for new vaccine design.

Yanyi Chu
Dan Yu
Mengdi Wang
Article 05 Apr 2024
Reusability report: Uncovering associations in biomedical bipartite networks via a bilinear attention network with domain adaptation

In early 2023, Bai and colleagues presented DrugBAN, an interpretable method for drug–target prediction. In this Reusability Report, Xu and colleagues reproduce the original findings and provide a careful exploration of cross-domain adaptability.

Tao Xu
Haoyuan Shi
Zhenyu Yue
Article 04 Apr 2024
A neural speech decoding framework leveraging deep learning and speech synthesis

Recent research has focused on restoring speech in populations with neurological deficits. Chen, Wang et al. develop a framework for decoding speech from neural signals, which could lead to innovative speech prostheses.

Xupeng Chen
Ran Wang
Adeen Flinker
Article Open Access 08 Apr 2024
The synergy complement control approach for seamless limb-driven prostheses

Current limb-driven methods often result in suboptimal prosthetic motions. Kühn and colleagues develop a framework called synergy complement control (SCC) that advances prosthetics by learning ‘cyborg’ limb-driven control, ensuring natural coordination. Validated in diverse trials, SCC offers reliable and intuitive enhancement for limb functionality.

Johannes Kühn
Tingli Hu
Sami Haddadin
Article Open Access 19 Apr 2024

Top of page ⤴

Amendments & Corrections

Author Correction: A soft robot that adapts to environments through shape change

Dylan S. Shah
Joshua P. Powers
Rebecca Kramer-Bottiglio
Author Correction 01 Apr 2024
Publisher Correction: The curious case of the test set AUROC

Michael Roberts
Alon Hazan
Carola-Bibiane Schönlieb
Publisher Correction 12 Apr 2024

Top of page ⤴

Volume 6 Issue 4, April 2024

Generating turbulence trajectories with diffusion models

Editorial

The rewards of reusable machine learning code

Comment & Opinion

Federated learning is not a cure-all for data ethics

The curious case of the test set AUROC

Dangers of speech technology for workplace diversity

News & Views

Artificial intelligence tackles the nature–nurture debate

Reviews

The benefits, risks and bounds of personalizing the alignment of large language models to individuals

Research

Synthetic Lagrangian turbulence by generative diffusion models

Tandem mass spectrum prediction for small molecules using graph transformers

Equivariant 3D-conditional diffusion model for molecular linker design

Geometry-enhanced pretraining on interatomic potentials

Invalid SMILES are beneficial rather than detrimental to chemical language models

A 5′ UTR language model for decoding untranslated regions of mRNA and function predictions

Reusability report: Uncovering associations in biomedical bipartite networks via a bilinear attention network with domain adaptation

A neural speech decoding framework leveraging deep learning and speech synthesis

The synergy complement control approach for seamless limb-driven prostheses

Amendments & Corrections

Author Correction: A soft robot that adapts to environments through shape change

Publisher Correction: The curious case of the test set AUROC

Search

Quick links