Research Radar — 2026-05-15

Generated 2026-05-15 09:15 +0800 DeepSeek-V4-Pro Academic articles only

Hot Topic

From Molecules to Migration: New Sensors, Models, and Evaluation Standards

This week's papers span scales from atomic trajectories to cellular migration, united by a demand for better models and evaluation rigor — TrajCast rethinks molecular dynamics, VIS-seq images variant effects at scale, Galvanin reveals an electric-field migration sensor, and multiple papers call out inflated benchmarks in perturbation prediction.

TrajCast (Nature Machine Intelligence) bypasses force fields to directly predict atomic trajectories, enabling 30x longer MD timesteps
VIS-seq (Cell) links thousands of genetic variants to multidimensional cell imaging phenotypes at scale
Galvanin/TMEM154 (Cell) identified as a conserved electric-field sensor for directed cell migration
Two independent papers reveal spurious correlation and nonidentifiability inflating perturbation prediction and BINN benchmarks
TxPert (Nature Biotechnology) achieves near-experimental accuracy for unseen transcriptomic perturbations using knowledge graphs

Methods & AI

Computational

6 selected

Computational #1 READ FULL

TxPert: using multiple knowledge graphs for prediction of transcriptomic perturbation effects

Nature Biotechnology Published 2026-05-01 research article DOI: 10.1038/s41587-026-03113-4

Authors: Rapsomaniki et al.

transcriptomic perturbation knowledge graphs drug discovery deep learning

Summary: TxPert predicts unseen single-gene transcriptomic perturbation effects by integrating multiple knowledge graphs, achieving accuracy approaching split-half experimental reproducibility. The framework generalizes across cell types and perturbation classes without requiring paired pre/post perturbation training data.

Why it matters: Achieving near-experimental accuracy for unseen perturbations is a major advance for in silico drug screening and CRISPR experimental design. Knowledge graph integration provides a principled way to incorporate prior biological knowledge into deep learning.

Why for Yiru: Transcriptomic perturbation prediction with knowledge graph priors directly relevant to modeling drug responses and genetic perturbations in cancer and immune cells — core to computational oncology research.

Computational #2 READ FULL

Spurious correlation inflates performance in single-cell perturbation prediction

bioRxiv Published 2026-05-07 preprint DOI: 10.1101/2026.05.07.723486

Authors: Roohani et al.

perturbation prediction benchmarking single-cell evaluation bias

Summary: Demonstrates that standard evaluation metrics for single-cell perturbation prediction (correlation and cosine similarity of differential expression) are systematically inflated by statistical bias from reusing the same control population. A simple control-splitting procedure removes this bias, substantially reducing previously reported performance — even non-informative methods appear competitive under biased metrics.

Why it matters: This finding demands re-evaluation of published perturbation prediction benchmarks and raises the bar for method development. The field risks optimizing for artifacts rather than biological signal.

Why for Yiru: Methodological rigor in computational biology is essential. Boss's work on perturbation modeling must account for evaluation bias to avoid overoptimistic conclusions.

Computational #3 READ FULL

Task-Specialized Protein Language Models Decode the Sequence Grammar of Post-Translational Modification Sites

bioRxiv Published 2026-05-08 preprint DOI: 10.1101/2026.05.08.723918

Authors: Rives et al.

protein language models post-translational modifications ESM2 fine-tuning

Summary: Fine-tunes ESM2, a 650M-parameter protein language model, for phosphorylation, acetylation, and ubiquitination-site prediction using parameter-efficient fine-tuning with focal loss to handle extreme class imbalance. The task-specialized models reveal sequence grammar governing PTM site selection, identifying motifs that distinguish modified from chemically eligible but unmodified residues at proteome scale.

Why it matters: PTM prediction at proteome scale with PLMs bridges sequence and function. Understanding what makes a residue modified vs. merely chemically eligible is a fundamental biochemical question with implications for signaling biology and drug targeting.

Why for Yiru: Protein language models applied to functional site prediction connect Boss's interests in AI methods with biological mechanism discovery. PTM regulation is central to immune signaling and cancer biology.

Computational #4 READ FULL

Towards the explainability of protein language models

Nature Machine Intelligence Published 2026-05-11 review DOI: 10.1038/s42256-026-01232-w

Authors: Hunklinger and Ferruz

protein language models explainable AI XAI computational biology

Summary: Comprehensive review of explainable artificial intelligence (XAI) methods tailored for protein language models. Covers attention analysis, probing classifiers, concept-based explanations, and structural interpretability approaches, with practical guidance on when each method is appropriate and what biological questions they can address.

Why it matters: As PLMs become central to computational biology, understanding what they learn is critical for trust and scientific discovery. This review provides a roadmap for extracting mechanistic insight rather than just predictive performance.

Why for Yiru: Interpretability of deep learning models in biology is a recurring challenge. The XAI methods surveyed here apply broadly to any biological sequence or structure model Boss might develop or use.

Computational #5 READ FULL

Quantifying Cross-Modal Association Confidence for Single-Cell RNA-ATAC Integration

bioRxiv Published 2026-05-07 preprint DOI: 10.1101/2026.05.07.723400

Authors: Noble et al.

single-cell multi-omics RNA-ATAC integration cross-modal multiome

Summary: Introduces the CLIC (Cross-modality Link Confidence) score, a quantitative measure of empirical concordance between gene expression and nearby chromatin accessibility derived from diverse single-cell multiome datasets. CLIC enables filtering of low-confidence associations that compromise integration accuracy when combining separately profiled scRNA-seq and scATAC-seq data.

Why it matters: Most single-cell data remains unimodal. Better integration of separately profiled modalities is essential for maximizing the value of existing datasets. CLIC provides a principled confidence metric for a problem that's typically treated as uniformly reliable.

Why for Yiru: Multi-omics integration is central to spatial and single-cell analysis. A confidence framework for cross-modal associations directly improves the interpretability of integrated TME atlases.

Computational #6 BROWSE

Empirically determined baseline masking strategies and other considerations for gene-level burden tests

Nature Genetics Published 2026-05-08 research article DOI: 10.1038/s41588-026-02597-9

Authors: Neale et al.

rare variant association burden tests UK Biobank statistical genetics

Summary: Systematic comparison of different masking approaches for rare variant association tests across 54 traits in UK Biobank identifies optimal strategies that increase study power and replicability. Provides practical guidance for baseline masking, variant annotation, and covariate adjustment in gene-level burden analyses.

Why it matters: Rare variant association testing is a cornerstone of statistical genetics but methodological choices dramatically affect power. Empirically grounded best practices from this scale of analysis are highly actionable.

Why for Yiru: Statistical genetics methodology is relevant to understanding the genetic architecture of immune-related traits and cancer susceptibility, which contextualize Boss's translational research.

Biomedical discoveries

Biomedicine

5 selected

Biomedicine #1 READ FULL

Image-based, pooled phenotyping reveals multidimensional, disease-specific variant effects

Cell Published 2026-05-12 research article DOI:

Authors: Blainey et al.

variant effect imaging phenotyping LMNA PTEN functional genomics

Summary: VIS-seq (Variant in situ sequencing) links genetic variants to cell images at scale, revealing how thousands of LMNA and PTEN variants affect molecules, subcellular structures, and cellular phenotypes. The multidimensional phenotypic continuum revealed by imaging is not recapitulated by any single functional readout, demonstrating that variants produce complex, disease-specific phenotypic signatures.

Why it matters: Moves variant effect prediction beyond unidimensional functional scores to rich phenotypic profiles. The finding that no single assay captures the full variant effect has profound implications for clinical variant interpretation and deep mutational scanning.

Why for Yiru: Image-based phenotyping at scale connects to spatial omics interests. Understanding how genetic variants reshape cellular phenotypes is fundamental to cancer biology and personalized medicine.

Biomedicine #2 READ FULL

Galvanin (TMEM154) is an electric-field sensor for directed cell migration

Cell Published 2026-05-12 research article DOI:

Authors: Zhao et al.

cell migration electrotaxis galvanotaxis TMEM154 membrane protein

Summary: Identifies TMEM154 (renamed Galvanin) as a conserved transmembrane protein that functions as a direct electric-field sensor, linking extracellular electrical cues to directed cell migration. Galvanin mediates galvanotaxis in rapidly migrating cells including immune cells, revealing a molecular mechanism for how endogenous electric fields guide cell movement during development, wound healing, and immune responses.

Why it matters: The molecular identity of the electric-field sensor has been a decades-old mystery in cell biology. Galvanin's discovery opens a new axis of cell guidance with implications for wound healing, development, and immune cell trafficking.

Why for Yiru: Immune cell migration and trafficking are central to tumor immunology. A molecular electric-field sensor may explain how immune cells navigate complex tissue environments, including tumors.

Biomedicine #3 READ FULL

Antibody Blockade of Ly49/MHC-I interactions enhances Innate and Adaptive Immunity Against Cancer Metastasis

bioRxiv Published 2026-05-07 preprint DOI: 10.1101/2026.05.07.722994

Authors: Mandelboim et al.

NK cells cancer immunotherapy MHC-I Ly49 metastasis immune checkpoint

Summary: Demonstrates that antibody-mediated blockade of Ly49/MHC-I interactions enhances both innate NK cell and adaptive T cell immunity against checkpoint inhibitor-resistant pancreatic and melanoma metastasis. Cryo-EM and X-ray crystallography reveal the structural basis of M1/42 antibody binding, while in vivo models show reduced metastatic burden with enhanced immune infiltration.

Why it matters: Ly49/MHC-I blockade represents a novel immune checkpoint axis distinct from PD-1/PD-L1 and CTLA-4, with potential to overcome resistance to existing immunotherapies in metastatic settings.

Why for Yiru: NK cell-based immunotherapy and novel immune checkpoint targets are highly relevant to tumor immunology research. The structural immunology and in vivo metastasis models provide a comprehensive mechanistic picture.

Biomedicine #4 READ FULL

The gut microbiota metabolite Urolithin A mitigates JAK signaling to suppress cytokine-mediated autoimmune diseases

bioRxiv Published 2026-05-08 preprint DOI: 10.1101/2026.05.08.723914

Authors: Chen et al.

JAK signaling urolithin A autoimmune disease gut microbiota metabolite

Summary: Characterizes Urolithin A, a natural gut microbiota-derived metabolite, as a direct JAK1 inhibitor that broadly dampens JAK-STAT signaling induced by type I/II interferons and IL-6. UA binds the JAK1 JH1 domain and attenuates autoimmune pathogenesis in Trex1-KO mice and imiquimod-induced SLE and psoriasis models.

Why it matters: A gut-derived natural metabolite functioning as an endogenous JAK inhibitor opens new therapeutic possibilities for autoimmune diseases. The microbiota-immune axis mediated by specific metabolites is an emerging frontier.

Why for Yiru: Immunometabolism and the gut-immune axis are increasingly recognized as modulators of tumor immunity and immunotherapy response. JAK-STAT signaling is also central to cancer inflammation.

Biomedicine #5 BROWSE

TMEM119+ microglia MHC class I restricted antigen presentation impacts CD8 T cell memory, effector status, and blood-brain barrier disruption during neurotropic virus infection

bioRxiv Published 2026-05-08 preprint DOI: 10.1101/2026.05.08.722741

Authors: Klein et al.

microglia MHC-I CD8 T cells neuroimmunology antigen presentation

Summary: Reveals that TMEM119+ microglia present antigen via MHC class I to CD8 T cells during neurotropic virus infection, shaping T cell memory formation, effector function, and blood-brain barrier integrity. Microglial MHC-I expression is dynamically regulated and its loss alters the outcome of CNS infection.

Why it matters: Microglia are traditionally viewed as innate immune cells; this study demonstrates they directly bridge to adaptive immunity through MHC-I-restricted antigen presentation, redefining their role in CNS immune surveillance.

Why for Yiru: The intersection of innate and adaptive immunity, antigen presentation by tissue-resident cells, and T cell memory formation in specialized niches parallels questions about immune function in the tumor microenvironment.

Cross-disciplinary watchlist

Other Fields

5 selected

Field #1 READ FULL

Peripheral control enabled by distributed sensing in an octopus-inspired soft robotic arm for autonomous underwater grasping

Nature Machine Intelligence Published 2026-05-12 research article DOI: 10.1038/s42256-026-01230-y

Authors: Del Dottore et al.

soft robotics embodied intelligence distributed sensing autonomous systems

Summary: Presents an octopus-inspired soft robotic arm that uses optoelectronic mechanosensors embedded in suction cups to detect contact forces and infer object positions, enabling autonomous underwater grasping without centralized processing. The distributed sensing architecture mimics the peripheral nervous system of octopuses, achieving robust manipulation in unstructured environments.

Why it matters: Embodied intelligence in soft robotics — where sensing and computation are distributed throughout the body rather than centralized — represents a fundamental shift from traditional AI-robotics architectures and could enable robots that operate in complex real-world environments.

Why for Yiru: Bio-inspired distributed computation and sensing architectures offer conceptual parallels to how biological systems process spatial information. The octopus nervous system is a model of decentralized intelligence relevant to thinking about multi-scale biological data integration.

Field #2 READ FULL

Force-free molecular dynamics through autoregressive equivariant networks

Nature Machine Intelligence Published 2026-05-05 research article DOI: 10.1038/s42256-026-01227-7

Authors: Thiemann et al.

molecular dynamics equivariant networks autoregressive models materials science

Summary: Introduces TrajCast, a neural network that bypasses force calculations to directly predict atomic trajectories using autoregressive equivariant networks. TrajCast enables molecular dynamics timesteps up to 30 times longer than conventional integrators while accurately reproducing physical properties of molecules and materials.

Why it matters: Molecular dynamics simulations are limited by femtosecond timesteps. A 30x acceleration through learned trajectory prediction could transform materials discovery, protein dynamics, and drug binding simulations.

Why for Yiru: Though applied to materials, the autoregressive trajectory prediction framework has direct analogies to modeling cellular state transitions and disease progression trajectories — core interests in computational biology.

Field #3 READ FULL

Platonic representation of foundation machine learning interatomic potentials

Nature Machine Intelligence Published 2026-05-07 research article DOI: 10.1038/s42256-026-01235-7

Authors: Li and Walsh

foundation models interatomic potentials representation learning materials science

Summary: Demonstrates that a unified 'Platonic' geometry emerges across the latent representations of independently trained foundation models for learning interatomic potentials. This shared structure enables cross-model comparison, embedding arithmetic, and ground-truth-free diagnostics — suggesting that these models converge to a universal representation of atomic environments.

Why it matters: The discovery of convergent representations across independent foundation models echoes the Platonic representation hypothesis in LLMs, suggesting deep structural regularities in how neural networks learn physical systems. This has implications for model interoperability and transfer learning in scientific ML.

Why for Yiru: Foundation model representation learning and the Platonic representation hypothesis are directly relevant to biological foundation models for single-cell and spatial omics, where convergence across models could enable universal cell state representations.

Field #4 READ FULL

Learning the chemical language of natural products

Nature Machine Intelligence Published 2026-05-07 research article DOI: 10.1038/s42256-026-01241-9

Authors: Stokes et al.

natural products chemical language models drug discovery foundation models

Summary: Develops a foundation model for natural product chemistry that learns biosynthetic grammar — predicting enzymatic transformations, scaffold relationships, and bioactivity from molecular representations. The model captures the chemical language of secondary metabolism, enabling mining of uncharted natural product space.

Why it matters: Natural products remain a rich source of drugs but their chemical space is poorly mapped. A language model that captures biosynthetic rules could accelerate natural product discovery by orders of magnitude.

Why for Yiru: Chemical language modeling with a foundation model framing connects to broader interests in learning biological sequence-structure-function relationships. The natural product grammar concept has parallels to gene regulatory grammar.

Field #5 BROWSE

Re-thinking human–machine interaction and the governance of AI in the military domain

Nature Machine Intelligence Published 2026-05-11 perspective DOI: 10.1038/s42256-026-01231-x

Authors: Bode and Chandler

AI governance military AI human-machine interaction AI policy

Summary: Analyzes how human-machine interactions across the AI lifecycle — from development and testing to deployment and oversight — affect meaningful human control and decision-making in military applications. Argues that governance frameworks must account for the full spectrum of human-AI interaction, not just the moment of autonomous decision-making.

Why it matters: As AI systems become embedded in high-stakes domains, understanding the governance of human-AI interaction is critical beyond military contexts — it applies to clinical AI, autonomous vehicles, and scientific AI assistants.

Why for Yiru: AI governance and human-AI interaction design are increasingly relevant to the responsible deployment of AI in biomedical research and clinical decision support systems.

Friday delivery

BioTech News Delivery

5 selected

BioTech #1 READ FULL

Five burning Qs for Isomorphic Labs after $2.1B raise for AI biotech

Endpoints News Published 2026-05-12 industry news DOI:

Authors: Endpoints News

AI drug discovery Isomorphic Labs biotech financing AlphaFold

Summary: Isomorphic Labs, the Alphabet subsidiary applying AlphaFold-derived AI to drug discovery, raises $2.1 billion in one of the largest AI-biotech financings ever. The raise signals enormous investor conviction in AI-native drug discovery platforms, but key questions remain about pipeline maturity, clinical validation timelines, and the AlphaFold-to-drug translation gap.

Why it matters: A $2.1B raise for an AI drug discovery company is a landmark event that will shape investment flows and talent migration across the AI-biotech ecosystem. Isomorphic's progress is a bellwether for whether computational protein structure prediction translates to therapeutic impact.

Why for Yiru: AI-driven drug discovery from structure prediction to therapeutic candidates represents the frontier where Boss's computational biology expertise meets translational impact.

BioTech #2 BROWSE

Bristol Myers joins Hengrui party in 13-asset deal worth up to $15.2B

Endpoints News Published 2026-05-12 industry news DOI:

Authors: Endpoints News

drug licensing Bristol Myers Squibb Hengrui pharma deal China biotech

Summary: Bristol Myers Squibb licenses 13 preclinical and clinical assets from China's Hengrui Medicine in a deal valued at up to $15.2 billion, continuing the trend of Western pharma companies sourcing innovation from Chinese biotech pipelines across oncology, immunology, and metabolic disease.

Why it matters: The scale of this deal underscores the growing role of Chinese biotech in global pharmaceutical pipelines. The flow of assets from East to West is reshaping drug development economics and competitive dynamics.

Why for Yiru: The globalization of drug development and the increasing prominence of Chinese biotech innovation has implications for the research ecosystem Boss operates within, including collaboration opportunities and competitive awareness.

BioTech #3 READ FULL

Flagship startup Serif says it has solved non-viral gene therapy in monkeys

Endpoints News Published 2026-05-12 industry news DOI:

Authors: Endpoints News

gene therapy non-viral delivery Flagship Pioneering Serif Therapeutics

Summary: Serif Therapeutics, a Flagship Pioneering startup, claims to have achieved durable therapeutic gene expression in non-human primates using a fully non-viral delivery platform — a milestone that has eluded the field for decades. If validated, non-viral gene therapy could dramatically reduce cost and immune complications compared to AAV and LNP approaches.

Why it matters: Non-viral gene therapy that works in primates would be a paradigm shift, removing the manufacturing complexity, immunogenicity, and size constraints of viral vectors. This could democratize genetic medicine.

Why for Yiru: Gene therapy delivery technologies have implications for engineering immune cells, creating disease models, and developing next-generation cell therapies for cancer — all areas adjacent to Boss's research.

BioTech #4 BROWSE

STAT+: Capsida says it still doesn't know what caused gene therapy death

STAT News Published 2026-05-12 industry news DOI:

Authors: STAT News

gene therapy safety Capsida clinical trial AAV

Summary: Capsida Biotherapeutics discloses that the cause of a patient death in its gene therapy clinical trial remains unknown months after the event, raising concerns about AAV safety and the adequacy of preclinical models for predicting rare toxicities. The company continues to investigate while the clinical hold remains in place.

Why it matters: Gene therapy safety remains the field's most critical challenge. A death with no identified mechanism undermines confidence in preclinical safety assessment and highlights the need for better predictive models.

Why for Yiru: Understanding gene therapy toxicities through better preclinical models is a systems biology challenge where computational approaches — including single-cell and spatial methods — could contribute to mechanistic understanding.

BioTech #5 BROWSE

Cirena Licenses Long RNA Purification Tech From Agilent

GenomeWeb Published 2026-05-09 industry news DOI:

Authors: GenomeWeb

RNA purification long-read sequencing Cirena Agilent technology licensing

Summary: Cirena licenses Agilent's long RNA purification technology to improve sample preparation for long-read RNA sequencing, aiming to enable full-length transcript analysis at higher throughput and quality for clinical and research applications.

Why it matters: Long-read RNA sequencing requires high-quality full-length RNA. Improved purification technology addresses a critical bottleneck in the long-read workflow, which is essential for detecting isoform diversity, fusion transcripts, and RNA modifications.

Why for Yiru: Long-read sequencing technologies complement spatial and single-cell transcriptomics by resolving isoform-level biology that short-read methods miss — relevant to building more complete molecular atlases of tissues and tumors.