Research Radar — 2026-05-15
Methods & AI
Computational
TxPert: using multiple knowledge graphs for prediction of transcriptomic perturbation effects
Nature Biotechnology Published 2026-05-01 research article DOI: 10.1038/s41587-026-03113-4
transcriptomic perturbation knowledge graphs drug discovery deep learning
Summary: TxPert predicts unseen single-gene transcriptomic perturbation effects by integrating multiple knowledge graphs, achieving accuracy approaching split-half experimental reproducibility. The framework generalizes across cell types and perturbation classes without requiring paired pre/post perturbation training data.
Why it matters: Achieving near-experimental accuracy for unseen perturbations is a major advance for in silico drug screening and CRISPR experimental design. Knowledge graph integration provides a principled way to incorporate prior biological knowledge into deep learning.
Why for Yiru: Transcriptomic perturbation prediction with knowledge graph priors directly relevant to modeling drug responses and genetic perturbations in cancer and immune cells — core to computational oncology research.
Spurious correlation inflates performance in single-cell perturbation prediction
bioRxiv Published 2026-05-07 preprint DOI: 10.1101/2026.05.07.723486
perturbation prediction benchmarking single-cell evaluation bias
Summary: Demonstrates that standard evaluation metrics for single-cell perturbation prediction (correlation and cosine similarity of differential expression) are systematically inflated by statistical bias from reusing the same control population. A simple control-splitting procedure removes this bias, substantially reducing previously reported performance — even non-informative methods appear competitive under biased metrics.
Why it matters: This finding demands re-evaluation of published perturbation prediction benchmarks and raises the bar for method development. The field risks optimizing for artifacts rather than biological signal.
Why for Yiru: Methodological rigor in computational biology is essential. Boss's work on perturbation modeling must account for evaluation bias to avoid overoptimistic conclusions.
Task-Specialized Protein Language Models Decode the Sequence Grammar of Post-Translational Modification Sites
bioRxiv Published 2026-05-08 preprint DOI: 10.1101/2026.05.08.723918
protein language models post-translational modifications ESM2 fine-tuning
Summary: Fine-tunes ESM2, a 650M-parameter protein language model, for phosphorylation, acetylation, and ubiquitination-site prediction using parameter-efficient fine-tuning with focal loss to handle extreme class imbalance. The task-specialized models reveal sequence grammar governing PTM site selection, identifying motifs that distinguish modified from chemically eligible but unmodified residues at proteome scale.
Why it matters: PTM prediction at proteome scale with PLMs bridges sequence and function. Understanding what makes a residue modified vs. merely chemically eligible is a fundamental biochemical question with implications for signaling biology and drug targeting.
Why for Yiru: Protein language models applied to functional site prediction connect Boss's interests in AI methods with biological mechanism discovery. PTM regulation is central to immune signaling and cancer biology.
Towards the explainability of protein language models
Nature Machine Intelligence Published 2026-05-11 review DOI: 10.1038/s42256-026-01232-w
protein language models explainable AI XAI computational biology
Summary: Comprehensive review of explainable artificial intelligence (XAI) methods tailored for protein language models. Covers attention analysis, probing classifiers, concept-based explanations, and structural interpretability approaches, with practical guidance on when each method is appropriate and what biological questions they can address.
Why it matters: As PLMs become central to computational biology, understanding what they learn is critical for trust and scientific discovery. This review provides a roadmap for extracting mechanistic insight rather than just predictive performance.
Why for Yiru: Interpretability of deep learning models in biology is a recurring challenge. The XAI methods surveyed here apply broadly to any biological sequence or structure model Boss might develop or use.
Quantifying Cross-Modal Association Confidence for Single-Cell RNA-ATAC Integration
bioRxiv Published 2026-05-07 preprint DOI: 10.1101/2026.05.07.723400
single-cell multi-omics RNA-ATAC integration cross-modal multiome
Summary: Introduces the CLIC (Cross-modality Link Confidence) score, a quantitative measure of empirical concordance between gene expression and nearby chromatin accessibility derived from diverse single-cell multiome datasets. CLIC enables filtering of low-confidence associations that compromise integration accuracy when combining separately profiled scRNA-seq and scATAC-seq data.
Why it matters: Most single-cell data remains unimodal. Better integration of separately profiled modalities is essential for maximizing the value of existing datasets. CLIC provides a principled confidence metric for a problem that's typically treated as uniformly reliable.
Why for Yiru: Multi-omics integration is central to spatial and single-cell analysis. A confidence framework for cross-modal associations directly improves the interpretability of integrated TME atlases.
Empirically determined baseline masking strategies and other considerations for gene-level burden tests
Nature Genetics Published 2026-05-08 research article DOI: 10.1038/s41588-026-02597-9
rare variant association burden tests UK Biobank statistical genetics
Summary: Systematic comparison of different masking approaches for rare variant association tests across 54 traits in UK Biobank identifies optimal strategies that increase study power and replicability. Provides practical guidance for baseline masking, variant annotation, and covariate adjustment in gene-level burden analyses.
Why it matters: Rare variant association testing is a cornerstone of statistical genetics but methodological choices dramatically affect power. Empirically grounded best practices from this scale of analysis are highly actionable.
Why for Yiru: Statistical genetics methodology is relevant to understanding the genetic architecture of immune-related traits and cancer susceptibility, which contextualize Boss's translational research.
Biomedical discoveries
Biomedicine
Image-based, pooled phenotyping reveals multidimensional, disease-specific variant effects
Cell Published 2026-05-12 research article DOI:
variant effect imaging phenotyping LMNA PTEN functional genomics
Summary: VIS-seq (Variant in situ sequencing) links genetic variants to cell images at scale, revealing how thousands of LMNA and PTEN variants affect molecules, subcellular structures, and cellular phenotypes. The multidimensional phenotypic continuum revealed by imaging is not recapitulated by any single functional readout, demonstrating that variants produce complex, disease-specific phenotypic signatures.
Why it matters: Moves variant effect prediction beyond unidimensional functional scores to rich phenotypic profiles. The finding that no single assay captures the full variant effect has profound implications for clinical variant interpretation and deep mutational scanning.
Why for Yiru: Image-based phenotyping at scale connects to spatial omics interests. Understanding how genetic variants reshape cellular phenotypes is fundamental to cancer biology and personalized medicine.
Galvanin (TMEM154) is an electric-field sensor for directed cell migration
Cell Published 2026-05-12 research article DOI:
cell migration electrotaxis galvanotaxis TMEM154 membrane protein
Summary: Identifies TMEM154 (renamed Galvanin) as a conserved transmembrane protein that functions as a direct electric-field sensor, linking extracellular electrical cues to directed cell migration. Galvanin mediates galvanotaxis in rapidly migrating cells including immune cells, revealing a molecular mechanism for how endogenous electric fields guide cell movement during development, wound healing, and immune responses.
Why it matters: The molecular identity of the electric-field sensor has been a decades-old mystery in cell biology. Galvanin's discovery opens a new axis of cell guidance with implications for wound healing, development, and immune cell trafficking.
Why for Yiru: Immune cell migration and trafficking are central to tumor immunology. A molecular electric-field sensor may explain how immune cells navigate complex tissue environments, including tumors.
Antibody Blockade of Ly49/MHC-I interactions enhances Innate and Adaptive Immunity Against Cancer Metastasis
bioRxiv Published 2026-05-07 preprint DOI: 10.1101/2026.05.07.722994
NK cells cancer immunotherapy MHC-I Ly49 metastasis immune checkpoint
Summary: Demonstrates that antibody-mediated blockade of Ly49/MHC-I interactions enhances both innate NK cell and adaptive T cell immunity against checkpoint inhibitor-resistant pancreatic and melanoma metastasis. Cryo-EM and X-ray crystallography reveal the structural basis of M1/42 antibody binding, while in vivo models show reduced metastatic burden with enhanced immune infiltration.
Why it matters: Ly49/MHC-I blockade represents a novel immune checkpoint axis distinct from PD-1/PD-L1 and CTLA-4, with potential to overcome resistance to existing immunotherapies in metastatic settings.
Why for Yiru: NK cell-based immunotherapy and novel immune checkpoint targets are highly relevant to tumor immunology research. The structural immunology and in vivo metastasis models provide a comprehensive mechanistic picture.
The gut microbiota metabolite Urolithin A mitigates JAK signaling to suppress cytokine-mediated autoimmune diseases
bioRxiv Published 2026-05-08 preprint DOI: 10.1101/2026.05.08.723914
JAK signaling urolithin A autoimmune disease gut microbiota metabolite
Summary: Characterizes Urolithin A, a natural gut microbiota-derived metabolite, as a direct JAK1 inhibitor that broadly dampens JAK-STAT signaling induced by type I/II interferons and IL-6. UA binds the JAK1 JH1 domain and attenuates autoimmune pathogenesis in Trex1-KO mice and imiquimod-induced SLE and psoriasis models.
Why it matters: A gut-derived natural metabolite functioning as an endogenous JAK inhibitor opens new therapeutic possibilities for autoimmune diseases. The microbiota-immune axis mediated by specific metabolites is an emerging frontier.
Why for Yiru: Immunometabolism and the gut-immune axis are increasingly recognized as modulators of tumor immunity and immunotherapy response. JAK-STAT signaling is also central to cancer inflammation.
TMEM119+ microglia MHC class I restricted antigen presentation impacts CD8 T cell memory, effector status, and blood-brain barrier disruption during neurotropic virus infection
bioRxiv Published 2026-05-08 preprint DOI: 10.1101/2026.05.08.722741
microglia MHC-I CD8 T cells neuroimmunology antigen presentation
Summary: Reveals that TMEM119+ microglia present antigen via MHC class I to CD8 T cells during neurotropic virus infection, shaping T cell memory formation, effector function, and blood-brain barrier integrity. Microglial MHC-I expression is dynamically regulated and its loss alters the outcome of CNS infection.
Why it matters: Microglia are traditionally viewed as innate immune cells; this study demonstrates they directly bridge to adaptive immunity through MHC-I-restricted antigen presentation, redefining their role in CNS immune surveillance.
Why for Yiru: The intersection of innate and adaptive immunity, antigen presentation by tissue-resident cells, and T cell memory formation in specialized niches parallels questions about immune function in the tumor microenvironment.
Cross-disciplinary watchlist
Other Fields
Peripheral control enabled by distributed sensing in an octopus-inspired soft robotic arm for autonomous underwater grasping
Nature Machine Intelligence Published 2026-05-12 research article DOI: 10.1038/s42256-026-01230-y
soft robotics embodied intelligence distributed sensing autonomous systems
Summary: Presents an octopus-inspired soft robotic arm that uses optoelectronic mechanosensors embedded in suction cups to detect contact forces and infer object positions, enabling autonomous underwater grasping without centralized processing. The distributed sensing architecture mimics the peripheral nervous system of octopuses, achieving robust manipulation in unstructured environments.
Why it matters: Embodied intelligence in soft robotics — where sensing and computation are distributed throughout the body rather than centralized — represents a fundamental shift from traditional AI-robotics architectures and could enable robots that operate in complex real-world environments.
Why for Yiru: Bio-inspired distributed computation and sensing architectures offer conceptual parallels to how biological systems process spatial information. The octopus nervous system is a model of decentralized intelligence relevant to thinking about multi-scale biological data integration.
Force-free molecular dynamics through autoregressive equivariant networks
Nature Machine Intelligence Published 2026-05-05 research article DOI: 10.1038/s42256-026-01227-7
molecular dynamics equivariant networks autoregressive models materials science
Summary: Introduces TrajCast, a neural network that bypasses force calculations to directly predict atomic trajectories using autoregressive equivariant networks. TrajCast enables molecular dynamics timesteps up to 30 times longer than conventional integrators while accurately reproducing physical properties of molecules and materials.
Why it matters: Molecular dynamics simulations are limited by femtosecond timesteps. A 30x acceleration through learned trajectory prediction could transform materials discovery, protein dynamics, and drug binding simulations.
Why for Yiru: Though applied to materials, the autoregressive trajectory prediction framework has direct analogies to modeling cellular state transitions and disease progression trajectories — core interests in computational biology.
Platonic representation of foundation machine learning interatomic potentials
Nature Machine Intelligence Published 2026-05-07 research article DOI: 10.1038/s42256-026-01235-7
foundation models interatomic potentials representation learning materials science
Summary: Demonstrates that a unified 'Platonic' geometry emerges across the latent representations of independently trained foundation models for learning interatomic potentials. This shared structure enables cross-model comparison, embedding arithmetic, and ground-truth-free diagnostics — suggesting that these models converge to a universal representation of atomic environments.
Why it matters: The discovery of convergent representations across independent foundation models echoes the Platonic representation hypothesis in LLMs, suggesting deep structural regularities in how neural networks learn physical systems. This has implications for model interoperability and transfer learning in scientific ML.
Why for Yiru: Foundation model representation learning and the Platonic representation hypothesis are directly relevant to biological foundation models for single-cell and spatial omics, where convergence across models could enable universal cell state representations.
Learning the chemical language of natural products
Nature Machine Intelligence Published 2026-05-07 research article DOI: 10.1038/s42256-026-01241-9
natural products chemical language models drug discovery foundation models
Summary: Develops a foundation model for natural product chemistry that learns biosynthetic grammar — predicting enzymatic transformations, scaffold relationships, and bioactivity from molecular representations. The model captures the chemical language of secondary metabolism, enabling mining of uncharted natural product space.
Why it matters: Natural products remain a rich source of drugs but their chemical space is poorly mapped. A language model that captures biosynthetic rules could accelerate natural product discovery by orders of magnitude.
Why for Yiru: Chemical language modeling with a foundation model framing connects to broader interests in learning biological sequence-structure-function relationships. The natural product grammar concept has parallels to gene regulatory grammar.
Re-thinking human–machine interaction and the governance of AI in the military domain
Nature Machine Intelligence Published 2026-05-11 perspective DOI: 10.1038/s42256-026-01231-x
AI governance military AI human-machine interaction AI policy
Summary: Analyzes how human-machine interactions across the AI lifecycle — from development and testing to deployment and oversight — affect meaningful human control and decision-making in military applications. Argues that governance frameworks must account for the full spectrum of human-AI interaction, not just the moment of autonomous decision-making.
Why it matters: As AI systems become embedded in high-stakes domains, understanding the governance of human-AI interaction is critical beyond military contexts — it applies to clinical AI, autonomous vehicles, and scientific AI assistants.
Why for Yiru: AI governance and human-AI interaction design are increasingly relevant to the responsible deployment of AI in biomedical research and clinical decision support systems.
Friday delivery
BioTech News Delivery
Five burning Qs for Isomorphic Labs after $2.1B raise for AI biotech
Endpoints News Published 2026-05-12 industry news DOI:
AI drug discovery Isomorphic Labs biotech financing AlphaFold
Summary: Isomorphic Labs, the Alphabet subsidiary applying AlphaFold-derived AI to drug discovery, raises $2.1 billion in one of the largest AI-biotech financings ever. The raise signals enormous investor conviction in AI-native drug discovery platforms, but key questions remain about pipeline maturity, clinical validation timelines, and the AlphaFold-to-drug translation gap.
Why it matters: A $2.1B raise for an AI drug discovery company is a landmark event that will shape investment flows and talent migration across the AI-biotech ecosystem. Isomorphic's progress is a bellwether for whether computational protein structure prediction translates to therapeutic impact.
Why for Yiru: AI-driven drug discovery from structure prediction to therapeutic candidates represents the frontier where Boss's computational biology expertise meets translational impact.
Bristol Myers joins Hengrui party in 13-asset deal worth up to $15.2B
Endpoints News Published 2026-05-12 industry news DOI:
drug licensing Bristol Myers Squibb Hengrui pharma deal China biotech
Summary: Bristol Myers Squibb licenses 13 preclinical and clinical assets from China's Hengrui Medicine in a deal valued at up to $15.2 billion, continuing the trend of Western pharma companies sourcing innovation from Chinese biotech pipelines across oncology, immunology, and metabolic disease.
Why it matters: The scale of this deal underscores the growing role of Chinese biotech in global pharmaceutical pipelines. The flow of assets from East to West is reshaping drug development economics and competitive dynamics.
Why for Yiru: The globalization of drug development and the increasing prominence of Chinese biotech innovation has implications for the research ecosystem Boss operates within, including collaboration opportunities and competitive awareness.
Flagship startup Serif says it has solved non-viral gene therapy in monkeys
Endpoints News Published 2026-05-12 industry news DOI:
gene therapy non-viral delivery Flagship Pioneering Serif Therapeutics
Summary: Serif Therapeutics, a Flagship Pioneering startup, claims to have achieved durable therapeutic gene expression in non-human primates using a fully non-viral delivery platform — a milestone that has eluded the field for decades. If validated, non-viral gene therapy could dramatically reduce cost and immune complications compared to AAV and LNP approaches.
Why it matters: Non-viral gene therapy that works in primates would be a paradigm shift, removing the manufacturing complexity, immunogenicity, and size constraints of viral vectors. This could democratize genetic medicine.
Why for Yiru: Gene therapy delivery technologies have implications for engineering immune cells, creating disease models, and developing next-generation cell therapies for cancer — all areas adjacent to Boss's research.
STAT+: Capsida says it still doesn't know what caused gene therapy death
STAT News Published 2026-05-12 industry news DOI:
gene therapy safety Capsida clinical trial AAV
Summary: Capsida Biotherapeutics discloses that the cause of a patient death in its gene therapy clinical trial remains unknown months after the event, raising concerns about AAV safety and the adequacy of preclinical models for predicting rare toxicities. The company continues to investigate while the clinical hold remains in place.
Why it matters: Gene therapy safety remains the field's most critical challenge. A death with no identified mechanism undermines confidence in preclinical safety assessment and highlights the need for better predictive models.
Why for Yiru: Understanding gene therapy toxicities through better preclinical models is a systems biology challenge where computational approaches — including single-cell and spatial methods — could contribute to mechanistic understanding.
Cirena Licenses Long RNA Purification Tech From Agilent
GenomeWeb Published 2026-05-09 industry news DOI:
RNA purification long-read sequencing Cirena Agilent technology licensing
Summary: Cirena licenses Agilent's long RNA purification technology to improve sample preparation for long-read RNA sequencing, aiming to enable full-length transcript analysis at higher throughput and quality for clinical and research applications.
Why it matters: Long-read RNA sequencing requires high-quality full-length RNA. Improved purification technology addresses a critical bottleneck in the long-read workflow, which is essential for detecting isoform diversity, fusion transcripts, and RNA modifications.
Why for Yiru: Long-read sequencing technologies complement spatial and single-cell transcriptomics by resolving isoform-level biology that short-read methods miss — relevant to building more complete molecular atlases of tissues and tumors.