Research Radar — 2026-06-04
Methods & AI
Computational
MetaSTAARlite: an all-in-one tool for biobank-scale whole-genome sequencing meta-analysis
Nature Computational Science Published 2026-06-03 research article DOI: 10.1038/s43588-026-00995-x
whole-genome sequencing meta-analysis rare variant biobank GWAS statistical genetics scalable bioinformatics
Summary: Introduces MetaSTAARlite, a summary-statistics-based pipeline that enables powerful, functionally informed rare-variant meta-analysis across biobank-scale whole-genome sequencing (WGS) cohorts. As WGS becomes the standard for large-scale population and disease studies — with biobanks such as UK Biobank, All of Us, and FinnGen generating WGS data for hundreds of thousands of participants — the computational challenge shifts from single-cohort analysis to meta-analysis across multiple cohorts. Individual-level data sharing is often restricted by privacy regulations and data use agreements, making summary-statistics-based meta-analysis the only practical approach. However, existing rare-variant meta-analysis tools are either not scalable to WGS-scale data (hundreds of millions of variants) or lack the functional annotation integration that gives methods like STAAR their power. MetaSTAARlite addresses both gaps: it operates on per-variant summary statistics (score statistics and covariance matrices) that can be shared without individual-level data, incorporates functional annotation weights across multiple annotation categories (epigenetic, evolutionary conservation, protein structure), and scales linearly with sample size through efficient matrix operations. The method supports both single-variant and aggregate rare-variant tests, gene-centric and sliding-window analyses, and conditional analysis to identify independent signals. Applied to biobank-scale WGS data, MetaSTAARlite substantially increases discovery power for rare-variant associations while maintaining well-calibrated type I error rates.
Why it matters: Rare variants (minor allele frequency < 1%) collectively explain a substantial portion of missing heritability for complex diseases, but individual rare variants have effect sizes too small to reach genome-wide significance without extremely large sample sizes. MetaSTAARlite provides the computational infrastructure to combine rare-variant evidence across all available WGS cohorts, potentially unlocking discoveries that no single cohort could achieve alone. The summary-statistics-based design also respects data privacy constraints, making it compatible with the federated analysis models increasingly mandated by biobanks and funding agencies. As WGS replaces genotyping arrays in population genomics, tools like MetaSTAARlite will become essential for maximizing the scientific return on these massive investments.
Why for Yiru: Rare-variant analysis is increasingly relevant to cancer genomics — germline rare variants may influence TME composition, immunotherapy response, and cancer susceptibility in ways that common variants cannot capture. MetaSTAARlite could be applied to combine rare-variant evidence across cancer cohorts from multiple biobanks, potentially identifying rare variants that affect immune-related phenotypes (cytokine levels, immune cell counts, checkpoint expression). The functional annotation weighting framework is also conceptually transferable — one could develop TME-specific annotation weights that prioritize variants affecting genes expressed in relevant immune or stromal cell types. More broadly, the computational approach to federated rare-variant analysis exemplifies the type of privacy-preserving, scalable methods needed for multi-center cancer genomics consortia.
Cell-type-resolved genetic variation shapes inflammatory bowel disease risk
Nature Published 2026-06-03 research article DOI: 10.1038/s41586-026-10627-z
eQTL single-cell inflammatory bowel disease GWAS genetic regulation enhancer cell-type-specific functional genomics
Summary: Presents the largest single-cell cis-expression quantitative trait locus (eQTL) study to date, mapping genetic regulation of gene expression across 2.2 million single cells from intestinal biopsies and peripheral blood of 421 individuals, including 125 with inflammatory bowel disease (IBD). Most genetic variants associated with complex diseases lie in non-coding regions, making it difficult to identify which genes they regulate and in which cell types. Traditional eQTL studies use bulk tissue RNA-seq, which averages gene expression across all cell types and misses cell-type-specific regulatory effects. This study addresses this limitation by performing single-cell eQTL mapping in intestinal tissue — directly in the disease-relevant organ — across multiple intestinal cell types including epithelial cells, stromal cells, and diverse immune populations. Three key findings emerge: first, cell-type-level eQTLs are systematically more distal to transcription start sites and enriched in enhancers compared to tissue-level eQTLs, consistent with enhancers being the primary mediators of cell-type-specific gene regulation. Second, enhancer-enriched, cell-type-resolved eQTLs co-localize with IBD GWAS loci at substantially higher rates than tissue-level eQTLs — meaning that cell-type resolution directly improves the ability to connect disease-associated variants to effector genes and cell types. Third, many IBD-associated variants regulate different genes in different cell types, highlighting the inadequacy of the "one variant, one gene, one cell type" model. The authors provide an interactive resource enabling querying of eQTL results across cell types and genetic variants.
Why it matters: This study represents the state of the art in connecting non-coding genetic variation to disease mechanisms at cellular resolution. The finding that cell-type-level eQTLs dramatically improve GWAS co-localization — and that enhancers are the key regulatory elements driving this improvement — has immediate implications for how post-GWAS functional studies should be designed. The observation that single variants can regulate different genes in different cell types challenges the prevailing paradigm of variant-to-gene (V2G) mapping and suggests that the regulatory consequences of disease-associated variants are more complex and context-dependent than appreciated. For IBD specifically, identifying the specific intestinal cell types through which genetic risk operates could guide cell-type-targeted therapeutic development.
Why for Yiru: The single-cell eQTL framework developed here is directly applicable to the TME. One could perform single-cell eQTL mapping in tumour and adjacent normal tissue to identify genetic variants that regulate gene expression specifically in TME cell types — tumour cells, T cells, macrophages, fibroblasts. These TME-specific eQTLs could reveal why certain germline variants influence cancer risk or immunotherapy response — for example, a variant that alters chemokine expression specifically in tumour-associated macrophages. The computational methods for cell-type-resolved eQTL mapping and GWAS co-localization provide a template for similar analyses in cancer. The finding that enhancers are the primary drivers of cell-type-specific regulation also connects to Yiru interest in the epigenetic basis of TME heterogeneity.
PepAnno: A structure-aware deep learning framework for bioactive peptide prediction, structural visualization, and physicochemical profiling
PLOS Computational Biology Published 2026-06-02 research article DOI:
peptide deep learning structure prediction drug discovery antimicrobial anticancer geometric deep learning web server
Summary: Presents PepAnno, a comprehensive web server for multi-functional peptide annotation powered by a structure-aware, multi-view geometric deep learning framework. Peptides are gaining prominence as therapeutic candidates — they occupy a unique niche between small molecules and biologics, combining the target specificity of antibodies with the tissue penetration of small molecules. However, computational tools for predicting peptide bioactivities have lagged behind those for small molecules and proteins, with many existing tools relying on sequence-only representations that ignore 3D structural information. PepAnno addresses this gap with a dual-stream architecture: a Transformer branch processes pre-trained sequence embeddings to capture evolutionary and biochemical information, while a GATv2 (graph attention network) branch processes predicted 3D structural graphs to capture geometric features such as secondary structure elements, surface properties, and folding patterns. A cross-modal attention mechanism fuses these two representations, enabling the model to learn which features — sequence-based or structure-based — are most predictive for each bioactivity task. The model supports simultaneous prediction across seven key bioactivities (antimicrobial, anticancer, antiviral, antifungal, antibiofilm, hemolytic, and cell-penetrating), consistently matching or outperforming task-specific methods. The web server also provides automated physicochemical profiling, 3D structure visualization, and access to integrated peptide databases.
Why it matters: Therapeutic peptides represent a rapidly growing drug class — over 80 peptide drugs are currently approved, with hundreds in clinical development for indications ranging from diabetes to cancer. Computational tools that can rapidly screen peptide libraries for desired bioactivities while avoiding undesirable properties (toxicity, hemolysis) could dramatically accelerate peptide drug discovery. The structure-aware design of PepAnno is particularly important because peptide function depends critically on 3D conformation — an alpha-helical antimicrobial peptide and a disordered peptide with the same sequence composition can have completely different activities. The web server format also democratizes access, making sophisticated deep learning tools available to experimental researchers without computational expertise.
Why for Yiru: Peptide-based therapeutics are an emerging modality for TME-targeted therapy. Therapeutic peptides could be designed to block immunosuppressive checkpoint interactions (PD-L1/PD-1), disrupt chemokine-receptor axes that recruit immunosuppressive cells, or deliver cytotoxic payloads selectively to tumour cells expressing specific surface markers. PepAnno could be used to screen candidate peptide libraries for TME-relevant properties — for example, identifying peptides predicted to penetrate tumour cells while avoiding hemolysis. More broadly, the dual-stream architecture combining sequence and structure representations provides a template for designing multimodal deep learning models for other biological molecules relevant to the TME, such as cytokines and chemokines.
A prototype-augmented graph representation learning framework for identifying brain disorder-associated genes and facilitating drug repurposing
PLOS Computational Biology Published 2026-05-29 research article DOI:
graph neural network drug repurposing brain disorder Parkinson disease multi-omics GWAS deep learning
Summary: Introduces MOGT (Multi-omics Graph Transformer Network), a semi-supervised graph neural network that models biological networks derived from multi-omics data to predict disease-associated genes and identify drug repurposing candidates. Genome-wide association studies have identified thousands of genetic loci associated with neuropsychiatric and neurodegenerative disorders, but translating these statistical associations into mechanistic understanding and therapeutic opportunities remains a major challenge. MOGT addresses this by integrating multiple types of biological networks — protein-protein interactions, gene co-expression, and functional annotation similarities — into a heterogeneous graph, then using a graph transformer architecture with prototype-based attention to learn which network neighborhoods are most predictive of disease gene status. The prototype augmentation is the key innovation: rather than learning gene representations solely from local network neighborhoods, MOGT learns a set of prototype vectors that represent archetypal "disease gene network patterns," and each gene representation is augmented by its similarity to these prototypes. Applied to Parkinson disease (PD), MOGT outperforms existing methods in disease gene prediction and identifies high-risk genes that are then used to query the Connectivity Map (CMAP) database for drugs that reverse the disease-associated gene expression signature. Among 10 predicted candidates, the drug UK-356618 was experimentally validated in a primary neuron model, reversing PD-associated gene expression abnormalities and improving cellular phenotypes.
Why it matters: The prototype-augmented graph learning approach is methodologically innovative — it addresses a fundamental limitation of graph neural networks, which tend to learn myopic representations biased toward local network structure. By augmenting each gene representation with global prototype information, MOGT learns richer representations that capture higher-order patterns in biological networks. The experimental validation of a predicted drug — going from GWAS hits through computational prediction to wet-lab validation — demonstrates a complete translational pipeline for drug repurposing that could be applied to many other diseases. The framework is general and could incorporate additional data modalities and network types.
Why for Yiru: The drug repurposing pipeline demonstrated here — disease gene prediction from multi-omics networks followed by CMAP-based drug identification — is directly applicable to TME research. One could apply MOGT to identify genes associated with immunotherapy resistance using GWAS or transcriptome-wide association study data from cancer cohorts, then query CMAP for drugs that reverse the resistance-associated gene expression signature, identifying candidates for combination immunotherapy. The prototype-augmented graph learning approach could also be adapted to TME-specific networks — for example, a cell-cell communication graph where prototypes might represent archetypal "immune-hot" or "immune-cold" TME architectures. The successful experimental validation provides confidence in computational drug repurposing pipelines for TME applications.
A comparative study of simulation-based inference methods for epidemic models with identifiability considerations
PLOS Computational Biology Published 2026-06-02 research article DOI:
simulation-based inference approximate Bayesian computation neural posterior estimation epidemic model benchmarking identifiability computational biology
Summary: Presents a systematic comparison of four simulation-based inference (SBI) methods — Approximate Bayesian Computation (ABC), Neural Posterior Estimation (NPE), NPE with temporal embeddings, and Preconditioned Neural Posterior Estimation (PNPE) — applied to epidemic models of increasing complexity with explicit attention to structural and practical identifiability. Epidemic models are essential for understanding disease transmission, forecasting outbreak trajectories, and evaluating intervention strategies, but calibrating these models to data is challenging because their likelihood functions are often analytically intractable. SBI methods circumvent the likelihood by learning the relationship between model parameters and simulated data, enabling Bayesian inference for models where traditional methods fail. The authors evaluate these four methods across epidemic models ranging from simple SIR (susceptible-infected-recovered) to complex age-structured and spatially explicit models, under varying observational noise and fixed simulation budgets. Key findings include: neural methods (NPE, PNPE) generally produce more faithful posterior distributions than ABC under constrained simulation budgets; PNPE achieves strong performance but requires substantially more computational resources and may need reconditioning for each new observation; temporal embeddings improve inference for models with complex epidemic dynamics by capturing sequential dependencies; and ABC remains the most computationally efficient option, though it produces more conservative (wider) posterior estimates. The study also highlights the critical importance of identifiability assessment — many epidemic model parameters cannot be uniquely estimated from available data, regardless of the inference method used.
Why it matters: Simulation-based inference is rapidly gaining adoption across computational biology — from ecology and epidemiology to systems biology and single-cell genomics — because it enables statistically rigorous parameter inference for models where the likelihood is intractable. However, practitioners face a bewildering choice of methods with limited guidance on which to use. This systematic benchmark provides that guidance, with actionable recommendations based on model complexity, simulation budget, and available computational resources. The explicit treatment of identifiability is particularly valuable — it serves as a reminder that no inference method can extract information that is not present in the data, and that identifiability analysis should be a mandatory step in any SBI workflow.
Why for Yiru: Simulation-based inference methods are directly applicable to mechanistic models of the TME. Mathematical models of tumour-immune interactions — including ODE-based models of cytokine signaling, agent-based models of cell migration and interaction, and stochastic models of immune repertoire dynamics — typically have intractable likelihoods but can be simulated forward. SBI could be used to calibrate these models to spatial transcriptomics, multiplexed imaging, or longitudinal tumour growth data, enabling inference of parameters such as immune cell infiltration rates, tumour cell killing rates, and cytokine diffusion coefficients. The benchmarking results directly inform which SBI method to choose for TME models of varying complexity. The identifiability analysis framework is also essential for TME models, which often have many parameters relative to available data — identifying which parameters can and cannot be estimated from a given experimental design is critical for model-based experimental planning.
Biomedical discoveries
Biomedicine
Single-cell spatial pharmacobiology identifies conserved stromal barriers to therapeutic antibody delivery in human solid tumors
Nature Biotechnology Published 2026-06-03 research article DOI: 10.1038/s41587-026-03152-x
spatial proteomics antibody delivery tumour microenvironment stromal barrier pharmacobiology single-cell cancer imaging
Summary: Introduces single-cell spatial pharmacobiology (SSP), an integrated experimental and analytical framework that combines in situ imaging of systemically infused fluorescently labeled therapeutic antibodies with high-plex spatial proteomics (CODEX, ~50 protein markers) to quantify antibody distribution, target engagement, and tumour microenvironment (TME) responses at single-cell resolution. A fundamental challenge in antibody-based cancer therapy — including checkpoint inhibitors, antibody-drug conjugates, and bispecific antibodies — is that antibody distribution within solid tumours is highly heterogeneous, and the determinants of this heterogeneity have been difficult to study due to a lack of methods that simultaneously measure drug localization and the cellular context. SSP addresses this by administering a fluorescently labeled therapeutic antibody to patients or animal models, collecting tumour tissue, and performing high-plex spatial proteomics that captures both the drug signal and ~50 protein markers of cell identity, signalling state, and extracellular matrix (ECM) composition — all in the same tissue section at single-cell resolution. Applied across multiple human solid tumour types (head and neck squamous cell carcinoma, lung adenocarcinoma, pancreatic ductal adenocarcinoma) and a therapeutic antibody (cetuximab, anti-EGFR), SSP reveals that antibody penetration is limited by conserved stromal features: dense collagen-rich ECM zones, perivascular fibroblast accumulation, and regions of high interstitial fluid pressure create physical barriers that antibodies cannot cross, regardless of tumour type. Critically, these barriers are spatially organized — they are not random but form structured zones around blood vessels — and their composition and extent predict antibody delivery efficiency better than tumour type or EGFR expression level.
Why it matters: This study directly addresses one of the most important unsolved problems in antibody-based cancer therapy: why do antibodies that bind their target with high affinity in vitro often fail to reach all tumour cells in vivo? The identification of conserved stromal barriers across multiple tumour types suggests that these barriers are a general feature of solid tumour biology — not idiosyncratic to specific tumour types or antibody targets — and therefore represent a universal therapeutic challenge. The SSP framework itself is a methodological advance that can be applied to any therapeutic antibody in any tumour type, enabling rational optimization of antibody design (e.g., engineering smaller formats with better penetration), dosing regimens, and combination therapies (e.g., co-administering ECM-degrading enzymes to improve penetration). The finding that barrier organization — not just composition — matters highlights the importance of spatial analysis in drug delivery research.
Why for Yiru: This study is directly and profoundly relevant to TME computational biology. The SSP framework generates spatially resolved, multi-parameter data on both drug distribution and TME composition — exactly the type of data that computational models can exploit to predict optimal dosing, identify which stromal features to target, and simulate the effects of barrier-disrupting combination therapies. The identification of quantitative spatial features that predict antibody delivery — ECM density, fibroblast proximity to vessels, interstitial pressure gradients — provides directly measurable inputs for computational models of drug transport in the TME. More broadly, SSP data could be integrated with spatial transcriptomics to build multi-modal models that predict drug delivery from more readily available transcriptomic data. The finding that stromal barriers are conserved across tumour types also suggests that computational signatures of these barriers could be developed and applied across cancer cohorts to stratify patients for antibody-based therapies.
Spermine is an endogenous iron chelator that inhibits ferroptosis
Nature Published 2026-06-03 research article DOI: 10.1038/s41586-026-10597-2
ferroptosis spermine polyamine iron chelation hepatocellular carcinoma metabolism cell death ALDH18A1
Summary: Identifies spermine — a polyamine metabolite best known for its roles in cell proliferation and nucleic acid stabilization — as an endogenous iron chelator that inhibits ferroptosis, and reveals a non-canonical spermine synthesis pathway mediated by ALDH18A1 that is co-opted by hepatocellular carcinoma (HCC) cells to evade this form of cell death. Ferroptosis is an iron-dependent form of regulated cell death driven by the peroxidation of polyunsaturated fatty acid-containing phospholipids in cellular membranes. It has emerged as a central mechanism in cancer biology — many cancer cells are primed for ferroptosis due to their high iron content and metabolic activity, and inducing ferroptosis is a promising therapeutic strategy, particularly for therapy-resistant cancers. The polyamine metabolic pathway has been extensively studied in cancer because polyamines (putrescine, spermidine, spermine) are elevated in proliferating cells and tumours, but their connection to ferroptosis was unknown. The authors discover that spermine directly chelates intracellular iron, reducing the labile iron pool that catalyzes lipid peroxidation, thereby acting as a natural ferroptosis inhibitor. This iron-chelating activity is independent of spermine other known functions and is mediated by its polycationic structure, which coordinates iron ions. In HCC, the enzyme ALDH18A1 — canonically known as a mitochondrial enzyme in proline synthesis — is found to catalyze a non-canonical reaction that produces spermine, and ALDH18A1 is upregulated in HCC as a mechanism of ferroptosis resistance. Genetic or pharmacological disruption of ALDH18A1-mediated spermine synthesis sensitizes HCC cells to ferroptosis and suppresses tumour growth in vivo.
Why it matters: This study establishes an entirely new axis connecting polyamine metabolism to ferroptosis regulation, with immediate therapeutic implications. Polyamine metabolism is one of the oldest and most studied metabolic pathways in cancer biology — elevated polyamines were noted in cancer cells decades ago — yet the connection to ferroptosis was completely unsuspected. This discovery recontextualizes decades of polyamine research: the oncogenic role of polyamines may be partly mediated through ferroptosis suppression rather than solely through proliferation support. The identification of ALDH18A1 as a non-canonical spermine synthase provides a druggable target — inhibiting ALDH18A1 would deplete spermine, release the iron chelation brake, and sensitize cancer cells to ferroptosis. Given that HCC has very limited therapeutic options and is frequently resistant to existing therapies, the ALDH18A1-spermine-ferroptosis axis represents a promising new vulnerability. More broadly, the concept of endogenous metabolites acting as ferroptosis regulators suggests that cellular metabolism and cell death are more intimately connected than appreciated.
Why for Yiru: Ferroptosis is increasingly recognized as an important mechanism in the TME — ferroptotic cancer cells release damage-associated molecular patterns (DAMPs) that can stimulate anti-tumour immunity, and certain immune cells (CD8+ T cells) induce ferroptosis in tumour cells through interferon-gamma signalling. The spermine-ferroptosis connection could be computationally explored in TME single-cell and spatial data — do tumours with high ALDH18A1 or spermine levels show reduced ferroptosis signatures and altered immune infiltration? Polyamine metabolism in the TME could be profiled to identify which cell types produce spermine and whether paracrine spermine signalling protects neighbouring tumour cells from immune-induced ferroptosis. More broadly, the computational identification of metabolic vulnerabilities in the TME — metabolites or pathways that specifically protect tumour cells but not immune cells from ferroptosis — could reveal therapeutic targets with favourable therapeutic windows.
Distinct transcription factors control tissue adaptation and effector function in infant and adult memory T cells
Nature Immunology Published 2026-06-02 research article DOI: 10.1038/s41590-026-02535-1
memory T cell infant immunity tissue adaptation transcription factor single-cell stem-like mucosal immunity development
Summary: Analyzes T cells from lymphoid and mucosal tissues of infant and adult human organ donors using single-cell transcriptomics and epigenomics to reveal that infant memory T cells have a unique stem-like transcriptional profile and tissue adaptation program distinct from their adult counterparts, governed by different transcription factor networks. The infant immune system is classically described as "immature" or "naive-biased" — more tolerant, less inflammatory, and biased toward TH2 responses — but the molecular basis of these differences has been poorly understood, particularly in tissue-resident memory T cells (TRM) that reside in mucosal barriers and provide frontline immune protection. The authors profiled T cells from multiple tissue sites (lung, jejunum, mesenteric lymph nodes, spleen) and blood from infant (0-2 years) and adult organ donors. Infant memory T cells across all tissues express a stem-like gene program characterized by TCF7, LEF1, and other WNT-pathway transcription factors typically associated with naive and stem cell memory T cells, while adult memory T cells express canonical effector and tissue-residency programs driven by distinct transcription factors including PRDM1 (BLIMP1), ZNF683 (HOBIT), and RUNX3. This is not simply a failure of maturation — infant memory cells are functional and clonally expanded — but rather reflects an alternative differentiation state optimized for the infant immunological context: rapid tissue adaptation and barrier protection without the inflammatory effector programs that could damage developing tissues. The distinct transcription factor dependencies are confirmed by chromatin accessibility analysis showing that infant and adult memory T cells have different regulatory landscapes at key effector loci.
Why it matters: This study fundamentally reframes how we understand infant immunity. Rather than being an immature version of adult immunity, the infant immune system appears to use a distinct transcriptional program for memory T cell differentiation — a program that prioritizes tissue adaptation and stemness over inflammatory effector function. This has immediate implications for vaccine design: infant vaccines may need to engage the specific transcription factor networks that drive infant memory T cell differentiation, which are different from those targeted by vaccines optimized for adults. The finding also has broader implications for T cell biology — it demonstrates that the canonical memory T cell differentiation hierarchy (naive → effector → memory) is not universal but can be tuned by developmental context. The concept of stem-like memory T cells in infants may also be relevant to the stem-like TCF7+ T cell populations that have recently been identified as critical for immunotherapy response in cancer.
Why for Yiru: The identification of distinct transcription factor programs for T cell tissue adaptation vs. effector function is directly relevant to understanding T cell states in the TME. The stem-like TCF7+ T cell program observed in infant memory cells is analogous to the TCF7+ progenitor exhausted T cell population that sustains anti-tumour immunity during checkpoint blockade — both represent a tissue-adapted, stem-like state distinct from terminal effector differentiation. Computationally, one could compare the transcription factor networks of infant memory T cells, adult memory T cells, and TME T cell states to identify shared and distinct regulatory programs. The finding that tissue adaptation and effector function are controlled by different transcription factors also suggests that these two axes could be independently manipulated therapeutically — for example, boosting T cell tissue residency without triggering exhaustion — which is relevant to optimizing adoptive T cell therapy for solid tumours.
Commensal-derived acetylcholine enhances mucosal immune education
Nature Published 2026-06-03 research article DOI: 10.1038/s41586-026-10592-7
microbiome acetylcholine mucosal immunity commensal diet-microbiome host-microbiota immune education metabolite
Summary: Reveals a diet-microbiome-host axis in which commensal gut bacteria produce the neurotransmitter acetylcholine (ACh) from dietary choline, and this bacteria-derived ACh acts on host immune cells in the intestinal mucosa to strengthen immune defences and reinforce host-microbiota mutualism. The gut microbiome is known to produce a vast array of metabolites that influence host physiology — short-chain fatty acids, bile acids, tryptophan metabolites — but the production of neurotransmitters by commensal bacteria and their direct effects on mucosal immunity have been less explored. The authors demonstrate that specific commensal bacterial species express choline acetyltransferase (ChAT) homologs and produce ACh when provided with dietary choline as a substrate. This bacteria-derived ACh is detected in the intestinal lumen and mucosa at physiologically relevant concentrations. Using germ-free and gnotobiotic mouse models colonized with ACh-producing or ACh-deficient bacterial strains, they show that commensal-derived ACh acts through muscarinic acetylcholine receptors on intestinal immune cells — particularly innate lymphoid cells (ILCs) and macrophages — to enhance mucosal immune education, including increased production of antimicrobial peptides, enhanced barrier integrity, and promotion of regulatory immune responses that maintain tolerance to commensals while enabling responses to pathogens. The diet-microbiome-host connection is completed by showing that dietary choline intake determines the level of bacterial ACh production, directly linking nutrition to mucosal immune function through the microbiome.
Why it matters: This study identifies a new class of microbiome-derived signaling molecules — neurotransmitters — that bridge diet, the microbiome, and host immunity. The concept that bacteria produce ACh is not entirely new (Lactobacillus species were previously shown to produce ACh), but the systematic demonstration that this is a widespread commensal function with direct immunological consequences — and that it is diet-dependent — is a significant advance. The diet-microbiome-immune axis has immediate implications for understanding how nutrition influences immune function and disease susceptibility, and for designing microbiome-targeted interventions (probiotics, prebiotics, dietary supplements) to enhance mucosal immunity. The neurotransmitter aspect is particularly intriguing because it suggests that the microbiome may influence not only local mucosal immunity but also systemic processes through neuro-immune circuits.
Why for Yiru: The gut microbiome is increasingly recognized as a modulator of cancer immunotherapy response — specific bacterial species have been associated with improved responses to checkpoint inhibitors in multiple cancer types. The identification of ACh as a microbiome-derived immunomodulatory molecule raises the possibility that bacterial ACh production could influence anti-tumour immunity systemically or in gut-associated tumours. Computationally, one could mine metagenomic and metabolomic data from cancer cohorts to test whether the abundance of ACh-producing bacterial species or ACh biosynthetic genes correlates with immunotherapy outcomes. The diet-microbiome connection also suggests that dietary interventions (choline supplementation or restriction) could be explored as adjuncts to immunotherapy — a hypothesis that could be tested using existing clinical trial data with dietary information. More broadly, the concept of microbiome-derived neurotransmitters influencing immune function opens new avenues for computational integration of microbiome, metabolomic, and immune profiling data in cancer.
Cross-disciplinary watchlist
Other Fields
Programmable control of bacterial operons with a single Cas13 RNA effector
Nature Biotechnology Published 2026-06-03 research article DOI: 10.1038/s41587-026-03159-4
CRISPR Cas13 RNA targeting synthetic biology gene regulation operon bacteria genetic circuit
Summary: Introduces SONAR (Synthetic Operon control by Nucleic Acid Recognition), an RNA-level gene regulation system for bacteria that uses a single engineered Cas13 RNA-targeting effector to achieve multiple regulatory modes — transcript degradation, translation inhibition, and translation activation — through simple changes in CRISPR RNA (crRNA) design, without modifying the Cas13 protein itself. CRISPR-Cas13 enzymes target RNA rather than DNA, providing reversible gene regulation without permanent genome modification, but wild-type Cas13 has limited regulatory versatility: it simply cleaves target transcripts, which turns genes off. SONAR expands this functionality dramatically using a single engineered Cas13 variant. By modifying the crRNA design — specifically, by altering the guide sequence, adding RNA aptamers, or changing the secondary structure of the crRNA — the same Cas13 protein can be directed to degrade the target mRNA (knockdown), block ribosome access to inhibit translation without degrading the mRNA, or recruit translational activators to enhance protein production from the target mRNA. This represents a major advance in versatility: a single protein component, combined with programmable crRNAs, replaces what previously required multiple different protein-based regulators. Applied to bacterial operons — clusters of genes transcribed as a single mRNA — SONAR can independently control each gene within the operon by targeting different regions of the polycistronic mRNA, enabling sophisticated multi-gene expression programs. The system is demonstrated in metabolic pathway optimization, where precise tuning of individual enzyme levels within a biosynthetic operon maximizes product yield.
Why it matters: The ability to achieve multiple regulatory modes with a single protein is a significant advance for synthetic biology. Current bacterial gene regulation tools typically require different proteins for different functions (CRISPRi for repression, CRISPRa for activation, separate inducible promoters, etc.), creating a burden on the host cell and limiting the number of independently controllable genes. SONAR collapses these functions into a single protein component, reducing the genetic payload and metabolic burden while expanding regulatory capabilities. The programmable nature means that complex gene expression programs can be implemented simply by designing new crRNAs — no protein engineering required. This has immediate applications in metabolic engineering (optimizing biosynthetic pathways), living therapeutics (programming therapeutic bacteria with sophisticated behaviors), and fundamental studies of bacterial gene regulation.
Why for Yiru: Engineered bacteria are an emerging modality for cancer therapy — certain bacterial strains (Salmonella, E. coli, Listeria) preferentially colonize tumours and can be engineered to deliver therapeutic payloads. SONAR could be used to program tumour-homing bacteria with sophisticated behaviours: expressing cytotoxic payloads only upon sensing TME-specific signals (hypoxia, low pH, specific metabolites), turning off payload expression upon dissemination to non-tumour tissues, and coordinating the expression of multiple immunomodulatory factors from a single operon. The ability to independently regulate multiple genes within an operon using the same Cas13 protein is particularly valuable for TME applications where coordinated delivery of multiple signals — a chemokine to recruit immune cells, a checkpoint inhibitor to activate them, and an enzyme to degrade the extracellular matrix — may be needed for effective TME remodeling.
Selection of human hematopoietic stem cells bearing the intended functional edit by transient AND-gate reporters
Nature Biotechnology Published 2026-06-01 research article DOI: 10.1038/s41587-026-03142-z
hematopoietic stem cell gene editing homology-directed repair AND gate selection cell therapy CRISPR genome engineering
Summary: Introduces SMArT (Selection by Means of Artificial Transactivators), a transient selection system that implements AND-gate reporter logic to identify and enrich hematopoietic stem and progenitor cells (HSPCs) that have undergone successful homology-directed repair (HDR)-mediated gene editing, substantially increasing the purity and yield of correctly edited cells for therapeutic applications. Gene therapy using CRISPR-Cas9-mediated HDR to correct disease-causing mutations in HSPCs holds enormous promise for treating genetic blood disorders (sickle cell disease, beta-thalassemia, primary immunodeficiencies), but the clinical translation is limited by two persistent challenges: low HDR efficiency (typically 1-20% of treated cells) and the presence of unintended on-target editing outcomes (non-homologous end joining, large deletions, vector integrations) in cells that undergo editing but not correct repair. SMArT addresses both challenges simultaneously through an elegant AND-gate strategy. Two transient reporter constructs are co-delivered with the editing machinery: one reporter is activated by Cas9-mediated DNA cleavage (indicating that editing occurred), and the second reporter is activated only by successful HDR-mediated integration of the therapeutic cassette (indicating correct repair). Only cells where BOTH reporters are active — the AND gate — are selected, ensuring that the enriched population contains cells that underwent editing AND received the correct repair, while excluding unedited cells and cells with incorrect repair outcomes. The reporters are transient (non-integrating), so no genetic trace is left in the final cell product. SMArT achieves substantial enrichment of correctly edited HSPCs, increasing HDR-edited cell purity to levels suitable for clinical transplantation while reducing the burden of undetected off-target effects.
Why it matters: The gap between editing efficiency and clinical-grade cell purity is one of the most important barriers to the widespread adoption of HDR-based gene therapy. Current approaches rely on either accepting low editing rates (and hoping that the edited cells engraft and expand in vivo) or using drug-selectable markers that permanently modify the genome, which is undesirable for clinical applications. SMArT transient AND-gate approach elegantly solves this problem without leaving a genetic scar. The concept is generalizable — the AND-gate logic could be adapted to select for other desirable editing outcomes (e.g., biallelic correction, specific integration sites) and other cell types beyond HSPCs. The demonstration in HSPCs is particularly important because these are the target cells for most ex vivo gene therapy applications.
Why for Yiru: Gene editing of HSPCs and immune cells is increasingly relevant to cancer immunotherapy. Engineered HSPCs could be used to generate immune cells with enhanced anti-tumour activity — for example, macrophages engineered to resist TME immunosuppression or T cells engineered with synthetic receptors and enhanced metabolic fitness. SMArT could be applied to select for HSPCs carrying precise genetic modifications that enhance anti-tumour immune function after differentiation. More broadly, the AND-gate logic could be adapted to select for engineered T cells (CAR-T, TCR-T) that have undergone correct gene editing, improving the manufacturing of adoptive cell therapies. Computationally, one could model the selection dynamics — what AND-gate stringency maximizes the yield of correctly edited cells while minimizing the risk of clonal outgrowth from rare incorrectly edited cells? This connects to the broader challenge of ensuring safety in genetically engineered cell therapies.
Integrative analyses elucidate transcriptional regulatory functions of risk alleles for metabolic liver disease
Nature Genetics Published 2026-06-02 research article DOI: 10.1038/s41588-026-02617-8
GWAS metabolic liver disease MPRA CRISPR chromatin accessibility functional genomics variant interpretation regulatory element
Summary: Combines massively parallel reporter assays (MPRA) and CRISPR-based perturbation assays to systematically interrogate chromatin accessibility quantitative trait loci (caQTLs) associated with metabolic dysfunction-associated steatotic liver disease (MASLD), identifying likely causal variants and their effector genes. MASLD affects approximately 30% of the global population and is a leading cause of cirrhosis and hepatocellular carcinoma, yet the genetic basis of disease risk — and specifically which non-coding variants causally contribute to disease — remains poorly characterized. The study takes a systematic functional genomics approach: first, the authors identify caQTLs from liver chromatin accessibility data, prioritizing variants that affect regulatory element activity. Then, they use MPRA to test thousands of candidate variants in parallel, measuring the regulatory activity of reference and alternative alleles in a pooled format. Finally, they use CRISPR interference (CRISPRi) and CRISPR activation (CRISPRa) to validate the effects of top candidate variants on the expression of putative target genes in hepatocyte cell models. This pipeline identifies multiple causal variants at established MASLD loci (PNPLA3, TM6SF2, MBOAT7, HSD17B13) and reveals new regulatory variants at additional loci. Several identified variants act through mechanisms that would not have been predicted by genomic annotation alone — for example, creating or disrupting transcription factor binding sites that regulate genes not immediately adjacent to the variant — highlighting the value of functional validation over computational prediction.
Why it matters: This study exemplifies the state-of-the-art pipeline for moving from GWAS associations to causal variants and mechanisms — a pipeline combining MPRA for high-throughput screening with CRISPR-based validation for mechanistic confirmation. The finding that many causal variants regulate non-nearest genes and act through unannotated regulatory elements reinforces the limitations of purely computational variant-to-gene mapping approaches. For MASLD specifically, identifying causal variants and their effector genes provides validated targets for therapeutic development and biomarkers for risk stratification. The study also provides a rich dataset of functional annotations for liver regulatory variants that will be a resource for the MASLD and broader hepatology research communities.
Why for Yiru: The functional genomics pipeline demonstrated here — MPRA screening followed by CRISPR validation — is directly applicable to studying genetic variants that influence the TME. GWAS and TWAS studies have identified germline variants associated with cancer risk and immunotherapy response, but the causal variants and mechanisms are largely unknown. One could adapt this pipeline to test variants associated with TME phenotypes — for example, using MPRA to screen variants that affect chromatin accessibility in macrophages or T cells, then CRISPR-validating their effects on immune function. The liver-specific regulatory annotations generated in this study are also relevant to liver cancer (HCC), where TME composition is a strong determinant of outcome and where germline variants may influence TME characteristics. More broadly, the integration of MPRA and CRISPR data provides a template for how computational predictions of variant function should be experimentally validated.
Optically detected and radio wave-controlled spin chemistry in flavoproteins
Nature Biotechnology Published 2026-05-29 research article DOI: 10.1038/s41587-026-03158-5
spin chemistry flavoprotein optically detected radio wave quantum sensing magnetic field radical pair biophysics
Summary: Demonstrates that photogenerated spin-correlated radical pairs in certain flavoproteins — specifically cryptochromes and light-oxygen-voltage (LOV) proteins — can be manipulated by radio waves, enabling magnetic field sensing and spatial modulation of photoluminescence using radiofrequency pulses and magnetic field gradients, thereby establishing proteins as a previously unexplored platform for optically addressable spin-based quantum technologies. Optically addressable spin systems — materials in which the spin state of electrons can be initialized, manipulated, and read out using light — are the foundation of quantum sensing, quantum information processing, and emerging biomedical imaging technologies. The most widely used system, the nitrogen-vacancy (NV) center in diamond, requires a crystalline diamond matrix, limiting its application in biological contexts. The authors discover that the same spin physics — photoinduced radical pairs with coherent spin dynamics — can be realized in common flavoproteins, which are soluble, genetically encodable, and naturally function in aqueous environments at room temperature. Upon blue light illumination, the flavin cofactor in these proteins forms a spin-correlated radical pair with a nearby amino acid residue (tryptophan or tyrosine). This radical pair is magnetically sensitive: its spin dynamics depend on the local magnetic field, and this dependence manifests as changes in photoluminescence. Critically, the authors show that radio waves can resonantly manipulate the spin state of these radical pairs — the defining capability of an optically addressable spin system — enabling optically detected magnetic resonance (ODMR) spectroscopy in proteins. Using magnetic field gradients, they demonstrate spatial encoding, where different regions of a sample experience different magnetic fields and therefore different radio wave resonance conditions, enabling spatial mapping of the protein distribution.
Why it matters: This discovery opens an entirely new direction for quantum sensing in biology. Unlike diamond NV centers, which must be physically introduced into biological samples as nanoparticles, flavoproteins can be genetically encoded and expressed in specific cell types, organelles, or subcellular compartments. This means that magnetic field sensing, temperature sensing, and potentially quantum-enhanced imaging could be performed with genetically targeted, biocompatible protein sensors — a transformative capability for cell biology and neuroscience. The radio wave control demonstration is the critical milestone that elevates this from an interesting photophysical phenomenon to a practical technology platform. The spatial encoding capability suggests that spin-labeled proteins could be used as genetically encoded MRI contrast agents or for magnetogenetics — controlling cellular processes with magnetic fields via protein-based mediators.
Why for Yiru: Genetically encoded sensors are transformative tools for studying the TME, enabling measurement of pH, oxygen, metabolites, and signalling events in specific cell types within the complex tumour ecosystem. Protein-based spin sensors could add magnetic field and temperature sensing to this toolkit — for example, measuring local temperature variations in tumour tissue (which can reflect metabolic activity and inflammation), or detecting magnetic field perturbations from iron-accumulating cells. The magnetogenetics aspect is particularly intriguing: one could potentially engineer T cells or macrophages that are activated by external magnetic fields to deliver localized immunotherapy — a remote-controlled cell therapy. While this is a long-term vision, the demonstration that flavoproteins can be controlled by radio waves provides the biophysical foundation. Computationally, spatial mapping of protein-based spin sensors would generate data requiring specialized image analysis and signal processing methods.