Welcome to OpenDEL™ Community

A central hub to connect with global DEL professionals, access the latest industry insights and product updates, and collaborate to accelerate drug discovery.

DEL Hunter

  • DEL-Related Publications

    Photochemical Synthesis of DNA-Encoded 3H-Azepines via Skeletal Editing of Nitroarenes

    Jia-ying Xue,Jia-hui Shi,Yuan Yao,Wei-en Xie,Yong Zou,Ming Yan,Xue-jing Zhang Organic Letters DOI: 10.1021/acs.orglett.6c00234 Abstract We report a skeletal editing strategy based on DNA-encoded nitroarenes for the direct conversion of benzene cores into valuable 3H-azepine scaffolds. This transformation is efficiently promoted by visible light in the presence of P(Oi-Pr)3, which serves as a reductant to generate reactive nitrene intermediates from the nitro group. Demonstrating broad substrate scope with applicability to pharmaceutical molecules, this protocol offers an efficient and versatile route to DNA-encoded 3H-azepine derivatives. It thus establishes a robust platform for skeletal diversification in DNA-encoded library synthesis.  

  • DEL-Related Publications

    Discovery and Optimization of Small Molecule Inhibitors of the SLIT2/ROBO1 Protein-Protein Interaction Using DNA-Encoded Libraries

    Nelson Garcia-Vazquez, Shaoren Yuan, Moustafa Gabr bioRxiv - Pharmacology and Toxicology DOI: 10.64898/2026.02.21.707154 Abstract Protein-protein interactions (PPIs) mediated by extracellular ligands remain challenging targets for small molecule intervention due to their large and dynamic interfaces. The interaction between SLIT2 and its receptor ROBO1 plays a critical role in cell migration and tumor progression, yet remains largely unexplored. Here, we report the discovery and optimization of small molecule inhibitors of the SLIT2/ROBO1 interaction enabled by DNA-encoded library (DEL) screening. Affinity selection against SLIT2 identified four structurally diverse hit compounds, which were subsequently validated using orthogonal biophysical assays. Among these, one hit exhibited measurable SLIT2 binding and functional inhibition of the SLIT2/ROBO1 interaction in a time-resolved FRET assay. Guided by physicochemical considerations, a solubility-optimized analog was designed, resulting in a ~50-fold improvement in binding affinity and an ~9-fold enhancement in functional potency. Molecular dynamics simulations and induced-fit docking revealed a stable binding mode within the SLIT2 LRR2 domain and suggested that a benzothiophene substituent was dispensable for target engagement. Fragment-based experimental validation confirmed this prediction, leading to the identification of a minimal azaindole-based pharmacophore that retained nanomolar binding affinity. Collectively, this study demonstrates how DEL-enabled hit discovery combined with rational optimization and fragment deconstruction can yield potent small molecule modulators of a challenging extracellular PPI, providing a foundation for further development of SLIT2/ROBO1 pathway inhibitors.

  • DEL-Related Publications

    DNA-Compatible Synthesis of β-Ketoamides as Intermediates for On-DNA Chemical Diversification

    Xianfeng Li ,  Zehao Yin ,  Qiuyi Chen ,  Xinlong Hu ,  Gong Zhang ,  Xiaohong Fan ,  Yizhou Li Organic Letters  DOI: 10.1021/acs.orglett.6c00490 Abstract The β-ketoamide motif represents both a privileged scaffold and a versatile synthetic intermediate in medicinal chemistry. Herein, we developed a DNA-compatible method for the efficient conversion of various DNA-conjugated amines into β-ketoamides. The resulting β-ketoamides facilitate rapid diversification into a panel of structurally diverse molecular scaffolds. Importantly, the synthetic route and subsequent derivatization steps were validated to be fully compatible with DNA encoding, offering a reliable and versatile platform for DNA-encoded library synthesis.  

  • DEL-Related Publications

    Discovery and dynamic pharmacology of µ-opioid receptor positive allosteric modulators

    Evan S. O’Brien, Junzheng Wang, Parthasaradhireddy Tanguturi, Mengchu Li, Elizabeth White, Yuki Shiimura, Barnali Paul, Kevin Appourchaux, Kaavya Krishna Kumar, Weijiao Huang, Susruta Majumdar, John R. Traynor, John M. Streicher, Chunlai Chen, Brian K. Kobilka bioRxiv - Biophysics DOI: 10.64898/2026.02.20.707058 Abstract Opioid agonists such as morphine and fentanyl exert analgesic effects by binding and activating the µ-opioid receptor (µOR), yet agonism of the µOR causes a slate of serious side effects. µOR-mediated addiction and respiratory depression are the major causes of the current opioid overdose crisis, largely driven by the explosion in illicit use of fentanyl, a potent opioid receptor full agonist. Given these serious side effects (and high resulting societal cost), molecules that act as analgesics with distinct mechanisms of action are of great interest. Positive allosteric modulators (PAMs) of the µOR have the potential to avoid many off-target side effects of conventional opioid orthosteric agonists by enhancing the signaling properties of natural opioid peptide systems. We used a DNA-encoded chemical library screening approach to selectively discover active-state-specific µOR PAMs. Two out of 3 selected prospective PAMs displayed the anticipated enhancement in agonist activity. The most effective of these compounds enhanced the activity of all orthosteric opioid agonists tested, including the native opioid peptide met-enkephalin. Little is known about the underlying dynamic basis of allosteric modulation of Family A GPCRs like the µOR. To that end, we used single-molecule fluorescence resonance energy transfer experiments to detail the impact that our novel µOR PAM has on the dynamic activation behavior of a key region on the intracellular face of the receptor. Our results here provide both a new chemical scaffold that acts as a µOR PAM and detailed pharmacological and dynamic insights into its mechanism of action.

  • DEL-Related Publications

    Toward generalizable predictive models for DNA-encoded libraries

    Vasanthanathan Poongavanam ,  S. Pauliina Turunen ,  Kristian Sandberg ,  Ulrika Yngve ,  Johan Wannberg Drug Discovery Today  DOI: 10.1016/j.drudis.2026.104629 Abstract DNA-encoded libraries (DELs) combined with machine learning (ML) offer a powerful paradigm for hit identification. However, sequencing-derived enrichment data are inherently noisy and biased, often resulting in models that overfit to specific chemical libraries. In this review, we critically evaluate the capabilities and limitations of DEL-ML, illustrating key challenges using Aurora Kinase A (AURKA) DEL affinity selection data. We demonstrate that standard ML models often struggle to generalize to unseen chemical space because of the specific structural constraints of combinatorial libraries. Furthermore, we discuss the necessity of rigorous denoising strategies and evaluate approaches, such as domain adaptation, to mitigate these limitations, offering a roadmap for building robust models capable of exploring diverse chemical space. Summary This review critically examines the integration of machine learning (ML) with DNA-encoded library (DEL) technology for drug discovery. While DEL-ML offers a powerful paradigm for hit identification by generating massive binding datasets (10⁶–10¹² data points), the authors identify a critical "generalizability gap" that limits the practical utility of current models. Using Aurora Kinase A (AURKA) as a case study with OpenDEL 4.0 screening data (~1.5 million data points), the authors demonstrate that standard ML models achieve high accuracy on internal validation but frequently fail to generalize to structurally novel scaffolds due to domain shift—the substantial difference between DEL chemical space and known pharmacological compounds. The review provides methodological best practices for data preprocessing, denoising, and validation, while evaluating advanced strategies such as domain adaptation to improve model robustness. The authors argue that future DEL-ML development must move beyond simple accuracy maximization toward explicit handling of distribution shifts to transform DEL-ML from a retrospective analysis tool into a reliable engine for novel chemical discovery. Highlights 1. The Generalizability Challenge in DEL-ML Models trained on DEL data often memorize library-specific building blocks rather than learning transferable structure-activity relationships The BELKA competition revealed that models perform well on test sets within the same chemical space but fail on structurally novel scaffolds Domain shift between DEL training data and external compound collections represents a fundamental barrier to practical application 2. Data Quality and Preprocessing Considerations DEL sequencing data contains unique noise profiles including matrix binding, DNA-tag interference, unequal synthesis yields, and "jackpot" effects Multiple denoising strategies are evaluated: fold-enrichment, Z-scores for ultra-large libraries, disynthon aggregation, and uncertainty-aware probabilistic loss functions Critical importance of subtracting background noise from control experiments (matrix/bead-only) to prevent false positives 3. Class Imbalance and Data Splitting Strategies DEL selections produce highly imbalanced datasets (10¹–10⁴ binders vs. up to ~10¹² nonbinders) Random splitting leads to overoptimistic metrics due to high structural similarity within DEL congeneric series Scaffold-based or library-based splitting provides more rigorous assessment of generalizability to novel chemotypes Undersampling nonbinders (e.g., 1:1 ratio) can boost external sensitivity from ~1% to 20–30%, though this may reflect bias exploitation rather than true generalization 4. Molecular Representation and Model Architectures Traditional fingerprints and physicochemical descriptors often fail to capture subtle variations in DEL compounds Graph neural networks (GNNs) and variational autoencoders (VAEs) show promise but require careful handling of linker/DNA-tag artifacts Compositional (disynthon) approaches reduce sparsity but risk losing "whole-molecule" structural fidelity Conformal prediction frameworks provide calibrated confidence intervals essential for prioritizing predictions in noisy DEL environments 5. Domain Adaptation as a Solution Strategy Covariate shift correction reduces divergence between source (DEL) and target (known binder) domains Using high-confidence predictions from diverse compound collections (e.g., Enamine REAL Diversity Set) as an intermediate domain improves generalization Domain adaptation reduced PCA centroid distance from 0.77 to 0.32 between DEL training data and known AURKA space Retraining with both predicted binders and nonbinders improved Matthews Correlation Coefficient (MCC) from 0.2 to 0.4 on external datasets while maintaining 20–39% sensitivity 6. AURKA Case Study Findings OpenDEL 4.0-derived binders tended to be larger, more lipophilic, and less polar compared to known AURKA inhibitors Despite overall domain shift, highly enriched DEL hits from sublibrary 27 shared conserved hinge-binding motifs with established inhibitors (e.g., VX-680) Mechanistic alignment between DEL hits and known binders confirms that domain shift, rather than fundamental binding mode differences, drives prediction failures Conclusion The integration of DELs with ML presents transformative opportunities for early drug discovery, but realizing this potential requires overcoming the critical generalizability gap. The primary challenge is not data volume but data nature: intrinsic structural biases and systematic false negatives (often linker-induced) cause models to memorize library-specific artifacts rather than learn transferable pharmacophore principles. High internal validation metrics frequently mask failures to extrapolate to novel, pharmacologically relevant scaffolds. The authors advocate for a paradigm shift in DEL-ML development emphasizing: Rigorous validation standards: Moving beyond random splits to scaffold-based and out-of-distribution evaluation Domain alignment strategies: Explicit handling of distribution shifts through domain adaptation and transfer learning Data diversity expansion: Open-source DEL datasets spanning broader drug-like chemical space to reduce single-library bias Integration of physics-based priors: Incorporating docking constraints to reduce overfitting to synthetic artifacts Uncertainty quantification: Systematic use of conformal prediction and applicability domain assessment By pivoting from simple accuracy maximization to robust domain alignment, DEL-ML can evolve from a retrospective analysis tool into a reliable engine for identifying novel chemical starting points. The establishment of standardized benchmarks and community resources will be essential to accelerate the development of generalizable predictive models capable of exploring the vast chemical space beyond individual DEL compositions.

  • DEL-Related Publications

    Hermes: Large DEL Datasets Train Generalizable Protein-Ligand Binding Prediction Models

    Maxwell Kleinsasser ,  Brayden J. Halverson ,  Edward Kraft ,  Sean Francis-Lyon ,  Sarah E. Hugo ,  Mackenzie R. Roman ,  Ben Miller ,  Andrew D. Blevins ,  Ian K. Quigley arXiv - QuanBio - Biomolecules  Abstract The quality and consistency of training data remain critical bottlenecks for protein-ligand binding prediction. Public affinity datasets, aggregated from thousands of labs and assay formats, introduce biases that limit model generalization and complicate evaluation. DNA-encoded chemical libraries (DELs) offer a potential solution: unified experimental protocols generating massive binding datasets across diverse chemical and protein target space. We present Hermes, a lightweight transformer trained exclusively on DEL data from screens against hundreds of protein targets, representing one of the largest and most protein-diverse DEL training sets applied to protein-ligand interaction (PLI) modeling to date. Despite never seeing traditional affinity measurements during training, Hermes generalizes to held-out targets, novel chemical scaffolds, and external benchmarks derived from public binding data and high-throughput screens. Our results demonstrate that DEL data alone captures transferable protein-ligand interaction representations, while Hermes' minimal architecture enables inference speeds suitable for large-scale virtual screening. Summary The paper introduces Hermes, a lightweight transformer-based model trained exclusively on DNA-encoded library (DEL) screening data across 239 protein targets. Despite never using traditional affinity measurements (e.g., IC50, Kd), Hermes generalizes to unseen protein targets, novel chemical scaffolds, and external benchmarks derived from public binding data. The model demonstrates that DEL data alone captures transferable protein-ligand interaction representations, with inference speeds 500–700× faster than state-of-the-art structure-based models like Boltz-2, making it highly suitable for large-scale virtual screening. Highlights Strong generalization: Achieves mean AUROC of 0.68 on the DEL Protein Split (unseen proteins) and 0.60 on Public Binders/Decoys (external benchmarks), with significantly better performance for kinase targets due to kinase-enriched training data. Speed advantage: Processes 28.2 samples/second/GPU on H200 hardware, far outpacing Boltz-2 (0.04 samples/second on H100), critical for cost-effective virtual screening. Limitations: Performance drops on the DEL Chemical Library Split (AUROC ~0.56), suggesting challenges in generalizing to entirely new chemical libraries. Data binarization (binary binding labels) and noise in DEL screening results constrain model expressivity. Practical impact: Highlights DEL datasets as a scalable, unified alternative to fragmented public affinity data (e.g., ChEMBL), with potential to accelerate drug discovery pipelines. Conclusion Hermes demonstrates that DEL-derived data alone can train generalizable protein-ligand binding prediction models without reliance on traditional affinity measurements. Its success underscores the value of large-scale, consistent DEL screening data for capturing transferable biological interactions. As DEL datasets continue to grow beyond public affinity resources, DEL-trained models like Hermes are poised to drive the next generation of computational drug discovery, particularly for targets underrepresented in existing public data. Future improvements could incorporate structural augmentation and continuous binding strength modeling to address current limitations.

Product & Services

OpenDEL™ - Small Molecule

Starting Your Journey to Access the Vast Chemical Space

The Kit

  • 57 Libraries
  • ~3.8Bn compounds
  • 10 DEL samples

 

To Access

  • Fully Enumerated Molecules
  • Building Block Structures
  • DNA Codon Sequences
  • Scaffolds Information

 

✔ No Structure Disclosure Fee

✔ No Compound IP License Fee
Learn More more Quote more
case_01
OpenDEL™ - Small Molecule
01

OpenDEL™ Screening

OpenDEL™ screening is carried out by our team of experienced professionals, proficient in handling over 50 different target types including protein-protein interactions, kinases, enzymes, transcription factors, and RNA targets. Our team typically completes the screening experiments within 1-2 weeks. 
Learn More more Quote more
case_01
OpenDEL™ Screening
02

OpenDEL™ Sequencing

HitGen offers high-quality and gold sequencing service includes. 
  • Global Sample Shipment

  • Outstanding Sequencing Quality

  • Lightning-speed Result Delivery

  • Diverse Sequencing Options

Learn More more Quote more
case_01
OpenDEL™ Sequencing
03

OpenDEL™ Hit Proposal

Analyzing DEL selection data and choosing the right compounds for follow-up necessitates multidisciplinary expertise encompassing biology, computational science, and chemistry. This includes a deep understanding of the experimental design and mechanisms of action (MOAs) in biology, data processing and analysis in computational science, and aspects of both synthetic and DEL chemistry
Learn More more Quote more
case_01
OpenDEL™ Hit Proposal
04

OpenDEL™ Off-DNA Synthesis

HitGen Chemical Services: Innovation-Driven and Precision-Empowered.

We transform your DEL hits into tangible results by delivering the pure, complex structures critical for validating discoveries and accelerating their advancement.

Choose Your Path:

A. Traditional Chemical Synthesis @ HitGen 
B. High Throughput Chemical Synthesis @ HitGen

Learn More more Quote more
case_01
OpenDEL™ Off-DNA Synthesis
05

What are people in the community saying?

Connect with peers. Access breakthrough science. Spark your next discovery.

  • HitGen
    HitGen

    Nelson Garcia-Vazquez, Shaoren Yuan, Moustafa Gabr

    bioRxiv - Pharmacology and Toxicology

    DOI: 10.64898/2026.02.21.707154

    Abstract

    Protein-protein interactions (PPIs) mediated by extracellular ligands remain challenging targets for small molecule intervention due to their large and dynamic interfaces. The interaction between SLIT2 and its receptor ROBO1 plays a critical role in cell migration and tumor progression, yet remains largely unexplored. Here, we report the discovery and optimization of small molecule inhibitors of the SLIT2/ROBO1 interaction enabled by DNA-encoded library (DEL) screening. Affinity selection against SLIT2 identified four structurally diverse hit compounds, which were subsequently validated using orthogonal biophysical assays. Among these, one hit exhibited measurable SLIT2 binding and functional inhibition of the SLIT2/ROBO1 interaction in a time-resolved FRET assay. Guided by physicochemical considerations, a solubility-optimized analog was designed, resulting in a ~50-fold improvement in binding affinity and an ~9-fold enhancement in functional potency. Molecular dynamics simulations and induced-fit docking revealed a stable binding mode within the SLIT2 LRR2 domain and suggested that a benzothiophene substituent was dispensable for target engagement. Fragment-based experimental validation confirmed this prediction, leading to the identification of a minimal azaindole-based pharmacophore that retained nanomolar binding affinity. Collectively, this study demonstrates how DEL-enabled hit discovery combined with rational optimization and fragment deconstruction can yield potent small molecule modulators of a challenging extracellular PPI, providing a foundation for further development of SLIT2/ROBO1 pathway inhibitors.

  • HitGen
    HitGen

    Evan S. O’Brien, Junzheng Wang, Parthasaradhireddy Tanguturi, Mengchu Li, Elizabeth White, Yuki Shiimura, Barnali Paul, Kevin Appourchaux, Kaavya Krishna Kumar, Weijiao Huang, Susruta Majumdar, John R. Traynor, John M. Streicher, Chunlai Chen, Brian K. Kobilka

    bioRxiv - Biophysics

    DOI: 10.64898/2026.02.20.707058

    Abstract

    Opioid agonists such as morphine and fentanyl exert analgesic effects by binding and activating the µ-opioid receptor (µOR), yet agonism of the µOR causes a slate of serious side effects. µOR-mediated addiction and respiratory depression are the major causes of the current opioid overdose crisis, largely driven by the explosion in illicit use of fentanyl, a potent opioid receptor full agonist. Given these serious side effects (and high resulting societal cost), molecules that act as analgesics with distinct mechanisms of action are of great interest. Positive allosteric modulators (PAMs) of the µOR have the potential to avoid many off-target side effects of conventional opioid orthosteric agonists by enhancing the signaling properties of natural opioid peptide systems. We used a DNA-encoded chemical library screening approach to selectively discover active-state-specific µOR PAMs. Two out of 3 selected prospective PAMs displayed the anticipated enhancement in agonist activity. The most effective of these compounds enhanced the activity of all orthosteric opioid agonists tested, including the native opioid peptide met-enkephalin. Little is known about the underlying dynamic basis of allosteric modulation of Family A GPCRs like the µOR. To that end, we used single-molecule fluorescence resonance energy transfer experiments to detail the impact that our novel µOR PAM has on the dynamic activation behavior of a key region on the intracellular face of the receptor. Our results here provide both a new chemical scaffold that acts as a µOR PAM and detailed pharmacological and dynamic insights into its mechanism of action.

  • HitGen
    HitGen

    Jia-ying Xue,Jia-hui Shi,Yuan Yao,Wei-en Xie,Yong Zou,Ming Yan,Xue-jing Zhang

    Organic Letters

    DOI: 10.1021/acs.orglett.6c00234

    Abstract

    Abstract Image

    We report a skeletal editing strategy based on DNA-encoded nitroarenes for the direct conversion of benzene cores into valuable 3H-azepine scaffolds. This transformation is efficiently promoted by visible light in the presence of P(Oi-Pr)3, which serves as a reductant to generate reactive nitrene intermediates from the nitro group. Demonstrating broad substrate scope with applicability to pharmaceutical molecules, this protocol offers an efficient and versatile route to DNA-encoded 3H-azepine derivatives. It thus establishes a robust platform for skeletal diversification in DNA-encoded library synthesis.
     

  • HitGen
    HitGen

    Congbao Kang ,  Hung T. Nguyen ,  David E. Heppner ,  Bin Yu ,  Weijun Xu

    Journal of Medicinal Chemistry

    DOI: 10.1021/acs.jmedchem.6c00070

     

    Over the past decade, RNA has emerged as an attractive and evolving landscape for small-molecule drug discovery. RNA functions as an intermediate macromolecule during gene expression and plays a diverse role in regulating cellular processes. (1) The promise in targeting RNA as therapeutic interventions arises from recent breakthroughs in structural biology, chemical biology, and computational modeling that collectively facilitate understanding of RNA biology. Multiple classes of RNA have been identified contributing to their functional diversity, including messenger RNA (mRNA), transfer RNA (tRNA), ribosomal RNA (rRNA), long noncoding RNA (lncRNA), microRNA (miRNA), and other noncoding RNAs (ncRNAs). While only a tiny fraction of 1.5% of human genome is translated into proteins, approximately 75% is transcribed into RNA, (2−4) making RNA a vastly larger reservoir of potential drug targets than the traditional protein targets. Moreover, many disease-associated pathways including viral replication, cancer progression, neurodegeneration, and immune regulation are regulated by RNA or through its interactions with different proteins, making RNA an increasingly relevant and strategic focus for next-generation drug discovery. (5) Several classes of small molecules define the landscape of RNA-targeting drug discovery (Figure 1), and many other modulators of RNA have been discovered for further development. (6) Small molecules targeting RNA can act through diverse mechanisms, including direct RNA binding to modulate RNA function, disruption of RNA interactions with RNA binding proteins, and stimulation RNA binding to RNA degrading enzyme for degradation. (7) The success in developing RNA drugs and RNA ligands prove the feasibility of RNA-focused drug discovery. (2) Patients with spinal muscular atrophy (SMA) lack enough survival motor neuron (SMN) protein to maintain adequate muscle function. The lack of SMN protein is believed to drive the pathophysiology of SMA. In August 2020, the FDA approved risdiplam (Evrysdi, RG7916, RO7034067) developed by Roche for the treatment of SMA in patients of all ages. (8) As a small-molecule modulator of SMN2 splicing, risdiplam represents the first FDA-approved RNA-targeting small-molecule drug. Mechanically, risdiplam acts as a “glue”-like compound that enhances RNA binding protein (RBP) to RNA to facilitate splicing and increases the production of full-length functional SMN protein. In addition to risdiplam, there are other drugs or clinical candidates that directly bind to RNA, including small molecules targeting RNA G-quadruplexes such as Quarfloxin and CX-5461, ribosome-targeting antibiotics like macrolides, and viral RNA polymerase nucleosides like sofosbuvir. (9) Together, these precedences highlight the growing potential of RNA as a therapeutic target and lay the foundation for the rational discovery of small-molecule RNA modulators. Figure 1. Small-molecule drugs that target RNAs. While strong incentives have been directed to drug RNA, several fundamental challenges make RNA-targeted drug discovery inherently more intractable than conventional protein-directed approaches. (10) Although proteins can undergo complicated conformational changes and post-translational modifications, RNA is even more dynamic and flexible, adopting distinct conformations that depend on its sequence, cellular environment, and interactions with other biomolecules. (11) Said another way, the highly dynamic nature of RNA implies that robust pockets may be impossible to bind small molecules like unstructured regions of proteins. Unlike proteins formed by 20 amino acids, RNA is composed of only four nucleotides, presenting challenges for specific and tight interactions with small molecules. Additional hurdles include the limited identification of clearly druggable RNA motifs, the intrinsic structural dynamics of RNA, and the difficulty in achieving selectivity across closely related RNA sequences. Additionally, traditional protein-directed drug discovery often relies on both polar and nonpolar interactions within well-defined hydrophobic and hydrophilic pockets. By contrast, RNA features overwhelmingly negatively charged backbones and typically lacks hydrophobic pockets. As a result, the number and structural diversity of druggable RNA binding sites are expected to be limited, which further complicates efforts to achieve selective modulation. As such, the structural and functional nature of RNA intrinsically favors chemical space dominated by highly polar molecules manifesting to at least two major issues: limited selectivity due to the abundance of similarly polar features across cellular RNAs and significant challenges in optimizing ADME properties, particularly lipophilicity. As with drug discovery of protein targets, hit identification via high-throughput screening (HTS), fragment-based screening, and virtual screening are commonly adopted. A major challenge for HTS against RNA targets is that most existing HTS libraries were designed for proteins and are poorly suited for recognizing the unique physicochemical and structural characteristics of RNA. Therefore, the development of RNA-focused libraries has become a key priority, which requires pharmacophore-based strategies through incorporating scaffolds known to interact with RNA motifs. This can be further complemented by the rise of DNA-encoded libraries (DELs) that have emerged as powerful source for hit identification for RNA targets. (4) For RNA to become a more tractable and widely druggable class of biomolecules, advances are needed in generating well-behaved biochemical tools and obtaining detailed structural information on unique or transient RNA conformations. Such innovations will be essential for enabling robust discovery platforms analogous to those that have long supported successful drug development for proteins and enzymes. Despite the above-mentioned challenges, RNA-targeted therapeutics continue to gain momentum across oncology, virology, and neurology. Recent advances in structural biology and computation, artificial intelligence, and multiscale modeling have begun to overcome these barriers and pave a rational foundation for RNA-targeted drug discovery. Structural studies of RNA using chemical biology and biophysical methods have made progress in recent years, providing detailed insights into secondary and tertiary motifs such as hairpins, internal loops, bulges, and pseudoknots, which are considered important structural architectures for developing small molecules. NMR spectroscopy remains a powerful technique for probing RNA structure and dynamics in solution. (12) Its ability to resolve conformational equilibria, detect transient states, and characterize ligand binding has made it indispensable for understanding RNA functional motions and identifying druggable conformations. X-ray crystallography, usually hindered by the intrinsic flexibility of RNA, has become increasingly feasible due to improved construct design, presence of stabilizing ligands, and the use of RBPs to facilitate crystallization. These innovations have enabled high-resolution structures of diverse RNA motifs and RNA–ligand complexes to be solved. (13) Cryo-EM has transformed the field through efficient determination of large RNA molecules and RNA–protein complexes that were previously inaccessible. (14) Complementing experimental approaches, computational methods have evolved rapidly and played a critical role in RNA structural biology and drug discovery. (15) Several algorithms and servers have been developed to predict RNA secondary and tertiary structures, model conformational ensembles, binding pocket identification and screen compound libraries against RNA targets. (16) Machine learning based methods that are supported by continuously expanding high-quality structural data, have improved accuracy in predicting RNA structures, RNA–RBP interactions and RNA–ligand interactions. Collectively, these methods provide a robust foundation for rational discovery of RNA modulators. Advances in physics-based molecular dynamics (MD) simulations are now mapping RNA conformational landscapes to reveal hidden and metastable pockets. Enhanced sampling MD, (17) coarse-grained (CG) modeling, (18) and Markov state models (MSMs) (19) allow the identification of transient states that are invisible to standard spectroscopy and experimental determination. (20) Coupled with ensemble-based docking against multiple conformers, hit rates may improve compared to static docking against RNA targets. Furthermore, the integration of hybrid QM/MM and machine-learning (ML) corrected scoring functions is providing a more rigorous treatment of stacking interactions, hydration shells, and ion-mediated electrostatics, which are critical for accurate RNA-ligand affinity prediction. These methods are increasingly used to derive druggability maps for riboswitches, repeat RNAs, viral elements, and structured motifs within long non-coding RNAs. (21) On the other hand, AI-driven prediction and generative design for RNA-targeted chemistry is on the rise. However, a central barrier is the limited availability of high-resolution RNA–ligand structures, which constrains traditional structure-based design. Emerging AI approaches are addressing this gap through multimodal learning frameworks that integrate RNA sequence, chemical features, SHAPE/DMS reactivity, and evolutionary covariation. Deep learning models improve secondary and tertiary structure inference, enabling more accurate identification of ligandable motifs and conformational states relevant for binding. (22) Recently, prediction of small-molecule–RNA interactions was achieved without the need for RNA tertiary structures as input. (23) However, their broader utility in RNA-focused medicinal chemistry remains to be fully established and will likely require larger data sets and more rigorous experimental validation. Moreover, many disease-relevant RNAs function through multivalent interactions with RBPs. Small molecules that restore normal RNA–RBP equilibrium have demonstrated proof-of-principle activity in correcting splicing, transcriptional regulation, and translation. (24) It remains to be seen whether disrupting RNA–protein binding interaction would bring clinical benefits. As data sets from CLIP–seq, RNP–MaP, and ligand–RNA cross-linking expand, predictive modeling of RBP selectivity will continue to improve. Last but not least, a rapidly growing therapeutic direction targets ribonucleoprotein condensates, that feature dynamic assemblies whose physicochemical properties (viscoelasticity, aging kinetics, etc.) encode key regulatory functions. (25) Aberrant condensate behavior is implicated in many diseases including neurodegenerative diseases and cancer. (25) Small molecules that alter condensate properties, such as softening, hardening, dissolving, or preventing gelation, represent a new modality of RNA-focused therapeutics. (26) Computational modeling is expected to play a central role capturing the structures of mesoscale organization and emergent behaviors of ribonucleoprotein condensates. These approaches collectively support rational design of condensate-modulating drugs, a modality fundamentally different from classical lock-and-key pharmacology. Overall, deep understanding of RNA targets, innovative screening and validation as a collaborative effort from chemical biology and medicinal chemistry is key to the success of RNA targeted drug discovery campaigns. With rapidly advancing techniques in structural biology and artificial intelligence are being developed, we may witness whether small molecule drug discovery targeting RNAs will meet its inflection point or remain a niche curiosity in the years to come. This article references 26 other publications. (acccessed 2024/10/22) PubMed (acccessed 2026/01/04) (acccessed 2026/01/04) This article has not yet been cited by other publications.

  • HitGen
    HitGen

    Xianfeng Li ,  Zehao Yin ,  Qiuyi Chen ,  Xinlong Hu ,  Gong Zhang ,  Xiaohong Fan ,  Yizhou Li

    Organic Letters 

    DOI: 10.1021/acs.orglett.6c00490

    Abstract

    The β-ketoamide motif represents both a privileged scaffold and a versatile synthetic intermediate in medicinal chemistry. Herein, we developed a DNA-compatible method for the efficient conversion of various DNA-conjugated amines into β-ketoamides. The resulting β-ketoamides facilitate rapid diversification into a panel of structurally diverse molecular scaffolds. Importantly, the synthetic route and subsequent derivatization steps were validated to be fully compatible with DNA encoding, offering a reliable and versatile platform for DNA-encoded library synthesis.

     

  • HitGen
    HitGen

    Yulong An ,  Ruolan Zhou ,  Xiang Li

    ACS Medicinal Chemistry Letters 

    DOI: 10.1021/acsmedchemlett.5c00738

    Abstract

    DNA-encoded library (DEL) technology has emerged as a transformative platform for discovering chemical inducers of proximity (CIPs), addressing challenges in both degrader and non-degrader CIP development. This Microperspective analyzes the results of recent DEL technology screens (2021–2025) to enable medicinal chemistry programs, focusing on CIP development including CIP-focused DELs, DEL-derived ligands for proteins of interest (POIs) and E3 ligase in rational CIP design, and directly functional CIP identification. Finally, we address current limitations of DEL technology in CIP research and outline future directions. This Microperspective underscores DEL’s pivotal role in advancing CIP discovery, providing actionable insights for addressing “undruggable” targets and accelerating translational research in chemical biology and medicinal chemistry.

    Summary

    This MicroPerspective reviews recent advances (2021–2025) in DNA-encoded library (DEL) technology for discovering chemical inducers of proximity (CIPs), spanning both degraders (e.g., PROTACs, molecular glue degraders) and non-degrader modalities (e.g., protein stabilization, subcellular relocalization, transcriptional activation). It synthesizes three key strategies: (1) CIP-focused DELs (CIP-DELs), enabling simultaneous dual-target (POI + E3 ligase) selection to directly identify cooperatively binding bifunctional compounds; (2) Conversion of DEL-derived POI/E3 ligands—leveraging well-defined DNA attachment sites as “exit vectors”—into functional CIPs; and (3) Discovery of non-degradative CIPs, including FKBP12-recruiting molecular glues and function-driven DEL screening (e.g., direct ubiquitination readout). DEL overcomes longstanding limitations of traditional HTS—including library size, cost, and scarcity of E3 ligands—thereby accelerating CIP development against “undruggable” targets.

    Highlights

    • Dual-Target CIP-DEL Screening: CRBN- or VHL-targeted DELs enable concurrent selection against POIs and E3 ligases, directly identifying ternary complex stabilizers with high cooperativity (e.g., BRD4/BRD2-selective PROTACs, BRD9 molecular glue).
    • Ligand-to-CIP Conversion Paradigm: DEL-derived ligands for ERα, MAGE-A3, PIN1, DNPH1, and TRIM21 were optimized and converted into functional PROTACs or TrimTACs; the DNA attachment site serves as a built-in, precise “exit vector” for linker conjugation.
    • Expansion to Non-Degradative Functions: An FKBP12-biased CIP-DEL identified a molecular glue that stabilizes the Crohn’s disease-associated ATG16L1 T300A variant; function-driven DEL screening (in presence of E1/E2/ATP) directly enriches ubiquitination-competent PROTACs, eliminating affinity-only false positives.
    • New Frontier: RNA Targets: DEL screening against RNase L led to the design of RiboTACs targeting pre-miR-21—extending CIP therapeutics to RNA biology.

    Conclusion

    DEL technology has evolved from a single-target ligand discovery platform into a central engine driving the discovery of the full spectrum of CIPs—from degraders to non-degraders, and from proteins to RNA. Its core advantages lie in vast chemical space coverage, barcode-enabled precise hit identification, and intrinsic structural information (e.g., defined exit vectors). Future directions include integrating AI for POI–E3 interface–guided library design, developing robust in-cell DEL screening, expanding the repertoire of E3 ligase ligands, and strengthening functional phenotypic and preclinical translational studies—to fully unlock the therapeutic potential of CIPs against “undruggable” targets.

Messages and Feedback

By submitting your information, you acknowledge having received, read and understood our Privacy Notice as made available above.

logo
logo