Welcome to OpenDEL™ Community A central hub to connect with global DEL professionals, access the latest industry insights and product updates, and collaborate to accelerate drug discovery. Learn More
DEL-Related Publications 24 February 2026 Photochemical Synthesis of DNA-Encoded 3H-Azepines via Skeletal Editing of Nitroarenes Jia-ying Xue,Jia-hui Shi,Yuan Yao,Wei-en Xie,Yong Zou,Ming Yan,Xue-jing Zhang Organic Letters DOI: 10.1021/acs.orglett.6c00234 Abstract We report a skeletal editing strategy based on DNA-encoded nitroarenes for the direct conversion of benzene cores into valuable 3H-azepine scaffolds. This transformation is efficiently promoted by visible light in the presence of P(Oi-Pr)3, which serves as a reductant to generate reactive nitrene intermediates from the nitro group. Demonstrating broad substrate scope with applicability to pharmaceutical molecules, this protocol offers an efficient and versatile route to DNA-encoded 3H-azepine derivatives. It thus establishes a robust platform for skeletal diversification in DNA-encoded library synthesis. Learn More DEL-Related Publications 23 February 2026 Discovery and Optimization of Small Molecule Inhibitors of the SLIT2/ROBO1 Protein-Protein Interaction Using DNA-Encoded Libraries Nelson Garcia-Vazquez, Shaoren Yuan, Moustafa Gabr bioRxiv - Pharmacology and Toxicology DOI: 10.64898/2026.02.21.707154 Abstract Protein-protein interactions (PPIs) mediated by extracellular ligands remain challenging targets for small molecule intervention due to their large and dynamic interfaces. The interaction between SLIT2 and its receptor ROBO1 plays a critical role in cell migration and tumor progression, yet remains largely unexplored. Here, we report the discovery and optimization of small molecule inhibitors of the SLIT2/ROBO1 interaction enabled by DNA-encoded library (DEL) screening. Affinity selection against SLIT2 identified four structurally diverse hit compounds, which were subsequently validated using orthogonal biophysical assays. Among these, one hit exhibited measurable SLIT2 binding and functional inhibition of the SLIT2/ROBO1 interaction in a time-resolved FRET assay. Guided by physicochemical considerations, a solubility-optimized analog was designed, resulting in a ~50-fold improvement in binding affinity and an ~9-fold enhancement in functional potency. Molecular dynamics simulations and induced-fit docking revealed a stable binding mode within the SLIT2 LRR2 domain and suggested that a benzothiophene substituent was dispensable for target engagement. Fragment-based experimental validation confirmed this prediction, leading to the identification of a minimal azaindole-based pharmacophore that retained nanomolar binding affinity. Collectively, this study demonstrates how DEL-enabled hit discovery combined with rational optimization and fragment deconstruction can yield potent small molecule modulators of a challenging extracellular PPI, providing a foundation for further development of SLIT2/ROBO1 pathway inhibitors. Learn More DEL-Related Publications 22 February 2026 DNA-Compatible Synthesis of β-Ketoamides as Intermediates for On-DNA Chemical Diversification Xianfeng Li , Zehao Yin , Qiuyi Chen , Xinlong Hu , Gong Zhang , Xiaohong Fan , Yizhou Li Organic Letters DOI: 10.1021/acs.orglett.6c00490 Abstract The β-ketoamide motif represents both a privileged scaffold and a versatile synthetic intermediate in medicinal chemistry. Herein, we developed a DNA-compatible method for the efficient conversion of various DNA-conjugated amines into β-ketoamides. The resulting β-ketoamides facilitate rapid diversification into a panel of structurally diverse molecular scaffolds. Importantly, the synthetic route and subsequent derivatization steps were validated to be fully compatible with DNA encoding, offering a reliable and versatile platform for DNA-encoded library synthesis. Learn More DEL-Related Publications 21 February 2026 Discovery and dynamic pharmacology of µ-opioid receptor positive allosteric modulators Evan S. O’Brien, Junzheng Wang, Parthasaradhireddy Tanguturi, Mengchu Li, Elizabeth White, Yuki Shiimura, Barnali Paul, Kevin Appourchaux, Kaavya Krishna Kumar, Weijiao Huang, Susruta Majumdar, John R. Traynor, John M. Streicher, Chunlai Chen, Brian K. Kobilka bioRxiv - Biophysics DOI: 10.64898/2026.02.20.707058 Abstract Opioid agonists such as morphine and fentanyl exert analgesic effects by binding and activating the µ-opioid receptor (µOR), yet agonism of the µOR causes a slate of serious side effects. µOR-mediated addiction and respiratory depression are the major causes of the current opioid overdose crisis, largely driven by the explosion in illicit use of fentanyl, a potent opioid receptor full agonist. Given these serious side effects (and high resulting societal cost), molecules that act as analgesics with distinct mechanisms of action are of great interest. Positive allosteric modulators (PAMs) of the µOR have the potential to avoid many off-target side effects of conventional opioid orthosteric agonists by enhancing the signaling properties of natural opioid peptide systems. We used a DNA-encoded chemical library screening approach to selectively discover active-state-specific µOR PAMs. Two out of 3 selected prospective PAMs displayed the anticipated enhancement in agonist activity. The most effective of these compounds enhanced the activity of all orthosteric opioid agonists tested, including the native opioid peptide met-enkephalin. Little is known about the underlying dynamic basis of allosteric modulation of Family A GPCRs like the µOR. To that end, we used single-molecule fluorescence resonance energy transfer experiments to detail the impact that our novel µOR PAM has on the dynamic activation behavior of a key region on the intracellular face of the receptor. Our results here provide both a new chemical scaffold that acts as a µOR PAM and detailed pharmacological and dynamic insights into its mechanism of action. Learn More DEL-Related Publications 19 February 2026 Toward generalizable predictive models for DNA-encoded libraries Vasanthanathan Poongavanam , S. Pauliina Turunen , Kristian Sandberg , Ulrika Yngve , Johan Wannberg Drug Discovery Today DOI: 10.1016/j.drudis.2026.104629 Abstract DNA-encoded libraries (DELs) combined with machine learning (ML) offer a powerful paradigm for hit identification. However, sequencing-derived enrichment data are inherently noisy and biased, often resulting in models that overfit to specific chemical libraries. In this review, we critically evaluate the capabilities and limitations of DEL-ML, illustrating key challenges using Aurora Kinase A (AURKA) DEL affinity selection data. We demonstrate that standard ML models often struggle to generalize to unseen chemical space because of the specific structural constraints of combinatorial libraries. Furthermore, we discuss the necessity of rigorous denoising strategies and evaluate approaches, such as domain adaptation, to mitigate these limitations, offering a roadmap for building robust models capable of exploring diverse chemical space. Summary This review critically examines the integration of machine learning (ML) with DNA-encoded library (DEL) technology for drug discovery. While DEL-ML offers a powerful paradigm for hit identification by generating massive binding datasets (10⁶–10¹² data points), the authors identify a critical "generalizability gap" that limits the practical utility of current models. Using Aurora Kinase A (AURKA) as a case study with OpenDEL 4.0 screening data (~1.5 million data points), the authors demonstrate that standard ML models achieve high accuracy on internal validation but frequently fail to generalize to structurally novel scaffolds due to domain shift—the substantial difference between DEL chemical space and known pharmacological compounds. The review provides methodological best practices for data preprocessing, denoising, and validation, while evaluating advanced strategies such as domain adaptation to improve model robustness. The authors argue that future DEL-ML development must move beyond simple accuracy maximization toward explicit handling of distribution shifts to transform DEL-ML from a retrospective analysis tool into a reliable engine for novel chemical discovery. Highlights 1. The Generalizability Challenge in DEL-ML Models trained on DEL data often memorize library-specific building blocks rather than learning transferable structure-activity relationships The BELKA competition revealed that models perform well on test sets within the same chemical space but fail on structurally novel scaffolds Domain shift between DEL training data and external compound collections represents a fundamental barrier to practical application 2. Data Quality and Preprocessing Considerations DEL sequencing data contains unique noise profiles including matrix binding, DNA-tag interference, unequal synthesis yields, and "jackpot" effects Multiple denoising strategies are evaluated: fold-enrichment, Z-scores for ultra-large libraries, disynthon aggregation, and uncertainty-aware probabilistic loss functions Critical importance of subtracting background noise from control experiments (matrix/bead-only) to prevent false positives 3. Class Imbalance and Data Splitting Strategies DEL selections produce highly imbalanced datasets (10¹–10⁴ binders vs. up to ~10¹² nonbinders) Random splitting leads to overoptimistic metrics due to high structural similarity within DEL congeneric series Scaffold-based or library-based splitting provides more rigorous assessment of generalizability to novel chemotypes Undersampling nonbinders (e.g., 1:1 ratio) can boost external sensitivity from ~1% to 20–30%, though this may reflect bias exploitation rather than true generalization 4. Molecular Representation and Model Architectures Traditional fingerprints and physicochemical descriptors often fail to capture subtle variations in DEL compounds Graph neural networks (GNNs) and variational autoencoders (VAEs) show promise but require careful handling of linker/DNA-tag artifacts Compositional (disynthon) approaches reduce sparsity but risk losing "whole-molecule" structural fidelity Conformal prediction frameworks provide calibrated confidence intervals essential for prioritizing predictions in noisy DEL environments 5. Domain Adaptation as a Solution Strategy Covariate shift correction reduces divergence between source (DEL) and target (known binder) domains Using high-confidence predictions from diverse compound collections (e.g., Enamine REAL Diversity Set) as an intermediate domain improves generalization Domain adaptation reduced PCA centroid distance from 0.77 to 0.32 between DEL training data and known AURKA space Retraining with both predicted binders and nonbinders improved Matthews Correlation Coefficient (MCC) from 0.2 to 0.4 on external datasets while maintaining 20–39% sensitivity 6. AURKA Case Study Findings OpenDEL 4.0-derived binders tended to be larger, more lipophilic, and less polar compared to known AURKA inhibitors Despite overall domain shift, highly enriched DEL hits from sublibrary 27 shared conserved hinge-binding motifs with established inhibitors (e.g., VX-680) Mechanistic alignment between DEL hits and known binders confirms that domain shift, rather than fundamental binding mode differences, drives prediction failures Conclusion The integration of DELs with ML presents transformative opportunities for early drug discovery, but realizing this potential requires overcoming the critical generalizability gap. The primary challenge is not data volume but data nature: intrinsic structural biases and systematic false negatives (often linker-induced) cause models to memorize library-specific artifacts rather than learn transferable pharmacophore principles. High internal validation metrics frequently mask failures to extrapolate to novel, pharmacologically relevant scaffolds. The authors advocate for a paradigm shift in DEL-ML development emphasizing: Rigorous validation standards: Moving beyond random splits to scaffold-based and out-of-distribution evaluation Domain alignment strategies: Explicit handling of distribution shifts through domain adaptation and transfer learning Data diversity expansion: Open-source DEL datasets spanning broader drug-like chemical space to reduce single-library bias Integration of physics-based priors: Incorporating docking constraints to reduce overfitting to synthetic artifacts Uncertainty quantification: Systematic use of conformal prediction and applicability domain assessment By pivoting from simple accuracy maximization to robust domain alignment, DEL-ML can evolve from a retrospective analysis tool into a reliable engine for identifying novel chemical starting points. The establishment of standardized benchmarks and community resources will be essential to accelerate the development of generalizable predictive models capable of exploring the vast chemical space beyond individual DEL compositions. Learn More DEL-Related Publications 19 February 2026 Hermes: Large DEL Datasets Train Generalizable Protein-Ligand Binding Prediction Models Maxwell Kleinsasser , Brayden J. Halverson , Edward Kraft , Sean Francis-Lyon , Sarah E. Hugo , Mackenzie R. Roman , Ben Miller , Andrew D. Blevins , Ian K. Quigley arXiv - QuanBio - Biomolecules Abstract The quality and consistency of training data remain critical bottlenecks for protein-ligand binding prediction. Public affinity datasets, aggregated from thousands of labs and assay formats, introduce biases that limit model generalization and complicate evaluation. DNA-encoded chemical libraries (DELs) offer a potential solution: unified experimental protocols generating massive binding datasets across diverse chemical and protein target space. We present Hermes, a lightweight transformer trained exclusively on DEL data from screens against hundreds of protein targets, representing one of the largest and most protein-diverse DEL training sets applied to protein-ligand interaction (PLI) modeling to date. Despite never seeing traditional affinity measurements during training, Hermes generalizes to held-out targets, novel chemical scaffolds, and external benchmarks derived from public binding data and high-throughput screens. Our results demonstrate that DEL data alone captures transferable protein-ligand interaction representations, while Hermes' minimal architecture enables inference speeds suitable for large-scale virtual screening. Summary The paper introduces Hermes, a lightweight transformer-based model trained exclusively on DNA-encoded library (DEL) screening data across 239 protein targets. Despite never using traditional affinity measurements (e.g., IC50, Kd), Hermes generalizes to unseen protein targets, novel chemical scaffolds, and external benchmarks derived from public binding data. The model demonstrates that DEL data alone captures transferable protein-ligand interaction representations, with inference speeds 500–700× faster than state-of-the-art structure-based models like Boltz-2, making it highly suitable for large-scale virtual screening. Highlights Strong generalization: Achieves mean AUROC of 0.68 on the DEL Protein Split (unseen proteins) and 0.60 on Public Binders/Decoys (external benchmarks), with significantly better performance for kinase targets due to kinase-enriched training data. Speed advantage: Processes 28.2 samples/second/GPU on H200 hardware, far outpacing Boltz-2 (0.04 samples/second on H100), critical for cost-effective virtual screening. Limitations: Performance drops on the DEL Chemical Library Split (AUROC ~0.56), suggesting challenges in generalizing to entirely new chemical libraries. Data binarization (binary binding labels) and noise in DEL screening results constrain model expressivity. Practical impact: Highlights DEL datasets as a scalable, unified alternative to fragmented public affinity data (e.g., ChEMBL), with potential to accelerate drug discovery pipelines. Conclusion Hermes demonstrates that DEL-derived data alone can train generalizable protein-ligand binding prediction models without reliance on traditional affinity measurements. Its success underscores the value of large-scale, consistent DEL screening data for capturing transferable biological interactions. As DEL datasets continue to grow beyond public affinity resources, DEL-trained models like Hermes are poised to drive the next generation of computational drug discovery, particularly for targets underrepresented in existing public data. Future improvements could incorporate structural augmentation and continuous binding strength modeling to address current limitations. Learn More
OpenDEL™ - Small Molecule Starting Your Journey to Access the Vast Chemical Space The Kit 57 Libraries ~3.8Bn compounds 10 DEL samples To Access Fully Enumerated Molecules Building Block Structures DNA Codon Sequences Scaffolds Information ✔ No Structure Disclosure Fee ✔ No Compound IP License Fee
OpenDEL™ Screening OpenDEL™ screening is carried out by our team of experienced professionals, proficient in handling over 50 different target types including protein-protein interactions, kinases, enzymes, transcription factors, and RNA targets. Our team typically completes the screening experiments within 1-2 weeks.
OpenDEL™ Sequencing HitGen offers high-quality and gold sequencing service includes. Global Sample Shipment Outstanding Sequencing Quality Lightning-speed Result Delivery Diverse Sequencing Options
OpenDEL™ Hit Proposal Analyzing DEL selection data and choosing the right compounds for follow-up necessitates multidisciplinary expertise encompassing biology, computational science, and chemistry. This includes a deep understanding of the experimental design and mechanisms of action (MOAs) in biology, data processing and analysis in computational science, and aspects of both synthetic and DEL chemistry
OpenDEL™ Off-DNA Synthesis HitGen Chemical Services: Innovation-Driven and Precision-Empowered. We transform your DEL hits into tangible results by delivering the pure, complex structures critical for validating discoveries and accelerating their advancement. Choose Your Path: A. Traditional Chemical Synthesis @ HitGen B. High Throughput Chemical Synthesis @ HitGen