Autochem is a computational drug discovery platform developed by Pauling.AI. Beyond identifying molecules that bind a single target, Autochem supports counterscreening: a selectivity-focused workflow that simultaneously docks a compound library against two binding sites and ranks candidates by how strongly they distinguish between those sites. The goal is not merely to find binders, but to find selective binders — molecules with high affinity for a desired site and low affinity for an undesired one.
Why Selectivity Matters
Binding affinity to a single target is a necessary but insufficient criterion for a successful drug candidate. Most protein families share conserved structural motifs, and a compound that binds one member potently will often bind closely related proteins with similar or greater affinity. The clinical consequences range from mild (reduced efficacy due to sequestration by an anti-target) to severe (toxicity from on-target activity at the wrong tissue, or off-target interference with an essential protein).
Counterscreening addresses this computationally, before any compound reaches a laboratory. By docking against both the desired and the undesired site within the same pipeline, it becomes possible to rank candidates by a selectivity score — the difference in predicted binding energy between the two sites — rather than by affinity to a single site alone. Compounds with a large positive selectivity score are predicted to engage the desired pocket strongly while leaving the off-target pocket largely unoccupied.
Typical use cases include:
- Isoform selectivity. Kinases, nuclear receptors, and many enzyme families have multiple isoforms with near-identical active sites. Counterscreening can identify compounds whose binding is sensitive to the subtle structural differences between isoforms.
- Allosteric-versus-orthosteric selectivity. A protein may have both an orthosteric active site and one or more allosteric pockets. If binding the orthosteric site is toxic but an allosteric pocket offers a safer modulation point, counterscreening can enrich for compounds that prefer the allosteric site.
- Anti-target avoidance. Early safety filters often flag specific proteins — hERG, CYP isoforms, nuclear hormone receptors — as anti-targets. Counterscreening against a known anti-target simultaneously with the desired target produces a selectivity-enriched hit list at the outset, reducing attrition in later stages.
Two Workflow Modes
Autochem offers two distinct counterscreening workflows, selected based on the biological question:
Mode 1: Two Pockets on the Same Protein
Used when the desired and off-target sites both reside on a single protein structure — for example, an orthosteric active site versus an allosteric site, or two distinct pockets with different biological outcomes. Both sites are identified from the same prepared structure, and docking runs against both pockets are performed in a single pass.
Mode 2: Two Different Protein Structures
Used when the desired target and the off-target are entirely different proteins — for example, when seeking inhibitors selective for one kinase over another structurally similar kinase, or when the off-target is an unrelated protein known to cause adverse effects. Each protein is independently downloaded, prepared, and quality-controlled, and the compound library is docked against each in turn before results are merged and ranked.
The pipeline steps below describe the full workflow. Steps 1–8 apply to both modes. Where Mode 2 diverges — because it requires a second, independent protein preparation — the parallel steps for the off-target protein are described in Phase 2.
Pipeline Steps
Phase 1: Desired Target Preparation
Step 1: Target Identification and Validation
The input identifier for the desired target is validated against three databases in priority order: RCSB PDB, UniProt, and HGNC. This normalization step determines whether the identifier is a PDB ID, a UniProt accession, a UniProt entry name, or a gene symbol, and confirms that it resolves to a known, unambiguous entry. Validation before any download prevents silent failures where a mistyped or deprecated identifier would propagate through the pipeline and produce meaningless results.
Step 2: Protein Metadata Download
Protein metadata is fetched from UniProt. This step resolves cross-references between databases and records gene names, organism, functional annotations, and known isoforms. Having this metadata before structure download allows the pipeline to select the biologically relevant chain and to flag cases where multiple orthologues or isoforms exist — information that is particularly relevant in counterscreening, where isoform context is often central to the selectivity question.
Step 3: Protein Structure Acquisition
The 3D structure is obtained in one of two ways depending on the input identifier.
Experimental structures (RCSB PDB): The pipeline fetches the PDB record and chain information, downloads the mmCIF file alongside the PDB file, converts mmCIF to PDB when no PDB file is available using the Gemmi library, and splits multi-model structures (e.g., NMR ensembles) into individual conformations for independent evaluation. It also queries the OPM (Orientations of Proteins in Membranes) database to flag membrane proteins, and fetches binding site annotations from NCBI for each chain.
Starting from an experimental structure is strongly preferred when one is available. X-ray crystal structures and cryo-EM structures capture the true three-dimensional arrangement of atoms in the binding site, including water molecules, metal ions, and co-crystallized ligands. In a selectivity context, even small structural differences between an isoform pair — a single residue substitution, a shifted loop, a different bound cofactor — can be decisive, so structure quality has a disproportionate effect on the reliability of selectivity predictions.
Predicted structures (AlphaFold DB): For targets without an experimentally determined structure, the pipeline fetches the predicted structure from the AlphaFold Protein Structure Database.[1] Per-residue confidence scores (pLDDT) are available and can flag regions where the predicted geometry is uncertain — particularly disordered loops that may border the binding site.
Alternatively, users may upload their own structure file directly in PDB or mmCIF format. Uploaded structures bypass the download steps and enter the pipeline at the preparation stage.
Step 4: Receptor Preparation — Fixing, Reducing, and Protonating
Raw PDB files from public databases are almost always unsuitable for docking in their downloaded form. The problems are systematic and, if left unaddressed, would produce artifactual selectivity scores:
- Missing hydrogens. X-ray crystallography does not resolve hydrogen atoms at typical resolutions. Hydrogen atoms are essential for docking: they participate in hydrogen bonds, alter the shape and electrostatic surface of the binding site, and are required by AutoDock-family scoring functions to compute partial charges.
- Missing atoms and residues. Flexible loops and termini are often disordered in crystal structures. If a missing loop borders the binding site, leaving it absent creates an artificial opening that distorts the pocket geometry.
- Non-standard residues. Modified amino acids and crystallization artifacts must be converted to their standard equivalents or removed, because force fields do not have parameters for them.
- Incorrect protonation states. The protonation state of titratable residues (histidine, aspartate, glutamate, lysine, arginine, cysteine, tyrosine) depends on the local pH and electrostatic environment. The wrong protonation state at the active site can shift docking scores by several kcal/mol. In selectivity analysis, a protonation error on one protein but not the other would produce a spurious apparent selectivity, making this step particularly important.
The preparation pipeline addresses all of these with a four-stage protocol:
PDBFixer is applied first. It adds missing heavy atoms using template-based modeling, resolves missing terminal residues, replaces non-standard residues with their standard equivalents, removes heterogens, and performs an initial coarse hydrogen placement.
Reduce (TRIM mode) is then applied to optimize hydrogen placement for key titratable and ambiguous groups.[2] This is especially important for histidine (which can be protonated at Nδ, Nε, or both) and for asparagine and glutamine (whose amide groups can be flipped 180° without changing heavy-atom positions but dramatically changing hydrogen-bond geometry). The TRIM variant removes the hydrogens it placed while preserving their optimized orientations, avoiding double-protonation in the following step.
PDB2PQR with the SWANSON force field is the core protonation engine.[3] It assigns protonation states at pH 7.4 using empirical pKa models and adds all hydrogen atoms accordingly. The SWANSON force field assigns partial charges compatible with the AutoDock/Vina scoring function; using charges parameterized for a different force field would create a systematic mismatch that degrades scoring accuracy. PDB2PQR outputs a PQR file — a variant of PDB format that includes per-atom partial charges and radii required by the docking engine.
OpenBabel converts the PQR file back to standard PDB format and then to PDBQT format, the native input format for AutoDock-family docking engines, encoding torsional flexibility information for ligands through BRANCH/ENDBRANCH annotation blocks.
Step 5: Receptor Energy Minimization
Even after fixing and protonation, the protein geometry carries artifacts from template-based atom additions and protonation state changes. These manifest as local steric clashes and bond strain that would bias docking results. The receptor_minimization pipeline applies energy minimization using OpenBabel with the MMFF94 (Merck Molecular Force Field).[4] The minimization runs a two-stage algorithm — steepest descent first to escape large energy barriers quickly, followed by conjugate gradient for efficient convergence near the local minimum. This relaxes strain and resolves clashes without driving the structure away from the experimental geometry.
Step 6: Structural Quality Control with MolProbity
Before committing to the computationally expensive docking stage, the pipeline evaluates the quality of the prepared receptor using MolProbity,[5] the standard tool in structural biology for assessing model quality (used by the wwPDB validation pipeline for all deposited structures). The pipeline calls a MolProbity server with four sequential analyses:
- Oneline analysis — Computes an overall MolProbity score expressed as a percentile relative to structures of comparable resolution in the PDB.
- Clashscore — Counts serious steric overlaps (> 0.4 Å) per 1000 atoms. A high clashscore in the binding site is particularly damaging for selectivity analysis, as it can create artificial voids or occlusions that skew scores relative to the other protein.
- Ramachandran analysis — Evaluates backbone φ/ψ dihedral angles against distributions derived from ultra-high-resolution crystal structures.
- Rotamer quality — Assesses whether side-chain conformations match preferred rotamer distributions. Rotamer outliers in or near the binding site directly affect the shape and electrostatic character of the pocket.
When multiple conformations are available (e.g., from NMR ensembles), the pipeline automatically selects the conformation with the best (lowest) MolProbity score as the representative structure for docking.
Step 7: Binding Pocket Prediction with P2Rank
With a validated, high-quality receptor structure, the pipeline identifies druggable binding sites using P2Rank (version 2.5).[6] P2Rank is a machine learning-based pocket predictor that uses a random forest model trained on the structural and physicochemical properties of protein surfaces. Unlike purely geometric methods that identify concave surface regions, P2Rank learns which surface features are associated with observed ligand binding in the PDB training set.
P2Rank operates on the protein surface as sampled by solvent-accessible surface (SAS) points. For each point, it computes a feature vector encoding local geometry (curvature, burial depth), physicochemical properties (hydrophobicity, charge, hydrogen-bond donor/acceptor capacity), and evolutionary conservation. The random forest outputs a per-point probability of belonging to a binding pocket; nearby high-probability points are clustered into pocket predictions, each ranked by a combined score.
The pipeline then: (1) extracts center coordinates of each predicted pocket; (2) calculates a bounding box using MDAnalysis, sized to enclose all SAS points plus a 2 Šmargin; (3) enforces a maximum box volume of 27,000 ų (the limit imposed by UniDock), scaling oversized boxes down while preserving their center and aspect ratio; and (4) checks new predictions against previously stored sites (within a 5 Šcenter tolerance) to avoid duplicate pockets.
In counterscreening, P2Rank is run with a larger max_number_pockets setting (up to 20 pockets per receptor by default) compared to standard single-target docking. This is intentional: when looking for a pocket to avoid, rare or cryptic pockets that might not appear in a low-budget search need to be surfaced so the user can make an informed selection.
Step 8: Binding Site Selection
After pocket detection, the agent presents all identified pockets to the user through a 3D visualization overlaid on the protein structure and a summary table of scores and positions.
In Mode 1 (two pockets on the same protein): The user selects two distinct pockets — the desired binding site and the off-target site to avoid — from the same receptor structure. The agent enforces that the two selected pocket IDs are different, preventing the trivial and uninformative case where the same pocket is designated as both target and counter-target.
In Mode 2 (two different proteins): The user selects one pocket from the desired protein's pocket list. This completes Phase 1. Phase 2 then begins independently for the off-target protein.
Phase 2: Off-Target Protein Preparation (Mode 2 Only)
In Mode 2, the off-target protein undergoes the same complete preparation pipeline as the desired target: identifier validation (Steps 1–2), metadata download, structure acquisition, receptor preparation with PDBFixer / Reduce / PDB2PQR / OpenBabel (Step 4), energy minimization (Step 5), MolProbity quality control (Step 6), and pocket detection with P2Rank (Step 7).
The off-target structure is stored in a separate project field from the desired structure, so neither overwrites the other and both are available to the downstream docking step. Similarly, the off-target pocket selection is stored separately from the desired pocket selection.
This independent preparation is important because different proteins frequently require different handling: they may have different resolution, different missing regions, different membrane associations, or different relevant protonation states. Sharing a preparation between two different proteins would introduce errors that would corrupt the selectivity comparison.
Phase 3: Counter-Screening Docking
Step 9: Virtual Screening with UniDock Against Both Sites
With both binding sites prepared and identified, the compound library is docked against each site using UniDock,[8] a GPU-accelerated molecular docking engine derived from AutoDock Vina.[9] UniDock reimplements Vina's Monte Carlo–based conformational search on GPU hardware, enabling batch processing of thousands of ligands simultaneously.
Before docking, the compound library is filtered at the database level:
- Drug-likeness (QED ≥ 0.2): The Quantitative Estimate of Drug-likeness[7] combines multiple Lipinski-style descriptors into a single score from 0 to 1. A threshold of 0.2 removes obvious non-drug-like compounds while retaining fragments and lead-like molecules.
- Nanoparticles: Carbon-only structures (fullerenes, nanotubes, etc.) are excluded.
- Metabolites: Endogenous metabolites are excluded to focus the screen on exogenous drug-like molecules.
A two-stage docking strategy balances throughput against accuracy:
Stage 1 — Fast screening across the full library: UniDock runs in fast search mode. Each ligand's best-scoring pose is evaluated by Vina's hybrid scoring function. Ligands scoring worse than −3.0 kcal/mol against the desired site are discarded.
Stage 2 — Detail re-docking of top candidates: The top candidates from Stage 1 are re-docked in detail search mode, which allocates significantly more Monte Carlo steps and evaluations per ligand. This stage uses PDBQT files with explicit hydrogen atoms rather than the implicit-hydrogen representation used in Stage 1, improving scoring accuracy for hydrogen-bond interactions.
Critically, in counterscreening, each compound is docked against both the desired site and the off-target site. The scoring against the off-target uses the same two-stage protocol and the same compound representations, ensuring that the scores from the two sites are directly comparable on the same energy scale.
Step 10: Selectivity Ranking
After docking is complete for both sites, compounds are ranked by a selectivity score, defined as the difference between the Vina score at the off-target site and the Vina score at the desired site:
Because Vina scores are negative (more negative = stronger predicted binding), a large positive selectivity score means the compound binds the desired site much more strongly than the off-target site. Compounds are sorted by this score in descending order, with the most selective compounds appearing first.
This ranking directly reflects the drug discovery objective: the ideal hit not only binds the desired target with high predicted affinity (a strongly negative desired-site score), but also fails to engage the off-target to any significant degree (a near-zero or weakly negative off-target score). The selectivity score encapsulates both criteria in a single quantity.
The top candidates are stored in the database with both docking scores, the selectivity score, SDF coordinates of both docked poses, and cross-references to the receptor structures and binding sites.
Step 11: Docking Quality Control with PoseBusters
High selectivity scores are a necessary but not sufficient condition for a compound to be a viable hit. The final step runs PoseBusters[10] on the docked poses for all top-ranked compounds. PoseBusters applies a battery of deterministic, physics-based checks that include:
- Molecular sanitization: Correct valence, no charged atoms, no radicals.
- Ligand bond geometry: Bond lengths, bond angles, and torsion angles within expected chemical ranges.
- Internal clashes: No steric overlaps between atoms within the ligand.
- Receptor clashes: No steric overlaps between the docked ligand and the protein heavy atoms.
- Binding site proximity: The ligand center of mass is within the declared binding site, not docked to a remote surface patch.
- Flat ring systems: Aromatic rings are planar, as required by their electronic structure.
In a selectivity context, PoseBusters checks are applied to the poses from both binding sites. A compound that passes PoseBusters for its desired-site pose but fails for the off-target pose suggests that the off-target score reflects a spurious or unphysical interaction — potentially inflating the apparent selectivity. Conversely, a compound that passes both checks is a physically plausible binder at both sites, and its selectivity score reflects a genuine geometric and energetic preference for the desired site over the off-target.
The final results table displays the top-ranked selective compounds with their target scores, off-target scores, selectivity scores, and PoseBusters validation status for each pose.
Summary of Tools
| Step | Tool | Purpose |
|---|---|---|
| Identifier validation | RCSB PDB / UniProt / HGNC | Normalize and confirm input identifiers |
| Structure acquisition | RCSB PDB | Download experimental 3D structures |
| Structure acquisition | AlphaFold DB | Download predicted 3D structures |
| Metadata | UniProt | Protein metadata and cross-references |
| Metadata | NCBI | Chain annotations and binding sites |
| Format conversion | Gemmi | mmCIF to PDB conversion |
| Membrane check | OPM | Identify membrane proteins |
| Structure fixing | PDBFixer | Missing atoms, non-standard residues |
| Hydrogen optimization | Reduce | Hydrogen flipping and orientation |
| Protonation | PDB2PQR (SWANSON FF) | pH-aware protonation and charge assignment |
| Format conversion | OpenBabel | PQR→PDB, PDB→PDBQT conversions |
| Energy minimization | OpenBabel (MMFF94) | Receptor geometry relaxation |
| Quality control | MolProbity | Structure validation and ranking |
| Pocket prediction | P2Rank v2.5 | ML-based binding pocket identification |
| Ligand filtering | QED | Drug-likeness pre-filter |
| Docking | UniDock | GPU-accelerated virtual screening against both sites |
| Docking scoring function | AutoDock Vina | Scoring function underlying UniDock |
| Pose quality control | PoseBusters | Physics-based docking pose validation |
| Distributed compute | Apache Beam / Dataflow | Scalable pipeline execution |
Infrastructure
All pipelines run as Apache Beam jobs on Google Cloud Dataflow. The docking steps use GPU workers provisioned on demand; the rest of the pipeline runs on CPU workers. Persistent data (structures, sites, docking results) is stored in Cloud SQL (MySQL), and large files (PDB, SDF, PDBQT, NDJSON) are stored in Google Cloud Storage. In Mode 2 (two proteins), both preparation pipelines can be executed in parallel, with the counter-screening docking step depending on both completing successfully.
References
-
Jumper, J. et al. (2021). Highly accurate protein structure prediction with AlphaFold. Nature, 596(7873), 583–589. https://doi.org/10.1038/s41586-021-03819-2
-
Word, J.M., Lovell, S.C., Richardson, J.S., Richardson, D.C. (1999). Visualizing and quantifying molecular goodness-of-fit: small-probe contact dots with explicit hydrogen atoms. Journal of Molecular Biology, 285(4), 1711–1733.
-
Dolinsky, T.J., Nielsen, J.E., McCammon, J.A., Baker, N.A. (2004). PDB2PQR: an automated pipeline for the setup of Poisson–Boltzmann electrostatics calculations. Nucleic Acids Research, 32(suppl. 2), W665–W667. https://doi.org/10.1093/nar/gkh381
-
Halgren, T.A. (1996). Merck molecular force field. I. Basis, form, scope, parameterization, and performance of MMFF94. Journal of Computational Chemistry, 17(5–6), 490–519. https://doi.org/10.1002/(SICI)1096-987X(199604)17:5/6<490::AID-JCC1>3.0.CO;2-P
-
Williams, C.J. et al. (2018). MolProbity: more and better reference data for improved all-atom structure validation. Protein Science, 27(1), 293–315. https://doi.org/10.1002/pro.3330
-
Krivák, R. and Hoksza, D. (2018). P2Rank: machine learning based tool for rapid and accurate prediction of ligand binding sites from protein structure. Journal of Cheminformatics, 10, 39. https://doi.org/10.1186/s13321-018-0285-8
-
Bickerton, G.R., Paolini, G.V., Besnard, J., Marechal, S., Hopkins, A.L. (2012). Quantifying the chemical beauty of drugs. Nature Chemistry, 4, 90–98. https://doi.org/10.1038/nchem.1243
-
Yu, Y. et al. (2023). Uni-Dock: GPU-accelerated docking enables ultralarge virtual screening. Journal of Chemical Theory and Computation, 19(11), 3336–3345. https://doi.org/10.1021/acs.jctc.2c01145
-
Trott, O. and Olson, A.J. (2010). AutoDock Vina: improving the speed and accuracy of docking with a new scoring function, efficient optimization, and multithreading. Journal of Computational Chemistry, 31(2), 455–461. https://doi.org/10.1002/jcc.21334
-
Buttenschoen, M., Morris, G.M., Deane, C.M. (2024). PoseBusters: AI-based docking methods fail to generate physically valid poses or generalise to novel sequences. Chemical Science, 15, 3130–3139. https://doi.org/10.1039/d3sc04185a