Synthesis#

The synkit.Synthesis package provides a unified interface for reaction prediction and chemical reaction network (CRN) exploration. It applies rule-based graph rewriting to molecular structures, allowing you to enumerate candidate products (forward mode) or candidate precursors (backward mode) from reaction templates.

Reaction Prediction: Reactor#

The synkit.Synthesis.Reactor submodule applies a reaction template (SMARTS / rule) to an input substrate (SMILES) and enumerates all valid transformations under a chosen graph-matching strategy.

Two interchangeable backends are available:

NetworkX-based reactor SynReactor (lightweight, pure-Python workflow and tight integration with synkit graphs)
MØD-based reactor MODReactor [3] (graph-grammar engine backend, suitable for robust rewriting and larger workloads)

Reactor parameters#

Name	Type	Default	Description
`invert`	bool	`False`	Direction of application. Use `False` for forward prediction (substrate → products) and `True` for backward prediction (target → precursors).
`explicit_h`	bool	`False`	When `True`, hydrogens in the reaction center are rendered explicitly in the output SMARTS. This is useful for debugging, auditing rule scope, and disambiguating closely related matches.
`strategy`	str	`'bt'`	Graph-matching strategy used to enumerate transformations: `'comp'`: component-aware matching (fastest; recommended for multi-component SMILES) `'all'`: exhaustive arbitrary subgraph search (most expensive) `'bt'`: fallback strategy (tries `comp` first, then `all` if no match is found)
`template_format`	str	`'typesGH'`	ITS representation used when the template is a reaction string. Use `'tuple'` for the Lewis State Graph representation.
`electron_diagnostics`	bool	`False`	When `True`, keep Lewis-state accounting diagnostics on generated ITS objects. This is useful when inspecting charge, lone-pair, or radical recomputation. The option name remains `electron_diagnostics` for API compatibility.
`automorphism`	bool	`True`	Deduplicate symmetry-equivalent matches before rewriting.

Example: Forward Prediction (NetworkX)#

Forward prediction with explicit H and backtracking strategy#

from synkit.Synthesis.Reactor.syn_reactor import SynReactor

input_fw = 'CC=O.CC=O'
template = '[C:2]=[O:3].[C:4]([H:7])[H:8]>>[C:2]=[C:4].[O:3]([H:7])[H:8]'

reactor = SynReactor(
    substrate=input_fw,
    template=template,
    invert=False,
    explicit_h=True,
    strategy='bt'
)

smarts_list = reactor.smarts_list
print(smarts_list)

Example output

[
  '[CH3:1][CH:2]=[O:3].[CH:4]([CH:5]=[O:6])([H:7])[H:8]>>[CH3:1][CH:2]=[CH:4][CH:5]=[O:6].[O:3]([H:7])[H:8]',
  '[CH3:4][CH:5]=[O:6].[CH:1]([CH:2]=[O:3])([H:7])[H:8]>>[CH:1]([CH:2]=[O:3])=[CH:5][CH3:4].[O:6]([H:7])[H:8]'
]

Example: Backward Prediction (NetworkX)#

Backward prediction targeting product to precursors#

from synkit.Synthesis.Reactor.syn_reactor import SynReactor

target = 'CC=CC=O.O'
template = '[C:2]=[O:3].[C:4]([H:7])[H:8]>>[C:2]=[C:4].[O:3]([H:7])[H:8]'

reactor_bw = SynReactor(
    substrate=target,
    template=template,
    invert=True,
    explicit_h=False,
    strategy='comp'
)

precursors = reactor_bw.smarts_list
print(precursors)

Example output

[
  '[CH3:1][CH:2]=[O:6].[CH3:3][CH:4]=[O:5]>>[CH3:1][CH:2]=[CH:3][CH:4]=[O:5].[OH2:6]',
  '[CH3:1][CH3:2].[CH:3]([CH:4]=[O:5])=[O:6]>>[CH3:1][CH:2]=[CH:3][CH:4]=[O:5].[OH2:6]'
]

Example: Implicit-H Template (NetworkX)#

If your template is written in an implicit-H form, enable it via implicit_temp=True while keeping explicit_h=False.

Backward prediction with an implicit-H template#

from synkit.Synthesis.Reactor.syn_reactor import SynReactor

target = 'CC=CC=O.O'
template = '[C:2]=[O:3].[CH2:4]>>[C:2]=[C:4].[OH2:3]'

reactor_imp = SynReactor(
    substrate=target,
    template=template,
    invert=True,
    explicit_h=False,
    strategy='comp',
    implicit_temp=True
)

precursors = reactor_imp.smarts_list
print(precursors)

Example output

[
  '[CH3:1][CH:2]=[O:6].[CH3:3][CH:4]=[O:5]>>[CH3:1][CH:2]=[CH:3][CH:4]=[O:5].[OH2:6]',
  '[CH3:1][CH3:2].[CH:3]([CH:4]=[O:5])=[O:6]>>[CH3:1][CH:2]=[CH:3][CH:4]=[O:5].[OH2:6]'
]

Lewis State Graph Templates#

The NetworkX reactor can consume Lewis State Graph (LSG) templates. This is the SynKit-native path for transformations where valence-state information matters: lone pairs, radicals, valence electrons, and sigma/pi bond components are stored in the template and used during matching/rewrite. In the current API LSG construction is requested with format="tuple".

There are two common entry points:

Build the LSG template explicitly#

from synkit.IO import rsmi_to_its
from synkit.Synthesis.Reactor.syn_reactor import SynReactor

smart = "[NH3:1].[CH3:2][Cl:3]>>[NH3+:1][CH3:2].[Cl-:3]"
substrate = "CCl.N"
template = rsmi_to_its(smart, core=False, format="tuple")

reactor = SynReactor(
    substrate=substrate,
    template=template,
    implicit_temp=True,
    explicit_h=False,
    electron_diagnostics=True,
)

print(reactor.smarts)

Let SynReactor build an LSG template from a reaction string#

reactor = SynReactor(
    substrate="CCl.N",
    template="[NH3:1].[CH3:2][Cl:3]>>[NH3+:1][CH3:2].[Cl-:3]",
    template_format="tuple",
    implicit_temp=True,
    explicit_h=False,
    electron_diagnostics=True,
)

LSG rewrite policy:

Concept	Policy
Bond truth	`sigma_order` and `pi_order` are authoritative in new mode.
Product reconstruction	`kekule_order` is computed from `sigma_order + pi_order` before conversion through RDKit.
Charge	Charge is recomputed from valence electrons, lone pairs, hydrogen count, radical count, and Kekule bond-order sum.
Aromaticity	Aromatic flags are still useful for matching and display, but aromatic `order=1.5` is not used as the LSG-authoritative rewrite value.

Note

LSG rewriting is currently a SynKit SynReactor path. MØD-backed reactors remain on the legacy rule representation.

Example: Forward Prediction (MØD)#

Forward prediction using the MØD backend#

from synkit.Synthesis.Reactor.mod_reactor import MODReactor

input_fw = 'CC=O.CC=O'
template = '[C:2]=[O:3].[C:4]([H:7])[H:8]>>[C:2]=[C:4].[O:3]([H:7])[H:8]'

reactor_mod = MODReactor(
    substrate=input_fw,
    rule_file=template,
    invert=False,
    strategy='bt'
)

reaction_list = reactor_mod.reaction_smiles
print(reaction_list)

Example output

['CC=O.CC=O>>CC=CC=O.O']

Example: Backward Prediction with AAM (MØD)#

When atom mapping must be retained end-to-end, use the AAM-aware variant (e.g., MODAAM) together with a GML rule representation.

Backward prediction with atom-map preservation#

from synkit.Synthesis.Reactor.mod_aam import MODAAM
from synkit.IO import smart_to_gml

input_bw = 'CC=CC=O.O'
rule_gml = smart_to_gml(
    '[C:2]=[O:3].[C:4]([H:7])[H:8]>>[C:2]=[C:4].[O:3]([H:7])[H:8]',
    core=True
)

reactor_aam = MODAAM(
    substrate=input_bw,
    rule_file=rule_gml,
    invert=True,
    strategy='bt'
)

smarts_list = reactor_aam.get_smarts()
print(smarts_list)

Example output

[
  '[CH3:1][CH:2]=[O:3].[CH:4]([CH:5]=[O:6])([H:7])[H:8]>>[CH3:1][CH:2]=[CH:4][CH:5]=[O:6].[O:3]([H:7])[H:8]',
  '[CH3:1][CH:2]([H:3])[H:4].[CH:5]([CH:6]=[O:7])=[O:8]>>[CH3:1][CH:2]=[CH:5][CH:6]=[O:7].[H:3][O:8][H:4]'
]

Radical-based linking#

RBLEngine links forward and backward template applications through a wildcard-aware reaction-centre overlap. It is useful when a direct reactor application is insufficient and the two sides need to be fused through a shared core.

Choose the execution mode according to the required recall and cost:

"fast_track" performs only a cheap reactor round-trip.
"early_stop" (the default) also constructs ITS candidates but stops before maximum-common-subgraph (MCS) fusion.
"full" performs wildcard-aware MCS fusion and returns all collected unique candidates; it is the most expensive mode.

Run the RBL engine with its default exact MCS matcher#

from synkit.Synthesis.Reactor.rbl_engine import RBLEngine

engine = RBLEngine(mode="early_stop")
result = engine.process(reaction_rsmi, template)
candidates = result.fused_rsmis

Use mode="full" only when the early path does not provide enough candidates. matcher_cls accepts ApproxMCSMatcher for a faster, heuristic alternative on large or highly symmetric ITS graphs.

Synthesis#

Reaction Prediction: Reactor#

Reactor parameters#

Example: Forward Prediction (NetworkX)#

Example: Backward Prediction (NetworkX)#

Example: Implicit-H Template (NetworkX)#

Lewis State Graph Templates#

Example: Forward Prediction (MØD)#

Example: Backward Prediction with AAM (MØD)#

Radical-based linking#

See Also#