One integer coordinate for all 64 codons, built from the two physical bits of each base. Given the code's degeneracy, it is the unique linearization that keeps each amino acid's codons contiguous. Selected by the degeneracy, not forced by mathematics.
UCAG 4×16×1, unique up to reading direction, fixed by the ground state · Paper on Zenodo
Four states, two bits: ring type and hydrogen-bond count. The two diagonals that flip both bits, U↔G and C↔A, are the transversions. This is the substrate.
From the two bits the substrate follows: the Gray-code adjacencies, the divergence classes, the homonucleotide anchors, and the symmetries. Three positions weighted 4×16×1 give the 64 addresses. All of this is true of the addressing regardless of biology, the same under any labeling.
The substrate is forced once you have four letters from two bits read in triples. What the center-dominant reading adds is the contiguous ordering of the families, and that is selected by the code's degeneracy, not forced by the mathematics.
Require that each amino acid's codons group together, and of the 144 frameworks two survive, related by reading direction. Fixing the direction by the ground state leaves one.
To organize codons systematically, amino acids with multiple codons must have those codons grouped together. Leucine has 6 codons - they must form a block. Methionine has 1 codon - it must separate from Isoleucine's 3 codons.
144 possible frameworks: 6 ways to weight positions × 24 ways to order nucleotides. Each tested against the grouping requirement.
Only UCAG ordering with 4×16×1 weighting creates proper grouping. Formula: CA = 4×1st + 16×2nd + 1×3rd. Everything else scatters amino acids across the coordinate space.
Weighting: the wobble position takes the unit weight, removing four of six weightings; leucine's six codons decide center over edge among the rest.
Order and direction: two frameworks keep the families contiguous, UCAG 4×16×1 and its reverse reading GACU 4×16×1. They are one coordinate read in opposite directions.
Result: fixing the reading direction by the ground state (UUU = 0) selects UCAG 4×16×1 uniquely, conditional on the degeneracy pattern.
UCAG 4×16×1 generates a three-dimensional coordinate system organizing all 64 codons as a 4×4×4 cube
Every codon maps to coordinates 0-63. Four homo-nucleotide codons (UUU, CCC, AAA, GGG) anchor the main diagonal at positions 0, 21, 42, 63.
Single nucleotide mutations produce predictable coordinate changes: ΔCA = ±1, ±4, or ±16 single-digit changes in quaternary notation.
19 of 20 amino acids have all codons within a single plane (same middle nucleotide). Natural boundaries at 10/11, 31/32, and 52/53 create four functional domains.
The coordinate is descriptive, not predictive. Distances like ΔCA largely re-express the known dominance of second-position substitutions, and the clinical correlations are confounded; no predictive use is claimed.
Each codon's middle nucleotide sets its own reference. Flanking positions are measured against it without external frame required. Local order.
The polar, charged and reactive residues and all three stops concentrate in CA 32-52. Under a degeneracy-preserving null this window is modest (p ≈ 0.05); the strong signal is the all level coherence.
Grouping amino acids by the middle base separates them by hydropathy far more than by the first base (F = 59 versus F = 2): the Woese correlation, shown on the level axis by the center reading. The level order itself is set by the bits, U = 00 to G = 11 and not by any global properties; a hydropathy ranking would put A opposite U, not G.
The coordinate scaffold shows where the code is irregular: serine and arginine each occupy two disjoint boxes, and the three stops form a connected cube corner that no linearization keeps contiguous.
The entire 4×4×4 cube reduces to a single mathematical object: a 2×2 Gray code iterated three times. Two molecular properties, ring structure (pyrimidine/purine) and pairing identity (UA/CG) define two binary orthogonal axes that place each nucleotide at a unique coordinate.
Traversing the square changes exactly one bit per step: a Gray code. Within chemical families (U↔C, A↔G), single-bit adjacency provides error tolerance. Crossing the C|A boundary flips both bits: an irreversible two-bit regime transition. This template, applied at each of three codon positions with the weights 4×16×1 generates the complete 64-state space. No new mathematics emerges at scale.
Every codon is a six-bit word: two bits per position, three positions, weights 4×16×1. The full 8×8 lattice, V = 8×row, H = column. No properties. Pure Combinatorics.
| V \ H | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 |
|---|---|---|---|---|---|---|---|---|
| 56 | 011100 | 011110 | 011101 | 011111 | 111100 | 111110 | 111101 | 111111 |
| 48 | 001100 | 001110 | 001101 | 001111 | 101100 | 101110 | 101101 | 101111 |
| 40 | 010100 | 010110 | 010101 | 010111 | 110100 | 110110 | 110101 | 110111 |
| 32 | 000100 | 000110 | 000101 | 000111 | 100100 | 100110 | 100101 | 100111 |
| 24 | 011000 | 011010 | 011001 | 011011 | 111000 | 111010 | 111001 | 111011 |
| 16 | 001000 | 001010 | 001001 | 001011 | 101000 | 101010 | 101001 | 101011 |
| 8 | 010000 | 010010 | 010001 | 010011 | 110000 | 110010 | 110001 | 110011 |
| 0 | 000000 | 000010 | 000001 | 000011 | 100000 | 100010 | 100001 | 100011 |
U (00) 1 2 3 4 6
C (10) 4 4 4 4
A (01) 2 2 2 2 2 2 2 2
G (11) 1 1 2 2 4 6
The earlier preprints overstated the result as unconditional uniqueness. The consolidated paper tightens it to conditional uniqueness: given the code's degeneracy pattern and the physical ground state, the linearization is unique. It supersedes the four-preprint series and folds their content into a single statement.
The 144 frameworks, filtered by the requirement that synonymous families stay contiguous. Two survive: UCAG [4,16,1] and its reverse reading GACU [4,16,1]. Fixing the reading direction by the ground state leaves UCAG [4,16,1] unique. The integer coordinate and center-dominant weighting are prior art (Sánchez 2005).
Structural consequences once UCAG [4,16,1] is fixed: the level structure, the divergence geometry, the graded mutation-step magnitudes. The ordering tracks wobble decoding geometry. These describe the chosen coordinate; they do not show the assignment was forced.
The 64-state address space is complete: positional properties give each codon a unique signature. This is a property of the addressing, and places no constraint on the degeneracy pattern or the assignment.
The general algebra fixes the weight set and the block-carrying position. The stronger claim, that this position is forced to the center for odd n, does not hold without a layout assumption, which is why the center placement is sourced here from the degeneracy pattern instead.
Prior art. The integer coordinate and center-dominant weighting are Sánchez, Morgado and Grau (2005); the six-bit hypercube is Jiménez-Montaño and colleagues (1996); a separate uniqueness theorem for the assignment is Zamudio and José (2017). What this work adds is the substrate-based base order and the bounded-line serialization criterion, with leucine the deciding family.
Four molecular states (U, C, A, G) require binary representation in quaternary base
Three-position reading creates 64 unique addresses requiring 4×4×4 organization
Middle-base weight 16 is selected by the code's degeneracy; the wobble position takes the unit weight
Wobble sits at the unit-weight third position; the U, C, A, G order keeps each ring class contiguous.
One consolidated paper stating the conditional-uniqueness result. Interactive viewer and a coordinate calculator below.
The single consolidated statement. Each codon gets an integer place-value address from the two physical bits of each base. Of 144 frameworks, the ground state UUU = 0 leaves UCAG 4×16×1 unique, given the degeneracy. Supersedes the earlier four preprints.
Access on Zenodo InteractiveExplore the 4×4×4 cube. Rotate, zoom, and examine how 64 codons organize in three-dimensional space with domain segregation. Codon and Amino acid visualizations are provided.
Launch Viewer AnalysisCalculate coordinate addresses and distances for codons and variants. Batch processing supported. The coordinate is descriptive, not a predictor of pathogenicity.
Open Tool