A Positional Coordinate for the Standard Genetic Code

The Codons Are Binary. The Meaning Is Matched to Them.

One integer coordinate for all 64 codons, built from the two physical bits of each base. Given the code's degeneracy, it is the unique linearization that keeps each amino acid's codons contiguous. Selected by the degeneracy, not forced by mathematics.

UCAG 4×16×1, unique up to reading direction, fixed by the ground state · Paper on Zenodo

See the Result Read the Paper
Four nucleotide states: A(01)–G(11), U(00)–C(10)

Four states, two bits: ring type and hydrogen-bond count. The two diagonals that flip both bits, U↔G and C↔A, are the transversions. This is the substrate.

From the two bits the substrate follows: the Gray-code adjacencies, the divergence classes, the homonucleotide anchors, and the symmetries. Three positions weighted 4×16×1 give the 64 addresses. All of this is true of the addressing regardless of biology, the same under any labeling.

The substrate is forced once you have four letters from two bits read in triples. What the center-dominant reading adds is the contiguous ordering of the families, and that is selected by the code's degeneracy, not forced by the mathematics.

The Analysis

Require that each amino acid's codons group together, and of the 144 frameworks two survive, related by reading direction. Fixing the direction by the ground state leaves one.

The Requirement

To organize codons systematically, amino acids with multiple codons must have those codons grouped together. Leucine has 6 codons - they must form a block. Methionine has 1 codon - it must separate from Isoleucine's 3 codons.

The Test

144 possible frameworks: 6 ways to weight positions × 24 ways to order nucleotides. Each tested against the grouping requirement.

The Result

Only UCAG ordering with 4×16×1 weighting creates proper grouping. Formula: CA = 4×1st + 16×2nd + 1×3rd. Everything else scatters amino acids across the coordinate space.

What survives

Weighting: the wobble position takes the unit weight, removing four of six weightings; leucine's six codons decide center over edge among the rest.

Order and direction: two frameworks keep the families contiguous, UCAG 4×16×1 and its reverse reading GACU 4×16×1. They are one coordinate read in opposite directions.

Result: fixing the reading direction by the ground state (UUU = 0) selects UCAG 4×16×1 uniquely, conditional on the degeneracy pattern.

The Order

UCAG 4×16×1 generates a three-dimensional coordinate system organizing all 64 codons as a 4×4×4 cube

The Cube Structure

Every codon maps to coordinates 0-63. Four homo-nucleotide codons (UUU, CCC, AAA, GGG) anchor the main diagonal at positions 0, 21, 42, 63.

Single nucleotide mutations produce predictable coordinate changes: ΔCA = ±1, ±4, or ±16 single-digit changes in quaternary notation.

19 of 20 amino acids have all codons within a single plane (same middle nucleotide). Natural boundaries at 10/11, 31/32, and 52/53 create four functional domains.

A-Level (CA 32-47) and G-Level (CA 48-63) showing Chemistry and Adaptation domains U-Level (CA 0-15) and C-Level (CA 16-31) showing Foundation and Control domains

The coordinate is descriptive, not predictive. Distances like ΔCA largely re-express the known dominance of second-position substitutions, and the clinical correlations are confounded; no predictive use is claimed.

Self-Referential Coordinates

Each codon's middle nucleotide sets its own reference. Flanking positions are measured against it without external frame required. Local order.

Domain Formation

The polar, charged and reactive residues and all three stops concentrate in CA 32-52. Under a degeneracy-preserving null this window is modest (p ≈ 0.05); the strong signal is the all level coherence.

Level Coherence

Grouping amino acids by the middle base separates them by hydropathy far more than by the first base (F = 59 versus F = 2): the Woese correlation, shown on the level axis by the center reading. The level order itself is set by the bits, U = 00 to G = 11 and not by any global properties; a hydropathy ranking would put A opposite U, not G.

Faithful Where Irregular

The coordinate scaffold shows where the code is irregular: serine and arginine each occupy two disjoint boxes, and the three stops form a connected cube corner that no linearization keeps contiguous.

RNAcube showing all 64 codons organized by coordinate address
Codons by coordinate address
RNAcube showing amino acid assignments across the cube
Amino acid assignments

3D interactive visualization

2×2 Gray code template showing binary encoding of four nucleotides by ring structure and pairing identity
The 2×2 minimal unit

The Generating Template

The entire 4×4×4 cube reduces to a single mathematical object: a 2×2 Gray code iterated three times. Two molecular properties, ring structure (pyrimidine/purine) and pairing identity (UA/CG) define two binary orthogonal axes that place each nucleotide at a unique coordinate.

A (01) purine +UA, V operates
G (11) purine +CG, both active
U (00) pyrim. +UA, identity state
C (10) pyrim. +CG, H operates

Traversing the square changes exactly one bit per step: a Gray code. Within chemical families (U↔C, A↔G), single-bit adjacency provides error tolerance. Crossing the C|A boundary flips both bits: an irreversible two-bit regime transition. This template, applied at each of three codon positions with the weights 4×16×1 generates the complete 64-state space. No new mathematics emerges at scale.

The 64 Binary Addresses

Every codon is a six-bit word: two bits per position, three positions, weights 4×16×1. The full 8×8 lattice, V = 8×row, H = column. No properties. Pure Combinatorics.

V \ H 01234567
56 011100011110011101011111 111100111110111101111111
48 001100001110001101001111 101100101110101101101111
40 010100010110010101010111 110100110110110101110111
32 000100000110000101000111 100100100110100101100111
24 011000011010011001011011 111000111010111001111011
16 001000001010001001001011 101000101010101001101011
8 010000010010010001010011 110000110010110001110011
0 000000000010000001000011 100000100010100001100011

Degeneracy per level

U (00)  1  2  3  4  6

C (10)  4  4  4  4

A (01)  2  2  2  2  2  2  2  2

G (11)  1  1  2  2  4  6

How the claim has developed

The earlier preprints overstated the result as unconditional uniqueness. The consolidated paper tightens it to conditional uniqueness: given the code's degeneracy pattern and the physical ground state, the linearization is unique. It supersedes the four-preprint series and folds their content into a single statement.

From overclaim to conditional uniqueness

The 144 frameworks, filtered by the requirement that synonymous families stay contiguous. Two survive: UCAG [4,16,1] and its reverse reading GACU [4,16,1]. Fixing the reading direction by the ground state leaves UCAG [4,16,1] unique. The integer coordinate and center-dominant weighting are prior art (Sánchez 2005).

Structural consequences once UCAG [4,16,1] is fixed: the level structure, the divergence geometry, the graded mutation-step magnitudes. The ordering tracks wobble decoding geometry. These describe the chosen coordinate; they do not show the assignment was forced.

The 64-state address space is complete: positional properties give each codon a unique signature. This is a property of the addressing, and places no constraint on the degeneracy pattern or the assignment.

The general algebra fixes the weight set and the block-carrying position. The stronger claim, that this position is forced to the center for odd n, does not hold without a layout assumption, which is why the center placement is sourced here from the degeneracy pattern instead.

Prior art. The integer coordinate and center-dominant weighting are Sánchez, Morgado and Grau (2005); the six-bit hypercube is Jiménez-Montaño and colleagues (1996); a separate uniqueness theorem for the assignment is Zamudio and José (2017). What this work adds is the substrate-based base order and the bounded-line serialization criterion, with leucine the deciding family.

AXIOM 1

Binary Encoding

Four molecular states (U, C, A, G) require binary representation in quaternary base

AXIOM 2

Triplet Structure

Three-position reading creates 64 unique addresses requiring 4×4×4 organization

AXIOM 3

Positional Asymmetry

Middle-base weight 16 is selected by the code's degeneracy; the wobble position takes the unit weight

AXIOM 4

Single-Feature Adjacency

Wobble sits at the unit-weight third position; the U, C, A, G order keeps each ring class contiguous.

Research & Tools

One consolidated paper stating the conditional-uniqueness result. Interactive viewer and a coordinate calculator below.

Paper · Zenodo

Serializing the genetic code: a place-value coordinate from physical bits

The single consolidated statement. Each codon gets an integer place-value address from the two physical bits of each base. Of 144 frameworks, the ground state UUU = 0 leaves UCAG 4×16×1 unique, given the degeneracy. Supersedes the earlier four preprints.

Access on Zenodo
Interactive

3D Visualization

Explore the 4×4×4 cube. Rotate, zoom, and examine how 64 codons organize in three-dimensional space with domain segregation. Codon and Amino acid visualizations are provided.

Launch Viewer
Analysis

Coordinate Calculator

Calculate coordinate addresses and distances for codons and variants. Batch processing supported. The coordinate is descriptive, not a predictor of pathogenicity.

Open Tool