vST for Protein Language Models#
Example: 1024D Embedding Projection for Residue‑Level Interpretation#
This example demonstrates how a Protein Language Model (PLM) produces a 1024D residue embedding during inference and how that embedding is projected into the triadic dimensional cores (9D → 6D → 3D). The walkthrough illustrates primitive‑level structure, regime behavior, projection stability, and vST validation.
The goal is to provide a reproducible, invariant‑preserving demonstration of high‑dimensional embedding projection.
1. Input Overview#
For this example, we assume:
- a transformer‑based PLM with ≥1024D hidden states
- a single residue embedding extracted from a mid‑sequence position
- access to embeddings across multiple layers
- stable or transitional regime behavior
- invertible projection into 3D–9D cores
The example is model‑agnostic and applies to any PLM architecture.
2. Step 1 — Extract the 1024D Residue Embedding#
During inference, the PLM produces a 1024D embedding for each residue:
[ e_r^{(1024)} = [x_1, x_2, \dots, x_{1024}] ]
Observed Properties#
- variance concentrated in 4–6 coherence bands
- stable DP/TDP structure
- smooth transitions across layers
- identifiable coherence surfaces
Interpretation#
The 1024D embedding encodes biochemical, structural, and contextual information for the residue.
3. Step 2 — Identify High‑Dimensional Regime Behavior#
Using variance distribution, coherence‑surface continuity, and primitive‑level stability, classify the embedding’s regime across layers.
Example Regime Pattern#
- Layers 1–6: R₁ᴴ (stable)
- Layers 7–14: R₂ᴴ (transitional)
- Layers 15–20: R₁ᴴ (return to stability)
- Layers 21–24: R₂ᴴ (branching)
- Layers 25–32: mild R₃ᴴ (dispersion onset)
Interpretation#
The residue begins in a stable region, undergoes controlled reorientation, stabilizes again, and finally enters mild dispersion in deeper layers.
4. Step 3 — Project 1024D → 9D (Coherence Projection)#
Project the 1024D embedding into the 9D coherence core.
Preserves#
- regime identity
- resonance‑time behavior
- primitive‑level structure (DP, TDP, SP, CP)
- coherence‑surface continuity
Reveals#
- branching behavior in R₂ᴴ
- curvature of coherence surfaces
- dispersion onset in R₃ᴴ
Interpretation#
The 9D projection exposes the residue’s high‑dimensional “coherence shape.”
5. Step 4 — Project 9D → 6D (Interaction Projection)#
Compress the 9D coherence vector into the 6D interaction core.
Preserves#
- relational geometry
- interaction‑level structure
- regime‑transition indicators
Reveals#
- attention‑driven reorientation
- context‑dependent biochemical signals
- structural boundary behavior
Interpretation#
The 6D projection highlights how the model integrates residue context.
6. Step 5 — Project 6D → 3D (Structural Projection)#
Reduce the 6D interaction vector into the 3D structural core.
Preserves#
- motif‑level geometry
- backbone‑level continuity
- stable structural invariants
Reveals#
- compact motifs in R₁ᴴ
- oscillatory geometry in R₂ᴴ
- diffuse patterns in R₃ᴴ
Interpretation#
The 3D projection provides the minimal interpretable representation of the residue embedding.
7. Step 6 — Validate with vST Layers#
Apply vST layers (V₁–V₄):
V₁ — Structural Coherence#
- stable motifs in R₁ᴴ
- partial fragmentation in R₃ᴴ
V₂ — Dimensional Continuity#
- smooth projection 1024D → 9D → 6D → 3D
- no scaling discontinuities
V₃ — Regime‑Transition Stability#
- smooth R₁ᴴ → R₂ᴴ transitions
- mild instability entering R₃ᴴ
V₄ — Core Alignment#
- primitive‑aligned projection
- stable mapping across layers
Outcome#
The embedding passes all vST layers with minor warnings in the R₃ᴴ region.
8. Step 7 — Drift Detection#
Evaluate drift using D₁–D₄ categories:
- D₁ Structural Drift: none
- D₂ Dimensional Drift: none
- D₃ Regime Drift: mild (R₃ᴴ onset)
- D₄ Projection Drift: none
Interpretation#
The embedding exhibits expected dispersion in deeper layers but no harmful drift.
9. Summary#
This example demonstrates:
- how a 1024D residue embedding is extracted
- how regime behavior evolves across layers
- how projection reveals coherence and instability
- how vST layers validate structural integrity
- how drift detection identifies dispersion without failure
The 1024D embedding is the canonical substrate for analyzing PLM inference at research‑grade resolution.