LMRuntime.com / Terminology
Glossary
Plain-language definitions for the runtime, evidence, and governance terms used across LMRuntime.com.
Definitions for the package, model-container, tokenization, tensor, execution, sampling, session, and evidence terms used throughout LMRuntime.com.
- Local-first
- Model files, prompts, session state, and inference stay on the operator-controlled machine in the documented runtime path. The package does not silently fall back to a provider API.Back to top ↑
- GGUF
- A binary model-container format carrying metadata, tokenizer data, tensor descriptors, and tensor bytes. The runtime treats a GGUF artifact as untrusted input until bounded parsing and validation succeed.Back to top ↑
- Artifact identity
- The immutable evidence used to identify a model or associated file, including its canonical path policy, SHA-256 digest, and exact byte count.Back to top ↑
- Allowed root
- A canonical directory boundary under which model and associated-artifact paths must resolve. It is used to reject path escape and, when configured, reparse-point traversal.Back to top ↑
- Tensor shape
- The ordered logical dimensions of a tensor together with checked element-count semantics. Shape is distinct from storage type, byte layout, and allocation ownership.Back to top ↑
- Tensor storage type
- The physical representation used for tensor values, such as floating-point, integer, or a block-quantized GGML layout. Representation does not imply every kernel supports that type.Back to top ↑
- Quantized block
- A fixed group of logical elements encoded with a storage-specific byte layout, scales, and quantized values. Correct block geometry is required before dequantization or dot products.Back to top ↑
- Mapped tensor
- A tensor view backed by a mapped model file rather than a fully materialized managed array. The view must not outlive the object that owns the mapping.Back to top ↑
- Tokenizer
- The model-coupled component that encodes text to token IDs and decodes IDs to bytes or text using vocabulary, merges, pre-tokenizer rules, and special-token metadata.Back to top ↑
- Special token
- A model-defined control token such as beginning-of-sequence, end-of-sequence, padding, or chat-role marker. Adding, parsing, removing, or rendering it is an explicit option.Back to top ↑
- Chat template
- A model-specific rule for converting structured messages into the exact prompt text expected by the tokenizer and model. It is not interchangeable across arbitrary models.Back to top ↑
- Tensor binding
- The process of matching semantic model roles—such as token embeddings or attention weights—to compatible GGUF tensor descriptors and storage readers.Back to top ↑
- Reference path
- A deliberately legible managed implementation used as a correctness anchor for the operations and model fixtures it covers.Back to top ↑
- Parity
- Evidence that two implementations produce matching structural or numerical results for named cases under exact rules or declared tolerances.Back to top ↑
- Logits
- The model output scores over the vocabulary before token selection. Sampling transforms and selection consume logits; they do not compute the model forward pass.Back to top ↑
- Greedy sampling
- Deterministic token selection that chooses the highest-ranked eligible logit according to the package tie behavior.Back to top ↑
- Probability sampling
- Token selection after explicit temperature, top-k, top-p, minimum-p, repetition, frequency, presence, or bias processing using caller-owned sampling state.Back to top ↑
- Sampling state
- The random-generator and token-history state used by probability sampling. Reusing or resetting it changes the generation process.Back to top ↑
- KV cache
- Per-session key and value state retained by transformer attention so continuation tokens can reuse earlier sequence computations.Back to top ↑
- Prefill
- The phase that commits prompt tokens and builds session state before iterative next-token decoding.Back to top ↑
- Decode step
- One model evaluation and token-selection cycle that advances a session by a committed token.Back to top ↑
- Session isolation
- The rule that independent conversations own separate position, KV cache, logits, generation history, and context evidence rather than a process-global active state.Back to top ↑
- UAIX context evidence
- Display-safe, already validated profile/load-session and memory-route metadata retained with a local session. It is evidence only and grants no execution, command, network, provider, telemetry, export, or policy authority.Back to top ↑
- Host-owned responsibility
- Behavior intentionally outside the runtime package, including model acquisition, prompt assembly, profile import, persistence, transport, UI, commands, network access, providers, audit, and user approval.Back to top ↑
- Evidence-bounded claim
- A statement limited to what a named package artifact, fixture, managed run, parity record, model identity, or reproducible benchmark actually demonstrates.Back to top ↑
- Release gate
- A required check such as clean consumption, tests, model parity, security review, licensing review, or deployment validation before a consuming application makes a stronger support claim.Back to top ↑
- No-op decision
- A structured decision to make no change or take no external action when evidence, authority, budget, or review state is insufficient.Back to top ↑
- Claim boundary
- A policy that prevents language from exceeding evidence—for example, unsupported universal compatibility, benchmark leadership, certification, or authority claims.Back to top ↑
