LMRuntime.com / Generated reference

UAIX.LmRuntime API Reference

Search the documented public types and members generated from public package XML documentation.

A searchable reference generated from public package XML documentation: 376 documented types and 2,044 documented members across the package family.

Reference provenance Type and member summaries come from public package XML documentation. Package version numbers are intentionally excluded. The NuGet package pages remain authoritative for current frameworks, dependency versions, package hashes, and release metadata.

UAIX.LmRuntime.LocalEndpoint

23 documented types and 163 documented members.

LocalGgufRuntimeScopeUAIX.LmRuntime.LocalEndpoint 2 members

Identifies one caller-owned LocalEndpoint application scope and optional legacy UAIX package correlation label.

The values are opaque compatibility labels. They do not identify the active profile or per-application-instance load session; that evidence is supplied to . The runtime never opens, interprets, persists, or synchronizes UAIX content.

Property ApplicationInstanceId

Gets the LocalEndpoint application instance identifier.

Property UaixPackageId

Gets an optional caller-owned legacy UAIX package correlation identifier.

LocalGgufRuntimeCapabilitiesUAIX.LmRuntime.LocalEndpoint 17 members

Declares the fixed local-only capabilities of the LocalEndpoint integration facade.

Property SupportsExplicitFileVerification

Gets a value indicating whether callers can explicitly re-verify current model-file bytes before reuse.

Property SupportsAssociatedArtifactVerification

Gets a value indicating whether callers can bind reviewed companion artifacts to path-free evidence.

Property UsesAssociatedArtifactsForGeneration

Gets a value indicating whether associated artifacts alter managed token generation.

Property SupportsSynchronousTokenObservation

Gets a value indicating whether caller-owned synchronous token observation is supported.

Property UsesSubprocesses

Gets a value indicating whether the integration starts subprocesses.

Property UsesNetworkAccess

Gets a value indicating whether the integration performs network access.

Property PersistsRuntimeContent

Gets a value indicating whether the integration persists prompts, generated text, or session state.

Property IsManagedExecution

Gets a value indicating whether model execution is implemented by managed project code.

Property UsesNativeInference

Gets a value indicating whether model execution delegates to a native inference library.

Property UsesGpuAcceleration

Gets a value indicating whether the integration uses GPU or CUDA execution.

Property UsesTelemetry

Gets a value indicating whether the integration emits telemetry.

Property HostsServer

Gets a value indicating whether the integration hosts a server or public listener.

Property SupportsUaixRuntimeContextValidation

Gets a value indicating whether the facade validates LocalEndpoint-supplied UAIX runtime context metadata.

Property UsesGlobalUaixProfile

Gets a value indicating whether the runtime assumes one process-global active UAIX profile.

Property ParsesUaixPackages

Gets a value indicating whether the runtime opens or parses .uaix package containers.

Property UaixMemoryGrantsAuthority

Gets a value indicating whether UAIX memory metadata can grant runtime or external authority.

Property EmitsWorkerJsonlEvents

Gets a value indicating whether this managed facade emits LocalEndpoint worker JSONL events.

LocalGgufExecutionLimitsUAIX.LmRuntime.LocalEndpoint 3 members

Defines caller-visible resource ceilings for one verified model owner.

The limits are enforced before large generation buffers are allocated. They supplement, but do not replace, the model context-window checks performed by the underlying deterministic runtime.

Property MaximumPromptCharacters

Gets the maximum accepted prepared-prompt length in .NET UTF-16 code units.

Property MaximumGeneratedTokens

Gets the maximum generated-token buffer length permitted for one operation.

Property MaximumStopTokenCount

Gets the maximum number of caller-defined stop-token identifiers permitted for one operation.

LocalGgufModelLoadOptionsUAIX.LmRuntime.LocalEndpoint 7 members

Configures loading of one trusted local GGUF artifact through the LocalEndpoint-facing managed runtime surface.

The options govern local file validation and bounded managed execution only. They do not enable downloading, remote inference, process launch, telemetry, server hosting, or persistent runtime state.

Property MaximumModelBytes

Gets the maximum accepted model file length in bytes.

Property RejectReparsePoints

Gets a value indicating whether symbolic links and reparse points are rejected before the file is opened.

Property AllowedRootDirectory

Gets the optional normalized directory that must contain the resolved GGUF file.

Property MaximumReferenceMaterializationBytes

Gets the maximum number of bytes that compatibility-only float32 materialization may allocate.

Property ParseOptions

Gets GGUF parser safety limits.

Property BindingOptions

Gets semantic LLaMA tensor-binding validation controls.

Property ExecutionLimits

Gets the prompt, generated-token, and stop-token ceilings enforced by sessions created from the model.

LocalGgufModelMetadataUAIX.LmRuntime.LocalEndpoint 11 members

Describes the verified model and execution metadata exposed to the LocalEndpoint worker adapter.

Property ModelSha256

Gets the lowercase SHA-256 digest verified before model loading.

Property ModelByteCount

Gets the verified model file length in bytes.

Property AssociatedArtifacts

Gets the verified path-free identities of associated model artifacts supplied by LocalEndpoint.

Property GgufVersion

Gets the GGUF container version.

Property Architecture

Gets the model architecture identifier.

Property ModelName

Gets the optional model display name from GGUF metadata.

Property Tokenizer

Gets the exact tokenizer implementation name selected from GGUF metadata.

Property ContextLength

Gets the configured context length in tokens.

Property VocabularySize

Gets the configured vocabulary size.

Property BoundTensorCount

Gets the number of tensors accepted by semantic binding.

Property StorageSummary

Gets the mapped storage summary used by the managed reference runtime.

LocalGgufFileExpectationUAIX.LmRuntime.LocalEndpoint 2 members

Declares the path-free model-file evidence that LocalEndpoint requires before loading or reusing a model.

The contract contains only a SHA-256 digest and byte count. It intentionally excludes the private model path, model bytes, prompt content, generated content, registry state, audit state, and UAIX package data. Runtime entry points validate both values against current file bytes before model parsing or cached-model reuse.

Property ModelSha256

Gets the required SHA-256 digest for the current local model file.

Property ModelByteCount

Gets the required current model file length in bytes.

LocalGgufFileIdentityUAIX.LmRuntime.LocalEndpoint 2 members

Represents the verified content identity of one current local GGUF file.

The result contains only a digest and byte count. It deliberately omits the local path, model bytes, prompt content, generated content, UAIX state, and caller policy so it can be copied into LocalEndpoint evidence without expanding the runtime's persistence or trust responsibilities.

Property ModelSha256

Gets the normalized lowercase SHA-256 digest verified from current file bytes.

Property ModelByteCount

Gets the verified current file length in bytes.

LocalGgufAssociatedArtifactLoadInputUAIX.LmRuntime.LocalEndpoint 5 members

Supplies one LocalEndpoint-reviewed associated artifact for verification by the managed runtime facade.

The private path is accepted only as an input needed to verify current local bytes. The returned verification identity omits the path and contains only display-safe role, file name, SHA-256, and byte-count evidence.

Property ArtifactPath

Gets the private direct-local path to the associated artifact file that LocalEndpoint already reviewed.

Property Role

Gets the caller-owned role label for the associated artifact.

Property FileName

Gets the reviewed leaf file name expected at .

Property ArtifactSha256

Gets the required SHA-256 digest for the current associated artifact file.

Property ArtifactByteCount

Gets the required current associated artifact file length in bytes.

LocalGgufAssociatedArtifactIdentityUAIX.LmRuntime.LocalEndpoint 4 members

Represents the path-free verified identity of one associated model artifact.

The identity is constructed only after the runtime verifies the current associated artifact file under the same local path, containment, reparse-point, stable-length, byte-count, and SHA-256 rules used for primary model admission. It deliberately omits private paths, file bytes, prompt content, generated content, UAIX content, registry state, and audit state.

Property Role

Gets the caller-owned artifact role label copied into path-free evidence.

Property FileName

Gets the reviewed associated-artifact leaf file name.

Property ArtifactSha256

Gets the normalized lowercase SHA-256 digest verified from current associated-artifact bytes.

Property ArtifactByteCount

Gets the verified current associated-artifact file length in bytes.

LocalGgufTokenizerMetadataUAIX.LmRuntime.LocalEndpoint 9 members

Provides a bounded tokenizer metadata projection for LocalEndpoint runtime readiness and diagnostics.

Property TokenizerModel

Gets the tokenizer family declared by GGUF metadata.

Property PreTokenizer

Gets the optional pre-tokenizer identifier.

Property VocabularySize

Gets the vocabulary size.

Property BosTokenId

Gets the beginning-of-sequence token identifier when defined.

Property EosTokenId

Gets the end-of-sequence token identifier when defined.

Property UnknownTokenId

Gets the unknown-token identifier when defined.

Property AddBos

Gets whether model metadata requests automatic beginning-of-sequence insertion.

Property AddEos

Gets whether model metadata requests automatic end-of-sequence insertion.

Property ChatTemplate

Gets the optional chat template declared by the model.

LocalGgufGenerationRequestUAIX.LmRuntime.LocalEndpoint 12 members

Defines one bounded deterministic greedy-generation request.

The request contains only in-memory generation controls. It does not define persistence, network, provider, process, server, telemetry, or model-download behavior.

Property Prompt

Gets the exact prepared prompt to tokenize in memory.

Property MaximumTokens

Gets the positive maximum number of tokens to generate.

Property ResetSession

Gets whether the session is reset before prompt prefill.

Property AddSpecialTokens

Gets whether model-defined special-token insertion is enabled for the prompt.

Property ParseSpecialTokens

Gets whether special-token text embedded in the prompt is parsed as a special token.

Property EmitTokenizerTrace

Gets whether tokenizer trace entries are captured for the prompt.

Property InvalidUtf16Policy

Gets the invalid UTF-16 handling policy used during tokenization.

Property EndOfSequenceTokenId

Gets the optional explicit end-of-sequence token identifier; when omitted, model metadata is used.

Property StopTokenIds

Gets additional token identifiers that terminate generation after being emitted.

Property RemoveSpecialTokens

Gets whether special tokens are removed from decoded generated text.

Property UnparseSpecialTokens

Gets whether special tokens are emitted as their raw token text when they are not removed.

Property CleanSpaces

Gets whether tokenizer-defined spacing cleanup is applied during detokenization.

LocalGgufGeneratedTokenUAIX.LmRuntime.LocalEndpoint 4 members

Describes one generated token selected by the managed runtime for caller-owned streaming.

The value intentionally excludes prompt text, decoded output, model paths, UAIX content, registry state, audit state, and transport details. LocalEndpoint remains responsible for detokenization policy, worker-event mapping, terminal-event policy, and persistence boundaries.

Method LocalGgufGeneratedToken(int,int,float)

Initializes a new LocalGgufGeneratedToken instance with validated dependencies and operational bounds.

sequence
The zero-based selection sequence within the current generation operation.
tokenId
The token identifier to process; it must fall within the validated vocabulary and operation-specific range.
selectedLogit
The deterministic argmax logit associated with the selected token. NaN is rejected; positive and negative infinity are retained because the lower-level greedy sampler compares infinities deterministically and the runtime verifies the exact observed value against the completed generation result.
Property Sequence

Gets the zero-based selection sequence.

Property TokenId

Gets the selected model vocabulary identifier.

Property SelectedLogit

Gets the deterministic argmax logit associated with the selected token.

LocalGgufGenerationResultUAIX.LmRuntime.LocalEndpoint 6 members

Represents deterministic generated token identifiers and their decoded text.

Property PromptTokenCount

Gets the number of prompt tokens evaluated for the operation.

Property GeneratedTokenIds

Gets the generated token identifiers in selection order.

Property GeneratedText

Gets the decoded generated text.

Property StopReason

Gets the deterministic generation stop reason.

Property Position

Gets the next sequence position retained by the isolated session.

Property FinalSelectedLogit

Gets the selected logit of the final generated token, or negative infinity when no token was generated.

LocalGgufVerificationExceptionUAIX.LmRuntime.LocalEndpoint 5 members

Represents a local GGUF artifact verification failure.

Method LocalGgufVerificationException(string,string,string,System.Exception,System.Nullable<long>,System.Nullable<long>)

Initializes a new LocalGgufVerificationException instance with validated dependencies and operational bounds.

message
The display-safe diagnostic message describing the failure without embedding prompt text, generated text, credentials, or private file contents.
expectedSha256
The expected lowercase or uppercase 64-character SHA-256 digest used to authenticate the complete artifact; a mismatch fails the operation before a trusted result is published.
actualSha256
The observed SHA-256 digest retained as path-free diagnostic evidence when available, or null when hashing did not complete.
innerException
The optional lower-level failure used only to select a bounded, path-free diagnostic category. The original exception object, message, stack trace, data, and file-name properties are never retained because they can contain private model paths or other caller-owned content.
expectedByteCount
The required file length when byte-count evidence is supplied.
actualByteCount
The actual byte count used to bound this operation; it must be nonnegative and within the supported range.
Property ExpectedSha256

Gets the normalized expected SHA-256 value when available.

Property ActualSha256

Gets the normalized observed SHA-256 value when available.

Property ExpectedByteCount

Gets the required model byte count when supplied by caller-owned evidence.

Property ActualByteCount

Gets the observed current model byte count when available.

LocalGgufModelUAIX.LmRuntime.LocalEndpoint 11 members

Owns one verified mapped GGUF model and creates isolated deterministic inference sessions.

Property Scope

Gets the opaque application scope and optional legacy package label associated with this model owner.

Property Metadata

Gets verified model and runtime metadata.

Property TokenizerMetadata

Gets the bounded tokenizer metadata projection.

Property ExecutionLimits

Gets the immutable prompt and generation ceilings copied when this model was loaded.

Property ActiveSessionCount

Gets the number of caller-owned sessions that have not yet been disposed.

Property IsDisposed

Gets a value indicating whether this model owner has been disposed.

Method Tokenize(string,UAIX.LmRuntime.Tokenization.TokenizationOptions)

Tokenizes text using exact metadata loaded from the verified GGUF artifact.

text
The text processed by the configured encoding or normalization rules; it must satisfy the declared nullability contract.
options
The optional TokenizationOptions controlling Tokenize; null selects the documented defaults, supplied limits are validated before allocation, and the instance is not mutated.

Returns: The exact integer token identifiers and optional trace.

Method Decode(System.Collections.Generic.IReadOnlyList<int>,UAIX.LmRuntime.Tokenization.MetadataDrivenGgufTokenizerDetokenizationOptions)

Decodes token identifiers using the tokenizer metadata from the verified GGUF artifact.

tokenIds
The token identifiers to decode in sequence order.
options
The optional MetadataDrivenGgufTokenizerDetokenizationOptions controlling Decode; null selects the documented defaults, supplied limits are validated before allocation, and the instance is not mutated.

Returns: The decoded text produced from the validated token sequence in the original sequence order.

Method CreateSession

Rejects creation of a LocalEndpoint inference session when required UAIX load-session context is absent.

Returns: No value is returned; this compatibility member always throws to require an explicit session-scoped UAIX context.

Method CreateSession(UAIX.LmRuntime.LocalEndpoint.LocalGgufSessionContext)

Creates a new inference session with independent sequence state and validated UAIX load-session evidence.

sessionContext
The LocalEndpoint worker session identifier and display-safe UAIX context.

Returns: An isolated deterministic session owned by the caller.

Method Dispose

Releases the memory-mapped model after all child sessions have been disposed.

LocalGgufRuntimeUAIX.LmRuntime.LocalEndpoint 11 members

Loads verified local GGUF files into isolated pure-managed LLaMA runtime models.

This type performs local file validation and direct managed model loading only. It contains no subprocess, network, provider, telemetry, server, downloader, command-dispatch, or persistent-profile behavior. The type holds no static mutable model, session, profile, or wiki state. Separate application instances can therefore load independent models and bind each inference session to a distinct LocalEndpoint-owned UAIX load session.

Method LocalGgufRuntime

Initializes an unscoped model facade for compatibility with callers that do not require application labels.

Method LocalGgufRuntime(UAIX.LmRuntime.LocalEndpoint.LocalGgufRuntimeScope)

Initializes a runtime facade for one LocalEndpoint application instance and optional legacy package label.

scope
The immutable runtime scope whose name and instance identifier are validated once and then retained as path-free correlation evidence for model verification operations.
Property Scope

Gets the opaque caller-owned runtime scope.

Property Capabilities

Gets the fixed local-only capability declaration.

Method VerifyUaixRuntimeContext(UAIX.LmRuntime.LocalEndpoint.LocalUaixRuntimeContext)

Validates LocalEndpoint-supplied uaixRuntimeContext metadata without opening package or wiki files.

context
The display-safe profile, load-session, and long-term memory routing metadata.

Returns: An immutable path-relative evidence projection with a canonical context SHA-256.

Method VerifyLocalModelFile(string,string,UAIX.LmRuntime.LocalEndpoint.LocalGgufModelLoadOptions)

Verifies the current bytes and file-policy boundaries of one local GGUF artifact without loading a model.

modelPath
The local file-system model path processed by this operation; it must satisfy the containing component's path and scope policy.
expectedSha256
The expected lowercase or uppercase 64-character SHA-256 digest used to authenticate the complete artifact; a mismatch fails the operation before a trusted result is published.
options
The optional LocalGgufModelLoadOptions controlling VerifyLocalModelFile; null selects the documented defaults, supplied limits are validated before allocation, and the instance is not mutated.

Returns: A path-free identity containing the verified digest and current byte count.

Method VerifyAssociatedArtifacts(System.Collections.Generic.IReadOnlyList<UAIX.LmRuntime.LocalEndpoint.LocalGgufAssociatedArtifactLoadInput>,UAIX.LmRuntime.LocalEndpoint.LocalGgufModelLoadOptions)

Verifies LocalEndpoint-reviewed associated artifacts without exposing private local paths.

associatedArtifacts
The LocalEndpoint-reviewed associated artifact load inputs to verify before model publication or worker execution proceeds.
options
Optional bounded parser, binding, containment, and file-validation controls shared with primary model verification.

Returns: Path-free associated-artifact identities sorted by role, file name, and SHA-256 digest.

Method VerifyLocalModelFile(string,UAIX.LmRuntime.LocalEndpoint.LocalGgufFileExpectation,UAIX.LmRuntime.LocalEndpoint.LocalGgufModelLoadOptions)

Verifies current local model bytes against caller-owned digest and byte-count evidence without loading a model.

modelPath
The local file-system model path processed by this operation; it must satisfy the containing component's path and scope policy.
expectation
The path-free digest and byte-count evidence required by LocalEndpoint.
options
The optional LocalGgufModelLoadOptions controlling VerifyLocalModelFile; null selects the documented defaults, supplied limits are validated before allocation, and the instance is not mutated.

Returns: A path-free identity containing the verified digest and current byte count.

Method LoadVerifiedModel(string,string,UAIX.LmRuntime.LocalEndpoint.LocalGgufModelLoadOptions)

Loads a local GGUF model only after its exact SHA-256 digest and file-policy constraints are verified.

modelPath
The local file-system model path processed by this operation; it must satisfy the containing component's path and scope policy.
expectedSha256
The expected lowercase or uppercase 64-character SHA-256 digest used to authenticate the complete artifact; a mismatch fails the operation before a trusted result is published.
options
Optional bounded parser, binding, containment, and file-validation controls.

Returns: An owned mapped model that must be disposed after all child sessions are disposed.

Method LoadVerifiedModel(string,UAIX.LmRuntime.LocalEndpoint.LocalGgufFileExpectation,UAIX.LmRuntime.LocalEndpoint.LocalGgufModelLoadOptions)

Loads a local GGUF model after binding current file bytes to caller-owned digest and byte-count evidence.

modelPath
The local file-system model path processed by this operation; it must satisfy the containing component's path and scope policy.
expectation
The path-free digest and byte-count evidence required by LocalEndpoint.
options
Optional bounded parser, binding, containment, and file-validation controls.

Returns: An owned mapped model that must be disposed after all child sessions are disposed.

Method LoadVerifiedModel(string,UAIX.LmRuntime.LocalEndpoint.LocalGgufFileExpectation,System.Collections.Generic.IReadOnlyList<UAIX.LmRuntime.LocalEndpoint.LocalGgufAssociatedArtifactLoadInput>,UAIX.LmRuntime.LocalEndpoint.LocalGgufModelLoadOptions)

Loads a local GGUF model after binding the primary artifact and associated artifacts to caller-owned evidence.

modelPath
The local file-system model path processed by this operation; it must satisfy the containing component's path and scope policy.
expectation
The path-free digest and byte-count evidence required by LocalEndpoint for the primary GGUF artifact.
associatedArtifacts
The LocalEndpoint-reviewed associated artifacts to verify before the model is published.
options
Optional bounded parser, binding, containment, and file-validation controls.

Returns: An owned mapped model that must be disposed after all child sessions are disposed.

LocalGgufSessionUAIX.LmRuntime.LocalEndpoint 8 members

Owns isolated deterministic sequence and key/value-cache state for one verified local model.

A session serializes generation, reset, and disposal transitions through one lifecycle authority. It stores no prompt, generated text, audit record, registry record, UAIX content, provider state, or network state.

Property SessionId

Gets the validated LocalEndpoint worker session identifier preserved for this inference session.

Property UaixRuntimeContextEvidence

Gets immutable, path-relative evidence for the profile and UAIX load session bound to this inference session.

Property Position

Gets the next sequence position retained by this isolated session.

Property IsDisposed

Gets a value indicating whether this session has been disposed.

Method GenerateGreedy(UAIX.LmRuntime.LocalEndpoint.LocalGgufGenerationRequest,System.Threading.CancellationToken)

Runs bounded deterministic greedy generation and returns exact token identifiers with decoded text.

request
The LocalGgufGenerationRequest containing the complete caller-owned inputs for GenerateGreedy; required fields are validated and mutable collections are snapshotted before state changes or large allocations.
cancellationToken
A token observed before work and between committed model steps.

Returns: The generated token identifiers, text, stop reason, position, and final selected logit.

Method GenerateGreedy(UAIX.LmRuntime.LocalEndpoint.LocalGgufGenerationRequest,System.Action<UAIX.LmRuntime.LocalEndpoint.LocalGgufGeneratedToken>,System.Threading.CancellationToken)

Runs bounded deterministic greedy generation and synchronously reports each selected token.

request
The LocalGgufGenerationRequest containing the complete caller-owned inputs for GenerateGreedy; required fields are validated and mutable collections are snapshotted before state changes or large allocations.
tokenObserver
The caller-owned observer invoked once for each selected token.
cancellationToken
A token observed before work and between committed model steps.

Returns: The generated token identifiers, text, stop reason, position, and final selected logit.

Method Reset

Clears sequence position and key/value-cache state retained by this session.

Method Dispose

Releases isolated session state without disposing the shared model owner.

LocalUaixLongTermMemoryModeUAIX.LmRuntime.LocalEndpoint 1 member

Identifies the supported long-term memory routing modes for one validated UAIX load session.

The value describes a caller-owned memory route only. It does not authorize file access, runtime execution, command execution, network access, provider access, telemetry, export, or any other capability.

Field Isolated

Routes the load session to a profile-owned long-term memory root.

LocalUaixRuntimeContextFailureCodeUAIX.LmRuntime.LocalEndpoint 12 members

Identifies the fail-closed reason produced while validating LocalEndpoint UAIX runtime context metadata.

Codes are stable, display-safe boundary identifiers. They do not include prompt text, generated text, private absolute paths, package bytes, wiki document contents, credentials, or unrestricted exception details.

Field MissingContext

The required runtime context object was absent.

Field ProfileNotLoaded

The caller did not identify a loaded UAIX profile.

Field MissingValue

A required identifier or display-safe value was absent.

Field InvalidIdentifier

An identifier was outside the bounded portable identifier grammar.

Field InvalidDisplayValue

A display-safe value contained unsupported control data or exceeded its limit.

Field InvalidRelativePath

A relative path was absolute, traversing, malformed, or outside the supported portable form.

Field ProfilePathMismatch

The profile-relative path did not match the declared profile identifier.

Field SessionPathMismatch

The load-session-relative path did not match the declared load-session identifier.

Field LongTermMemoryPathMismatch

The long-term memory path did not match its declared mode and root identity.

Field UnsupportedLongTermMemoryMode

The long-term memory mode was not one of the explicitly supported values.

Field AuthorityEscalation

One or more UAIX memory fields attempted to grant execution or external authority.

Field InvalidWorkerSession

The worker session identifier was absent or malformed.

LocalUaixRuntimeContextUAIX.LmRuntime.LocalEndpoint 17 members

Represents display-safe UAIX profile and long-term memory routing metadata supplied by LocalEndpoint.

This type mirrors the uaixRuntimeContext object in localendpoint.worker.request.v1. The runtime does not open or parse a .uaix archive, read .uai files, read wiki documents, expand a profile, choose a profile, or infer authority from this object. LocalEndpoint validates and imports packages, assembles the prompt, and supplies this metadata after its own policy gates. Every authority field must remain .

Property LoadedUaixProfilePresent

Gets a value indicating whether LocalEndpoint supplied one validated and loaded UAIX profile.

Property LoadedUaixProfileId

Gets the portable identifier of the loaded UAIX profile.

Property LoadedUaixProfileDisplayName

Gets the display-safe name of the loaded UAIX profile.

Property LoadedUaixLoadSessionId

Gets the LocalEndpoint load-session identifier that bound this profile to one desktop application instance.

Property LoadedUaixUaiRelativePath

Gets the app-local relative path to the expanded profile .uai root.

Property LoadedUaixSessionRelativePath

Gets the app-local relative path to the LocalEndpoint load-session evidence record.

Property LongTermMemoryRootId

Gets the portable identifier of the selected Documents-backed long-term memory root.

Property LongTermMemoryRootRelativePath

Gets the path relative to Documents/LocalEndpoint/Wikis for the selected long-term memory root.

Property LongTermMemoryMode

Gets whether the selected long-term memory root is profile-isolated or deliberately shared.

Property RuntimeExecutionAllowed

Gets a value that must remain false because UAIX memory does not authorize model runtime execution.

Property MemoryCanOverridePolicy

Gets a value that must remain false because memory cannot override LocalEndpoint policy.

Property CommandExecutionAllowed

Gets a value that must remain false because command execution requires a separate user-approved gate.

Property NetworkAccessAllowed

Gets a value that must remain false because UAIX memory does not authorize network access.

Property ProviderApisAllowed

Gets a value that must remain false because UAIX memory does not authorize provider APIs.

Property WebsitePromptIntakeAllowed

Gets a value that must remain false because UAIX memory does not authorize website prompt intake.

Property TelemetryEnabled

Gets a value that must remain false because UAIX memory does not authorize telemetry.

Property AutoExportAllowed

Gets a value that must remain false because UAIX memory does not authorize automatic export.

LocalGgufSessionContextUAIX.LmRuntime.LocalEndpoint 2 members

Represents one worker session identity and its required UAIX boundary metadata.

LocalEndpoint creates this value after package import, profile loading, prompt assembly, and policy validation. The runtime snapshots the context for one isolated session and retains no global active profile.

Property SessionId

Gets the LocalEndpoint worker session identifier that every mapped event must preserve.

Property UaixRuntimeContext

Gets the required display-safe UAIX profile and wiki routing metadata.

LocalUaixRuntimeContextEvidenceUAIX.LmRuntime.LocalEndpoint 10 members

Represents the immutable, path-relative evidence produced after UAIX runtime context validation.

The evidence contains no prompt, generated text, package bytes, wiki document content, model path, credential, command grant, network grant, provider grant, or execution grant. The SHA-256 binds a canonical field sequence for caller-owned registry and audit evidence without turning the context into authority.

Property LoadedUaixProfileId

Gets the validated loaded profile identifier.

Property LoadedUaixProfileDisplayName

Gets the validated display-safe profile name.

Property LoadedUaixLoadSessionId

Gets the validated LocalEndpoint load-session identifier.

Property LoadedUaixUaiRelativePath

Gets the validated app-local profile .uai relative path.

Property LoadedUaixSessionRelativePath

Gets the validated app-local load-session relative path.

Property LongTermMemoryRootId

Gets the validated long-term memory root identifier.

Property LongTermMemoryRootRelativePath

Gets the validated Documents-backed wiki relative path.

Property LongTermMemoryMode

Gets the validated long-term memory routing mode.

Property ContextSha256

Gets the canonical lowercase SHA-256 for this validated context.

Property AuthorityBoundaryClosed

Gets a value indicating that every UAIX authority field was verified false.

LocalUaixRuntimeContextExceptionUAIX.LmRuntime.LocalEndpoint 2 members

Represents a fail-closed UAIX runtime context validation error.

The exception exposes a stable code and field name for bounded diagnostics. It does not include prompt text, generated text, profile file contents, wiki document contents, credentials, or private absolute paths.

Property FailureCode

Gets the stable fail-closed validation code.

Property FieldName

Gets the display-safe contract field associated with the failure.

UAIX.LmRuntime.Models.Llama

102 documented types and 605 documented members.

LlamaReferenceSessionSnapshotUAIX.LmRuntime.Models.Llama 5 members

Captures complete deterministic reference-session state without retaining live model pointers.

Property SchemaVersion

Gets the in-memory snapshot schema version.

Property Position

Gets the next sequence position.

Property TokenHistory

Gets committed input token identifiers in sequence order.

Property LastLogits

Gets the most recently computed logits.

Property KeyValueCache

Gets complete capacity-shaped key/value state.

LlamaSessionPersistenceOptionsUAIX.LmRuntime.Models.Llama 13 members

Configures digest-bound complete session serialization.

Property PackageVersion

Gets the package version that emitted the artifact.

Property MinimumCompatiblePackageVersion

Gets the oldest supported package version.

Property MaximumCompatiblePackageVersion

Gets the newest supported package version.

Property ModelSha256

Gets the complete model artifact SHA-256.

Property ConfigurationFingerprint

Gets the LLaMA configuration fingerprint.

Property TokenizerFingerprint

Gets the GGUF tokenizer fingerprint.

Property CacheLayoutFingerprint

Gets the persistent cache-layout identity.

Property SamplerMode

Gets the deterministic sampler mode.

Property EndOfSequenceTokenId

Gets the optional end-of-sequence token identifier.

Property StopTokenIds

Gets configured stop-token identifiers.

Property GeneratedUtc

Gets the UTC generation time.

Property ClaimStatus

Gets the evidence claim status.

Property MaximumByteCount

Gets the maximum accepted artifact byte count.

LlamaSessionArtifactUAIX.LmRuntime.Models.Llama 15 members

Carries verified complete deterministic session state and compatibility identities.

Property SchemaVersion

Gets the portable schema version.

Property PackageVersion

Gets the package version that emitted the artifact.

Property MinimumCompatiblePackageVersion

Gets the oldest supported package version.

Property MaximumCompatiblePackageVersion

Gets the newest supported package version.

Property ModelSha256

Gets the complete model artifact SHA-256.

Property ConfigurationFingerprint

Gets the model configuration fingerprint.

Property TokenizerFingerprint

Gets the tokenizer fingerprint.

Property CacheLayoutFingerprint

Gets the cache-layout fingerprint.

Property SamplerMode

Gets the sampler mode.

Property EndOfSequenceTokenId

Gets the optional end-of-sequence token identifier.

Property StopTokenIds

Gets configured stop-token identifiers.

Property GeneratedUtc

Gets the artifact generation time in UTC.

Property ClaimStatus

Gets the evidence claim status.

Property ContentSha256

Gets the SHA-256 of every serialized byte preceding the digest.

Property Snapshot

Gets the complete session snapshot.

LlamaSessionArtifactSerializerUAIX.LmRuntime.Models.Llama 5 members

Serializes complete deterministic reference-session state in bounded little-endian form.

Field SchemaVersion

Gets the supported artifact schema version.

Method Serialize(UAIX.LmRuntime.Models.Llama.LlamaReferenceSessionSnapshot,UAIX.LmRuntime.Models.Llama.LlamaSessionPersistenceOptions)

Serializes complete session state and appends a SHA-256 digest.

snapshot
The immutable state snapshot being validated, serialized, restored, or analyzed without retaining caller-owned mutable aliases.
options
The optional LlamaSessionPersistenceOptions controlling Serialize; null selects the documented defaults, supplied limits are validated before allocation, and the instance is not mutated.

Returns: A newly allocated byte[] containing the ordered result of LlamaSessionArtifactSerializer.Serialize: Serializes complete session state and appends a SHA-256 digest. The caller owns the returned array and later mutation cannot alter the source object.

Method Deserialize(System.ReadOnlySpan<byte>,int)

Deserializes the llama session artifact from the validated persisted representation.

bytes
The bytes sequence used by this operation; its required length, ordering, and element bounds are validated before access.
maximumByteCount
The maximum byte count used to bound this operation; it must be nonnegative and within the supported range.

Returns: The LlamaSessionArtifact result produced by LlamaSessionArtifactSerializer.Deserialize for this contract: Deserializes the llama session artifact from the validated persisted representation. It is published only after all documented validation and ownership transitions succeed.

Method Save(string,UAIX.LmRuntime.Models.Llama.LlamaReferenceSessionSnapshot,UAIX.LmRuntime.Models.Llama.LlamaSessionPersistenceOptions)

Writes a complete artifact to a local file.

path
The local file-system path processed by this operation; it must satisfy the containing component's path and scope policy.
snapshot
The immutable state snapshot being validated, serialized, restored, or analyzed without retaining caller-owned mutable aliases.
options
The optional LlamaSessionPersistenceOptions controlling Save; null selects the documented defaults, supplied limits are validated before allocation, and the instance is not mutated.

Returns: The LlamaSessionArtifact result produced by LlamaSessionArtifactSerializer.Save for this contract: Writes a complete artifact to a local file. It is published only after all documented validation and ownership transitions succeed.

Method Load(string,int)

Reads and verifies a complete artifact from a local file.

path
The local file-system path processed by this operation; it must satisfy the containing component's path and scope policy.
maximumByteCount
The maximum byte count used to bound this operation; it must be nonnegative and within the supported range.

Returns: The verified artifact, with ownership and disposal obligations defined by the returned type and the Load contract.

FixtureVerificationDiagnosticUAIX.LmRuntime.Models.Llama 2 members

Represents one diagnostic emitted while verifying a checked-in GGUF fixture directory.

Property Code

Gets the stable diagnostic code.

Property Message

Gets the diagnostic message.

FixtureVerificationResultUAIX.LmRuntime.Models.Llama 5 members

Represents the result of bounded, offline fixture directory verification.

Property FixtureDirectory

Gets the normalized fixture directory.

Property ArtifactPath

Gets the normalized GGUF artifact path.

Property ArtifactSha256

Gets the verified SHA-256 digest.

Property Diagnostics

Gets verification diagnostics.

Property IsValid

Gets whether no verification diagnostics were emitted.

FixtureDirectoryVerifierUAIX.LmRuntime.Models.Llama 1 member

Verifies fixture manifests, artifact paths, digests, and basic loadability without network access.

Method Verify(string)

Verifies the supplied fixture directory and returns bounded evidence only after every required check succeeds.

fixtureDirectory
The directory containing a fixture manifest and GGUF artifact.

Returns: The FixtureVerificationResult result produced by FixtureDirectoryVerifier.Verify for this contract: Verifies the supplied fixture directory and returns bounded evidence only after every required check succeeds. It is published only after all documented validation and ownership transitions succeed.

LlamaWeightStorageModeUAIX.LmRuntime.Models.Llama 3 members

Identifies how a bound tensor participates in reference execution.

Field Mapped

The tensor remains a borrowed view over the mapped GGUF file.

Field Alias

The tensor aliases another mapped tensor.

Field CopiedForReference

The tensor was explicitly copied into a bounded float32 reference buffer.

LlamaBoundTensorUAIX.LmRuntime.Models.Llama 5 members

Represents one semantic LLaMA weight bound to mapped model storage.

Property Role

Gets the semantic tensor role.

Property BlockIndex

Gets the optional transformer block index.

Property Binding

Gets the validated binding manifest entry.

Property View

Gets the borrowed mapped tensor view.

Property StorageMode

Gets the storage mode represented by this binding.

LlamaBoundLayerWeightSetUAIX.LmRuntime.Models.Llama 10 members

Represents the mapped tensors required by one LLaMA transformer block.

Property BlockIndex

Gets the zero-based transformer block index.

Property AttentionNorm

Gets the attention normalization tensor.

Property AttentionQuery

Gets the query projection tensor.

Property AttentionKey

Gets the key projection tensor.

Property AttentionValue

Gets the value projection tensor.

Property AttentionOutput

Gets the attention output projection tensor.

Property FeedForwardNorm

Gets the feed-forward normalization tensor.

Property FeedForwardGate

Gets the feed-forward gate projection tensor.

Property FeedForwardUp

Gets the feed-forward up projection tensor.

Property FeedForwardDown

Gets the feed-forward down projection tensor.

LlamaReferenceMaterializationRecordUAIX.LmRuntime.Models.Llama 5 members

Records one explicit managed copy made for the bounded scalar reference runtime.

Property TensorName

Gets the source tensor name.

Property Role

Gets the semantic tensor role.

Property BlockIndex

Gets the optional transformer block index.

Property CopiedByteCount

Gets the copied byte count.

Property StorageMode

Gets the resulting storage mode.

LlamaReferenceWeightMaterializationUAIX.LmRuntime.Models.Llama 3 members

Contains immutable float32 weights and copy evidence for the scalar reference runtime.

Property Weights

Gets the immutable reference weights.

Property Records

Gets every bounded copy made while materializing the fixture.

Property TotalCopiedByteCount

Gets the total number of copied bytes.

LlamaBoundWeightSetUAIX.LmRuntime.Models.Llama 11 members

Resolves a complete LLaMA binding manifest into stable mapped tensor views.

This object does not own the operating-system mapping. Every view borrows storage from the supplied and becomes invalid when that mapping is disposed.

Method LlamaBoundWeightSet(UAIX.LmRuntime.Gguf.MappedGgufFile,UAIX.LmRuntime.Models.Llama.TensorBindingManifest,UAIX.LmRuntime.Models.Llama.LlamaModelConfig)

Initializes a mapped LLaMA weight set from a complete binding manifest.

mapping
The mapped GGUF file that owns tensor storage.
manifest
The validated manifest that binds tensor requirements, model identity, and storage diagnostics used by the operation.
config
The validated LLaMA model configuration defining context capacity, vocabulary size, tensor geometry, and attention dimensions for the operation.
Property Mapping

Gets the mapping that owns all borrowed tensor bytes.

Property Configuration

Gets the validated model configuration.

Property Manifest

Gets the complete tensor binding manifest.

Property Bindings

Gets all semantic mapped tensor bindings.

Property TokenEmbeddings

Gets the token embedding tensor.

Property OutputNorm

Gets the final output normalization tensor.

Property Output

Gets the output projection tensor or tied embedding alias.

Property Layers

Gets the block-local mapped weight sets.

Method Get(UAIX.LmRuntime.Models.Llama.LlamaTensorRole,System.Nullable<int>)

Retrieves the llama bound tensor from the current LlamaBoundWeightSet state after validating the requested access.

role
The semantic LLaMA tensor role used to select the required bound tensor from the validated manifest.
blockIndex
The zero-based block index; it must identify an existing position within the relevant validated range.

Returns: The LlamaBoundTensor result produced by LlamaBoundWeightSet.Get for this contract: Retrieves the llama bound tensor from the current LlamaBoundWeightSet state after validating the requested access. It is published only after all documented validation and ownership transitions succeed.

Method MaterializeFloat32ReferenceWeights(int)

Materializes bounded float32 arrays for the scalar correctness runtime.

maximumCopiedBytes
The maximum total bytes that may be copied from mapped storage.

Returns: The immutable reference weights and explicit copy ledger.

LlamaRuntimeModeUAIX.LmRuntime.Models.Llama 1 member

Identifies the deterministic execution contract used by a mapped model session.

Field DeterministicParity

Runs only deterministic parity behavior without adaptive governance.

LlamaOneTokenFinishReasonUAIX.LmRuntime.Models.Llama 1 member

Identifies why a bounded one-token generation operation ended.

Field OneTokenCompleted

Exactly one greedy token was selected as requested.

LlamaMappedModelLoadOptionsUAIX.LmRuntime.Models.Llama 5 members

Configures loading of a mapped LLaMA GGUF artifact.

Property ParseOptions

Gets GGUF parser safety limits.

Property BindingOptions

Gets semantic tensor binding validation options.

Property RuntimeMode

Gets the runtime mode.

Property MaximumReferenceMaterializationBytes

Gets the maximum bytes that scalar reference sessions may copy from mapped F32 weights.

Property ComputeModelSha256

Gets whether a SHA-256 digest of the complete artifact should be computed during load.

LlamaMappedModelLoadTimingsUAIX.LmRuntime.Models.Llama 5 members

Records measured stages of mapped model loading.

Property ParseDuration

Gets metadata and tensor catalog parse duration.

Property MapDuration

Gets operating-system memory-map creation duration.

Property CompositionDuration

Gets architecture, tokenizer, and binding composition duration.

Property HashDuration

Gets optional complete-file digest duration.

Property TotalDuration

Gets total load duration.

LlamaMappedModelManifestUAIX.LmRuntime.Models.Llama 13 members

Describes the immutable evidence produced while loading a mapped LLaMA model.

Property ModelPath

Gets the normalized model path.

Property ModelByteCount

Gets the exact mapped GGUF file length observed during parsing.

Property ModelSha256

Gets the optional complete-file SHA-256 digest.

Property GgufVersion

Gets the GGUF version.

Property Architecture

Gets the architecture identifier.

Property ModelName

Gets the model display name.

Property Tokenizer

Gets the tokenizer implementation name.

Property BoundTensorCount

Gets the bound tensor count.

Property StorageSummary

Gets the physical tensor storage summary used by direct mapped execution.

Property ManagedModelWeightCopiedByteCount

Gets the managed model-weight byte count copied by the default execution path.

Property RuntimeMode

Gets the selected execution mode.

Property Timings

Gets load-stage timings.

Property Evidence

Gets load evidence messages.

LlamaOneTokenOptionsUAIX.LmRuntime.Models.Llama 4 members

Configures one deterministic mapped-model greedy-token operation.

Property ResetSession

Gets whether the session should reset before prompt evaluation.

Property ParseSpecialTokens

Gets whether raw special-token text should be recognized.

Property AddSpecialTokens

Gets whether model-defined BOS/EOS behavior should be applied.

Property EmitTokenizerTrace

Gets whether tokenizer trace events should be captured.

LlamaOneTokenTimingsUAIX.LmRuntime.Models.Llama 4 members

Records measured stages of exactly one mapped-model greedy decode operation.

Property TokenizationDuration

Gets prompt tokenization duration.

Property PrefillDuration

Gets prompt prefill duration.

Property SelectionDuration

Gets greedy selection and token decode duration.

Property TotalDuration

Gets total operation duration.

LlamaMappedGreedyTokenResultUAIX.LmRuntime.Models.Llama 20 members

Represents an end-to-end prompt-to-one-token result from a mapped GGUF model.

Property ModelPath

Gets the normalized GGUF model path used for the operation.

Property ModelSha256

Gets the optional complete-file model digest computed during load.

Property ModelName

Gets the model display name declared by GGUF metadata.

Property Architecture

Gets the model architecture identifier.

Property Prompt

Gets the input prompt.

Property PromptTokenIds

Gets the exact prompt token identifiers.

Property TokenizerTrace

Gets tokenizer trace events when requested.

Property TokenId

Gets the selected token identifier.

Property TokenText

Gets the selected token text.

Property SelectedLogit

Gets the selected token logit.

Property Logits

Gets the complete next-token logits for parity diagnostics.

Property StorageSummary

Gets the mapped storage-type summary.

Property ManagedModelWeightCopiedByteCount

Gets the managed model-weight bytes copied by the session path.

Property ManagedAllocatedByteCount

Gets managed bytes allocated on the current thread during the measured operation.

Property Position

Gets the sequence position that produced the logits.

Property KeyValueCacheTokenCount

Gets the resulting key/value cache token count.

Property FinishReason

Gets the deterministic finish reason.

Property RuntimeMode

Gets the runtime mode.

Property Timings

Gets measured operation timings.

Property Evidence

Gets evidence statements for the deterministic one-token operation.

LlamaMappedModelLoaderUAIX.LmRuntime.Models.Llama 1 member

Loads a local GGUF artifact into a mapped, tokenizer-aware LLaMA model composition.

Method Load(string,UAIX.LmRuntime.Models.Llama.LlamaMappedModelLoadOptions)

Loads and validates one mapped local model.

path
The local file-system path processed by this operation; it must satisfy the containing component's path and scope policy.
options
The optional LlamaMappedModelLoadOptions controlling Load; null selects the documented defaults, supplied limits are validated before allocation, and the instance is not mutated.

Returns: The owned mapped model, with ownership and disposal obligations defined by the returned type and the Load contract.

LlamaMappedModelUAIX.LmRuntime.Models.Llama 14 members

Owns a mapped GGUF artifact and immutable LLaMA runtime composition.

Property Mapping

Gets the mapped model storage owner.

Property Configuration

Gets the validated LLaMA configuration.

Property TokenizerMetadata

Gets validated GGUF tokenizer metadata.

Property Tokenizer

Gets the exact metadata-driven tokenizer.

Property BindingManifest

Gets the tensor binding manifest.

Property Weights

Gets the mapped semantic weight set.

Property WeightSource

Gets the direct mapped execution weight source.

Property Options

Gets the load options retained for deterministic session creation.

Property Manifest

Gets the immutable load evidence manifest.

Property IsDisposed

Gets whether the model has been disposed.

Method CreateReferenceSession

Creates an independent scalar reference session with its own key/value state.

Returns: The new mapped reference session, with ownership and disposal obligations defined by the returned type and the CreateReferenceSession contract.

Method CreateMaterializedReferenceSession

Creates an independent compatibility session over explicitly materialized float32 arrays.

Returns: The materialized compatibility session, with ownership and disposal obligations defined by the returned type and the CreateMaterializedReferenceSession contract.

Method GetReferenceMaterialization

Gets the bounded reference materialization evidence, creating it on first use.

Returns: The LlamaReferenceWeightMaterialization result produced by LlamaMappedModel.GetReferenceMaterialization for this contract: Gets the bounded reference materialization evidence, creating it on first use. It is published only after all documented validation and ownership transitions succeed.

Method Dispose

Releases resources owned by LlamaMappedModel and transitions it to the disposed state.

LlamaMappedReferenceSessionUAIX.LmRuntime.Models.Llama 12 members

Combines exact GGUF tokenization with an independent scalar reference session.

Property Position

Gets the current next-token sequence position.

Property KvCache

Gets the typed session-local key/value cache.

Property IsDisposed

Gets whether this session has released its state.

Method Reset

Resets this session's sequence and key/value state.

Method DecodeOneGreedy(string,UAIX.LmRuntime.Models.Llama.LlamaOneTokenOptions)

Tokenizes a prompt, executes prefill, selects argmax, and decodes exactly one token.

prompt
The prompt processed by the configured encoding or normalization rules; it must satisfy the declared nullability contract.
options
The optional LlamaOneTokenOptions controlling DecodeOneGreedy; null selects the documented defaults, supplied limits are validated before allocation, and the instance is not mutated.

Returns: The LlamaMappedGreedyTokenResult result produced by LlamaMappedReferenceSession.DecodeOneGreedy for this contract: Tokenizes a prompt, executes prefill, selects argmax, and decodes exactly one token. It is published only after all documented validation and ownership transitions succeed.

Method GenerateGreedy(string,System.Span<int>,System.Span<float>,UAIX.LmRuntime.Models.Llama.LlamaGreedyGenerationOptions,UAIX.LmRuntime.Tokenization.TokenizationOptions,System.Threading.CancellationToken)

Tokenizes a prompt and generates greedy token identifiers into caller-owned buffers.

prompt
The prompt processed by the configured encoding or normalization rules; it must satisfy the declared nullability contract.
generatedTokenIds
The ordered token identifiers to process; sequence order is preserved and each identifier is validated where required.
finalLogits
The final logits sequence used by this operation; its required length, ordering, and element bounds are validated before access.
generationOptions
The generation options that define validation limits and execution behavior; required values are checked before use.
tokenizationOptions
The tokenization options that define validation limits and execution behavior; required values are checked before use.
cancellationToken
The caller-provided token used to cancel the operation before additional work or results are published.

Returns: The LlamaGreedyGenerationResult result produced by LlamaMappedReferenceSession.GenerateGreedy for this contract: Tokenizes a prompt and generates greedy token identifiers into caller-owned buffers. It is published only after all documented validation and ownership transitions succeed.

Method GenerateGreedy(string,System.Span<int>,System.Span<float>,UAIX.LmRuntime.Models.Llama.LlamaGreedyGenerationOptions,UAIX.LmRuntime.Tokenization.TokenizationOptions,System.Action<UAIX.LmRuntime.Models.Llama.LlamaGeneratedToken>,System.Threading.CancellationToken)

Tokenizes a prompt, generates greedy token identifiers, and reports each selected token synchronously.

prompt
The prompt processed by the configured encoding or normalization rules; it must satisfy the declared nullability contract.
generatedTokenIds
The ordered token identifiers to process; sequence order is preserved and each identifier is validated where required.
finalLogits
The final logits sequence used by this operation; its required length, ordering, and element bounds are validated before access.
generationOptions
The generation options that define validation limits and execution behavior; required values are checked before use.
tokenizationOptions
The tokenization options that define validation limits and execution behavior; required values are checked before use.
tokenObserver
The optional observer invoked once for each selected token.
cancellationToken
A token observed before selection and before the next committed model step.

Returns: The LlamaGreedyGenerationResult result produced by LlamaMappedReferenceSession.GenerateGreedy for this contract: Tokenizes a prompt, generates greedy token identifiers, and reports each selected token synchronously. It is published only after all documented validation and ownership transitions succeed.

Method ExportState(UAIX.LmRuntime.Models.Llama.LlamaSessionPersistenceOptions)

Exports complete deterministic state with model, configuration, tokenizer, and cache-layout identities.

options
Optional persistence metadata. Empty identity fields are resolved from the mapped model.

Returns: A newly allocated byte[] containing the ordered result of LlamaMappedReferenceSession.ExportState: Exports complete deterministic state with model, configuration, tokenizer, and cache-layout identities. The caller owns the returned array and later mutation cannot alter the source object.

Method SaveState(string,UAIX.LmRuntime.Models.Llama.LlamaSessionPersistenceOptions)

Saves complete deterministic state to a local artifact.

path
The local file-system path processed by this operation; it must satisfy the containing component's path and scope policy.
options
The optional LlamaSessionPersistenceOptions controlling SaveState; null selects the documented defaults, supplied limits are validated before allocation, and the instance is not mutated.

Returns: The LlamaSessionArtifact result produced by LlamaMappedReferenceSession.SaveState for this contract: Saves complete deterministic state to a local artifact. It is published only after all documented validation and ownership transitions succeed.

Method RestoreState(System.ReadOnlySpan<byte>,int)

Restores verified complete state after enforcing mapped model and tokenizer identities.

bytes
The bytes sequence used by this operation; its required length, ordering, and element bounds are validated before access.
maximumByteCount
The maximum byte count used to bound this operation; it must be nonnegative and within the supported range.

Returns: The LlamaSessionArtifact result produced by LlamaMappedReferenceSession.RestoreState for this contract: Restores verified complete state after enforcing mapped model and tokenizer identities. It is published only after all documented validation and ownership transitions succeed.

Method LoadState(string,int)

Loads and restores complete deterministic state from a local artifact.

path
The local file-system path processed by this operation; it must satisfy the containing component's path and scope policy.
maximumByteCount
The maximum byte count used to bound this operation; it must be nonnegative and within the supported range.

Returns: The verified artifact, with ownership and disposal obligations defined by the returned type and the LoadState contract.

Method Dispose

Releases resources owned by LlamaMappedReferenceSession and transitions it to the disposed state.

LlamaModelConfigUAIX.LmRuntime.Models.Llama 17 members

Represents LLaMA-family transformer configuration reconstructed from GGUF metadata.

Property Architecture

Gets the architecture name.

Property ModelName

Gets the optional model display name.

Property EmbeddingLength

Gets the embedding length.

Property BlockCount

Gets the transformer block count.

Property FeedForwardLength

Gets the feed-forward hidden length.

Property AttentionHeadCount

Gets the attention head count.

Property AttentionKeyValueHeadCount

Gets the attention key/value head count.

Property ContextLength

Gets the training context length.

Property VocabularySize

Gets the vocabulary size.

Property RopeDimensionCount

Gets the RoPE dimension count per attention head.

Property RopeFrequencyBase

Gets the RoPE frequency base.

Property RmsNormEpsilon

Gets the RMSNorm epsilon.

Property SupportsTiedOutputProjection

Gets whether the loader may use token embeddings as the output projection when output.weight is absent.

Property HeadDimension

Gets the dimension of one query attention head.

Property KeyValueDimension

Gets the flattened key/value projection dimension.

Method FromGguf(UAIX.LmRuntime.Gguf.GgufModel)

Creates a LLaMA-family configuration from GGUF metadata.

model
The parsed GGUF model whose validated metadata and tensor catalog are consumed by this operation.

Returns: The LlamaModelConfig result produced by LlamaModelConfig.FromGguf for this contract: Creates a LLaMA-family configuration from GGUF metadata. It is published only after all documented validation and ownership transitions succeed.

Method Validate

Validates architectural invariants required by the scalar LLaMA runtime.

LlamaReferenceForwardPassUAIX.LmRuntime.Models.Llama 2 members

Provides tiny reference building blocks for LLaMA-family correctness tests.

Method RmsNorm(System.ReadOnlySpan<float>,System.ReadOnlySpan<float>,System.Span<float>,float)

Applies the LLaMA RMSNorm operation through the CPU reference kernel.

input
The source data consumed by the operation; caller-owned storage is not retained after the method returns.
weight
The weight sequence used by this operation; its required length, ordering, and element bounds are validated before access.
output
The caller-owned destination buffer that receives the result; required capacity is validated before any write occurs.
epsilon
The positive normalization epsilon added to the mean-square term to avoid division by zero while preserving deterministic numerical behavior.
Method ApplyRope(System.Span<float>,System.ReadOnlySpan<float>,System.ReadOnlySpan<float>,int)

Applies LLaMA-style RoPE to a query or key vector in place.

vector
The vector sequence used by this operation; its required length, ordering, and element bounds are validated before access.
cos
The cos sequence used by this operation; its required length, ordering, and element bounds are validated before access.
sin
The sin sequence used by this operation; its required length, ordering, and element bounds are validated before access.
ropeDimensions
The even number of leading head dimensions transformed by rotary positional encoding.
LlamaReferenceLayerWeightsUAIX.LmRuntime.Models.Llama 9 members

Stores immutable float32 weights for one scalar/reference LLaMA transformer block.

Property AttentionNorm

Gets the attention RMSNorm scale.

Property AttentionQuery

Gets the query projection matrix in row-major logical order.

Property AttentionKey

Gets the key projection matrix in row-major logical order.

Property AttentionValue

Gets the value projection matrix in row-major logical order.

Property AttentionOutput

Gets the attention output projection matrix in row-major logical order.

Property FeedForwardNorm

Gets the feed-forward RMSNorm scale.

Property FeedForwardGate

Gets the feed-forward gate projection matrix in row-major logical order.

Property FeedForwardUp

Gets the feed-forward up projection matrix in row-major logical order.

Property FeedForwardDown

Gets the feed-forward down projection matrix in row-major logical order.

LlamaReferenceModelWeightsUAIX.LmRuntime.Models.Llama 5 members

Stores immutable float32 weights for the deterministic LLaMA reference runtime.

Property TokenEmbeddings

Gets the token embedding table in row-major logical order.

Property Layers

Gets transformer block weights in execution order.

Property OutputNorm

Gets the final RMSNorm scale.

Property OutputProjection

Gets the output projection matrix in row-major logical order. An empty value means tied embeddings.

Method Validate(UAIX.LmRuntime.Models.Llama.LlamaModelConfig)

Validates all reference-weight shapes against a LLaMA configuration.

config
The validated LLaMA model configuration defining context capacity, vocabulary size, tensor geometry, and attention dimensions for the operation.
LlamaGreedyTokenResultUAIX.LmRuntime.Models.Llama 5 members

Represents exactly one greedily selected token produced by the reference runtime.

Property TokenId

Gets the selected token identifier.

Property TokenText

Gets the selected token text when a tokenizer is attached.

Property PromptTokenCount

Gets the number of prompt tokens evaluated.

Property Position

Gets the zero-based position whose logits selected this token.

Property SelectedLogit

Gets the selected token logit.

LlamaReferenceSessionUAIX.LmRuntime.Models.Llama 17 members

Executes a deterministic, scalar-first LLaMA forward path for tiny correctness fixtures.

This class is the numerical correctness anchor for later optimized kernels. It is intentionally limited to batch size one and F32, Q8_0, or Q4_0 mapped or array-backed weights. It performs no governance or adaptive policy operations and therefore belongs exclusively to deterministic parity mode.

Method LlamaReferenceSession(UAIX.LmRuntime.Models.Llama.LlamaModelConfig,UAIX.LmRuntime.Models.Llama.LlamaReferenceModelWeights,UAIX.LmRuntime.Tokenization.IGgufTokenizer)

Initializes a reference session through the v1.8.0 array-backed compatibility path.

config
The validated LLaMA model configuration defining context capacity, vocabulary size, tensor geometry, and attention dimensions for the operation.
weights
The validated model-weight source or bound weight set consumed read-only by the deterministic reference operation.
tokenizer
The optional tokenizer used only to decode the selected token text.
Method LlamaReferenceSession(UAIX.LmRuntime.Models.Llama.LlamaModelConfig,UAIX.LmRuntime.Models.Llama.ILlamaModelWeightSource,UAIX.LmRuntime.Tokenization.IGgufTokenizer)

Initializes a reference session over immutable array-backed or direct mapped weight sources.

config
The validated LLaMA model configuration defining context capacity, vocabulary size, tensor geometry, and attention dimensions for the operation.
weights
The validated model-weight source or bound weight set consumed read-only by the deterministic reference operation.
tokenizer
The optional tokenizer used only to decode the selected token text.
Property Position

Gets the next sequence position to be evaluated.

Property KvCache

Gets the typed key/value cache owned by this session.

Property WeightSource

Gets the immutable model weight source used by this session.

Property VocabularySize

Gets the configured vocabulary size.

Property ContextCapacity

Gets the configured sequence capacity.

Method Reset

Clears sequence state and all key/value cache contents.

Method CaptureState

Captures complete deterministic session state without serializing live model pointers.

Returns: The LlamaReferenceSessionSnapshot result produced by LlamaReferenceSession.CaptureState for this contract: Captures complete deterministic session state without serializing live model pointers. It is published only after all documented validation and ownership transitions succeed.

Method RestoreState(UAIX.LmRuntime.Models.Llama.LlamaReferenceSessionSnapshot)

Restores complete deterministic state after validating sequence, vocabulary, and cache identities.

snapshot
The immutable state snapshot being validated, serialized, restored, or analyzed without retaining caller-owned mutable aliases.
Method RunStep(int,System.Span<float>)

Evaluates one input token and writes next-token logits.

tokenId
The token identifier to process; it must fall within the validated vocabulary and operation-specific range.
logits
The destination buffer with at least vocabulary-size elements.
Method DecodeOneGreedy(System.Collections.Generic.IReadOnlyList<int>,bool)

Evaluates a prompt and returns exactly one greedily selected next token.

promptTokenIds
The ordered token identifiers to process; sequence order is preserved and each identifier is validated where required.
resetSession
Whether existing key/value state should be cleared first.

Returns: The LlamaGreedyTokenResult result produced by LlamaReferenceSession.DecodeOneGreedy for this contract: Evaluates a prompt and returns exactly one greedily selected next token. It is published only after all documented validation and ownership transitions succeed.

Method Prefill(System.Collections.Generic.IReadOnlyList<int>,bool)

Evaluates every prompt token and leaves the final logits available for deterministic selection.

promptTokenIds
The ordered token identifiers to process; sequence order is preserved and each identifier is validated where required.
resetSession
Whether existing sequence and cache state should be cleared first.
Method CopyLastLogitsTo(System.Span<float>)

Copies the most recently computed logits to a caller-provided destination.

destination
The destination with room for the configured vocabulary.
Method GenerateGreedy(System.Collections.Generic.IReadOnlyList<int>,System.Span<int>,System.Span<float>,UAIX.LmRuntime.Models.Llama.LlamaGreedyGenerationOptions,System.Threading.CancellationToken)

Generates deterministic greedy token identifiers into caller-owned buffers.

promptTokenIds
The ordered token identifiers to process; sequence order is preserved and each identifier is validated where required.
generatedTokenIds
The ordered token identifiers to process; sequence order is preserved and each identifier is validated where required.
finalLogits
The caller-owned destination for the final available logits.
options
The optional LlamaGreedyGenerationOptions controlling GenerateGreedy; null selects the documented defaults, supplied limits are validated before allocation, and the instance is not mutated.
cancellationToken
The caller-provided token used to cancel the operation before additional work or results are published.

Returns: The LlamaGreedyGenerationResult result produced by LlamaReferenceSession.GenerateGreedy for this contract: Generates deterministic greedy token identifiers into caller-owned buffers. It is published only after all documented validation and ownership transitions succeed.

Method GenerateGreedy(System.Collections.Generic.IReadOnlyList<int>,System.Span<int>,System.Span<float>,UAIX.LmRuntime.Models.Llama.LlamaGreedyGenerationOptions,System.Action<UAIX.LmRuntime.Models.Llama.LlamaGeneratedToken>,System.Threading.CancellationToken)

Generates deterministic greedy token identifiers and reports each selection to a synchronous observer.

promptTokenIds
The ordered token identifiers to process; sequence order is preserved and each identifier is validated where required.
generatedTokenIds
The ordered token identifiers to process; sequence order is preserved and each identifier is validated where required.
finalLogits
The caller-owned destination for the final available logits.
options
The optional LlamaGreedyGenerationOptions controlling GenerateGreedy; null selects the documented defaults, supplied limits are validated before allocation, and the instance is not mutated.
tokenObserver
The optional synchronous observer invoked once after each token is selected.
cancellationToken
A token observed before selection and before the next committed model step.

Returns: The LlamaGreedyGenerationResult result produced by LlamaReferenceSession.GenerateGreedy for this contract: Generates deterministic greedy token identifiers and reports each selection to a synchronous observer. It is published only after all documented validation and ownership transitions succeed.

Method SelectGreedyToken(int)

Selects and decodes one greedy token from the current logits.

promptTokenCount
The number of prompt tokens associated with the current logits.

Returns: The LlamaGreedyTokenResult result produced by LlamaReferenceSession.SelectGreedyToken for this contract: Selects and decodes one greedy token from the current logits. It is published only after all documented validation and ownership transitions succeed.

LlamaReferenceFixtureUAIX.LmRuntime.Models.Llama 5 members

Represents a deterministic tiny reference fixture with one transformer block.

Property Configuration

Gets the fixture model configuration.

Property Weights

Gets the fixture model weights.

Property Tokenizer

Gets the fixture tokenizer.

Property PromptTokenIds

Gets the canonical fixture prompt tokens.

Method CreateSession

Creates the session from the validated inputs required by LlamaReferenceFixture.

Returns: A session with empty key/value cache state.

LlamaReferenceFixtureFactoryUAIX.LmRuntime.Models.Llama 1 member

Creates deterministic tiny fixtures used by reference-runtime tests and examples.

Method CreateDeterministic

Creates a one-block, five-token deterministic LLaMA fixture.

Returns: The fixture configuration, weights, tokenizer, and prompt, with ownership and disposal obligations defined by the returned type and the CreateDeterministic contract.

ILlamaSessionUAIX.LmRuntime.Models.Llama 1 member

Defines the lifecycle for a LLaMA-family inference session.

Method DecodeAsync(int,System.Threading.CancellationToken)

Decodes the next token for the active sequence.

tokenId
The token identifier to process; it must fall within the validated vocabulary and operation-specific range.
cancellationToken
The caller-provided token used to cancel the operation before additional work or results are published.

Returns: An asynchronous ValueTask<int> that completes with the result of ILlamaSession.DecodeAsync: Decodes the next token for the active sequence. Fault and cancellation states are propagated without a successful partial result.

LlamaReferenceExecutorUAIX.LmRuntime.Models.Llama 1 member

Provides scalar/reference execution anchors for LLaMA-family graphs.

Method Forward(System.ReadOnlySpan<float>,UAIX.LmRuntime.Models.Llama.LlamaWeights,System.Span<float>)

Executes a minimal reference forward pass over hidden-state logits.

hiddenState
The hidden state sequence used by this operation; its required length, ordering, and element bounds are validated before access.
weights
The LLaMA weights used by the reference path.
logits
The logits sequence used by this operation; its required length, ordering, and element bounds are validated before access.
LlamaWeightsUAIX.LmRuntime.Models.Llama 2 members

Represents model-level LLaMA weights used by reference execution.

Property TokenEmbeddings

Gets token embedding weights.

Property OutputProjection

Gets the output projection matrix in row-major order.

LlamaLayerWeightsUAIX.LmRuntime.Models.Llama 3 members

Represents one transformer block's reference weights.

Property AttentionQuery

Gets the attention query projection matrix.

Property AttentionKey

Gets the attention key projection matrix.

Property AttentionValue

Gets the attention value projection matrix.

LlamaReferenceRmsNormUAIX.LmRuntime.Models.Llama 1 member

Provides reference RMSNorm behavior.

Method Apply(System.ReadOnlySpan<float>,System.ReadOnlySpan<float>,System.Span<float>,float)

Applies the supplied input to the supplied values while preserving the operation's numeric and shape invariants.

input
The source data consumed by the operation; caller-owned storage is not retained after the method returns.
weights
The weights sequence used by this operation; its required length, ordering, and element bounds are validated before access.
output
The caller-owned destination buffer that receives the result; required capacity is validated before any write occurs.
epsilon
The positive normalization epsilon added to the mean-square term to avoid division by zero while preserving deterministic numerical behavior.
LlamaReferenceRopeUAIX.LmRuntime.Models.Llama 1 member

Provides reference RoPE behavior.

Method Apply(System.Span<float>,int,float)

Applies rotary position embedding to adjacent hidden-state pairs.

values
The values sequence used by this operation; its required length, ordering, and element bounds are validated before access.
position
The zero-based sequence or cache position addressed by the operation; it must lie within the allocated context and readable or writable range.
theta
The rotary angle in radians applied to the paired vector components at the addressed position.
LlamaReferenceAttentionUAIX.LmRuntime.Models.Llama 1 member

Provides reference causal attention behavior.

Method ApplyCausal(System.ReadOnlySpan<float>,System.ReadOnlySpan<float>,System.ReadOnlySpan<float>,int,System.Span<float>)

Applies a minimal causal attention score computation.

query
The query sequence used by this operation; its required length, ordering, and element bounds are validated before access.
keys
The keys sequence used by this operation; its required length, ordering, and element bounds are validated before access.
values
The values sequence used by this operation; its required length, ordering, and element bounds are validated before access.
headSize
The numeric head size consumed by ApplyCausal; it must satisfy the member's documented range, geometry, and finite-value requirements.
output
The caller-owned destination buffer that receives the result; required capacity is validated before any write occurs.
GroupedQueryAttentionMapUAIX.LmRuntime.Models.Llama 1 member

Maps query heads to grouped key/value heads.

Method MapHead(int,int,int)

Maps an attention query head to the corresponding KV head.

queryHead
The zero-based query-head index mapped deterministically to its corresponding key/value head.
queryHeadCount
The query head count used to bound this operation; it must be nonnegative and within the supported range.
keyValueHeadCount
The key value head count used to bound this operation; it must be nonnegative and within the supported range.

Returns: The int value computed by GroupedQueryAttentionMap.MapHead for this contract: Maps an attention query head to the corresponding KV head. Range, finite-value, and overflow checks are completed before the value is returned.

LlamaSwiGluReferenceUAIX.LmRuntime.Models.Llama 1 member

Provides reference SwiGLU behavior.

Method Apply(System.ReadOnlySpan<float>,System.ReadOnlySpan<float>,System.Span<float>)

Applies the SwiGLU activation to validated gate and up-projection vectors.

gate
The gate sequence used by this operation; its required length, ordering, and element bounds are validated before access.
up
The up sequence used by this operation; its required length, ordering, and element bounds are validated before access.
output
The caller-owned destination buffer that receives the result; required capacity is validated before any write occurs.
LlamaLogitComputerUAIX.LmRuntime.Models.Llama 1 member

Computes reference logits from a hidden state and output projection.

Method ComputeLogits(System.ReadOnlySpan<float>,System.ReadOnlySpan<float>,System.Span<float>)

Computes logits from a hidden vector and a row-major projection matrix.

hiddenState
The hidden state sequence used by this operation; its required length, ordering, and element bounds are validated before access.
projection
The projection sequence used by this operation; its required length, ordering, and element bounds are validated before access.
logits
The logits sequence used by this operation; its required length, ordering, and element bounds are validated before access.
LlamaParityToleranceUAIX.LmRuntime.Models.Llama 3 members

Configures exact token and explicit floating-point tolerance checks for cross-storage parity.

Property AbsoluteTolerance

Gets the absolute per-logit tolerance.

Property RelativeTolerance

Gets the relative per-logit tolerance.

Method Validate

Validates the absolute and relative parity tolerances used for numerical comparison.

LlamaLogitComparisonUAIX.LmRuntime.Models.Llama 6 members

Summarizes a deterministic comparison of two next-token logit vectors.

Property IsWithinTolerance

Gets whether every compared logit satisfies the configured tolerance.

Property MaximumAbsoluteError

Gets the largest absolute logit difference.

Property MeanAbsoluteError

Gets the arithmetic mean absolute logit difference.

Property FirstFailingIndex

Gets the first failing logit index, or when none failed.

Property FirstFailingReferenceValue

Gets the reference value at the first failing index.

Property FirstFailingCandidateValue

Gets the candidate value at the first failing index.

LlamaLogitComparatorUAIX.LmRuntime.Models.Llama 1 member

Compares deterministic next-token vectors without widening caller-provided tolerances.

Method Compare(System.Collections.Generic.IReadOnlyList<float>,System.Collections.Generic.IReadOnlyList<float>,UAIX.LmRuntime.Models.Llama.LlamaParityTolerance)

Compares two logit vectors using absolute-or-relative error acceptance.

reference
The reference sequence used by this operation; its required length, ordering, and element bounds are validated before access.
candidate
The candidate sequence used by this operation; its required length, ordering, and element bounds are validated before access.
tolerance
The tolerance input of type LlamaParityTolerance read by LlamaLogitComparator.Compare; it must satisfy the member-specific nullability, identity, range, and ownership rules before dependent work begins.

Returns: The LlamaLogitComparison result produced by LlamaLogitComparator.Compare for this contract: Compares two logit vectors using absolute-or-relative error acceptance. It is published only after all documented validation and ownership transitions succeed.

LlamaStorageParityCandidateResultUAIX.LmRuntime.Models.Llama 7 members

Represents one candidate model's parity result against a selected reference model.

Property ModelPath

Gets the candidate model path.

Property ModelSha256

Gets the candidate model SHA-256.

Property StorageSummary

Gets the candidate storage summary.

Property TokenMatches

Gets whether the selected token identifier exactly equals the reference identifier.

Property LogitComparison

Gets the detailed logit comparison.

Property OneTokenResult

Gets the complete candidate one-token result.

Property Passed

Gets whether both exact-token and floating-point contracts passed.

LlamaStorageParityResultUAIX.LmRuntime.Models.Llama 4 members

Represents a cross-storage one-token parity run.

Property Prompt

Gets the prompt used for every model.

Property ReferenceResult

Gets the reference one-token result.

Property Candidates

Gets candidate results in caller order.

Property Passed

Gets whether every candidate passed the explicit parity contract.

LlamaStorageParityRunnerUAIX.LmRuntime.Models.Llama 1 member

Executes bounded offline one-token parity comparisons across local GGUF storage variants.

Method Run(string,System.Collections.Generic.IReadOnlyList<string>,string,UAIX.LmRuntime.Models.Llama.LlamaParityTolerance)

Runs one reference model and one or more candidate models with identical prompt settings.

referenceModelPath
The local file-system reference model path processed by this operation; it must satisfy the containing component's path and scope policy.
candidateModelPaths
The local file-system candidate model paths processed by this operation; it must satisfy the containing component's path and scope policy.
prompt
The prompt processed by the configured encoding or normalization rules; it must satisfy the declared nullability contract.
tolerance
The tolerance input of type LlamaParityTolerance read by LlamaStorageParityRunner.Run; it must satisfy the member-specific nullability, identity, range, and ownership rules before dependent work begins.

Returns: The LlamaStorageParityResult result produced by LlamaStorageParityRunner.Run for this contract: Runs one reference model and one or more candidate models with identical prompt settings. It is published only after all documented validation and ownership transitions succeed.

LlamaTensorRoleUAIX.LmRuntime.Models.Llama 12 members

Identifies semantic roles for LLaMA-family tensors.

Field TokenEmbedding

Token embedding table.

Field OutputNorm

Final output normalization scale.

Field Output

Output projection matrix.

Field AttentionNorm

Per-block attention normalization scale.

Field AttentionQuery

Per-block query projection.

Field AttentionKey

Per-block key projection.

Field AttentionValue

Per-block value projection.

Field AttentionOutput

Per-block attention output projection.

Field FeedForwardNorm

Per-block feed-forward normalization scale.

Field FeedForwardGate

Per-block feed-forward gate projection.

Field FeedForwardUp

Per-block feed-forward up projection.

Field FeedForwardDown

Per-block feed-forward down projection.

TensorBindingStorageKindUAIX.LmRuntime.Models.Llama 2 members

Identifies where a bound tensor payload is stored.

Field MemoryMappedFile

The tensor remains in the GGUF memory-mapped artifact.

Field Alias

The tensor is an alias of another bound tensor.

TensorBindingOwnershipUAIX.LmRuntime.Models.Llama 2 members

Identifies ownership for a bound tensor payload.

Field BorrowedModelStorage

The binding borrows storage owned by the loaded model.

Field BorrowedAlias

The binding borrows storage through another tensor binding.

TensorBindingOptionsUAIX.LmRuntime.Models.Llama 4 members

Configures semantic validation performed by .

Property AllowTiedOutputProjection

Gets whether a missing output.weight may alias token_embd.weight.

Property ValidateSemanticShapes

Gets whether dimensions derived from model metadata must match the GGUF storage shape.

Property ValidateByteLengths

Gets whether physical byte lengths must match the registered tensor type traits.

Property ValidateFileBounds

Gets whether tensor ranges must fit inside the parsed source file length when available.

LlamaTensorRequirementUAIX.LmRuntime.Models.Llama 7 members

Describes one required LLaMA tensor contract.

Property Name

Gets the required tensor name.

Property Role

Gets the tensor role.

Property ExpectedRank

Gets the expected rank.

Property ExpectedStorageDimensions

Gets dimensions in GGUF storage order, where dimension zero is the row width.

Property ExpectedLogicalDimensions

Gets dimensions in logical row-major order for diagnostics and manifests.

Property BlockIndex

Gets the optional block index.

Property IsOptional

Gets whether the tensor may be satisfied by an explicit alias rule.

TensorBindingEntryUAIX.LmRuntime.Models.Llama 10 members

Represents one bound tensor entry.

Property Requirement

Gets the tensor requirement.

Property Descriptor

Gets the GGUF tensor descriptor supplying storage.

Property SourceTensorName

Gets the source tensor name when this binding is an alias.

Property LogicalDimensions

Gets the normalized logical dimensions.

Property ByteLength

Gets the physical storage byte length.

Property AbsoluteOffset

Gets the absolute source-file offset.

Property DataType

Gets the mapped runtime data type.

Property StorageKind

Gets the storage kind.

Property Ownership

Gets the ownership contract.

Property IsAlias

Gets whether this binding aliases another tensor.

TensorBindingDiagnosticUAIX.LmRuntime.Models.Llama 4 members

Represents a tensor binding diagnostic.

Property Code

Gets the diagnostic code.

Property TensorName

Gets the tensor name associated with the diagnostic.

Property BlockIndex

Gets the optional transformer block index.

Property Message

Gets the diagnostic message.

TensorBindingManifestUAIX.LmRuntime.Models.Llama 5 members

Represents the result of LLaMA tensor binding.

Property Bindings

Gets bound tensor entries.

Property Diagnostics

Gets binding diagnostics.

Property IsComplete

Gets a value indicating whether every required tensor was bound without diagnostics.

Method TryGetBinding(UAIX.LmRuntime.Models.Llama.LlamaTensorRole,System.Nullable<int>,UAIX.LmRuntime.Models.Llama.TensorBindingEntry&)

Attempts to find one bound tensor by semantic role and optional block index.

role
The semantic LLaMA tensor role used to select the required bound tensor from the validated manifest.
blockIndex
The zero-based block index; it must identify an existing position within the relevant validated range.
entry
When the method returns, contains the entry produced by the operation when successful; otherwise contains the type's default value.

Returns: True when try get binding succeeds for the supplied values; otherwise, false.

Method ThrowIfIncomplete

Throws when the manifest contains one or more diagnostics.

TensorBindingExceptionUAIX.LmRuntime.Models.Llama 2 members

Represents a failed LLaMA tensor schema binding operation.

Method TensorBindingException(UAIX.LmRuntime.Models.Llama.TensorBindingManifest)

Initializes a binding exception from a failed manifest.

manifest
The validated manifest that binds tensor requirements, model identity, and storage diagnostics used by the operation.
Property Manifest

Gets the failed binding manifest.

LlamaRequiredTensorRegistryUAIX.LmRuntime.Models.Llama 1 member

Builds the required LLaMA-family tensor registry from model configuration.

Method Build(UAIX.LmRuntime.Models.Llama.LlamaModelConfig)

Creates the required tensor list for the configuration.

config
The validated LLaMA model configuration defining context capacity, vocabulary size, tensor geometry, and attention dimensions for the operation.

Returns: An ordered read-only IReadOnlyList<LlamaTensorRequirement> result from LlamaRequiredTensorRegistry.Build: Creates the required tensor list for the configuration. Mutable internal collection aliases are not exposed through the returned contract.

LlamaTensorBinderUAIX.LmRuntime.Models.Llama 2 members

Binds and validates LLaMA-family GGUF tensors as a schema-validation phase.

Method Bind(UAIX.LmRuntime.Gguf.GgufModel,UAIX.LmRuntime.Models.Llama.LlamaModelConfig)

Binds required tensors from a parsed GGUF artifact using default validation options.

model
The parsed GGUF model whose validated metadata and tensor catalog are consumed by this operation.
config
The validated LLaMA model configuration defining context capacity, vocabulary size, tensor geometry, and attention dimensions for the operation.

Returns: The TensorBindingManifest result produced by LlamaTensorBinder.Bind for this contract: Binds required tensors from a parsed GGUF artifact using default validation options. It is published only after all documented validation and ownership transitions succeed.

Method Bind(UAIX.LmRuntime.Gguf.GgufModel,UAIX.LmRuntime.Models.Llama.LlamaModelConfig,UAIX.LmRuntime.Models.Llama.TensorBindingOptions)

Binds required tensors from a parsed GGUF artifact.

model
The parsed GGUF model whose validated metadata and tensor catalog are consumed by this operation.
config
The validated LLaMA model configuration defining context capacity, vocabulary size, tensor geometry, and attention dimensions for the operation.
options
The optional TensorBindingOptions controlling Bind; null selects the documented defaults, supplied limits are validated before allocation, and the instance is not mutated.

Returns: The TensorBindingManifest result produced by LlamaTensorBinder.Bind for this contract: Binds required tensors from a parsed GGUF artifact. It is published only after all documented validation and ownership transitions succeed.

MappedFloat16VectorSourceUAIX.LmRuntime.Models.Llama 6 members

Reads an IEEE float16 vector directly from a mapped GGUF tensor view.

Method MappedFloat16VectorSource(UAIX.LmRuntime.Gguf.MappedTensorView)

Initializes a new MappedFloat16VectorSource instance with validated dependencies and operational bounds.

view
The bounded tensor view whose descriptor, shape, byte order, and mapped payload are read without transferring ownership of the underlying mapping.
Property Length
Property DataType
Property StorageType
Property StorageDiagnostics
Method CopyTo(System.Span<float>)

Copies the to into caller-owned storage after validating the requested range and capacity.

destination
The destination buffer that receives the produced values.
MappedBFloat16VectorSourceUAIX.LmRuntime.Models.Llama 6 members

Reads a brain-float16 vector directly from a mapped GGUF tensor view.

Method MappedBFloat16VectorSource(UAIX.LmRuntime.Gguf.MappedTensorView)

Initializes a new MappedBFloat16VectorSource instance with validated dependencies and operational bounds.

view
The bounded tensor view whose descriptor, shape, byte order, and mapped payload are read without transferring ownership of the underlying mapping.
Property Length
Property DataType
Property StorageType
Property StorageDiagnostics
Method CopyTo(System.Span<float>)

Copies the to into caller-owned storage after validating the requested range and capacity.

destination
The destination buffer that receives the produced values.
MappedFloat16MatrixSourceUAIX.LmRuntime.Models.Llama 8 members

Reads and multiplies an IEEE float16 matrix directly from a mapped GGUF tensor view.

Method MappedFloat16MatrixSource(UAIX.LmRuntime.Gguf.MappedTensorView)

Initializes a new MappedFloat16MatrixSource instance with validated dependencies and operational bounds.

view
The bounded tensor view whose descriptor, shape, byte order, and mapped payload are read without transferring ownership of the underlying mapping.
Property RowCount
Property ColumnCount
Property DataType
Property StorageType
Property StorageDiagnostics
Method CopyRowTo(int,System.Span<float>)

Copies the row to into caller-owned storage after validating the requested range and capacity.

rowIndex
The zero-based row index; it must identify an existing position within the relevant validated range.
destination
The destination buffer that receives the produced values.
Method Multiply(System.ReadOnlySpan<float>,System.Span<float>)

Multiplies the supplied vector by the supplied vector without changing logical row order.

vector
The vector sequence used by this operation; its required length, ordering, and element bounds are validated before access.
output
The caller-owned destination buffer that receives the result; required capacity is validated before any write occurs.
MappedBFloat16MatrixSourceUAIX.LmRuntime.Models.Llama 8 members

Reads and multiplies a brain-float16 matrix directly from a mapped GGUF tensor view.

Method MappedBFloat16MatrixSource(UAIX.LmRuntime.Gguf.MappedTensorView)

Initializes a new MappedBFloat16MatrixSource instance with validated dependencies and operational bounds.

view
The bounded tensor view whose descriptor, shape, byte order, and mapped payload are read without transferring ownership of the underlying mapping.
Property RowCount
Property ColumnCount
Property DataType
Property StorageType
Property StorageDiagnostics
Method CopyRowTo(int,System.Span<float>)

Copies the row to into caller-owned storage after validating the requested range and capacity.

rowIndex
The zero-based row index; it must identify an existing position within the relevant validated range.
destination
The destination buffer that receives the produced values.
Method Multiply(System.ReadOnlySpan<float>,System.Span<float>)

Multiplies the supplied vector by the supplied vector without changing logical row order.

vector
The vector sequence used by this operation; its required length, ordering, and element bounds are validated before access.
output
The caller-owned destination buffer that receives the result; required capacity is validated before any write occurs.
MappedQ4_KMatrixSourceUAIX.LmRuntime.Models.Llama 8 members

Reads and multiplies a Q4_K matrix directly from a mapped GGUF tensor view.

Method MappedQ4_KMatrixSource(UAIX.LmRuntime.Gguf.MappedTensorView)

Initializes a new MappedQ4_KMatrixSource instance with validated dependencies and operational bounds.

view
The bounded tensor view whose descriptor, shape, byte order, and mapped payload are read without transferring ownership of the underlying mapping.
Property RowCount
Property ColumnCount
Property DataType
Property StorageType
Property StorageDiagnostics
Method CopyRowTo(int,System.Span<float>)

Copies the row to into caller-owned storage after validating the requested range and capacity.

rowIndex
The zero-based row index; it must identify an existing position within the relevant validated range.
destination
The destination buffer that receives the produced values.
Method Multiply(System.ReadOnlySpan<float>,System.Span<float>)

Multiplies the supplied vector by the supplied vector without changing logical row order.

vector
The vector sequence used by this operation; its required length, ordering, and element bounds are validated before access.
output
The caller-owned destination buffer that receives the result; required capacity is validated before any write occurs.
MappedQ6_KMatrixSourceUAIX.LmRuntime.Models.Llama 8 members

Reads and multiplies a Q6_K matrix directly from a mapped GGUF tensor view.

Method MappedQ6_KMatrixSource(UAIX.LmRuntime.Gguf.MappedTensorView)

Initializes a new MappedQ6_KMatrixSource instance with validated dependencies and operational bounds.

view
The bounded tensor view whose descriptor, shape, byte order, and mapped payload are read without transferring ownership of the underlying mapping.
Property RowCount
Property ColumnCount
Property DataType
Property StorageType
Property StorageDiagnostics
Method CopyRowTo(int,System.Span<float>)

Copies the row to into caller-owned storage after validating the requested range and capacity.

rowIndex
The zero-based row index; it must identify an existing position within the relevant validated range.
destination
The destination buffer that receives the produced values.
Method Multiply(System.ReadOnlySpan<float>,System.Span<float>)

Multiplies the supplied vector by the supplied vector without changing logical row order.

vector
The vector sequence used by this operation; its required length, ordering, and element bounds are validated before access.
output
The caller-owned destination buffer that receives the result; required capacity is validated before any write occurs.
MappedVectorSourceFactoryUAIX.LmRuntime.Models.Llama 1 member

Selects a mapped scalar vector implementation from GGML storage metadata.

Method Create(UAIX.LmRuntime.Gguf.MappedTensorView)

Creates the read only vector source from the validated inputs required by MappedVectorSourceFactory.

view
The bounded tensor view whose descriptor, shape, byte order, and mapped payload are read without transferring ownership of the underlying mapping.

Returns: The storage-specific vector source, with ownership and disposal obligations defined by the returned type and the Create contract.

LlamaGenerationStopReasonUAIX.LmRuntime.Models.Llama 5 members

Identifies why deterministic greedy generation stopped.

Field MaximumTokens

The requested maximum number of tokens was produced.

Field EndOfSequence

The configured end-of-sequence token was selected.

Field StopToken

A caller-configured stop token was selected.

Field ContextCapacity

The model context window could not accept another evaluated token.

Field Cancelled

Cooperative cancellation was observed between committed inference steps.

LlamaGreedyGenerationOptionsUAIX.LmRuntime.Models.Llama 4 members

Defines allocation-bounded deterministic greedy generation controls.

Property MaximumTokens

Gets the maximum number of generated tokens.

Property ResetSession

Gets whether the session is reset before prompt prefill.

Property EndOfSequenceTokenId

Gets the optional end-of-sequence token identifier.

Property StopTokenIds

Gets additional token identifiers that terminate generation after being emitted.

LlamaGeneratedTokenUAIX.LmRuntime.Models.Llama 4 members

Describes one token selected during deterministic greedy generation.

The value contains only a zero-based sequence number, token identifier, and selected logit. It does not contain prompt text, decoded output, model bytes, file paths, persistent state, or provider information.

Method LlamaGeneratedToken(int,int,float)

Initializes a new LlamaGeneratedToken instance with validated dependencies and operational bounds.

sequence
The zero-based selection sequence within the current generation operation.
tokenId
The token identifier to process; it must fall within the validated vocabulary and operation-specific range.
selectedLogit
The deterministic selected logit at this generation sequence position. NaN is rejected; infinities remain valid because compares them using the same deterministic ordering as finite values.
Property Sequence

Gets the zero-based token-selection sequence.

Property TokenId

Gets the selected model vocabulary identifier.

Property SelectedLogit

Gets the selected token's deterministic argmax logit.

LlamaGreedyGenerationResultUAIX.LmRuntime.Models.Llama 5 members

Describes an allocation-bounded greedy generation operation.

Property PromptTokenCount

Gets the number of prompt tokens evaluated for this operation.

Property GeneratedTokenCount

Gets the number of generated token identifiers written to the caller buffer.

Property StopReason

Gets the deterministic stop reason.

Property Position

Gets the next sequence position maintained by the session.

Property FinalSelectedLogit

Gets the selected logit of the final generated token, or negative infinity when none was generated.

RealModelSmokeStageUAIX.LmRuntime.Models.Llama 4 members

Identifies the deepest stage requested from the local real-model smoke workflow.

Field ParseOnly

Parses and validates the GGUF container only.

Field Tokenizer

Also constructs and validates the metadata-driven tokenizer.

Field TensorBinding

Also reconstructs LLaMA geometry and validates required tensor bindings.

Field OneToken

Also executes one deterministic greedy token when every storage contract is supported.

RealModelSmokeOptionsUAIX.LmRuntime.Models.Llama 15 members

Configures an explicitly local, opt-in GGUF smoke inspection.

Property ModelPath

Gets the local GGUF path.

Property AllowedRoot

Gets an optional root that the resolved model path must remain under.

Property MaximumFileByteCount

Gets an optional explicit maximum file length; zero disables this limit.

Property ComputeModelSha256

Gets whether the complete model SHA-256 should be computed.

Property Stage

Gets the deepest smoke stage to execute.

Property Prompt

Gets the prompt used by the one-token stage.

Property ExpectedTokenIdsPath

Gets an optional local JSON file containing expected prompt token identifiers.

Property ExpectedOneTokenPath

Gets an optional local JSON file containing the expected one-token result.

Property RequireEnvironmentGate

Gets whether the explicit environment gate is required.

Property PackageVersion

Gets the package version recorded in evidence.

Property CommitIdentity

Gets a commit or source identity supplied by the operator.

Property ProvenanceLabel

Gets an operator-supplied provenance label.

Property LicenseReviewStatus

Gets the operator-supplied license review status.

Property RedactModelPath

Gets whether the artifact model path is reduced to its file name.

Property EnvironmentGateName

Gets the environment variable that enables real-model execution.

RealModelSmokeStageEvidenceUAIX.LmRuntime.Models.Llama 3 members

Records one real-model workflow stage duration and current-thread allocation delta.

Property Stage

Gets the stage name.

Property ElapsedStopwatchTicks

Gets elapsed stopwatch ticks.

Property ManagedAllocatedByteCount

Gets managed bytes allocated on the measuring thread.

RealModelSmokeArtifactUAIX.LmRuntime.Models.Llama 29 members

Represents a versioned, machine-readable real-model smoke artifact.

Property Schema

Gets the artifact schema identifier.

Property PackageVersion

Gets the package version.

Property CommitIdentity

Gets the source/commit identity.

Property ProvenanceLabel

Gets the operator-supplied provenance label.

Property LicenseReviewStatus

Gets the operator-supplied license review status.

Property GeneratedUtc

Gets the generation time in UTC.

Property ClaimStatus

Gets the evidence claim status.

Property Succeeded

Gets whether the requested stage completed.

Property CompletedStage

Gets the deepest completed stage.

Property ModelPath

Gets the normalized local model path.

Property FileByteCount

Gets the model file length.

Property ModelSha256

Gets the optional complete-file SHA-256.

Property GgufVersion

Gets the parsed GGUF version.

Property Architecture

Gets the model architecture.

Property TokenizerFamily

Gets the tokenizer family.

Property StorageTypeCounts

Gets physical tensor counts by GGML storage name.

Property BindingDiagnostics

Gets binding diagnostic messages.

Property PromptTokenIds

Gets exact prompt token identifiers when tokenization completed.

Property SelectedTokenId

Gets the selected one-token identifier when execution completed.

Property SelectedTokenText

Gets the selected token text when execution completed.

Property ExpectedTokenIdsMatched

Gets whether the optional expected token-identifier evidence matched.

Property ExpectedOneTokenMatched

Gets whether the optional expected one-token evidence matched.

Property Alignment

Gets the effective GGUF tensor alignment.

Property PromptSha256

Gets the SHA-256 of the prompt text rather than requiring publication of the raw prompt.

Property StageEvidence

Gets stage timing and current-thread allocation measurements.

Property UnsupportedDiagnostics

Gets exact unsupported execution diagnostics.

Property CommandIdentity

Gets the non-secret command identity.

Property EnvironmentVariableNames

Gets environment-variable names used by the workflow without values.

Property Diagnostics

Gets bounded workflow diagnostics.

RealModelSmokeEnvironmentUAIX.LmRuntime.Models.Llama 1 member

Creates explicit local smoke options from the documented environment-variable contract.

Method Load(UAIX.LmRuntime.Models.Llama.RealModelSmokeStage)

Reads the local real-model smoke configuration from environment variables.

stage
The explicitly selected real-model smoke stage or stage implementation executed by the controlled pipeline.

Returns: The local smoke options, with ownership and disposal obligations defined by the returned type and the Load contract.

RealModelPathPolicyUAIX.LmRuntime.Models.Llama 1 member

Resolves local model paths under an optional root without following hidden network or download behavior.

Method Resolve(string,string,long)

Resolves and validates one local model path.

path
The local file-system path processed by this operation; it must satisfy the containing component's path and scope policy.
allowedRoot
The normalized caller-authorized directory boundary, or null when no containment root was configured for the operation.
maximumFileByteCount
The optional explicit file-size limit; zero disables it.

Returns: The text produced by RealModelPathPolicy.Resolve for this contract: Resolves and validates one local model path. The returned string is detached from mutable caller storage and is not persisted by the operation.

RealModelSmokeRunnerUAIX.LmRuntime.Models.Llama 1 member

Executes staged, offline real-model validation and emits a bounded evidence artifact.

Method Run(UAIX.LmRuntime.Models.Llama.RealModelSmokeOptions)

Runs the requested local smoke stages in their required order.

options
The explicit local smoke options and evidence boundaries.

Returns: A bounded machine-readable artifact describing the deepest completed stage.

ReferenceKvWriteBehaviorUAIX.LmRuntime.Models.Llama 1 member

Identifies the deterministic write semantics used by the scalar reference key/value cache.

Field AppendOrOverwrite

Writes append new positions and deterministically overwrite already written positions.

ReferenceKvCacheFingerprintUAIX.LmRuntime.Models.Llama 1 member

Computes stable fingerprints for model configurations that own reference key/value cache snapshots.

Method Create(UAIX.LmRuntime.Models.Llama.LlamaModelConfig)

Creates a SHA-256 fingerprint from the configuration fields that determine cache geometry and semantics.

config
The validated LLaMA model configuration defining context capacity, vocabulary size, tensor geometry, and attention dimensions for the operation.

Returns: The text produced by ReferenceKvCacheFingerprint.Create for this contract: Creates a SHA-256 fingerprint from the configuration fields that determine cache geometry and semantics. The returned string is detached from mutable caller storage and is not persisted by the operation.

IReferenceKvCacheUAIX.LmRuntime.Models.Llama 13 members

Defines a typed, deterministic key/value cache contract for the scalar LLaMA reference runtime.

Property LayerCount

Gets the number of transformer layers.

Property ContextLength

Gets the maximum sequence capacity.

Property KeyValueHeadCount

Gets the number of key/value heads per layer.

Property HeadWidth

Gets the float width of one key/value head.

Property UsedTokenCount

Gets the highest contiguous token position written plus one.

Property ConfigurationFingerprint

Gets the configuration fingerprint required by compatible snapshots.

Property WriteBehavior

Gets the deterministic append-versus-overwrite behavior.

Method Write(int,int,System.ReadOnlySpan<float>,System.ReadOnlySpan<float>)

Appends or replaces one layer's key and value vectors at a sequence position.

layerIndex
The zero-based layer index; it must identify an existing position within the relevant validated range.
position
The zero-based sequence or cache position addressed by the operation; it must lie within the allocated context and readable or writable range.
key
The flattened key vector for all key/value heads.
value
The flattened value vector for all key/value heads.
Method GetKey(int,int,int)

Retrieves the key from the current cache state after validating the requested access.

layerIndex
The zero-based layer index; it must identify an existing position within the relevant validated range.
position
The zero-based sequence or cache position addressed by the operation; it must lie within the allocated context and readable or writable range.
headIndex
The zero-based head index; it must identify an existing position within the relevant validated range.

Returns: The bounded ReadOnlySpan<float> view produced by IReferenceKvCache.GetKey: Retrieves the key from the current cache state after validating the requested access. Its lifetime and ownership remain tied to the owner identified by the containing type; no out-of-range region is exposed.

Method GetValue(int,int,int)

Retrieves the value from the current cache state after validating the requested access.

layerIndex
The zero-based layer index; it must identify an existing position within the relevant validated range.
position
The zero-based sequence or cache position addressed by the operation; it must lie within the allocated context and readable or writable range.
headIndex
The zero-based head index; it must identify an existing position within the relevant validated range.

Returns: The bounded ReadOnlySpan<float> view produced by IReferenceKvCache.GetValue: Retrieves the value from the current cache state after validating the requested access. Its lifetime and ownership remain tied to the owner identified by the containing type; no out-of-range region is exposed.

Method Reset

Resets the requested state to its validated initial state without publishing partial state.

Method CreateSnapshot

Creates a bounded snapshot for tiny-fixture testing and replay.

Returns: The immutable cache snapshot, with ownership and disposal obligations defined by the returned type and the CreateSnapshot contract.

Method Restore(UAIX.LmRuntime.Models.Llama.ReferenceKvCacheSnapshot)

Restores the supplied snapshot from a validated persisted representation.

snapshot
The immutable state snapshot being validated, serialized, restored, or analyzed without retaining caller-owned mutable aliases.
ReferenceKvCacheSnapshotUAIX.LmRuntime.Models.Llama 9 members

Represents an immutable snapshot of a tiny reference key/value cache.

Property SchemaVersion

Gets the snapshot schema version.

Property ConfigurationFingerprint

Gets the model/configuration fingerprint.

Property LayerCount

Gets the number of layers in the snapshot.

Property ContextLength

Gets the context capacity in the snapshot.

Property KeyValueHeadCount

Gets the key/value head count.

Property HeadWidth

Gets the per-head width.

Property UsedTokenCount

Gets the used token count.

Property Keys

Gets a copy of all key values.

Property Values

Gets a copy of all value values.

ReferenceKvCacheDiagnosticSnapshotUAIX.LmRuntime.Models.Llama 3 members

Represents a bounded, non-mutable diagnostic view of reference cache state.

Property ConfigurationFingerprint

Gets the configuration fingerprint.

Property UsedTokenCount

Gets the used token count.

Property ContentSha256

Gets the SHA-256 of the used key/value prefix.

ReferenceKvCacheUAIX.LmRuntime.Models.Llama 16 members

Stores reference key/value state in two contiguous arrays without per-token dictionaries.

Method ReferenceKvCache(int,int,int,int)

Initializes a reference key/value cache with a geometry-derived compatibility fingerprint.

layerCount
The layer count used to bound this operation; it must be nonnegative and within the supported range.
contextLength
The context length that supplies session-scoped identity and boundary state; it is validated before dependent work begins.
keyValueHeadCount
The number of key/value heads per layer.
headWidth
The positive number of scalar values stored for each attention head in one cache position.
Method ReferenceKvCache(int,int,int,int,string)

Initializes a reference key/value cache with an explicit model/configuration fingerprint.

layerCount
The layer count used to bound this operation; it must be nonnegative and within the supported range.
contextLength
The context length that supplies session-scoped identity and boundary state; it is validated before dependent work begins.
keyValueHeadCount
The number of key/value heads per layer.
headWidth
The positive number of scalar values stored for each attention head in one cache position.
configurationFingerprint
The configuration fingerprint text consumed by ReferenceKvCache.ReferenceKvCache; null, emptiness, length, encoding, identifier, or path rules are enforced as documented, and the value is not persisted by this operation.
Property LayerCount
Property ContextLength
Property KeyValueHeadCount
Property HeadWidth
Property UsedTokenCount
Property ConfigurationFingerprint
Property WriteBehavior
Method Write(int,int,System.ReadOnlySpan<float>,System.ReadOnlySpan<float>)

Writes the supplied layer index to the current cache state using the component's canonical representation.

layerIndex
The zero-based layer index; it must identify an existing position within the relevant validated range.
position
The zero-based sequence or cache position addressed by the operation; it must lie within the allocated context and readable or writable range.
key
The key sequence used by this operation; its required length, ordering, and element bounds are validated before access.
value
The value sequence used by this operation; its required length, ordering, and element bounds are validated before access.
Method GetKey(int,int,int)

Retrieves the key from the current cache state after validating the requested access.

layerIndex
The zero-based layer index; it must identify an existing position within the relevant validated range.
position
The zero-based sequence or cache position addressed by the operation; it must lie within the allocated context and readable or writable range.
headIndex
The zero-based head index; it must identify an existing position within the relevant validated range.

Returns: The bounded ReadOnlySpan<float> view produced by ReferenceKvCache.GetKey: Retrieves the key from the current cache state after validating the requested access. Its lifetime and ownership remain tied to the owner identified by the containing type; no out-of-range region is exposed.

Method GetValue(int,int,int)

Retrieves the value from the current cache state after validating the requested access.

layerIndex
The zero-based layer index; it must identify an existing position within the relevant validated range.
position
The zero-based sequence or cache position addressed by the operation; it must lie within the allocated context and readable or writable range.
headIndex
The zero-based head index; it must identify an existing position within the relevant validated range.

Returns: The bounded ReadOnlySpan<float> view produced by ReferenceKvCache.GetValue: Retrieves the value from the current cache state after validating the requested access. Its lifetime and ownership remain tied to the owner identified by the containing type; no out-of-range region is exposed.

Method Reset

Resets the reference KV cache contents and logical sequence position to their initial state.

Method CreateSnapshot

Creates the snapshot from the validated inputs required by ReferenceKvCache.

Returns: The ReferenceKvCacheSnapshot result produced by ReferenceKvCache.CreateSnapshot for this contract: Creates the snapshot from the validated inputs required by ReferenceKvCache. It is published only after all documented validation and ownership transitions succeed.

Method CreateDiagnosticSnapshot

Creates a small diagnostic snapshot without exposing mutable key/value arrays.

Returns: The bounded diagnostic snapshot, with ownership and disposal obligations defined by the returned type and the CreateDiagnosticSnapshot contract.

Method Restore(UAIX.LmRuntime.Models.Llama.ReferenceKvCacheSnapshot)

Restores the supplied snapshot from a validated persisted representation.

snapshot
The immutable state snapshot being validated, serialized, restored, or analyzed without retaining caller-owned mutable aliases.
ReferenceKvPortableSnapshotUAIX.LmRuntime.Models.Llama 6 members

Carries a deterministic portable key/value-cache snapshot and its compatibility identities.

Property SchemaVersion

Gets the portable schema version.

Property ConfigurationFingerprint

Gets the model-configuration fingerprint.

Property ModelArtifactFingerprint

Gets the optional model-artifact fingerprint.

Property CacheLayoutFingerprint

Gets the cache-layout fingerprint.

Property ContentSha256

Gets the SHA-256 of the serialized bytes preceding the digest field.

Property Snapshot

Gets the restored capacity-shaped snapshot.

ReferenceKvCacheSerializerUAIX.LmRuntime.Models.Llama 5 members

Serializes only logically used key/value positions in stable layer-position-head order.

Schema version two is additive and does not change the in-memory version-one snapshot contract retained for source compatibility. Unused capacity is reconstructed as zero during deserialization.

Field SchemaVersion

Gets the portable snapshot schema version.

Field DefaultMaximumByteCount

Gets the default maximum serialized snapshot size.

Method Serialize(UAIX.LmRuntime.Models.Llama.ReferenceKvCacheSnapshot,string,string,int)

Serializes a bounded cache snapshot in deterministic little-endian form.

snapshot
The immutable state snapshot being validated, serialized, restored, or analyzed without retaining caller-owned mutable aliases.
modelArtifactFingerprint
The stable model-artifact fingerprint that binds the serialized state to the exact reviewed model identity.
cacheLayoutFingerprint
The stable cache-layout fingerprint used to reject state created for incompatible tensor geometry or storage layout.
maximumByteCount
The maximum byte count used to bound this operation; it must be nonnegative and within the supported range.

Returns: The serialized snapshot bytes including a trailing SHA-256.

Method Deserialize(System.ReadOnlySpan<byte>,int)

Deserializes and verifies a portable key/value-cache snapshot.

bytes
The bytes sequence used by this operation; its required length, ordering, and element bounds are validated before access.
maximumByteCount
The maximum byte count used to bound this operation; it must be nonnegative and within the supported range.

Returns: The ReferenceKvPortableSnapshot result produced by ReferenceKvCacheSerializer.Deserialize for this contract: Deserializes and verifies a portable key/value-cache snapshot. It is published only after all documented validation and ownership transitions succeed.

Method Restore(UAIX.LmRuntime.Models.Llama.ReferenceKvCache,System.ReadOnlySpan<byte>,string,string)

Restores verified portable bytes into a cache after validating model and layout identities.

cache
The validated ReferenceKvCache dependency consumed by Restore; ownership and lifetime remain with the caller unless this member explicitly documents a transfer.
bytes
The bytes sequence used by this operation; its required length, ordering, and element bounds are validated before access.
expectedModelArtifactFingerprint
The expected artifact identity, or an empty string to accept an empty serialized identity.
expectedCacheLayoutFingerprint
The required cache-layout fingerprint against which the serialized state is compared before restoration.
WeightSourceStorageDiagnosticsUAIX.LmRuntime.Models.Llama 7 members

Describes immutable storage used by one deterministic reference weight source.

Property TensorName

Gets the semantic tensor name.

Property StorageType

Gets the GGML physical storage type.

Property DataType

Gets the logical runtime data type.

Property ByteLength

Gets the physical byte length.

Property ManagedCopiedByteCount

Gets the number of bytes copied into persistent managed model-weight storage.

Property IsMemoryMapped

Gets a value indicating whether the source borrows memory-mapped storage.

Property IsAlias

Gets a value indicating whether this source aliases another semantic binding.

IReadOnlyVectorSourceUAIX.LmRuntime.Models.Llama 5 members

Exposes an immutable logical vector without requiring a particular storage representation.

Property Length

Gets the logical vector length.

Property DataType

Gets the logical runtime data type.

Property StorageType

Gets the physical GGML storage type.

Property StorageDiagnostics

Gets immutable storage diagnostics.

Method CopyTo(System.Span<float>)

Copies every vector value into a caller-owned float32 destination.

destination
The caller-owned destination buffer that receives the result; required capacity is validated before any write occurs.
IReadOnlyMatrixSourceUAIX.LmRuntime.Models.Llama 7 members

Exposes an immutable logical row-major matrix without requiring a particular storage representation.

Property RowCount

Gets the logical row count.

Property ColumnCount

Gets the logical column count.

Property DataType

Gets the logical runtime data type.

Property StorageType

Gets the physical GGML storage type.

Property StorageDiagnostics

Gets immutable storage diagnostics.

Method CopyRowTo(int,System.Span<float>)

Copies and, when required, dequantizes one logical row into a caller-owned float32 destination.

rowIndex
The zero-based row index; it must identify an existing position within the relevant validated range.
destination
The caller-owned destination buffer that receives the result; required capacity is validated before any write occurs.
Method Multiply(System.ReadOnlySpan<float>,System.Span<float>)

Multiplies this matrix by a float32 vector without materializing a complete float32 matrix.

vector
The input vector with at least ColumnCount values.
output
The output buffer with at least RowCount values.
ILlamaLayerWeightSourceUAIX.LmRuntime.Models.Llama 9 members

Exposes immutable weights required by one LLaMA transformer block.

Property AttentionNorm

Gets the attention normalization vector.

Property AttentionQuery

Gets the query projection matrix.

Property AttentionKey

Gets the key projection matrix.

Property AttentionValue

Gets the value projection matrix.

Property AttentionOutput

Gets the attention output projection matrix.

Property FeedForwardNorm

Gets the feed-forward normalization vector.

Property FeedForwardGate

Gets the feed-forward gate projection matrix.

Property FeedForwardUp

Gets the feed-forward up projection matrix.

Property FeedForwardDown

Gets the feed-forward down projection matrix.

ILlamaModelWeightSourceUAIX.LmRuntime.Models.Llama 8 members

Exposes immutable model weights required by the deterministic LLaMA reference session.

Property TokenEmbeddings

Gets the token embedding table.

Property Layers

Gets transformer-block weights in execution order.

Property OutputNorm

Gets the final output normalization vector.

Property OutputProjection

Gets the output projection matrix.

Property UsesTiedOutputProjection

Gets a value indicating whether output projection aliases token embeddings.

Property StorageDiagnostics

Gets storage diagnostics for every distinct semantic source.

Property StorageSummary

Gets a stable summary of physical storage types used by the model.

Property ManagedCopiedByteCount

Gets persistent managed model-weight bytes represented by this source.

ArrayVectorSourceUAIX.LmRuntime.Models.Llama 7 members

Provides an immutable array-backed vector adapter for compatibility and deterministic fixtures.

Method ArrayVectorSource(string,float[])

Initializes a new ArrayVectorSource instance with validated dependencies and operational bounds.

tensorName
The exact ordinal GGUF tensor catalog name used for lookup and diagnostics.
values
The values sequence used by this operation; its required length, ordering, and element bounds are validated before access.
Property TensorName

Gets the semantic tensor name.

Property Length
Property DataType
Property StorageType
Property StorageDiagnostics
Method CopyTo(System.Span<float>)

Copies the to into caller-owned storage after validating the requested range and capacity.

destination
The destination buffer that receives the produced values.
ArrayMatrixSourceUAIX.LmRuntime.Models.Llama 9 members

Provides an immutable row-major array-backed matrix adapter for compatibility and deterministic fixtures.

Method ArrayMatrixSource(string,float[],int,int)

Initializes a new ArrayMatrixSource instance with validated dependencies and operational bounds.

tensorName
The exact ordinal GGUF tensor catalog name used for lookup and diagnostics.
values
The values sequence used by this operation; its required length, ordering, and element bounds are validated before access.
rowCount
The row count used to bound this operation; it must be nonnegative and within the supported range.
columnCount
The column count used to bound this operation; it must be nonnegative and within the supported range.
Property TensorName

Gets the semantic tensor name.

Property RowCount
Property ColumnCount
Property DataType
Property StorageType
Property StorageDiagnostics
Method CopyRowTo(int,System.Span<float>)

Copies the row to into caller-owned storage after validating the requested range and capacity.

rowIndex
The zero-based row index; it must identify an existing position within the relevant validated range.
destination
The destination buffer that receives the produced values.
Method Multiply(System.ReadOnlySpan<float>,System.Span<float>)

Multiplies the supplied vector by the supplied vector without changing logical row order.

vector
The vector sequence used by this operation; its required length, ordering, and element bounds are validated before access.
output
The caller-owned destination buffer that receives the result; required capacity is validated before any write occurs.
ArrayLlamaLayerWeightSourceUAIX.LmRuntime.Models.Llama 10 members

Provides one array-backed LLaMA layer weight source.

Method ArrayLlamaLayerWeightSource(int,UAIX.LmRuntime.Models.Llama.LlamaReferenceLayerWeights,UAIX.LmRuntime.Models.Llama.LlamaModelConfig)

Initializes a new ArrayLlamaLayerWeightSource instance with validated dependencies and operational bounds.

blockIndex
The zero-based block index; it must identify an existing position within the relevant validated range.
weights
The validated model-weight source or bound weight set consumed read-only by the deterministic reference operation.
config
The validated LLaMA model configuration defining context capacity, vocabulary size, tensor geometry, and attention dimensions for the operation.
Property AttentionNorm
Property AttentionQuery
Property AttentionKey
Property AttentionValue
Property AttentionOutput
Property FeedForwardNorm
Property FeedForwardGate
Property FeedForwardUp
Property FeedForwardDown
ArrayLlamaModelWeightSourceUAIX.LmRuntime.Models.Llama 10 members

Adapts the v1.8.0 float-array model to the storage-neutral v1.9.0 execution contracts.

Method Create(UAIX.LmRuntime.Models.Llama.LlamaModelConfig,UAIX.LmRuntime.Models.Llama.LlamaReferenceModelWeights)

Creates an array-backed source after validating its complete model contract.

config
The validated LLaMA model configuration defining context capacity, vocabulary size, tensor geometry, and attention dimensions for the operation.
weights
The validated model-weight source or bound weight set consumed read-only by the deterministic reference operation.

Returns: The array-backed source, with ownership and disposal obligations defined by the returned type and the Create contract.

Method ArrayLlamaModelWeightSource(UAIX.LmRuntime.Models.Llama.LlamaModelConfig,UAIX.LmRuntime.Models.Llama.LlamaReferenceModelWeights)

Initializes a new ArrayLlamaModelWeightSource instance with validated dependencies and operational bounds.

config
The validated LLaMA model configuration defining context capacity, vocabulary size, tensor geometry, and attention dimensions for the operation.
weights
The validated model-weight source or bound weight set consumed read-only by the deterministic reference operation.
Property TokenEmbeddings
Property Layers
Property OutputNorm
Property OutputProjection
Property UsesTiedOutputProjection
Property StorageDiagnostics
Property ManagedCopiedByteCount
Property StorageSummary
MappedFloat32VectorSourceUAIX.LmRuntime.Models.Llama 6 members

Reads a float32 vector directly from a mapped GGUF tensor view.

Method MappedFloat32VectorSource(UAIX.LmRuntime.Gguf.MappedTensorView)

Initializes a new MappedFloat32VectorSource instance with validated dependencies and operational bounds.

view
The bounded tensor view whose descriptor, shape, byte order, and mapped payload are read without transferring ownership of the underlying mapping.
Property Length
Property DataType
Property StorageType
Property StorageDiagnostics
Method CopyTo(System.Span<float>)

Copies the to into caller-owned storage after validating the requested range and capacity.

destination
The destination buffer that receives the produced values.
MappedFloat32MatrixSourceUAIX.LmRuntime.Models.Llama 8 members

Reads and multiplies an F32 matrix directly from a mapped GGUF tensor view.

Method MappedFloat32MatrixSource(UAIX.LmRuntime.Gguf.MappedTensorView)

Initializes a new MappedFloat32MatrixSource instance with validated dependencies and operational bounds.

view
The bounded tensor view whose descriptor, shape, byte order, and mapped payload are read without transferring ownership of the underlying mapping.
Property RowCount
Property ColumnCount
Property DataType
Property StorageType
Property StorageDiagnostics
Method CopyRowTo(int,System.Span<float>)

Copies the row to into caller-owned storage after validating the requested range and capacity.

rowIndex
The zero-based row index; it must identify an existing position within the relevant validated range.
destination
The destination buffer that receives the produced values.
Method Multiply(System.ReadOnlySpan<float>,System.Span<float>)

Multiplies the supplied vector by the supplied vector without changing logical row order.

vector
The vector sequence used by this operation; its required length, ordering, and element bounds are validated before access.
output
The caller-owned destination buffer that receives the result; required capacity is validated before any write occurs.
MappedQ8_0MatrixSourceUAIX.LmRuntime.Models.Llama 8 members

Reads and multiplies a Q8_0 matrix directly from a mapped GGUF tensor view.

Method MappedQ8_0MatrixSource(UAIX.LmRuntime.Gguf.MappedTensorView)

Initializes a new MappedQ8_0MatrixSource instance with validated dependencies and operational bounds.

view
The bounded tensor view whose descriptor, shape, byte order, and mapped payload are read without transferring ownership of the underlying mapping.
Property RowCount
Property ColumnCount
Property DataType
Property StorageType
Property StorageDiagnostics
Method CopyRowTo(int,System.Span<float>)

Copies the row to into caller-owned storage after validating the requested range and capacity.

rowIndex
The zero-based row index; it must identify an existing position within the relevant validated range.
destination
The destination buffer that receives the produced values.
Method Multiply(System.ReadOnlySpan<float>,System.Span<float>)

Multiplies the supplied vector by the supplied vector without changing logical row order.

vector
The vector sequence used by this operation; its required length, ordering, and element bounds are validated before access.
output
The caller-owned destination buffer that receives the result; required capacity is validated before any write occurs.
MappedQ4_0MatrixSourceUAIX.LmRuntime.Models.Llama 8 members

Reads and multiplies a Q4_0 matrix directly from a mapped GGUF tensor view.

Method MappedQ4_0MatrixSource(UAIX.LmRuntime.Gguf.MappedTensorView)

Initializes a new MappedQ4_0MatrixSource instance with validated dependencies and operational bounds.

view
The bounded tensor view whose descriptor, shape, byte order, and mapped payload are read without transferring ownership of the underlying mapping.
Property RowCount
Property ColumnCount
Property DataType
Property StorageType
Property StorageDiagnostics
Method CopyRowTo(int,System.Span<float>)

Copies the row to into caller-owned storage after validating the requested range and capacity.

rowIndex
The zero-based row index; it must identify an existing position within the relevant validated range.
destination
The destination buffer that receives the produced values.
Method Multiply(System.ReadOnlySpan<float>,System.Span<float>)

Multiplies the supplied vector by the supplied vector without changing logical row order.

vector
The vector sequence used by this operation; its required length, ordering, and element bounds are validated before access.
output
The caller-owned destination buffer that receives the result; required capacity is validated before any write occurs.
MappedMatrixSourceFactoryUAIX.LmRuntime.Models.Llama 1 member

Creates supported matrix sources over mapped tensor views.

Method Create(UAIX.LmRuntime.Gguf.MappedTensorView)

Creates a direct mapped source for supported scalar and quantized storage.

view
The bounded tensor view whose descriptor, shape, byte order, and mapped payload are read without transferring ownership of the underlying mapping.

Returns: The storage-specific matrix source, with ownership and disposal obligations defined by the returned type and the Create contract.

MappedLlamaLayerWeightSourceUAIX.LmRuntime.Models.Llama 10 members

Exposes one mapped LLaMA transformer block through storage-neutral execution contracts.

Method MappedLlamaLayerWeightSource(UAIX.LmRuntime.Models.Llama.LlamaBoundLayerWeightSet)

Initializes a new MappedLlamaLayerWeightSource instance with validated dependencies and operational bounds.

weights
The validated model-weight source or bound weight set consumed read-only by the deterministic reference operation.
Property AttentionNorm
Property AttentionQuery
Property AttentionKey
Property AttentionValue
Property AttentionOutput
Property FeedForwardNorm
Property FeedForwardGate
Property FeedForwardUp
Property FeedForwardDown
MappedLlamaModelWeightSourceUAIX.LmRuntime.Models.Llama 10 members

Exposes a complete mapped LLaMA model through storage-neutral execution contracts.

Method Create(UAIX.LmRuntime.Models.Llama.LlamaBoundWeightSet)

Creates and validates a complete mapped model weight source.

weights
The validated model-weight source or bound weight set consumed read-only by the deterministic reference operation.

Returns: The validated mapped model weight source, with ownership and disposal obligations defined by the returned type and the Create contract.

Method MappedLlamaModelWeightSource(UAIX.LmRuntime.Models.Llama.LlamaBoundWeightSet)

Initializes a new MappedLlamaModelWeightSource instance with validated dependencies and operational bounds.

weights
The validated model-weight source or bound weight set consumed read-only by the deterministic reference operation.
Property TokenEmbeddings
Property Layers
Property OutputNorm
Property OutputProjection
Property UsesTiedOutputProjection
Property StorageDiagnostics
Property ManagedCopiedByteCount

Gets the total number of persistent managed model-weight bytes copied by this source.

Property StorageSummary
LlamaWeightSourceValidatorUAIX.LmRuntime.Models.Llama 1 member

Validates storage-neutral LLaMA weight sources before deterministic execution begins.

Method Validate(UAIX.LmRuntime.Models.Llama.LlamaModelConfig,UAIX.LmRuntime.Models.Llama.ILlamaModelWeightSource)

Validates every global and block-local source against the configured model geometry.

config
The validated LLaMA model configuration defining context capacity, vocabulary size, tensor geometry, and attention dimensions for the operation.
weights
The validated model-weight source or bound weight set consumed read-only by the deterministic reference operation.

UAIX.LmRuntime.Tokenization

45 documented types and 160 documented members.

ChatTemplateRendererUAIX.LmRuntime.Tokenization 1 member

Renders a minimal safe chat template suitable for deterministic tests and initial GGUF tokenizer work.

Method Render(System.Collections.Generic.IEnumerable<UAIX.LmRuntime.Contracts.LlmMessage>)

Renders messages using a small role/content template rather than a general Jinja interpreter.

messages
The messages sequence used by this operation; its required length, ordering, and element bounds are validated before access.

Returns: The text produced by ChatTemplateRenderer.Render for this contract: Renders messages using a small role/content template rather than a general Jinja interpreter. The returned string is detached from mutable caller storage and is not persisted by the operation.

GgufTokenizerFingerprintUAIX.LmRuntime.Tokenization 1 member

Computes a deterministic SHA-256 identity for model-facing GGUF tokenizer metadata.

Method Create(UAIX.LmRuntime.Tokenization.GgufTokenizerMetadata)

Computes a canonical tokenizer fingerprint without treating decoded text as token-ID authority.

metadata
The metadata containing validated format or tokenizer metadata required by this operation.

Returns: The text produced by GgufTokenizerFingerprint.Create for this contract: Computes a canonical tokenizer fingerprint without treating decoded text as token-ID authority. The returned string is detached from mutable caller storage and is not persisted by the operation.

GgufTokenTypeUAIX.LmRuntime.Tokenization 6 members

Identifies the tokenizer token type stored in GGUF metadata.

Field Normal

Normal token text.

Field Unknown

Unknown token.

Field Control

Control token.

Field UserDefined

User-defined token.

Field Unused

Unused token slot.

Field Byte

Byte-fallback token.

GgufTokenUAIX.LmRuntime.Tokenization 4 members

Represents one GGUF vocabulary token.

Property TokenId

Gets the token identifier used by model embedding rows.

Property Text

Gets the raw token text from the GGUF vocabulary.

Property Score

Gets the tokenizer score associated with the token.

Property Type

Gets the token type associated with the token.

GgufSpecialTokenMapUAIX.LmRuntime.Tokenization 6 members

Represents special token identifiers resolved from GGUF metadata.

Property BosTokenId

Gets the beginning-of-sequence token identifier.

Property EosTokenId

Gets the end-of-sequence token identifier.

Property UnknownTokenId

Gets the unknown token identifier.

Property SeparatorTokenId

Gets the separator token identifier.

Property PaddingTokenId

Gets the padding token identifier.

Method EnumerateKnownTokenIds

Enumerates the known token identifiers in stable source order without exposing mutable internal storage.

Returns: An ordered sequence containing the known token identifiers as produced by the validated operation.

GgufTokenizerMetadataUAIX.LmRuntime.Tokenization 21 members

Captures tokenizer metadata loaded from a GGUF model.

Property TokenizerModel

Gets the tokenizer model name from GGUF metadata.

Property PreTokenizer

Gets the tokenizer pre-tokenizer name, when present.

Property Tokens

Gets the vocabulary tokens indexed by token identifier.

Property Merges

Gets the BPE merge rules from GGUF metadata.

Property AddedTokens

Gets the added tokens from GGUF metadata.

Property SourceScoreCount

Gets the source score-array length, or zero when the metadata key was absent.

Property SourceTokenTypeCount

Gets the source token-type-array length, or zero when the metadata key was absent.

Property ScoresPresent

Gets a value indicating whether tokenizer.ggml.scores was present.

Property TokenTypesPresent

Gets a value indicating whether tokenizer.ggml.token_type was present.

Property PrecompiledCharsMap

Gets the optional precompiled SentencePiece normalization character map.

Property SpecialTokens

Gets the special token identifiers.

Property AddBos

Gets whether model-defined BOS insertion is enabled.

Property AddEos

Gets whether model-defined EOS insertion is enabled.

Property AddSeparator

Gets whether model-defined separator insertion is enabled.

Property AddSpacePrefix

Gets whether a leading space prefix is added before text fragments.

Property EscapeWhitespaces

Gets whether whitespace characters are escaped using SentencePiece whitespace notation.

Property RemoveExtraWhitespaces

Gets whether tokenizer-specific extra whitespace removal is enabled.

Property CleanSpaces

Gets whether detokenization should clean spaces around punctuation.

Property ChatTemplate

Gets the chat template from GGUF metadata, when present.

Property HuggingFaceTokenizerJson

Gets the embedded Hugging Face tokenizer JSON, when present.

Property VocabularySize

Gets the effective vocabulary size from the token array.

TokenizationOptionsUAIX.LmRuntime.Tokenization 6 members

Describes tokenization behavior for one encode operation.

Property AddSpecialTokens

Gets whether model-defined special tokens should be added.

Property ParseSpecialTokens

Gets whether raw special-token text should be parsed as special tokens.

Property OverrideAddBos

Gets an optional override for BOS insertion.

Property OverrideAddEos

Gets an optional override for EOS insertion.

Property EmitTrace

Gets whether content-minimized trace data should be emitted for parity diagnostics.

Property InvalidUtf16Policy

Gets the policy for invalid UTF-16 surrogate sequences.

DetokenizationOptionsUAIX.LmRuntime.Tokenization 3 members

Describes detokenization behavior for one decode operation.

Property RemoveSpecialTokens

Gets whether special tokens should be removed from decoded text.

Property UnparseSpecialTokens

Gets whether special tokens should be emitted as their raw token text.

Property CleanSpaces

Gets whether tokenizer-specific space cleanup should be applied.

MetadataDrivenGgufTokenizerDetokenizationOptionsUAIX.LmRuntime.Tokenization 4 members

Provides a stable LocalEndpoint-facing name for metadata-driven GGUF detokenization controls.

This compatibility type mirrors without inheritance because the canonical options type is sealed. It allows integration code to use a descriptive contract while the tokenizer retains one canonical internal representation.

Property RemoveSpecialTokens

Gets whether special tokens should be removed from decoded text.

Property UnparseSpecialTokens

Gets whether special tokens should be emitted as their raw token text.

Property CleanSpaces

Gets whether tokenizer-specific space cleanup should be applied.

Method ToDetokenizationOptions

Creates the canonical detokenization options consumed by the tokenizer engine.

Returns: A new canonical options instance with the same behavior flags.

TokenizationResultUAIX.LmRuntime.Tokenization 2 members

Represents the output of a tokenizer encode operation.

Property TokenIds

Gets the emitted token identifiers.

Property Trace

Gets optional content-minimized events used for tokenizer parity diagnostics.

IGgufTokenizerUAIX.LmRuntime.Tokenization 3 members

Encodes and decodes text for a GGUF-backed model.

Property Metadata

Gets the tokenizer metadata used by this tokenizer.

Method Encode(string,UAIX.LmRuntime.Tokenization.TokenizationOptions)

Encodes the supplied text with the configured tokenizer and validated special-token policy.

text
The text processed by the configured encoding or normalization rules; it must satisfy the declared nullability contract.
options
The optional TokenizationOptions controlling Encode; null selects the documented defaults, supplied limits are validated before allocation, and the instance is not mutated.

Returns: The TokenizationResult result produced by IGgufTokenizer.Encode for this contract: Encodes the supplied text with the configured tokenizer and validated special-token policy. It is published only after all documented validation and ownership transitions succeed.

Method Decode(System.Collections.Generic.IReadOnlyList<int>,UAIX.LmRuntime.Tokenization.DetokenizationOptions)

Decodes the supplied token sequence with the configured tokenizer while preserving sequence order.

tokenIds
The ordered token identifiers to process; sequence order is preserved and each identifier is validated where required.
options
The optional DetokenizationOptions controlling Decode; null selects the documented defaults, supplied limits are validated before allocation, and the instance is not mutated.

Returns: The decoded text produced from the validated token sequence in the original sequence order.

GgufTokenizerMetadataValidationResultUAIX.LmRuntime.Tokenization 2 members

Represents tokenizer metadata validation output.

Property Diagnostics

Gets validation diagnostics.

Property IsValid

Gets a value indicating whether no diagnostics were emitted.

GgufTokenizerMetadataReaderUAIX.LmRuntime.Tokenization 2 members

Builds tokenizer metadata from a parsed GGUF artifact.

Method Read(UAIX.LmRuntime.Gguf.GgufModel)

Reads tokenizer metadata without throwing for validation failures.

model
The parsed GGUF model whose validated metadata and tensor catalog are consumed by this operation.

Returns: The GgufTokenizerMetadata result produced by GgufTokenizerMetadataReader.Read for this contract: Reads tokenizer metadata without throwing for validation failures. It is published only after all documented validation and ownership transitions succeed.

Method ReadStrict(UAIX.LmRuntime.Gguf.GgufModel)

Reads tokenizer metadata and throws when validation fails.

model
The parsed GGUF model whose validated metadata and tensor catalog are consumed by this operation.

Returns: The GgufTokenizerMetadata result produced by GgufTokenizerMetadataReader.ReadStrict for this contract: Reads tokenizer metadata and throws when validation fails. It is published only after all documented validation and ownership transitions succeed.

GgufTokenizerMetadataValidatorUAIX.LmRuntime.Tokenization 1 member

Validates GGUF tokenizer metadata before runtime use.

Method Validate(UAIX.LmRuntime.Tokenization.GgufTokenizerMetadata)

Validates the supplied metadata against the invariants required by GgufTokenizerMetadataValidator.

metadata
The metadata containing validated format or tokenizer metadata required by this operation.

Returns: The GgufTokenizerMetadataValidationResult result produced by GgufTokenizerMetadataValidator.Validate for this contract: Validates the supplied metadata against the invariants required by GgufTokenizerMetadataValidator. It is published only after all documented validation and ownership transitions succeed.

InvalidGgufTokenizerExceptionUAIX.LmRuntime.Tokenization 1 member

Thrown when GGUF tokenizer metadata is invalid.

Method InvalidGgufTokenizerException(string)

Initializes a new InvalidGgufTokenizerException instance with validated dependencies and operational bounds.

message
The display-safe diagnostic message describing the failure without embedding prompt text, generated text, credentials, or private file contents.
UnsupportedTokenizerExceptionUAIX.LmRuntime.Tokenization 1 member

Thrown when a GGUF tokenizer family is not supported.

Method UnsupportedTokenizerException(string)

Initializes a new UnsupportedTokenizerException instance with validated dependencies and operational bounds.

message
The display-safe diagnostic message describing the failure without embedding prompt text, generated text, credentials, or private file contents.
Gpt2BpeTokenizerEngineUAIX.LmRuntime.Tokenization 4 members

Implements GPT-2 byte-level BPE from GGUF vocabulary and merge metadata.

Method Gpt2BpeTokenizerEngine(UAIX.LmRuntime.Tokenization.GgufTokenizerMetadata)

Initializes a GPT-2 BPE engine from validated GGUF tokenizer metadata.

metadata
The metadata containing validated format or tokenizer metadata required by this operation.
Property Name
Method EncodeRaw(string,UAIX.LmRuntime.Tokenization.TokenizerFragmentContext,System.Collections.Generic.IList<int>,System.Collections.Generic.IList<string>)

Encodes the raw with the configured tokenizer and validated special-token policy.

text
The text to process using the configured encoding and normalization rules.
context
The context that supplies session-scoped identity and boundary state; it is validated before dependent work begins.
destination
The destination buffer that receives the produced values.
trace
The trace sequence used by this operation; its required length, ordering, and element bounds are validated before access.
Method Decode(System.Collections.Generic.IReadOnlyList<int>,UAIX.LmRuntime.Tokenization.DetokenizationOptions)

Decodes the supplied token sequence with the configured tokenizer while preserving sequence order.

tokenIds
The token identifiers to process in sequence order.
options
The optional DetokenizationOptions controlling Decode; null selects the documented defaults, supplied limits are validated before allocation, and the instance is not mutated.

Returns: The decoded text produced from the validated token sequence in the original sequence order.

MetadataDrivenGgufTokenizerUAIX.LmRuntime.Tokenization 12 members

Executes a GGUF tokenizer by combining special-token partitioning with a family-specific tokenizer engine.

Real GGUF execution never falls back to whitespace tokenization. Unsupported tokenizer families fail during construction so token identifiers cannot silently diverge from the model embedding table.

Method MetadataDrivenGgufTokenizer(UAIX.LmRuntime.Tokenization.GgufTokenizerMetadata)

Initializes a tokenizer from validated GGUF metadata.

metadata
The metadata containing validated format or tokenizer metadata required by this operation.
Method MetadataDrivenGgufTokenizer(UAIX.LmRuntime.Tokenization.GgufTokenizerMetadata,UAIX.LmRuntime.Tokenization.IGgufTokenizerEngine)

Initializes a tokenizer with an explicitly selected family engine.

metadata
The metadata containing validated format or tokenizer metadata required by this operation.
engine
The validated IGgufTokenizerEngine dependency consumed by MetadataDrivenGgufTokenizer; ownership and lifetime remain with the caller unless this member explicitly documents a transfer.
Property Name
Property Metadata
Method Tokenize(string)

Tokenizes the supplied text with the configured metadata and preserves deterministic token order.

text
The text to process using the configured encoding and normalization rules.

Returns: An ordered read-only collection of token text values produced by the configured tokenizer.

Method Encode(string,bool,bool)

Encodes the supplied text with the configured tokenizer and validated special-token policy.

text
The text to process using the configured encoding and normalization rules.
addBos
A value indicating whether add BOS applies to this operation.
addEos
A value indicating whether add EOS applies to this operation.

Returns: An ordered read-only collection of token identifiers produced by the configured tokenizer.

Method Encode(string,UAIX.LmRuntime.Tokenization.TokenizationOptions)

Encodes the supplied text with the configured tokenizer and validated special-token policy.

text
The text to process using the configured encoding and normalization rules.
options
The optional TokenizationOptions controlling Encode; null selects the documented defaults, supplied limits are validated before allocation, and the instance is not mutated.

Returns: The TokenizationResult result produced by MetadataDrivenGgufTokenizer.Encode for this contract: Encodes the supplied text with the configured tokenizer and validated special-token policy. It is published only after all documented validation and ownership transitions succeed.

Method Decode(System.Collections.Generic.IEnumerable<int>)

Decodes the supplied token sequence with the configured tokenizer while preserving sequence order.

tokenIds
The token identifiers to process in sequence order.

Returns: The decoded text produced from the validated token sequence in the original sequence order.

Method Decode(System.Collections.Generic.IReadOnlyList<int>,UAIX.LmRuntime.Tokenization.DetokenizationOptions)

Decodes the supplied token sequence with the configured tokenizer while preserving sequence order.

tokenIds
The token identifiers to process in sequence order.
options
The optional DetokenizationOptions controlling Decode; null selects the documented defaults, supplied limits are validated before allocation, and the instance is not mutated.

Returns: The decoded text produced from the validated token sequence in the original sequence order.

Method Decode(System.Collections.Generic.IReadOnlyList<int>,UAIX.LmRuntime.Tokenization.MetadataDrivenGgufTokenizerDetokenizationOptions)

Decodes model token identifiers using the stable metadata-driven compatibility options contract.

tokenIds
The token identifiers to process in sequence order.
options
The optional MetadataDrivenGgufTokenizerDetokenizationOptions controlling Decode; null selects the documented defaults, supplied limits are validated before allocation, and the instance is not mutated.

Returns: The decoded text produced from the validated token sequence in the original sequence order.

Method CountTokens(string)

Counts the tokens using the same deterministic rules as the corresponding processing operation.

text
The text to process using the configured encoding and normalization rules.

Returns: The int value computed by MetadataDrivenGgufTokenizer.CountTokens for this contract: Counts the tokens using the same deterministic rules as the corresponding processing operation. Range, finite-value, and overflow checks are completed before the value is returned.

Method CountTokens(System.Collections.Generic.IEnumerable<UAIX.LmRuntime.Contracts.LlmMessage>)

Counts the tokens using the same deterministic rules as the corresponding processing operation.

messages
The messages sequence used by this operation; its required length, ordering, and element bounds are validated before access.

Returns: The TokenCountResult result produced by MetadataDrivenGgufTokenizer.CountTokens for this contract: Counts the tokens using the same deterministic rules as the corresponding processing operation. It is published only after all documented validation and ownership transitions succeed.

SentencePieceBpeTokenizerEngineUAIX.LmRuntime.Tokenization 4 members

Implements the SentencePiece-BPE execution path used by LLaMA-style GGUF vocabularies.

Method SentencePieceBpeTokenizerEngine(UAIX.LmRuntime.Tokenization.GgufTokenizerMetadata)

Initializes the engine from validated GGUF tokenizer metadata.

metadata
The metadata containing validated format or tokenizer metadata required by this operation.
Property Name
Method EncodeRaw(string,UAIX.LmRuntime.Tokenization.TokenizerFragmentContext,System.Collections.Generic.IList<int>,System.Collections.Generic.IList<string>)

Encodes the raw with the configured tokenizer and validated special-token policy.

text
The text to process using the configured encoding and normalization rules.
context
The context that supplies session-scoped identity and boundary state; it is validated before dependent work begins.
destination
The destination buffer that receives the produced values.
trace
The trace sequence used by this operation; its required length, ordering, and element bounds are validated before access.
Method Decode(System.Collections.Generic.IReadOnlyList<int>,UAIX.LmRuntime.Tokenization.DetokenizationOptions)

Decodes the supplied token sequence with the configured tokenizer while preserving sequence order.

tokenIds
The token identifiers to process in sequence order.
options
The optional DetokenizationOptions controlling Decode; null selects the documented defaults, supplied limits are validated before allocation, and the instance is not mutated.

Returns: The decoded text produced from the validated token sequence in the original sequence order.

SpecialTokenFragmentKindUAIX.LmRuntime.Tokenization 2 members

Identifies the type of fragment emitted by special-token partitioning.

Field RawText

A raw text fragment that must be processed by the tokenizer engine.

Field Token

A pre-resolved token identifier fragment.

SpecialTokenFragmentUAIX.LmRuntime.Tokenization 7 members

Represents one fragment emitted by special-token partitioning.

Property Kind

Gets the fragment kind.

Property Text

Gets the raw text fragment.

Property TokenId

Gets the token identifier for token fragments.

Property Offset

Gets the character offset in the source text.

Property Length

Gets the fragment length in source text characters.

Method Raw(string,int)

Creates a raw-text fragment representing an unmodified source slice at the supplied offset.

text
The text processed by the configured encoding or normalization rules; it must satisfy the declared nullability contract.
offset
The zero-based offset into the relevant source or destination; range validation occurs before access.

Returns: The SpecialTokenFragment result produced by SpecialTokenFragment.Raw for this contract: Creates a raw-text fragment representing an unmodified source slice at the supplied offset. It is published only after all documented validation and ownership transitions succeed.

Method Token(int,string,int)

Creates a special-token fragment at the supplied source-text offset.

tokenId
The token identifier to process; it must fall within the validated vocabulary and operation-specific range.
text
The text processed by the configured encoding or normalization rules; it must satisfy the declared nullability contract.
offset
The zero-based offset into the relevant source or destination; range validation occurs before access.

Returns: The SpecialTokenFragment result produced by SpecialTokenFragment.Token for this contract: Creates a special-token fragment at the supplied source-text offset. It is published only after all documented validation and ownership transitions succeed.

SpecialTokenPartitionerUAIX.LmRuntime.Tokenization 1 member

Partitions raw text around tokenizer special tokens before normal tokenization.

Method Partition(string,UAIX.LmRuntime.Tokenization.GgufTokenizerMetadata,bool)

Partitions text around known special tokens using longest-token-first matching.

text
The text processed by the configured encoding or normalization rules; it must satisfy the declared nullability contract.
metadata
The metadata containing validated format or tokenizer metadata required by this operation.
parseSpecial
Whether control and unknown tokens should be parsed as special tokens.

Returns: An ordered read-only IReadOnlyList<SpecialTokenFragment> result from SpecialTokenPartitioner.Partition: Partitions text around known special tokens using longest-token-first matching. Mutable internal collection aliases are not exposed through the returned contract.

TokenizerFragmentContextUAIX.LmRuntime.Tokenization 2 members

Describes the position of one raw-text fragment within special-token partitioning.

Property IsFirstFragment

Gets a value indicating whether this is the first raw-text fragment in the input.

Property PreviousFragmentWasSpecial

Gets a value indicating whether the immediately preceding fragment was a special token.

IGgufTokenizerEngineUAIX.LmRuntime.Tokenization 3 members

Defines a family-specific tokenizer engine that operates after special-token partitioning.

Property Name

Gets the stable engine name.

Method EncodeRaw(string,UAIX.LmRuntime.Tokenization.TokenizerFragmentContext,System.Collections.Generic.IList<int>,System.Collections.Generic.IList<string>)

Encodes a raw-text fragment into model token identifiers.

text
The text processed by the configured encoding or normalization rules; it must satisfy the declared nullability contract.
context
The context that supplies session-scoped identity and boundary state; it is validated before dependent work begins.
destination
The caller-owned destination buffer that receives the result; required capacity is validated before any write occurs.
trace
The trace sequence used by this operation; its required length, ordering, and element bounds are validated before access.
Method Decode(System.Collections.Generic.IReadOnlyList<int>,UAIX.LmRuntime.Tokenization.DetokenizationOptions)

Decodes the supplied token sequence with the configured tokenizer while preserving sequence order.

tokenIds
The ordered token identifiers to process; sequence order is preserved and each identifier is validated where required.
options
The optional DetokenizationOptions controlling Decode; null selects the documented defaults, supplied limits are validated before allocation, and the instance is not mutated.

Returns: The decoded text produced from the validated token sequence in the original sequence order.

BpeMergeRuleUAIX.LmRuntime.Tokenization 3 members

Represents one parsed BPE merge rule.

Property Left

Gets the left symbol.

Property Right

Gets the right symbol.

Method TryParse(string,UAIX.LmRuntime.Tokenization.BpeMergeRule&)

Attempts to parse the Boolean result while reporting invalid input without a successful result.

text
The text processed by the configured encoding or normalization rules; it must satisfy the declared nullability contract.
rule
When the method returns, contains the rule produced by the operation when successful; otherwise contains the type's default value.

Returns: True when the rule contains two non-empty symbols.

GgufPreTokenizerRegistryUAIX.LmRuntime.Tokenization 2 members

Provides a conservative allow-list for tokenizer.ggml.pre values implemented by this build.

Method IsSupported(string)

Determines whether a pre-tokenizer identifier is supported.

name
The exact ordinal name used for catalog lookup, canonical hashing, or diagnostic labeling as defined by the containing member.

Returns: True when the identifier is absent or explicitly supported.

Method GetSupportedNames

Retrieves the supported names from the configured tokenizer after validating the requested access.

Returns: An ordered read-only IReadOnlyList<string> result from GgufPreTokenizerRegistry.GetSupportedNames: Retrieves the supported names from the configured tokenizer after validating the requested access. Mutable internal collection aliases are not exposed through the returned contract.

GgufTokenizerEngineFactoryUAIX.LmRuntime.Tokenization 1 member

Selects a concrete tokenizer engine from validated GGUF tokenizer metadata.

Method Create(UAIX.LmRuntime.Tokenization.GgufTokenizerMetadata)

Creates the GGUF tokenizer engine from the validated inputs required by GgufTokenizerEngineFactory.

metadata
The metadata containing validated format or tokenizer metadata required by this operation.

Returns: The concrete tokenizer engine, with ownership and disposal obligations defined by the returned type and the Create contract.

IGgufTokenizerFactoryUAIX.LmRuntime.Tokenization 1 member

Creates tokenizer instances from GGUF tokenizer metadata.

Method Create(UAIX.LmRuntime.Gguf.GgufModel)

Creates a tokenizer for a parsed GGUF model.

model
The parsed GGUF model whose validated metadata and tensor catalog are consumed by this operation.

Returns: The tokenizer selected from metadata, with ownership and disposal obligations defined by the returned type and the Create contract.

GgufTokenizerFactoryUAIX.LmRuntime.Tokenization 2 members

Creates strict, metadata-routed tokenizers for parsed GGUF artifacts.

Method Create(UAIX.LmRuntime.Gguf.GgufModel)

Creates the tokenizer from the validated inputs required by GgufTokenizerFactory.

model
The parsed GGUF model whose validated metadata and tensor catalog are consumed by this operation.

Returns: The ITokenizer result produced by GgufTokenizerFactory.Create for this contract: Creates the tokenizer from the validated inputs required by GgufTokenizerFactory. It is published only after all documented validation and ownership transitions succeed.

Method CreateStrict(UAIX.LmRuntime.Gguf.GgufModel)

Creates a tokenizer after strict GGUF tokenizer metadata validation.

model
The parsed GGUF model whose validated metadata and tensor catalog are consumed by this operation.

Returns: The selected tokenizer, with ownership and disposal obligations defined by the returned type and the CreateStrict contract.

SentencePieceGgufTokenizerUAIX.LmRuntime.Tokenization 10 members

Executes the SentencePiece-BPE tokenizer path used by LLaMA-style GGUF artifacts.

Method SentencePieceGgufTokenizer(UAIX.LmRuntime.Tokenization.GgufTokenizerMetadata)

Initializes the tokenizer from validated GGUF metadata.

metadata
The metadata containing validated format or tokenizer metadata required by this operation.
Property Name
Property Metadata
Method Tokenize(string)

Tokenizes the supplied text with the configured metadata and preserves deterministic token order.

text
The text to process using the configured encoding and normalization rules.

Returns: An ordered read-only collection of token text values produced by the configured tokenizer.

Method Encode(string,bool,bool)

Encodes the supplied text with the configured tokenizer and validated special-token policy.

text
The text to process using the configured encoding and normalization rules.
addBos
A value indicating whether add BOS applies to this operation.
addEos
A value indicating whether add EOS applies to this operation.

Returns: An ordered read-only collection of token identifiers produced by the configured tokenizer.

Method Encode(string,UAIX.LmRuntime.Tokenization.TokenizationOptions)

Encodes the supplied text with the configured tokenizer and validated special-token policy.

text
The text to process using the configured encoding and normalization rules.
options
The optional TokenizationOptions controlling Encode; null selects the documented defaults, supplied limits are validated before allocation, and the instance is not mutated.

Returns: The TokenizationResult result produced by SentencePieceGgufTokenizer.Encode for this contract: Encodes the supplied text with the configured tokenizer and validated special-token policy. It is published only after all documented validation and ownership transitions succeed.

Method Decode(System.Collections.Generic.IEnumerable<int>)

Decodes the supplied token sequence with the configured tokenizer while preserving sequence order.

tokenIds
The token identifiers to process in sequence order.

Returns: The decoded text produced from the validated token sequence in the original sequence order.

Method Decode(System.Collections.Generic.IReadOnlyList<int>,UAIX.LmRuntime.Tokenization.DetokenizationOptions)

Decodes the supplied token sequence with the configured tokenizer while preserving sequence order.

tokenIds
The token identifiers to process in sequence order.
options
The optional DetokenizationOptions controlling Decode; null selects the documented defaults, supplied limits are validated before allocation, and the instance is not mutated.

Returns: The decoded text produced from the validated token sequence in the original sequence order.

Method CountTokens(string)

Counts the tokens using the same deterministic rules as the corresponding processing operation.

text
The text to process using the configured encoding and normalization rules.

Returns: The int value computed by SentencePieceGgufTokenizer.CountTokens for this contract: Counts the tokens using the same deterministic rules as the corresponding processing operation. Range, finite-value, and overflow checks are completed before the value is returned.

Method CountTokens(System.Collections.Generic.IEnumerable<UAIX.LmRuntime.Contracts.LlmMessage>)

Counts the tokens using the same deterministic rules as the corresponding processing operation.

messages
The messages sequence used by this operation; its required length, ordering, and element bounds are validated before access.

Returns: The TokenCountResult result produced by SentencePieceGgufTokenizer.CountTokens for this contract: Counts the tokens using the same deterministic rules as the corresponding processing operation. It is published only after all documented validation and ownership transitions succeed.

Gpt2BpeTokenizerUAIX.LmRuntime.Tokenization 10 members

Executes the GPT-2 byte-level BPE tokenizer path from GGUF vocabulary and merge metadata.

Method Gpt2BpeTokenizer(UAIX.LmRuntime.Tokenization.GgufTokenizerMetadata)

Initializes the tokenizer from validated GGUF metadata.

metadata
The metadata containing validated format or tokenizer metadata required by this operation.
Property Name
Property Metadata
Method Tokenize(string)

Tokenizes the supplied text with the configured metadata and preserves deterministic token order.

text
The text to process using the configured encoding and normalization rules.

Returns: An ordered read-only collection of token text values produced by the configured tokenizer.

Method Encode(string,bool,bool)

Encodes the supplied text with the configured tokenizer and validated special-token policy.

text
The text to process using the configured encoding and normalization rules.
addBos
A value indicating whether add BOS applies to this operation.
addEos
A value indicating whether add EOS applies to this operation.

Returns: An ordered read-only collection of token identifiers produced by the configured tokenizer.

Method Encode(string,UAIX.LmRuntime.Tokenization.TokenizationOptions)

Encodes the supplied text with the configured tokenizer and validated special-token policy.

text
The text to process using the configured encoding and normalization rules.
options
The optional TokenizationOptions controlling Encode; null selects the documented defaults, supplied limits are validated before allocation, and the instance is not mutated.

Returns: The TokenizationResult result produced by Gpt2BpeTokenizer.Encode for this contract: Encodes the supplied text with the configured tokenizer and validated special-token policy. It is published only after all documented validation and ownership transitions succeed.

Method Decode(System.Collections.Generic.IEnumerable<int>)

Decodes the supplied token sequence with the configured tokenizer while preserving sequence order.

tokenIds
The token identifiers to process in sequence order.

Returns: The decoded text produced from the validated token sequence in the original sequence order.

Method Decode(System.Collections.Generic.IReadOnlyList<int>,UAIX.LmRuntime.Tokenization.DetokenizationOptions)

Decodes the supplied token sequence with the configured tokenizer while preserving sequence order.

tokenIds
The token identifiers to process in sequence order.
options
The optional DetokenizationOptions controlling Decode; null selects the documented defaults, supplied limits are validated before allocation, and the instance is not mutated.

Returns: The decoded text produced from the validated token sequence in the original sequence order.

Method CountTokens(string)

Counts the tokens using the same deterministic rules as the corresponding processing operation.

text
The text to process using the configured encoding and normalization rules.

Returns: The int value computed by Gpt2BpeTokenizer.CountTokens for this contract: Counts the tokens using the same deterministic rules as the corresponding processing operation. Range, finite-value, and overflow checks are completed before the value is returned.

Method CountTokens(System.Collections.Generic.IEnumerable<UAIX.LmRuntime.Contracts.LlmMessage>)

Counts the tokens using the same deterministic rules as the corresponding processing operation.

messages
The messages sequence used by this operation; its required length, ordering, and element bounds are validated before access.

Returns: The TokenCountResult result produced by Gpt2BpeTokenizer.CountTokens for this contract: Counts the tokens using the same deterministic rules as the corresponding processing operation. It is published only after all documented validation and ownership transitions succeed.

RwkvWorldTokenizerUAIX.LmRuntime.Tokenization 6 members

Marks the RWKV tokenizer family as an explicit unsupported boundary until a dedicated engine is implemented.

Property Name
Method Tokenize(string)

Tokenizes the supplied text with the configured metadata and preserves deterministic token order.

text
The text to process using the configured encoding and normalization rules.

Returns: An ordered read-only collection of token text values produced by the configured tokenizer.

Method Encode(string,bool,bool)

Encodes the supplied text with the configured tokenizer and validated special-token policy.

text
The text to process using the configured encoding and normalization rules.
addBos
A value indicating whether add BOS applies to this operation.
addEos
A value indicating whether add EOS applies to this operation.

Returns: An ordered read-only collection of token identifiers produced by the configured tokenizer.

Method Decode(System.Collections.Generic.IEnumerable<int>)

Decodes the supplied token sequence with the configured tokenizer while preserving sequence order.

tokenIds
The token identifiers to process in sequence order.

Returns: The decoded text produced from the validated token sequence in the original sequence order.

Method CountTokens(string)

Counts the tokens using the same deterministic rules as the corresponding processing operation.

text
The text to process using the configured encoding and normalization rules.

Returns: The int value computed by RwkvWorldTokenizer.CountTokens for this contract: Counts the tokens using the same deterministic rules as the corresponding processing operation. Range, finite-value, and overflow checks are completed before the value is returned.

Method CountTokens(System.Collections.Generic.IEnumerable<UAIX.LmRuntime.Contracts.LlmMessage>)

Counts the tokens using the same deterministic rules as the corresponding processing operation.

messages
The messages sequence used by this operation; its required length, ordering, and element bounds are validated before access.

Returns: The TokenCountResult result produced by RwkvWorldTokenizer.CountTokens for this contract: Counts the tokens using the same deterministic rules as the corresponding processing operation. It is published only after all documented validation and ownership transitions succeed.

HuggingFaceTokenizerJsonAdapterUAIX.LmRuntime.Tokenization 1 member

Provides an optional seam for embedded Hugging Face tokenizer JSON metadata.

Method Create(string)

Creates a tokenizer from embedded tokenizer JSON metadata when supported.

json
The json text consumed by HuggingFaceTokenizerJsonAdapter.Create; null, emptiness, length, encoding, identifier, or path rules are enforced as documented, and the value is not persisted by this operation.

Returns: A tokenizer instance, with ownership and disposal obligations defined by the returned type and the Create contract.

ChatTemplateConformanceSuiteUAIX.LmRuntime.Tokenization 1 member

Runs chat-template conformance checks against rendered message sequences.

Method RenderAndValidate(string,System.Collections.Generic.IReadOnlyList<UAIX.LmRuntime.Contracts.LlmMessage>)

Renders and validates a chat template against a message sequence.

template
The template text. The current safe subset ignores arbitrary code.
messages
The messages sequence used by this operation; its required length, ordering, and element bounds are validated before access.

Returns: The TokenizerParityReport result produced by ChatTemplateConformanceSuite.RenderAndValidate for this contract: Renders and validates a chat template against a message sequence. It is published only after all documented validation and ownership transitions succeed.

SpecialTokenMapUAIX.LmRuntime.Tokenization 4 members

Represents model special-token identities.

Property BeginningOfSequence

Gets the beginning-of-sequence token identifier.

Property EndOfSequence

Gets the end-of-sequence token identifier.

Property Padding

Gets the padding token identifier.

Property Unknown

Gets the unknown token identifier.

TokenBudgetTruncatorUAIX.LmRuntime.Tokenization 1 member

Truncates message sequences by token budget.

Method TruncateMessages(System.Collections.Generic.IEnumerable<UAIX.LmRuntime.Contracts.LlmMessage>,UAIX.LmRuntime.Abstractions.ITokenizer,int)

Truncates messages so the total token count does not exceed the budget.

messages
The messages sequence used by this operation; its required length, ordering, and element bounds are validated before access.
tokenizer
The validated ITokenizer dependency consumed by TruncateMessages; ownership and lifetime remain with the caller unless this member explicitly documents a transfer.
maxTokens
The numeric max tokens consumed by TruncateMessages; it must satisfy the member's documented range, geometry, and finite-value requirements.

Returns: An ordered read-only IReadOnlyList<LlmMessage> result from TokenBudgetTruncator.TruncateMessages: Truncates messages so the total token count does not exceed the budget. Mutable internal collection aliases are not exposed through the returned contract.

TokenizerGoldenCorpusUAIX.LmRuntime.Tokenization 1 member

Loads tokenizer golden corpora.

Method Load(string)

Loads ordered tokenizer golden record collection from a verified local source into TokenizerGoldenCorpus.

json
The json text consumed by TokenizerGoldenCorpus.Load; null, emptiness, length, encoding, identifier, or path rules are enforced as documented, and the value is not persisted by this operation.

Returns: An ordered read-only IReadOnlyList<TokenizerGoldenRecord> result from TokenizerGoldenCorpus.Load: Loads ordered tokenizer golden record collection from a verified local source into TokenizerGoldenCorpus. Mutable internal collection aliases are not exposed through the returned contract.

TokenizerGoldenRecordUAIX.LmRuntime.Tokenization 2 members

Represents one tokenizer golden record.

Property Text

Gets the source text.

Property ExpectedTokenIds

Gets expected token identifiers.

TokenizerParityReportUAIX.LmRuntime.Tokenization 1 member

Represents tokenizer parity diagnostics.

Property Mismatches

Gets tokenizer mismatches.

InvalidUtf16PolicyUAIX.LmRuntime.Tokenization 2 members

Defines how tokenizer entry points handle invalid UTF-16 surrogate sequences.

Field Reject

Rejects invalid UTF-16 before tokenizer-specific normalization or segmentation.

Field Replace

Replaces each invalid surrogate code unit with the Unicode replacement character.

TokenizerTextSafetyUAIX.LmRuntime.Tokenization 1 member

Validates and normalizes managed strings before tokenizer-specific processing.

Method NormalizeUtf16(string,UAIX.LmRuntime.Tokenization.InvalidUtf16Policy)

Validates a managed string and optionally replaces unpaired surrogate code units.

text
The text processed by the configured encoding or normalization rules; it must satisfy the declared nullability contract.
policy
The policy that define validation limits and execution behavior; required values are checked before use.

Returns: The original string when valid, or a normalized replacement string when requested.

StreamingUtf8TokenDecoderUAIX.LmRuntime.Tokenization 2 members

Incrementally decodes byte-token payloads without corrupting UTF-8 sequences split across token boundaries.

Method Decode(System.ReadOnlySpan<byte>,bool)

Decodes one byte fragment and retains incomplete UTF-8 state for the next call.

bytes
The bytes sequence used by this operation; its required length, ordering, and element bounds are validated before access.
flush
The flush flag selecting the documented branch of Decode; it does not grant authority beyond this in-memory operation.

Returns: The decoded text produced from the validated token sequence in the original sequence order.

Method Reset

Resets the requested state to its validated initial state without publishing partial state.

TokenizerVocabularyReconciliationResultUAIX.LmRuntime.Tokenization 3 members

Describes a consistency check between GGUF vocabulary order and embedded Hugging Face tokenizer JSON.

Property IsConsistent

Gets whether the embedded tokenizer JSON is absent or consistent with GGUF token identifiers.

Property EmbeddedJsonPresent

Gets whether embedded tokenizer JSON was present.

Property Diagnostics

Gets bounded deterministic diagnostics.

TokenizerVocabularyReconcilerUAIX.LmRuntime.Tokenization 1 member

Reconciles embedded Hugging Face vocabulary identifiers against authoritative GGUF token-array order.

Method Reconcile(UAIX.LmRuntime.Tokenization.GgufTokenizerMetadata)

Validates embedded tokenizer JSON without allowing it to reorder GGUF token identifiers.

metadata
The metadata containing validated format or tokenizer metadata required by this operation.

Returns: The TokenizerVocabularyReconciliationResult result produced by TokenizerVocabularyReconciler.Reconcile for this contract: Validates embedded tokenizer JSON without allowing it to reorder GGUF token identifiers. It is published only after all documented validation and ownership transitions succeed.

WhitespaceTokenizerUAIX.LmRuntime.Tokenization 6 members

Provides a deterministic tokenizer for tests, examples, and fallback token budgeting.

Property Name
Method Tokenize(string)

Tokenizes the supplied text with the configured metadata and preserves deterministic token order.

text
The text to process using the configured encoding and normalization rules.

Returns: An ordered read-only collection of token text values produced by the configured tokenizer.

Method Encode(string,bool,bool)

Encodes the supplied text with the configured tokenizer and validated special-token policy.

text
The text to process using the configured encoding and normalization rules.
addBos
A value indicating whether add BOS applies to this operation.
addEos
A value indicating whether add EOS applies to this operation.

Returns: An ordered read-only collection of token identifiers produced by the configured tokenizer.

Method Decode(System.Collections.Generic.IEnumerable<int>)

Decodes the supplied token sequence with the configured tokenizer while preserving sequence order.

tokenIds
The token identifiers to process in sequence order.

Returns: The decoded text produced from the validated token sequence in the original sequence order.

Method CountTokens(string)

Counts the tokens using the same deterministic rules as the corresponding processing operation.

text
The text to process using the configured encoding and normalization rules.

Returns: The int value computed by WhitespaceTokenizer.CountTokens for this contract: Counts the tokens using the same deterministic rules as the corresponding processing operation. Range, finite-value, and overflow checks are completed before the value is returned.

Method CountTokens(System.Collections.Generic.IEnumerable<UAIX.LmRuntime.Contracts.LlmMessage>)

Counts the tokens using the same deterministic rules as the corresponding processing operation.

messages
The messages sequence used by this operation; its required length, ordering, and element bounds are validated before access.

Returns: The TokenCountResult result produced by WhitespaceTokenizer.CountTokens for this contract: Counts the tokens using the same deterministic rules as the corresponding processing operation. It is published only after all documented validation and ownership transitions succeed.

UAIX.LmRuntime.Gguf

32 documented types and 148 documented members.

GgufArtifactKindUAIX.LmRuntime.Gguf 5 members

Identifies the likely role of a GGUF artifact.

Field BaseModel

Base model artifact.

Field VocabOnly

Vocabulary-only artifact.

Field LoraSidecar

LoRA sidecar artifact.

Field MultimodalProjectorSidecar

Multimodal projector sidecar artifact.

Field MultiTokenPredictionSidecar

Multi-token-prediction sidecar artifact.

GgufByteOrderUAIX.LmRuntime.Gguf 3 members

Identifies the byte order used by a GGUF artifact.

Field Auto

Detect the byte order from the version field and reject ambiguous headers.

Field LittleEndian

Interpret multi-byte values as little-endian.

Field BigEndian

Interpret multi-byte values as big-endian.

GgufDiagnosticUAIX.LmRuntime.Gguf 3 members

Represents a structured GGUF parser diagnostic.

Property Code

Gets the stable diagnostic code.

Property Message

Gets the diagnostic message.

Property ByteOffset

Gets the byte offset associated with the diagnostic when known.

GgufParseResultUAIX.LmRuntime.Gguf 3 members

Represents a non-throwing GGUF parse result.

Property Model

Gets the parsed model when parsing succeeded.

Property Diagnostics

Gets parser diagnostics.

Property Succeeded

Gets a value indicating whether parsing succeeded without diagnostics.

GgufFormatExceptionUAIX.LmRuntime.Gguf 3 members

Represents a GGUF format violation with a stable code and byte offset.

Method GgufFormatException(string,string,System.Nullable<ulong>,System.Exception)

Initializes a new GgufFormatException instance with validated dependencies and operational bounds.

code
The stable machine-readable diagnostic code used to classify the failure without relying on localized message text.
message
The display-safe diagnostic message describing the failure without embedding prompt text, generated text, credentials, or private file contents.
byteOffset
The zero-based byte offset into the relevant source or destination; range validation occurs before access.
innerException
The underlying exception preserved for diagnostic chaining, or null when no lower-level failure is available.
Property Code

Gets the stable diagnostic code.

Property ByteOffset

Gets the byte offset associated with the violation when known.

GgufDumpWriterUAIX.LmRuntime.Gguf 1 member

Writes human-readable GGUF metadata and tensor catalog dumps.

Method Write(UAIX.LmRuntime.Gguf.GgufModel)

Creates a textual dump for a parsed GGUF model.

model
The parsed GGUF model whose validated metadata and tensor catalog are consumed by this operation.

Returns: The text produced by GgufDumpWriter.Write for this contract: Creates a textual dump for a parsed GGUF model. The returned string is detached from mutable caller storage and is not persisted by the operation.

GgufMetadataAccessorsUAIX.LmRuntime.Gguf 6 members

Provides strongly typed accessors for GGUF metadata values.

Method TryGetBoolean(UAIX.LmRuntime.Gguf.GgufModel,string,bool&)

Tries to read a Boolean metadata value.

model
The parsed GGUF model whose validated metadata and tensor catalog are consumed by this operation.
key
The exact ordinal metadata or catalog key used for lookup; the operation does not normalize or retain a mutable alias of the key.
value
When the method returns, contains the value produced by the operation when successful; otherwise contains the type's default value.

Returns: True when the key exists and contains a Boolean value.

Method TryGetStringArray(UAIX.LmRuntime.Gguf.GgufModel,string,System.Collections.Generic.IReadOnlyList<string>&)

Tries to read a metadata string array.

model
The parsed GGUF model whose validated metadata and tensor catalog are consumed by this operation.
key
The exact ordinal metadata or catalog key used for lookup; the operation does not normalize or retain a mutable alias of the key.
values
When the method returns, contains the values produced by the operation when successful; otherwise contains the type's default value.

Returns: True when the key exists and contains only string elements.

Method TryGetSingleArray(UAIX.LmRuntime.Gguf.GgufModel,string,System.Collections.Generic.IReadOnlyList<float>&)

Tries to read a metadata single-precision floating point array.

model
The parsed GGUF model whose validated metadata and tensor catalog are consumed by this operation.
key
The exact ordinal metadata or catalog key used for lookup; the operation does not normalize or retain a mutable alias of the key.
values
When the method returns, contains the values produced by the operation when successful; otherwise contains the type's default value.

Returns: True when the key exists and contains numeric elements convertible to float.

Method TryGetInt32Array(UAIX.LmRuntime.Gguf.GgufModel,string,System.Collections.Generic.IReadOnlyList<int>&)

Tries to read a metadata 32-bit signed integer array.

model
The parsed GGUF model whose validated metadata and tensor catalog are consumed by this operation.
key
The exact ordinal metadata or catalog key used for lookup; the operation does not normalize or retain a mutable alias of the key.
values
When the method returns, contains the values produced by the operation when successful; otherwise contains the type's default value.

Returns: True when the key exists and contains integral elements convertible to int.

Method TryGetByteArray(UAIX.LmRuntime.Gguf.GgufModel,string,System.Collections.Generic.IReadOnlyList<byte>&)

Tries to read a metadata byte array.

model
The parsed GGUF model whose validated metadata and tensor catalog are consumed by this operation.
key
The exact ordinal metadata or catalog key used for lookup; the operation does not normalize or retain a mutable alias of the key.
values
When the method returns, contains the values produced by the operation when successful; otherwise contains the type's default value.

Returns: True when the key exists and contains integral elements convertible to bytes.

Method TryGetInt32(UAIX.LmRuntime.Gguf.GgufModel,string,int&)

Tries to read a metadata 32-bit signed integer scalar.

model
The parsed GGUF model whose validated metadata and tensor catalog are consumed by this operation.
key
The exact ordinal metadata or catalog key used for lookup; the operation does not normalize or retain a mutable alias of the key.
value
When the method returns, contains the value produced by the operation when successful; otherwise contains the type's default value.

Returns: True when the key exists and contains an integral value convertible to int.

GgufMetadataValueUAIX.LmRuntime.Gguf 4 members

Represents a typed GGUF metadata value.

Property Type

Gets the metadata value type.

Property Value

Gets the scalar value or a for arrays.

Method Create(UAIX.LmRuntime.Gguf.GgufMetadataValueType,object)

Creates the GGUF metadata value from the validated inputs required by GgufMetadataValue.

type
The type containing validated format or tokenizer metadata required by this operation.
value
The value input of type object? read by GgufMetadataValue.Create; it must satisfy the member-specific nullability, identity, range, and ownership rules before dependent work begins.

Returns: The metadata value, with ownership and disposal obligations defined by the returned type and the Create contract.

Method ToString

Returns the string representation of this value.

Returns: The text produced by GgufMetadataValue.ToString for this contract: Returns the string representation of this value. The returned string is detached from mutable caller storage and is not persisted by the operation.

GgufMetadataArrayUAIX.LmRuntime.Gguf 3 members

Represents a typed GGUF metadata array.

Property ElementType

Gets the element type.

Property Items

Gets the array values.

Method ToString

Returns a concise description of this array.

Returns: The text produced by GgufMetadataArray.ToString for this contract: Returns a concise description of this array. The returned string is detached from mutable caller storage and is not persisted by the operation.

GgufMetadataValueTypeUAIX.LmRuntime.Gguf 13 members

Identifies GGUF metadata value types.

Field UInt8

8-bit unsigned integer.

Field Int8

8-bit signed integer.

Field UInt16

16-bit unsigned integer.

Field Int16

16-bit signed integer.

Field UInt32

32-bit unsigned integer.

Field Int32

32-bit signed integer.

Field Float32

32-bit floating point.

Field Bool

Boolean.

Field String

UTF-8 string.

Field Array

Typed array.

Field UInt64

64-bit unsigned integer.

Field Int64

64-bit signed integer.

Field Float64

64-bit floating point.

GgufModelUAIX.LmRuntime.Gguf 15 members

Represents a parsed GGUF artifact catalog and metadata dictionary.

Property Path

Gets the source file path.

Property Version

Gets the GGUF format version.

Property ByteOrder

Gets the resolved byte order used by the artifact.

Property FileLength

Gets the source file length captured during parsing.

Property Metadata

Gets the parsed metadata dictionary.

Property Tensors

Gets the tensor catalog.

Property TensorDataOffset

Gets the absolute tensor-data section start offset.

Property Alignment

Gets the tensor-data alignment.

Property Shard

Gets shard metadata.

Property ArtifactKind

Gets the inferred artifact kind.

Method Load(string,UAIX.LmRuntime.Gguf.GgufParseOptions)

Loads the GGUF model from a verified local source into GgufModel.

path
The local file-system path processed by this operation; it must satisfy the containing component's path and scope policy.
options
The optional GgufParseOptions controlling Load; null selects the documented defaults, supplied limits are validated before allocation, and the instance is not mutated.

Returns: The parsed model catalog, with ownership and disposal obligations defined by the returned type and the Load contract.

Method TryGetString(string,string&)

Tries to get a string metadata value.

key
The exact ordinal metadata or catalog key used for lookup; the operation does not normalize or retain a mutable alias of the key.
value
When the method returns, contains the value produced by the operation when successful; otherwise contains the type's default value.

Returns: True when the key exists and contains a string.

Method TryGetUInt32(string,uint&)

Tries to get an unsigned 32-bit metadata value.

key
The exact ordinal metadata or catalog key used for lookup; the operation does not normalize or retain a mutable alias of the key.
value
When the method returns, contains the value produced by the operation when successful; otherwise contains the type's default value.

Returns: True when the key exists and can be converted to an unsigned 32-bit integer.

Method TryGetSingle(string,float&)

Tries to get a single-precision metadata value.

key
The exact ordinal metadata or catalog key used for lookup; the operation does not normalize or retain a mutable alias of the key.
value
When the method returns, contains the value produced by the operation when successful; otherwise contains the type's default value.

Returns: True when the key exists and can be converted to a single-precision value.

Method TryGetTensor(string,UAIX.LmRuntime.Gguf.GgufTensorDescriptor&)

Tries to resolve a tensor descriptor by its exact GGUF name.

name
The exact ordinal name used for catalog lookup, canonical hashing, or diagnostic labeling as defined by the containing member.
tensor
When the method returns, contains the tensor produced by the operation when successful; otherwise contains the type's default value.

Returns: True when try get tensor succeeds for the supplied values; otherwise, false.

GgufParseOptionsUAIX.LmRuntime.Gguf 7 members

Defines safety limits for GGUF parsing.

Property ByteOrder

Gets the requested GGUF byte order. Auto uses the version-field heuristic.

Property MaxTensorCount

Gets the maximum supported tensor count.

Property MaxMetadataCount

Gets the maximum supported metadata key-value count.

Property MaxDimensionCount

Gets the maximum supported dimension count per tensor.

Property MaxStringBytes

Gets the maximum metadata string byte length.

Property MaxArrayLength

Gets the maximum metadata array length.

Property MaxArrayDepth

Gets the maximum recursive array depth.

GgufReaderUAIX.LmRuntime.Gguf 2 members

Reads and validates GGUF model artifacts.

Method Read(string,UAIX.LmRuntime.Gguf.GgufParseOptions)

Reads a GGUF artifact from disk without copying tensor payloads to managed memory.

path
The local file-system path processed by this operation; it must satisfy the containing component's path and scope policy.
options
The optional GgufParseOptions controlling Read; null selects the documented defaults, supplied limits are validated before allocation, and the instance is not mutated.

Returns: The GgufModel result produced by GgufReader.Read for this contract: Reads a GGUF artifact from disk without copying tensor payloads to managed memory. It is published only after all documented validation and ownership transitions succeed.

Method TryRead(string,UAIX.LmRuntime.Gguf.GgufParseOptions)

Parses a GGUF artifact and returns structured diagnostics instead of throwing for format failures.

path
The local file-system path processed by this operation; it must satisfy the containing component's path and scope policy.
options
The optional GgufParseOptions controlling TryRead; null selects the documented defaults, supplied limits are validated before allocation, and the instance is not mutated.

Returns: True when the GGUF parse result is produced successfully; otherwise, false and no successful result is published.

GgufShardInfoUAIX.LmRuntime.Gguf 3 members

Describes sharding metadata found in a GGUF artifact.

Property ShardIndex

Gets the shard index when present.

Property ShardCount

Gets the shard count when present.

Property IsSharded

Gets a value indicating whether sharding metadata was found.

GgufTensorDescriptorUAIX.LmRuntime.Gguf 7 members

Describes a tensor stored in a GGUF artifact without copying its payload.

Property Name

Gets the tensor name.

Property Dimensions

Gets tensor dimensions in GGUF order.

Property GgmlType

Gets the GGML storage type.

Property ElementCount

Gets the logical element count.

Property RelativeOffset

Gets the relative tensor offset from the tensor data section.

Property AbsoluteOffset

Gets the absolute file offset for this tensor.

Property ByteLength

Gets the physical tensor byte length.

GgufValidationReportUAIX.LmRuntime.Gguf 3 members

Represents the result of strict GGUF validation.

Property IsValid

Gets a value indicating whether validation succeeded.

Property Errors

Gets validation errors with byte offsets when known.

Property Model

Gets the parsed model when validation succeeded.

GgufValidationErrorUAIX.LmRuntime.Gguf 3 members

Represents a GGUF validation error.

Property Code

Gets the diagnostic code.

Property Message

Gets the diagnostic message.

Property ByteOffset

Gets the byte offset associated with the error, if known.

GgufStrictValidatorUAIX.LmRuntime.Gguf 1 member

Validates GGUF artifacts without claiming execution parity.

Method Validate(string,UAIX.LmRuntime.Gguf.GgufParseOptions)

Validates the supplied path and the supplied options against the invariants required by GgufStrictValidator.

path
The local file-system path processed by this operation; it must satisfy the containing component's path and scope policy.
options
The optional GgufParseOptions controlling Validate; null selects the documented defaults, supplied limits are validated before allocation, and the instance is not mutated.

Returns: The GgufValidationReport result produced by GgufStrictValidator.Validate for this contract: Validates the supplied path and the supplied options against the invariants required by GgufStrictValidator. It is published only after all documented validation and ownership transitions succeed.

GgufShardResolverUAIX.LmRuntime.Gguf 1 member

Resolves GGUF shard files adjacent to a root artifact.

Method ResolveShards(string)

Resolves likely shard paths for a GGUF model path.

path
The local file-system path processed by this operation; it must satisfy the containing component's path and scope policy.

Returns: An ordered read-only IReadOnlyList<string> result from GgufShardResolver.ResolveShards: Resolves likely shard paths for a GGUF model path. Mutable internal collection aliases are not exposed through the returned contract.

GgufTensorPayloadViewUAIX.LmRuntime.Gguf 3 members

Represents a zero-copy tensor payload range inside a GGUF file.

Property AbsoluteOffset

Gets the absolute file offset.

Property LengthBytes

Gets the payload byte length.

Method FromTensor(UAIX.LmRuntime.Gguf.GgufTensorDescriptor)

Creates a payload view from a tensor descriptor.

tensor
The validated GGUF tensor descriptor whose dimensions, offsets, storage type, and byte length drive the operation.

Returns: The GgufTensorPayloadView result produced by GgufTensorPayloadView.FromTensor for this contract: Creates a payload view from a tensor descriptor. It is published only after all documented validation and ownership transitions succeed.

GgufArtifactClassifierUAIX.LmRuntime.Gguf 1 member

Classifies GGUF artifacts from metadata.

Method Classify(UAIX.LmRuntime.Gguf.GgufModel)

Classifies the GGUF artifact kind from validated metadata and tensor-layout evidence.

model
The parsed GGUF model whose validated metadata and tensor catalog are consumed by this operation.

Returns: The GgufArtifactKind result produced by GgufArtifactClassifier.Classify for this contract: Classifies the GGUF artifact kind from validated metadata and tensor-layout evidence. It is published only after all documented validation and ownership transitions succeed.

GgufMetadataKeyPolicyUAIX.LmRuntime.Gguf 1 member

Validates GGUF metadata key syntax.

Method Validate(string)

Validates the supplied key against the invariants required by GgufMetadataKeyPolicy.

key
The exact ordinal metadata or catalog key used for lookup; the operation does not normalize or retain a mutable alias of the key.

Returns: The GgufValidationError? result produced by GgufMetadataKeyPolicy.Validate for this contract: Validates the supplied key against the invariants required by GgufMetadataKeyPolicy. It is published only after all documented validation and ownership transitions succeed.

GgufHashingReaderUAIX.LmRuntime.Gguf 1 member

Computes integrity hashes for GGUF and related model artifacts.

Method ComputeSha256(string)

Computes the SHA-256 hash of a file.

path
The local file-system path processed by this operation; it must satisfy the containing component's path and scope policy.

Returns: The text produced by GgufHashingReader.ComputeSha256 for this contract: Computes the SHA-256 hash of a file. The returned string is detached from mutable caller storage and is not persisted by the operation.

GgufJsonDumpWriterUAIX.LmRuntime.Gguf 1 member

Writes machine-readable GGUF dump JSON.

Method Write(UAIX.LmRuntime.Gguf.GgufModel)

Writes a stable JSON dump for a parsed GGUF model.

model
The parsed GGUF model whose validated metadata and tensor catalog are consumed by this operation.

Returns: The text produced by GgufJsonDumpWriter.Write for this contract: Writes a stable JSON dump for a parsed GGUF model. The returned string is detached from mutable caller storage and is not persisted by the operation.

IMappedModelFileUAIX.LmRuntime.Gguf 6 members

Defines bounded, zero-copy access to tensor payloads in a mapped model artifact.

Property Model

Gets the parsed GGUF model catalog associated with the mapping.

Property FileLength

Gets the mapped file length in bytes.

Method GetTensorBytes(UAIX.LmRuntime.Gguf.GgufTensorDescriptor)

Gets a synchronous read-only span over one tensor payload.

tensor
The validated GGUF tensor descriptor whose dimensions, offsets, storage type, and byte length drive the operation.

Returns: A read-only span valid until this mapping is disposed.

Method GetTensorMemory(UAIX.LmRuntime.Gguf.GgufTensorDescriptor)

Gets a read-only memory view over one tensor payload.

tensor
The validated GGUF tensor descriptor whose dimensions, offsets, storage type, and byte length drive the operation.

Returns: A read-only memory view whose owner remains the mapped file.

Method TryGetTensorMemory(string,System.ReadOnlyMemory<byte>&)

Attempts to get a read-only memory view by tensor name.

tensorName
The exact ordinal GGUF tensor catalog name used for lookup and diagnostics.
memory
When the method returns, contains the memory produced by the operation when successful; otherwise contains the type's default value.

Returns: True when try get tensor memory succeeds for the supplied values; otherwise, false.

Method GetMemorySegments(ulong,ulong,int)

Creates bounded read-only memory windows for a validated mapped-file range.

absoluteOffset
The zero-based absolute offset into the relevant source or destination; range validation occurs before access.
byteLength
The bounded payload length in bytes used to validate offsets and prevent arithmetic overflow before slicing mapped storage.
maximumSegmentByteCount
The positive maximum byte count per returned segment.

Returns: Ordered borrowed segments whose combined length equals byteLength.

MappedGgufFileUAIX.LmRuntime.Gguf 14 members

Owns a read-only operating-system mapping of a GGUF artifact and exposes bounded tensor payload views.

The mapping is the sole owner of the acquired unmanaged pointer. Returned spans and memory views borrow that pointer and become invalid immediately after . Public entry points verify that descriptors belong to the parsed catalog before pointer arithmetic occurs.

Method MappedGgufFile(string,UAIX.LmRuntime.Gguf.GgufParseOptions)

Opens, validates, and maps a GGUF file read-only.

path
The local file-system path processed by this operation; it must satisfy the containing component's path and scope policy.
options
The optional GgufParseOptions controlling MappedGgufFile; null selects the documented defaults, supplied limits are validated before allocation, and the instance is not mutated.
Method MappedGgufFile(UAIX.LmRuntime.Gguf.GgufModel)

Maps a GGUF file using an already validated catalog.

model
The parsed GGUF model whose validated metadata and tensor catalog are consumed by this operation.
Property Model

Gets the immutable GGUF catalog that was validated before the operating-system mapping was opened.

Property FileLength

Gets the mapped file length captured from the validated catalog.

Property IsDisposed

Gets a value indicating whether the mapping has been disposed.

Method GetTensorBytes(UAIX.LmRuntime.Gguf.GgufTensorDescriptor)

Gets a synchronous borrowed span over one catalog tensor payload.

tensor
The descriptor that must match the mapping's authoritative tensor catalog.

Returns: A read-only span whose lifetime cannot exceed the current synchronous call chain.

Method GetTensorMemory(UAIX.LmRuntime.Gguf.GgufTensorDescriptor)

Gets a borrowed memory object over one catalog tensor payload.

tensor
The descriptor that must match the mapping's authoritative tensor catalog.

Returns: A read-only memory view backed directly by the operating-system mapping.

Method TryGetTensorMemory(string,System.ReadOnlyMemory<byte>&)

Tries to resolve a named tensor and create a borrowed mapped-memory view.

tensorName
The exact ordinal GGUF tensor catalog name used for lookup and diagnostics.
memory
Receives the borrowed tensor memory when the tensor exists; otherwise receives empty memory.

Returns: True when the tensor exists and a view was created; otherwise false.

Method GetMemorySegments(ulong,ulong,int)

Splits a validated mapped-file range into bounded borrowed memory segments.

absoluteOffset
The absolute file offset at which the first segment begins.
byteLength
The total number of bytes represented by all returned segments.
maximumSegmentByteCount
The positive upper bound for each managed segment descriptor.

Returns: Ordered borrowed memory segments that exactly cover the requested range.

Method GetBytes(ulong,ulong)

Gets a read-only span over a validated file range.

absoluteOffset
The zero-based absolute offset into the relevant source or destination; range validation occurs before access.
byteLength
The bounded payload length in bytes used to validate offsets and prevent arithmetic overflow before slicing mapped storage.

Returns: The bounded ReadOnlySpan<byte> view produced by MappedGgufFile.GetBytes: Gets a read-only span over a validated file range. Its lifetime and ownership remain tied to the owner identified by the containing type; no out-of-range region is exposed.

Method CopyBytes(ulong,System.Span<byte>)

Copies a validated mapped-file range into a caller-owned destination.

absoluteOffset
The zero-based absolute offset into the relevant source or destination; range validation occurs before access.
destination
The caller-owned destination buffer that receives the result; required capacity is validated before any write occurs.
Method CopyTensorBytes(UAIX.LmRuntime.Gguf.GgufTensorDescriptor,System.Span<byte>)

Copies a complete tensor payload into a caller-owned destination.

tensor
The validated GGUF tensor descriptor whose dimensions, offsets, storage type, and byte length drive the operation.
destination
The caller-owned destination buffer that receives the result; required capacity is validated before any write occurs.
Method CreateTensorView(UAIX.LmRuntime.Gguf.GgufTensorDescriptor,System.Collections.Generic.IReadOnlyList<ulong>)

Creates a read-only typed tensor view whose lifetime is owned by this mapping.

tensor
The validated GGUF tensor descriptor whose dimensions, offsets, storage type, and byte length drive the operation.
logicalDimensions
Optional logical row-major dimensions; storage dimensions are used when omitted.

Returns: The borrowed mapped tensor view, with ownership and disposal obligations defined by the returned type and the CreateTensorView contract.

Method Dispose

Releases the acquired view pointer, operating-system mapping handles, and source file handle.

IReadOnlyTensorViewUAIX.LmRuntime.Gguf 8 members

Defines a read-only tensor view backed by validated model storage.

Property Descriptor

Gets the authoritative GGUF tensor descriptor.

Property StorageDimensions

Gets dimensions in GGUF storage order.

Property LogicalDimensions

Gets normalized logical row-major dimensions.

Property DataType

Gets the runtime data type.

Property ByteOrder

Gets the GGUF byte order.

Property IsDisposed

Gets whether the backing owner has been disposed.

Method GetMemory

Gets a borrowed read-only memory view over the tensor payload.

Returns: The memory region containing the memory, bounded to the validated range owned by the result.

Method GetSpan

Gets a synchronous borrowed span over the tensor payload.

Returns: The bounded ReadOnlySpan<byte> view produced by IReadOnlyTensorView.GetSpan: Gets a synchronous borrowed span over the tensor payload. Its lifetime and ownership remain tied to the owner identified by the containing type; no out-of-range region is exposed.

MappedTensorViewUAIX.LmRuntime.Gguf 9 members

Carries validated tensor geometry and a borrowed read-only view into a mapped GGUF file.

Method MappedTensorView(UAIX.LmRuntime.Gguf.MappedGgufFile,UAIX.LmRuntime.Gguf.GgufTensorDescriptor,System.Collections.Generic.IReadOnlyList<ulong>)

Initializes a new MappedTensorView instance with validated dependencies and operational bounds.

owner
The mapped model-file owner that keeps the tensor payload alive for the lifetime of the created view.
descriptor
The validated tensor descriptor retained by the view; its offsets and lengths remain bounded by the mapped file.
logicalDimensions
The logical dimensions sequence used by this operation; its required length, ordering, and element bounds are validated before access.
Property Descriptor
Property StorageDimensions
Property LogicalDimensions
Property DataType
Property ByteOrder
Property IsDisposed
Method GetMemory

Retrieves the memory from the current MappedTensorView state after validating the requested access.

Returns: The memory region containing the memory, bounded to the validated range owned by the result.

Method GetSpan

Retrieves the span from the current MappedTensorView state after validating the requested access.

Returns: The bounded ReadOnlySpan<byte> view produced by MappedTensorView.GetSpan: Retrieves the span from the current MappedTensorView state after validating the requested access. Its lifetime and ownership remain tied to the owner identified by the containing type; no out-of-range region is exposed.

Float32TensorReaderUAIX.LmRuntime.Gguf 5 members

Reads float32 values from a validated tensor view with explicit GGUF byte-order handling.

Method Float32TensorReader(UAIX.LmRuntime.Gguf.IReadOnlyTensorView)

Initializes a new Float32TensorReader instance with validated dependencies and operational bounds.

view
The bounded tensor view whose descriptor, shape, byte order, and mapped payload are read without transferring ownership of the underlying mapping.
Property ElementCount

Gets the logical float32 element count.

Method ReadElement(int)

Reads one element by flat storage index.

index
The zero-based index; it must identify an existing position within the relevant validated range.

Returns: The float value computed by Float32TensorReader.ReadElement for this contract: Reads one element by flat storage index. Range, finite-value, and overflow checks are completed before the value is returned.

Method CopyTo(System.Span<float>)

Decodes every element into a caller-provided destination.

destination
The destination with room for every tensor element.
Method ToArray

Creates an explicit bounded managed copy for scalar reference execution.

Returns: A newly allocated float[] containing the ordered result of Float32TensorReader.ToArray: Creates an explicit bounded managed copy for scalar reference execution. The caller owns the returned array and later mutation cannot alter the source object.

SegmentedModelFileOptionsUAIX.LmRuntime.Gguf 1 member

Configures bounded segmented reads from a model file whose logical offsets remain unsigned 64-bit values.

Property MaximumSegmentByteCount

Gets the maximum bytes returned by one owned segment.

ModelFileSegmentUAIX.LmRuntime.Gguf 5 members

Owns one pooled, bounded model-file segment returned by .

The segment owns an lease. Consumers may retain only while this instance is alive and must dispose the segment exactly once when the bytes are no longer needed.

Property Offset

Gets the unsigned logical file offset represented by this segment.

Property Length

Gets the number of valid bytes in the segment.

Property IsDisposed

Gets whether the pooled segment has been released.

Property Memory

Gets the read-only segment bytes while this owner remains alive.

Method Dispose

Returns the rented byte array to the shared pool and invalidates Memory.

SegmentedModelFileReaderUAIX.LmRuntime.Gguf 7 members

Reads bounded windows from a local model file without representing the complete file as one managed span.

Logical positions remain unsigned 64-bit values until the final checked conversion required by the operating-system random-access API. Returned owned segments are copies in pooled memory; caller-provided spans remain caller owned. This class performs no network access and never follows URLs.

Method SegmentedModelFileReader(string,UAIX.LmRuntime.Gguf.SegmentedModelFileOptions)

Opens a local model file for deterministic random access.

path
The local file-system path processed by this operation; it must satisfy the containing component's path and scope policy.
options
The optional SegmentedModelFileOptions controlling SegmentedModelFileReader; null selects the documented defaults, supplied limits are validated before allocation, and the instance is not mutated.
Property Length

Gets the file length as an unsigned 64-bit value.

Property MaximumSegmentByteCount

Gets the maximum owned segment size.

Property IsDisposed

Gets whether the reader has released its file handle.

Method ReadExactly(ulong,System.Span<byte>)

Reads exactly into a caller-owned bounded destination.

offset
The zero-based offset into the relevant source or destination; range validation occurs before access.
destination
The caller-owned destination buffer that receives the result; required capacity is validated before any write occurs.
Method ReadSegment(ulong,int)

Reads the segment from the current binary source using the component's validated representation.

offset
The zero-based offset into the relevant source or destination; range validation occurs before access.
length
The length used to bound this operation; it must be nonnegative and within the supported range.

Returns: The ModelFileSegment result produced by SegmentedModelFileReader.ReadSegment for this contract: Reads the segment from the current binary source using the component's validated representation. It is published only after all documented validation and ownership transitions succeed.

Method Dispose

Releases resources owned by SegmentedModelFileReader and transitions it to the disposed state.

UAIX.LmRuntime.Sampling

16 documented types and 71 documented members.

GenerationFinishReasonUAIX.LmRuntime.Sampling 7 members

Identifies the first decisive condition that ended a generation.

Field None

Generation remains active.

Field StopToken

An exact configured token identifier ended generation.

Field StopText

A configured decoded UTF-8 stop sequence ended generation.

Field TokenLimit

The configured maximum generated-token count was reached.

Field ContextLimit

The configured prompt-plus-generation context bound was reached.

Field Cancelled

Cancellation was observed before publishing another token.

Field ExecutionError

An execution error ended generation.

GenerationUsageUAIX.LmRuntime.Sampling 3 members

Records tokenizer-ID-based usage accounting.

Property PromptTokens

Gets the exact number of prompt token identifiers consumed.

Property CompletionTokens

Gets the exact number of generated token identifiers accepted by the controller.

Property TotalTokens

Gets the checked sum of prompt and completion token counts.

GenerationStepResultUAIX.LmRuntime.Sampling 3 members

Represents the observable result of attempting to publish one generated token.

Property TokenAccepted

Gets whether the token identifier was retained in generated-token output.

Property VisibleBytes

Gets bytes newly safe to publish after stop-prefix matching.

Property FinishReason

Gets the stable finish reason after this step.

GenerationControllerUAIX.LmRuntime.Sampling 9 members

Enforces stop, limit, cancellation, usage, and output-publication boundaries for one generation.

Method GenerationController(int,int,UAIX.LmRuntime.Sampling.SamplingOptions)

Initializes a controller from validated vocabulary and prompt-token bounds.

vocabularySize
The positive tokenizer vocabulary size used to validate token identifiers and size bounded result buffers.
promptTokenCount
The prompt token count used to bound this operation; it must be nonnegative and within the supported range.
options
The optional SamplingOptions controlling GenerationController; null selects the documented defaults, supplied limits are validated before allocation, and the instance is not mutated.
Property State

Gets the session-local sampling state used by the same generation.

Property OutputTokenIds

Gets generated token identifiers retained under stop-token emission policy.

Property FinishReason

Gets the first decisive finish reason, or while active.

Property Usage

Gets exact tokenizer-ID usage without deriving counts from text or bytes.

Method AcceptToken(int,System.ReadOnlySpan<byte>,System.Threading.CancellationToken)

Attempts to accept and publish one generated token at a bounded cancellation point.

tokenId
The token identifier to process; it must fall within the validated vocabulary and operation-specific range.
decodedBytes
The decoded bytes sequence used by this operation; its required length, ordering, and element bounds are validated before access.
cancellationToken
Cancellation observed before the token is recorded or published.

Returns: The token/output/finish transition produced by this step.

Method ObserveCancellation(System.Threading.CancellationToken)

Observes cancellation between decode steps without publishing another token.

cancellationToken
The caller-provided token used to cancel the operation before additional work or results are published.

Returns: The GenerationFinishReason result produced by GenerationController.ObserveCancellation for this contract: Observes cancellation between decode steps without publishing another token. It is published only after all documented validation and ownership transitions succeed.

Method Fail(System.Exception)

Records an execution failure without exposing exception details through the stable finish reason.

exception
The non-null execution failure observed by the caller.

Returns: The GenerationFinishReason result produced by GenerationController.Fail for this contract: Records an execution failure without exposing exception details through the stable finish reason. It is published only after all documented validation and ownership transitions succeed.

Method CompleteVisibleBytes

Completes an otherwise active stream and flushes any bytes retained as a possible stop prefix.

Returns: Remaining visible bytes; the finish reason remains GenerationFinishReason.None.

GreedySamplerUAIX.LmRuntime.Sampling 1 member

Provides deterministic greedy token selection with explicit non-finite input policy.

Method Select(System.ReadOnlySpan<float>)

Selects the highest logit index with deterministic lower-index tie-breaking.

logits
The non-empty source logits. NaN is rejected; infinities compare normally.

Returns: The int value computed by GreedySampler.Select for this contract: Selects the highest logit index with deterministic lower-index tie-breaking. Range, finite-value, and overflow checks are completed before the value is returned.

LogitProcessorUAIX.LmRuntime.Sampling 4 members

Applies validated, deterministic token-history and bias policies to logits.

Method Process(System.ReadOnlySpan<float>,UAIX.LmRuntime.Sampling.SamplingOptions,UAIX.LmRuntime.Sampling.SamplingState)

Produces a processed copy of the source logits without exposing partially mutated caller data on validation failure.

logits
The logits sequence used by this operation; its required length, ordering, and element bounds are validated before access.
options
The optional SamplingOptions controlling Process; null selects the documented defaults, supplied limits are validated before allocation, and the instance is not mutated.
state
The validated state value consumed by the operation; mutations, when applicable, are limited to the explicitly documented state owner.

Returns: A newly allocated float[] containing the ordered result of LogitProcessor.Process: Produces a processed copy of the source logits without exposing partially mutated caller data on validation failure. The caller owns the returned array and later mutation cannot alter the source object.

Method ApplyHistoryPenalties(System.Span<float>,UAIX.LmRuntime.Sampling.SamplingOptions,System.Collections.Generic.IReadOnlyDictionary<int,int>)

Applies sign-aware repetition, frequency, and presence penalties in one deterministic pass.

logits
The logits sequence used by this operation; its required length, ordering, and element bounds are validated before access.
options
The optional SamplingOptions controlling ApplyHistoryPenalties; null selects the documented defaults, supplied limits are validated before allocation, and the instance is not mutated.
tokenCounts
The ordered token counts collection of type IReadOnlyDictionary<int, int>; LogitProcessor.ApplyHistoryPenalties validates nullability, count, and element constraints before consuming or snapshotting it and does not retain a mutable caller alias.
Method ApplyBias(System.Span<float>,System.Collections.Generic.IReadOnlyDictionary<int,float>)

Adds all validated per-token biases exactly once.

logits
The logits sequence used by this operation; its required length, ordering, and element bounds are validated before access.
biases
The ordered biases collection of type IReadOnlyDictionary<int, float>; LogitProcessor.ApplyBias validates nullability, count, and element constraints before consuming or snapshotting it and does not retain a mutable caller alias.
Method SuppressEarlyStopTokens(System.Span<float>,UAIX.LmRuntime.Sampling.SamplingOptions,int)

Marks configured stop tokens ineligible before the exact minimum-token boundary.

logits
The logits sequence used by this operation; its required length, ordering, and element bounds are validated before access.
options
The optional SamplingOptions controlling SuppressEarlyStopTokens; null selects the documented defaults, supplied limits are validated before allocation, and the instance is not mutated.
generatedTokenCount
The generated token count used to bound this operation; it must be nonnegative and within the supported range.
LogitScoreUAIX.LmRuntime.Sampling 2 members

Represents a scored token candidate.

Property TokenIndex

Gets the token index.

Property Score

Gets the logit score.

SamplingCandidateUAIX.LmRuntime.Sampling 2 members

Represents one normalized token candidate retained after all filters.

Property TokenId

Gets the token identifier.

Property Probability

Gets the normalized candidate probability.

SamplingDecisionUAIX.LmRuntime.Sampling 3 members

Describes one deterministic or stochastic sampling decision.

Property TokenId

Gets the selected token identifier.

Property IsGreedy

Gets whether the zero-temperature greedy path made the decision.

Property Candidates

Gets the candidate distribution used for selection, ordered by probability and token identifier.

ProbabilitySamplerUAIX.LmRuntime.Sampling 2 members

Builds stable normalized distributions and samples them with session-local deterministic state.

Method Select(System.ReadOnlySpan<float>,UAIX.LmRuntime.Sampling.SamplingOptions,UAIX.LmRuntime.Sampling.SamplingState)

Processes logits and selects one token under the supplied deterministic policy.

logits
The logits sequence used by this operation; its required length, ordering, and element bounds are validated before access.
options
The optional SamplingOptions controlling Select; null selects the documented defaults, supplied limits are validated before allocation, and the instance is not mutated.
state
The validated state value consumed by the operation; mutations, when applicable, are limited to the explicitly documented state owner.

Returns: The SamplingDecision result produced by ProbabilitySampler.Select for this contract: Processes logits and selects one token under the supplied deterministic policy. It is published only after all documented validation and ownership transitions succeed.

Method BuildDistribution(System.ReadOnlySpan<float>,UAIX.LmRuntime.Sampling.SamplingOptions)

Builds a stable, filtered, and normalized probability distribution without consuming random state.

processedLogits
The processed logits sequence used by this operation; its required length, ordering, and element bounds are validated before access.
options
The optional SamplingOptions controlling BuildDistribution; null selects the documented defaults, supplied limits are validated before allocation, and the instance is not mutated.

Returns: Candidates ordered by descending probability and ascending token identifier for ties.

SamplingOptionsUAIX.LmRuntime.Sampling 17 members

Defines deterministic logit-processing, candidate-selection, and generation-stop policies for one sampling session.

Instances are treated as immutable configuration after a is created. Validation is intentionally performed before a logit buffer is modified so an invalid option cannot leave partially adjusted data.

Property Temperature

Gets the non-negative temperature. Zero selects the deterministic greedy path.

Property TopK

Gets the maximum candidate count. Zero disables top-k filtering.

Property TopP

Gets the normalized nucleus probability threshold in the inclusive range zero through one.

Property MinimumP

Gets the minimum probability relative to the highest candidate probability.

Property RepetitionPenalty

Gets the positive sign-aware repetition penalty.

Property FrequencyPenalty

Gets the amount subtracted for each prior occurrence of a token.

Property PresencePenalty

Gets the amount subtracted once from any token that has previously appeared.

Property LogitBias

Gets finite per-token additive logit biases.

Property StopTokenIds

Gets exact token identifiers that terminate generation after the minimum-token boundary.

Property StopSequences

Gets UTF-8 stop strings matched across decoded token boundaries.

Property IncludeStopToken

Gets whether a matched stop token is retained in emitted token identifiers.

Property IncludeStopSequence

Gets whether matched stop-sequence bytes are included in visible output.

Property MinimumGeneratedTokens

Gets the minimum generated-token count before stop-token or stop-text policies become eligible.

Property MaximumGeneratedTokens

Gets the maximum number of generated tokens. Zero permits no generated tokens.

Property MaximumContextTokens

Gets the maximum prompt-plus-generation token count.

Property Seed

Gets the deterministic per-session pseudo-random generator seed.

Method Validate(int)

Validates every option and token-indexed policy against a vocabulary size without mutating caller data.

vocabularySize
The positive number of logits accepted by the model.
SamplingStateUAIX.LmRuntime.Sampling 6 members

Stores token history and deterministic pseudo-random state for exactly one generation session.

A state instance must not be shared by independent requests. Keeping history and random state together makes session isolation explicit and prevents interleaved requests from consuming one another's random sequence.

Method SamplingState(UAIX.LmRuntime.Sampling.SamplingOptions)

Initializes isolated state from the immutable session options.

options
The options whose seed initializes the session generator.
Property Random

Gets the session-local deterministic random generator.

Property GeneratedTokenCount

Gets the number of generated token identifiers recorded by this session.

Property TokenCounts

Gets the session-owned prior-token counts through a read-only interface.

Method RecordToken(int)

Records one generated token for repetition, frequency, presence, and usage policies.

tokenId
The non-negative tokenizer identifier generated by the model.
Method GetTokenCount(int)

Gets the prior count for one token without adding it to history.

tokenId
The token identifier to process; it must fall within the validated vocabulary and operation-specific range.

Returns: The recorded count, or zero when the token has not appeared.

Xoshiro256StarStarUAIX.LmRuntime.Sampling 3 members

Implements the xoshiro256** generator with SplitMix64 seed expansion and deterministic unsigned arithmetic.

The generator is intentionally session-local and not thread-safe. Its transition follows the published xoshiro256** reference algorithm; seed expansion prevents the prohibited all-zero state for a zero seed.

Method Xoshiro256StarStar(ulong)

Initializes the generator from one deterministic 64-bit seed.

seed
The numeric seed consumed by Xoshiro256StarStar; it must satisfy the member's documented range, geometry, and finite-value requirements.
Method NextUInt64

Returns the next 64-bit output and advances the generator exactly once.

Returns: The ulong value computed by Xoshiro256StarStar.NextUInt64 for this contract: Returns the next 64-bit output and advances the generator exactly once. Range, finite-value, and overflow checks are completed before the value is returned.

Method NextUnitDouble

Returns a uniformly distributed value in the half-open interval [0, 1).

Returns: The double value computed by Xoshiro256StarStar.NextUnitDouble for this contract: Returns a uniformly distributed value in the half-open interval [0, 1). Range, finite-value, and overflow checks are completed before the value is returned.

StopSequenceMatchResultUAIX.LmRuntime.Sampling 3 members

Represents visible bytes released by one bounded stop-sequence matching step.

Property VisibleBytes

Gets newly visible bytes that cannot participate in a future stop match.

Property MatchedStopSequence

Gets the exact stop string that completed during this step, if any.

Property Matched

Gets whether a terminal stop sequence has matched.

StopSequenceMatcherUAIX.LmRuntime.Sampling 5 members

Matches UTF-8 stop sequences across arbitrary decoded-byte boundaries while retaining only a bounded possible prefix.

Method StopSequenceMatcher(System.Collections.Generic.IEnumerable<string>,bool)

Initializes a matcher from non-empty stop strings.

stopSequences
The exact Unicode strings encoded as UTF-8 for byte matching.
includeMatchedBytes
Whether a terminal match is included in visible output.
Property MaximumRetainedBytes

Gets the maximum retained prefix bytes, bounded by the longest configured stop sequence.

Property RetainedByteCount

Gets the current possible stop-prefix byte count.

Method Append(System.ReadOnlySpan<byte>)

Appends one decoded byte chunk and releases bytes that can no longer participate in a stop match.

bytes
The next decoded UTF-8 bytes; chunks may split a code point or stop sequence.

Returns: The newly visible bytes and optional terminal match.

Method Complete

Completes matching and releases any retained non-matching prefix bytes.

Returns: A newly allocated byte[] containing the ordered result of StopSequenceMatcher.Complete: Completes matching and releases any retained non-matching prefix bytes. The caller owns the returned array and later mutation cannot alter the source object.

TopKSelectorUAIX.LmRuntime.Sampling 1 member

Provides partial top-k selection for logit arrays.

Method SelectTopK(System.ReadOnlySpan<float>,int)

Selects the highest scoring token candidates without sorting the full input.

logits
The logits sequence used by this operation; its required length, ordering, and element bounds are validated before access.
k
The numeric k consumed by SelectTopK; it must satisfy the member's documented range, geometry, and finite-value requirements.

Returns: The selected candidates in descending score order with deterministic index tie-breaking.

UAIX.LmRuntime.Kernels.Cpu

21 documented types and 87 documented members.

Q4_1DequantizerUAIX.LmRuntime.Kernels.Cpu 1 member

Dequantizes Q4_1 blocks for scalar reference parity.

Method DequantizeBlock(System.ReadOnlySpan<byte>,System.Span<float>)

Dequantizes one Q4_1 block into destination floats.

source
The source data consumed by the operation; caller-owned storage is not retained after the method returns.
destination
The caller-owned destination buffer that receives the result; required capacity is validated before any write occurs.
Q5_0DequantizerUAIX.LmRuntime.Kernels.Cpu 1 member

Dequantizes Q5_0 blocks for scalar reference parity.

Method DequantizeBlock(System.ReadOnlySpan<byte>,System.Span<float>)

Dequantizes one Q5_0 block into destination floats.

source
The source data consumed by the operation; caller-owned storage is not retained after the method returns.
destination
The caller-owned destination buffer that receives the result; required capacity is validated before any write occurs.
Q6_KDequantizerUAIX.LmRuntime.Kernels.Cpu 1 member

Dequantizes Q6_K blocks for scalar reference parity.

Method DequantizeBlock(System.ReadOnlySpan<byte>,System.Span<float>)

Dequantizes one exact GGML Q6_K block into destination floats.

source
The source data consumed by the operation; caller-owned storage is not retained after the method returns.
destination
The caller-owned destination buffer that receives the result; required capacity is validated before any write occurs.
FusedQuantizedDotProductUAIX.LmRuntime.Kernels.Cpu 1 member

Provides fused dequantize-and-dot reference kernels.

Method Dot(UAIX.LmRuntime.Tensors.GgmlTensorType,System.ReadOnlySpan<byte>,System.ReadOnlySpan<float>)

Computes a dot product between a quantized block and float activations.

type
The declared GGML tensor or metadata type used to select the corresponding decoding and validation rules.
block
The block sequence used by this operation; its required length, ordering, and element bounds are validated before access.
activations
The activations sequence used by this operation; its required length, ordering, and element bounds are validated before access.

Returns: The float value computed by FusedQuantizedDotProduct.Dot for this contract: Computes a dot product between a quantized block and float activations. Range, finite-value, and overflow checks are completed before the value is returned.

QuantizedKernelParityReportUAIX.LmRuntime.Kernels.Cpu 2 members

Represents the result of comparing an optimized quantized kernel to a reference kernel.

Property Passed

Gets a value indicating whether outputs are within tolerance.

Property MaxAbsoluteError

Gets the maximum absolute error observed.

QuantizedKernelParityRunnerUAIX.LmRuntime.Kernels.Cpu 1 member

Compares quantized kernels against scalar references.

Method CompareAgainstReference(System.ReadOnlySpan<float>,System.ReadOnlySpan<float>,float)

Compares two output vectors with an absolute tolerance.

reference
The reference sequence used by this operation; its required length, ordering, and element bounds are validated before access.
actual
The actual sequence used by this operation; its required length, ordering, and element bounds are validated before access.
tolerance
The numeric tolerance consumed by CompareAgainstReference; it must satisfy the member's documented range, geometry, and finite-value requirements.

Returns: The QuantizedKernelParityReport result produced by QuantizedKernelParityRunner.CompareAgainstReference for this contract: Compares two output vectors with an absolute tolerance. It is published only after all documented validation and ownership transitions succeed.

Avx2Float32KernelsUAIX.LmRuntime.Kernels.Cpu 2 members

Provides dedicated AVX2 float32 correctness kernels with scalar tails.

These kernels are selected only when AVX2 is explicitly requested and supported. Scalar implementations remain the numerical authority, and no throughput claim is implied until executed benchmark evidence exists.

Method Dot(System.ReadOnlySpan<float>,System.ReadOnlySpan<float>)

Computes a float32 dot product with AVX/FMA vector arithmetic and a scalar tail.

left
The left sequence used by this operation; its required length, ordering, and element bounds are validated before access.
right
The right sequence used by this operation; its required length, ordering, and element bounds are validated before access.

Returns: The float value computed by Avx2Float32Kernels.Dot for this contract: Computes a float32 dot product with AVX/FMA vector arithmetic and a scalar tail. Range, finite-value, and overflow checks are completed before the value is returned.

Method MatVec(System.ReadOnlySpan<float>,int,int,System.ReadOnlySpan<float>,System.Span<float>)

Computes a row-major float32 matrix-vector product by reusing the dedicated AVX2 dot kernel.

matrix
The matrix sequence used by this operation; its required length, ordering, and element bounds are validated before access.
rowCount
The row count used to bound this operation; it must be nonnegative and within the supported range.
columnCount
The column count used to bound this operation; it must be nonnegative and within the supported range.
vector
The vector sequence used by this operation; its required length, ordering, and element bounds are validated before access.
output
The caller-owned destination buffer that receives the result; required capacity is validated before any write occurs.
CpuKernelTierUAIX.LmRuntime.Kernels.Cpu 6 members

Identifies a managed CPU kernel implementation tier.

Field Auto

Selects the highest supported tier implemented for the requested operation.

Field Scalar

Uses the scalar correctness implementation.

Field PortableVector

Uses portable operations.

Field Avx2

Uses an AVX2 implementation when the operation provides one.

Field Avx512

Uses an AVX-512 implementation when the operation provides one.

Field AdvSimd

Uses an ARM64 AdvSimd implementation when the operation provides one.

CpuKernelSelectionUAIX.LmRuntime.Kernels.Cpu 4 members

Describes the requested and selected CPU kernel tier for one operation.

Property RequestedTier

Gets the requested tier.

Property SelectedTier

Gets the selected tier.

Property Operation

Gets the operation name.

Property Reason

Gets the stable selection rationale.

CpuKernelDispatcherUAIX.LmRuntime.Kernels.Cpu 6 members

Dispatches correctness-first CPU kernels through explicitly selectable implementation tiers.

Scalar implementations remain the numerical authority. Portable and architecture-specific paths are additive and can always be bypassed by requesting .

Method SelectFloat32DotTier(UAIX.LmRuntime.Kernels.Cpu.CpuKernelTier)

Selects the implemented tier for a float32 dot product.

requestedTier
The requested tier containing caller-supplied values for this operation; all required fields are validated before processing.

Returns: The CpuKernelSelection result produced by CpuKernelDispatcher.SelectFloat32DotTier for this contract: Selects the implemented tier for a float32 dot product. It is published only after all documented validation and ownership transitions succeed.

Method DotFloat32(System.ReadOnlySpan<float>,System.ReadOnlySpan<float>,UAIX.LmRuntime.Kernels.Cpu.CpuKernelTier,UAIX.LmRuntime.Kernels.Cpu.CpuKernelSelection&)

Computes a float32 dot product through the selected implementation tier.

left
The left sequence used by this operation; its required length, ordering, and element bounds are validated before access.
right
The right sequence used by this operation; its required length, ordering, and element bounds are validated before access.
requestedTier
The requested tier containing caller-supplied values for this operation; all required fields are validated before processing.
selection
When the method returns, contains the selection produced by the operation when successful; otherwise contains the type's default value.

Returns: The float value computed by CpuKernelDispatcher.DotFloat32 for this contract: Computes a float32 dot product through the selected implementation tier. Range, finite-value, and overflow checks are completed before the value is returned.

Method MatVecFloat32(System.ReadOnlySpan<float>,int,int,System.ReadOnlySpan<float>,System.Span<float>,UAIX.LmRuntime.Kernels.Cpu.CpuKernelTier,UAIX.LmRuntime.Kernels.Cpu.CpuKernelSelection&)

Computes a row-major float32 matrix-vector product through the selected implementation tier.

matrix
The matrix sequence used by this operation; its required length, ordering, and element bounds are validated before access.
rowCount
The row count used to bound this operation; it must be nonnegative and within the supported range.
columnCount
The column count used to bound this operation; it must be nonnegative and within the supported range.
vector
The vector sequence used by this operation; its required length, ordering, and element bounds are validated before access.
output
The caller-owned destination buffer that receives the result; required capacity is validated before any write occurs.
requestedTier
The requested tier containing caller-supplied values for this operation; all required fields are validated before processing.
selection
When the method returns, contains the selection produced by the operation when successful; otherwise contains the type's default value.
Method RmsNorm(System.ReadOnlySpan<float>,System.ReadOnlySpan<float>,System.Span<float>,float,UAIX.LmRuntime.Kernels.Cpu.CpuKernelTier,UAIX.LmRuntime.Kernels.Cpu.CpuKernelSelection&)

Applies RMS normalization using the selected scalar or portable-vector accumulation tier.

input
The source data consumed by the operation; caller-owned storage is not retained after the method returns.
weight
The weight sequence used by this operation; its required length, ordering, and element bounds are validated before access.
output
The caller-owned destination buffer that receives the result; required capacity is validated before any write occurs.
epsilon
The positive normalization epsilon added to the mean-square term to avoid division by zero while preserving deterministic numerical behavior.
requestedTier
The requested tier containing caller-supplied values for this operation; all required fields are validated before processing.
selection
When the method returns, contains the selection produced by the operation when successful; otherwise contains the type's default value.
Method DotQ8_0(System.ReadOnlySpan<byte>,System.ReadOnlySpan<float>,UAIX.LmRuntime.Kernels.Cpu.CpuKernelTier,UAIX.LmRuntime.Kernels.Cpu.CpuKernelSelection&)

Computes a Q8_0 block dot product through a scalar or portable-vector correctness path.

block
The block sequence used by this operation; its required length, ordering, and element bounds are validated before access.
activations
The activations sequence used by this operation; its required length, ordering, and element bounds are validated before access.
requestedTier
The requested tier containing caller-supplied values for this operation; all required fields are validated before processing.
selection
When the method returns, contains the selection produced by the operation when successful; otherwise contains the type's default value.

Returns: The float value computed by CpuKernelDispatcher.DotQ8_0 for this contract: Computes a Q8_0 block dot product through a scalar or portable-vector correctness path. Range, finite-value, and overflow checks are completed before the value is returned.

Method DotQ4_0(System.ReadOnlySpan<byte>,System.ReadOnlySpan<float>,UAIX.LmRuntime.Kernels.Cpu.CpuKernelTier,UAIX.LmRuntime.Kernels.Cpu.CpuKernelSelection&)

Computes a Q4_0 block dot product through a scalar or portable-vector correctness path.

block
The block sequence used by this operation; its required length, ordering, and element bounds are validated before access.
activations
The activations sequence used by this operation; its required length, ordering, and element bounds are validated before access.
requestedTier
The requested tier containing caller-supplied values for this operation; all required fields are validated before processing.
selection
When the method returns, contains the selection produced by the operation when successful; otherwise contains the type's default value.

Returns: The float value computed by CpuKernelDispatcher.DotQ4_0 for this contract: Computes a Q4_0 block dot product through a scalar or portable-vector correctness path. Range, finite-value, and overflow checks are completed before the value is returned.

Q4KBlockUAIX.LmRuntime.Kernels.Cpu 4 members

Defines the audited packed GGML Q4_K block layout for 256 logical values.

Field Scale

Gets or sets the common little-endian IEEE half scale.

Field MinimumScale

Gets or sets the common little-endian IEEE half minimum scale.

Field ScaleMinimums

Stores eight packed 6-bit scales and eight packed 6-bit minimum factors.

Field QuantizedValues

Stores 256 four-bit quants in 128 bytes.

Q6KBlockUAIX.LmRuntime.Kernels.Cpu 4 members

Defines the audited packed GGML Q6_K block layout for 256 logical values.

Field LowBits

Stores the lower four bits for 256 quants.

Field HighBits

Stores the upper two bits for 256 quants.

Field Scales

Stores sixteen signed sub-block scales.

Field Scale

Gets or sets the common little-endian IEEE half scale.

KQuantizedBlockLayoutUAIX.LmRuntime.Kernels.Cpu 4 members

Describes one audited K-quantized block layout.

Property Format

Gets the format name.

Property ElementCount

Gets the logical element count.

Property ByteCount

Gets the physical byte count.

Property LayoutDescription

Gets the audited layout statement.

KQuantizedCpuKernelsUAIX.LmRuntime.Kernels.Cpu 17 members

Provides correctness-first scalar GGML Q4_K and Q6_K block kernels.

These methods operate on one exact 256-element block and never materialize a complete model matrix. All scale fields are interpreted as little-endian IEEE half values because current direct K-quant execution is limited to little-endian GGUF storage.

Field BlockElementCount

Gets the number of logical values in one K-quant block.

Field Q4KBlockByteCount

Gets the exact Q4_K block byte count.

Field Q6KBlockByteCount

Gets the exact Q6_K block byte count.

Field Q4_KBlockBytes

Gets the historical Q4_K block-byte constant retained for source compatibility.

Field Q6_KBlockBytes

Gets the historical Q6_K block-byte constant retained for source compatibility.

Property Q4KLayout

Gets the audited Q4_K block layout.

Property Q6KLayout

Gets the audited Q6_K block layout.

Method DequantizeQ4K(System.ReadOnlySpan<byte>,System.Span<float>)

Dequantizes one exact Q4_K block into a caller-owned destination.

block
The block sequence used by this operation; its required length, ordering, and element bounds are validated before access.
destination
The caller-owned destination buffer that receives the result; required capacity is validated before any write occurs.
Method DotQ4K(System.ReadOnlySpan<byte>,System.ReadOnlySpan<float>)

Computes an allocation-free dot product for one exact Q4_K block.

block
The block sequence used by this operation; its required length, ordering, and element bounds are validated before access.
activations
The activations sequence used by this operation; its required length, ordering, and element bounds are validated before access.

Returns: The float value computed by KQuantizedCpuKernels.DotQ4K for this contract: Computes an allocation-free dot product for one exact Q4_K block. Range, finite-value, and overflow checks are completed before the value is returned.

Method DequantizeQ6K(System.ReadOnlySpan<byte>,System.Span<float>)

Dequantizes one exact Q6_K block into a caller-owned destination.

block
The block sequence used by this operation; its required length, ordering, and element bounds are validated before access.
destination
The caller-owned destination buffer that receives the result; required capacity is validated before any write occurs.
Method DotQ6K(System.ReadOnlySpan<byte>,System.ReadOnlySpan<float>)

Computes an allocation-free dot product for one exact Q6_K block.

block
The block sequence used by this operation; its required length, ordering, and element bounds are validated before access.
activations
The activations sequence used by this operation; its required length, ordering, and element bounds are validated before access.

Returns: The float value computed by KQuantizedCpuKernels.DotQ6K for this contract: Computes an allocation-free dot product for one exact Q6_K block. Range, finite-value, and overflow checks are completed before the value is returned.

Method DequantizeQ4_K(System.ReadOnlySpan<byte>,System.Span<float>)

Dequantizes one Q4_K block using the historical method name retained for source compatibility.

block
The block sequence used by this operation; its required length, ordering, and element bounds are validated before access.
destination
The caller-owned destination buffer that receives the result; required capacity is validated before any write occurs.
Method DequantizeQ6_K(System.ReadOnlySpan<byte>,System.Span<float>)

Dequantizes one Q6_K block using the historical method name retained for source compatibility.

block
The block sequence used by this operation; its required length, ordering, and element bounds are validated before access.
destination
The caller-owned destination buffer that receives the result; required capacity is validated before any write occurs.
Method MatVecQ4_K(System.ReadOnlySpan<byte>,int,int,System.ReadOnlySpan<float>,System.Span<float>)

Computes a Q4_K matrix-vector product using the historical method name retained for source compatibility.

matrix
The matrix sequence used by this operation; its required length, ordering, and element bounds are validated before access.
rowCount
The row count used to bound this operation; it must be nonnegative and within the supported range.
columnCount
The column count used to bound this operation; it must be nonnegative and within the supported range.
activations
The activations sequence used by this operation; its required length, ordering, and element bounds are validated before access.
output
The caller-owned destination buffer that receives the result; required capacity is validated before any write occurs.
Method MatVecQ6_K(System.ReadOnlySpan<byte>,int,int,System.ReadOnlySpan<float>,System.Span<float>)

Computes a Q6_K matrix-vector product using the historical method name retained for source compatibility.

matrix
The matrix sequence used by this operation; its required length, ordering, and element bounds are validated before access.
rowCount
The row count used to bound this operation; it must be nonnegative and within the supported range.
columnCount
The column count used to bound this operation; it must be nonnegative and within the supported range.
activations
The activations sequence used by this operation; its required length, ordering, and element bounds are validated before access.
output
The caller-owned destination buffer that receives the result; required capacity is validated before any write occurs.
Method MatVecQ4K(System.ReadOnlySpan<byte>,int,int,System.ReadOnlySpan<float>,System.Span<float>)

Computes a row-major Q4_K matrix-vector product without whole-matrix dequantization.

matrix
The matrix sequence used by this operation; its required length, ordering, and element bounds are validated before access.
rowCount
The row count used to bound this operation; it must be nonnegative and within the supported range.
columnCount
The column count used to bound this operation; it must be nonnegative and within the supported range.
activations
The activations sequence used by this operation; its required length, ordering, and element bounds are validated before access.
output
The caller-owned destination buffer that receives the result; required capacity is validated before any write occurs.
Method MatVecQ6K(System.ReadOnlySpan<byte>,int,int,System.ReadOnlySpan<float>,System.Span<float>)

Computes a row-major Q6_K matrix-vector product without whole-matrix dequantization.

matrix
The matrix sequence used by this operation; its required length, ordering, and element bounds are validated before access.
rowCount
The row count used to bound this operation; it must be nonnegative and within the supported range.
columnCount
The column count used to bound this operation; it must be nonnegative and within the supported range.
activations
The activations sequence used by this operation; its required length, ordering, and element bounds are validated before access.
output
The caller-owned destination buffer that receives the result; required capacity is validated before any write occurs.
QuantizedCpuKernelsUAIX.LmRuntime.Kernels.Cpu 9 members

Provides allocation-free scalar correctness kernels for high-value GGML quantization formats.

Field Q4_0BlockBytes

Gets the byte length of a Q4_0 block.

Field Q8_0BlockBytes

Gets the byte length of a Q8_0 block.

Field BlockElementCount

Gets the logical element count in a Q4_0 or Q8_0 block.

Method DequantizeQ4_0(System.ReadOnlySpan<byte>,System.Span<float>)

Dequantizes one little-endian Q4_0 block into float32 values.

block
The block sequence used by this operation; its required length, ordering, and element bounds are validated before access.
destination
The destination buffer with room for 32 values.
Method DequantizeQ8_0(System.ReadOnlySpan<byte>,System.Span<float>)

Dequantizes one little-endian Q8_0 block into float32 values.

block
The block sequence used by this operation; its required length, ordering, and element bounds are validated before access.
destination
The destination buffer with room for 32 values.
Method DotQ4_0(System.ReadOnlySpan<byte>,System.ReadOnlySpan<float>)

Computes an allocation-free dequantize-and-dot operation for one Q4_0 block.

block
The block sequence used by this operation; its required length, ordering, and element bounds are validated before access.
activations
The activations sequence used by this operation; its required length, ordering, and element bounds are validated before access.

Returns: The float value computed by QuantizedCpuKernels.DotQ4_0 for this contract: Computes an allocation-free dequantize-and-dot operation for one Q4_0 block. Range, finite-value, and overflow checks are completed before the value is returned.

Method DotQ8_0(System.ReadOnlySpan<byte>,System.ReadOnlySpan<float>)

Computes an allocation-free dequantize-and-dot operation for one Q8_0 block.

block
The block sequence used by this operation; its required length, ordering, and element bounds are validated before access.
activations
The activations sequence used by this operation; its required length, ordering, and element bounds are validated before access.

Returns: The float value computed by QuantizedCpuKernels.DotQ8_0 for this contract: Computes an allocation-free dequantize-and-dot operation for one Q8_0 block. Range, finite-value, and overflow checks are completed before the value is returned.

Method MatVecQ4_0(System.ReadOnlySpan<byte>,int,int,System.ReadOnlySpan<float>,System.Span<float>)

Computes a row-major Q4_0 matrix-vector product without materializing full-precision rows.

matrix
The matrix sequence used by this operation; its required length, ordering, and element bounds are validated before access.
rowCount
The row count used to bound this operation; it must be nonnegative and within the supported range.
columnCount
The number of logical columns; it must be divisible by 32.
activations
The activations sequence used by this operation; its required length, ordering, and element bounds are validated before access.
output
The caller-owned destination buffer that receives the result; required capacity is validated before any write occurs.
Method MatVecQ8_0(System.ReadOnlySpan<byte>,int,int,System.ReadOnlySpan<float>,System.Span<float>)

Computes a row-major Q8_0 matrix-vector product without materializing full-precision rows.

matrix
The matrix sequence used by this operation; its required length, ordering, and element bounds are validated before access.
rowCount
The row count used to bound this operation; it must be nonnegative and within the supported range.
columnCount
The number of logical columns; it must be divisible by 32.
activations
The activations sequence used by this operation; its required length, ordering, and element bounds are validated before access.
output
The caller-owned destination buffer that receives the result; required capacity is validated before any write occurs.
ReferenceCpuKernelsUAIX.LmRuntime.Kernels.Cpu 6 members

Provides scalar and portable CPU reference kernels for correctness anchoring.

Method SoftmaxInPlace(System.Span<float>)

Computes softmax probabilities for the in place using numerically stable normalization.

values
The values sequence used by this operation; its required length, ordering, and element bounds are validated before access.
Method ApplyRopeInPlace(System.Span<float>,System.ReadOnlySpan<float>,System.ReadOnlySpan<float>,int)

Applies RoPE rotation to one query or key vector in place using precomputed sine and cosine values.

vector
The vector sequence used by this operation; its required length, ordering, and element bounds are validated before access.
cos
The cos sequence used by this operation; its required length, ordering, and element bounds are validated before access.
sin
The sin sequence used by this operation; its required length, ordering, and element bounds are validated before access.
ropeDimensions
The even number of leading head dimensions transformed by rotary positional encoding.
Method Softmax(System.Span<float>)

Computes softmax probabilities for the supplied values using numerically stable normalization.

values
The values sequence used by this operation; its required length, ordering, and element bounds are validated before access.
Method ApplyRope(System.Span<float>,int,float)

Applies RoPE rotation using generated trigonometric tables for the supplied position.

vector
The vector sequence used by this operation; its required length, ordering, and element bounds are validated before access.
position
The zero-based sequence or cache position addressed by the operation; it must lie within the allocated context and readable or writable range.
theta
The rotary angle in radians applied to the paired vector components at the addressed position.
Method RmsNorm(System.ReadOnlySpan<float>,System.ReadOnlySpan<float>,System.Span<float>,float)

Applies RMS normalization using the shared vector math implementation.

input
The source data consumed by the operation; caller-owned storage is not retained after the method returns.
weight
The weight sequence used by this operation; its required length, ordering, and element bounds are validated before access.
output
The caller-owned destination buffer that receives the result; required capacity is validated before any write occurs.
epsilon
The positive normalization epsilon added to the mean-square term to avoid division by zero while preserving deterministic numerical behavior.
Method MatVec(System.ReadOnlySpan<float>,int,int,System.ReadOnlySpan<float>,System.Span<float>)

Computes a matrix-vector product for row-major float32 weights.

matrix
The matrix sequence used by this operation; its required length, ordering, and element bounds are validated before access.
rowCount
The row count used to bound this operation; it must be nonnegative and within the supported range.
columnCount
The column count used to bound this operation; it must be nonnegative and within the supported range.
vector
The vector sequence used by this operation; its required length, ordering, and element bounds are validated before access.
output
The caller-owned destination buffer that receives the result; required capacity is validated before any write occurs.
Q4_0BlockUAIX.LmRuntime.Kernels.Cpu 2 members

Defines the exact packed Q4_0 block layout used by GGML storage.

Field Scale

Gets or sets the little-endian IEEE half scale field.

Field QuantizedValues

Stores 32 signed 4-bit values in 16 packed bytes.

Q8_0BlockUAIX.LmRuntime.Kernels.Cpu 2 members

Defines the exact packed Q8_0 block layout used by GGML storage.

Field Scale

Gets or sets the little-endian IEEE half scale field.

Field QuantizedValues

Stores 32 signed 8-bit values.

ReferenceMatrixStorageDescriptorUAIX.LmRuntime.Kernels.Cpu 4 members

Describes one supported scalar matrix storage layout.

Property GgmlType

Gets the GGML tensor type.

Property RowCount

Gets the logical row count.

Property ColumnCount

Gets the logical column count.

Property RequiredByteCount

Gets the exact required storage byte count.

ReferenceMatrixRowDispatcherUAIX.LmRuntime.Kernels.Cpu 2 members

Dispatches correctness-first matrix-vector operations for supported mapped scalar and quantized rows.

Method Describe(UAIX.LmRuntime.Tensors.GgmlTensorType,int,int)

Computes the exact storage byte count for a supported matrix.

type
The declared GGML tensor or metadata type used to select the corresponding decoding and validation rules.
rowCount
The row count used to bound this operation; it must be nonnegative and within the supported range.
columnCount
The column count used to bound this operation; it must be nonnegative and within the supported range.

Returns: The ReferenceMatrixStorageDescriptor result produced by ReferenceMatrixRowDispatcher.Describe for this contract: Computes the exact storage byte count for a supported matrix. It is published only after all documented validation and ownership transitions succeed.

Method MatVec(UAIX.LmRuntime.Tensors.GgmlTensorType,System.ReadOnlySpan<byte>,int,int,System.ReadOnlySpan<float>,System.Span<float>)

Computes a little-endian matrix-vector product without materializing a complete dequantized matrix.

type
The declared GGML tensor or metadata type used to select the corresponding decoding and validation rules.
matrix
The matrix sequence used by this operation; its required length, ordering, and element bounds are validated before access.
rowCount
The row count used to bound this operation; it must be nonnegative and within the supported range.
columnCount
The column count used to bound this operation; it must be nonnegative and within the supported range.
activations
The activations sequence used by this operation; its required length, ordering, and element bounds are validated before access.
output
The caller-owned destination buffer that receives the result; required capacity is validated before any write occurs.
Scalar16CpuKernelsUAIX.LmRuntime.Kernels.Cpu 8 members

Provides correctness-first scalar F16 and BF16 decoding and matrix-vector kernels.

Method DecodeFloat16(System.ReadOnlySpan<byte>,bool)

Decodes one IEEE binary16 value with explicit byte order.

bytes
The bytes sequence used by this operation; its required length, ordering, and element bounds are validated before access.
bigEndian
True to decode source bytes in big-endian order; false to decode the little-endian representation used by the local GGUF artifact.

Returns: The float value computed by Scalar16CpuKernels.DecodeFloat16 for this contract: Decodes one IEEE binary16 value with explicit byte order. Range, finite-value, and overflow checks are completed before the value is returned.

Method DecodeFloat16(System.ReadOnlySpan<byte>,bool,bool)

Decodes one IEEE binary16 value with explicit byte order and non-finite policy.

bytes
The bytes sequence used by this operation; its required length, ordering, and element bounds are validated before access.
bigEndian
True to decode source bytes in big-endian order; false to decode the little-endian representation used by the local GGUF artifact.
rejectNonFinite
Whether NaN and infinity are rejected as invalid model weights.

Returns: The float value computed by Scalar16CpuKernels.DecodeFloat16 for this contract: Decodes one IEEE binary16 value with explicit byte order and non-finite policy. Range, finite-value, and overflow checks are completed before the value is returned.

Method DecodeBFloat16(System.ReadOnlySpan<byte>,bool)

Decodes one bfloat16 value with explicit byte order.

bytes
The bytes sequence used by this operation; its required length, ordering, and element bounds are validated before access.
bigEndian
True to decode source bytes in big-endian order; false to decode the little-endian representation used by the local GGUF artifact.

Returns: The float value computed by Scalar16CpuKernels.DecodeBFloat16 for this contract: Decodes one bfloat16 value with explicit byte order. Range, finite-value, and overflow checks are completed before the value is returned.

Method DecodeBFloat16(System.ReadOnlySpan<byte>,bool,bool)

Decodes one bfloat16 value with explicit byte order and non-finite policy.

bytes
The bytes sequence used by this operation; its required length, ordering, and element bounds are validated before access.
bigEndian
True to decode source bytes in big-endian order; false to decode the little-endian representation used by the local GGUF artifact.
rejectNonFinite
Whether NaN and infinity are rejected as invalid model weights.

Returns: The float value computed by Scalar16CpuKernels.DecodeBFloat16 for this contract: Decodes one bfloat16 value with explicit byte order and non-finite policy. Range, finite-value, and overflow checks are completed before the value is returned.

Method CopyFloat16(System.ReadOnlySpan<byte>,System.Span<float>,bool)

Copies F16 values into a caller-owned float32 destination.

source
The source data consumed by the operation; caller-owned storage is not retained after the method returns.
destination
The destination with one element per F16 value.
bigEndian
True to decode source bytes in big-endian order; false to decode the little-endian representation used by the local GGUF artifact.
Method CopyBFloat16(System.ReadOnlySpan<byte>,System.Span<float>,bool)

Copies BF16 values into a caller-owned float32 destination.

source
The source data consumed by the operation; caller-owned storage is not retained after the method returns.
destination
The destination with one element per BF16 value.
bigEndian
True to decode source bytes in big-endian order; false to decode the little-endian representation used by the local GGUF artifact.
Method MatVecFloat16(System.ReadOnlySpan<byte>,int,int,System.ReadOnlySpan<float>,System.Span<float>,bool)

Computes a row-major F16 matrix-vector product without whole-matrix conversion.

matrix
The matrix sequence used by this operation; its required length, ordering, and element bounds are validated before access.
rowCount
The row count used to bound this operation; it must be nonnegative and within the supported range.
columnCount
The column count used to bound this operation; it must be nonnegative and within the supported range.
vector
The vector sequence used by this operation; its required length, ordering, and element bounds are validated before access.
output
The caller-owned destination buffer that receives the result; required capacity is validated before any write occurs.
bigEndian
True to decode source bytes in big-endian order; false to decode the little-endian representation used by the local GGUF artifact.
Method MatVecBFloat16(System.ReadOnlySpan<byte>,int,int,System.ReadOnlySpan<float>,System.Span<float>,bool)

Computes a row-major BF16 matrix-vector product without whole-matrix conversion.

matrix
The matrix sequence used by this operation; its required length, ordering, and element bounds are validated before access.
rowCount
The row count used to bound this operation; it must be nonnegative and within the supported range.
columnCount
The column count used to bound this operation; it must be nonnegative and within the supported range.
vector
The vector sequence used by this operation; its required length, ordering, and element bounds are validated before access.
output
The caller-owned destination buffer that receives the result; required capacity is validated before any write occurs.
bigEndian
True to decode source bytes in big-endian order; false to decode the little-endian representation used by the local GGUF artifact.

UAIX.LmRuntime.Tensors

13 documented types and 76 documented members.

GgmlTensorTypeUAIX.LmRuntime.Tensors 21 members

Identifies GGML tensor storage types as encoded in GGUF tensor descriptors.

Field F32

32-bit floating point.

Field F16

16-bit floating point.

Field Q4_0

Q4_0 block quantization.

Field Q4_1

Q4_1 block quantization.

Field Q5_0

Q5_0 block quantization.

Field Q5_1

Q5_1 block quantization.

Field Q8_0

Q8_0 block quantization.

Field Q8_1

Q8_1 block quantization.

Field Q2_K

Q2_K block quantization.

Field Q3_K

Q3_K block quantization.

Field Q4_K

Q4_K block quantization.

Field Q5_K

Q5_K block quantization.

Field Q6_K

Q6_K block quantization.

Field Q8_K

Q8_K block quantization.

Field I64

64-bit signed integer.

Field I32

32-bit signed integer.

Field I16

16-bit signed integer.

Field I8

8-bit signed integer.

Field F64

64-bit floating point.

Field BF16

16-bit brain floating point.

Field IQ4_NL

IQ4_NL block quantization.

ITensorUAIX.LmRuntime.Tensors 2 members

Defines tensor metadata common to all backend placements.

Property Shape

Gets the tensor shape.

Property DataType

Gets the tensor element representation.

QuantizedBlockTraitUAIX.LmRuntime.Tensors 3 members

Describes a quantized block layout.

Property GgmlType

Gets the GGML tensor type.

Property BlockElementCount

Gets the logical elements in one block.

Property BlockByteCount

Gets the physical bytes in one block.

QuantizedBlockTraitsUAIX.LmRuntime.Tensors 1 member

Provides quantized block trait lookup.

Method Get(UAIX.LmRuntime.Tensors.GgmlTensorType)

Gets quantized block layout information for a GGML tensor type.

type
The declared GGML tensor or metadata type used to select the corresponding decoding and validation rules.

Returns: The QuantizedBlockTrait result produced by QuantizedBlockTraits.Get for this contract: Gets quantized block layout information for a GGML tensor type. It is published only after all documented validation and ownership transitions succeed.

Q4_1BlockUAIX.LmRuntime.Tensors 3 members

Represents a Q4_1 quantized block descriptor.

Property Scale

Gets the block scale.

Property Minimum

Gets the block minimum.

Property PackedValues

Gets the packed values.

Q5_0BlockUAIX.LmRuntime.Tensors 3 members

Represents a Q5_0 quantized block descriptor.

Property Scale

Gets the block scale.

Property HighBits

Gets the high-bit metadata.

Property PackedValues

Gets the packed low-bit values.

Q5_1BlockUAIX.LmRuntime.Tensors 4 members

Represents a Q5_1 quantized block descriptor.

Property Scale

Gets the block scale.

Property Minimum

Gets the block minimum.

Property HighBits

Gets the high-bit metadata.

Property PackedValues

Gets the packed low-bit values.

Q8_1BlockUAIX.LmRuntime.Tensors 3 members

Represents a Q8_1 quantized block descriptor.

Property Scale

Gets the block scale.

Property Sum

Gets the block sum metadata.

Property Values

Gets the quantized values.

TensorDataTypeUAIX.LmRuntime.Tensors 24 members

Identifies supported tensor element representations.

Field Unknown

Unknown or unsupported tensor representation.

Field Float32

32-bit IEEE floating point.

Field Float16

16-bit IEEE floating point.

Field BFloat16

16-bit brain floating point.

Field Int8

8-bit signed integer.

Field Int16

16-bit signed integer.

Field Int32

32-bit signed integer.

Field Int64

64-bit signed integer.

Field Q4_0

GGML Q4_0 block quantization.

Field Q4_1

GGML Q4_1 block quantization.

Field Q5_0

GGML Q5_0 block quantization.

Field Q5_1

GGML Q5_1 block quantization.

Field Q8_0

GGML Q8_0 block quantization.

Field Q8_1

GGML Q8_1 block quantization.

Field Q2_K

GGML Q2_K block quantization.

Field Q3_K

GGML Q3_K block quantization.

Field Q4_K

GGML Q4_K block quantization.

Field Q5_K

GGML Q5_K block quantization.

Field Q6_K

GGML Q6_K block quantization.

Field IQ4_NL

GGML IQ4_NL block quantization.

Field MXFP4

MXFP4 packed floating-point storage.

Field NVFP4

NVFP4 packed floating-point storage.

Field TQ1_0

TQ1_0 ternary quantized storage.

Field TQ2_0

TQ2_0 ternary quantized storage.

TensorShapeUAIX.LmRuntime.Tensors 3 members

Represents immutable tensor shape metadata.

Property Dimensions

Gets the tensor dimensions.

Property ElementCount

Gets the number of tensor elements.

Method From(long[])

Creates the result from the tensor shape after validating the supplied representation.

Returns: The TensorShape result produced by TensorShape.From for this contract: Creates the result from the tensor shape after validating the supplied representation. It is published only after all documented validation and ownership transitions succeed.

TensorTypeTraitsUAIX.LmRuntime.Tensors 5 members

Describes storage traits for a GGML tensor type.

Property GgmlType

Gets the GGML tensor type.

Property DataType

Gets the runtime tensor data type.

Property BlockElementCount

Gets the number of logical elements in one physical storage block.

Property BlockByteCount

Gets the number of physical bytes in one storage block.

Property IsQuantized

Gets a value indicating whether the type is block-quantized.

TensorTypeTraitsCatalogUAIX.LmRuntime.Tensors 2 members

Provides GGML tensor type trait lookup and byte-length validation.

Method Get(UAIX.LmRuntime.Tensors.GgmlTensorType)

Gets traits for the specified tensor type.

type
The declared GGML tensor or metadata type used to select the corresponding decoding and validation rules.

Returns: The TensorTypeTraits result produced by TensorTypeTraitsCatalog.Get for this contract: Gets traits for the specified tensor type. It is published only after all documented validation and ownership transitions succeed.

Method ComputeByteLength(UAIX.LmRuntime.Tensors.GgmlTensorType,ulong)

Computes the physical byte length required for a tensor type and element count.

type
The declared GGML tensor or metadata type used to select the corresponding decoding and validation rules.
elementCount
The element count used to bound this operation; it must be nonnegative and within the supported range.

Returns: The ulong value computed by TensorTypeTraitsCatalog.ComputeByteLength for this contract: Computes the physical byte length required for a tensor type and element count. Range, finite-value, and overflow checks are completed before the value is returned.

VectorMathUAIX.LmRuntime.Tensors 2 members

Provides allocation-free vector math kernels used by tests and CPU fallback paths.

Method Dot(System.ReadOnlySpan<float>,System.ReadOnlySpan<float>)

Computes the dot product of two equal-length vectors.

left
The left sequence used by this operation; its required length, ordering, and element bounds are validated before access.
right
The right sequence used by this operation; its required length, ordering, and element bounds are validated before access.

Returns: The float value computed by VectorMath.Dot for this contract: Computes the dot product of two equal-length vectors. Range, finite-value, and overflow checks are completed before the value is returned.

Method RmsNorm(System.ReadOnlySpan<float>,System.ReadOnlySpan<float>,System.Span<float>,float)

Applies RMS normalization using an explicit weight vector.

input
The source data consumed by the operation; caller-owned storage is not retained after the method returns.
weight
The weight sequence used by this operation; its required length, ordering, and element bounds are validated before access.
output
The caller-owned destination buffer that receives the result; required capacity is validated before any write occurs.
epsilon
The positive normalization epsilon added to the mean-square term to avoid division by zero while preserving deterministic numerical behavior.

UAIX.LmRuntime.Abstractions

99 documented types and 631 documented members.

AdapterStatusUAIX.LmRuntime.Contracts 3 members

Represents normalized adapter status information.

Property StatusCode

Gets the provider status code, if applicable.

Property RequestId

Gets the provider request identifier, if supplied by the backend.

Property Message

Gets a normalized warning or diagnostic message.

FinishReasonUAIX.LmRuntime.Contracts 8 members

Identifies why an inference response stopped.

Field Unknown

The provider or backend did not return a reason.

Field Stop

The model naturally stopped.

Field Length

The model hit the configured maximum output token budget.

Field ToolCall

The model emitted or requested a tool call.

Field ContentFilter

The runtime stopped generation because content was filtered.

Field Cancelled

The request was cancelled.

Field Error

The backend reported an execution error.

Field PolicyDenied

The runtime selected no-op because policy, budget, or claim-boundary rules blocked automatic execution.

InferenceRequestUAIX.LmRuntime.Contracts 14 members

Represents a provider-neutral inference request.

Property Model

Gets the requested model identifier.

Property ConversationId

Gets the conversation identifier when one is present.

Property Messages

Gets the message sequence.

Property MaxOutputTokens

Gets the maximum output token budget.

Property Temperature

Gets the sampling temperature.

Property TopP

Gets the nucleus sampling probability cutoff.

Property TopK

Gets the top-k sampling cutoff when one is present.

Property Seed

Gets the deterministic sampler seed when one is present.

Property StopSequences

Gets stop sequences used to terminate generation.

Property Tools

Gets tool definitions available to the model.

Property ToolChoice

Gets tool selection guidance.

Property ResponseFormat

Gets the requested response format.

Property Metadata

Gets caller-supplied metadata propagated to adapters and diagnostics.

Property UseMemory

Gets a value indicating whether .uai memory should be injected before execution.

InferenceResponseUAIX.LmRuntime.Contracts 11 members

Represents a normalized inference response.

Property ResponseId

Gets the response identifier.

Property ConversationId

Gets the conversation identifier when one is present.

Property Model

Gets the resolved model identifier.

Property Provider

Gets the provider or backend name.

Property OutputText

Gets the output text.

Property FinishReason

Gets the legacy textual finish reason.

Property FinishReasonKind

Gets the strongly typed finish reason.

Property CreatedUtc

Gets the UTC creation timestamp.

Property Usage

Gets normalized usage data.

Property AdapterStatus

Gets normalized adapter status information.

Property GovernanceReceipt

Gets the governance receipt emitted by budget or claim-boundary policy.

InferenceUsageUAIX.LmRuntime.Contracts 4 members

Represents normalized model usage data.

Property InputTokens

Gets the input token count.

Property OutputTokens

Gets the output token count.

Property CachedInputTokens

Gets the provider-cached input token count when available.

Property EstimatedCostMicros

Gets the estimated cost in one-millionth currency units, if known.

LlmMessageUAIX.LmRuntime.Contracts 8 members

Represents a canonical chat or completion message.

Property Role

Gets the message role.

Property Content

Gets the text content for the message.

Property ToolCallId

Gets the tool call identifier associated with a tool message when one is present.

Method System(string)

Creates an immutable system-role LLM message from caller-supplied content.

content
The content processed by the configured encoding or normalization rules; it must satisfy the declared nullability contract.

Returns: A new system-role message whose content is never null.

Method Developer(string)

Creates an immutable developer-role LLM message from caller-supplied content.

content
The content processed by the configured encoding or normalization rules; it must satisfy the declared nullability contract.

Returns: A new developer-role message whose content is never null.

Method User(string)

Creates an immutable user-role LLM message from caller-supplied content.

content
The content processed by the configured encoding or normalization rules; it must satisfy the declared nullability contract.

Returns: A new user-role message whose content is never null.

Method Assistant(string)

Creates an immutable assistant-role LLM message from caller-supplied content.

content
The content processed by the configured encoding or normalization rules; it must satisfy the declared nullability contract.

Returns: A new assistant-role message whose content is never null.

Method Tool(string,string)

Creates an immutable tool-role LLM message correlated to the supplied tool call.

toolCallId
The caller-owned tool call id used for deterministic correlation by Tool; it must satisfy the documented range and grammar and grants no additional authority.
content
The content processed by the configured encoding or normalization rules; it must satisfy the declared nullability contract.

Returns: A new tool-role message containing the supplied correlation identifier and non-null content.

LlmRoleUAIX.LmRuntime.Contracts 5 members

Identifies the role of a message in a canonical inference request.

Field System

System-level instruction context.

Field Developer

Developer-level instruction context.

Field User

End-user message content.

Field Assistant

Assistant message content.

Field Tool

Tool input or output content.

MemoryQueryUAIX.LmRuntime.Contracts 3 members

Represents a query against .uai memory entries.

Property ConversationId

Gets the conversation identifier filter when one is present.

Property Text

Gets full-text query text when one is present.

Property MaxEntries

Gets the maximum number of entries to return.

ModelDescriptorUAIX.LmRuntime.Contracts 6 members

Describes a model visible to the runtime.

Property ModelId

Gets the model identifier.

Property Provider

Gets the provider or backend name.

Property SupportsStreaming

Gets a value indicating whether streaming is supported.

Property IsLocal

Gets a value indicating whether the model executes locally.

Property ContextLength

Gets the maximum context length in tokens.

Property Capabilities

Gets capability names exposed by the model.

ProviderErrorUAIX.LmRuntime.Contracts 4 members

Represents a normalized adapter or provider error.

Property Code

Gets the normalized error code.

Property Message

Gets the error message.

Property Retriable

Gets a value indicating whether retry may be safe.

Property RetryAfter

Gets the suggested retry delay when available.

ResponseFormatUAIX.LmRuntime.Contracts 3 members

Describes structured output requirements for a response.

Property Kind

Gets the response format kind.

Property JsonSchema

Gets the JSON schema document used when is .

Property Strict

Gets a value indicating whether the backend should enforce strict schema adherence when supported.

ResponseFormatKindUAIX.LmRuntime.Contracts 3 members

Identifies the canonical output format mode.

Field Text

Free-form text response.

Field JsonObject

JSON object response.

Field JsonSchema

JSON schema-constrained response.

RuntimeOptionsUAIX.LmRuntime.Contracts 24 members

Defines runtime orchestration settings.

Property DefaultModel

Gets the default model identifier.

Property MaxMemoryEntries

Gets the maximum number of memory entries injected into a request.

Property MaxMemoryCharacters

Gets the maximum memory characters injected into the system context.

Property MaxContextTokens

Gets the maximum context tokens accepted by the orchestrator before adapter execution.

Property EnableTeleodynamicGovernance

Gets a value indicating whether Teleodynamic governance gates are evaluated before execution.

Property ReturnNoOpResponseOnGovernanceDenial

Gets a value indicating whether budget-denied requests return a no-op response instead of throwing.

Property EnableConstraintPolicy

Gets a value indicating whether request-side constraint rules are evaluated before adapter execution.

Property EnableClaimBoundaryPolicy

Gets a value indicating whether generated text is evaluated against claim-boundary rules.

Property ReturnNoOpResponseOnClaimBoundaryViolation

Gets a value indicating whether claim-boundary violations return a no-op response.

Property EnableReviewGatePolicy

Gets a value indicating whether slow-loop review gates are evaluated before execution.

Property ReturnNoOpResponseOnReviewGateRequired

Gets a value indicating whether review-gated requests return a no-op response instead of throwing.

Property QuarantineGeneratedNeedsHumanReview

Gets a value indicating whether generated items needing human review are written to the quarantine ledger.

Property FailOnEvidenceLedgerError

Gets a value indicating whether ledger append errors should fail inference.

Property AvailableResourceBudget

Gets the available resource budget used by the default governor.

Property ViabilityFloor

Gets the minimum resource reserve below which automatic actions are blocked.

Property MaxToolDefinitions

Gets the maximum number of tool definitions exposed to a request.

Property MaxUncertaintyScore

Gets the maximum uncertainty score accepted in the automatic lane.

Property TokenCostWeight

Gets the resource weight assigned to each token.

Property ToolDefinitionCost

Gets the resource weight assigned to each exposed tool definition.

Property MemoryEntryCost

Gets the resource weight assigned to each injected memory entry.

Property ReviewMinuteCost

Gets the resource weight assigned to each declared review minute.

Property UncertaintyCost

Gets the resource weight assigned to normalized uncertainty.

Property ClaimBoundaryRules

Gets additional claim-boundary rules used by the default claim policy.

Property ConstraintRules

Gets additional request-side constraint rules used by the default constraint policy.

StreamingDeltaUAIX.LmRuntime.Contracts 8 members

Represents a normalized streaming inference event.

Property Type

Gets the event type.

Property ResponseId

Gets the response identifier.

Property Text

Gets the text delta for text events.

Property ToolCallId

Gets the tool-call identifier for tool deltas.

Property ToolArgumentsDelta

Gets a tool-call argument delta.

Property Usage

Gets the usage payload when available.

Property Error

Gets an error message for error events.

Property CreatedUtc

Gets the UTC event timestamp.

StreamingEventTypeUAIX.LmRuntime.Contracts 6 members

Identifies the type of a streaming inference event.

Field Start

The stream has started.

Field Delta

The event contains text delta content.

Field ToolCallDelta

The event contains a tool-call delta.

Field Usage

The event contains usage information.

Field Completed

The stream completed successfully.

Field Error

The stream completed with an error.

TokenCountResultUAIX.LmRuntime.Contracts 2 members

Represents tokenizer count output.

Property TokenCount

Gets the token count.

Property Tokenizer

Gets the tokenizer name.

ToolChoiceUAIX.LmRuntime.Contracts 2 members

Defines runtime guidance for model tool selection.

Property Automatic

Gets a value indicating whether tool selection is automatic.

Property RequiredToolName

Gets a required tool name when a specific tool must be used.

ToolDefinitionUAIX.LmRuntime.Contracts 3 members

Defines a callable tool exposed through the canonical inference contract.

Property Name

Gets the tool name.

Property Description

Gets the tool description.

Property JsonSchema

Gets the JSON schema document used to validate tool arguments.

UaiFileMemoryOptionsUAIX.LmRuntime.Contracts 6 members

Defines settings for the .uai file memory store.

Property RootDirectory

Gets the root directory for .uai memory files.

Property MemoryFileName

Gets the memory file name.

Property IncludeShortTermMemoryFiles

Gets a value indicating whether short-term .sui memory files are included when reading memory.

Property ShortTermMemoryDirectoryName

Gets the directory name under the .uai root that contains short-term .sui memory files.

Property ShortTermMemoryFilePattern

Gets the file search pattern used to discover short-term memory units.

Property SkipInvalidEntries

Gets a value indicating whether invalid entries should be skipped instead of throwing.

UaiMemoryEntryUAIX.LmRuntime.Contracts 6 members

Represents a persisted .uai memory entry.

Property EntryId

Gets the memory entry identifier.

Property ConversationId

Gets the associated conversation identifier.

Property Role

Gets the memory role.

Property Content

Gets the persisted memory content.

Property CreatedUtc

Gets the UTC creation timestamp.

Property ContentSha256

Gets the SHA-256 hash of normalized content.

RuntimeTelemetryNamesUAIX.LmRuntime.Diagnostics 16 members

Defines stable telemetry names emitted by the runtime core.

Field SourceName

The ActivitySource and Meter name.

Field RequestCounter

Request counter metric name.

Field FailureCounter

Failure counter metric name.

Field RequestDurationMs

Request duration histogram metric name.

Field BudgetDecisionCounter

Teleodynamic budget decision counter metric name.

Field BlockedActionCounter

Teleodynamic blocked action counter metric name.

Field ClaimBoundaryViolationCounter

Claim-boundary violation counter metric name.

Field ReviewGateDecisionCounter

Slow-loop review-gate decision counter metric name.

Field QuarantineRecordCounter

Quarantine record counter metric name.

Field ConstraintDecisionCounter

Request-side constraint decision counter metric name.

Field ConstraintViolationCounter

Request-side constraint violation counter metric name.

Field EvidenceReceiptCounter

Evidence receipt counter metric name.

Field TeleodynamicControlDecisionCounter

Explicit teleodynamic control-cycle decision counter metric name.

Field TeleodynamicNoOpCounter

Explicit teleodynamic control-cycle no-op counter metric name.

Field MemoryFirewallDecisionCounter

Memory-firewall decision counter metric name.

Field MemoryQuarantineCounter

Memory-firewall quarantine counter metric name.

BudgetDecisionStatusUAIX.LmRuntime.Governance 4 members

Identifies the outcome of a runtime budget evaluation.

Field Unknown

The decision has not been evaluated.

Field Approved

The requested action is affordable under the configured viability budget.

Field Blocked

The requested action is blocked and should not execute automatically.

Field NoOpSelected

No-op was selected as the dominant safe action.

ClaimBoundaryDecisionUAIX.LmRuntime.Governance 6 members

Represents the result of applying claim-boundary rules to text.

Property Allowed

Gets a value indicating whether the text stayed within claim boundaries.

Property ViolatedRuleIds

Gets violated rule identifiers.

Property NoOpReason

Gets the selected no-op reason when the decision blocks output.

Property Message

Gets a bounded decision message.

Property SafeReplacementText

Gets a replacement text that can be emitted in automatic lanes.

Property CreatedUtc

Gets the UTC decision timestamp.

ClaimBoundaryRuleUAIX.LmRuntime.Governance 5 members

Defines a bounded claim-boundary rule applied to generated text or runtime claims.

Property RuleId

Gets the stable rule identifier.

Property Pattern

Gets the case-insensitive text pattern that triggers the rule.

Property Severity

Gets the rule severity.

Property NoOpReason

Gets the no-op reason associated with this rule.

Property Message

Gets the human-readable rule message.

ClaimBoundarySeverityUAIX.LmRuntime.Governance 3 members

Identifies how strongly a claim-boundary rule should affect runtime behavior.

Field Advisory

The rule adds advisory context only.

Field Warning

The rule should be logged and surfaced for review.

Field Block

The rule should block or replace output in automatic lanes.

ClaimLifecycleStatusUAIX.LmRuntime.Governance 6 members

Identifies the bounded evidence lifecycle assigned to a runtime or release claim.

Field Raw

The claim has been captured but has not received an evidence review.

Field Reviewed

The claim and cited evidence have received an initial review.

Field Bounded

The claim is constrained to an explicit domain, digest, environment, and evidence scope.

Field Promoted

The bounded claim has received explicit human approval for its intended publication lane.

Field Restricted

The claim remains usable only under narrower restrictions than originally requested.

Field Rejected

The claim has been rejected and cannot be promoted without a new evidence cycle.

ClaimStatusUAIX.LmRuntime.Governance 6 members

Identifies the evidence status assigned to a claim, generated artifact, or externally visible output.

Field Unknown

No claim status was supplied.

Field PublicReadyTemplate

The artifact is a buyer-safe methodology template and does not claim a specific client outcome.

Field GeneratedNeedsHumanReview

The artifact was generated or transformed by automation and requires human review before promotion.

Field ApprovedPublicOutcome

The artifact is an approved public outcome with reviewed evidence.

Field MachineReadableEvidence

The artifact is a machine-readable evidence payload intended for agents and due-diligence workflows.

Field Quarantined

The artifact is quarantined and cannot be promoted automatically.

ClaimTransitionDecisionUAIX.LmRuntime.Governance 5 members

Reports whether a requested claim-lifecycle transition is permitted by evidence and authority rules.

Property ClaimId

Gets the stable claim identifier.

Property Allowed

Gets a value indicating whether the requested transition is allowed.

Property EffectiveStatus

Gets the status that remains effective after policy evaluation.

Property NoOpReason

Gets the no-op reason when the requested transition is denied.

Property Message

Gets the bounded policy explanation.

ClaimTransitionRequestUAIX.LmRuntime.Governance 11 members

Describes one requested transition in the explicit claim-evidence lifecycle.

Property ClaimId

Gets the stable claim identifier.

Property CurrentStatus

Gets the currently recorded claim lifecycle status.

Property RequestedStatus

Gets the requested claim lifecycle status.

Property IndependentEvidenceCount

Gets the number of independent evidence references attached to the claim.

Property DomainBounded

Gets a value indicating whether the claim has an explicit domain and applicability boundary.

Property ResourceTracePresent

Gets a value indicating whether a candidate-bound resource trace supports the claim.

Property AuditTracePresent

Gets a value indicating whether an auditable decision trace supports the claim.

Property HumanApproved

Gets a value indicating whether an authorized human reviewer approved the requested promotion.

Property ProximityOnlyEvidence

Gets a value indicating whether the transition is justified only by association with a nearby approved claim.

Property BoundaryViolation

Gets a value indicating whether a claim-boundary violation was identified.

Property Rationale

Gets the bounded reviewer rationale attached to restrictive or terminal transitions.

ConstraintClosureReportUAIX.LmRuntime.Governance 6 members

Reports whether active work and constraint nodes participate in closed maintenance cycles.

Property RegistryId

Gets the analyzed registry identifier.

Property Closed

Gets a value indicating whether every active work and constraint node participates in a closed cycle.

Property StronglyConnectedComponents

Gets strongly connected components in deterministic node-identifier order.

Property OpenNodeIds

Gets active node identifiers that do not participate in a closed directed cycle.

Property RetirementCandidateIds

Gets active nodes marked for bounded retirement because maintenance burden exceeds evidence strength.

Property Diagnostics

Gets validation diagnostics for duplicate identifiers, missing endpoints, or invalid values.

ConstraintDecisionUAIX.LmRuntime.Governance 8 members

Represents the result of request-side constraint evaluation.

Property Allowed

Gets a value indicating whether automatic execution may proceed.

Property RequiresReview

Gets a value indicating whether the resulting artifact requires review before promotion.

Property NoOpReason

Gets the selected no-op reason when automatic execution is blocked.

Property MatchedRuleIds

Gets matched rule identifiers.

Property EvidenceReferences

Gets evidence references associated with this decision.

Property Message

Gets a bounded explanation suitable for logs and receipts.

Property SafeReplacementText

Gets the safe replacement text when no-op is selected.

Property CreatedUtc

Gets the UTC decision timestamp.

ConstraintEdgeUAIX.LmRuntime.Governance 4 members

Defines one directed maintenance or channeling relationship in a work-constraint graph.

Property SourceNodeId

Gets the source node identifier.

Property TargetNodeId

Gets the target node identifier.

Property Relationship

Gets the bounded relationship label, such as maintains, channels, depends-on, or verifies.

Property Strength

Gets the normalized relationship strength in the inclusive range from zero through one.

ConstraintNodeUAIX.LmRuntime.Governance 6 members

Defines one bounded node in the work-constraint closure graph.

Property NodeId

Gets the stable node identifier.

Property Kind

Gets the semantic role of the node.

Property Status

Gets the lifecycle state of the node.

Property MaintenanceBurden

Gets the non-negative recurring maintenance burden attributed to the node.

Property EvidenceStrength

Gets the non-negative evidence strength associated with the node.

Property Metadata

Gets bounded metadata attached to the node.

ConstraintNodeKindUAIX.LmRuntime.Governance 4 members

Identifies the role a node plays in a work-constraint registry.

Field Unknown

No node role was supplied.

Field Work

The node represents work that consumes resources and maintains constraints.

Field Constraint

The node represents a constraint that channels or limits work.

Field Evidence

The node represents evidence that supports a work or constraint decision.

ConstraintNodeStatusUAIX.LmRuntime.Governance 3 members

Identifies the current lifecycle state of a work-constraint registry node.

Field Active

The node is active and participates in closure analysis.

Field Frozen

The node is frozen and remains auditable but cannot be expanded automatically.

Field Retired

The node is retired and excluded from active closure analysis.

ConstraintRegistrySnapshotUAIX.LmRuntime.Governance 4 members

Captures one immutable work-constraint graph supplied to closure analysis.

Property RegistryId

Gets the stable registry identifier.

Property Nodes

Gets the graph nodes.

Property Edges

Gets the directed graph edges.

Property CapturedUtc

Gets the UTC time at which the registry snapshot was captured.

ConstraintRuleUAIX.LmRuntime.Governance 5 members

Defines an evidence-bounded runtime constraint rule.

Property RuleId

Gets the stable rule identifier.

Property Scope

Gets the inspected request surface.

Property Pattern

Gets the case-insensitive substring pattern matched by the default policy.

Property Severity

Gets the severity applied when the pattern is matched.

Property Message

Gets the bounded explanation for the decision receipt.

ConstraintScopeUAIX.LmRuntime.Governance 5 members

Identifies the request surface inspected by a constraint rule.

Field All

Inspect all supported request surfaces.

Field Messages

Inspect message content.

Field Metadata

Inspect metadata keys and values.

Field Tools

Inspect tool names, descriptions, and schemas.

Field ResponseFormat

Inspect response-format hints and schemas.

ConstraintSeverityUAIX.LmRuntime.Governance 3 members

Identifies how strongly a runtime constraint rule affects automatic execution.

Field Information

The rule only records an informational observation.

Field ReviewRequired

The rule permits execution but marks the artifact as needing review before promotion.

Field Block

The rule blocks automatic execution and selects no-op.

EvidenceReferenceUAIX.LmRuntime.Governance 5 members

Identifies a source used to justify a governance decision.

Property Source

Gets the evidence source name.

Property Path

Gets the evidence path or stable identifier.

Property Sha256

Gets the SHA-256 hash when the evidence is file-backed.

Property Span

Gets the line, byte, or section span when one is present.

Property Note

Gets a bounded note describing why the evidence is relevant.

GovernanceActionKindUAIX.LmRuntime.Governance 8 members

Identifies a Teleodynamic structural operator selected by the runtime control plane.

Field Unknown

No operator was specified.

Field Add

Add a bounded structure such as a tool, memory edge, adapter, or prompt template.

Field Merge

Merge overlapping structures after evidence shows lower maintenance burden.

Field Split

Split a structure into narrower lanes when evidence shows ambiguous or overloaded behavior.

Field Retire

Retire a structure whose maintenance burden is no longer justified.

Field NoOp

Select no mutation because evidence, budget, or claim boundaries do not justify action.

Field Reactivate

Reactivate a previously frozen structure after fresh evidence repays its maintenance burden.

Field Freeze

Freeze a structure so it remains auditable while automatic expansion and promotion are disabled.

GovernanceDecisionReceiptUAIX.LmRuntime.Governance 14 members

Represents an immutable evidence-bearing receipt for a runtime governance decision.

Property ReceiptId

Gets the stable receipt identifier.

Property PackageVersion

Gets the package version that emitted the receipt.

Property Model

Gets the associated model identifier.

Property ConversationId

Gets the conversation identifier when one is present.

Property Action

Gets the selected structural operator.

Property NoOpReason

Gets the selected no-op reason when applicable.

Property BudgetDecision

Gets the budget decision that contributed to the receipt.

Property ClaimBoundaryDecision

Gets the claim-boundary decision that contributed to the receipt.

Property ConstraintDecision

Gets the request-side constraint decision that contributed to the receipt.

Property ReviewGateDecision

Gets the slow-loop review-gate decision that contributed to the receipt.

Property QuarantineRecord

Gets the quarantine record emitted for the receipt, when one was written.

Property EvidenceReferences

Gets evidence references associated with the receipt.

Property Metadata

Gets bounded metadata for downstream audit and telemetry correlation.

Property CreatedUtc

Gets the UTC receipt timestamp.

MemoryFirewallPolicyUAIX.LmRuntime.Governance 8 members

Defines source, freshness, contradiction, entropy, and review boundaries for memory promotion.

Property MaximumEntropyScore

Gets the maximum entropy accepted in the automatic memory lane.

Property MinimumTrustScore

Gets the minimum source trust accepted in the automatic memory lane.

Property MaximumShortTermAge

Gets the maximum age of short-term packets before retirement.

Property MaximumMediumTermAge

Gets the maximum age of medium-term packets before retirement.

Property MaximumLongTermAge

Gets the maximum age of long-term packets before re-review is required.

Property MinimumLongTermEvidenceReferences

Gets the minimum evidence count required for durable long-term promotion.

Property RequireHumanReviewForLongTerm

Gets a value indicating whether long-term promotion requires explicit human review.

Property ReviewContradictions

Gets a value indicating whether contradiction always routes a packet to review.

MemoryPacketUAIX.LmRuntime.Governance 14 members

Describes a privacy-preserving memory packet using provenance and integrity metadata rather than raw content.

Property PacketId

Gets the stable packet identifier.

Property Tier

Gets the requested source-routed memory tier.

Property Source

Gets the bounded provenance source.

Property DeclaredContentSha256

Gets the content SHA-256 declared by the producer.

Property ObservedContentSha256

Gets the independently observed content SHA-256 supplied to the firewall.

Property EntropyScore

Gets the normalized entropy or unresolved uncertainty score.

Property TrustScore

Gets the normalized trust score assigned to the packet source.

Property IsCorrupt

Gets a value indicating whether an upstream integrity check marked the packet corrupt.

Property HasContradiction

Gets a value indicating whether the packet contradicts active memory.

Property ContradictionReferences

Gets bounded references to memories or evidence involved in a contradiction.

Property EvidenceReferences

Gets evidence references supporting provenance, trust, or contradiction analysis.

Property Metadata

Gets bounded metadata that excludes raw memory content.

Property CreatedUtc

Gets the UTC packet creation time.

Property ExpiresUtc

Gets the explicit UTC expiry time when one is present.

MemoryFirewallRequestUAIX.LmRuntime.Governance 4 members

Describes the requested disposition and review proof evaluated by the memory firewall.

Property Packet

Gets the packet to evaluate.

Property RequestedStatus

Gets the requested packet status.

Property HumanReviewed

Gets a value indicating whether explicit human review was completed.

Property ReviewReference

Gets a bounded review reference when human review was completed.

MemoryFirewallDecisionUAIX.LmRuntime.Governance 7 members

Represents the memory-firewall disposition for one packet.

Property PacketId

Gets the packet identifier.

Property Status

Gets the resulting firewall status.

Property Reason

Gets the primary firewall reason.

Property TargetTier

Gets the target memory tier when promotion is allowed.

Property Message

Gets a bounded explanation of the disposition.

Property CreatedUtc

Gets the UTC time at which the firewall decision was produced.

Property PromotionAllowed

Gets a value indicating whether the requested promotion was approved.

MemoryTierUAIX.LmRuntime.Governance 3 members

Identifies a source-routed memory tier managed by the memory firewall.

Field ShortTerm

Ephemeral context retained only for the immediate execution window.

Field MediumTerm

Reviewable working memory retained across a bounded sequence of interactions.

Field LongTerm

Durable memory that requires provenance, contradiction checks, and governed promotion.

MemoryPacketStatusUAIX.LmRuntime.Governance 6 members

Identifies the current firewall disposition of a memory packet.

Field Candidate

The packet has not yet passed a firewall decision.

Field Quarantined

The packet is isolated from active retrieval while evidence or integrity is unresolved.

Field ReviewRequired

The packet is structurally valid but requires an explicit human or policy review.

Field Promoted

The packet may be persisted in the approved target tier.

Field Rejected

The packet is invalid or unsafe to retain as active memory.

Field Retired

The packet is no longer eligible for active retrieval because it expired or was superseded.

MemoryFirewallReasonUAIX.LmRuntime.Governance 11 members

Identifies the primary reason for a memory-firewall disposition.

Field None

No firewall restriction was required.

Field MissingSource

The packet does not identify a bounded source.

Field InvalidDigest

A declared or observed content digest is not a valid SHA-256 value.

Field DigestMismatch

The observed content digest does not match the packet declaration.

Field CorruptPacket

The packet was explicitly marked corrupt.

Field StalePacket

The packet exceeded its freshness or expiry boundary.

Field ExcessiveEntropy

The packet entropy or uncertainty exceeds the automatic-lane threshold.

Field LowTrust

The packet source trust score is below the configured threshold.

Field Contradiction

The packet conflicts with one or more active memory references.

Field ReviewRequired

The requested tier or disposition requires explicit review.

Field EvidenceRequired

The requested promotion does not carry sufficient evidence references.

NoOpReasonUAIX.LmRuntime.Governance 16 members

Identifies why the runtime selected no-op.

Field None

No no-op reason applies.

Field InsufficientBudget

The requested work would exceed the available runtime budget.

Field ViabilityFloor

The requested work would cross the configured viability floor.

Field ExcessiveUncertainty

The uncertainty score is too high for an automatic action.

Field ReviewRequired

The requested work requires human review before promotion.

Field ClaimBoundaryViolation

The request or output crosses a claim boundary.

Field UnsupportedAction

The requested operator is not supported in the current lane.

Field WeakEvidence

The evidence packet is too weak to justify a structural mutation.

Field HardLimitExceeded

The request exceeds a hard token, memory, or tool-count limit.

Field ConstraintViolation

The request crossed a configured runtime constraint.

Field NoFeasibleImprovement

No feasible proposal produced enough bounded benefit to dominate explicit no-op.

Field MaintenanceCycleOpen

The work-constraint registry contains active structure outside a closed maintenance cycle.

Field PromotionEvidenceMissing

The requested claim promotion lacks candidate-bound evidence required by the lifecycle policy.

Field ProximityOnlyEvidence

The requested claim promotion relies only on proximity to another approved claim.

Field PhaseUnstable

The current structural phase is unstable and does not permit the requested automatic action.

Field IrreversibleAction

The proposed action lacks a documented rollback path required for automatic execution.

QuarantineRecordUAIX.LmRuntime.Governance 11 members

Represents an append-only quarantine ledger record for generated or unreviewed runtime artifacts.

Property RecordId

Gets the stable quarantine record identifier.

Property PackageVersion

Gets the package version that emitted the quarantine record.

Property ArtifactKind

Gets the artifact kind associated with the quarantine record.

Property Model

Gets the associated model identifier.

Property ConversationId

Gets the conversation identifier when one is present.

Property ClaimStatus

Gets the evidence status assigned to the quarantined item.

Property ReviewGateStatus

Gets the review-gate status that caused the record.

Property Reason

Gets the reason the item was quarantined or review-gated.

Property EvidenceReferences

Gets evidence references associated with the quarantined item.

Property Metadata

Gets bounded metadata for downstream review tools.

Property CreatedUtc

Gets the UTC record timestamp.

ResourceEconomyInputUAIX.LmRuntime.Governance 13 members

Describes the measured benefits and burdens of one observation or proposed structural action.

Property CorrelationId

Gets the stable correlation identifier copied into the transition record.

Property PredictiveGain

Gets observed predictive-loss reduction expressed as a non-negative raw gain.

Property ActionCost

Gets the estimated one-time cost of validating and applying the action.

Property MaintenanceCost

Gets the estimated recurring cost of retaining and reviewing the resulting structure.

Property EnergyCost

Gets the estimated energy or compute cost associated with the action.

Property MemoryCost

Gets the estimated memory cost associated with the action.

Property ReviewCost

Gets the estimated human-review cost associated with the action.

Property UncertaintyCost

Gets an explicit uncertainty cost already expressed in resource units.

Property UncertaintyScore

Gets normalized proposal uncertainty in the inclusive range from zero through one.

Property CountAsObservation

Gets a value indicating whether the committed transition represents a fast-loop observation.

Property CountAsStructuralReservation

Gets a value indicating whether the committed transition represents a structural reservation.

Property EvidenceReferences

Gets evidence references supporting the gain and cost measurements.

Property TransitionUtc

Gets the zero-offset UTC timestamp assigned to the transition.

ResourceEconomyPolicyUAIX.LmRuntime.Governance 13 members

Defines the bounded resource-economy policy used when structural proposals and observations are evaluated.

The policy makes predictive gain, action cost, maintenance burden, compute, memory, review, uncertainty, viability, and capacity explicit so a structural action cannot be justified by accuracy alone.

Property InitialResource

Gets the resource balance assigned to a newly created stateful economy.

Property Capacity

Gets the maximum resource balance retained after a transition.

Property ViabilityFloor

Gets the minimum resource balance required for an automatically approved action.

Property DecayRate

Gets the fraction of the current resource balance removed as natural decay during each observation.

Property PredictiveGainWeight

Gets the multiplier applied to measured predictive gain.

Property ActionCostWeight

Gets the multiplier applied to one-time action cost.

Property MaintenanceCostWeight

Gets the multiplier applied to recurring maintenance cost.

Property EnergyCostWeight

Gets the multiplier applied to estimated compute or energy cost.

Property MemoryCostWeight

Gets the multiplier applied to memory cost.

Property ReviewCostWeight

Gets the multiplier applied to human-review cost.

Property UncertaintyReserve

Gets the resource reserve charged for a fully uncertain proposal.

Property UncertaintyReserveRatio

Gets the fraction of the current balance protected in addition to the viability floor.

Property MinimumNetGain

Gets the minimum positive net resource change required before growth is preferred over no-op.

ResourceEconomyStateUAIX.LmRuntime.Governance 16 members

Captures the endogenous resource state carried between fast observations and structural-control cycles.

This state is independent from model weights and inference tensors. It records bounded accounting context used by the slow-loop control plane and cannot alter deterministic parity mode without an explicit external actuator.

Property Cycle

Gets the monotonically increasing committed transition number.

Property CurrentResource

Gets the current resource balance available to sustain runtime structure.

Property CumulativePredictiveGain

Gets the cumulative weighted predictive gain accepted by the resource economy.

Property CumulativeDecay

Gets the cumulative endogenous decay charged by the resource economy.

Property CumulativeActionCost

Gets the cumulative weighted one-time action cost charged by the resource economy.

Property CumulativeMaintenanceCost

Gets the cumulative weighted maintenance cost charged by the resource economy.

Property CumulativeEnergyCost

Gets the cumulative weighted energy or compute cost charged by the resource economy.

Property CumulativeMemoryCost

Gets the cumulative weighted memory cost charged by the resource economy.

Property CumulativeReviewCost

Gets the cumulative weighted human-review cost charged by the resource economy.

Property CumulativeUncertaintyReserve

Gets the cumulative uncertainty reserve charged by the resource economy.

Property ObservationCount

Gets the number of accepted fast-loop observations.

Property StructuralReservationCount

Gets the number of approved structural reservations.

Property Capacity

Gets the configured capacity used to normalize resource-retention metrics.

Property ViabilityFloor

Gets the configured viability floor used by the current economy.

Property UpdatedUtc

Gets the UTC time at which the state was produced.

Property ViabilityMargin

Gets current resource minus the configured viability floor.

ResourceEconomyTransitionUAIX.LmRuntime.Governance 20 members

Reports the deterministic resource transition associated with an observation or structural proposal.

Property CorrelationId

Gets the observation or reservation identifier associated with the transition.

Property PreviousState

Gets the state supplied before the transition.

Property NextState

Gets the candidate state after gain, decay, costs, and capacity bounds are applied.

Property WeightedGain

Gets the weighted gain credited to the transition.

Property DecayCost

Gets the natural decay charged to the transition.

Property ProtectedReserve

Gets the resource reserve protected in proportion to the previous balance.

Property EffectiveViabilityFloor

Gets the effective viability boundary after the protected reserve is added to the policy floor.

Property WeightedActionCost

Gets the weighted one-time action cost.

Property WeightedMaintenanceCost

Gets the weighted recurring maintenance cost.

Property WeightedEnergyCost

Gets the weighted compute or energy cost.

Property WeightedMemoryCost

Gets the weighted memory cost.

Property WeightedReviewCost

Gets the weighted human-review cost.

Property WeightedUncertaintyCost

Gets the uncertainty cost and reserve charged to the transition.

Property WeightedCost

Gets the total weighted action, maintenance, compute, memory, review, and uncertainty cost.

Property NetChange

Gets the unconstrained net resource change before capacity clamping.

Property Viable

Gets a value indicating whether the transition remains above the configured viability floor.

Property NoOpReason

Gets the no-op reason when the transition is not viable or does not repay its burden.

Property EvidenceReferences

Gets the evidence references supporting the transition inputs.

Property Message

Gets the bounded diagnostic explaining the transition outcome.

Property CreatedUtc

Gets the UTC time at which the transition was produced.

ReviewGateDecisionUAIX.LmRuntime.Governance 9 members

Represents the result of slow-loop review gate evaluation.

Property Status

Gets the review-gate status.

Property Allowed

Gets a value indicating whether automatic execution may proceed.

Property NoOpReason

Gets the no-op reason when automatic execution is blocked.

Property ReviewState

Gets the review-state label assigned to the work.

Property TriggeredRuleIds

Gets triggered review rule identifiers.

Property EvidenceReferences

Gets evidence references used by the review decision.

Property Metadata

Gets bounded metadata for review and audit systems.

Property Message

Gets the bounded review-gate message.

Property CreatedUtc

Gets the UTC review-gate timestamp.

ReviewGateRequestUAIX.LmRuntime.Governance 9 members

Describes runtime work that may require slow-loop review before execution or promotion.

Property Model

Gets the model identifier associated with the request.

Property ConversationId

Gets the conversation identifier when one is present.

Property Action

Gets the requested structural operator.

Property UncertaintyScore

Gets the normalized uncertainty score declared by the caller or derived by the orchestrator.

Property ToolDefinitions

Gets the number of tool definitions exposed to the request.

Property EstimatedReviewMinutes

Gets the estimated review effort in minutes.

Property EvidenceReferences

Gets evidence references used to justify the action.

Property Metadata

Gets bounded request metadata used by the policy.

Property CreatedUtc

Gets the UTC timestamp when the review request was created.

ReviewGateStatusUAIX.LmRuntime.Governance 5 members

Identifies the slow-loop review gate disposition.

Field Unknown

No review decision has been made.

Field NotRequired

No additional review is required for the selected lane.

Field Approved

The work is already approved for the selected lane.

Field ReviewRequired

Human review is required before promotion or mutation.

Field Blocked

The work is blocked by review policy and must select no-op.

RuntimeBudgetUAIX.LmRuntime.Governance 12 members

Defines resource-economy limits used by the runtime budget governor.

Property AvailableResource

Gets the available resource budget for the evaluated action.

Property ViabilityFloor

Gets the minimum reserve below which automatic structural growth is blocked.

Property MaxInputTokens

Gets the maximum accepted input-token count before a hard limit block.

Property MaxOutputTokens

Gets the maximum accepted output-token count before a hard limit block.

Property MaxToolDefinitions

Gets the maximum number of tool definitions accepted before a hard limit block.

Property MaxMemoryEntries

Gets the maximum number of injected memory entries accepted before a hard limit block.

Property MaxUncertaintyScore

Gets the maximum uncertainty score accepted in an automatic lane.

Property TokenCostWeight

Gets the resource cost assigned to each token considered by the request.

Property ToolDefinitionCost

Gets the resource cost assigned to each exposed tool definition.

Property MemoryEntryCost

Gets the resource cost assigned to each injected memory entry.

Property ReviewMinuteCost

Gets the resource cost assigned to each declared review minute.

Property UncertaintyCost

Gets the resource cost assigned to uncertainty after normalization.

RuntimeBudgetDecisionUAIX.LmRuntime.Governance 10 members

Represents a resource-economy decision for a runtime action.

Property Status

Gets the decision status.

Property Action

Gets the selected or requested structural operator.

Property NoOpReason

Gets the no-op reason when no-op was selected.

Property Approved

Gets a value indicating whether the action may continue automatically.

Property ActionCost

Gets the computed action cost.

Property AvailableResource

Gets the configured available resource value used during evaluation.

Property ViabilityFloor

Gets the configured viability floor used during evaluation.

Property Message

Gets a bounded decision message.

Property CreatedUtc

Gets the UTC decision timestamp.

Property EvidenceReferences

Gets evidence references used by the decision.

RuntimeBudgetRequestUAIX.LmRuntime.Governance 12 members

Describes a request or structural action being evaluated by the runtime budget governor.

Property Model

Gets the model identifier associated with the action.

Property ConversationId

Gets the conversation identifier associated with the action when one is present.

Property Action

Gets the requested structural operator.

Property InputTokens

Gets the input-token count after prompt and memory preparation.

Property OutputTokens

Gets the requested output-token count.

Property ToolDefinitions

Gets the number of tool definitions exposed to the model.

Property MemoryEntries

Gets the number of memory entries injected into the prompt.

Property RetrievalFanOut

Gets the retrieval fan-out declared by the caller or retriever.

Property EstimatedReviewMinutes

Gets the expected human review burden in minutes.

Property UncertaintyScore

Gets the normalized uncertainty score in the range zero to one.

Property Source

Gets a bounded source label for the evaluated action.

Property EvidenceReferences

Gets evidence references associated with the decision.

StructuralChangeDecisionUAIX.LmRuntime.Governance 9 members

Represents the budget, review, and evidence disposition for a proposed structural change.

Property Status

Gets the decision status.

Property Action

Gets the selected structural operator.

Property NoOpReason

Gets the no-op reason when no mutation may occur.

Property BudgetDecision

Gets the resource-budget decision.

Property ReviewGateDecision

Gets the review-gate decision.

Property GovernanceReceipt

Gets the governance receipt emitted for the decision.

Property QuarantineRecord

Gets the quarantine record when the decision is held for review.

Property Message

Gets the bounded decision message.

Property CreatedUtc

Gets the UTC decision timestamp.

StructuralChangeDecisionStatusUAIX.LmRuntime.Governance 3 members

Identifies the disposition of a proposed structural change.

Field Approved

The proposal was approved for the caller's mutation lane.

Field RequiresReview

The proposal was converted to slow-loop review.

Field NoOpSelected

No mutation is allowed.

StructuralChangeRequestUAIX.LmRuntime.Governance 16 members

Describes a slow-loop structural change proposed for runtime configuration, memory, tools, prompts, or backend routing.

Property ChangeId

Gets the caller-provided change identifier.

Property Action

Gets the requested structural operator.

Property TargetKind

Gets the type of structure being changed.

Property TargetName

Gets the structure name or stable key.

Property Model

Gets the model affected by the change when one is present.

Property ConversationId

Gets the conversation affected by the change when one is present.

Property EstimatedTokens

Gets the estimated token cost of validating or applying the change.

Property EstimatedOutputTokens

Gets the estimated generated-token cost of validation or rollback evidence.

Property ToolDefinitions

Gets the number of tool definitions affected by the change.

Property MemoryEntries

Gets the number of memory entries affected by the change.

Property RetrievalFanOut

Gets the retrieval fan-out required to validate the change.

Property EstimatedReviewMinutes

Gets the estimated human review minutes required before promotion.

Property UncertaintyScore

Gets the normalized uncertainty score for the change.

Property ClaimText

Gets claim text associated with the change, if any.

Property EvidenceReferences

Gets evidence references supporting the requested change.

Property Metadata

Gets bounded metadata used for receipts and audit correlation.

StructuralObservationWindowUAIX.LmRuntime.Governance 10 members

Captures bounded fast-loop measurements used by the slow-loop structural phase detector.

Property SampleCount

Gets the number of observations represented by the window.

Property CurrentLoss

Gets the current normalized predictive loss.

Property PreviousLoss

Gets the normalized predictive loss from the preceding window.

Property CurrentComplexity

Gets the current normalized structural complexity.

Property PreviousComplexity

Gets the normalized structural complexity from the preceding window.

Property ActionRate

Gets the normalized rate of structural actions in the current window.

Property DriftScore

Gets the normalized cross-context or cross-window behavioral drift score.

Property ViabilityMargin

Gets the resource balance minus the configured viability floor.

Property MaintenancePressure

Gets the normalized recurring maintenance pressure.

Property ReviewPressure

Gets the normalized human-review pressure associated with recent changes.

StructuralPhaseUAIX.LmRuntime.Governance 7 members

Identifies the bounded structural regime inferred from a recent observation window.

Field Unknown

The observation window is too small or invalid for classification.

Field UnderStructured

Loss remains high while complexity is low and viable growth remains available.

Field Growth

Loss is improving while bounded structural change remains economically viable.

Field PhaseLocked

Loss, complexity, and action rate are stable within configured tolerances.

Field OverStructured

Complexity and maintenance burden are growing without sufficient predictive improvement.

Field ResourceConstrained

The resource margin is too small to support additional automatic structure.

Field Drifting

Behavior has changed materially across contexts or observation periods.

StructuralPhaseAssessmentUAIX.LmRuntime.Governance 5 members

Reports the structural phase and bounded reasoning derived from an observation window.

Property Phase

Gets the classified structural phase.

Property Confidence

Gets the normalized confidence in the classification.

Property LossDelta

Gets the current-minus-previous loss delta.

Property ComplexityDelta

Gets the current-minus-previous complexity delta.

Property Message

Gets the bounded explanation for the classification.

StructuralPhasePolicyUAIX.LmRuntime.Governance 11 members

Defines deterministic thresholds for structural phase classification.

Property MinimumSampleCount

Gets the minimum number of observations required for classification.

Property HighLossThreshold

Gets the loss level above which a low-complexity system is considered under-structured.

Property LowComplexityThreshold

Gets the complexity level below which a high-loss system may be considered under-structured.

Property GrowthImprovementThreshold

Gets the minimum loss improvement required to classify a window as bounded growth.

Property StableLossDelta

Gets the maximum absolute loss change accepted as phase-locked stability.

Property StableComplexityDelta

Gets the maximum absolute complexity change accepted as phase-locked stability.

Property StableActionRate

Gets the maximum action rate accepted as phase-locked stability.

Property DriftThreshold

Gets the drift score at or above which the phase is classified as drifting.

Property ComplexityGrowthThreshold

Gets the complexity-growth threshold used to identify over-structured behavior.

Property MaintenancePressureThreshold

Gets the maintenance-pressure threshold used to identify over-structured behavior.

Property ReviewPressureThreshold

Gets the review-pressure threshold used to identify over-structured behavior.

StructuralProposalUAIX.LmRuntime.Governance 15 members

Describes one reversible candidate considered by the slow structural-control loop.

Property ProposalId

Gets the stable proposal identifier used by evidence and trace records.

Property Action

Gets the proposed structural operator.

Property Target

Gets the stable name of the affected structure.

Property ExpectedPredictiveGain

Gets the expected predictive-loss reduction produced by the proposal.

Property ComplexityDelta

Gets the signed structural-complexity change; negative values reduce complexity.

Property ActionCost

Gets the one-time cost of validating and applying the proposal.

Property MaintenanceCost

Gets the recurring cost of retaining the resulting structure.

Property EnergyCost

Gets the estimated energy or compute cost of the proposal.

Property MemoryCost

Gets the estimated memory cost of the proposal.

Property ReviewCost

Gets the expected human-review cost of the proposal.

Property UncertaintyScore

Gets the normalized proposal uncertainty in the inclusive range from zero through one.

Property Reversible

Gets a value indicating whether the proposal can be rolled back through a documented inverse action.

Property RequiresHumanReview

Gets a value indicating whether the proposal requires explicit human approval before application.

Property EvidenceReferences

Gets the evidence references that justify the proposal estimates.

Property Metadata

Gets bounded metadata attached to the proposal.

StructuralProposalEvaluationUAIX.LmRuntime.Governance 6 members

Reports the viability and local objective for one structural proposal.

Property Proposal

Gets the evaluated proposal.

Property ResourceTransition

Gets the resource transition predicted for the proposal.

Property Objective

Gets the local objective, where lower values represent a more favorable bounded action.

Property Feasible

Gets a value indicating whether all automatic-action gates were satisfied.

Property NoOpReason

Gets the reason the proposal was excluded when it was not feasible.

Property Message

Gets the bounded diagnostic explaining the evaluation.

TeleodynamicControlRequestUAIX.LmRuntime.Governance 11 members

Aggregates one opt-in slow-loop evaluation without exposing or mutating model inference state.

Property CycleId

Gets the stable control-cycle identifier.

Property Proposals

Gets the structural proposals considered during the cycle.

Property ResourceState

Gets the resource state observed before the cycle.

Property ResourcePolicy

Gets the resource-economy policy.

Property DecisionPolicy

Gets the proposal-scoring and safety policy.

Property ConstraintRegistry

Gets the work-constraint registry snapshot.

Property ObservationWindow

Gets the recent fast-loop observation window.

Property PhasePolicy

Gets the structural-phase threshold policy.

Property ClaimTransition

Gets a claim-lifecycle transition evaluated during the cycle when one is present.

Property CreatedUtc

Gets the zero-offset UTC time assigned to the control cycle and trace entry.

Property Metadata

Gets bounded metadata copied into the decision trace.

TeleodynamicControlResultUAIX.LmRuntime.Governance 5 members

Reports the complete bounded output of one opt-in teleodynamic control cycle.

Property PhaseAssessment

Gets the structural phase inferred from the observation window.

Property ConstraintClosure

Gets the work-constraint closure analysis.

Property Decision

Gets the selected structural proposal or explicit no-op decision.

Property ClaimDecision

Gets the claim-lifecycle decision when one is present.

Property TraceEntry

Gets the tamper-evident trace entry appended for the cycle.

TeleodynamicDecisionUAIX.LmRuntime.Governance 8 members

Reports the deterministic selection made across structural proposals and the explicit no-op candidate.

Property DecisionId

Gets the stable decision identifier.

Property SelectedAction

Gets the selected action, including explicit no-op.

Property SelectedProposalId

Gets the selected proposal identifier, or when no-op wins.

Property NoOpReason

Gets the reason no-op was selected, or for an actionable proposal.

Property Evaluations

Gets all bounded proposal evaluations in stable proposal-identifier order.

Property SelectedTransition

Gets the resource transition associated with the selected proposal or the unchanged no-op state.

Property Message

Gets the bounded explanation for the selected action.

Property CreatedUtc

Gets the UTC time at which the decision was produced.

TeleodynamicDecisionPolicyUAIX.LmRuntime.Governance 8 members

Defines deterministic scoring and safety rules for choosing among structural proposals and no-op.

Property MaximumProposals

Gets the maximum number of proposals accepted in one bounded evaluation.

Property MaximumUncertainty

Gets the maximum uncertainty accepted for an automatically actionable proposal.

Property MinimumEvidenceCount

Gets the minimum evidence-reference count required for an automatically actionable proposal.

Property ComplexityWeight

Gets the weight applied to absolute structural-complexity growth in the local objective.

Property UncertaintyWeight

Gets the weight applied to uncertainty in the local objective.

Property NoOpAdvantage

Gets the non-negative advantage a proposal must have over no-op before it can be selected.

Property RequireReversibleAutomaticAction

Gets a value indicating whether automatic proposal selection requires a documented rollback path.

Property DeferHumanReviewProposals

Gets a value indicating whether proposals requiring human review are excluded from automatic selection.

TeleodynamicTraceEntryUAIX.LmRuntime.Governance 4 members

Represents one immutable entry in the SHA-256-linked teleodynamic decision trace.

Property Sequence

Gets the one-based sequence number assigned by the trace chain.

Property Request

Gets the canonical event request stored by the entry.

Property PreviousHash

Gets the preceding entry hash, or 64 zero characters for the first entry.

Property ContentHash

Gets the lowercase SHA-256 digest covering the sequence, previous hash, and canonical request fields.

TeleodynamicTraceRequestUAIX.LmRuntime.Governance 8 members

Describes the canonical fields appended to the tamper-evident teleodynamic decision trace.

Property EventId

Gets the stable decision or event identifier.

Property EventKind

Gets the bounded event kind.

Property Action

Gets the selected governance action.

Property NoOpReason

Gets the no-op reason when no structural action was selected.

Property ResourceBefore

Gets the resource balance observed before the event.

Property ResourceAfter

Gets the resource balance selected after the event.

Property CreatedUtc

Gets the UTC event time supplied by the orchestrator.

Property Metadata

Gets bounded metadata serialized in ordinal key order.

TraceChainVerificationResultUAIX.LmRuntime.Governance 3 members

Reports whether a teleodynamic trace chain is structurally and cryptographically intact.

Property Valid

Gets a value indicating whether every sequence, predecessor hash, and content hash is valid.

Property FirstInvalidSequence

Gets the one-based sequence number of the first invalid entry, or zero when the chain is valid.

Property Message

Gets the bounded verification diagnostic.

IClaimBoundaryPolicyUAIX.LmRuntime.Abstractions 1 member

Evaluates generated text against bounded claim-boundary rules.

Method EvaluateAsync(string,System.Collections.Generic.IReadOnlyList<UAIX.LmRuntime.Governance.ClaimBoundaryRule>,System.Threading.CancellationToken)

Evaluates the async against the supplied policy and bounded observation state.

text
The text processed by the configured encoding or normalization rules; it must satisfy the declared nullability contract.
rules
The claim-boundary rules. Implementations may use defaults when this collection is empty.
cancellationToken
The caller-provided token used to cancel the operation before additional work or results are published.

Returns: An asynchronous ValueTask<ClaimBoundaryDecision> that completes with the result of IClaimBoundaryPolicy.EvaluateAsync: Evaluates the async against the supplied policy and bounded observation state. Fault and cancellation states are propagated without a successful partial result.

IClaimLifecyclePolicyUAIX.LmRuntime.Abstractions 1 member

Enforces explicit evidence, boundary, and human-authority transitions for runtime claims.

Method Evaluate(UAIX.LmRuntime.Governance.ClaimTransitionRequest)

Evaluates the supplied request against the supplied policy and bounded observation state.

request
The requested transition and its bounded evidence signals.

Returns: The ClaimTransitionDecision result produced by IClaimLifecyclePolicy.Evaluate for this contract: Evaluates the supplied request against the supplied policy and bounded observation state. It is published only after all documented validation and ownership transitions succeed.

IConstraintClosureAnalyzerUAIX.LmRuntime.Abstractions 1 member

Analyzes a bounded work-constraint registry for closed maintenance cycles and retirement candidates.

Method Analyze(UAIX.LmRuntime.Governance.ConstraintRegistrySnapshot)

Computes deterministic graph closure information for the supplied registry snapshot.

snapshot
The immutable state snapshot being validated, serialized, restored, or analyzed without retaining caller-owned mutable aliases.

Returns: The closure report, including validation diagnostics and retirement candidates.

IConstraintPolicyUAIX.LmRuntime.Abstractions 1 member

Evaluates request-side constraints before budgeted execution is allowed.

Method EvaluateAsync(UAIX.LmRuntime.Contracts.InferenceRequest,System.Collections.Generic.IReadOnlyList<UAIX.LmRuntime.Governance.ConstraintRule>,System.Threading.CancellationToken)

Evaluates the request against configured constraint rules.

request
The InferenceRequest containing the complete caller-owned inputs for EvaluateAsync; required fields are validated and mutable collections are snapshotted before state changes or large allocations.
rules
The rules sequence used by this operation; its required length, ordering, and element bounds are validated before access.
cancellationToken
The caller-provided token used to cancel the operation before additional work or results are published.

Returns: An asynchronous ValueTask<ConstraintDecision> that completes with the result of IConstraintPolicy.EvaluateAsync: Evaluates the request against configured constraint rules. Fault and cancellation states are propagated without a successful partial result.

IEvidenceLedgerUAIX.LmRuntime.Abstractions 1 member

Appends immutable governance evidence receipts.

Method AppendAsync(UAIX.LmRuntime.Governance.GovernanceDecisionReceipt,System.Threading.CancellationToken)

Appends the async to the current IEvidenceLedger state after validating capacity, ordering, and ownership constraints.

receipt
The immutable governance decision receipt appended to the caller-owned evidence ledger in the supplied order.
cancellationToken
The caller-provided token used to cancel the operation before additional work or results are published.

Returns: A task that represents completion of the asynchronous operation.

IInferenceRuntimeUAIX.LmRuntime.Abstractions 2 members

Defines the public runtime orchestration API.

Method GenerateAsync(UAIX.LmRuntime.Contracts.InferenceRequest,System.Threading.CancellationToken)

Generates the async through the deterministic execution path owned by IInferenceRuntime.

request
The InferenceRequest containing the complete caller-owned inputs for GenerateAsync; required fields are validated and mutable collections are snapshotted before state changes or large allocations.
cancellationToken
The caller-provided token used to cancel the operation before additional work or results are published.

Returns: An asynchronous Task<InferenceResponse> that completes with the result of IInferenceRuntime.GenerateAsync: Generates the async through the deterministic execution path owned by IInferenceRuntime. Fault and cancellation states are propagated without a successful partial result.

Method StreamAsync(UAIX.LmRuntime.Contracts.InferenceRequest,System.Threading.CancellationToken)

Streams the async in observable sequence order while honoring caller cancellation.

request
The InferenceRequest containing the complete caller-owned inputs for StreamAsync; required fields are validated and mutable collections are snapshotted before state changes or large allocations.
cancellationToken
The caller-provided token used to cancel the operation before additional work or results are published.

Returns: The normalized streaming events, enumerated in source order with caller cancellation and failure propagation governed by StreamAsync.

IInferenceSessionUAIX.LmRuntime.Abstractions 3 members

Defines a stateful inference session with explicit prefill and decode phases.

Property SessionId

Gets the session identifier.

Method PrefillAsync(UAIX.LmRuntime.Contracts.InferenceRequest,System.Threading.CancellationToken)

Prefills the async into the current model state after validating token and cache bounds.

request
The InferenceRequest containing the complete caller-owned inputs for PrefillAsync; required fields are validated and mutable collections are snapshotted before state changes or large allocations.
cancellationToken
The caller-provided token used to cancel the operation before additional work or results are published.

Returns: The number of prompt tokens accepted by the session.

Method DecodeNextAsync(System.Threading.CancellationToken)

Decodes the next token or text chunk from the session.

cancellationToken
The caller-provided token used to cancel the operation before additional work or results are published.

Returns: An asynchronous ValueTask<StreamingDelta> that completes with the result of IInferenceSession.DecodeNextAsync: Decodes the next token or text chunk from the session. Fault and cancellation states are propagated without a successful partial result.

IMemoryFirewallUAIX.LmRuntime.Abstractions 1 member

Evaluates source-routed memory packets before they can enter an active retrieval tier.

Method Evaluate(UAIX.LmRuntime.Governance.MemoryFirewallRequest)

Evaluates packet provenance, integrity, freshness, entropy, trust, contradiction, evidence, and review state.

request
The MemoryFirewallRequest containing the complete caller-owned inputs for Evaluate; required fields are validated and mutable collections are snapshotted before state changes or large allocations.

Returns: The memory region containing the supplied request, bounded to the validated range owned by the result.

IModelAdapterUAIX.LmRuntime.Abstractions 4 members

Defines the execution boundary for provider-hosted or local models.

Property ProviderName

Gets the provider or backend name.

Method GetModelAsync(string,System.Threading.CancellationToken)

Gets model metadata for the requested model.

model
The model whose validated metadata, tensors, or runtime state are consumed by this operation.
cancellationToken
The caller-provided token used to cancel the operation before additional work or results are published.

Returns: An asynchronous ValueTask<ModelDescriptor> that completes with the result of IModelAdapter.GetModelAsync: Gets model metadata for the requested model. Fault and cancellation states are propagated without a successful partial result.

Method GenerateAsync(UAIX.LmRuntime.Contracts.InferenceRequest,System.Threading.CancellationToken)

Generates the async through the deterministic execution path owned by IModelAdapter.

request
The InferenceRequest containing the complete caller-owned inputs for GenerateAsync; required fields are validated and mutable collections are snapshotted before state changes or large allocations.
cancellationToken
The caller-provided token used to cancel the operation before additional work or results are published.

Returns: An asynchronous Task<InferenceResponse> that completes with the result of IModelAdapter.GenerateAsync: Generates the async through the deterministic execution path owned by IModelAdapter. Fault and cancellation states are propagated without a successful partial result.

Method StreamAsync(UAIX.LmRuntime.Contracts.InferenceRequest,System.Threading.CancellationToken)

Streams the async in observable sequence order while honoring caller cancellation.

request
The InferenceRequest containing the complete caller-owned inputs for StreamAsync; required fields are validated and mutable collections are snapshotted before state changes or large allocations.
cancellationToken
The caller-provided token used to cancel the operation before additional work or results are published.

Returns: The normalized streaming event sequence, enumerated in source order with caller cancellation and failure propagation governed by StreamAsync.

IQuarantineLedgerUAIX.LmRuntime.Abstractions 1 member

Writes append-only records for generated or unreviewed artifacts that require quarantine.

Method AppendAsync(UAIX.LmRuntime.Governance.QuarantineRecord,System.Threading.CancellationToken)

Appends the async to the current IQuarantineLedger state after validating capacity, ordering, and ownership constraints.

record
The immutable quarantine record appended to the caller-owned ledger without altering its evidence fields.
cancellationToken
The caller-provided token used to cancel the operation before additional work or results are published.

Returns: A task that represents completion of the asynchronous operation.

IResourceEconomyEngineUAIX.LmRuntime.Abstractions 1 member

Advances the structural-control resource economy using explicit gain and cost inputs.

Method Evaluate(UAIX.LmRuntime.Governance.ResourceEconomyState,UAIX.LmRuntime.Governance.ResourceEconomyInput,UAIX.LmRuntime.Governance.ResourceEconomyPolicy)

Evaluates the supplied state against the supplied policy and bounded observation state.

state
The immutable state observed before the candidate action.
input
The gain and burden estimates for the candidate action.
policy
The policy that define validation limits and execution behavior; required values are checked before use.

Returns: The ResourceEconomyTransition result produced by IResourceEconomyEngine.Evaluate for this contract: Evaluates the supplied state against the supplied policy and bounded observation state. It is published only after all documented validation and ownership transitions succeed.

IReviewGatePolicyUAIX.LmRuntime.Abstractions 1 member

Evaluates slow-loop review gates before consequential runtime work proceeds.

Method EvaluateAsync(UAIX.LmRuntime.Governance.ReviewGateRequest,System.Threading.CancellationToken)

Evaluates whether a request can continue in the automatic lane.

request
The ReviewGateRequest containing the complete caller-owned inputs for EvaluateAsync; required fields are validated and mutable collections are snapshotted before state changes or large allocations.
cancellationToken
The caller-provided token used to cancel the operation before additional work or results are published.

Returns: An asynchronous ValueTask<ReviewGateDecision> that completes with the result of IReviewGatePolicy.EvaluateAsync: Evaluates whether a request can continue in the automatic lane. Fault and cancellation states are propagated without a successful partial result.

IRuntimeBudgetGovernorUAIX.LmRuntime.Abstractions 1 member

Evaluates whether runtime work can proceed under the configured resource economy.

Method EvaluateAsync(UAIX.LmRuntime.Governance.RuntimeBudgetRequest,UAIX.LmRuntime.Governance.RuntimeBudget,System.Threading.CancellationToken)

Evaluates the requested runtime work against a resource budget.

request
The RuntimeBudgetRequest containing the complete caller-owned inputs for EvaluateAsync; required fields are validated and mutable collections are snapshotted before state changes or large allocations.
budget
The immutable runtime budget whose quantitative limits constrain the evaluated operation or proposal.
cancellationToken
The caller-provided token used to cancel the operation before additional work or results are published.

Returns: An asynchronous ValueTask<RuntimeBudgetDecision> that completes with the result of IRuntimeBudgetGovernor.EvaluateAsync: Evaluates the requested runtime work against a resource budget. Fault and cancellation states are propagated without a successful partial result.

IStructuralChangePlannerUAIX.LmRuntime.Abstractions 1 member

Evaluates proposed structural mutations through budget, no-op, evidence, and review gates.

Method PlanAsync(UAIX.LmRuntime.Governance.StructuralChangeRequest,UAIX.LmRuntime.Governance.RuntimeBudget,System.Threading.CancellationToken)

Plans the disposition for a proposed structural change.

request
The StructuralChangeRequest containing the complete caller-owned inputs for PlanAsync; required fields are validated and mutable collections are snapshotted before state changes or large allocations.
budget
The immutable runtime budget whose quantitative limits constrain the evaluated operation or proposal.
cancellationToken
The caller-provided token used to cancel the operation before additional work or results are published.

Returns: An asynchronous ValueTask<StructuralChangeDecision> that completes with the result of IStructuralChangePlanner.PlanAsync: Plans the disposition for a proposed structural change. Fault and cancellation states are propagated without a successful partial result.

IStructuralOperatorEngineUAIX.LmRuntime.Abstractions 1 member

Converts resource and claim-boundary decisions into an auditable structural operator receipt.

Method DecideAsync(UAIX.LmRuntime.Governance.RuntimeBudgetDecision,UAIX.LmRuntime.Governance.ClaimBoundaryDecision,System.Threading.CancellationToken)

Selects the structural operator for the current decision context.

budgetDecision
The prior budget decision that must authorize the structural operation before any proposal can advance.
claimBoundaryDecision
The claim-boundary decision used to constrain the resulting action, or null when no claim evaluation applies.
cancellationToken
The caller-provided token used to cancel the operation before additional work or results are published.

Returns: An asynchronous ValueTask<GovernanceDecisionReceipt> that completes with the result of IStructuralOperatorEngine.DecideAsync: Selects the structural operator for the current decision context. Fault and cancellation states are propagated without a successful partial result.

IStructuralPhaseDetectorUAIX.LmRuntime.Abstractions 1 member

Classifies the current structural regime from bounded fast-loop measurements.

Method Detect(UAIX.LmRuntime.Governance.StructuralObservationWindow,UAIX.LmRuntime.Governance.StructuralPhasePolicy)

Classifies one observation window using deterministic threshold precedence.

window
The immutable observation window whose ordered measurements are evaluated to determine the structural phase.
policy
The policy that define validation limits and execution behavior; required values are checked before use.

Returns: The StructuralPhaseAssessment result produced by IStructuralPhaseDetector.Detect for this contract: Classifies one observation window using deterministic threshold precedence. It is published only after all documented validation and ownership transitions succeed.

ITeleodynamicControlPlaneUAIX.LmRuntime.Abstractions 1 member

Coordinates the resource, proposal, closure, phase, claim, and trace components on a slow timescale.

Method Evaluate(UAIX.LmRuntime.Governance.TeleodynamicControlRequest)

Evaluates one complete control cycle without mutating model weights, tokenizer state, or generated output.

request
The TeleodynamicControlRequest containing the complete caller-owned inputs for Evaluate; required fields are validated and mutable collections are snapshotted before state changes or large allocations.

Returns: The phase, closure, structural decision, claim decision when present, and tamper-evident trace entry.

ITeleodynamicDecisionEngineUAIX.LmRuntime.Abstractions 1 member

Selects a bounded structural proposal or explicit no-op using resource and evidence constraints.

Method Decide(string,System.Collections.Generic.IReadOnlyList<UAIX.LmRuntime.Governance.StructuralProposal>,UAIX.LmRuntime.Governance.ResourceEconomyState,UAIX.LmRuntime.Governance.ResourceEconomyPolicy,UAIX.LmRuntime.Governance.TeleodynamicDecisionPolicy,System.DateTimeOffset)

Evaluates the proposal set and returns one deterministic, no-op-aware decision.

decisionId
The stable identifier assigned by the caller to the decision cycle.
proposals
The proposals sequence used by this operation; its required length, ordering, and element bounds are validated before access.
state
The validated state value consumed by the operation; mutations, when applicable, are limited to the explicitly documented state owner.
resourcePolicy
The immutable resource-economy policy that defines allowed transitions and quantitative thresholds for the decision.
decisionPolicy
The immutable decision policy that constrains proposal selection, no-op behavior, and review requirements.
createdUtc
The caller-supplied UTC timestamp recorded in the deterministic decision receipt; non-UTC offsets must be normalized by the caller.

Returns: The TeleodynamicDecision result produced by ITeleodynamicDecisionEngine.Decide for this contract: Evaluates the proposal set and returns one deterministic, no-op-aware decision. It is published only after all documented validation and ownership transitions succeed.

ITeleodynamicTraceChainUAIX.LmRuntime.Abstractions 3 members

Maintains an append-only, SHA-256-linked trace of bounded structural-control decisions.

Method Snapshot

Gets an immutable snapshot of the current trace entries.

Returns: The current trace entries in ascending sequence order.

Method Append(UAIX.LmRuntime.Governance.TeleodynamicTraceRequest)

Appends one canonical event to the trace.

request
The TeleodynamicTraceRequest containing the complete caller-owned inputs for Append; required fields are validated and mutable collections are snapshotted before state changes or large allocations.

Returns: The immutable trace entry assigned to the event.

Method Verify(System.Collections.Generic.IReadOnlyList<UAIX.LmRuntime.Governance.TeleodynamicTraceEntry>)

Verifies an arbitrary trace snapshot without mutating the current chain.

entries
The entries to verify in their supplied order.

Returns: The first detected chain error or a valid result.

ITokenizerUAIX.LmRuntime.Abstractions 6 members

Defines tokenization behavior for runtime token budgeting and model parity work.

Property Name

Gets the tokenizer name.

Method Tokenize(string)

Tokenizes the supplied text with the configured metadata and preserves deterministic token order.

text
The text processed by the configured encoding or normalization rules; it must satisfy the declared nullability contract.

Returns: An ordered read-only collection of token text values produced by the configured tokenizer.

Method Encode(string,bool,bool)

Encodes text into token identifiers when the tokenizer has a vocabulary.

text
The text processed by the configured encoding or normalization rules; it must satisfy the declared nullability contract.
addBos
A value indicating whether add BOS applies to this operation.
addEos
A value indicating whether add EOS applies to this operation.

Returns: An ordered read-only collection of token identifiers produced by the configured tokenizer.

Method Decode(System.Collections.Generic.IEnumerable<int>)

Decodes token identifiers into text when the tokenizer has a vocabulary.

tokenIds
The ordered token identifiers to process; sequence order is preserved and each identifier is validated where required.

Returns: The decoded text produced from the validated token sequence in the original sequence order.

Method CountTokens(string)

Counts tokens in a single text value.

text
The text processed by the configured encoding or normalization rules; it must satisfy the declared nullability contract.

Returns: The int value computed by ITokenizer.CountTokens for this contract: Counts tokens in a single text value. Range, finite-value, and overflow checks are completed before the value is returned.

Method CountTokens(System.Collections.Generic.IEnumerable<UAIX.LmRuntime.Contracts.LlmMessage>)

Counts tokens across a set of model messages.

messages
The messages sequence used by this operation; its required length, ordering, and element bounds are validated before access.

Returns: The TokenCountResult result produced by ITokenizer.CountTokens for this contract: Counts tokens across a set of model messages. It is published only after all documented validation and ownership transitions succeed.

IUaiMemoryStoreUAIX.LmRuntime.Abstractions 2 members

Defines append and query behavior for .uai-backed runtime memory.

Method AppendAsync(UAIX.LmRuntime.Contracts.UaiMemoryEntry,System.Threading.CancellationToken)

Appends the async to the current IUaiMemoryStore state after validating capacity, ordering, and ownership constraints.

entry
The entry examined or transformed by this operation; it must satisfy the declared type and range constraints.
cancellationToken
The caller-provided token used to cancel the operation before additional work or results are published.

Returns: A task that completes when the entry has been written.

Method ReadAsync(UAIX.LmRuntime.Contracts.MemoryQuery,System.Threading.CancellationToken)

Reads the async from the current IUaiMemoryStore state using the component's validated representation.

query
The bounded memory query defining the caller-authorized selection criteria and result ceiling.
cancellationToken
The caller-provided token used to cancel the operation before additional work or results are published.

Returns: An asynchronous ValueTask<IReadOnlyList<UaiMemoryEntry>> that completes with the result of IUaiMemoryStore.ReadAsync: Reads the async from the current IUaiMemoryStore state using the component's validated representation. Fault and cancellation states are propagated without a successful partial result.

RuntimePackageVersionUAIX.LmRuntime.Abstractions 2 members

Provides the runtime package version emitted by the centrally configured assembly metadata.

Package projects obtain their version from Directory.Build.props. Runtime components should use this type instead of repeating a version literal, which keeps evidence, session, CLI, and governance records aligned with the assembly that produced them.

Property Current

Gets the normalized three-part package version of the active runtime distribution.

Method Resolve(System.Reflection.Assembly)

Resolves a normalized three-part package version from an assembly identity.

assembly
The assembly whose centrally generated version metadata is authoritative.

Returns: A major.minor.build version string, or 0.0.0 when the assembly has no version metadata.

UAIX.LmRuntime.Acceleration

13 documented types and 80 documented members.

DiagnosticRuntimeBackendUAIX.LmRuntime.Acceleration 5 members

Provides shared diagnostic behavior for backend packages that declare compatibility before native execution is proven.

The class keeps package-visible backend declarations DRY. It performs no native loads, subprocess execution, provider calls, network access, or model downloads; concrete packages remain responsible only for naming their compatibility API, runtime identifiers, and package-specific diagnostics.

Property Id

Gets the stable backend identifier.

Property Kind

Gets the backend kind.

Property DisplayName

Gets the human-readable backend name.

Property Capabilities

Gets the backend capability declaration.

Method ProbeAsync

Probes a diagnostic backend and reports unavailable until native assets, runtime libraries, and devices are proven.

options
The caller-supplied probing and selection options.
cancellationToken
A token that cancels the probe.

Returns: The backend probe result.

IRuntimeBackendUAIX.LmRuntime.Acceleration 5 members

Defines a local runtime backend that can report capabilities and probe available devices.

Backends are package-visible compatibility surfaces. Implementations must not perform hidden downloads, provider calls, subprocess execution, or remote inference during probing or selection.

Property Id

Gets the stable backend identifier.

Property Kind

Gets the backend kind.

Property DisplayName

Gets the human-readable backend name.

Property Capabilities

Gets the backend capability declaration.

Method ProbeAsync

Probes the backend for local execution availability and devices.

options
The caller-supplied probing and selection options.
cancellationToken
A token that cancels the probe.

Returns: The backend probe result.

IRuntimeBackendRegistryUAIX.LmRuntime.Acceleration 3 members

Defines a registry of runtime backends available to a host application.

The registry is explicit and local. Merely installing a package does not grant hidden runtime authority unless the host registers the backend instance.

Method Register

Registers one runtime backend.

backend
The backend to register.
Method GetBackends

Gets the currently registered runtime backends.

Returns: The ordered registered backend list.

Method FindById

Finds a backend by identifier.

backendId
The backend identifier to find.

Returns: The matching backend, or when no backend is registered for the ID.

IRuntimeBackendSelectorUAIX.LmRuntime.Acceleration 1 member

Defines backend selection over a registered set of local runtime backends.

Selection returns explicit backend and device identity so CPU fallback is never confused with GPU execution.

Method SelectAsync

Selects a backend according to the supplied policy and options.

options
The backend selection options.
cancellationToken
A token that cancels selection.

Returns: The backend selection result.

RuntimeBackendCapabilitiesUAIX.LmRuntime.Acceleration 13 members

Reports the execution and compatibility capabilities declared by a runtime backend.

Capabilities are package-visible declarations. Availability is still decided by probing native assets, runtime libraries, drivers, and devices before a backend is selected for execution.

Property UsesGpuAcceleration

Gets or sets a value indicating whether the backend uses GPU acceleration when it executes successfully.

Property UsesNativeInference

Gets or sets a value indicating whether the backend uses native inference components.

Property SupportsCpuOnlyExecution

Gets or sets a value indicating whether the backend can execute without GPU hardware.

Property SupportsGpuExecution

Gets or sets a value indicating whether the backend can execute on a GPU device.

Property SupportsGguf

Gets or sets a value indicating whether the backend supports GGUF model inputs.

Property SupportsLlama

Gets or sets a value indicating whether the backend supports LLaMA-family model binding.

Property SupportsStreaming

Gets or sets a value indicating whether the backend supports streaming token output.

Property SupportsDeviceSelection

Gets or sets a value indicating whether callers can select among backend devices.

Property SupportsMemoryQuery

Gets or sets a value indicating whether the backend can query device or execution memory.

Property BackendApiName

Gets or sets the backend API name, such as CUDA, DirectML, Vulkan, ROCm, Metal, or managed CPU.

Property RuntimeIdentifiers

Gets runtime identifiers supported or observed by the backend package.

Property NativeAssetState

Gets or sets the backend-reported native asset state.

Property Diagnostics

Gets diagnostic messages describing compatibility, package state, and probe status.

RuntimeBackendKindUAIX.LmRuntime.Acceleration 8 members

Identifies the accelerator API family or execution class represented by a runtime backend.

The value describes the backend package family, not a claim that a device, driver, or native library is available on the current machine.

Field CpuManaged

The managed CPU reference backend.

Field Cuda

An NVIDIA CUDA backend.

Field DirectML

A Windows DirectML backend.

Field Vulkan

A Vulkan compute backend.

Field Rocm

An AMD ROCm backend.

Field Metal

An Apple Metal backend.

Field ExternalNative

A backend supplied by an external native package or host adapter.

Field Unknown

An unknown or not-yet-classified backend.

RuntimeBackendOptionsUAIX.LmRuntime.Acceleration 7 members

Provides caller-supplied backend selection and probing options.

These options never authorize hidden downloads, provider fallback, subprocess execution, or remote inference. They only describe how registered local backends should be probed and selected.

Property Policy

Gets or sets the backend selection policy.

Property PreferredBackendId

Gets or sets the preferred or required backend identifier for backend-id policies.

Property PreferredDeviceId

Gets or sets the preferred backend-local device identifier when device selection is supported.

Property AllowCpuFallback

Gets or sets a value indicating whether prefer policies may fall back to CPU execution.

Property RequireNativeAssets

Gets or sets a value indicating whether a selected backend must have native assets available.

Property RequestedRuntimeIdentifier

Gets or sets the runtime identifier requested by the caller, when different from the current process.

Property NativeAssetDirectory

Gets or sets an optional directory that a backend may inspect for native assets.

RuntimeBackendProbeResultUAIX.LmRuntime.Acceleration 6 members

Reports the result of probing one registered runtime backend.

A backend may declare capabilities but still probe unavailable when native assets, runtime libraries, drivers, or devices are absent.

Property BackendId

Gets or sets the backend identifier.

Property BackendKind

Gets or sets the backend kind.

Property IsAvailable

Gets or sets a value indicating whether the backend is available for execution.

Property Capabilities

Gets or sets the backend capabilities.

Property Devices

Gets discovered devices reported by the backend.

Property Diagnostics

Gets probe diagnostics.

RuntimeBackendRegistryUAIX.LmRuntime.Acceleration 3 members

Provides an in-memory explicit registry of local runtime backends.

Registration order is preserved so callers can make deterministic selection decisions.

Method Register

Registers one runtime backend.

backend
The backend to register.
Method GetBackends

Gets the currently registered runtime backends.

Returns: The ordered registered backend list.

Method FindById

Finds a backend by identifier.

backendId
The backend identifier to find.

Returns: The matching backend, or when no backend is registered for the ID.

RuntimeBackendSelectorUAIX.LmRuntime.Acceleration 2 members

Selects a local runtime backend from an explicit registry.

The selector probes registered backends and reports fallback identity instead of silently substituting one execution class for another.

Constructor Constructor

Initializes a new instance of the class.

registry
The backend registry to select from.
Method SelectAsync

Selects a backend according to the supplied policy and options.

options
The backend selection options.
cancellationToken
A token that cancels selection.

Returns: The backend selection result.

RuntimeDeviceDescriptorUAIX.LmRuntime.Acceleration 12 members

Describes one CPU or accelerator device discovered by a runtime backend probe.

Device descriptors are diagnostic evidence. They should identify the device that would execute work without exposing private file paths, credentials, prompt text, or generated text.

Property BackendId

Gets or sets the identifier of the backend that reported the device.

Property BackendKind

Gets or sets the backend kind that reported the device.

Property DeviceId

Gets or sets the stable backend-local device identifier.

Property DisplayName

Gets or sets the human-readable device name.

Property Vendor

Gets or sets the vendor or implementation owner reported by the backend.

Property BackendApiName

Gets or sets the acceleration API name associated with the device.

Property RuntimeIdentifier

Gets or sets the runtime identifier associated with the probed process.

Property IsCpu

Gets or sets a value indicating whether the descriptor represents CPU execution.

Property IsGpu

Gets or sets a value indicating whether the descriptor represents GPU execution.

Property ComputeCapability

Gets or sets the backend-reported compute capability, feature level, or API version.

Property MemoryBytes

Gets or sets the backend-reported device memory in bytes when known.

Property Diagnostics

Gets diagnostic messages associated with the device descriptor.

RuntimeSelectionPolicyUAIX.LmRuntime.Acceleration 6 members

Defines how the selector chooses among registered runtime backends.

Policies that prefer a backend may fall back when the options allow it. Policies that require a backend fail clearly when the required capability is unavailable.

Field RequireCpu

Select only a CPU-capable backend.

Field PreferCpu

Prefer a CPU-capable backend but allow a different available backend when CPU is absent.

Field PreferGpu

Prefer an available GPU backend and fall back to CPU only when fallback is permitted.

Field RequireGpu

Select only an available GPU backend.

Field PreferBackendId

Prefer a specific backend identifier and fall back only when fallback is permitted.

Field RequireBackendId

Select only a specific backend identifier.

RuntimeSelectionResultUAIX.LmRuntime.Acceleration 9 members

Reports the selected backend, selected device, and any fallback used by a selection policy.

Selection results make CPU fallback explicit so callers never mistake fallback execution for GPU execution.

Property Succeeded

Gets or sets a value indicating whether selection succeeded.

Property Policy

Gets or sets the policy used for selection.

Property SelectedBackendId

Gets or sets the selected backend identifier.

Property SelectedBackendKind

Gets or sets the selected backend kind.

Property SelectedDevice

Gets or sets the selected device descriptor.

Property SelectedCapabilities

Gets or sets the selected backend capabilities.

Property UsedCpuFallback

Gets or sets a value indicating whether a GPU-prefer policy fell back to CPU execution.

Property FailureReason

Gets or sets the selection failure reason when selection did not succeed.

Property Diagnostics

Gets selection diagnostics.

UAIX.LmRuntime.Backends.CpuManaged

2 documented types and 8 documented members.

CpuManagedRuntimeBackendUAIX.LmRuntime.Backends.CpuManaged 7 members

Reports the UAIX managed CPU backend to the acceleration registry.

This backend represents compatibility required for systems without GPUs. It does not require GPU hardware, GPU drivers, native runtime libraries, provider APIs, subprocess execution, or model downloads.

Field BackendId

The stable backend identifier used by selection policies.

Constructor Constructor

Initializes a new instance of the class.

Property Id

Gets the stable backend identifier.

Property Kind

Gets the backend kind.

Property DisplayName

Gets the human-readable backend name.

Property Capabilities

Gets the backend capability declaration.

Method ProbeAsync

Probes the managed .NET backend for local execution availability.

options
The caller-supplied probing and selection options.
cancellationToken
A token that cancels the probe.

Returns: The available CPU probe result.

RuntimeBackendRegistryExtensionsUAIX.LmRuntime.Backends.CpuManaged 1 member

Provides registration helpers for the managed CPU backend.

Hosts can call this helper to make CPU fallback explicit in the acceleration registry.

Method AddUaixCpuManagedBackend

Adds the UAIX managed CPU backend to a registry.

registry
The registry to update.

Returns: The same registry instance for fluent configuration.

UAIX.LmRuntime.Backends.Cuda

2 documented types and 3 documented members.

CudaRuntimeBackendUAIX.LmRuntime.Backends.Cuda 2 members

Reports CUDA backend compatibility and diagnostics to the acceleration registry.

This package does not contain hidden CUDA inference binaries. Native assets and device proof must come from a declared native package or host adapter before CUDA execution can be selected.

Field BackendId

The stable backend identifier used by selection policies.

Constructor Constructor

Initializes a new instance of the class.

RuntimeBackendRegistryExtensionsUAIX.LmRuntime.Backends.Cuda 1 member

Provides registration helpers for the CUDA backend.

Registration only exposes diagnostics and selection metadata. It does not authorize hidden native execution.

Method AddUaixCudaBackend

Adds the UAIX CUDA backend to a registry.

registry
The registry to update.

Returns: The same registry instance for fluent configuration.

UAIX.LmRuntime.Backends.DirectML

2 documented types and 3 documented members.

DirectMlRuntimeBackendUAIX.LmRuntime.Backends.DirectML 2 members

Reports DirectML backend compatibility and diagnostics to the acceleration registry.

DirectML support is declared as a package-visible compatibility path for Windows hosts. This package performs no hidden native inference and reports unavailable until a host supplies native runtime and device evidence.

Field BackendId

The stable backend identifier used by selection policies.

Constructor Constructor

Initializes a new instance of the class.

RuntimeBackendRegistryExtensionsUAIX.LmRuntime.Backends.DirectML 1 member

Provides registration helpers for the DirectML backend.

Registration only exposes diagnostics and selection metadata. It does not authorize hidden native execution.

Method AddUaixDirectMlBackend

Adds the UAIX DirectML backend to a registry.

registry
The registry to update.

Returns: The same registry instance for fluent configuration.

UAIX.LmRuntime.Backends.Vulkan

2 documented types and 3 documented members.

RuntimeBackendRegistryExtensionsUAIX.LmRuntime.Backends.Vulkan 1 member

Provides registration helpers for the Vulkan backend.

Registration only exposes diagnostics and selection metadata. It does not authorize hidden native execution.

Method AddUaixVulkanBackend

Adds the UAIX Vulkan backend to a registry.

registry
The registry to update.

Returns: The same registry instance for fluent configuration.

VulkanRuntimeBackendUAIX.LmRuntime.Backends.Vulkan 2 members

Reports Vulkan backend compatibility and diagnostics to the acceleration registry.

Vulkan support is declared for vendor-diverse hardware. This package does not contain hidden Vulkan inference binaries and reports unavailable until runtime and device evidence exists.

Field BackendId

The stable backend identifier used by selection policies.

Constructor Constructor

Initializes a new instance of the class.

UAIX.LmRuntime.Backends.Rocm

2 documented types and 3 documented members.

RocmRuntimeBackendUAIX.LmRuntime.Backends.Rocm 2 members

Reports ROCm backend compatibility and diagnostics to the acceleration registry.

ROCm support is declared for AMD GPU systems. This package does not contain hidden ROCm inference binaries and reports unavailable until runtime and device evidence exists.

Field BackendId

The stable backend identifier used by selection policies.

Constructor Constructor

Initializes a new instance of the class.

RuntimeBackendRegistryExtensionsUAIX.LmRuntime.Backends.Rocm 1 member

Provides registration helpers for the ROCm backend.

Registration only exposes diagnostics and selection metadata. It does not authorize hidden native execution.

Method AddUaixRocmBackend

Adds the UAIX ROCm backend to a registry.

registry
The registry to update.

Returns: The same registry instance for fluent configuration.

UAIX.LmRuntime.Backends.Metal

2 documented types and 3 documented members.

MetalRuntimeBackendUAIX.LmRuntime.Backends.Metal 2 members

Reports Metal backend compatibility and diagnostics to the acceleration registry.

Metal support is declared for Apple GPU systems. This package does not contain hidden Metal inference binaries and reports unavailable until runtime and device evidence exists.

Field BackendId

The stable backend identifier used by selection policies.

Constructor Constructor

Initializes a new instance of the class.

RuntimeBackendRegistryExtensionsUAIX.LmRuntime.Backends.Metal 1 member

Provides registration helpers for the Metal backend.

Registration only exposes diagnostics and selection metadata. It does not authorize hidden native execution.

Method AddUaixMetalBackend

Adds the UAIX Metal backend to a registry.

registry
The registry to update.

Returns: The same registry instance for fluent configuration.

UAIX.LmRuntime.Backends.Cuda.Native.win-x64

0 documented types and 0 documented members.

This RID-specific deployment package has no managed public types.

UAIX.LmRuntime.Backends.Cuda.LegacyK80.win-x64

0 documented types and 0 documented members.

This RID-specific deployment package has no managed public types.