// Chapter 04 · Protocol
Request Routing
The four signals, in priority order.
A deterministic function of four signals
// 4.0 · same inputs, same outcome, every time
// Key claim
The order is the policy.
Scheduling in ParalleliX is not a heuristic. Four signals are applied in fixed priority order; each stage narrows the candidate pool before the next stage runs. Capability runs first because it eliminates the largest cohort with the cheapest test. Reputation runs last because it is a tie-breaker, not a gate.
// ParalleliX AI · inbound requests
- req-4a1
- req-4a2
- req-4a3
- req-4a4
- req-4a5
- req-4a6
// node pool · each node serves one whole request
- node-04req-4a1
- node-09req-4a2
- node-15req-4a3
- node-22req-4a4
- node-31req-4a5
- node-07req-4a6
A single inference is served whole by one node, never split across machines. Throughput scales with the number of online nodes.
- // 01Binary
Capability match
Available now - // 02Priority
Capacity-aware queue
Available now - // 03Fit
Best-fit allocation
Available now - // 04Weighted
Reputation weighting
Planned
The four signals
// 4.1 · 4.2 · 4.3 · 4.4 · in priority order
// Signal register · coordinator schedulerlive now
- // 01
Capability match
· Binary· Available nowReads each node's declared workload classes and eliminates every node that did not register the unit of work's class. At launch every capable node is eligible; class-specific optimisation is planned.
- // 02
Capacity-aware queue
· Priority· Available nowReads each node's active work counter against its declared limit. Higher-complexity work jumps lower-complexity work; no node is over-subscribed beyond its declared capacity.
- // 03
Best-fit allocation
· Fit· Available nowAmong capable, available nodes, picks the smallest one that fits the unit of work. Larger nodes stay free for larger work, and capacity fragmentation drops.
- // 04
Reputation weighting
· Weighted· PlannedReads rolling uptime, validated-vs-dispatched completion rate, and 90-day validation success. At launch the multiplier is constant 1.0; once the reputation layer ships, it breaks ties at equal capability and capacity.
Worked example
// 4.5 · ai.inference · tier 2 · expedited · n=247
A single inference request, dispatched whole to one node. Watch the candidate pool shrink at each signal.
// Request specification
- workload_class
- ai.inference
- hardware_tier
- 2 (mid-GPU)
- priority
- expedited
- resource_req
- 8 GB VRAM · 2 CPU cores
- node_pool_n
- 247
// Candidate funnel · 247 → 1expedited · tier 2
- // 01Capability247 nodes declared ai.inference247 nodes
- // 02Capacity89 with available capacity for an expedited slot89 nodes−158
- // 03Best-fit73 with smallest-adequate resource_req73 nodes−16
- // 04Reputation10 top by uptime + validation success10 nodes−63
- // 05AssignedTop-ranked candidate wins the request.nd_8e91
Note·Routing in practice at launch
Signal inputs and behaviour over time
// 4.6 · per-signal schema
What each signal reads, what it outputs, and what changes as planned work lands.
// Signal arity · 4 axes × 4 columns
- 4.1 Capabilitydeclared workload classesall capable = eligibleunchangedclass-specific optimisation
- 4.2 Capacityactive counter per nodepriority queue by complexityunchangedunchanged
- 4.3 Best-fitresource_req vs availablesmallest adequateunchangedunchanged
- 4.4 Reputationuptime / completion / validationconstant 1.0rolling multiplierunchanged
// Where to go next · reading path