Phase 108: AI Inference and Serving

Phase 108 of the AI Encyclopedia — AI Inference and Serving. Topics 2141–2160.

Part of the AI Encyclopedia · Phase 108 of 130 · Topics 2141–2160

This phase covers AI Inference and Serving. Below are the 20 concepts grouped under this phase — each is a future article in the Insightful AI World encyclopedia.

2141 Model Serving

2142 Online Inference

2143 Batch Inference

2144 Streaming Inference

2145 Inference Latency

2146 Throughput

2147 Batching

2148 Dynamic Batching

2149 Caching

2150 Quantized Inference

2151 Speculative Decoding

2152 KV Cache

2153 Model Routing

2154 Load Balancing

2155 Autoscaling

2156 Edge Inference

2157 On-device AI

2158 Serverless AI

2159 Inference Cost Optimization

2160 Inference Reliability

← Phase 107

All phases

Phase 109 →