Phase 108: AI Inference and Serving
Phase 108 of the AI Encyclopedia — AI Inference and Serving. Topics 2141–2160.
This phase covers AI Inference and Serving. Below are the 20 concepts grouped under this phase — each is a future article in the Insightful AI World encyclopedia.
2141 Model Serving
2142 Online Inference
2143 Batch Inference
2144 Streaming Inference
2145 Inference Latency
2146 Throughput
2147 Batching
2148 Dynamic Batching
2149 Caching
2150 Quantized Inference
2151 Speculative Decoding
2152 KV Cache
2153 Model Routing
2154 Load Balancing
2155 Autoscaling
2156 Edge Inference
2157 On-device AI
2158 Serverless AI
2159 Inference Cost Optimization
2160 Inference Reliability