Edge AI
Edge AI runs artificial intelligence models directly on local devices or edge servers rather than in the cloud, enabling real-time processing, data privacy, and operation without internet connectivity.
What is Edge AI?
Why Edge AI Matters for Business
Related Terms
Explore further
FAQ
Frequently asked questions
Modern smartphones, tablets, IoT devices with NPUs, edge servers, Raspberry Pi-class devices, NVIDIA Jetson modules, industrial PLCs with AI capabilities, and many other devices. The model must be optimised for the specific device's compute and memory constraints.
Small language models (1-7 billion parameters) can run on capable edge devices with quantisation. Larger models require cloud or server deployment. Research in model compression is steadily expanding what is possible at the edge.
Over-the-air (OTA) model updates push new model versions to edge devices. This requires update infrastructure, version management, and rollback capabilities. Updates should be tested thoroughly before deployment to avoid degrading performance on devices in the field.
Need help implementing this?
Our team can help you apply these concepts to your business. Book a free strategy call.