Coral SDK

Support running LLM and building AI Agents locally and efficiently on edge devices

NEXA AI's Coral SDK

Blazing Inference Speed

20 token/s prefix and decoding speed for Phi-3 (data collected on SAMSUNG S23)

Multi-Processor Support

CPU, GPU, and hybrid CPU + GPU inference

Multi-Compression Options

1.5-bit, 2-bit, 4-bit and 8-bit integer quantization

Multi-Platform Availability

Android, iOS, MacOS and Windows Operating Systems

Explore our collection of 200+ Premium Webflow Templates