Loading...
Discovering amazing AI tools


A cross-platform SDK to run and ship LLMs, multimodal, ASR and TTS models on mobile, PC, automotive and IoT with NPU/GPU/CPU acceleration.

A cross-platform SDK to run and ship LLMs, multimodal, ASR and TTS models on mobile, PC, automotive and IoT with NPU/GPU/CPU acceleration.
NexaSDK for Mobile is a developer toolkit that enables running and deploying large language models, multimodal models, automatic speech recognition (ASR), and text-to-speech (TTS) directly on-device across iOS and Android (and broader device classes). It provides runtimes and tooling powered by the NexaML engine to run workloads on NPUs (including Apple Neural Engine), GPUs and CPUs, delivering low-latency, private, and production-ready inference. The SDK includes platform-specific bindings (Android, CLI/Python integrations and Linux support), model conversion and optimization utilities, and runtime acceleration to minimize resource use and latency while maintaining on-device privacy and reduced cloud costs.



