Google: Gemma 3n E4B

Gemma-3n-E4B

Gemma 3n E4B-it is a highly efficient AI model optimized for mobile and low-resource devices, including phones, laptops, and tablets. It supports multimodal inputs—text, images, and audio—enabling a wide range of tasks such as text generation, speech recognition, translation, and image analysis. Powered by innovations like Per-Layer Embedding (PLE) caching and the MatFormer architecture, Gemma 3n intelligently manages memory and computation by selectively activating parameters, greatly reducing runtime resource demands.

Trained across 140+ languages and equipped with a flexible 32K token context window, the model adapts its parameter usage based on the task or device, ensuring both efficiency and versatility. This makes Gemma 3n ideal for privacy-focused, offline-capable applications and on-device AI solutions.

Conversations

Download TXT
Download PDF
CreatorGoogle
Release DateJune, 2025
LicenseGemma License
Context Window32,000
Image Input SupportYes
Open Source (Weights)Yes
Parameters8.4B, 4.0B active at inference time
Model WeightsClick here