BazaarLinkBazaarLink
登入

Gemma 3n E4b It

Gemma 3n E4B-it is optimized for efficient execution on mobile and low-resource devices, such as phones, laptops, and tablets. It supports multimodal inputs—including text, visual data, and audio—enabling diverse tasks such as text generation, speech recognition, translation, and image analysis. Leveraging innovations like Per-Layer Embedding (PLE) caching and the MatFormer architecture, Gemma 3n dynamically manages memory usage and computational load by selectively activating model parameters, significantly reducing runtime resource requirements. This model supports a wide linguistic range (trained in over 140 languages) and features a flexible 32K token context window. Gemma 3n can selectively load parameters, optimizing memory and computational efficiency based on the task or device capabilities, making it well-suited for privacy-focused, offline-capable applications and on-device AI solutions. [Read more in the blog post](https://developers.googleblog.com/en/introducing-gemma-3n/)

定價資訊

輸入價格
$0.0200
/ 百萬 tokens
輸出價格
$0.0400
/ 百萬 tokens
上下文視窗
33K
tokens
供應商Google
模態text->text
發布日期2025年5月
Model IDgoogle/gemma-3n-e4b-it

快速開始

只需更改 base_url 即可透過 BazaarLink API 使用 Gemma 3n E4b It:

Python
from openai import OpenAI

client = OpenAI(
    base_url="https://bazaarlink.ai/api/v1",
    api_key="sk-bl-YOUR_API_KEY",
)

response = client.chat.completions.create(
    model="google/gemma-3n-e4b-it",
    messages=[{"role": "user", "content": "Hello!"}],
)
print(response.choices[0].message.content)
TypeScript
import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://bazaarlink.ai/api/v1",
  apiKey: "sk-bl-YOUR_API_KEY",
});

const response = await client.chat.completions.create({
  model: "google/gemma-3n-e4b-it",
  messages: [{ role: "user", content: "Hello!" }],
});
console.log(response.choices[0].message.content);

為什麼透過 BazaarLink 使用 Gemma 3n E4b It?

  • 美元計費(台幣報價)+ 統一發票 — 台灣團隊無需海外刷卡
  • OpenAI 相容 API — 無需改寫程式碼
  • 自動故障轉移 — 同模型多供應商備援
  • 中文客服支援 — 在地團隊即時協助
立即試用← 查看所有模型

Frequently Asked Questions

What is Gemma 3n E4b It?

Gemma 3n E4B-it is optimized for efficient execution on mobile and low-resource devices, such as phones, laptops, and tablets. It supports multimodal inputs—including text, visual data, and audio—enabling diverse tasks such as text generation, speech recognition, translation, and image analysis. Leveraging innovations like Per-Layer Embedding (PLE) caching and the MatFormer architecture, Gemma 3n dynamically manages memory usage and computational load by selectively activating model parameters, significantly reducing runtime resource requirements. This model supports a wide linguistic range (trained in over 140 languages) and features a flexible 32K token context window. Gemma 3n can selectively load parameters, optimizing memory and computational efficiency based on the task or device capabilities, making it well-suited for privacy-focused, offline-capable applications and on-device AI solutions. [Read more in the blog post](https://developers.googleblog.com/en/introducing-gemma-3n/)

How much does the Gemma 3n E4b It API cost?

Gemma 3n E4b It costs $0.0200 per 1K input tokens and $0.0400 per 1K output tokens when accessed through BazaarLink.

How do I use Gemma 3n E4b It with the OpenAI SDK?

Set base_url to "https://bazaarlink.ai/api/v1" and use model ID "google/gemma-3n-e4b-it". All OpenAI SDK methods (chat.completions, embeddings, streaming) work without code changes.

What is the context window for Gemma 3n E4b It?

Gemma 3n E4b It supports a context window of 32,768 tokens.

Is Gemma 3n E4b It available for free?

Gemma 3n E4b It is a paid model. BazaarLink offers free trial credits on registration so you can test it without a credit card.

Support
Support
Hi! How can we help you?
Send a message and we'll get back to you soon.