Skip to content
Closed
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
10 changes: 9 additions & 1 deletion explore-analyze/elastic-inference/eis.md
Original file line number Diff line number Diff line change
Expand Up @@ -75,6 +75,14 @@
1. Navigate to [**Billing and subscriptions > Usage**](https://cloud.elastic.co/billing/usage) in the {{ecloud}} Console
2. Look for line items where the **Billing dimension** is set to "Inference"

### Fair usage during free trial

Accounts in the free trial period are subject to token limits that are considered "fair usage". Access to some models may be paused temporarily if this limit is exceeded.

Check notice on line 80 in explore-analyze/elastic-inference/eis.md

View workflow job for this annotation

GitHub Actions / vale

Elastic.WordChoice: Consider using 'can, might' instead of 'may', unless the term is in the UI.

Fair usage limits while account is in free trial:
- **Elastic Managed LLM:** 100 million input tokens in 24h or 5 million output tokens in 24h
- **ELSER**: 1 billion tokens in 24h

Check notice on line 84 in explore-analyze/elastic-inference/eis.md

View workflow job for this annotation

GitHub Actions / vale

Elastic.Acronyms: 'ELSER' has no definition.

## Rate limits

The service enforces rate limits on an ongoing basis. Exceeding a limit will result in HTTP 429 responses from the server until the sliding window moves on further and parts of the limit resets.
Expand All @@ -88,7 +96,7 @@

We limit on both requests per minute and tokens per minute (whichever limit is reached first).

#### Ingest
#### Ingest

- 6,000 request per minute
- 6,000,000 tokens per minute
Expand Down
Loading