Protect your OpenAI application with Arcjet

If you are building an AI application using OpenAI then you will want to protect it from abuse. Arcjet rate limiting and bot protection can help manage your OpenAI developer token budget.

What is Arcjet?

Arcjet helps developers protect their apps in just a few lines of code. Bot detection. Rate limiting. Email validation. Attack protection. Data redaction. A developer-first approach to security.

Example use case

You have a chat interface that uses OpenAI to generate responses.
You want to prevent automated bots from accessing your application.
You want to implement a rate limit for each user logged in to your application.
The rate limit should be based on the OpenAI tokens, which is how you are billed for your usage of the OpenAI API.

How it works

Arcjet rate limits allow custom fingerprint characteristics to identify the client and apply the limit. We provide the user ID to identify the user. This can use any authentication system you have in place, such as Clerk.
Define a rate limit of 2,000 tokens per hour with a maximum of 5,000 tokens in the bucket. This allows for a reasonable conversation length without consuming too many tokens.
Also apply a bot rule to block clients we are sure are automated.
Use the openai-chat-tokens package to count the number of tokens in each chat API request.
Pass the token estimate to the Arcjet protect call to deduct the tokens from the user’s rate limit.

The example below shows the API route for a Next.js application with a gpt-3.5-turbo AI chatbot. See the full example Next.js implementation on GitHub.

TS (App)
TS (Pages)

1
// Adapted from https://sdk.vercel.ai/docs/getting-started/nextjs-app-router
2
import { openai } from "@ai-sdk/openai";
3
import arcjet, { shield, tokenBucket } from "@arcjet/next";
4
import { streamText } from "ai";
5
import { promptTokensEstimate } from "openai-chat-tokens";
6

7
const aj = arcjet({
8
  // Get your site key from https://app.arcjet.com
9
  // and set it as an environment variable rather than hard coding.
10
  // See: https://nextjs.org/docs/app/building-your-application/configuring/environment-variables
11
  key: process.env.ARCJET_KEY!,
12
  rules: [
13
    shield({
14
      mode: "LIVE", // will block requests. Use "DRY_RUN" to log only
15
    }),
16
    tokenBucket({
17
      mode: "LIVE", // will block requests. Use "DRY_RUN" to log only
18
      characteristics: ["userId"], // track requests by user ID
19
      refillRate: 2_000, // fill the bucket up by 2,000 tokens
20
      interval: "1h", // every hour
21
      capacity: 5_000, // up to 5,000 tokens
22
    }),
23
  ],
24
});
25

26
// Allow streaming responses up to 30 seconds
27
export const maxDuration = 30;
28

29
// Edge runtime allows for streaming responses
30
export const runtime = "edge";
31

32
export async function POST(req: Request) {
33
  // This userId is hard coded for the example, but this is where you would do a
34
  // session lookup and get the user ID.
35
  const userId = "totoro";
36

37
  const { messages } = await req.json();
38

39
  // Estimate the number of tokens required to process the request
40
  const estimate = promptTokensEstimate({
41
    messages,
42
  });
43

44
  console.log("Token estimate", estimate);
45

46
  // Withdraw tokens from the token bucket
47
  const decision = await aj.protect(req, { requested: estimate, userId });
48
  console.log("Arcjet decision", decision.conclusion);
49

50
  for (const { reason } of decision.results) {
51
    if (reason.isRateLimit()) {
52
      console.log("Requests remaining", reason.remaining);
53
    }
54
  }
55

56
  // If the request is denied, return a 429
57
  if (decision.isDenied()) {
58
    if (decision.reason.isRateLimit()) {
59
      return new Response("Too Many Requests", {
60
        status: 429,
61
      });
62
    } else {
63
      return new Response("Forbidden", {
64
        status: 403,
65
      });
66
    }
67
  }
68

69
  // If the request is allowed, continue to use OpenAI
70
  const result = await streamText({
71
    model: openai("gpt-4-turbo"),
72
    messages,
73
  });
74

75
  return result.toDataStreamResponse();
76
}

The Next.js pages router does not support streaming responses so you should use the app router for this example. You can still use the pages/ directory for the rest of your application. See the Next.js AI docs for details.

Protect your OpenAI application with Arcjet

Example use case

How it works

Discussion