Bot protection reference

Arcjet bot detection allows you to manage traffic by automated clients and bots.

Plan availability

Arcjet bot detection functionality depends on your pricing plan.

Plan	Bot protection
Free	Basic - user agent + IP type analysis
Starter Business	Advanced - IP reputation, verification, ML, and other signals
Enterprise	Custom

Configuration

Bot detection is configured by allowing or denying a subset of bots. The allow and deny lists are mutually-exclusive, such that using allow will result in a DENY decision for any detected bot that is not specified in the allow list and using deny will result in an ALLOW decision for any detected bot that is not specified in the deny list.

You can use only one of the following configuration definitions:

1
type BotOptionsAllow = {
2
  mode?: "LIVE" | "DRY_RUN";
3
  allow: Array<ArcjetWellKnownBot | ArcjetBotCategory>;
4
};

1
type BotOptionsDeny = {
2
  mode?: "LIVE" | "DRY_RUN";
3
  deny: Array<ArcjetWellKnownBot | ArcjetBotCategory>;
4
};

The arcjet client is configured with one or more detectBot rules which take one or many BotOptions.

Allowing specific bots

Most applications want to block almost all bots. However, it is common to allow some bots to access your system, such as bots for search indexing or API access from the command line.

When allowing specific bots we recommend that you also check the verification status after an allow decision is returned to ensure that the bots are who they say they are.

This behavior is configured with an allow list from our full list of bots and/or bot categories.

Denying specific bots

Some applications may only want to block a small subset of bots, while allowing the majority continued access. This may be due to many reasons, such as misconfigured or high-traffic bots.

This behavior is configured with a deny list from our full list of bots and/or bot categories.

Decision

The quick start example will deny requests that match the bot detection rules, immediately returning a response to the client.

Arcjet also provides a single protect function that is used to execute your protection rules. This requires a request argument which is the request context as passed to the request handler.

This function returns a Promise that resolves to an ArcjetDecision object. This contains the following properties:

id (string) - The unique ID for the request. This can be used to look up the request in the Arcjet dashboard. It is prefixed with req_ for decisions involving the Arcjet cloud API. For decisions taken locally, the prefix is lreq_.
conclusion (ArcjetConclusion) - The final conclusion based on evaluating each of the configured rules. If you wish to accept Arcjet’s recommended action based on the configured rules then you can use this property.
reason (ArcjetReason) - An object containing more detailed information about the conclusion.
results (ArcjetRuleResult[]) - An array of ArcjetRuleResult objects containing the results of each rule that was executed.
ip (ArcjetIpDetails) - An object containing Arcjet’s analysis of the client IP address. See the SDK reference for more information.

You check if a deny conclusion has been returned by a bot protection rule by using decision.isDenied() and decision.reason.isBot() respectively.

You can iterate through the results and check whether a bot protection rule was applied:

1
for (const result of decision.results) {
2
  console.log("Rule Result", result);
3
}

Identified bots

The decision also contains all of the identified bots and matched categories detected from the request. A request may be identified as zero, one, or more bots/categories—all of which will be available on the decision.allowed and decision.denied properties.

Error handling

Arcjet is designed to fail open so that a service issue or misconfiguration does not block all requests. The SDK will also time out and fail open after 1000ms when NODE_ENV or ARCJET_ENV is development and 500ms otherwise. However, in most cases, the response time will be less than 20-30ms.

If there is an error condition when processing the rule, Arcjet will return an ERROR result for that rule and you can check the message property on the rule’s error result for more information.

If all other rules that were run returned an ALLOW result, then the final Arcjet conclusion will be ERROR.

Filtering categories

All categories are also provided as enumerations, which allows for programmatic access. For example, you may want to allow most of CATEGORY:GOOGLE except their “advertising quality” bot.

Bot verification

Requests analyzed by Arcjet on Starter or Business plans include automatic bot verification. For allow rules, Arcjet verifies the authenticity of detected bots by checking IP data and performing reverse DNS lookups.

This helps protect against spoofed bots where clients pretend to be someone else.

Example: Allowing verified bots

Well-behaved bots, such as search engine indexers, are often desirable traffic. The companies that operate these bots will make them verifiable so application developers can choose to avoid additional signals about the request.

For example, when a request claims to be GoogleBot, Arcjet will check if the IP truly belongs to Google. You can check the verification status in your code and take actions based on the results, such as allowing all verified bots.

1
import { isVerifiedBot } from "@arcjet/inspect";
2

3
// ...
4
const aj = arcjet({
5
  // ...
6
  rules: [
7
    detectBot({
8
      mode: "LIVE",
9
      allow: ["CATEGORY:SEARCH_ENGINE"],
10
    }),
11
  ],
12
});
13

14
// ...
15
const decision = await aj.protect(req);
16
// ...
17

18
// Ignore other signals for verified search engine bots
19
if (decision.results.some(isVerifiedBot)) {
20
  return new Response("Hello Bot!");
21
}
22

23
// Leverage all Arcjet signals
24
if (decision.isDenied()) {
25
  return new Response(null, { status: 403 });
26
}

Check for spoofed bots

This will check if the bot is spoofed. You would usually return a 403 or similar response to block the request.

1
for (const { reason } of decision.results) {
2
  if (reason.isBot() && reason.isSpoofed()) {
3
    console.log("Detected spoofed bot", reason.spoofed);
4
    // Return a 403 or similar response
5
  }
6
}

Check bot verification

This will check if the bot is verified.

1
for (const { reason } of decision.results) {
2
  if (reason.isBot() && reason.isVerified()) {
3
    console.log("Verified bot", reason.verified);
4
    // Allow the request
5
  }
6
}

Testing

Arcjet runs the same in any environment, including locally and in CI. You can use the mode set to DRY_RUN to log the results of rule execution without blocking any requests.

We have an example test framework you can use to automatically test your rules. Arcjet can also be triggered based using a sample of your traffic.

See the Testing section of the docs for details.

Bot protection reference

Plan availability

Configuration

Allowing specific bots

Denying specific bots

Loader vs action

Action

Guards and routes

Global guard

Per route guard

Within route

Per route vs hooks

Per route

Hooks

Avoiding double protection with hooks

Per route vs middleware

Per route

Middleware

Avoiding double protection with middleware

Pages & Server Actions

Decision

Identified bots

Error handling

Filtering categories

Bot verification

Example: Allowing verified bots

Check for spoofed bots

Check bot verification

Testing

Examples

Protecting a page

Wrap existing handler

Edge Functions

Discussion