Creating API Endpoint for Webhooks

Do you want to connect Bloomreach Engagement to your own RESTful system? Due to some Engagement-specific behavior of campaigns and Bloomreach Engagement's webhooks, we have created a guide that will take you through the steps necessary to build an API to use with Bloomreach Engagement “Per customer” webhook.

This Guide Will Help You Understand:

  • What response codes of your API will mean to Bloomreach Engagement
  • What performance is required from your API
  • What security elements can be used to protect your data in transit

Instructions

Below you will find a short list of all the requirements. They are also explained in more detail in each respective section.

Your API requirements

  • Provides HTTPS endpoint with SSL certificate signed by one of Trusted CAs (trusted CA list)
  • Handles concurrent requests, this can be set per webhook, read more
  • Responds in less than 1000 ms
  • HTTP 200 for success or processing failure
  • HTTP 400 for errors or invalid requests to notify Bloomreach Engagement to stop sending more requests
  • HTTP 500 to notify Bloomreach Engagement to retry the same request later
  • Accepts and responds with JSON
  • Authenticates requests with HTTP authorization header (HTTP Basic Auth described in RFC 2617)

Response Codes

HTTP 200

Generally, your API will respond with HTTP 200 whenever a request was received, parsed and passed initial validation.

{
    "success": true,
    "result": {
        "code": 1,
        "status": "sent",
        "message: "SMS message sent successfully"
}
}

If request processing fails due to input specific to this single request, your API should return HTTP 200 with error message.

{
    "success": false,
    "errors": [
        { "code": 1004, "description": "SMS message too long. Limit is 160 characters."
    ]
}

HTTP 4xx

Each response HTTP 4xx will be translated as a campaign event with the status "failed".

The exceptions are the error codes 400, 404, 410, 412, 413, and 418, which prevent the webhook from sending any other request for all chunks.

A chunk of customers means a group of customers that entered the scenario at the same time (Now trigger, Regularly triggered scenarios). For event-triggered scenarios, every customer is a new chunk.

Do not respond with 4xx if the request is invalid only for 1 customer due to webhook personalization. For this, you need to use the 'per customer' webhook, which sends and processes for one customer only at a time. However, please bear in mind that this increases the load on the API endpoint.

As for the batch webhooks, it is impossible to tell from the response to which of the batched customers it belongs. Therefore it is being tracked for all. Even one customer causing an error when received and processed by the API endpoint that returns a 4xx response will add a 'failure' event for all customers in a batch, even if they did not cause that error on the API side.

Usual examples:

  • 400 Bad request - request generated by Bloomreach Engagement is invalid due to incorrect Webhook definition
  • 401 Unauthorized - credentials are not provided or invalid
  • 404 Not found

HTTP 500

HTTP 500 responses indicate that your API is currently unavailable to process the request. Bloomreach Engagement will attempt to retry the request later.

Retry Policy

Bloomreach Engagement will retry on HTTP 500 Internal Server Error or higher, 429 Too many reqeusts and also for infrequent connection issues - timeout, reject, drop.

There is a maximum of 3 retries (total of 4 requests attempts). Batch webhooks with batch limit of 10 or above may do up to 7 retries (8 attempts).

The time between each retry increases with some random factor (the average delays for 4 attempts are 7, 45 and 300 seconds respectively with randomized +-10%, which might change in future without announcement).

However, when Bloomreach Engagement encounters too many unrecoverable errors, it starts aborting Webhook requests. This aborting occurs for customers it hasn't processed yet, to protect itself from exhaustion.

Unrecoverable Errors:

  • HTTP 4xx, except 429 Too many requests
  • exhausts number of retries for recoverable errors*
  • frequent connection issues*
  • rendering repeatedly fails (Jinja personalization)

*The behavior is very specific and the definition is out of the scope of this document.

It's advised to build your API idempotent: require a unique request identifier and reject the processing of duplicate requests.

Response Format

Bloomreach Engagement supports any content-type in requests, however, only JSON and plain text format can be parsed as a response. Use of JSON formatted response is recommended.

Performance Requirements

Response Time

Bloomreach Engagement expects your API to respond in a timely fashion. The timeouts are set at 10 seconds by default, and we are no longer aborting subsequent requests.

However, we strongly recommend having latencies below 1 second as larger latencies will make the performance of your campaigns worse and risk getting close to the 10-second timeout. It's not not true that we may retry requests with latencies above 1 second.

Concurrency

Bloomreach Engagement scenarios run in parallel by default. If not set differently, It's recommended to scale your API to handle at least 200 concurrent HTTP requests. You can set the desired maximum of parallel requests in each webhook (See more). In the worst case, you may use HTTP 500 or connection reject, with Bloomreach Engagement retry policy in mind, to throttle the number of concurrent requests.

Security

Authentication

HTTP Basic Authentication is implemented natively. See RFC 2617

HTTPS

Your server must expose the API on a secure endpoint to ensure In-Transit encryption. Bloomreach Engagement requires your server's SSL certificate to be signed by Trusted CA. The common name (CN) of the supplied SSL certificate needs to match the API endpoint.

The current list of Trusted CAs can be found here

IP Firewall Whitelisting

A pool of outgoing IP addresses can be limited by enabling "Use static IPs" in Webhook configuration, where you will find the list of those IPs.

📘

Configuration of Static IPs

Configuration of static IPs on instances other than app.exponea.com may differ.

❗️

Overusage of Static IPs in Webhook Nodes

Be very careful with usage of static IPs in large amount of webhooks.
Try to use them only when absolutely necessary.
If overused, such API requests has ability to create bottle necks and slower campaign execution on the whole Bloomreach Engagement instance.

As a rule of thumb, few thousands API requests with static IPs should be OK, but more than hundred thousand requests could introduce slower execution.

As a workaround we recommend:

  • Instead of IP whitelisting use https and authorization header as a main security measure
  • If possible, call webhook API requests with static IPs in bulk. This should lower the amount of campaign API requests.