xlibb/pipeline

1.0.0

Overview

Building message-driven applications often involves complex tasks such as data transformation, message filtering, and reliable delivery to multiple systems. Developers frequently find themselves writing repetitive code for common patterns like retries, error handling, and parallel delivery. This leads to increased development time, inconsistent implementations, and systems that are harder to maintain and evolve.

This package simplifies these challenges by offering a standardized, declarative way to define message pipelines. It centralizes message flow management, reduces boilerplate code, and makes it easier to build resilient, fault-tolerant applications, thereby improving developer experience and promoting consistent, reliable integration patterns across Ballerina projects.

Core components

The package provides a set of core components that facilitate message-driven application development.

Handler

The Handler is the fundamental building block of the handler chain. It represents a processing unit that can either process messages or serve as a destination for them. Handlers can be configured with various properties, such as retry policies and error handling strategies.

Processor

The message processors are just Ballerina functions that are annotated to indicate their type and purpose. All processors are assumed to be idempotent, meaning that running them multiple times with the same input will always produce the same result. This is crucial for safe message replay. It is developer's responsibility to ensure that the logic within these processors adheres to this principle.

The package provides three types of processors:

Filter: A processor that can drop messages based on a condition. This accepts the Context and returns a boolean indicating whether the message should continue processing.


@pipeline:Filter {id: "filter"}
isolated function filter(pipeline:MessageContext context) returns boolean|error {
    // Check some condition on the message
}

Transformer: A processor that modifies the message content or metadata. It accepts the Context and returns a modified message content.


@pipeline:Transformer {id: "transformer"}
isolated function transformer(pipeline:MessageContext context) returns anydata|error {
    // Modify the message content or metadata
    // Return the modified message content
}

Generic Processor: A processor that can perform any action on the Context. It accepts the Context and returns nothing.


@pipeline:Processor {id: "generic"}
isolated function generic(pipeline:MessageContext context) returns error? {
    // Perform any action on the context
}

Destination

A destination is similar to a generic processor but is used to deliver the message to an external system or endpoint. It accepts a copy of the Context and returns an error if the delivery fails. Additionally, it can return any result that is relevant to the delivery operation, such as a confirmation or status.

A destination can be configured with retry policies to ensure reliable delivery.


@pipeline:Destination {
    id: "destination"
    retryConfig: {
        maxRetries: 3,
        retryInterval: 2
    }
}
isolated function destination(pipeline:MessageContext context) returns anydata|error {
    // Deliver the message to an external system or endpoint
}

Message

The Message is the core data structure that represents the message being processed. It contains the actual payload and any metadata required for processing. The Message is passed through the handler chain, allowing each processor to access and modify it as needed.

Message context

The MessageContext is a mutable container that holds the current state of the message being processed. It encapsulates the Message itself, along with any additional properties or metadata that processors and destinations need to share or update during the message's journey through the handler chain. This allows for a flexible and dynamic processing flow, where each component can access and modify the context as needed.

The following methods are available on the pipeline:MessageContext:

Method	Description
`getContent()`	Returns the message content as `anydata`.
`getContentWithType()`	Returns the message content as a specific type.
`getId()`	Returns the unique identifier of the message.
`setProperty(string key, anydata value)`	Sets a property in the context.
`getProperty(string key)`	Gets a property from the context.
`getPropertyWithType(string key)`	Gets a property from the context with a specific type.
`hasProperty(string key)`	Checks if a property exists in the context.
`removeProperty(string key)`	Removes a property from the context.
`toRecord()`	Converts the context to a record type for easier inspection and debugging.

Failure store

The FailureStore is a crucial component that captures messages that fail during processing or delivery. It stores the original message content along with a snapshot of the MessageContext at the time of failure. This allows for later inspection, debugging, and potential replay of failed messages.

Replay listener

The ReplayListener is an optional component that listens for failed messages stored in the FailureStore or a dedicated ReplayStore. It attempts to re-process these messages through the handler chain's defined pipeline, including retry policies. If a message consistently fails replay attempts, it can be routed to a Dead Letter Store for manual intervention.

Handler chain

The HandlerChain is the central component that orchestrates the entire message processing flow. It manages the sequence of handlers, the MessageContext, and the interaction with the FailureStore and ReplayListener. The HandlerChain is responsible for executing the defined processing logic, handling failures, and ensuring messages are processed in a consistent manner.


pipeline:HandlerChain handlerChain = check new(
    name = "exampleHandlerChain", // Name of the handler chain
    processors = [
        filter, // a Filter processor
        transformer, // a Transformer processor
        generic // a generic Processor
    ],
    destinations = [
        destination // a Destination handler
    ],
    failureStore = failureStore, // an instance of FailureStore
    replayListenerConfig = {
        pollingInterval: 5, // Polling interval for the replay listener
        maxRetries: 3, // Maximum retries for replaying messages
        retryInterval: 2 // Interval between retries
        deadLetterStore: deadLetterStore // Dead Letter Store
        replayStore: replayStore // Optional Replay Store
    }
);

Component interaction

The flow of a message through a pipeline:HandlerChain is meticulously orchestrated to ensure reliability and flexible processing:

Message Ingress: A raw message content (e.g., a string, json, byte[], or anydata) enters the Handler Chain through its execute method. This content typically originates from an external source (e.g., an HTTP request, a message queue subscription, a file read, or a direct function call).
Context Creation: The Handler Chain immediately wraps this incoming raw content into a Message record. This Message is then encapsulated within a new Message Context instance. This Message Context becomes the central, dynamic container for all subsequent operations, allowing processors and destinations to share and update state throughout the message's journey. A unique identifier is assigned to the Message and stored within the Context.
Sequential Processing (Processors):
- The Handler Chain iteratively processes the Message Context through its configured Processors in the defined order.
- Each Processor receives the Message Context as input. It can access and modify the message's content, update its internal metadata, or add new properties to the Message Context itself.
- Filtering: If a Filter processor returns false (indicating the message should be dropped) or an error, the Handler Chain immediately stops further processing for that message within the current handler chain. The message is considered successfully handled (dropped) and is not passed to subsequent processors or destinations.
- Error Handling (Processors): If any Processor encounters an error and returns an error type, the Handler Chain catches this exception. It then persists the original Message and the initial Message Context into the configured Failure Store. This ensures that the state leading to the failure is preserved for later inspection and potential replay.
Parallel Delivery (Destinations):
- If the message successfully traverses all Processors (i.e., it wasn't dropped and no processor returned an unhandled error), the Handler Chain proceeds to its Destinations flow.
- The Destinations configured in the Handler Chain are executed in parallel.
- Crucially, each Destination receives a copy of the Message Context (which includes the fully processed Message). This ensures isolation; actions performed by one destination (e.g., external API calls, logging specific to that destination) do not unintentionally interfere with the Message Context being used by other concurrently executing destinations.
- Error Handling (Destinations): If any Destination fails to deliver the message (returns an error), the Handler Chain intercepts this. Similar to processor failures, the original Message and the initial Message Context are sent to the Failure Store.
- Execution Result: If all Destinations succeed, the execute method returns a pipeline:ExecutionSuccess containing a map of results from each destination, keyed by the destination's name.
Failure Store Interaction:
- The Failure Store is a required configuration for the Handler Chain.
- When enabled, it acts as the central repository for messages that encounter an error during either the Processor phase or the Destination phase.
- The Handler Chain serializes and persists the Message and the state of its Message Context into the Failure Store. This comprehensive capture is vital for debugging, re-analyzing failure causes, and enabling the replay mechanism.
Replay Mechanism:
- The Handler Chain can be configured with an optional Replay Listener (leveraging ballerina/messaging:StoreListener) that automatically monitors the Failure Store/Replay Store to reply failed messages.
- The replayListener will poll the Failure Store/Replay Store at a configured pollingInterval for new failed messages.
- When a failed message is retrieved by the replayListener, it triggers a re-processing attempt through the original Handler Chain's execute method, but with an intelligent context.
- Intelligent Replay: During replay, the Handler Chain inspects the Message Context snapshot. If the Message Context contains information about destinations that already successfully processed the message in previous attempts, the Handler Chain will intelligently skip those already successful Destinations. This prevents redundant deliveries to systems that have already received the message, ensuring idempotency at the destination level where possible and preventing unintended side effects.
- If a replayed message consistently fails even after the configured number of maxRetries, it can be sent to a Dead Letter Store (another messaging:Store instance) for manual inspection or further automated handling outside the main handler chain flow.

Handler Chain Interaction

Example usage


import ballerina/constraint;
import ballerina/http;

import xlibb/pipeline;

public enum OrderStatus {
    PENDING,
    APPROVED,
    COMPLETED,
    FAILED
}

public type Order record {|
    @constraint:String {pattern: re `^OR[0-9]{5}$`}
    readonly string id;
    @constraint:String {pattern:  re `^[a-zA-Z0-9]+$`}
    readonly string customerId;
    @constraint:Number {minValue: 0.0d}
    decimal unitPrice;
    @constraint:Int {minValue:  1}
    int quantity;
    OrderStatus status;
|};

public type CalculatedOrder record {|
    *Order;
    decimal amount;
|};

@pipeline:ProcessorConfig {id: "validate_order"}
isolated function validateOrder(pipeline:MessageContext msgCtx) returns error? {
    Order 'order = check msgCtx.getContentWithType();
    Order _ = check constraint:validate('order);
}

@pipeline:FilterConfig {id: "filter_pending_orders"}
isolated function orderFilter(pipeline:MessageContext msgCtx) returns boolean|error {
    Order 'order = check msgCtx.getContentWithType();
    return 'order.status == PENDING || 'order.status == APPROVED;
}

@pipeline:TransformerConfig {id: "calculate_amount"}
isolated function calculateOrderAmount(pipeline:MessageContext msgCtx) returns CalculatedOrder|error {
    Order 'order = check msgCtx.getContentWithType();
    return {
        ...'order,
        amount: 'order.unitPrice * 'order.quantity
    };
}

@pipeline:TransformerConfig {id: "approve_order"}
isolated function approveOrder(pipeline:MessageContext msgCtx) returns CalculatedOrder|error {
    CalculatedOrder 'order = check msgCtx.getContentWithType();
    if 'order.status == APPROVED {
        return 'order; // Skip further processing if already approved
    }
    if 'order.amount > 100000.0d {
        'order.status = FAILED;
        return error("Order amount exceeds limit");
    }
    'order.status = APPROVED;
    return 'order;
}

@pipeline:ProcessorConfig {id: "get_discount"}
isolated function checkForOrderDiscount(pipeline:MessageContext msgCtx) returns error? {
    CalculatedOrder 'order = check msgCtx.getContentWithType();
    http:Client discountService = check new("http://discount-service:8080");
    float discount = check discountService->/discounts/['order.customerId];
    msgCtx.setProperty("discount", discount);
}

@pipeline:TransformerConfig {id: "apply_discount"}
isolated function applyOrderDiscount(pipeline:MessageContext msgCtx) returns CalculatedOrder|error {
    CalculatedOrder 'order = check msgCtx.getContentWithType();
    decimal discount = check msgCtx.getPropertyWithType("discount");
    'order.amount = 'order.amount - ('order.amount * discount);
    return 'order;
}

@pipeline:DestinationConfig {id: "add_order_to_inventory"}
isolated function addOrderToInventory(pipeline:MessageContext msgCtx) returns json|error {
    CalculatedOrder 'order = check msgCtx.getContentWithType();
    http:Client inventoryService = check new("http://inventory-service:8080");
    return inventoryService->/orders.post('order);
}

final rabbitmq:MessageStore failureStore = check new("order-failure-store");
final rabbitmq:MessageStore deadLetterStore = check new("order-dead-letter-store");
final rabbitmq:MessageStore replayStore = check new("order-replay-store");

final pipeline:HandlerChain orderPipeline = check new (
    name = "orderPipeline",
    processors = [
        validateOrder,
        orderFilter,
        calculateOrderAmount,
        approveOrder,
        checkForOrderDiscount,
        applyOrderDiscount
    ],
    destinations = addOrderToInventory,
    failureStore = failureStore,
    replayListenerConfig = {
        pollingInterval: 5,
        maxRetries: 3,
        retryInterval: 2,
        deadLetterStore: deadLetterStore,
        replayStore: replayStore
    }
);

service /api/v1 on new http:Listener(8080) {

    resource function post orders(Order 'order) returns http:Accepted|error {
        _ = start orderPipeline.execute('order.clone());
        return http:ACCEPTED;
    }
}

Classes

pipeline: HandlerChain

Isolated

Represents a handler chain that processes messages through a series of processors and destinations. A handler chain can be defined with a sequence of processors and a set of destinations. The destinations are executed in parallel after all processors have been executed. If any processor or destination fails, the message can be stored in a store for later processing or for error handling. Additionally, the handler chain supports pausing the execution of a message, allowing it to be resumed later with the same state.

Constructor

Creates a new handler chain.

init (string name, Processor|readonly & Processor[] processors, Destination|readonly & Destination[] destinations, Store failureStore, ReplayListenerConfiguration? replayListenerConfig)

name string - The name of the handler chain

processors Processor|readonly & Processor[] - The processors to be used in the handler chain, which can be a single processor or an array of processors

destinations Destination|readonly & Destination[] - The destinations to be used in the handler chain, which can be a single destination or an array of destinations

failureStore Store - The store to be used for storing messages on failure

replayListenerConfig ReplayListenerConfiguration? () -

getName

Isolated Function

function getName() returns string

Get the name of the handler chain.

Return Type

string - Returns the name of the handler chain

getFailureStore

Isolated Function

function getFailureStore() returns Store

Get the failure store of the handler chain.

Return Type

Store - Returns the failure store of the handler chain

replay

Isolated Function

function replay(Message message) returns ExecutionSuccess|ExecutionError

Replay the execution of a failed message in the handler chain. Failed messages in replay will not be stored automatically.

Parameters

message Message - The message to be replayed, which should contain the ID and the relevant state of the failed message

Return Type

ExecutionSuccess|ExecutionError - Returns an error if the message could not be processed, otherwise returns the execution result

execute

Isolated Function

function execute(anydata content) returns ExecutionSuccess|ExecutionError

Dispatch a message to the handler chain for processing with the defined processors and destinations.

Parameters

content anydata - The message content to be processed

Return Type

ExecutionSuccess|ExecutionError - Returns the execution result or an error if the processing failed

pipeline: MessageContext

Isolated

MessageContext encapsulates the message and the relevant properties. Additionally, it provides methods to manipulate the message properties and metadata.

getId

Isolated Function

function getId() returns string

Get the unique identifier of the message.

Return Type

string - The unique identifier of the message

toRecord

Isolated Function

function toRecord() returns Message

Get the message as a record.

Return Type

Message - A record version of the message

getContent

Isolated Function

function getContent() returns anydata

Get the content of the message.

Return Type

anydata - The content of the message, which can be of anydata type

getHandlerChainName

Isolated Function

function getHandlerChainName() returns string

Get the name of the handler chain that processed this message.

Return Type

string - The name of the handler chain

getContentWithType

Isolated Function

function getContentWithType(typedesc<anydata> targetType) returns targetType|Error

Get the content of the message with a specific type.

Parameters

targetType typedesc<anydata> (default <>) - The type to which the content should be converted, defaults to anydata

Return Type

targetType|Error - The content of the message converted to the specified type, or an error if the conversion fails

getProperty

Isolated Function

function getProperty(string key) returns anydata

Get message property by key.

Parameters

key string - The key of the property to retrieve

Return Type

anydata - The value of the property if it exists, otherwise panics

getPropertyWithType

Isolated Function

function getPropertyWithType(string key, typedesc<anydata> targetType) returns targetType|Error

Get message property by key with a specific type.

Parameters

key string - The key of the property to retrieve

targetType typedesc<anydata> (default <>) - The type to which the property value should be converted, defaults to anydata

Return Type

targetType|Error - The value of the property converted to the specified type, or an error if the conversion fails. The function will panic if the property does not exist

setProperty

Isolated Function

function setProperty(string key, anydata value)

Set a property in the message.

Parameters

key string - The key of the property to set

value anydata - The value to set for the property

removeProperty

Isolated Function

function removeProperty(string key) returns anydata

Remove a property from the message.

Parameters

key string - The key of the property to remove

Return Type

anydata - If the property exists, it is removed; otherwise, it panics

hasProperty

Isolated Function

function hasProperty(string key) returns boolean

Check if a property exists in the message.

Parameters

key string - The key of the property to check

Return Type

boolean - Returns true if the property exists, otherwise false

Annotations

pipeline: DestinationConfig

DestinationConfiguration

function

Destination configuration annotation.

pipeline: FilterConfig

HandlerConfiguration

function

Filter configuration annotation.

pipeline: ProcessorConfig

HandlerConfiguration

function

Processor configuration annotation.

pipeline: TransformerConfig

HandlerConfiguration

function

Transformer configuration annotation.

Records

pipeline: DestinationConfiguration

Closed record

Represents a destination configuration that can be used to configure a destination with retry capabilities.

Fields

Fields Included from *HandlerConfiguration

id string

retryConfig? RetryDestinationConfig - The retry configuration for the destination. By default, it is disabled

pipeline: ErrorDetails

Closed record

Error details.

Fields

message Message - The message associated with the errored execution

pipeline: ErrorInfo

Closed record

Error information type.

Fields

message string - A descriptive error message

stackTrace string[] - An array of strings representing the stack trace of the error

detail map<anydata> - A map containing additional details about the error, which can include any data type

cause? ErrorInfo - An optional cause of the error, which can be another error or an error info

pipeline: ExecutionSuccess

Closed record

Channel execution success result.

Fields

message Message - The message that was processed

destinationResults map<anydata>(default {}) - A map of destination names to their respective results

pipeline: HandlerConfiguration

Closed record

Handler related configuration.

Fields

id string - The unique identifier for the handler

pipeline: Message

Closed record

Message type represents a message with content, error information, metadata, and properties.

Fields

idreadonly string - Unique identifier for the message

handlerChainNamereadonly string - The name of the handler chain that processed this message

content anydata - The actual content of the message, which can be of anydata type

errorMsg? string - Optional error message if an error occurred during processing

errorStackTrace? string[] - Optional stack trace of the error if available

errorDetails? map<anydata> - Additional error details, which can include anydata type

destinationErrors? map<ErrorInfo> - A map of errors associated with specific destinations, where the key is the destination name and the value is an ErrorInfo record

metadata MessageMetadata(default {}) - Metadata associated with the message, such as destinations to skip

properties map<anydata>(default {}) - A map of additional properties associated with the message

destinationResults? map<anydata> - A map of successful destination results

pipeline: MessageMetadata

Closed record

Message metadata.

Fields

destinationsToSkip string[](default []) - An array of destination names that should be skipped when processing the message

pipeline: ReplayListenerConfiguration

Closed record

Represents the listener configurations to replay message processing in a handler chain.

Fields

Fields Included from *ServiceRetryConfiguration

maxRetries int
retryInterval decimal

pollingInterval decimal(default 1) - The interval in seconds to poll for messages to replay

deadLetterStore Store - The store to be used for storing messages that could not be processed even after retries

replayStore? Store - Optional store to be used for replaying messages in the handler chain. If not provided, the handler chain's failure store will be used.

pipeline: RetryDestinationConfig

Closed record

Represents a retry configuration for a destination.

Fields

maxRetries int(default 3) - The maximum number of retries for the destination

retryInterval decimal(default 1) - The interval in seconds between retries

pipeline: ServiceRetryConfiguration

Closed record

Retry configuration for the replay service.

Fields

maxRetries int(default 3) - The maximum number of retries to attempt for a message

retryInterval decimal(default 1) - The interval in seconds to wait before retrying a message

Errors

pipeline: Error

Distinct

Generic error type.

Union types

pipeline: Processor

GenericProcessor|Filter|Transformer

Processor

Represents a processor that can be a filter, transformer, or processor and can be attached to a channel for processing messages. Processors should be idempotent i.e. repeating the execution with the same message should not change the outcome or the channel state.

pipeline: Handler

Processor|Destination

Handler

Represents a handler which can be either a Processor or a Destination.

Intersection types

pipeline: ExecutionError

distinct Error & error

ExecutionError

Execution error type for the channel execution.

Function types

pipeline: Transformer

function(MessageContext) returns (anydata|error)

Transformer

Represents a transformer function that processes the message content and returns a modified message content.

pipeline: Filter

function(MessageContext) returns (boolean|error)

Filter

Represents a filter function that checks the message context and returns a boolean indicating whether the message should be processed further.

pipeline: GenericProcessor

function(MessageContext) returns (error?)

GenericProcessor

Represents a generic message processor that can process the message and return an error if the processing fails.

pipeline: Destination

function(MessageContext) returns (anydata|error)

Destination

Represents a destination function that processes the message context and returns a result or an error if it failed to send the message to the destination. Destinations are typically contains a sender or a writer that sends or writes the message to a specific destination.

Import

import xlibb/pipeline;

Other versions

1.0.0

Metadata

Released date: 24 days ago

Version: 1.0.0

License: Apache-2.0

Compatibility

Platform: java21

Ballerina version: 2201.12.0

GraalVM compatible: Yes

Pull count

Total: 0

Current verison: 0

Weekly downloads

Source repository

Keywords

pipeline

handler

replay

processor

destination

store

Contributors

Dependencies

ballerina/io/1.8.0 ballerina/log/2.12.0 ballerina/uuid/1.10.0

Cookie policy

Delete policy

classes

annotations

configurables

records

errors

unionTypes

intersectionTypes

functionTypes

xlibb/pipeline

Overview

Core components

Handler

Processor

Destination

Message

Message context

Failure store

Replay listener

Handler chain

Component interaction

Example usage

Classes

pipeline: HandlerChain

Constructor

getName

Return Type

getFailureStore

Return Type

replay

Parameters

Return Type

execute

Parameters

Return Type

pipeline: MessageContext

getId

Return Type

toRecord

Return Type

getContent

Return Type

getHandlerChainName

Return Type

getContentWithType

Parameters

Return Type

getProperty

Parameters

Return Type

getPropertyWithType

Parameters

Return Type

setProperty

Parameters

removeProperty

Parameters

Return Type

hasProperty

Parameters

Return Type

Annotations

pipeline: DestinationConfig

pipeline: FilterConfig

pipeline: ProcessorConfig

pipeline: TransformerConfig

Records

pipeline: DestinationConfiguration

Fields

pipeline: ErrorDetails

Fields

pipeline: ErrorInfo

Fields

pipeline: ExecutionSuccess

Fields

pipeline: HandlerConfiguration

Fields

pipeline: Message

Fields

pipeline: MessageMetadata

Fields