Module aws.redshiftdata

ballerinax/aws.redshiftdata Ballerina library

1.1.0

Overview

Amazon Redshift is a powerful and fully-managed data warehouse service provided by Amazon Web Services (AWS), designed to efficiently analyze large datasets with high performance and scalability.

The ballerinax/aws.redshiftdata package allows developers to interact with Amazon Redshift Data API seamlessly using Ballerina. The Redshift Data API simplifies data access by eliminating the need for managing persistent database connections or the Redshift JDBC driver.

Setup guide

Log into the AWS Management Console. If you don’t have an AWS account yet, you can create one by visiting the AWS sign-up page. Sign up is free, and you can explore many services under the Free Tier.

Create a user

In the AWS Management Console, search for IAM in the services search bar.
Click on IAM
Click Users
Click Create User
Provide a suitable name for the user and continue
Add necessary permissions by adding the user to a user group, copy permissions or directly attach the policies. And click next.
Review and create the user

Get user access keys

Click the user that created
Click Create access key
Click your use case and click next.
Record the Access Key and Secret access key. These credentials will be used to authenticate your Ballerina application with the Redshift cluster.

Setup a Cluster

To use the Ballerina AWS Redshift data connector, follow these steps to set up an Amazon Redshift cluster:

Step 1: Navigate to Amazon Redshift and create a cluster

In the AWS Management Console, search for Redshift in the services search bar.
Click on Amazon Redshift.
Click on the Create cluster button to initiate the process of creating a new Amazon Redshift cluster.

Step 2: Configure cluster settings

Configure your Redshift cluster settings, including cluster identifier, database name, credentials, and other relevant parameters.
Configure security groups to control inbound and outbound traffic to your Redshift cluster. Ensure that your Ballerina application will have the necessary permissions to access the cluster.
Record the username during the cluster configuration. This will be used to authenticate your Ballerina application with the Redshift cluster.
Finally, review your configuration settings, and once satisfied, click Create cluster to launch your Amazon Redshift cluster.

Step 3: Wait for cluster availability

It may take some time for your Redshift cluster to be available. Monitor the cluster status in the AWS Console until it shows as "Available".

Note: Amazon Redshift now offers a serverless option, allowing you to use the data warehouse without managing infrastructure. Redshift Serverless automatically scales to handle your workloads, providing a flexible and efficient way to run analytics. To configure a Redshift serverless setup, please refer to AWS documentation.

Quickstart

To use the aws.redshiftdata connector in your Ballerina project, modify the .bal file as follows:

Step 1: Import the module


import ballerinax/aws.redshiftdata;

Step 2: Instantiate a new connector

Create a new redshiftdata:Client by providing the region, authentication configurations and dbAccessConfig.

The dbAccessConfig in the ConnectionConfig record defines the database access configuration for connecting to the Redshift Data API. It can be set to either a Cluster or a WorkGroup (Serverless mode). Additionally, users can override this configuration for specific requests by providing it in individual calls to methods like execute or batchExecute, allowing for more granular control over database access per execution.


configurable string accessKeyId = ?;
configurable string secretAccessKey = ?;
configurable redshiftdata:Cluster dbAccessConfig = ?;

redshiftdata:Client redshiftdata = check new ({
   region: redshiftdata:US_EAST_2,
   auth: {
      accessKeyId,
      secretAccessKey
   },
   dbAccessConfig
});

Step 3: Invoke the connector operations

Now, utilize the available connector operations.


redshiftdata:ExecutionResponse response = check redshiftdata->execute(`SELECT * FROM Users`);

redshiftdata:DescriptionResponse descriptionResponse = check redshiftdata->describe(response.statementId);

stream<User, redshiftdata:Error?> statementResult = check redshiftdata->getResultAsStream(response.statementId);

Step 4: Run the Ballerina application

Use the following command to compile and run the Ballerina program.


bal run

Examples

The aws.redshiftdata connector provides practical examples illustrating usage in various scenarios. Explore these examples.

Manage users - This example demonstrates how to use the Ballerina Redshift Data connector to perform SQL operations on an AWS Redshift cluster. It includes creating a table, inserting data, and querying data.
Music store - This example illustrates the process of creating an HTTP RESTful API with Ballerina to perform basic CRUD operations on a database, specifically AWS Redshift, involving setup, configuration, and running examples.

Clients

aws.redshiftdata: Client

Isolated

The AWS Redshift Data API client.

Constructor

Initialize AWS Redshift Data API client.

redshiftdata:Client redshift = check new (region = redshiftdata:US_EAST_2,
   auth = {
       accessKeyId: "<aws-access-key>",
       secretAccessKey: "<aws-secret-key>"
   },
   dbAccessConfig = {
       id: "<cluster-id>",
       database: "<database-name>",
       dbUser: "<db-user>"
   }
);

init (*ConnectionConfig connectionConfig)

connectionConfig *ConnectionConfig - The Redshift Data API client configurations If a dbAccessConfig is provided, it will be used for the statement executions It can be overridden using the dbAccessConfig at the API level

execute

Isolated FunctionRemote Function

function execute(ParameterizedQuery statement, *ExecutionConfig executionConfig) returns ExecutionResponse|Error

Runs an SQL statement, which can be data manipulation language (DML) or data definition language (DDL).


redshiftdata:ExecutionResponse response = check redshift->execute(`SELECT * FROM Users`);

Parameters

statement ParameterizedQuery - The SQL statement to be executed

executionConfig *ExecutionConfig - The configurations related to the execution of the statement

Return Type

ExecutionResponse|Error - The redshiftdata:ExecutionResponse or a redshiftdata:Error if the execution fails

batchExecute

Isolated FunctionRemote Function

function batchExecute(ParameterizedQuery[] statements, *ExecutionConfig executionConfig) returns ExecutionResponse|Error

Runs one or more SQL statements, which can be data manipulation language (DML) or data definition language (DDL). The batch size should not exceed 40.


redshiftdata:ExecutionResponse response = check redshift->batchExecute([`<statement>`,
   `<statement>`]);

Parameters

statements ParameterizedQuery[] - The SQL statements to be executed

executionConfig *ExecutionConfig - The configurations related to the execution of the statements

Return Type

ExecutionResponse|Error - The redshiftdata:ExecutionResponse or a redshiftdata:Error if the execution fails

getResultAsStream

Isolated FunctionRemote Function

function getResultAsStream(StatementId statementId, typedesc<record {}> rowTypes) returns stream<rowTypes, Error?>|Error

Retrieves the results for a previously executed SQL statement.


stream<User, Error?> response = check redshift->getResultAsStream("<statement-id>");

Parameters

statementId StatementId - The identifier of the SQL statement

rowTypes typedesc<record {}> (default <>) - The typedesc of the record to which the result needs to be returned

Return Type

stream<rowTypes, Error?>|Error - Stream of records in the type of rowTypes or a redshiftdata:Error if the retrieval fails

describe

Isolated FunctionRemote Function

function describe(StatementId statementId) returns DescriptionResponse|Error

Retrieves the execution status for a previously executed SQL statement.


redshiftdata:DescriptionResponse response = check redshift->describe("<statement-id>");

Parameters

statementId StatementId - The identifier of the SQL statement

Return Type

DescriptionResponse|Error - The redshiftdata:DescriptionResponse or a redshiftdata:Error if the execution fails

close

Isolated FunctionRemote Function

function close() returns Error?

Gracefully closes AWS Redshift Data API client resources.


check redshift->close();

Return Type

Error? - A redshiftdata:Error if there is an error while closing the client resources or else nil

Enums

aws.redshiftdata: Region

An Amazon Web Services region that hosts a set of Amazon services.

Members

AF_SOUTH_1

AP_EAST_1

AP_NORTHEAST_1

AP_NORTHEAST_2

AP_NORTHEAST_3

AP_SOUTH_1

AP_SOUTH_2

AP_SOUTHEAST_1

AP_SOUTHEAST_2

AP_SOUTHEAST_3

AP_SOUTHEAST_4

AWS_CN_GLOBAL

AWS_GLOBAL

AWS_ISO_GLOBAL

AWS_ISO_B_GLOBAL

AWS_US_GOV_GLOBAL

CA_WEST_1

CA_CENTRAL_1

CN_NORTH_1

CN_NORTHWEST_1

EU_CENTRAL_1

EU_CENTRAL_2

EU_ISOE_WEST_1

EU_NORTH_1

EU_SOUTH_1

EU_SOUTH_2

EU_WEST_1

EU_WEST_2

EU_WEST_3

IL_CENTRAL_1

ME_CENTRAL_1

ME_SOUTH_1

SA_EAST_1

US_EAST_1

US_EAST_2

US_GOV_EAST_1

US_GOV_WEST_1

US_ISOB_EAST_1

US_ISO_EAST_1

US_ISO_WEST_1

US_WEST_1

US_WEST_2

aws.redshiftdata: Status

The status of the SQL statement being described.

Members

SUBMITTED - The query was submitted, but not yet processed

PICKED - The query has been chosen to be run

STARTED - The query run has started

FINISHED - The query has finished running

ABORTED - The query run was stopped by the user

FAILED - The query run failed

ALL - A status value that includes all query statuses. This value can be used to filter results

Records

aws.redshiftdata: Cluster

Closed record

Represents the configuration details required for connecting to an Amazon Redshift cluster.

Fields

id string - The cluster identifier

database string - The name of the database

dbUser? string - The database user name

secretArn? string - The name or ARN of the secret that enables access to the database

sessionKeepAliveSeconds? int - The number of seconds to keep the session alive after the query finishes

aws.redshiftdata: ConnectionConfig

Closed record

Represents connection configurations related to Redshift Data API.

Fields

region Region - The AWS region with which the connector should communicate

auth StaticAuthConfig|EC2IAMRoleConfig - The authentication configurations for the Redshift Data API

dbAccessConfig? Cluster|WorkGroup - The database access configurations for the Redshift Data API This can be overridden in the individual execute and batchExecute requests

aws.redshiftdata: DescriptionResponse

Closed record

Describes the details about a specific instance when a query was run by the Amazon Redshift Data API.

Fields

Fields Included from *StatementData

statementId StatementId
createdAt Utc
duration decimal
error string
hasResultSet boolean
queryString string
redshiftQueryId int
resultRows int
resultSize int
status Status
updatedAt Utc

subStatements? StatementData[] - The SQL statements from a multiple statement run

redshiftPid int - The process identifier from Amazon Redshift

sessionId? SessionId - The session identifier of the query

aws.redshiftdata: EC2IAMRoleConfig

Closed record

Represents the EC2 IAM role based authentication configurations for the Redshift Data API.

Fields

profileName? string - Configure the profile name used for loading IMDS-related configuration, like the endpoint mode (IPv4 vs IPv6)

profileFile? string - The path to the file containing the profile configuration

aws.redshiftdata: ErrorDetails

Closed record

The error details type for the AWS Redshift Data module.

Fields

httpStatusCode? int - The HTTP status code for the error

httpStatusText? string - The HTTP status text returned from the service

errorCode? string - The error code associated with the response

errorMessage? string - The human-readable error message provided by the service

aws.redshiftdata: ExecutionConfig

Closed record

Represents the configuration details required for execute method.

Fields

dbAccessConfig? Cluster|WorkGroup|SessionId - The database access configurations for the Redshift Data If a dbAccessConfig is provided in the ExecutionConfig , it will override the init level dbAccessConfig

clientToken? string - A unique, case-sensitive identifier that you provide to ensure the idempotency of the request

statementName? string - The name of the SQL statement

withEvent? boolean - Flag which indicates to send an event after the SQL statement execution to an event bus instance running in Amazon EventBridge

aws.redshiftdata: ExecutionResponse

Closed record

The response from the execute method.

Fields

createdAt Utc - The date and time (UTC) the statement was created

dbGroups? string[] - A list of colon (:) separated names of database groups

statementId StatementId - The identifier of the SQL statement whose results are to be fetched

sessionId? SessionId - The session identifier of the query

aws.redshiftdata: StatementData

Closed record

Information about an SQL statement.

Fields

statementId StatementId - The identifier of the SQL statement described

createdAt Utc - The date and time (UTC) when the SQL statement was submitted to run

duration decimal - The amount of time in seconds that the statement ran

'error? string - The error message from the cluster if the SQL statement encountered an error while running

hasResultSet boolean - A value that indicates whether the statement has a result set

queryString? string - The SQL statement text

redshiftQueryId int - The identifier of the query generated by Amazon Redshift

resultRows int - Either the number of rows returned from the SQL statement or the number of rows affected

resultSize int - The size in bytes of the returned results

status Status - The status of the SQL statement being described

updatedAt Utc - The date and time (UTC) that the statement metadata was last updated

aws.redshiftdata: StaticAuthConfig

Closed record

Represents static authentication configurations for the Redshift Data API.

Fields

accessKeyId string - The AWS access key ID, used to identify the user interacting with AWS

secretAccessKey string - The AWS secret access key, used to authenticate the user interacting with AWS

sessionToken? string - The AWS session token, used for authenticating a user with temporary permission to a resource

aws.redshiftdata: WorkGroup

Closed record

Represents the configuration details required for connecting to an Amazon Redshift serverless workgroup.

Fields

name string - The serverless workgroup name or Amazon Resource Name (ARN)

database string - The name of the database

secretArn? string - The name or ARN of the secret that enables access to the database

sessionKeepAliveSeconds? int - The number of seconds to keep the session alive after the query finishes

Errors

aws.redshiftdata: Error

Distinct

ErrorDetails

Represents a AWS Redshift Data distinct error.

String types

aws.redshiftdata: SessionId

string

SessionId

The session identifier of the query.

aws.redshiftdata: StatementId

string

StatementId

The identifier of the SQL statement

Import

import ballerinax/aws.redshiftdata;

Metadata

Released date: 4 months ago

Version: 1.1.0

License: Apache-2.0

Compatibility

Platform: java21

Ballerina version: 2201.11.0

GraalVM compatible: Yes

Pull count

Total: 5

Current verison: 5

Weekly downloads

Source repository

Keywords

Data Warehouse

Columnar Storage

Cost/Paid

vendor/aws

Contributors

Other versions

1.1.0

1.0.1 1.0.0

Dependencies

ballerina/time/2.6.0 ballerina/constraint/1.6.0 ballerina/sql/1.15.0

Cookie policy

Delete policy

clients

enums

records

errors

stringTypes

ballerinax/aws.redshiftdata Ballerina library

Overview

Setup guide

Login to AWS Console

Create a user

Get user access keys

Setup a Cluster

Step 1: Navigate to Amazon Redshift and create a cluster

Step 2: Configure cluster settings

Step 3: Wait for cluster availability

Quickstart

Step 1: Import the module

Step 2: Instantiate a new connector

Step 3: Invoke the connector operations

Step 4: Run the Ballerina application

Examples

Clients

aws.redshiftdata: Client

Constructor

execute

Parameters

Return Type

batchExecute

Parameters

Return Type

getResultAsStream

Parameters

Return Type

describe

Parameters

Return Type

close

Return Type

Enums

aws.redshiftdata: Region

Members

aws.redshiftdata: Status

Members

Records

aws.redshiftdata: Cluster

Fields

aws.redshiftdata: ConnectionConfig

Fields

aws.redshiftdata: DescriptionResponse

Fields

aws.redshiftdata: EC2IAMRoleConfig

Fields

aws.redshiftdata: ErrorDetails

Fields

aws.redshiftdata: ExecutionConfig

Fields

aws.redshiftdata: ExecutionResponse

Fields

aws.redshiftdata: StatementData

Fields

aws.redshiftdata: StaticAuthConfig

Fields

aws.redshiftdata: WorkGroup

Fields

Errors

aws.redshiftdata: Error

String types

aws.redshiftdata: SessionId

aws.redshiftdata: StatementId