Druid Ansible

This project can setup a druid cluster on AWS

AWS preparations

Instances:

Instances are automatically created

Storage:

Create a bucket (druidbucket) with the following 3 folders in it:

logs
data
deepstorage

User:

Create a IAM user (druiduser) for accessing the data in S3. Be sure to capture the credentials

Add the following policy to the user

{
    "Statement": [
        {
            "Effect": "Allow",
            "Action": "s3:ListAllMyBuckets",
            "Resource": "arn:aws:s3:::*"
        },
        {
            "Effect": "Allow",
            "Action": "s3:*",
            "Resource": [
                "arn:aws:s3:::druidbucket",
                "arn:aws:s3:::druidbucket/*"
            ]
        }
    ]
}

AWS ansible playbook variable configuration in vars/cluster.yml supply the needed ami, vpc, subnet and key filename variables:

ami: <ami-id>
main_vpc_id: <vpc-id>
subnet_id_a: <subnet-id-region-a>
subnet_id_b: <subnet-id-region-b>
subnet_id_c: <subnet-id-region-c>
key_name: <druid-key-pem-file>

in roles/security-groups/default/main.yml change the external ip range to which the security group will be opened up:

ip_range: "192.168.1.0/32"

in vars/druid.yml supply the needed s3 variables:

deep_storage_type: s3
druid_s3_accessKey: ACCESS_KEY_ID
druid_s3_secretKey: SECRET_KEY
druid_s3_baseKey: deepstorage
druid_s3_bucketname: druidbucket
deep_storage_location: {{ druid_s3_bucketname }}/data
deep_storage_log_location: {{ druid_s3_bucketname }}/logs

AWS deployment

ansible-playbook create-druid-cluster.yml ansible-playbook --user centos --private-key ./druid.pem ping.yml ansible-playbook --user centos --private-key ./druid.pem playbook.yml

./data/index-data.sh to populate druid with the wikiticker data.

For upgrading druid to version 0.xx.x one can use the following scripts.

To see if druid still works during the rolling upgrade you can use the ./data/query-druid.sh script. This reports an error if the HTTP status is not 200 or if the response is empty. In the non-HA setup of druid this will only be the case for a short period when te broker is being restarted.

ansible-playbook --user centos --private-key ./druid.pem upgrade.yml

Testing

To insert data into druid one can use the scripts in the data directory.

First step is to create the ./data/index-data.json file based on the ./data/index-data.json.tmpl file.
Second step is to run the ./data/index-data.sh script.

The ./data/query-druid.sh script will fire queries to druid and can be used for testing the connection.

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
data		data
hosts		hosts
resources		resources
roles		roles
upgrade/tasks		upgrade/tasks
vars		vars
.gitignore		.gitignore
.gitmodules		.gitmodules
README.md		README.md
ansible.cfg		ansible.cfg
create-druid-cluster.yml		create-druid-cluster.yml
ping.yml		ping.yml
playbook.yml		playbook.yml
upgrade.yml		upgrade.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Druid Ansible

AWS preparations

AWS deployment

Testing

About

Releases 1

Packages

Contributors 2

Languages

godatadriven/druid-ansible

Folders and files

Latest commit

History

Repository files navigation

Druid Ansible

AWS preparations

AWS deployment

Testing

About

Resources

Stars

Watchers

Forks

Releases 1

Packages 0

Contributors 2

Languages

Packages