Do I need to configure the data pipeline daily for the AWS Dynamo database?

Question

Do I need to configure the data pipeline daily for the AWS Dynamo database?

I am considering using AWS DynamoDB for the application we are creating. I understand that setting up a backup job that exports data from DynamoDB to S3 includes a data pipeline with EMR. But my question is: do I need to worry about the backup task being configured on day 1? What are the chances of losing data?

+3

amazon-web-services amazon-dynamodb amazon-data-pipeline

Dahoopster Feb 07 '14 at 2:01

source share

3 answers

Chen Harel · Answer 1 · 2014-02-07T15:38:07+0000

This is really subjective. IMO, you should not worry about them "now." You can also use simpler solutions besides pipleline . Perhaps this will be a good start.

DynamoDB , . . , , SDK .

SudheerT · Answer 2 · 2014-02-07T18:07:30+0000

DynamoDB :

(1) S3 , , , ( ?)

(2) S3, -. , S3, , , RDBMS (RDS ) S3 . EMR Redshift (ETL) BI. Redshift, ELT- - Redshift

(3) ( ) ( , ) - . - , , . , , DynamoDB, .

(4) S3. , - DynamoDB - concurrency .

AWS Data Pipeline ( EMR ).

, , , , .

Sony Kadavan · Answer 3 · 2014-02-10T14:14:18+0000

S3. .

Dynamo DB , ( ). - .

You can say that Pipeline only consumes, say, 25% of the capacity when backing up so that your real users do not notice a delay. Each backup is "full" (not incremental), so at some periodic time interval you can delete several old backups if you are concerned about storage.

Do I need to configure the data pipeline daily for the AWS Dynamo database?

More articles: