Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BUG] Empty DLQ Objects and DLQ objects with data even though data is loaded correctly #4304

Open
amitkirdatt opened this issue Mar 20, 2024 · 1 comment
Assignees
Labels
bug Something isn't working

Comments

@amitkirdatt
Copy link

Describe the bug

  • Pipeline with dyanamodb as the source and OpenSearch Serverless sink is creating empty dlqObjects {"dlqObjects":[]}
  • non-empty dlqObjects are created even though data is loaded into OpenSearch. Seeing messages like these "status":0,"message":"Number of retries reached the limit of max retries (configured value 10)

To Reproduce
Steps to reproduce the behavior:

  1. Define a pipeline with a dynamodb table as the source (ideally with at least 10M records)
  2. Define an OpenSearch serverless sink
  3. Define S3 bucket and prefix for dlq
  4. Run pipeline
  5. DLQ S3 bucket will have several empty s3 objects that are 17.0 bytes in size ({"dlqObjects":[]}
  6. Some DLQ S3 objects have data, but those items are loaded in OpenSearch

Expected behavior

  • No DLQ objects are created if the data has been loaded successfully.
  • If data load is not successful and dlq s3 object is created, then dlqObjects should be populated with relevant data.
  • If data is ingested in OpenSearch dlq object with the id should not be created

Screenshots
If applicable, add screenshots to help explain your problem.

Environment (please complete the following information):

  • OS: [e.g. Ubuntu 20.04 LTS]
  • Version [e.g. 22]

Additional context

  • max_retries is set to 10
  • Pipeline has has min 1 OCU and max 20 OCU
  • dynamodb table has ~100M records
  • OpenSearch Serverless sink
@amitkirdatt amitkirdatt added bug Something isn't working untriaged labels Mar 20, 2024
@amitkirdatt amitkirdatt changed the title [BUG] Empty DLQ Objects and DLQ objects with data even though data appears to be loaded correctly [BUG] Empty DLQ Objects and DLQ objects with data even though data is loaded correctly Mar 20, 2024
@dlvenable dlvenable self-assigned this Apr 9, 2024
@dlvenable dlvenable added this to the v2.8 milestone Apr 9, 2024
@dlvenable dlvenable modified the milestones: v2.8, v2.9 May 16, 2024
@dlvenable
Copy link
Member

@amitkirdatt , We are releasing Data Prepper 2.8.0 today with a fix that may resolve this. See #4301.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
Development

No branches or pull requests

3 participants