You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+16-1Lines changed: 16 additions & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -15,20 +15,25 @@ The connector has the following capabilities:
15
15
-`UPDATE`
16
16
-`DELETE` (single row deletes)
17
17
- High scalability - able to split work accross multiple Kafka Connect workers
18
-
- Faulttolerant - connector periodically saves its progress and can resume from previously saved offset (with at-least-once semantics)
18
+
- Fault-tolerant - connector periodically saves its progress and can resume from previously saved offset (with at-least-once semantics)
19
19
- Support for many standard Kafka Connect converters, such as JSON and Avro
20
20
- Compatible with standard Kafka Connect transformations
21
21
- Metadata about CDC events - each generated Kafka message contains information about source, such as timestamp and table name
22
22
- Seamless handling of schema changes and topology changes (adding, removing nodes from Scylla cluster)
23
23
- Preimage support ([optional](#advanced-configuration-parameters)) - messages generated for row-level changes can have their [`before`](#data-change-event-value) field filled with information from corresponding preimage row.
24
+
- Post image only: You can configure the connector to produce only `POST_IMAGE` cdc events as `CREATE` events.
24
25
25
26
The connector has the following limitations:
26
27
- Only Kafka 2.6.0+ is supported
27
28
- Only row-level operations are produced (`INSERT`, `UPDATE`, `DELETE`):
28
29
- Partition deletes - those changes are ignored
29
30
- Row range deletes - those changes are ignored
30
31
- No support for collection types (`LIST`, `SET`, `MAP`) and `UDT` - columns with those types are omitted from generated messages
32
+
<<<<<<< HEAD
31
33
- No support for postimage, preimage needs to be enabled - By default changes only contain those columns that were modified, not the entire row before/after change. More information [here](#cell-representation)
34
+
=======
35
+
- No support for preimage - changes only contain those columns that were modified, not the entire row before/after change. More information [here](#cell-representation)
36
+
>>>>>>> 4c29e07 (Adding option to push only post image events to kafka)
32
37
33
38
## Connector installation
34
39
@@ -684,6 +689,7 @@ The connector will generate the following data change event's value (with JSON s
684
689
685
690
In addition to the configuration parameters described in the ["Configuration"](#configuration) section, Scylla CDC Source Connector exposes the following (non-required) configuration parameters:
|`scylla.query.time.window.size`| The size of windows queried by the connector. Changes are queried using `SELECT` statements with time restriction with width defined by this parameter. Value expressed in milliseconds. |
@@ -692,6 +698,15 @@ In addition to the configuration parameters described in the ["Configuration"](#
692
698
|`scylla.local.dc`| The name of Scylla local datacenter. This local datacenter name will be used to setup the connection to Scylla to prioritize sending requests to the nodes in the local datacenter. If not set, no particular datacenter will be prioritized. |
693
699
|`experimental.preimages.enabled`| False by default. If enabled connector will use `PRE_IMAGE` CDC entries to populate 'before' field of the debezium Envelope of the next kafka message. This may change some expected behaviours (e.g. ROW_DELETE will use preimage instead of its own information). Relies on correct ordering of rows within same stream in CDC tables. |
694
700
701
+
=======
702
+
| Property | Description |
703
+
| --- | --- |
704
+
|`scylla.query.time.window.size`| The size of windows queried by the connector. Changes are queried using `SELECT` statements with time restriction with width defined by this parameter. Value expressed in milliseconds. |
705
+
|`scylla.confidence.window.size`| The size of the confidence window. It is necessary for the connector to avoid reading too fresh data from the CDC log due to the eventual consistency of Scylla. The problem could appear when a newer write reaches a replica before some older write. For a short period of time, when reading, it is possible for the replica to return only the newer write. The connector mitigates this problem by not reading a window of most recent changes (controlled by this parameter). Value expressed in milliseconds.|
706
+
|`scylla.consistency.level`| The consistency level of CDC table read queries. This consistency level is used only for read queries to the CDC log table. By default, `QUORUM` level is used. |
707
+
|`scylla.local.dc`| The name of Scylla local datacenter. This local datacenter name will be used to setup the connection to Scylla to prioritize sending requests to the nodes in the local datacenter. If not set, no particular datacenter will be prioritized. |
708
+
|`post.image.only`| Push only the post image events from scylla cdc to kafka. The events are pushed as `CREATE` events. |
709
+
>>>>>>> 4c29e07 (Adding option to push only post image events to kafka)
.withDescription("Whether the connector should publish only post image events. The cdc settings must have `'postimage': 'true'`. To get full image with all fields, set `'preimage': 'full'`.");
0 commit comments