abort the whole table transaction if any updates in the transaction has failed #1246

stevie9868 · 2024-10-23T09:14:41Z

We have encountered a data loss issue when using pyIceberg to perform an overwrite operation. Typically, an overwrite operation involves creating both a delete snapshot and an append snapshot. However, if an exception occurs during the creation of the append snapshot, the current code still attempts to commit the delete snapshot, leading to potential data loss. One thing to note is this does not apply to only overwrite but potentially other operations as well.

To address this issue, we need to ensure that the entire transaction is aborted if any part of the update process fails.

Also provided a simple test case, where before this change, the transaction will only contains a delete snapshot update deleting the data. Whereas after this fix, we still keep the same data before the partially failed transaction since the whole transaction is now aborted.

stevie9868 · 2024-10-25T17:06:27Z

@HonahX

Thanks for unblocking the testing actions!
But looks like the curl command in Python CI/lint-and-test 3.10 times out.

kevinjqliu · 2024-10-25T22:18:04Z

Thanks for the PR @stevie9868. This sounds like an important bug to address.

Do you know if this bug only applies to the overwrite function or all functions in Transactions?

PS I reran the CI

kevinjqliu · 2024-10-25T22:16:06Z

pyiceberg/table/__init__.py

+    ) -> None:
+        """Close and commit the transaction, or handle exceptions."""
+        # Only commit the full transaction, if there is no exception in all updates on the chain
+        if exctb is None:


what is the difference between exctype, excinst, and exctb here? Why do we use exctb?

I'm a bit confused about this part. Typically __exit__ is called as the last step for the with statement.
Here, __exit__ calls the self.commit_transaction() which will process the transactions.

It seems like the issue here is to catch partial exceptions from the self.commit_transaction() which wont be caught here

kevinjqliu · 2024-10-25T23:02:53Z

We have encountered a data loss issue when using pyIceberg to perform an overwrite operation. Typically, an overwrite operation involves creating both a delete snapshot and an append snapshot. However, if an exception occurs during the creation of the append snapshot, the current code still attempts to commit the delete snapshot, leading to potential data loss.

Im a bit confused on the chain of events. Here's what I found digging through the code:

table.overwrite creates a transaction and calls its overwrite function

iceberg-python/pyiceberg/table/__init__.py

Lines 1044 to 1045 in de976fe

    
           with self.transaction() as tx: 
        
               tx.overwrite(df=df, overwrite_filter=overwrite_filter, snapshot_properties=snapshot_properties)

In the transaction's overwrite function, it calls both self.delete and self.update_snapshot(snapshot_properties=snapshot_properties).fast_append()

iceberg-python/pyiceberg/table/__init__.py

Lines 507 to 516 in de976fe

    
           self.delete(delete_filter=overwrite_filter, snapshot_properties=snapshot_properties) 
        
           with self.update_snapshot(snapshot_properties=snapshot_properties).fast_append() as update_snapshot: 
        
               # skip writing data files if the dataframe is empty 
        
               if df.shape[0] > 0: 
        
                   data_files = _dataframe_to_data_files( 
        
                       table_metadata=self.table_metadata, write_uuid=update_snapshot.commit_uuid, df=df, io=self._table.io 
        
                   ) 
        
                   for data_file in data_files: 
        
                       update_snapshot.append_data_file(data_file)

self.delete ultimately creates a UpdateSnapshot (_OverwriteFiles)

iceberg-python/pyiceberg/table/__init__.py

Lines 594 to 600 in de976fe

    
           with self.update_snapshot(snapshot_properties=snapshot_properties).overwrite( 
        
               commit_uuid=commit_uuid 
        
           ) as overwrite_snapshot: 
        
               for original_data_file, replaced_data_files in replaced_files: 
        
                   overwrite_snapshot.delete_data_file(original_data_file) 
        
                   for replaced_data_file in replaced_data_files: 
        
                       overwrite_snapshot.append_data_file(replaced_data_file)

and self.update_snapshot(snapshot_properties=snapshot_properties).fast_append() also creates a UpdateSnapshot (_FastAppendFiles).

iceberg-python/pyiceberg/table/__init__.py

Lines 594 to 600 in de976fe

    
           with self.update_snapshot(snapshot_properties=snapshot_properties).overwrite( 
        
               commit_uuid=commit_uuid 
        
           ) as overwrite_snapshot: 
        
               for original_data_file, replaced_data_files in replaced_files: 
        
                   overwrite_snapshot.delete_data_file(original_data_file) 
        
                   for replaced_data_file in replaced_data_files: 
        
                       overwrite_snapshot.append_data_file(replaced_data_file)

Both _OverwriteFiles and _FastAppendFiles subclass _SnapshotProducer which combines with UpdateTableMetadata updates the transaction

iceberg-python/pyiceberg/table/update/__init__.py

Lines 62 to 70 in de976fe

    
           @abstractmethod 
        
           def _commit(self) -> UpdatesAndRequirements: ... 
        
           def commit(self) -> None: 
        
               self._transaction._apply(*self._commit()) 
        
           def __exit__(self, _: Any, value: Any, traceback: Any) -> None: 
        
               """Close and commit the change.""" 
        
               self.commit()

iceberg-python/pyiceberg/table/update/snapshot.py

Lines 241 to 279 in de976fe

    
           def _commit(self) -> UpdatesAndRequirements: 
        
               new_manifests = self._manifests() 
        
               next_sequence_number = self._transaction.table_metadata.next_sequence_number() 
        
               summary = self._summary(self.snapshot_properties) 
        
               manifest_list_file_path = _generate_manifest_list_path( 
        
                   location=self._transaction.table_metadata.location, 
        
                   snapshot_id=self._snapshot_id, 
        
                   attempt=0, 
        
                   commit_uuid=self.commit_uuid, 
        
               ) 
        
               with write_manifest_list( 
        
                   format_version=self._transaction.table_metadata.format_version, 
        
                   output_file=self._io.new_output(manifest_list_file_path), 
        
                   snapshot_id=self._snapshot_id, 
        
                   parent_snapshot_id=self._parent_snapshot_id, 
        
                   sequence_number=next_sequence_number, 
        
               ) as writer: 
        
                   writer.add_manifests(new_manifests) 
        
               snapshot = Snapshot( 
        
                   snapshot_id=self._snapshot_id, 
        
                   parent_snapshot_id=self._parent_snapshot_id, 
        
                   manifest_list=manifest_list_file_path, 
        
                   sequence_number=next_sequence_number, 
        
                   summary=summary, 
        
                   schema_id=self._transaction.table_metadata.current_schema_id, 
        
               ) 
        
               return ( 
        
                   ( 
        
                       AddSnapshotUpdate(snapshot=snapshot), 
        
                       SetSnapshotRefUpdate( 
        
                           snapshot_id=self._snapshot_id, parent_snapshot_id=self._parent_snapshot_id, ref_name="main", type="branch" 
        
                       ), 
        
                   ), 
        
                   (AssertRefSnapshotId(snapshot_id=self._transaction.table_metadata.current_snapshot_id, ref="main"),), 
        
               )

At this point, nothing has been committed yet. All updates are queued up in the transaction.
commit_transaction is used to apply the changes in the transaction.
For the above scenario, all updates are applied as one transaction. This transaction is either accepted or rejected as a whole. So there cannot be a scenario where the deletes are applied while the append is not

kevinjqliu · 2024-10-25T23:03:05Z

Please let me know if the above makes sense

kevinjqliu · 2024-10-25T23:07:43Z

Ah, do you have _autocommit set to True?
Since both delete and fast_append ultimately call transaction's _apply to queue up the updates, having _autocommit set to True will trigger commit each time.

iceberg-python/pyiceberg/table/__init__.py

Lines 260 to 261 in de976fe

    
           if self._autocommit: 
        
               self.commit_transaction()

This seems like a potential footgun. Perhaps we should get rid of _autocommit, its not used anywhere https://github.com/search?q=repo%3Aapache%2Ficeberg-python%20_autocommit&type=code

stevie9868 · 2024-10-26T01:41:45Z

@kevinjqliu
Thanks for the detail walk through!

I believe if self.update_snapshot(snapshot_properties=snapshot_properties).fast_append throws an exception, it will still trigger the transaction.exit, which will have the commit_transaction then only contain 1 update, which is the delete in this case as the append failed to be added into the updates list

For example, if the fast_append() failed any operation during the commit, (in our case, we see aws s3 exception), then the exception will propagate back to the transaction.exit.

Let me know if I miss anything, thanks, and I also prefer getting rid of _autocommit in the Transaction class

stevie9868 · 2024-10-26T02:09:34Z

Thanks for the PR @stevie9868. This sounds like an important bug to address.

Do you know if this bug only applies to the overwrite function or all functions in Transactions?

PS I reran the CI

Thank you, I think this would potentially apply to all functions that triy to combine more than one update into one transaction.

stevie9868 · 2024-10-26T02:17:38Z

Ah, do you have _autocommit set to True? Since both delete and fast_append ultimately call transaction's _apply to queue up the updates, having _autocommit set to True will trigger commit each time.

iceberg-python/pyiceberg/table/__init__.py

Lines 260 to 261 in de976fe

if self._autocommit:

self.commit_transaction()

This seems like a potential footgun. Perhaps we should get rid of _autocommit, its not used anywhere https://github.com/search?q=repo%3Aapache%2Ficeberg-python%20_autocommit&type=code

ah, I don't think we set _autocommit to true

HonahX · 2024-10-27T00:03:37Z

@stevie9868 @kevinjqliu Thanks for the great PR and discussions! I agree that there is some issue with the current Transaction mechanism: the commit_transaction can be incorrectly called when we should just abandon everything

The following pattern has been the most common practice of updating tables in pyiceberg since the beginning

with tbl.transaction() as txn:
            txn.overwrite(...)
            ....

The "with" statement will ensure that the context manager---Transaction object's __exit__()(commit_transaction) will always be called (even there is an exception) as long as the transaction object is successfully initialized. However, we should only call commit_transaction when there is no exception along the way.

A simpler example would be:

pa_table_with_column = pa.Table.from_pydict(
        {
            "foo": ["a", None, "z"],
            "bar": [19, None, 25],
        },
        schema=pa.schema([
            pa.field("foo", pa.large_string(), nullable=True),
            pa.field("bar", pa.int32(), nullable=True),
        ]),
    )

tbl = catalog.create_table(identifier=identifier, schema=pa_table_with_column.schema)

    with pytest.raises(ValueError):
        with tbl.transaction() as txn:
            txn.append(pa_table_with_column)
            raise ValueError
            txn.append(pa_table_with_column)

    assert len(tbl.scan().to_pandas()) == 0

Since I explicitly raise an error during the transaction, the whole transaction should be abandoned. But this code block still insert 3 rows (first append) to the table.

Please let me know if these make sense. Would love to hear your thoughts on this!

stevie9868 · 2024-10-27T00:22:36Z

@HonahX

Thanks for providing a detailed example, and I agree that we should only call commit_transaction when there is no exception along the way.

HonahX · 2024-10-27T00:29:09Z

This seems like a potential footgun. Perhaps we should get rid of _autocommit, its not used anywhere https://github.com/search?q=repo%3Aapache%2Ficeberg-python%20_autocommit&type=code

The _autocommit flag/autocommit parameter in Transaction is used in some Table's APIs:

iceberg-python/pyiceberg/table/__init__.py

Lines 991 to 1006 in de976fe

    
               def update_schema(self, allow_incompatible_changes: bool = False, case_sensitive: bool = True) -> UpdateSchema: 
        
                   """Create a new UpdateSchema to alter the columns of this table. 
        
                   Args: 
        
                       allow_incompatible_changes: If changes are allowed that might break downstream consumers. 
        
                       case_sensitive: If field names are case-sensitive. 
        
                   Returns: 
        
                       A new UpdateSchema. 
        
                   """ 
        
                   return UpdateSchema( 
        
                       transaction=Transaction(self, autocommit=True), 
        
                       allow_incompatible_changes=allow_incompatible_changes, 
        
                       case_sensitive=case_sensitive, 
        
                       name_mapping=self.name_mapping(), 
        
                   )

iceberg-python/pyiceberg/table/__init__.py

Lines 1077 to 1078 in de976fe

    
           def update_spec(self, case_sensitive: bool = True) -> UpdateSpec: 
        
               return UpdateSpec(Transaction(self, autocommit=True), case_sensitive=case_sensitive)

iceberg-python/pyiceberg/table/__init__.py

Lines 976 to 989 in de976fe

    
               def manage_snapshots(self) -> ManageSnapshots: 
        
                   """ 
        
                   Shorthand to run snapshot management operations like create branch, create tag, etc. 
        
                   Use table.manage_snapshots().<operation>().commit() to run a specific operation. 
        
                   Use table.manage_snapshots().<operation-one>().<operation-two>().commit() to run multiple operations. 
        
                   Pending changes are applied on commit. 
        
                   We can also use context managers to make more changes. For example, 
        
                   with table.manage_snapshots() as ms: 
        
                      ms.create_tag(snapshot_id1, "Tag_A").create_tag(snapshot_id2, "Tag_B") 
        
                   """ 
        
                   return ManageSnapshots(transaction=Transaction(self, autocommit=True))

The idea is to make the code simpler if we only want to evolve schema/spec/...
i.e.

with table.update_schema() as update:
    update.add_column("some_field", IntegerType(), "doc")

instead of another with..transaction wrapper

with table.transaction() as transaction:
    with transaction.update_schema() as update_schema:
        update.add_column("some_other_field", IntegerType(), "doc")

Since the recommended way to start a transaction is

txn = tbl.transaction()

, this option in general is not exposed to user directly: #471 (comment)

However, there may still be some concerns around this since Transaction is a public class. If this is the case, I think we can start from making the parameter "private" (autocommit -> _autocommit) and/or adding some doc to explain the usage.

Please let me know what you think!

pyiceberg/table/__init__.py

tests/catalog/test_base.py

stevie9868 · 2024-10-27T17:45:44Z

However, there may still be some concerns around this since Transaction is a public class. If this is the case, I think we can start from making the parameter "private" (autocommit -> _autocommit) and/or adding some doc to explain the usage.

ah, I think the parameter autocommit is private in Transaction class.

Having a doc is a good first step, and I believe currently autocommit=true will be applied to ManagedSnapshot, UpdateSpec, and UpdateSchema.

Also correct me if I am wrong, I believe the current java iceberg library doesn't have the auto_commit option in the Transaction class?

stevie9868 · 2024-10-27T17:51:05Z

I have also updated the PR based on existing comments, and thanks everyone for the inputs!

kevinjqliu · 2024-10-27T21:25:40Z

pyiceberg/table/__init__.py

+        """Close and commit the transaction, or handle exceptions."""
+        # Only commit the full transaction, if there is no exception in all updates on the chain


Suggested change

"""Close and commit the transaction, or handle exceptions."""

# Only commit the full transaction, if there is no exception in all updates on the chain

"""Close and commit the transaction if no exceptions have been raised."""

Exit the runtime context related to this object. The parameters describe the exception that caused the context to be exited. If the context was exited without an exception, all three arguments will be None.

From https://docs.python.org/3/reference/datamodel.html#object.__exit__

kevinjqliu · 2024-10-27T21:29:29Z

tests/catalog/test_base.py

@@ -766,3 +766,26 @@ def test_table_properties_raise_for_none_value(catalog: InMemoryCatalog) -> None
    with pytest.raises(ValidationError) as exc_info:
        _ = given_catalog_has_a_table(catalog, properties=property_with_none)
    assert "None type is not a supported value in properties: property_name" in str(exc_info.value)
+
+
+def test_abort_table_transaction_on_exception(catalog: InMemoryCatalog) -> None:


nit: can this test be moved to tests/table/test_init.py instead? it doesn't really belong in the "catalog"

kevinjqliu · 2024-10-27T21:30:54Z

tests/catalog/test_base.py

+    # Populate some initial data
+    data = pa.Table.from_pylist(
+        [{"x": 1, "y": 2, "z": 3}, {"x": 4, "y": 5, "z": 6}],
+        schema=TEST_TABLE_SCHEMA.as_arrow(),
+    )


nit: can just use the test fixture arrow_table_with_null: pa.Table like so

iceberg-python/tests/integration/test_inspect_table.py

Line 77 in de976fe

spark: SparkSession, session_catalog: Catalog, arrow_table_with_null: pa.Table, format_version: int

kevinjqliu · 2024-10-27T21:32:43Z

tests/catalog/test_base.py

+    with pytest.raises(ValueError):
+        with tbl.transaction() as txn:
+            txn.overwrite(data)
+            raise ValueError
+


Suggested change

with pytest.raises(ValueError):

with tbl.transaction() as txn:

txn.overwrite(data)

raise ValueError

with pytest.raises(ValueError):

with tbl.transaction() as txn:

txn.overwrite(data)

raise ValueError

txn.overwrite(data)

maybe another call after the exception

oops yea, honah already mentioned this

kevinjqliu · 2024-10-27T21:39:12Z

Thanks @HonahX @stevie9868! Glad we were able to get to the bottom of this important correctness issue.

I started #1253 to continue the conversation on autocommit

stevie9868 · 2024-10-28T22:50:19Z

I have decided to move the test under integeration/test_writes/test_writes.py test instead of tests/table/test_init.py given that:

Many of the existing test fixtures have an S3 prefix in their file locations, which necessitates a Docker setup. While we could create our own fixtures, this would require additional work, which I am not sure if it's still worth to do if we can also test it in the integration test.
Most transactions related test are tested within the integration tests, which seems to be more logical and appropriate.

kevinjqliu

LGTM! Thanks for addressing the previous comments.
I left a few nit comments on making the test more readable

tests/integration/test_writes/test_writes.py

Co-authored-by: Kevin Liu <[email protected]>

kevinjqliu

theres a weird linting error, can you try running make lint?

tests/integration/test_writes/test_writes.py

kevinjqliu

LGTM!

HonahX

LGTM!

kevinjqliu · 2024-10-29T17:03:48Z

Thank you @stevie9868 for discovering and fixing this important correctness bug!

stevie9868 · 2024-10-29T17:14:13Z

Thanks @kevinjqliu @HonahX for the guidance and the quick review, really appreciate it!

…as failed (apache#1246) * abort the whole transaction if any update on the chain has failed * Update tests/integration/test_writes/test_writes.py Co-authored-by: Kevin Liu <[email protected]> * Update tests/integration/test_writes/test_writes.py Co-authored-by: Kevin Liu <[email protected]> * add type:ignore to prevent lint error --------- Co-authored-by: Yingjian Wu <[email protected]> Co-authored-by: Kevin Liu <[email protected]>

stevie9868 changed the title ~~abort the whole transaction if any update on the chain has failed~~ abort the whole table transaction if any updates in the transaction has failed Oct 23, 2024

kevinjqliu reviewed Oct 25, 2024

View reviewed changes

HonahX reviewed Oct 27, 2024

View reviewed changes

pyiceberg/table/__init__.py Outdated Show resolved Hide resolved

tests/catalog/test_base.py Outdated Show resolved Hide resolved

stevie9868 force-pushed the yingjianw/abortWholeTransactionWhenThereIsUpdateFailure branch from 34ca959 to f7a7a87 Compare October 27, 2024 17:49

kevinjqliu reviewed Oct 27, 2024

View reviewed changes

kevinjqliu mentioned this pull request Oct 27, 2024

[discuss] Transaction API's autocommit #1253

Open

abort the whole transaction if any update on the chain has failed

945d61d

stevie9868 force-pushed the yingjianw/abortWholeTransactionWhenThereIsUpdateFailure branch from 4f6faa9 to 945d61d Compare October 28, 2024 22:42

kevinjqliu reviewed Oct 28, 2024

View reviewed changes

tests/integration/test_writes/test_writes.py Outdated Show resolved Hide resolved

tests/integration/test_writes/test_writes.py Outdated Show resolved Hide resolved

kevinjqliu requested review from Fokko, HonahX and sungwy October 28, 2024 23:15

stevie9868 and others added 2 commits October 28, 2024 16:38

Update tests/integration/test_writes/test_writes.py

ab8a0a0

Co-authored-by: Kevin Liu <[email protected]>

Update tests/integration/test_writes/test_writes.py

af5d165

Co-authored-by: Kevin Liu <[email protected]>

kevinjqliu reviewed Oct 29, 2024

View reviewed changes

tests/integration/test_writes/test_writes.py Show resolved Hide resolved

add type:ignore to prevent lint error

fbb7604

kevinjqliu approved these changes Oct 29, 2024

View reviewed changes

HonahX approved these changes Oct 29, 2024

View reviewed changes

kevinjqliu merged commit fba79ba into apache:main Oct 29, 2024
7 checks passed

kevinjqliu mentioned this pull request Jan 8, 2025

UpdateSchema does not respect transaction abort #1497

Draft

		"""Close and commit the transaction, or handle exceptions."""
		# Only commit the full transaction, if there is no exception in all updates on the chain

	"""Close and commit the transaction, or handle exceptions."""
	# Only commit the full transaction, if there is no exception in all updates on the chain
	"""Close and commit the transaction if no exceptions have been raised."""

abort the whole table transaction if any updates in the transaction has failed #1246

abort the whole table transaction if any updates in the transaction has failed #1246

Uh oh!

Conversation

stevie9868 commented Oct 23, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

stevie9868 commented Oct 25, 2024

Uh oh!

kevinjqliu commented Oct 25, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

kevinjqliu commented Oct 25, 2024

Uh oh!

kevinjqliu commented Oct 25, 2024

Uh oh!

kevinjqliu commented Oct 25, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

stevie9868 commented Oct 26, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

stevie9868 commented Oct 26, 2024

Uh oh!

stevie9868 commented Oct 26, 2024

Uh oh!

HonahX commented Oct 27, 2024

Uh oh!

stevie9868 commented Oct 27, 2024

Uh oh!

HonahX commented Oct 27, 2024

Uh oh!

Uh oh!

Uh oh!

stevie9868 commented Oct 27, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

stevie9868 commented Oct 27, 2024

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

kevinjqliu commented Oct 27, 2024

Uh oh!

stevie9868 commented Oct 28, 2024

Uh oh!

kevinjqliu left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

kevinjqliu left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

kevinjqliu left a comment

Choose a reason for hiding this comment

Uh oh!

HonahX left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

kevinjqliu commented Oct 29, 2024

Uh oh!

stevie9868 commented Oct 29, 2024

Uh oh!

Uh oh!

stevie9868 commented Oct 23, 2024 •

edited

Loading

kevinjqliu commented Oct 25, 2024 •

edited

Loading

kevinjqliu commented Oct 25, 2024 •

edited

Loading

stevie9868 commented Oct 26, 2024 •

edited

Loading

stevie9868 commented Oct 27, 2024 •

edited

Loading