track uncompressed bytes for mongodb connector #3715

jgao54 · 2025-11-18T22:57:45Z

MongoDB Avro file has a very high compression ratio. This can result in OOM issues during file upload to ClickHouse during initial snapshot. The proposed solution compute the uncompressed bytes of QRecords based on Avro encoding. It reuses the PEERDB_S3_BYTES_PER_AVRO_FILE variable to determine limit, and then based on connector source type, determines whether to compute the compressed bytes (which is conveniently computed by the WatchWriter and how we've tracked this historically) or uncompressed bytes (in the case for MongoDB Connector, where we manually compute bytes as we process QValues). If we want a different default, we could also introduce a new env variable instead, e.g. PEERDB_S3_UNCOMPRESSED_BYTES_PER_AVRO_FILE. Keeping it to initial snapshot for now since that's where OOM issue is happening. If we want to extend this to CDC in the future, the computation would also be different due to the schema of the staging table.

Add unit tests
Run manual smoke test

ilidemi · 2025-11-18T23:07:02Z

Question, have we explored providing the record size from the pull side along with the record? Basically the only avro schemas Mongo data can have are the raw table schema and the (_id String, doc String) one which should be easy to size, and the generic solution that supports geometry types and all types of integers wouldn't be needed. But that doesn't account for flattened mode, which could end up being easier to support the generic way (or not).

jgao54 · 2025-11-19T00:00:18Z

@ilidemi great question/observation. agree it would have been a simpler implementation for Mongo's case, but as you have highlighted well, it does not extend to flatten mode as easily. (I did consider adding a Size() interface to QValue, in which case we return not-implemented error for the time being for types not relevant for MongoDB, but given how closely the size computation is tied to avro conversion, it didn't feel like the right abstraction).

Benefits of current approach: (1) it can be extended to other data sources easily and (2) it can leads to better reliability for around OOM issues in general; we haven't seem as many OOM issues with other connectors, but it could also be because of our conservative default value. With tracking uncompressed bytes as the default approach for connectors, we would be able to set a more reasonable default given available memory on an instance.

codecov · 2025-11-19T03:27:14Z

❌ 12 Tests Failed:

Tests completed	Failed	Passed	Skipped
1376	12	1364	130

View the top 3 failed test(s) by shortest run time

github.com/PeerDB-io/peerdb/flow/e2e::TestGenericBQ

Stack Traces | 0s run time

=== RUN   TestGenericBQ
=== PAUSE TestGenericBQ
=== CONT  TestGenericBQ
--- FAIL: TestGenericBQ (0.00s)

github.com/PeerDB-io/peerdb/flow/e2e::TestGenericBQ/Test_Partitioned_Table

Stack Traces | 33.2s run time

=== RUN   TestGenericBQ/Test_Partitioned_Table
=== PAUSE TestGenericBQ/Test_Partitioned_Table
=== CONT  TestGenericBQ/Test_Partitioned_Table
    generic_test.go:427: UNEXPECTED STATUS TIMEOUT STATUS_SETUP
    bigquery.go:86: begin tearing down postgres schema bq_kb6snv1r_20251121025514
--- FAIL: TestGenericBQ/Test_Partitioned_Table (33.24s)

github.com/PeerDB-io/peerdb/flow/e2e::TestPeerFlowE2ETestSuiteBQ/Test_Soft_Delete_UD_Same_Batch

Stack Traces | 33.4s run time

=== RUN   TestPeerFlowE2ETestSuiteBQ/Test_Soft_Delete_UD_Same_Batch
=== PAUSE TestPeerFlowE2ETestSuiteBQ/Test_Soft_Delete_UD_Same_Batch
=== CONT  TestPeerFlowE2ETestSuiteBQ/Test_Soft_Delete_UD_Same_Batch
    bigquery_test.go:1205: UNEXPECTED STATUS TIMEOUT STATUS_SETUP
    bigquery.go:86: begin tearing down postgres schema bq_isis5zfw_20251121030007
--- FAIL: TestPeerFlowE2ETestSuiteBQ/Test_Soft_Delete_UD_Same_Batch (33.35s)

github.com/PeerDB-io/peerdb/flow/e2e::TestPeerFlowE2ETestSuiteBQ/Test_Invalid_Geo_Avro_CDC

Stack Traces | 33.4s run time

=== RUN   TestPeerFlowE2ETestSuiteBQ/Test_Invalid_Geo_Avro_CDC
=== PAUSE TestPeerFlowE2ETestSuiteBQ/Test_Invalid_Geo_Avro_CDC
=== CONT  TestPeerFlowE2ETestSuiteBQ/Test_Invalid_Geo_Avro_CDC
    bigquery_test.go:513: UNEXPECTED STATUS TIMEOUT STATUS_SETUP
    bigquery.go:86: begin tearing down postgres schema bq_0kbbleyo_20251121030041
2025/11/21 03:01:14 INFO Executing and processing query x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart query="SELECT id,c1 FROM e2e_test_bq_zdnu2imp_20251121030013.\"test_simple_schema_changes\" ORDER BY id"
2025/11/21 03:01:14 INFO Executing and processing query stream x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart query="SELECT id,c1 FROM e2e_test_bq_zdnu2imp_20251121030013.\"test_simple_schema_changes\" ORDER BY id"
2025/11/21 03:01:14 INFO [pg_query_executor] declared cursor x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart cursorQuery="DECLARE peerdb_cursor_16157587765211406826 CURSOR FOR SELECT id,c1 FROM e2e_test_bq_zdnu2imp_20251121030013.\"test_simple_schema_changes\" ORDER BY id" args=[]
2025/11/21 03:01:14 INFO [pg_query_executor] fetching rows start x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart query="SELECT id,c1 FROM e2e_test_bq_zdnu2imp_20251121030013.\"test_simple_schema_changes\" ORDER BY id" channelLen=0
2025/11/21 03:01:14 INFO [pg_query_executor] fetching from cursor x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart cursor=peerdb_cursor_16157587765211406826
2025/11/21 03:01:14 INFO processed row stream x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart cursor=peerdb_cursor_16157587765211406826 records=1 bytes=16 channelLen=0
2025/11/21 03:01:14 INFO [pg_query_executor] fetched rows x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart query="SELECT id,c1 FROM e2e_test_bq_zdnu2imp_20251121030013.\"test_simple_schema_changes\" ORDER BY id" rows=1 bytes=16 channelLen=0
2025/11/21 03:01:14 INFO [pg_query_executor] fetching from cursor x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart cursor=peerdb_cursor_16157587765211406826
2025/11/21 03:01:14 INFO processed row stream x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart cursor=peerdb_cursor_16157587765211406826 records=0 bytes=0 channelLen=0
2025/11/21 03:01:14 INFO [pg_query_executor] fetched rows x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart query="SELECT id,c1 FROM e2e_test_bq_zdnu2imp_20251121030013.\"test_simple_schema_changes\" ORDER BY id" rows=0 bytes=0 channelLen=0
2025/11/21 03:01:14 INFO [pg_query_executor] committing transaction x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart
2025/11/21 03:01:14 INFO [pg_query_executor] committed transaction for query x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart query="SELECT id,c1 FROM e2e_test_bq_zdnu2imp_20251121030013.\"test_simple_schema_changes\" ORDER BY id" rows=1 bytes=16 channelLen=0
--- FAIL: TestPeerFlowE2ETestSuiteBQ/Test_Invalid_Geo_Avro_CDC (33.37s)

github.com/PeerDB-io/peerdb/flow/e2e::TestPeerFlowE2ETestSuiteBQ/Test_Toast_Advance_2

Stack Traces | 33.4s run time

=== RUN   TestPeerFlowE2ETestSuiteBQ/Test_Toast_Advance_2
=== PAUSE TestPeerFlowE2ETestSuiteBQ/Test_Toast_Advance_2
=== CONT  TestPeerFlowE2ETestSuiteBQ/Test_Toast_Advance_2
    bigquery_test.go:277: UNEXPECTED STATUS TIMEOUT STATUS_SETUP
    bigquery.go:86: begin tearing down postgres schema bq_rznz5trv_20251121030000
--- FAIL: TestPeerFlowE2ETestSuiteBQ/Test_Toast_Advance_2 (33.44s)

github.com/PeerDB-io/peerdb/flow/e2e::TestPeerFlowE2ETestSuiteBQ/Test_Composite_PKey

Stack Traces | 34.5s run time

=== RUN   TestPeerFlowE2ETestSuiteBQ/Test_Composite_PKey
=== PAUSE TestPeerFlowE2ETestSuiteBQ/Test_Composite_PKey
=== CONT  TestPeerFlowE2ETestSuiteBQ/Test_Composite_PKey
    bigquery_test.go:789: UNEXPECTED STATUS TIMEOUT STATUS_SETUP
    bigquery.go:86: begin tearing down postgres schema bq_o9ga5iqr_20251121030047
--- FAIL: TestPeerFlowE2ETestSuiteBQ/Test_Composite_PKey (34.52s)

github.com/PeerDB-io/peerdb/flow/e2e::TestGenericBQ/Test_Inheritance_Table_With_Dynamic_Setting

Stack Traces | 34.6s run time

=== RUN   TestGenericBQ/Test_Inheritance_Table_With_Dynamic_Setting
=== PAUSE TestGenericBQ/Test_Inheritance_Table_With_Dynamic_Setting
=== CONT  TestGenericBQ/Test_Inheritance_Table_With_Dynamic_Setting
2025/11/21 02:57:27 INFO fetched schema x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} table=e2e_test_mychclg_tihdy1gf.test_simple
    generic_test.go:739: UNEXPECTED STATUS TIMEOUT STATUS_SETUP
    bigquery.go:86: begin tearing down postgres schema bq_l6fs5mxo_20251121025727
2025/11/21 02:58:00 INFO Executing and processing query x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart query="SELECT id,c1,coalesce(c2,0) c2 FROM e2e_test_pg_jyf63dtm.\"test_simple_schema_changes\" ORDER BY id"
2025/11/21 02:58:00 INFO Executing and processing query stream x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart query="SELECT id,c1,coalesce(c2,0) c2 FROM e2e_test_pg_jyf63dtm.\"test_simple_schema_changes\" ORDER BY id"
2025/11/21 02:58:00 INFO [pg_query_executor] declared cursor x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart cursorQuery="DECLARE peerdb_cursor_14607223995190289934 CURSOR FOR SELECT id,c1,coalesce(c2,0) c2 FROM e2e_test_pg_jyf63dtm.\"test_simple_schema_changes\" ORDER BY id" args=[]
2025/11/21 02:58:00 INFO [pg_query_executor] fetching rows start x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart query="SELECT id,c1,coalesce(c2,0) c2 FROM e2e_test_pg_jyf63dtm.\"test_simple_schema_changes\" ORDER BY id" channelLen=0
2025/11/21 02:58:00 INFO [pg_query_executor] fetching from cursor x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart cursor=peerdb_cursor_14607223995190289934
2025/11/21 02:58:00 INFO processed row stream x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart cursor=peerdb_cursor_14607223995190289934 records=2 bytes=40 channelLen=1
2025/11/21 02:58:00 INFO [pg_query_executor] fetched rows x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart query="SELECT id,c1,coalesce(c2,0) c2 FROM e2e_test_pg_jyf63dtm.\"test_simple_schema_changes\" ORDER BY id" rows=2 bytes=40 channelLen=0
2025/11/21 02:58:00 INFO [pg_query_executor] fetching from cursor x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart cursor=peerdb_cursor_14607223995190289934
2025/11/21 02:58:00 INFO processed row stream x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart cursor=peerdb_cursor_14607223995190289934 records=0 bytes=0 channelLen=0
2025/11/21 02:58:00 INFO [pg_query_executor] fetched rows x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart query="SELECT id,c1,coalesce(c2,0) c2 FROM e2e_test_pg_jyf63dtm.\"test_simple_schema_changes\" ORDER BY id" rows=0 bytes=0 channelLen=0
2025/11/21 02:58:00 INFO [pg_query_executor] committing transaction x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart
2025/11/21 02:58:00 INFO [pg_query_executor] committed transaction for query x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart query="SELECT id,c1,coalesce(c2,0) c2 FROM e2e_test_pg_jyf63dtm.\"test_simple_schema_changes\" ORDER BY id" rows=2 bytes=40 channelLen=0
2025/11/21 02:58:00 INFO Executing and processing query x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart query="SELECT id,c1,coalesce(c2,0) c2 FROM e2e_test_pg_jyf63dtm.\"test_simple_schema_changes_dst\" ORDER BY id"
2025/11/21 02:58:00 INFO Executing and processing query stream x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart query="SELECT id,c1,coalesce(c2,0) c2 FROM e2e_test_pg_jyf63dtm.\"test_simple_schema_changes_dst\" ORDER BY id"
2025/11/21 02:58:00 INFO [pg_query_executor] failed to declare cursor x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart cursorQuery="DECLARE peerdb_cursor_6581298148018957628 CURSOR FOR SELECT id,c1,coalesce(c2,0) c2 FROM e2e_test_pg_jyf63dtm.\"test_simple_schema_changes_dst\" ORDER BY id" args=[] error="ERROR: column \"c2\" does not exist (SQLSTATE 42703)"
2025/11/21 02:58:00 ERROR [pg_query_executor] failed to execute and process query stream x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart error="[pg_query_executor] failed to declare cursor: ERROR: column \"c2\" does not exist (SQLSTATE 42703)"
--- FAIL: TestGenericBQ/Test_Inheritance_Table_With_Dynamic_Setting (34.56s)

github.com/PeerDB-io/peerdb/flow/e2e::TestPeerFlowE2ETestSuiteBQ/Test_Char_ColType_Error

Stack Traces | 66.2s run time

=== RUN   TestPeerFlowE2ETestSuiteBQ/Test_Char_ColType_Error
=== PAUSE TestPeerFlowE2ETestSuiteBQ/Test_Char_ColType_Error
=== CONT  TestPeerFlowE2ETestSuiteBQ/Test_Char_ColType_Error
    bigquery_test.go:142: WaitFor create table 2025-11-21 03:01:19.193073681 +0000 UTC m=+367.752419396
2025/11/21 03:01:19 INFO Executing and processing query x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart query="SELECT id,key,value FROM e2e_test_bq_w29burh2_20251121030114.\"test_char_coltype\" ORDER BY id"
2025/11/21 03:01:19 INFO Executing and processing query stream x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart query="SELECT id,key,value FROM e2e_test_bq_w29burh2_20251121030114.\"test_char_coltype\" ORDER BY id"
2025/11/21 03:01:19 INFO [pg_query_executor] declared cursor x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart cursorQuery="DECLARE peerdb_cursor_7178813907597722277 CURSOR FOR SELECT id,key,value FROM e2e_test_bq_w29burh2_20251121030114.\"test_char_coltype\" ORDER BY id" args=[]
2025/11/21 03:01:19 INFO [pg_query_executor] fetching rows start x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart query="SELECT id,key,value FROM e2e_test_bq_w29burh2_20251121030114.\"test_char_coltype\" ORDER BY id" channelLen=0
2025/11/21 03:01:19 INFO [pg_query_executor] fetching from cursor x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart cursor=peerdb_cursor_7178813907597722277
2025/11/21 03:01:19 INFO processed row stream x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart cursor=peerdb_cursor_7178813907597722277 records=0 bytes=0 channelLen=0
2025/11/21 03:01:19 INFO [pg_query_executor] fetched rows x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart query="SELECT id,key,value FROM e2e_test_bq_w29burh2_20251121030114.\"test_char_coltype\" ORDER BY id" rows=0 bytes=0 channelLen=0
2025/11/21 03:01:19 INFO [pg_query_executor] committing transaction x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart
2025/11/21 03:01:19 INFO [pg_query_executor] committed transaction for query x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart query="SELECT id,key,value FROM e2e_test_bq_w29burh2_20251121030114.\"test_char_coltype\" ORDER BY id" rows=0 bytes=0 channelLen=0
    bigquery_test.go:142: running query on bigquery: SELECT id,key,value FROM `e2e_test_dataset_1655399369953587381.test_char_coltype` ORDER BY id
    bigquery_test.go:144: WaitFor finish 2025-11-21 03:01:19.599328519 +0000 UTC m=+368.158674234
2025/11/21 03:01:19 INFO Executing and processing query x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart query="SELECT id,c1,c2,t FROM e2e_test_bq_f9lnsq96_20251121030008.\"test_softdel_iad\" ORDER BY id"
2025/11/21 03:01:19 INFO Executing and processing query stream x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart query="SELECT id,c1,c2,t FROM e2e_test_bq_f9lnsq96_20251121030008.\"test_softdel_iad\" ORDER BY id"
2025/11/21 03:01:19 INFO [pg_query_executor] declared cursor x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart cursorQuery="DECLARE peerdb_cursor_11759807524965916537 CURSOR FOR SELECT id,c1,c2,t FROM e2e_test_bq_f9lnsq96_20251121030008.\"test_softdel_iad\" ORDER BY id" args=[]
2025/11/21 03:01:19 INFO [pg_query_executor] fetching rows start x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart query="SELECT id,c1,c2,t FROM e2e_test_bq_f9lnsq96_20251121030008.\"test_softdel_iad\" ORDER BY id" channelLen=0
2025/11/21 03:01:19 INFO [pg_query_executor] fetching from cursor x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart cursor=peerdb_cursor_11759807524965916537
2025/11/21 03:01:19 INFO processed row stream x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart cursor=peerdb_cursor_11759807524965916537 records=0 bytes=0 channelLen=0
2025/11/21 03:01:19 INFO [pg_query_executor] fetched rows x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart query="SELECT id,c1,c2,t FROM e2e_test_bq_f9lnsq96_20251121030008.\"test_softdel_iad\" ORDER BY id" rows=0 bytes=0 channelLen=0
2025/11/21 03:01:19 INFO [pg_query_executor] committing transaction x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart
2025/11/21 03:01:19 INFO [pg_query_executor] committed transaction for query x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart query="SELECT id,c1,c2,t FROM e2e_test_bq_f9lnsq96_20251121030008.\"test_softdel_iad\" ORDER BY id" rows=0 bytes=0 channelLen=0
    bigquery_test.go:144: Finished Status Canceled
    bigquery.go:86: begin tearing down postgres schema bq_w29burh2_20251121030114
    bigquery.go:89: failed to tear down bigquery: failed to delete dataset: context deadline exceeded
--- FAIL: TestPeerFlowE2ETestSuiteBQ/Test_Char_ColType_Error (66.15s)

github.com/PeerDB-io/peerdb/flow/e2e::TestGenericBQ/Test_Partitioned_Table_Without_Publish_Via_Partition_Root

Stack Traces | 81.5s run time

=== RUN   TestGenericBQ/Test_Partitioned_Table_Without_Publish_Via_Partition_Root
=== PAUSE TestGenericBQ/Test_Partitioned_Table_Without_Publish_Via_Partition_Root
=== CONT  TestGenericBQ/Test_Partitioned_Table_Without_Publish_Via_Partition_Root
2025/11/21 02:57:23 INFO fetched schema x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} table=e2e_test_mychclg_4tki7u4k.test_simple_schema_changes
    generic_test.go:608: UNEXPECTED STATUS TIMEOUT STATUS_SETUP
    bigquery.go:86: begin tearing down postgres schema bq_h0drng6i_20251121025722
2025/11/21 02:58:43 INFO Executing and processing query x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart query="SELECT id,c1,coalesce(c2,0) c2 FROM e2e_test_sf_1rtvzsiy_20251121025822.\"test_schema_changes_cutoff_bug1\" ORDER BY id"
2025/11/21 02:58:43 INFO Executing and processing query stream x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart query="SELECT id,c1,coalesce(c2,0) c2 FROM e2e_test_sf_1rtvzsiy_20251121025822.\"test_schema_changes_cutoff_bug1\" ORDER BY id"
2025/11/21 02:58:43 INFO [pg_query_executor] declared cursor x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart cursorQuery="DECLARE peerdb_cursor_16199588078883295401 CURSOR FOR SELECT id,c1,coalesce(c2,0) c2 FROM e2e_test_sf_1rtvzsiy_20251121025822.\"test_schema_changes_cutoff_bug1\" ORDER BY id" args=[]
2025/11/21 02:58:43 INFO [pg_query_executor] fetching rows start x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart query="SELECT id,c1,coalesce(c2,0) c2 FROM e2e_test_sf_1rtvzsiy_20251121025822.\"test_schema_changes_cutoff_bug1\" ORDER BY id" channelLen=0
2025/11/21 02:58:43 INFO [pg_query_executor] fetching from cursor x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart cursor=peerdb_cursor_16199588078883295401
2025/11/21 02:58:43 INFO processed row stream x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart cursor=peerdb_cursor_16199588078883295401 records=2 bytes=40 channelLen=1
2025/11/21 02:58:43 INFO [pg_query_executor] fetched rows x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart query="SELECT id,c1,coalesce(c2,0) c2 FROM e2e_test_sf_1rtvzsiy_20251121025822.\"test_schema_changes_cutoff_bug1\" ORDER BY id" rows=2 bytes=40 channelLen=1
2025/11/21 02:58:43 INFO [pg_query_executor] fetching from cursor x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart cursor=peerdb_cursor_16199588078883295401
2025/11/21 02:58:43 INFO processed row stream x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart cursor=peerdb_cursor_16199588078883295401 records=0 bytes=0 channelLen=0
2025/11/21 02:58:43 INFO [pg_query_executor] fetched rows x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart query="SELECT id,c1,coalesce(c2,0) c2 FROM e2e_test_sf_1rtvzsiy_20251121025822.\"test_schema_changes_cutoff_bug1\" ORDER BY id" rows=0 bytes=0 channelLen=0
2025/11/21 02:58:43 INFO [pg_query_executor] committing transaction x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart
2025/11/21 02:58:43 INFO [pg_query_executor] committed transaction for query x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart query="SELECT id,c1,coalesce(c2,0) c2 FROM e2e_test_sf_1rtvzsiy_20251121025822.\"test_schema_changes_cutoff_bug1\" ORDER BY id" rows=2 bytes=40 channelLen=0
--- FAIL: TestGenericBQ/Test_Partitioned_Table_Without_Publish_Via_Partition_Root (81.45s)

github.com/PeerDB-io/peerdb/flow/e2e::TestPeerFlowE2ETestSuiteBQ/Test_Composite_PKey_Toast_1

Stack Traces | 93.3s run time

=== RUN   TestPeerFlowE2ETestSuiteBQ/Test_Composite_PKey_Toast_1
=== PAUSE TestPeerFlowE2ETestSuiteBQ/Test_Composite_PKey_Toast_1
=== CONT  TestPeerFlowE2ETestSuiteBQ/Test_Composite_PKey_Toast_1
2025/11/21 03:00:46 INFO [pg_query_executor] fetching from cursor x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart cursor=peerdb_cursor_4277103413183127090
2025/11/21 03:00:46 INFO processed row stream x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart cursor=peerdb_cursor_4277103413183127090 records=1 bytes=8867 channelLen=0
2025/11/21 03:00:46 INFO [pg_query_executor] fetched rows x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart query="SELECT id,c1,c2,t FROM e2e_test_bq_1bvp7i3s_20251121030010.\"test_softdel_src\" ORDER BY id" rows=1 bytes=8867 channelLen=0
2025/11/21 03:00:46 INFO [pg_query_executor] fetching from cursor x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart cursor=peerdb_cursor_4277103413183127090
2025/11/21 03:00:46 INFO processed row stream x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart cursor=peerdb_cursor_4277103413183127090 records=0 bytes=0 channelLen=0
2025/11/21 03:00:46 INFO [pg_query_executor] fetched rows x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart query="SELECT id,c1,c2,t FROM e2e_test_bq_1bvp7i3s_20251121030010.\"test_softdel_src\" ORDER BY id" rows=0 bytes=0 channelLen=0
2025/11/21 03:00:46 INFO [pg_query_executor] committing transaction x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart
2025/11/21 03:00:46 INFO [pg_query_executor] committed transaction for query x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart query="SELECT id,c1,c2,t FROM e2e_test_bq_1bvp7i3s_20251121030010.\"test_softdel_src\" ORDER BY id" rows=1 bytes=8867 channelLen=0
    bigquery_test.go:847: UNEXPECTED STATUS TIMEOUT STATUS_SETUP
    bigquery.go:86: begin tearing down postgres schema bq_hrxqwq2r_20251121030046
2025/11/21 03:01:19 INFO Executing and processing query x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart query="SELECT id,c1 FROM e2e_test_bq_zdnu2imp_20251121030013.\"test_simple_schema_changes\" ORDER BY id"
2025/11/21 03:01:19 INFO Executing and processing query stream x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart query="SELECT id,c1 FROM e2e_test_bq_zdnu2imp_20251121030013.\"test_simple_schema_changes\" ORDER BY id"
2025/11/21 03:01:19 INFO [pg_query_executor] declared cursor x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart cursorQuery="DECLARE peerdb_cursor_17853781504377811691 CURSOR FOR SELECT id,c1 FROM e2e_test_bq_zdnu2imp_20251121030013.\"test_simple_schema_changes\" ORDER BY id" args=[]
2025/11/21 03:01:19 INFO [pg_query_executor] fetching rows start x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart query="SELECT id,c1 FROM e2e_test_bq_zdnu2imp_20251121030013.\"test_simple_schema_changes\" ORDER BY id" channelLen=0
2025/11/21 03:01:19 INFO [pg_query_executor] fetching from cursor x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart cursor=peerdb_cursor_17853781504377811691
2025/11/21 03:01:19 INFO processed row stream x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart cursor=peerdb_cursor_17853781504377811691 records=1 bytes=16 channelLen=0
2025/11/21 03:01:19 INFO [pg_query_executor] fetched rows x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart query="SELECT id,c1 FROM e2e_test_bq_zdnu2imp_20251121030013.\"test_simple_schema_changes\" ORDER BY id" rows=1 bytes=16 channelLen=0
2025/11/21 03:01:19 INFO [pg_query_executor] fetching from cursor x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart cursor=peerdb_cursor_17853781504377811691
2025/11/21 03:01:19 INFO processed row stream x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart cursor=peerdb_cursor_17853781504377811691 records=0 bytes=0 channelLen=0
2025/11/21 03:01:19 INFO [pg_query_executor] fetched rows x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart query="SELECT id,c1 FROM e2e_test_bq_zdnu2imp_20251121030013.\"test_simple_schema_changes\" ORDER BY id" rows=0 bytes=0 channelLen=0
2025/11/21 03:01:19 INFO [pg_query_executor] committing transaction x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart
2025/11/21 03:01:19 INFO [pg_query_executor] committed transaction for query x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart query="SELECT id,c1 FROM e2e_test_bq_zdnu2imp_20251121030013.\"test_simple_schema_changes\" ORDER BY id" rows=1 bytes=16 channelLen=0
    bigquery.go:89: failed to tear down bigquery: failed to delete dataset: context deadline exceeded
--- FAIL: TestPeerFlowE2ETestSuiteBQ/Test_Composite_PKey_Toast_1 (93.29s)

github.com/PeerDB-io/peerdb/flow/e2e::TestPeerFlowE2ETestSuiteBQ/Test_Multi_Table

Stack Traces | 96.1s run time

=== RUN   TestPeerFlowE2ETestSuiteBQ/Test_Multi_Table
=== PAUSE TestPeerFlowE2ETestSuiteBQ/Test_Multi_Table
=== CONT  TestPeerFlowE2ETestSuiteBQ/Test_Multi_Table
2025/11/21 03:00:31 INFO Executing and processing query x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart query="SELECT id,t1,t2,k FROM e2e_test_bq_kxskyxav_20251121030004.\"test_toast_bq_1\" ORDER BY id"
2025/11/21 03:00:31 INFO Executing and processing query stream x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart query="SELECT id,t1,t2,k FROM e2e_test_bq_kxskyxav_20251121030004.\"test_toast_bq_1\" ORDER BY id"
2025/11/21 03:00:31 INFO [pg_query_executor] declared cursor x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart cursorQuery="DECLARE peerdb_cursor_15480576246234827727 CURSOR FOR SELECT id,t1,t2,k FROM e2e_test_bq_kxskyxav_20251121030004.\"test_toast_bq_1\" ORDER BY id" args=[]
2025/11/21 03:00:31 INFO [pg_query_executor] fetching rows start x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart query="SELECT id,t1,t2,k FROM e2e_test_bq_kxskyxav_20251121030004.\"test_toast_bq_1\" ORDER BY id" channelLen=0
2025/11/21 03:00:31 INFO [pg_query_executor] fetching from cursor x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart cursor=peerdb_cursor_15480576246234827727
2025/11/21 03:00:31 INFO processed row stream x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart cursor=peerdb_cursor_15480576246234827727 records=2 bytes=26601 channelLen=0
2025/11/21 03:00:31 INFO [pg_query_executor] fetched rows x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart query="SELECT id,t1,t2,k FROM e2e_test_bq_kxskyxav_20251121030004.\"test_toast_bq_1\" ORDER BY id" rows=2 bytes=26601 channelLen=0
2025/11/21 03:00:31 INFO [pg_query_executor] fetching from cursor x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart cursor=peerdb_cursor_15480576246234827727
2025/11/21 03:00:31 INFO processed row stream x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart cursor=peerdb_cursor_15480576246234827727 records=0 bytes=0 channelLen=0
2025/11/21 03:00:31 INFO [pg_query_executor] fetched rows x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart query="SELECT id,t1,t2,k FROM e2e_test_bq_kxskyxav_20251121030004.\"test_toast_bq_1\" ORDER BY id" rows=0 bytes=0 channelLen=0
2025/11/21 03:00:31 INFO [pg_query_executor] committing transaction x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart
2025/11/21 03:00:31 INFO [pg_query_executor] committed transaction for query x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart query="SELECT id,t1,t2,k FROM e2e_test_bq_kxskyxav_20251121030004.\"test_toast_bq_1\" ORDER BY id" rows=2 bytes=26601 channelLen=0
    bigquery_test.go:596: Executed an insert on two tables
    bigquery_test.go:598: WaitFor normalize both tables 2025-11-21 03:00:37.198422478 +0000 UTC m=+325.757768183
2025/11/21 03:00:37 INFO Executing and processing query x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart query="SELECT id,t1,t2,k FROM e2e_test_bq_kxskyxav_20251121030004.\"test_toast_bq_1\" ORDER BY id"
2025/11/21 03:00:37 INFO Executing and processing query stream x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart query="SELECT id,t1,t2,k FROM e2e_test_bq_kxskyxav_20251121030004.\"test_toast_bq_1\" ORDER BY id"
2025/11/21 03:00:37 INFO [pg_query_executor] declared cursor x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart cursorQuery="DECLARE peerdb_cursor_10560553960065452001 CURSOR FOR SELECT id,t1,t2,k FROM e2e_test_bq_kxskyxav_20251121030004.\"test_toast_bq_1\" ORDER BY id" args=[]
2025/11/21 03:00:37 INFO [pg_query_executor] fetching rows start x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart query="SELECT id,t1,t2,k FROM e2e_test_bq_kxskyxav_20251121030004.\"test_toast_bq_1\" ORDER BY id" channelLen=0
2025/11/21 03:00:37 INFO [pg_query_executor] fetching from cursor x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart cursor=peerdb_cursor_10560553960065452001
2025/11/21 03:00:37 INFO processed row stream x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart cursor=peerdb_cursor_10560553960065452001 records=2 bytes=26601 channelLen=1
2025/11/21 03:00:37 INFO [pg_query_executor] fetched rows x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart query="SELECT id,t1,t2,k FROM e2e_test_bq_kxskyxav_20251121030004.\"test_toast_bq_1\" ORDER BY id" rows=2 bytes=26601 channelLen=1
2025/11/21 03:00:37 INFO [pg_query_executor] fetching from cursor x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart cursor=peerdb_cursor_10560553960065452001
2025/11/21 03:00:37 INFO processed row stream x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart cursor=peerdb_cursor_10560553960065452001 records=0 bytes=0 channelLen=0
2025/11/21 03:00:37 INFO [pg_query_executor] fetched rows x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart query="SELECT id,t1,t2,k FROM e2e_test_bq_kxskyxav_20251121030004.\"test_toast_bq_1\" ORDER BY id" rows=0 bytes=0 channelLen=0
2025/11/21 03:00:37 INFO [pg_query_executor] committing transaction x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart
2025/11/21 03:00:37 INFO [pg_query_executor] committed transaction for query x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart query="SELECT id,t1,t2,k FROM e2e_test_bq_kxskyxav_20251121030004.\"test_toast_bq_1\" ORDER BY id" rows=2 bytes=26601 channelLen=0
    bigquery_test.go:612: WaitFor finish 2025-11-21 03:01:06.312027308 +0000 UTC m=+354.871373023
2025/11/21 03:01:06 INFO Executing and processing query x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart query="SELECT id,t1,t2,k FROM e2e_test_bq_kxskyxav_20251121030004.\"test_toast_bq_1\" ORDER BY id"
2025/11/21 03:01:06 INFO Executing and processing query stream x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart query="SELECT id,t1,t2,k FROM e2e_test_bq_kxskyxav_20251121030004.\"test_toast_bq_1\" ORDER BY id"
2025/11/21 03:01:06 INFO [pg_query_executor] declared cursor x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart cursorQuery="DECLARE peerdb_cursor_14828337868264247470 CURSOR FOR SELECT id,t1,t2,k FROM e2e_test_bq_kxskyxav_20251121030004.\"test_toast_bq_1\" ORDER BY id" args=[]
2025/11/21 03:01:06 INFO [pg_query_executor] fetching rows start x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart query="SELECT id,t1,t2,k FROM e2e_test_bq_kxskyxav_20251121030004.\"test_toast_bq_1\" ORDER BY id" channelLen=0
2025/11/21 03:01:06 INFO [pg_query_executor] fetching from cursor x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart cursor=peerdb_cursor_14828337868264247470
2025/11/21 03:01:06 INFO processed row stream x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart cursor=peerdb_cursor_14828337868264247470 records=2 bytes=26601 channelLen=1
2025/11/21 03:01:06 INFO [pg_query_executor] fetched rows x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart query="SELECT id,t1,t2,k FROM e2e_test_bq_kxskyxav_20251121030004.\"test_toast_bq_1\" ORDER BY id" rows=2 bytes=26601 channelLen=1
2025/11/21 03:01:06 INFO [pg_query_executor] fetching from cursor x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart cursor=peerdb_cursor_14828337868264247470
2025/11/21 03:01:06 INFO processed row stream x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart cursor=peerdb_cursor_14828337868264247470 records=0 bytes=0 channelLen=0
2025/11/21 03:01:06 INFO [pg_query_executor] fetched rows x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart query="SELECT id,t1,t2,k FROM e2e_test_bq_kxskyxav_20251121030004.\"test_toast_bq_1\" ORDER BY id" rows=0 bytes=0 channelLen=0
2025/11/21 03:01:06 INFO [pg_query_executor] committing transaction x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart
2025/11/21 03:01:06 INFO [pg_query_executor] committed transaction for query x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart query="SELECT id,t1,t2,k FROM e2e_test_bq_kxskyxav_20251121030004.\"test_toast_bq_1\" ORDER BY id" rows=2 bytes=26601 channelLen=0
    bigquery_test.go:612: Finished Status Canceled
    bigquery.go:86: begin tearing down postgres schema bq_ahcpfaku_20251121030031
    bigquery.go:89: failed to tear down bigquery: failed to check if dataset e2e_test_dataset_13620290188906507259 exists: failed to get dataset metadata: context deadline exceeded
--- FAIL: TestPeerFlowE2ETestSuiteBQ/Test_Multi_Table (96.09s)

github.com/PeerDB-io/peerdb/flow/e2e::TestPeerFlowE2ETestSuiteBQ/Test_NaN_Doubles

Stack Traces | 98.2s run time

=== RUN   TestPeerFlowE2ETestSuiteBQ/Test_NaN_Doubles
=== PAUSE TestPeerFlowE2ETestSuiteBQ/Test_NaN_Doubles
=== CONT  TestPeerFlowE2ETestSuiteBQ/Test_NaN_Doubles
2025/11/21 03:00:16 INFO Executing and processing query x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart query="SELECT id,c1 FROM e2e_test_bq_xhosrnlc_20251121025956.\"test_all_types_schema_changes\" ORDER BY id"
2025/11/21 03:00:16 INFO Executing and processing query stream x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart query="SELECT id,c1 FROM e2e_test_bq_xhosrnlc_20251121025956.\"test_all_types_schema_changes\" ORDER BY id"
2025/11/21 03:00:16 INFO [pg_query_executor] declared cursor x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart cursorQuery="DECLARE peerdb_cursor_7819260816720379911 CURSOR FOR SELECT id,c1 FROM e2e_test_bq_xhosrnlc_20251121025956.\"test_all_types_schema_changes\" ORDER BY id" args=[]
2025/11/21 03:00:16 INFO [pg_query_executor] fetching rows start x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart query="SELECT id,c1 FROM e2e_test_bq_xhosrnlc_20251121025956.\"test_all_types_schema_changes\" ORDER BY id" channelLen=0
2025/11/21 03:00:16 INFO [pg_query_executor] fetching from cursor x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart cursor=peerdb_cursor_7819260816720379911
2025/11/21 03:00:16 INFO processed row stream x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart cursor=peerdb_cursor_7819260816720379911 records=1 bytes=16 channelLen=0
2025/11/21 03:00:16 INFO [pg_query_executor] fetched rows x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart query="SELECT id,c1 FROM e2e_test_bq_xhosrnlc_20251121025956.\"test_all_types_schema_changes\" ORDER BY id" rows=1 bytes=16 channelLen=0
2025/11/21 03:00:16 INFO [pg_query_executor] fetching from cursor x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart cursor=peerdb_cursor_7819260816720379911
2025/11/21 03:00:16 INFO processed row stream x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart cursor=peerdb_cursor_7819260816720379911 records=0 bytes=0 channelLen=0
2025/11/21 03:00:16 INFO [pg_query_executor] fetched rows x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart query="SELECT id,c1 FROM e2e_test_bq_xhosrnlc_20251121025956.\"test_all_types_schema_changes\" ORDER BY id" rows=0 bytes=0 channelLen=0
2025/11/21 03:00:16 INFO [pg_query_executor] committing transaction x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart
2025/11/21 03:00:16 INFO [pg_query_executor] committed transaction for query x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart query="SELECT id,c1 FROM e2e_test_bq_xhosrnlc_20251121025956.\"test_all_types_schema_changes\" ORDER BY id" rows=1 bytes=16 channelLen=0
    bigquery_test.go:462: UNEXPECTED STATUS TIMEOUT STATUS_SETUP
    bigquery.go:86: begin tearing down postgres schema bq_4ggfdnbi_20251121030016
2025/11/21 03:01:54 INFO fetched schema x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} table=e2e_test_mych_70u4sj9h.test_update_pkey_enabled
--- FAIL: TestPeerFlowE2ETestSuiteBQ/Test_NaN_Doubles (98.18s)

View the full list of 2 ❄️ flaky test(s)

github.com/PeerDB-io/peerdb/flow/e2e::TestGenericBQ/Test_Simple_Schema_Changes

Flake rate in main: 26.75% (Passed 178 times, Failed 65 times)

Stack Traces | 60.8s run time

=== RUN   TestGenericBQ/Test_Simple_Schema_Changes
=== PAUSE TestGenericBQ/Test_Simple_Schema_Changes
=== CONT  TestGenericBQ/Test_Simple_Schema_Changes
2025/11/21 02:55:36 INFO Executing and processing query x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart query="SELECT id,name,created_at FROM e2e_test_sf_v35p4qva_20251121025511.\"test_inheritance_dynconf\" ORDER BY id"
2025/11/21 02:55:36 INFO Executing and processing query stream x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart query="SELECT id,name,created_at FROM e2e_test_sf_v35p4qva_20251121025511.\"test_inheritance_dynconf\" ORDER BY id"
2025/11/21 02:55:36 INFO [pg_query_executor] declared cursor x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart cursorQuery="DECLARE peerdb_cursor_16430809048484430780 CURSOR FOR SELECT id,name,created_at FROM e2e_test_sf_v35p4qva_20251121025511.\"test_inheritance_dynconf\" ORDER BY id" args=[]
2025/11/21 02:55:36 INFO [pg_query_executor] fetching rows start x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart query="SELECT id,name,created_at FROM e2e_test_sf_v35p4qva_20251121025511.\"test_inheritance_dynconf\" ORDER BY id" channelLen=0
2025/11/21 02:55:36 INFO [pg_query_executor] fetching from cursor x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart cursor=peerdb_cursor_16430809048484430780
2025/11/21 02:55:36 INFO processed row stream x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart cursor=peerdb_cursor_16430809048484430780 records=6 bytes=126 channelLen=5
2025/11/21 02:55:36 INFO [pg_query_executor] fetched rows x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart query="SELECT id,name,created_at FROM e2e_test_sf_v35p4qva_20251121025511.\"test_inheritance_dynconf\" ORDER BY id" rows=6 bytes=126 channelLen=4
2025/11/21 02:55:36 INFO [pg_query_executor] fetching from cursor x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart cursor=peerdb_cursor_16430809048484430780
2025/11/21 02:55:36 INFO processed row stream x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart cursor=peerdb_cursor_16430809048484430780 records=0 bytes=0 channelLen=0
2025/11/21 02:55:36 INFO [pg_query_executor] fetched rows x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart query="SELECT id,name,created_at FROM e2e_test_sf_v35p4qva_20251121025511.\"test_inheritance_dynconf\" ORDER BY id" rows=0 bytes=0 channelLen=0
2025/11/21 02:55:36 INFO [pg_query_executor] committing transaction x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart
2025/11/21 02:55:36 INFO [pg_query_executor] committed transaction for query x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN} partitionId=testpart query="SELECT id,name,created_at FROM e2e_test_sf_v35p4qva_20251121025511.\"test_inheritance_dynconf\" ORDER BY id" rows=6 bytes=126 channelLen=0
    generic_test.go:198: skipping test because destination connector does not implement GetTableSchemaConnector
    bigquery.go:86: begin tearing down postgres schema bq_hzjxnzdy_20251121025535
    bigquery.go:89: failed to tear down bigquery: failed to check if dataset e2e_test_dataset_9606532852008935714 exists: failed to get dataset metadata: context deadline exceeded
--- FAIL: TestGenericBQ/Test_Simple_Schema_Changes (60.82s)

github.com/PeerDB-io/peerdb/flow/e2e::TestPeerFlowE2ETestSuiteBQ

Flake rate in main: 20.69% (Passed 253 times, Failed 66 times)

Stack Traces | 0.01s run time

=== RUN   TestPeerFlowE2ETestSuiteBQ
=== PAUSE TestPeerFlowE2ETestSuiteBQ
=== CONT  TestPeerFlowE2ETestSuiteBQ
--- FAIL: TestPeerFlowE2ETestSuiteBQ (0.01s)
2025/11/21 03:03:05 INFO Received AWS credentials from peer for connector: ci x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN}
2025/11/21 03:03:05 INFO Received AWS credentials from peer for connector: clickhouse x-peerdb-additional-metadata={Operation:FLOW_OPERATION_UNKNOWN}

To view more test analytics, go to the Test Analytics Dashboard
_{📋 Got 3 mins? Take this short survey to help us improve Test Analytics.}

flow/model/qrecord_avro_size_test.go

flow/model/qvalue/avro_converter.go

flow/model/qvalue/avro_converter_test.go

protos/flow.proto

ilidemi · 2025-11-21T10:41:10Z

flow/model/qvalue/avro_converter.go

+	arrTerminationByteSize := int64(1)
+
+	size := int64(0)
+	switch v := value.(type) {


Seems a little bad that the switch branches are duplicated when the size is tied to the value. How about:

func QValueToAvro(..., calcSize bool) (any, int64, error) { switch v := value.(type) { case types.QValueFloat64: return c.processNullableUnion(float64(v.Val)), constSize(8, calcSize), nil case types.QValueString: return c.processNullableUnion(v.Value()), stringSize(v.Value(), calcSize), nil ... } func constSize(n int64, calcSize bool) int64 { if !calcSize { return 0 } return n } func stringSize(s string, calcSize bool) int64 { if !calcSize { return 0 } return calcVarIntSize(int64(len(s))) + int64(len(s)) }

ilidemi · 2025-11-21T10:43:02Z

flow/model/qrecord_avro_size_test.go

+					{Name: "doc", Type: types.QValueKindJSON, Nullable: false},
+				},
+			}
+			tmpFile := fmt.Sprintf("/tmp/test_avro_size_%s_%d.avro", tc.name, time.Now().Unix())


TestWriteRecordsToAvroFileHappyPath has a pattern for temp files

ilidemi · 2025-11-21T10:48:31Z

flow/model/qrecord_avro_size_test.go

+		t.Run(tc.name, func(t *testing.T) {
+			schema := types.QRecordSchema{
+				Fields: []types.QField{
+					{Name: "_id", Type: types.QValueKindString, Nullable: false},


CDC path/raw table? 👉 👈
(ok if not)

jgao54 force-pushed the support-avro-uncompressed branch 2 times, most recently from ce1de97 to 4e0fe48 Compare November 19, 2025 03:16

jgao54 force-pushed the support-avro-uncompressed branch 3 times, most recently from 66da080 to 9719705 Compare November 21, 2025 02:51

jgao54 marked this pull request as ready for review November 21, 2025 02:51

jgao54 requested review from Amogh-Bharadwaj, heavycrystal, ilidemi and serprex November 21, 2025 02:52

jgao54 commented Nov 21, 2025

View reviewed changes

flow/model/qrecord_avro_size_test.go Show resolved Hide resolved

jgao54 added 4 commits November 20, 2025 22:39

track uncompressed bytes for mongodb connector

34c362a

simplify

95c1da2

ui

f9ef702

add unit test for estimating file size

668aef2

jgao54 force-pushed the support-avro-uncompressed branch from 9719705 to 668aef2 Compare November 21, 2025 08:39

ilidemi reviewed Nov 21, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

track uncompressed bytes for mongodb connector #3715

track uncompressed bytes for mongodb connector #3715

Uh oh!

jgao54 commented Nov 18, 2025 •

edited

Loading

Uh oh!

ilidemi commented Nov 18, 2025

Uh oh!

jgao54 commented Nov 19, 2025

Uh oh!

codecov bot commented Nov 19, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ilidemi Nov 21, 2025 •

edited

Loading

Uh oh!

ilidemi Nov 21, 2025

Uh oh!

ilidemi Nov 21, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

track uncompressed bytes for mongodb connector #3715

Are you sure you want to change the base?

track uncompressed bytes for mongodb connector #3715

Uh oh!

Conversation

jgao54 commented Nov 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ilidemi commented Nov 18, 2025

Uh oh!

jgao54 commented Nov 19, 2025

Uh oh!

codecov bot commented Nov 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

❌ 12 Tests Failed:

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ilidemi Nov 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ilidemi Nov 21, 2025

Choose a reason for hiding this comment

Uh oh!

ilidemi Nov 21, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

jgao54 commented Nov 18, 2025 •

edited

Loading

codecov bot commented Nov 19, 2025 •

edited

Loading

ilidemi Nov 21, 2025 •

edited

Loading