-
Notifications
You must be signed in to change notification settings - Fork 1.1k
Figure out how to use RetryHelper in insertAll #470
Copy link
Copy link
Closed
Labels
api: bigqueryIssues related to the BigQuery API.Issues related to the BigQuery API.priority: p2Moderately-important priority. Fix may not be included in next release.Moderately-important priority. Fix may not be included in next release.triaged for GAtype: feature request‘Nice-to-have’ improvement, new feature or different behavior or design.‘Nice-to-have’ improvement, new feature or different behavior or design.
Metadata
Metadata
Assignees
Labels
api: bigqueryIssues related to the BigQuery API.Issues related to the BigQuery API.priority: p2Moderately-important priority. Fix may not be included in next release.Moderately-important priority. Fix may not be included in next release.triaged for GAtype: feature request‘Nice-to-have’ improvement, new feature or different behavior or design.‘Nice-to-have’ improvement, new feature or different behavior or design.
Right now
bigquery.insertAlldoes not useRetryHelper. Retrying a failedinsertAlloperation might insert duplicates into the table unless an ID is associated to each row.We have 2 options here:
RetryHelperand document that unless row IDs are used, duplicates might be insertedRetryHelperand when retrying add a randomUUID() to all rows that have no ID specifiedIt is worth noting that BigQuery duplicate detection based on row ID is best effort. Opinions/suggestions are welcome!
/cc @jtigani