feat(bigquery-jdbc): add `EnableProjectDiscovery` connection property for metadata methods by keshavdandeva · Pull Request #13344 · googleapis/google-cloud-java

keshavdandeva · 2026-06-03T14:14:59Z

b/499078725

This PR implements the EnableProjectDiscovery connection property into the BigQuery JDBC driver. Enabling this property (default false) allows JDBC database metadata methods (like getCatalogs() and getSchemas()) to discover and query datasets across all Google Cloud projects accessible to the client credentials, rather than being confined to the single default ProjectId specified in the connection URL.

Changes

Connection Parameter Parsing: Added parsing for EnableProjectDiscovery in BigQueryJdbcUrlUtility and configured it in DataSource.
SDK Integration: Implemented BigQueryConnection.getDiscoveredProjects() to utilize the BigQuery.listProjects() core SDK method
Resilience and Caching:
- Connection-scoped caching is implemented via discoveredProjectsCache.
- Non-transient errors (HTTP 400, 401, 403) cache an empty list to avoid redundant, failing calls to the BigQuery API.
- Transient errors (like 500) are not cached, allowing subsequent metadata scans to trigger retries.
Metadata Integration: Integrated discovered projects inside BigQueryDatabaseMetaData.getAccessibleCatalogNames() so that catalog and schema metadata fetches automatically query across all accessible projects when EnableProjectDiscovery is enabled.

…ethods

gemini-code-assist

Code Review

This pull request introduces an EnableProjectDiscovery configuration property to automatically discover and list all accessible Google Cloud projects as catalogs. To support this, schema fetching in BigQueryDatabaseMetaData has been refactored to run concurrently using an executor service. Feedback on these changes highlights two main areas for improvement: first, replacing the fragile use of reflection to access the low-level BigQuery client with the standard public BigQuery.listProjects() API; second, ensuring that outstanding asynchronous tasks are properly cancelled if the schema fetching loop is interrupted to prevent resource leaks.

keshavdandeva · 2026-06-03T18:30:32Z

/gemini review

gemini-code-assist

Code Review

This pull request introduces an automatic project discovery feature to the BigQuery JDBC driver, allowing users to discover all accessible Google Cloud projects via a new EnableProjectDiscovery connection property. It also updates BigQueryDatabaseMetaData to parallelize schema fetching across multiple projects. The review feedback highlights several areas for improvement: reusing a shared ExecutorService instead of creating a new one on every getSchemas() call, avoiding caching empty results on transient exceptions in getDiscoveredProjects(), using GsonFactory.getDefaultInstance() for better resource reuse, and preserving the stack trace when logging execution exceptions.

logachev · 2026-06-04T04:23:41Z

  }

  @Override
  public ResultSet getSchemas(String catalog, String schemaPattern) {


I'd suggest some refactoring for this method, few ideas to simplify:

Move code to fetch list of schemas in a specific catalog to a separate function;

Keep this method simple - single catalog only, essentially calls into helper & transforms to jsonResultSet. No need for background threads since it is single catalog

Refactor getSchemas() to be the one that fans out multiple requests & assembles data.

Probably breaking up CLs in 2 will be easier:

Add proper threadPool to connection-layer (and remove static threadPool in statement, it is not used)

update getSchemas

Also I'd suggest to reuse more code. There are 4 metadata methods that can generate large # of rest calls:

getCatalogs()

getSchemas()

getTables()

getColumns()

getProcedures()

In a way, they build results on top of each other. Right now we duplicate a lot, e.g. listDatasets() is called in 7 different places.

Also please don't tackle all of them in a single CL, lets do it incrementally :)
(Otherwise it is pain to review)

Yeah, makes sense. I have created 3 bugs:

b/520400589 - Refactor Metadata Thread Pool Management to Reuse Connection-Scoped Executor

b/520407325 - Refactor getSchemas for Catalog-Based Routing (Synchronous & Async Fan-out)

b/520406763 - Deduplicate metadata API calls

keshavdandeva · 2026-06-18T16:29:05Z

/gemini review

gemini-code-assist

Code Review

This pull request introduces an automatic project discovery feature to the BigQuery JDBC driver, allowing users to discover all accessible Google Cloud projects via the EnableProjectDiscovery connection property. The implementation includes caching of discovered projects and error handling for transient and non-transient BigQuery exceptions. The review feedback correctly identifies compilation errors in BigQueryConnection.java where invalid LOG.warning(e, ...) calls are used instead of LOG.log(Level.WARNING, ..., e).

feat: add EnableProjectDiscovery connection property for metadata m…

8f75b30

…ethods

gemini-code-assist Bot reviewed Jun 3, 2026

View reviewed changes

Comment thread java-bigquery-jdbc/src/main/java/com/google/cloud/bigquery/jdbc/BigQueryConnection.java

Comment thread java-bigquery-jdbc/src/main/java/com/google/cloud/bigquery/jdbc/BigQueryDatabaseMetaData.java Outdated

address pr feedback

81e80a0

gemini-code-assist Bot reviewed Jun 3, 2026

View reviewed changes

address pr feedback

86d62b1

keshavdandeva marked this pull request as ready for review June 3, 2026 19:13

keshavdandeva requested review from a team as code owners June 3, 2026 19:14

keshavdandeva requested review from Neenu1995 and logachev June 3, 2026 19:14

logachev reviewed Jun 4, 2026

View reviewed changes

keshavdandeva requested a review from logachev June 5, 2026 16:12

chore: address pr feedback

f440367

Neenu1995 reviewed Jun 8, 2026

View reviewed changes

Comment thread java-bigquery-jdbc/src/main/java/com/google/cloud/bigquery/jdbc/BigQueryConnection.java Outdated

keshavdandeva added 3 commits June 18, 2026 16:47

Merge branch 'main' into jdbc/get-all-projects

c45cec9

use the SDK listProjects API with connection-scoped caching

3387cf5

Merge branch 'main' into jdbc/get-all-projects

b0aa5cd

gemini-code-assist Bot reviewed Jun 18, 2026

View reviewed changes

Comment thread java-bigquery-jdbc/src/main/java/com/google/cloud/bigquery/jdbc/BigQueryConnection.java

Comment thread java-bigquery-jdbc/src/main/java/com/google/cloud/bigquery/jdbc/BigQueryConnection.java

keshavdandeva requested a review from Neenu1995 June 18, 2026 16:38

Merge branch 'main' into jdbc/get-all-projects

47a47ac

Conversation

keshavdandeva commented Jun 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

keshavdandeva commented Jun 3, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

logachev Jun 4, 2026

Choose a reason for hiding this comment

Uh oh!

keshavdandeva Jun 5, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

keshavdandeva commented Jun 18, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

keshavdandeva commented Jun 3, 2026 •

edited

Loading