Add connector SPI for returning redactable properties #24562

piotrrzysko · 2024-12-23T11:35:07Z

Description

An alternative approach to #23103. The main difference is that in this approach, the properties requiring redaction are selected from those provided by the user, rather than always returning a static set of predefined security-sensitive properties. The benefits are as follows:

By default (if a connector doesn't implement the SPI), all properties are masked.
Unknown (potentially misspelled) properties can also be treated as redactable.

This PR includes an implementation of the new SPI for the PostgreSQL connector. Once we confirm that the approach is correct, we will apply it to the remaining connectors.

Here is a PR demonstrating how the new SPI could be used to mask security-sensitive properties in queries related to creating catalogs: #24563.

Additional context and related issues

Resolves #22887.

Release notes

( ) This is not user-visible or is docs only, and no release notes are required.
( ) Release notes are required. Please propose a release note for me.
(x) Release notes are required, with the following suggested text:

## SPI
* Add connector SPI for returning redactable properties ({issue}`22887`)

The SPI will be used by the engine to redact security-sensitive information in statements that manage catalogs. It has been added at the connector factory level, rather than the connector level, to allow more flexibility in retrieving properties. In some cases, we want to perform redacting before a connector is initiated. For example, when we create a new catalog by issuing the CREATE CATALOG statement.

Exposed properties fall into one of the following categories: they are either explicitly marked as security-sensitive or are unknown. The connector assumes that unknown properties might be misspelled security-sensitive properties. The purpose of the included test is to identify security-sensitive properties that may be used by the connector. It uses the output generated by the maven-dependency-plugin, configured in the connector's pom.xml file. This output contains the connector's runtime classpath, which is then scanned to identify all property names annotated with @config. Scanning the classpath ensures that all configuration classes are included, even those used conditionally.

hashhar · 2025-01-02T08:27:19Z

core/trino-spi/src/main/java/io/trino/spi/connector/ConnectorFactory.java


 public interface ConnectorFactory
 {
    String getName();

    @CheckReturnValue
    Connector create(String catalogName, Map<String, String> config, ConnectorContext context);
+
+    /**
+     * Extracts property names from the provided set that may include security-sensitive


nit: Extracts -> Returns

feels clearer to me.

getRedactablePropertyNames -> getSecuritySensitivePropertyNames? (to align with Airlift's ConfigurationMetadata#isSecuritySensitive and @ConfigSecuritySensitive)? WDYT?

I had the exact same thought about renaming to security sensitive

hashhar · 2025-01-02T08:29:18Z

lib/trino-plugin-toolkit/src/main/java/io/trino/plugin/base/config/ConfigPropertyMetadata.java

in commit message: even those used conditionally. -> even those used conditionally or contributed by other modules.

Am I right that scanning the classpath will also include cases where properties are contributed from other modules?

hashhar · 2025-01-02T08:30:34Z

plugin/trino-postgresql/src/test/java/io/trino/plugin/postgresql/TestPostgreSqlPlugin.java

@@ -34,4 +53,97 @@ public void testCreateConnector()
                        "bootstrap.quiet", "true"),
                new TestingPostgreSqlConnectorContext()).shutdown();
    }
+
+    @Test
+    void testUnknownPropertiesAreRedactable()


testUnknownPropertiesAreRedactable -> testUnknownPropertiesAreSecuritySensitive (if you decide to change the SPI method name). Here and elsewhere.

dain · 2025-01-02T19:26:32Z

core/trino-spi/src/main/java/io/trino/spi/connector/ConnectorFactory.java


 public interface ConnectorFactory
 {
    String getName();

    @CheckReturnValue
    Connector create(String catalogName, Map<String, String> config, ConnectorContext context);
+
+    /**
+     * Extracts property names from the provided set that may include security-sensitive


I had the exact same thought about renaming to security sensitive

dain · 2025-01-02T19:46:37Z

plugin/trino-base-jdbc/src/main/java/io/trino/plugin/jdbc/JdbcConnectorFactory.java

    {
        checkArgument(!isNullOrEmpty(name), "name is null or empty");
        this.name = name;
        this.module = requireNonNull(module, "module is null");
+        Set<Class<?>> configClasses = ImmutableSet.<Class<?>>builder()


Instead of attempting to list every configuration class, I think we should modify ConfigurationFactory in Airlift to extract the properties for us. I'm thinking (just thoughts after a brief look) that we have a method to extract all properties from a set of modules, and classify them into used, unused, and unknown. With used and unused having sub classification for secure or unsecure.

piotrrzysko added 2 commits December 23, 2024 12:31

cla-bot bot added the cla-signed label Dec 23, 2024

This was referenced Dec 23, 2024

Redact sensitive information in catalog queries #24563

Draft

Add connector SPI for returning security-sensitive properties #23103

Closed

hashhar reviewed Jan 2, 2025

View reviewed changes

hashhar approved these changes Jan 2, 2025

View reviewed changes

piotrrzysko mentioned this pull request Jan 2, 2025

Extend syntax for Dynamic Catalogs #22188

Open

3 tasks

dain reviewed Jan 2, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add connector SPI for returning redactable properties #24562

Add connector SPI for returning redactable properties #24562

piotrrzysko commented Dec 23, 2024 •

edited

Loading

hashhar Jan 2, 2025

dain Jan 2, 2025

hashhar Jan 2, 2025

hashhar Jan 2, 2025

dain Jan 2, 2025

dain Jan 2, 2025

Add connector SPI for returning redactable properties #24562

Are you sure you want to change the base?

Add connector SPI for returning redactable properties #24562

Conversation

piotrrzysko commented Dec 23, 2024 • edited Loading

Description

Additional context and related issues

Release notes

hashhar Jan 2, 2025

Choose a reason for hiding this comment

dain Jan 2, 2025

Choose a reason for hiding this comment

hashhar Jan 2, 2025

Choose a reason for hiding this comment

hashhar Jan 2, 2025

Choose a reason for hiding this comment

dain Jan 2, 2025

Choose a reason for hiding this comment

dain Jan 2, 2025

Choose a reason for hiding this comment

piotrrzysko commented Dec 23, 2024 •

edited

Loading