Allow passing of profile in Spark options instead of a profile-file (#102) #103

jacob-heldenbrand-cl · 2022-01-06T19:15:51Z

This PR provides the ability to pass in the secrets stored in the profile-file directly into Spark.

While I've added an integration test, I was unable to run it directly since I do not have access to the test environment. However, using a debugger, I was able to confirm that the parameters from the DeltaSharingDataSource's createRelation method contain the options, and that the method constructs the RemoteDeltaLog object correctly.

Closes #102

Signed-off-by: Jacob Heldenbrand jacob.heldenbrand@closedloop.ai

…ile-file (delta-io#102) This PR provides the ability to pass in the secrets stored in the profile-file directly into Spark. Closes delta-io#102 Signed-off-by: Jacob Heldenbrand <jacob.heldenbrand@closedloop.ai>

zsxwing · 2022-01-07T05:19:25Z

Thanks for the contribution. Actually, we discussed how to pass the secrets in the past and decided to use the profile file approach. Setting the secrets in the code directly is not encouraged so we don't want to support this.

jacob-heldenbrand-cl · 2022-01-07T18:01:24Z

The reason we want to pass credentials in as read options is not so we can hardcode secrets in the code, but so we can inject them into Spark dynamically. For our use case, we are not allowed to save secrets to an arbitrary S3 file, but instead must store them in an audited secret management system (in our case HashiCorp's Vault). We follow this pattern with other technologies as well, such as JDBC.

Is there an alternative approach we could use/implement to inject these secrets into Spark dynamically?

zsxwing · 2022-01-12T07:04:41Z

This is a fair point. Let me think about this and also how to support SQL.

Is there an alternative approach we could use/implement to inject these secrets into Spark dynamically?

As a workaround, you can manually read them from your audited secret management system, and store as a temp file, and use the temp file path to access.

ssimeonov · 2022-04-16T01:33:04Z

I'll second the request for dynamic configuration.

The original design decision to use files seems to have been made on a very faulty assumption. Putting secrets in source code has been discouraged since first days of source code. It's not the responsibility of this project to save developers from themselves, especially not at the cost of increasing configuration/deployment complexity.

stevenayers-bge · 2024-05-16T13:55:47Z

@linzhou-db @zsxwing could we get this merged in, please? I'm happy to help resolve the conflicts?

Allow passing profile in through Spark read options instead of a prof…

a0a2bd0

…ile-file (delta-io#102) This PR provides the ability to pass in the secrets stored in the profile-file directly into Spark. Closes delta-io#102 Signed-off-by: Jacob Heldenbrand <jacob.heldenbrand@closedloop.ai>

linzhou-db force-pushed the main branch from c5585be to 5828d79 Compare April 6, 2023 07:48

zsxwing force-pushed the main branch from 5828d79 to 7f01260 Compare April 6, 2023 16:47

stevenayers mentioned this pull request Apr 25, 2024

load_as_spark using config as variable instead of file #483

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow passing of profile in Spark options instead of a profile-file (#102) #103

Allow passing of profile in Spark options instead of a profile-file (#102) #103

jacob-heldenbrand-cl commented Jan 6, 2022 •

edited

Loading

zsxwing commented Jan 7, 2022 •

edited

Loading

jacob-heldenbrand-cl commented Jan 7, 2022 •

edited

Loading

zsxwing commented Jan 12, 2022

ssimeonov commented Apr 16, 2022

stevenayers-bge commented May 16, 2024

Allow passing of profile in Spark options instead of a profile-file (#102) #103

Are you sure you want to change the base?

Allow passing of profile in Spark options instead of a profile-file (#102) #103

Conversation

jacob-heldenbrand-cl commented Jan 6, 2022 • edited Loading

zsxwing commented Jan 7, 2022 • edited Loading

jacob-heldenbrand-cl commented Jan 7, 2022 • edited Loading

zsxwing commented Jan 12, 2022

ssimeonov commented Apr 16, 2022

stevenayers-bge commented May 16, 2024

jacob-heldenbrand-cl commented Jan 6, 2022 •

edited

Loading

zsxwing commented Jan 7, 2022 •

edited

Loading

jacob-heldenbrand-cl commented Jan 7, 2022 •

edited

Loading