Offer Vertex-AI as a provider #706

hellovai · 2024-06-21T21:15:03Z

Vertex-ai is another commonly offered interface for speaking to LLMs on-top of gemini directly. It has issues in the way we get the authorization token (needs to be refreshed frequently w/ oauth). We can support this if many people require it.

Foxicution · 2024-07-05T11:22:52Z

We'd like this feature.

anish-palakurthi · 2024-07-10T16:58:26Z

We'll add support for this by end of the week!

hellovai · 2024-07-14T23:43:43Z

@Foxicution: @anish-palakurthi has got a prototype of this working (in the playground and runtime)! We should be pushing out the PR and merging soon!

To ensure we support what you need, how do you currently do authentication for vertex-ai?

Foxicution · 2024-07-15T07:13:46Z

Currently we use the google-cloud-sdk to authenticate using account credentials, and then we do

import vertexai

vertexai.init(project="***", location="***")
model = vertexai.generative_models.GenerativeModel("***")

generation_config = {
    "max_output_tokens": 8192,
    "temperature": 1,
    "top_p": 0.95,
}

safety_settings = {
    vertexai.preview.generative_models.HarmCategory.HARM_CATEGORY_HATE_SPEECH: vertexai.preview.generative_models.HarmBlockThreshold.BLOCK_ONLY_HIGH,
    vertexai.preview.generative_models.HarmCategory.HARM_CATEGORY_DANGEROUS_CONTENT: vertexai.preview.generative_models.HarmBlockThreshold.BLOCK_ONLY_HIGH,
    vertexai.preview.generative_models.HarmCategory.HARM_CATEGORY_SEXUALLY_EXPLICIT: vertexai.preview.generative_models.HarmBlockThreshold.BLOCK_ONLY_HIGH,
    vertexai.preview.generative_models.HarmCategory.HARM_CATEGORY_HARASSMENT: vertexai.preview.generative_models.HarmBlockThreshold.BLOCK_ONLY_HIGH,
}
responses = mode.generate_content(["message"], generation_config=generation_config, safety_settings=safety_settings)

hellovai · 2024-07-15T08:44:28Z

We'll confirm and ensure it works!

anish-palakurthi · 2024-07-15T20:04:07Z

@Foxicution
We are getting ready to ship this today!
Take a look and see if this fits your needs!
https://github.com/BoundaryML/baml/blob/vertex-ai/docs/docs/snippets/clients/providers/vertex.mdx

Here is a sample BAML client with your configuration:

client<llm> Vertex {
  provider vertex-ai

  options {
    model gemini-1.5-pro
    project_id my-project-id
    location us-central1
    safetySettings  [
        { 
            category HARM_CATEGORY_HATE_SPEECH
            threshold BLOCK_ONLY_HIGH
        },
        {
            category HARM_CATEGORY_DANGEROUS_CONTENT
            threshold BLOCK_ONLY_HIGH
        },
        {
            category HARM_CATEGORY_SEXUALLY_EXPLICIT
            threshold BLOCK_ONLY_HIGH
        },
        {
            category HARM_CATEGORY_HARASSMENT
            threshold BLOCK_ONLY_HIGH 
        }
      ]
 
    generationConfig {
      maxOutputTokens 10
      temperature 1
      topP 0.95
    }
  }
}

Foxicution · 2024-07-16T04:32:07Z

Read through the docs, looks good I think. Will try it out.

hellovai added the enhancement New feature or request label Jun 21, 2024

anish-palakurthi linked a pull request Jul 15, 2024 that will close this issue

Support Vertex AI (Google Cloud SDK) #790

Merged

anish-palakurthi closed this as completed in #790 Jul 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Offer Vertex-AI as a provider #706

Offer Vertex-AI as a provider #706

hellovai commented Jun 21, 2024 •

edited

Loading

Foxicution commented Jul 5, 2024

anish-palakurthi commented Jul 10, 2024

hellovai commented Jul 14, 2024

Foxicution commented Jul 15, 2024 •

edited

Loading

hellovai commented Jul 15, 2024

anish-palakurthi commented Jul 15, 2024 •

edited

Loading

Foxicution commented Jul 16, 2024

Offer Vertex-AI as a provider #706

Offer Vertex-AI as a provider #706

Comments

hellovai commented Jun 21, 2024 • edited Loading

Foxicution commented Jul 5, 2024

anish-palakurthi commented Jul 10, 2024

hellovai commented Jul 14, 2024

Foxicution commented Jul 15, 2024 • edited Loading

hellovai commented Jul 15, 2024

anish-palakurthi commented Jul 15, 2024 • edited Loading

Foxicution commented Jul 16, 2024

hellovai commented Jun 21, 2024 •

edited

Loading

Foxicution commented Jul 15, 2024 •

edited

Loading

anish-palakurthi commented Jul 15, 2024 •

edited

Loading