[Bug]: Llamaindex managed Vertex AI index needs to be updated. #14637

wadave · 2024-07-08T18:08:54Z

Bug Description

I tried this https://docs.llamaindex.ai/en/stable/examples/managed/VertexAIDemo/ but it gave errors.

print(index.list_files()) gave list_files() got an unexpected keyword argument 'corpus'

It seems Vertex AI RAG api (from vertexai.preview import rag) evolved. The latest RAG API actually uses 'corpus_name' instead of 'corpus'.

https://github.com/run-llama/llama_index/tree/main/llama-index-integrations/indices/llama-index-indices-managed-vertexai/llama_index/indices/managed/vertexai needs to be updated.

query_engine = index.as_query_engine() gave Could not load OpenAI model. If you intended to use OpenAI, please check your OPENAI_API_KEY. at beginning, after I set up Settings.llm = vertex_gemini (nor sure if this is the right way), it gave AttributeError: Unknown field for RetrieveContextsResponse: rag_contexts

Version

0.10.52

Steps to Reproduce

Run this notebook: https://docs.llamaindex.ai/en/stable/examples/managed/VertexAIDemo/
The errors will be reproducced.

Relevant Logs/Tracbacks

No response

The text was updated successfully, but these errors were encountered:

dosubot · 2024-07-08T18:14:27Z

Hey there, @wadave! I'm here to help you with any bugs, questions, or becoming a contributor. Let's squash those bugs together! 🤖

To address the issues with the list_files() and as_query_engine() methods in the LlamaIndex managed Vertex AI index, you can update the code as follows:

Update list_files() Method:
Ensure the list_files() method correctly lists all files in the index by using the corpus_name argument:

def list_files(self) -> Sequence[str]:
    """List all files in the index."""
    files = []
    with telemetry.tool_context_manager(self._user_agent):
        for file in rag.list_files(corpus=self.corpus_name):
            files.append(file.name)
    return files

Update as_query_engine() Method:
Ensure the as_query_engine() method correctly returns a RetrieverQueryEngine:

def as_query_engine(self, **kwargs: Any) -> BaseQueryEngine:
    from llama_index.core.query_engine.retriever_query_engine import (
        RetrieverQueryEngine,
    )

    kwargs["retriever"] = self.as_retriever(**kwargs)
    return RetrieverQueryEngine.from_args(**kwargs)

Handle rag_contexts Attribute:
Modify the VertexAIRetriever class to handle the rag_contexts attribute in the RetrieveContextsResponse to avoid the AttributeError:

from typing import Any, List, Optional

from llama_index.core.base.base_retriever import BaseRetriever
from llama_index.core.schema import NodeWithScore, QueryBundle, TextNode

from google.cloud.aiplatform import telemetry
from vertexai.preview import rag

class VertexAIRetriever(BaseRetriever):
    def __init__(
        self,
        corpus_name: str,
        similarity_top_k: Optional[int] = None,
        vector_distance_threshold: Optional[float] = 0.3,
        user_agent: Optional[str] = None,
        **kwargs: Any,
    ) -> None:
        """Initialize the Vertex AI Retriever."""
        self.rag_resources = [rag.RagResource(rag_corpus=corpus_name)]
        self._similarity_top_k = similarity_top_k
        self._vector_distance_threshold = vector_distance_threshold
        self._user_agent = user_agent or "llama-index/0.0.0"

    def _retrieve(self, query_bundle: QueryBundle) -> List[NodeWithScore]:
        """Retrieve from the platform."""
        with telemetry.tool_context_manager(self._user_agent):
            response = rag.retrieval_query(
                text=query_bundle.query_str,
                rag_resources=self.rag_resources,
                similarity_top_k=self._similarity_top_k,
                vector_distance_threshold=self._vector_distance_threshold,
            )

        if response.rag_contexts:
            return [
                NodeWithScore(node=TextNode(text=context.text), score=context.distance)
                for context in response.rag_contexts.contexts
            ]
        else:
            return []

    async def _aretrieve(self, query_bundle: QueryBundle) -> List[NodeWithScore]:
        """Asynchronously retrieve from the platform."""
        return self._retrieve(query_bundle=query_bundle)

Dependencies:
Ensure you have the necessary dependencies in your requirements.txt or setup.py file:

llama-index-llms-gemini
llama-index-indices-managed-vertexai
llama-index
google-cloud-aiplatform==1.53.0
llama-index-vector-stores-vertexaivectorsearch
llama-index-core
llama-index-llms-vertex

These updates should resolve the issues you encountered with the list_files() and as_query_engine() methods, as well as the AttributeError related to the rag_contexts attribute ^[1]^[2]^[3].

To continue talking to Dosu, mention @dosu.

wadave · 2024-07-08T19:54:48Z

Hi Bot, it should be 'response.contexts.contexts', not response.rag_contexts.contexts

dosubot · 2024-07-08T20:02:57Z

Thank you for the correction. Here is the updated code for the VertexAIRetriever class to handle the contexts attribute correctly:

from typing import Any, List, Optional

from llama_index.core.base.base_retriever import BaseRetriever
from llama_index.core.schema import NodeWithScore, QueryBundle, TextNode

from google.cloud.aiplatform import telemetry
from vertexai.preview import rag

class VertexAIRetriever(BaseRetriever):
    def __init__(
        self,
        corpus_name: str,
        similarity_top_k: Optional[int] = None,
        vector_distance_threshold: Optional[float] = 0.3,
        user_agent: Optional[str] = None,
        **kwargs: Any,
    ) -> None:
        """Initialize the Vertex AI Retriever."""
        self.rag_resources = [rag.RagResource(rag_corpus=corpus_name)]
        self._similarity_top_k = similarity_top_k
        self._vector_distance_threshold = vector_distance_threshold
        self._user_agent = user_agent or "llama-index/0.0.0"

    def _retrieve(self, query_bundle: QueryBundle) -> List[NodeWithScore]:
        """Retrieve from the platform."""
        with telemetry.tool_context_manager(self._user_agent):
            response = rag.retrieval_query(
                text=query_bundle.query_str,
                rag_resources=self.rag_resources,
                similarity_top_k=self._similarity_top_k,
                vector_distance_threshold=self._vector_distance_threshold,
            )

        if response.contexts:
            return [
                NodeWithScore(node=TextNode(text=context.text), score=context.distance)
                for context in response.contexts.contexts
            ]
        else:
            return []

    async def _aretrieve(self, query_bundle: QueryBundle) -> List[NodeWithScore]:
        """Asynchronously retrieve from the platform."""
        return self._retrieve(query_bundle=query_bundle)

This should correctly handle the contexts attribute in the RetrieveContextsResponse to avoid the AttributeError.

To continue talking to Dosu, mention @dosu.

…14641)

wadave added bug Something isn't working triage Issue needs to be triaged/prioritized labels Jul 8, 2024

wadave mentioned this issue Jul 8, 2024

Fix #14637 Llamaindex managed Vertex AI index needs to be updated. #14641

Merged

19 tasks

logan-markewich closed this as completed in #14641 Jul 8, 2024

logan-markewich pushed a commit that referenced this issue Jul 8, 2024

Fix #14637 Llamaindex managed Vertex AI index needs to be updated. (#…

09384e5

…14641)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug]: Llamaindex managed Vertex AI index needs to be updated. #14637

[Bug]: Llamaindex managed Vertex AI index needs to be updated. #14637

wadave commented Jul 8, 2024

dosubot bot commented Jul 8, 2024

wadave commented Jul 8, 2024

dosubot bot commented Jul 8, 2024

[Bug]: Llamaindex managed Vertex AI index needs to be updated. #14637

[Bug]: Llamaindex managed Vertex AI index needs to be updated. #14637

Comments

wadave commented Jul 8, 2024

Bug Description

Version

Steps to Reproduce

Relevant Logs/Tracbacks

dosubot bot commented Jul 8, 2024

wadave commented Jul 8, 2024

dosubot bot commented Jul 8, 2024