Chroma Vector Store
The Chroma Vector Store node is a component that integrates Chroma, an open-source embedding database, into a vector store system. It allows for upsert operations of embedded data and similarity search functionality.
Node Details
-
Name: Chroma_VectorStores
-
Type: Chroma
-
Version: 2.0
-
Category: Vector Stores
Base Classes
-
Chroma
-
VectorStoreRetriever
-
BaseRetriever
Inputs
-
Document (optional, list)
-
Type: Document
-
Description: List of documents to be stored in the vector store.
-
-
Embeddings
-
Type: Embeddings
-
Description: Embedding model to use for vectorizing the documents.
-
-
Record Manager (optional)
-
Type: RecordManager
-
Description: Keeps track of records to prevent duplication.
-
-
Collection Name
-
Type: string
-
Description: Name of the Chroma collection to use.
-
-
Chroma URL (optional)
-
Type: string
-
Description: URL of the Chroma instance if using a remote server.
-
-
Chroma Metadata Filter (optional)
-
Type: json
-
Description: Metadata filter for querying specific documents.
-
-
Top K (optional)
-
Type: number
-
Description: Number of top results to fetch (default: 4).
-
Outputs
-
Chroma Retriever
-
Type: retriever
-
Base Classes: [Chroma, VectorStoreRetriever, BaseRetriever]
-
-
Chroma Vector Store
-
Type: vectorStore
-
Base Classes: [Chroma, …BaseClasses(Chroma)]
-
Credential (Optional)
-
Type: chromaApi
-
Description: API key for Chroma cloud services (if applicable)
Methods
Upsert
Adds or updates documents in the Chroma vector store.
Delete
Removes documents from the Chroma vector store based on provided IDs.
Init
Initializes the Chroma vector store or retriever based on the provided configuration.
Usage
This node is used for creating and managing a Chroma-based vector store within a larger system. It can be used to:
-
Store and index documents with their embeddings.
-
Retrieve similar documents based on query embeddings.
-
Manage and update the vector store contents.
It’s particularly useful in applications requiring semantic search, document retrieval, or any task benefiting from efficient similarity comparisons of embedded data.