Web Vector Storage (WVS) is a lightweight and efficient vector database that stores document vectors in the browser's IndexedDB. This package allows you to perform semantic similarity searches on text documents using vector embeddings. Semantic search refers to the ability to understand the meaning and context of text documents and queries, enabling more accurate and relevant search results.
Web Vector Storage supports a variety of embedding providers and models to generate embeddings to convert text documents into vectors and provides an interface for searching similar documents based on cosine similarity.
Vector stores are a core component in building Retrieval Augmentation Generation (RAG) Generative AI applications using Large Language Models (LLMs). When running a LLM on the edge (i.e. in-browser), having a web browser native vector store can be useful. This enables local storage of user's data as well as leveraging the local compute capacity within the user's device thus alleviating overhead and cost.
Cosine similarity is a measure of similarity between two non-zero vectors in an inner product space. It is defined as the cosine of the angle between the two vectors. The cosine similarity value ranges from -1 to 1, where 1 indicates complete similarity, 0 indicates no similarity, and -1 indicates complete dissimilarity.
In this package, cosine similarity is used to measure the similarity between document vectors and the query vector. The cosine similarity score is calculated using the dot product of the vectors, divided by the product of their magnitudes.
The Least Recently Used (LRU) mechanism is used to manage the storage size and automatically remove documents when the storage size exceeds the specified limit. Documents are sorted by their hit counter (ascending) and then by their timestamp (ascending). Documents with the lowest hit count and oldest timestamps are removed first until the storage size is below the limit.
Install the package using npm:
npm i web-vector-storage
Here is a basic example of how to use the VectorStorage class:
import { VectorStorage } from "web-vector-storage";
import { OpenAIEmbedder } from "web-vector-storage";
// Create an instance of VectorStorage
const vectorStore = new VectorStorage(new OpenAIEmbedder({ apiKey: "your-openai-api-key" }));
// Add a text document to the store
await vectorStore.addText("The quick brown fox jumps over the lazy dog.", {
category: "example",
});
// Perform a similarity search
const results = await vectorStore.similaritySearch({
query: "A fast fox leaps over a sleepy hound.",
});
// Display the search results
console.log(results);
Here is a basic example of how to use the VectorStorage class:
import { VectorStorage } from "web-vector-storage";
import { OllamaEmbedder } from "web-vector-storage";
// Create an instance of VectorStorage
const vectorStore = new VectorStorage(new OllamaEmbedder({ embeddingModel: "your-favorite-ollama-embedding-model" }));
// Add a text document to the store
await vectorStore.addText("The quick brown fox jumps over the lazy dog.", {
category: "example",
});
// Perform a similarity search
const results = await vectorStore.similaritySearch({
query: "A fast fox leaps over a sleepy hound.",
});
// Display the search results
console.log(results);
Here is a basic example of how to use the VectorStorage class:
import { VectorStorage } from "web-vector-storage";
import { HFTransformerEmbedder } from "web-vector-storage";
// Create an instance of VectorStorage
const vectorStore = new VectorStorage(new HFTransformerEmbedder({ embeddingModel: "your-favorite-hf-transformer-embedding-model" }));
// Add a text document to the store
await vectorStore.addText("The quick brown fox jumps over the lazy dog.", {
category: "example",
});
// Perform a similarity search
const results = await vectorStore.similaritySearch({
query: "A fast fox leaps over a sleepy hound.",
});
// Display the search results
console.log(results);
The main class for managing document vectors in IndexedDB.
Creates a new instance of VectorStorage.
embedder: An instance of an embedder (OpenAIEmbedder, OllamaEmbedder, HFTransformerEmbedder) class
interface IEmbedderOptions {
apiKey?: string; // The API key to use. Only applicable to OpenAIEmbedder.
baseUrl?: string; // The base URL to use to connect to remote service. Only applicable to OllamaEmbedder and defaults to http://localhost:11434
embeddingModel?: string; // The specific embedding model to use. Each embedder has a default if none is specified.
}
options: An object containing the following properties:
interface IWVSOptions {
maxSizeInMB?: number; // The maximum size of the storage in megabytes. Defaults to 2GB
debounceTime?: number; // The debounce time in milliseconds for saving to IndexedDB. Defaults to 0.
}
Adds a text document to the store and returns the created document.
Adds multiple text documents to the store and returns an array of created documents.
Performs a similarity search on the stored documents and returns an array of matching documents.
params: An object containing the following properties:
The IWVSDocument interface represents a document object stored in the vector database. It contains the following properties:
interface IWVSDocument {
hits?: number; // The number of hits (accesses) for the document. Omit if the value is 0.
metadata: object; // The metadata associated with the document for filtering.
text: string; // The text content of the document.
timestamp: number; // The timestamp indicating when the document was added to the store.
vectorMag: number; // The magnitude of the document vector.
vector: number[]; // The vector representation of the document.
}
Contributions to this project are welcome! If you would like to contribute, please follow these steps:
Please ensure that your code follows the project's coding style and that all tests pass before submitting a pull request. If you find any bugs or have suggestions for improvements, feel free to open an issue on GitHub.
This project is licensed under the MIT License. See the LICENSE file for the full license text.
The Web Vector Storage is built on the great work by Nitai Aharoni. All rights reserved.