Apache Doris
Apache Doris is a modern data warehouse for real-time analytics. It delivers lightning-fast analytics on real-time data at scale.
Usually
Apache Doris
is categorized into OLAP, and it has showed excellent performance in ClickBench — a Benchmark For Analytical DBMS. Since it has a super-fast vectorized execution engine, it could also be used as a fast vectordb.
You'll need to install langchain-community
with pip install -qU langchain-community
to use this integration
Here we'll show how to use the Apache Doris Vector Store.
Setup
%pip install --upgrade --quiet pymysql
Set update_vectordb = False
at the beginning. If there is no docs updated, then we don't need to rebuild the embeddings of docs
!pip install sqlalchemy
!pip install langchain
from langchain.chains import RetrievalQA
from langchain_community.document_loaders import (
DirectoryLoader,
UnstructuredMarkdownLoader,
)
from langchain_community.vectorstores.apache_doris import (
ApacheDoris,
ApacheDorisSettings,
)
from langchain_openai import OpenAI, OpenAIEmbeddings
from langchain_text_splitters import TokenTextSplitter
update_vectordb = False
API Reference:RetrievalQA | DirectoryLoader | UnstructuredMarkdownLoader | ApacheDoris | ApacheDorisSettings | OpenAI | OpenAIEmbeddings | TokenTextSplitter